100 likes | 217 Views
Intro to Statistics and Data. What is Statistics?. Statistics is the science of learning from data , and of measuring, controlling, and communicating uncertainty In a word: variation. Think , Show , Tell. Think first. Know where you are headed and why.
E N D
What is Statistics? • Statistics is the science of learning from data, and of measuring, controlling, and communicatinguncertainty • In a word: variation
Think, Show, Tell • Thinkfirst. Know where you are headed and why. • Show your work. The mechanics of the calculation are important but can not exist on there own. • Tellyour conclusion in the context of the problem
Data • Information in context • Context: • Who • What (and in what units) • Why (if possible) • Where • When • How
The W’s • Who?-- The cases about whom (or which) we gather information • MLB Seasons from 1986 to 2007 • What? – Variables are characteristics recorded about each case • Barry Bonds’ HRs (HRs) • Why? – The reasons for gathering the data can influence our analysis
The W’s cont. • When and where? – Help provide additional context for the data • How? – The methodology for gathering the data can determine whether you have useful information or pure crap
More about What • Variables come in two types • A categorical (or qualitative) variable names categories and answers questions about how cases fall into those categories. (e.g., sex, race, ethnicity) • A quantitative variable is a measured variable (with units) that answers questions about the quantity of what is being measured. (e.g. income ($), height (inches), weight (pounds)) !!!Always include units!!!
What can go wrong? • Don’t label a variable as categorical or quantitative without thinking about the question you want it to answer. • Just because your variable’s values are numbers, don’t assume that it’s quantitative. • Always be skeptical—don’t take data for granted.