160 likes | 326 Views
Displaying and Summarizing Quantitative Variable. Ch. 4. Quantitative Data. A quantitative variable is a measured variable (with units ) that answers questions about the quantity of what is being measured. (e.g. income ($), height (inches), weight (pounds)).
E N D
Quantitative Data • A quantitative variable is a measured variable (with units) that answers questions about the quantity of what is being measured. (e.g. income ($), height (inches), weight (pounds)) • The data are values of a quantitative variable whose units are known Quantitative Data Condition
Barry Bonds’ HRs Who: MLB Seasons from 1986 to 2007 What: Barry Bonds’ HRs (HRs) When: From 1986 to 2007 Where: Cities with MLB teams Why: Mr. Gray likes baseball and needed an example How: Data was gathered from baseball-reference.com
What to look for When you describe a distribution alwaysdescribe the • Shape • Center • Spread
Center“One number to rule them all” • When the distribution is skewed or has outliers, use the median Median -- the middle number when the set is ordered • If there is an even number of data values, the median is the average of the two middle values Has the same units as the data! • When the distribution is unimodal and symmetric, use the mean Mean -- the average of the data set • The “balancing” point of the data Has the same units as the data!
Mean • The (sample) mean is the arithmetic average of the values in a sample
Spread • When the distribution is skewed or has outliers, use the IQR Interquartile Range (IQR) • The difference between quartile 3 and quartile 1 • IQR = Q3 – Q1 Has the same units as the data! • When the distribution is unimodal and symmetric, use the standard deviation Standard Deviation – the square root of the sum of the squared differences from the mean Has the same units as the data!
The Path to Standard Deviation • Deviation from the mean -- the difference of the data value from the mean of the data set • (Sample) Variance -- the sum of the squared deviations from the mean divided by
The Path to Standard Deviation (cont.) • (Sample) Standard Deviation -- the square root of the sample variance
Barry Bonds Summary Statistics • Median: 34 HRs • IQR: 20 HRs • Mean: 34.6 HRs • SD: 14.0 HRs
Three Rules of Data Analysis • Make a picture • Make a picture • Make a picture
Stem and Leaf Diagram • An effective way to display quantitative data when the data set is not too large • Stem – the beginning digit(s) of the data value • Leaf – the final digit(s) of the data value
Histogram • When to use: Number of variables: 1 Data type: quantitative data Purpose: displaying data distribution Include a key
Stem and Leaf Barry Bonds HRs 7|3 represents 73
Stem and Leaf Albert Pujols RBIs 9|9 represents 99
Stem and Leaf 7|3 represents 73