1 / 48

Statistics lecture 3

Statistics lecture 3. Bell-Shaped Curves and Other Shapes. Goals for lecture 3. Realize many measurements in nature follow a bell-shaped (“normal”) curve Understand and learn to compute a standardized score Learn to find the proportion of the population that falls into a given range

banagher
Download Presentation

Statistics lecture 3

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Statistics lecture 3 Bell-Shaped Curves and Other Shapes

  2. Goals for lecture 3 • Realize many measurements in nature follow a bell-shaped (“normal”) curve • Understand and learn to compute a standardized score • Learn to find the proportion of the population that falls into a given range • Memorize the Empirical Rule

  3. Histogram

  4. Bell-Shaped “Normal” Curve

  5. Bell-Shaped “Normal” Curve

  6. Bell-Shaped “Normal” Curve

  7. Bell-Shaped “Normal” Curve

  8. Remember? • Mean(average): Sum of the values divided by the number of values • Standard deviation: A measure of how spread out the values are. Think of it as the “average distance” of all values from the mean.

  9. mean Some Characteristics of a Normal Distribution • Symmetrical (not skewed) • One peak in the middle, at the mean • The wider the curve, the greater the standard deviation • Area under the curve is 1 (or 100%)

  10. Why it looks like that With many things in nature, most individuals fall near the average. The farther you move above or below the average, the fewer individuals there are with those extreme values. Examples: Height, weight, IQ, pulse rate

  11. Bell-shaped wear

  12. Bell-shaped wear

  13. Not all curves are “normal”

  14. Normal Curve... If you know these two things: • The Mean • The Standard Deviation ...

  15. ...Normal Curve ...you can figure these things: • The proportion of individuals who fall into any range of values • The percentile of any given value • The value of any given percentile

  16. Percentiles Your percentilefor a particular measure (like height or IQ) is the percentage of the population that falls belowyou. In one of my recent classes: • My height (183 cm): 89th percentile • My weight (104 kg): 99th percentile • My age (62): 99th percentile

  17. Standardized Scores A standardized score(also called the z-score) is simply the number of standard deviations a particular value is either above or below the mean. The standardized score is: • Positive if above the mean • Negative if below the mean

  18. Standardized Score Examples Class height: Mean 170 cm, StdDev: 10 cm. What is the z-score of someone: • 160 cm • 180 cm • 175 cm • 150 cm • 170 cm • 145 cm

  19. Calculate z-score for a Particular Value z-score = (Value - mean) / StdDev 185 cm : (185 – 170) / 10 = 15 / 10 = +1.5 165 cm: (165 - 170) / 10 = -5 / 10 = -0.5 180 cm: (180 - 170) / 10 = 10 / 10 = +1.0

  20. What’s the Point? • With z-score or percentile, you can compare unlike things. For instance, I am heavier (99th pctile) than I am tall (89th pctile). • With a z-score, you can look up the percentile in a table or an online calculator

  21. The Empirical Rule For any normal curve, approximately: • 68% of values within one StdDevof the mean • 95% of values within two StdDevsof the mean • 99.7% of values within three StdDevsof the mean

  22. Empirical Rule

  23. Empirical Rule

  24. Empirical Rule

  25. Outlier • A value that is more than three standard deviations above or below the mean.

  26. Apply Empirical Ruleto Class Height Class height: Mean 170 cm., StdDev 10 cm. • About 68% of class is between what heights? 160 cm and 180 inches (+/- 10 cm) • About 95% of class is between what heights? 150 inches and 190 inches (+/- 20 cm)

  27. Data visualization goals • See different ways of graphically displaying data. • Learn the features of a good statistical picture. • Be able to identify common problems with graphs and plots. • Learn to read graphs comprehensively.

  28. Why do we turn data into graphics? • Easier to understand • Easier to see the trends • A good graphic will convey the same message you would get if you really studied the data “Graphics reveal data.” -- Edward Tufte

  29. Two kinds of variables • Categorical: Data that can be counted in categories, such as gender or race • Measurement:Data that can be recorded as a number and then put into order, such as IQ, weight, cigarettes smoked per day, etc.

  30. Pictures ofCategorical Data Three common types of graphics for categorical data: • Pie charts • Bar graphs • Pictograms

  31. 37% 63% Pie Charts • Good for showing one categorical variable, like gender • Show the percentage that falls into each category Women Men

  32. Bar Graphs • Can show two or more categorical variables simultaneously (for example, height and gender)

  33. Pictograms A B C D F • Height of pictures is used like bars number grades

  34. Pictograms can be misleading • We tend to focus on the area, rather than just the height

  35. Pictograms can be misleading • To be fair, you should keep the width of pictograms the same

  36. Pictures ofMeasurement Data Lots of ways to illustrate measurement variables: • Stemplots and histograms (lecture 2) • Line graphs (also called fever charts) • Scatter plots • Others: Area, radar, doughnut, high-low-close, surface plots, maps, et al.

  37. Stemplots 19 | 5 9 20 | 1 1 4 5 5 5 6 6 6 6 7 7 8 21 | 0 1 2 2 2 2 4 4 4 4 5 8 8 8 9 9 22 | 0 2 2 2 5 6 7 9 23 | 0 2 2 3 5 5 24 | 4 6 7 7 9 9 25 | 1 7

  38. Line Graph(Fever Chart)

  39. Scatter Plot • Good for displaying the relationship between twomeasurement variables

  40. Scatter Plot Doig

  41. Scatter Plot Doig

  42. Scatter Plot

  43. Scatter Plot

  44. Scatter Plot

  45. Difficulties and Disasters Most common problems: • No labeling on one or more axes • Not starting at zero • Changes in labeling on axes • Misleading units • Graphs based on poor information

  46. Checklist for Statistical Pictures 1. Does the message clearly stand out? 2. Is the purpose or title evident? 3. Is a source given for the data? 4. Did the data come from a reliable, believable source? 5. Is everything labeled clearly and unambiguously?

  47. Checklist for Statistical Pictures 6. Do the axes start at zero? 7. Do the axes maintain a constant scale? 8. Are there breaks in the numbers on the axes that may be easy to miss? 9. Have financial numbers been adjusted for inflation? 10. Is there extraneous information cluttering the picture or misleading the eye?

  48. Perguntas?

More Related