1 / 24

A Short Tour of Probability & Statistics

A Short Tour of Probability & Statistics. Presented by: Nick Bennett, Grass Roots Consulting & GUTS Josh Thorp, Stigmergic Consulting & GUTS Irene Lee, Santa Fe Institute, GUTS. November 6, 2010 Santa Fe Alliance for Science Professional Enrichment Activity. Outline.

sukey
Download Presentation

A Short Tour of Probability & Statistics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Short Tour of Probability & Statistics Presented by: Nick Bennett, Grass Roots Consulting & GUTS Josh Thorp, Stigmergic Consulting & GUTS Irene Lee, Santa Fe Institute, GUTS November 6, 2010 Santa Fe Alliance for Science Professional Enrichment Activity

  2. Outline • Framing the problem (Nick) • Review of Statistics (Irene) • Randomness (Nick) • Dice & Data (Josh) • Problem of Points (Nick) • Crosswalk of Common Core standards (Irene)

  3. What is Statistics? • The science of collection, organization and interpertation of data.

  4. What do Statisticians do? • Data analysis • Probability • Statistical inference

  5. What is a Statistical question? • One that anticipates variability. • Compare • “How old am I?” • “How old are the students in my school?”

  6. Describing Data • Data (plural) are the raw material • Data are the numbers we use to interpret reality • We will look at a few different ways of describing data. • Dot plot • Frequency table • Stem and Leaf diagram

  7. A Sample Data Set • 92 Penn State students’ weights • MALES: • 140, 145, 160, 190, 155, 165, 150, 190, 195, 138, 160, 155, 153, 145, 170, 175, 175, 170, 180, 135, 170, 157, 130, 185, 190, 155, 170, 155, 215, 150, 145, 155, 155, 150, 155,150, 180, 160, 135, 160, 130, 155, 150, 148, 155, 150, 140, 180, 190, 145, 150, 164, 140, 142, 136, 123, 155. • FEMALES: • 140, 120, 130, 138, 121, 125, 116, 145, 150, 112, 125, 130, 120, 130, 131, 120, 118, 125, 135, 125, 118, 122, 115, 102, 115, 150, 110, 116, 108, 95, 125, 133, 110, 150, 108

  8. Dot Plot In a dot plot, one dot per student goes over each student’s reported weight.

  9. Frequency table -> Histogram Divide the number line into intervals and count the number of students weights within each interval. The “frequency” is the count in any given interval. The “relative frequency” is the proportion of weights in each interval.

  10. Histograms • From the frequency table, we can make a bar graph called a histogram. • Each bar covers an interval and is centered at the midpoint. • The height of the bar corresponds with the number of data points in the interval

  11. Stem-and-Leaf Diagram Both summarizes data and shows all data points. • The STEM shows intervals (ranges in tens) • The LEAVES show data points (ranges in ones) • Put the leaves in order • Is there evidence of reporting bias?

  12. Summary Statistics • Central or typical value • Spread about that value

  13. Measures of Center • Mean • Median

  14. The Mean • Given

  15. The Median • The midpoint of the data • If even number of data points, it is the middle • If odd number of data points, average the two data points nearest the middle.

  16. Why two measures of center?

  17. Measures of Spread • Interquartile range • Standard deviation

  18. Interquartile range • Put the data in numerical order • Divide the data set into two equal groups with the median as the center point. • The median of the low group = 1st quartile • The median of the high group = 3rd quartile

  19. Box & Whiskers plot median . 1.5 IQR 1.5 IQR

  20. Standard deviation • Average squared distance = • Sample variance • Standard deviation =

  21. Z-scores, Standardized Scores • A student weighing 175 pounds has a z-score of 1.26

  22. Summary: • Several ways to display data • Measures of Center • Measures of Spread • Standard deviations

  23. Statistical inference • Use random sampling to draw inferences about a population. • Generalizations about a population from a sample are valid only if the sample is representative of that population.

  24. Sampling • With replacement. • Without replacement.

More Related