1 / 58

Introduction to Biostatistics (Pubhlth 540) Lecture 3: Numerical Summary Measures

Introduction to Biostatistics (Pubhlth 540) Lecture 3: Numerical Summary Measures. Acknowledgement: Thanks to Professor Pagano (Harvard School of Public Health) for lecture material. Reading/Home work. -See WEB site. For after all, what is man in nature?

Download Presentation

Introduction to Biostatistics (Pubhlth 540) Lecture 3: Numerical Summary Measures

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to Biostatistics(Pubhlth 540) Lecture 3: Numerical Summary Measures Acknowledgement: Thanks to Professor Pagano (Harvard School of Public Health) for lecture material

  2. Reading/Home work • -See WEB site

  3. For after all, what is man in nature? A Nothing in relation to the infinite, All in relation to nothing, A central point between nothing and all, And infinitely far from understanding either. Blaise Pascal, (1623-1662) Pensees (1660)

  4. Example: FEV per second in 13 adolescents with asthma Let x represent FEV1 in liters

  5. Example: FEV per second in 13 adolescents with asthma Let x represent FEV1 in liters

  6. Example: FEV per second in 13 adolescents with asthma Let x represent FEV1 in liters

  7. Example: FEV per second in 13 adolescents with asthma Let x represent FEV1 in liters

  8. Example: FEV per second in 13 adolescents with asthma Let x represent FEV1 in liters

  9. Example: FEV per second in 13 adolescents with asthma Let x represent FEV1 in liters

  10. Measures of central tendency • Population Parameters • Sample Statistics • Mean • Median • Mode

  11. Measures of central tendency • Population Parameters

  12. Measures of central tendency: Mean Example: FEV per second in 13 adolescents with asthma 2.3, 2.15, 3.50, 2.60, 2.75, 2.82, 4.05, 2.25, 2.68, 3.00, 4.02, 2.85 (n=13)

  13. If we collect a man's urine during twenty four hours and mix all this urine to analyze the average, we get an analysis of a urine which simply does not exist; for urine when fasting, is different from urine during digestion. A startling instance of this kind was invented by a physiologist who took urine from a railroad station urinal where people of all nations passed, and who believed he could thus present an analysis of average European urine! Claude Bernard (1813-1878)

  14. Mean: Examples Approx 4 million singleton births, 1991 :

  15. Mean: Examples Approx 4 million singleton births, 1991 :

  16. Mean: Examples Approx 4 million singleton births, 1991 :

  17. Mean: Examples Approx 4 million singleton births, 1991 :

  18. Mean: Examples Approx 4 million singleton births, 1991 : Of 31,417 singleton births resulting in death :

  19. Mean: Properties 26.4 years years

  20. Mean: Properties Note what happens when one number, 4.02 say, becomes large, say 40.2 : 2.3, 2.15, 3.50, 2.60, 2.75, 2.82, 4.05, 2.25, 2.68, 3.00, 40.2, 2.85 (versus 2.95, from before) Mean is sensitive to every observation, it is not robust.

  21. Measures of central tendency: Median More robust, but not sensitive enough. Definition: At least 50% of the observations are greater than or equal to the median, and at least 50% of the observations are less than or equal to the median. 2.15, 2.25, 2.30 --- median = 2.25 2.15, 2.25, 2.30, 2.60 --- (2.25 + 2.30) = 2.275 median =

  22. Comparing mean and median Singleton births, 1991 :

  23. Mean = 3359 Median = 3374

  24. Mean = 30.4 Median = 30

  25. Mean = 49.4 Median=7

  26. Comparing mean and median When to use mean or median: Use both by all means. Mean performs best when we have a symmetric distribution with thin tails. If skewed, use the median. Remember: the mean follows the tail.

  27. Mode • Mode is defined as the observation that occurs most frequently • When the distribution is symmetric, all three measures of central tendency are equal

  28. Comparing mean, median and mode Bimodal distribution Mean, Median Modes

  29. Measures of spread • Range: • Simple to calculate • Very sensitive to extreme observations • Inter Quartile Range (IQR) • More robust than the range • Variance (Standard Deviation): • Quantifies the amount of variability around the mean

  30. Measures of spread: Range Singleton births, 1991 :

  31. Measures of spread: Variance

  32. Measures of spread: Variance

  33. Measures of spread: Variance

  34. Measures of spread: Variance e.g.

  35. Measures of spread: Variance Standard deviation takes on the same unit as the mean

  36. Variance & Standard deviation Empirical Rule: If dealing with a unimodal and symmetric distribution, then Mean ± 1 sd covers approx 67% obs. Mean ± 2 sd covers approx 95% obs Mean ± 3 sd covers approx all obs

  37. Variance & Standard deviation Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± k s.d.s

  38. Variance & Standard deviation Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± k s.d.s

  39. Variance & Standard deviation Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± k s.d.s

  40. Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± k s.d.s

  41. Variance & Standard deviation Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± k s.d.s

  42. Variance & Standard deviation Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± k s.d.s

  43. Variance & Standard deviation Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± k s.d.s

  44. Characterizing a symmetric, unimodal distribution – mean, SD Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± k s.d.s

  45. Characterizing a symmetric, unimodal distribution – mean, SD Area = 0.6475 20.56 32.4 years

  46. Characterizing a symmetric, unimodal distribution – mean, SD Area = 0.963 14.72 38.08 years

  47. Characterizing a symmetric, unimodal distribution – mean, SD Mother’s age: mean = 26.4 yrs s.d. = 5.84 yrs Table of ± ks.d.s

  48. Characterizing a distribution – Chebychev’s inequality Chebychev’s Inequality Table of ± k s.d.s Proportion is at least 1-1/k2 (true for any distribution.)

More Related