1 / 59

Community Health Assessment in Small Populations: Tools for Working With “Small Numbers”

Community Health Assessment in Small Populations: Tools for Working With “Small Numbers”. Region 2 Quarterly Meeting January 26, 2009. Outline. Description of the Problem Random variation Survey samples versus complete count datasets Observed events versus underlying risk Statistical Tools

randrus
Download Presentation

Community Health Assessment in Small Populations: Tools for Working With “Small Numbers”

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Community Health Assessment in Small Populations: Tools for Working With “Small Numbers” Region 2 Quarterly Meeting January 26, 2009

  2. Outline • Description of the Problem • Random variation • Survey samples versus complete count datasets • Observed events versus underlying risk • Statistical Tools • Confidence intervals • Combining data • SMR New Mexico Department of Health

  3. Small Numbers: The Problem

  4. Random Variation • Exercise: • Select a sample • Calculate the median age • State of New Mexico Median age1 • 36.0 • Why are they different? 1. 2007 American Community Survey, U.S. Census Bureau. Downloaded on 1/21/09 from http://factfinder.census.gov New Mexico Department of Health

  5. Random Variation and Sample Size • What if we had a sample of New Mexico residents that was: • Randomly selected • n=5,000 • Would it better match the state Census Bureau estimate? New Mexico Department of Health

  6. Size Matters • The larger sample helps to “cancel out” the effects of random variation. • Some sample subjects are older than the median. • Some sample subjects are younger than the median. • As you increase the number of sample subjects, the differences cancel out, and you get closer to the median. New Mexico Department of Health

  7. Reliability and Validity • The term "accuracy" is often used in relation to validity, while the term, "precision" is used to describe reliability. New Mexico Department of Health

  8. Numerator vs. Denominator • A large sample size means we have a large denominator, but the numerator also matters. • Some methods use a “Poisson distribution,” which considers ONLY the numerator size when assessing precision. • If we have only 1 event in one year, and 2 the next year, the addition of a single event doubles the rate of occurrence. New Mexico Department of Health

  9. Random Variation and Complete Count Datasets • What are some “complete count” datasets? • How do we use them for community health assessment? New Mexico Department of Health

  10. New Mexico Department of Health

  11. New Mexico Department of Health

  12. New Mexico Department of Health

  13. New Mexico Department of Health

  14. New Mexico Department of Health

  15. New Mexico Department of Health

  16. New Mexico Department of Health

  17. Summary of the Problem • Measurements are subject to sampling variability, also known as random error. • Even complete count datasets are subject to random error because we use them as a reflection of the underlying disease risk or rate. New Mexico Department of Health

  18. Summary of the Problem • A larger sample (denominator, population size) helps to “cancel out” the effects of random variation. • Size matters, in both the numerator and the denominator. • A measure that is relatively free from the effects of random variation is called “precise,” “reliable,” and “stable.” Those terms are synonymous. New Mexico Department of Health

  19. Small Numbers: Statistical Tools

  20. Tool #1. Confidence Intervals • Use confidence intervals to help you decide whether the rate is stable. • Won’t solve the problem, but will provide information to help you interpret the rates. • The stability of an observed rate is important when comparing areas or assessing whether disease risk has increased or decreased. New Mexico Department of Health

  21. New Mexico Department of Health

  22. New Mexico Department of Health

  23. New Mexico Department of Health

  24. New Mexico Department of Health

  25. Calculation of 95% C.I. • The 95% confidence interval is calculated as 1.96 x Standard Error of the estimate (s.e.). • s.e. is calculated as • So the 95% C.I. is 1.96 New Mexico Department of Health

  26. The “Normal” Distribution New Mexico Department of Health

  27. Poisson Distribution New Mexico Department of Health

  28. Calculation of 95% C.I. • p stands for “probability.” It is the rate without the multiplier (e.g., 100,000 for deaths). q is the complement of the probability (1 minus P). • In Union County, there were 2 diabetes deaths among the 4,470 population, for a probability of 0.00045 (45 in 100,000) New Mexico Department of Health

  29. Calculation of 95% C.I. • Formula: 1.96 • p=0.00045, q=0.99955, n=4,470 • (pq)/n = .000447 / 4470 = 0.000000100051 • √(pq)/n = ………………… = 0.000316 • 1.96√(pq)/n = 1.96 x 0.000316 = 0.00062 • Then we need to add the multiplier back in, so the confidence interval is: + 100,000*0.00062 = +62 New Mexico Department of Health

  30. Calculation of 95% C.I. • The diabetes death rate was 44.7 per 100,000. • The confidence interval statistic is applied both above and below the rate. • C.I. LL (lower limit) is: 44.7- 62 = -17.3, and since we cannot have a negative rate, we’ll call it “0” • C.I. UL (upper limit) is: 44.7 + 62 = 106.7 • The diabetes death rate for Union County in 2006 was 44.7 per 100,000 (95% C.I., 0 to 106.7) New Mexico Department of Health

  31. New Mexico Department of Health

  32. New Mexico Department of Health

  33. New Mexico Department of Health

  34. Confidence Interval Factoids • The confidence interval may be thought of as the range of probable true values for a statistic. • The confidence interval is an indication of the precision (stability, reliability) of the estimate. • A confidence interval is typically expressed as a symmetric value (e.g., "plus or minus 5%"). But for percentages, when the point estimate is close to 0% or 100%, a confidence interval with an asymmetric shape can be used. New Mexico Department of Health

  35. More Confidence Interval Factoids • The 95% confidence interval (calculated as 1.96 times the standard error of a statistic) indicates the range of values within which the statistic would fall 95% of the time if the researcher were to calculate the statistic from an infinite number of samples of the same size drawn from the same base population. Unless otherwise stated, a confidence interval will be the "95% confidence interval." New Mexico Department of Health

  36. More Confidence Interval Factoids • The 90% confidence interval, also commonly used, is calculated as 1.65 times the standard error of the estimate. • To calculate a confidence interval when the number of health events = 0, you may use 0 as the lower confidence limit, and for the upper confidence limit, assume a count of 3 health events in the same population. New Mexico Department of Health

  37. Tool #2. Combine Data • Combine years • Combine geographic areas (e.g., use the regional estimate rather than the county estimate) • Use a broader age group New Mexico Department of Health

  38. New Mexico Department of Health

  39. New Mexico Department of Health

  40. New Mexico Department of Health

  41. Interpretation of Diabetes Deaths in Union County • Union County’s diabetes death rate (1999-2006) was higher than the state, overall rate, but was not statistically significantly higher. • In other words, the Union County rate was “marginally higher” than the New Mexico state rate. • Was it higher than Santa Fe County? New Mexico Department of Health

  42. Differences Between Two Rates • Statistical significance of a change in a rate from time1 to time2 • Statistical significance of the difference between two rates in one time period (e.g., Union County versus Santa Fe County). **Test of Proportions** New Mexico Department of Health

  43. Test of Proportions • Proportion1: 1999-2006 Union County diabetes death rate: 41.3/100,000 = .000413 • Proportion 2: 1999-2006 Santa Fe County diabetes death rate: 20.4/100,000 = .000204 • Difference between the two proportions: .000413 - .000204 = .000209 New Mexico Department of Health

  44. Test of Proportions (cont’d) • The difference between the two rates (0.00026) must be considered in the context of the standard error of the difference between two rates (pooled standard error), computed as: • If the difference between the two rates, 0.000209, is greater than 1.96 x s.e.diff, then the difference is considered statistically significant. Bruning, J.L., and Kintz, B.L. (1977) Computational Handbook of Statistics. Scott, Foresman and Company: London. New Mexico Department of Health

  45. Union County: p1=0.000413 q1=0.999587 n1=33,929 Santa Fe County: p2=0.000204 q2=0.999796 n2=1,092,565 Calculation of s.e.diff • p=proportion, q=(1-p), n is the person-years at risk, or the sum of the population counts across all eight years. New Mexico Department of Health

  46. Calculation of s.e.diff New Mexico Department of Health

  47. Evaluation of the Difference • Union County: 41.3/100,000 = .000413 • Santa Fe County: 20.4/100,000 = .000204 • Difference: .000413 - .000204 = .000209 • s.e.diff = .0001111 • 1.96 * s.e.diff = .000218 • Is .000209 greater than .000218? • No. Union County’s rate is greater than Santa Fe County’s rate, but the difference is NOT statistically significant. New Mexico Department of Health

  48. Tool #3. SMR and ISR • Standardized Mortality (or Morbidity) Ratio (SMR) • Estimates the number of deaths (or health events) one would EXPECT, based on … • The age- and sex-specific rates in a standard population (e.g., New Mexico rate) • The age and sex distribution of the index area. • Indirectly Standardized Rates • Use SMR to perform age adjustment when the number of cases is less than 20. New Mexico Department of Health

  49. Standardized Mortality Ratio • The all-cause death rate in New Mexico in 2006 was 757.5 deaths per 100,000 population. • All other things being equal, we should expect the same death rate in Union County. New Mexico Department of Health

  50. Standardized Mortality Ratio • BUT all other things are NOT equal. • 2006, % of population over age 65 was • 18.9% in Union County, compared with • 12.3% statewide. • In an older population, we would expect a higher death rate. New Mexico Department of Health

More Related