100 likes | 228 Views
IPP Screening Audit. IPP Meeting May 31, 2006 Preeti Pathela (ppathela@health.nyc.gov). Question of interest:.
E N D
IPP Screening Audit IPP Meeting May 31, 2006 Preeti Pathela (ppathela@health.nyc.gov)
Question of interest: In stratified random sampling from a population of size N, what sample size n is necessary to determine a proportion ¶ to within the maximum allowable difference D with confidence P?To answer this, we want to use the following parameters:N= Each project area’s population size D= The maximum allowable difference (or margin of error)¶= Estimated proportion screenedP= Confidence level (or alpha)
Getting the sample size Take the smallest subgroup of STD and smallest subgroup of FP (in other words, the STD or FP project area with the smallest eligible population (N)), and calculate their required sample size (n) based on the parameters we set. Whatever percentage the n is of that area’s N will be applied to all the other subgroups of STD or FP. Eligible population for: STD: females under 30 yrs FP: females under 25 yrs for initial or annual exam
Implementation • Using information provided, project areas included were: • STD: NJ, NYS, NYC • NYS had the smallest eligible population • Range of estimated number screened was • 83.8% - 95.7% • Family planning: NJ, NYS, and NYS • [USVI FP provided eligible population, but it was so small that it would have greatly increased the sample size for all other areas]. • NYC had the smallest eligible population • Range of estimated number screened was • 74.0% - 91.1%
Parameters used (STD) • NYS clinics 2005: N=4,792 • D: Set at .03 We’re allowing a maximum allowable difference of 3% around the proportion estimate • ¶: Set at .90 We’ll estimate that of all eligible women in the population has an MD visit, 90% are screened for Ct. • P: Set at .95 We’ll estimate our proportion (within +3%) with 95% confidence; in other words, if we repeated this exercise 100 times, 95 of those times we would find that the true proportion of women screened is between 87%-93%. • If the true proportion is lower than 87%, we will perform a statistical test (z-score) to assess if the difference between the estimated and observed proportions is statistically significant.
Calculated sample size and sample sizes for all other groups 356 is 7.4% of 4,792 (NYS STD clinics’ N). Thus, we want to take a 7.4% sample from all other project areas. Using a power and sample size calculator: n= 356
Parameters used (FP) • NYC clinics 2005: N=5,621 • D: Set at .05 We’re allowing a maximum allowable difference of 5% around the proportion estimate • ¶: Set at .85 We’ll estimate that of all eligible women in the population has an MD visit, 85% are screened for Ct. • P: Set at .95 We’ll estimate our proportion (within +5%) with 95% confidence; in other words, if we repeated this exercise 100 times, 95 of those times we would find that the true proportion of women screened is between 80%-90%. • If the true proportion is lower than 80%, we will perform a statistical test (z-score) to assess if the difference between the estimated and observed proportions is statistically significant.
Calculated sample size and sample sizes for all other groups 191 is 3.4% of 5,621 (NYC FP clinics’ N). Thus, we want to take a 3.4% sample from all other project areas. Using a power and sample size calculator: n= 191
Choosing a random sample • Once you have a sample size, you can draw the random sample based on how the medical records are put in order (by date of visit, alphabetically by name, etc.) • Microsoft Excel • Random number table 3680 2231 8846 5418 0498 5245 7071 2597 If you wanted to sample two records from records numbered 1 to 48 we would read off the digits in pairs:36 80 22 31 88 46 54 18 04 98 52 45 70 71 25 97 If we wanted to sample two records from a much longer list with 140 records in it we would need to read the digits off in groups of three:368 022 318 846 541 804 985 245 707 125 97
Choosing a systematic sample • In a random sample every member of the population has an equal chance of being chosen, which is not the case with a systematic sample, but it is almost always accepted as being random. • Suppose you want to sample 8 charts from a population of 120 charts. 120/8=15, so every 15th chart is chosen after a random starting point between 1 and 15. If the random starting point is 11, then the charts selected are 11, 26, 41, 56, 71, 86, 101, and 116.