Sample Size

NTR 629 - Week 4B Sample Size

Sample Size • Sample Size = The number of elements in the obtained sample. • When determining sample size, consider: • Central Limit Theorem • Importance of Precision and Reliability • Power Analysis

1. Central Limit Theorem (CLT) • CLT says that under most distributions (normal or not), as sample size increases, distribution of means will tend towards the normal distribution. • With sample sizes of at least 30 per variable (or group if that is the variable) the sample means will approximate the population mean. The small sample sizes of 2 and 5 (above) do not approximate the mean, whereas the sample of 100 (below) does, and is much closer to a ‘normal’ (bell shaped) distribution. See also: http://www.wadsworth.com/psychology_d/templates/student_resources/workshops/stat_workshp/cnt_lim_therm/cnt_lim_therm_20.html

2. Precision and Reliability Precision = how close the statistic of the sample matches that of the study population. Determine precision based on: • Amount of error allowed, or the size of Standard Error (SE) or Standard Error. Larger sample sizes decrease SE. • Confidence Interval (CI) • How confident are you that you would achieve similar results if you repeated the study? • 95% of means for each subjects lies within +/- 1.96 sd of the mean. • The +/- 2.58 value reflects 99% of cases. Example Likert scale response GRAPH : Trochim, William M. The Research Methods Knowledge Base, 2nd Edition. Internet WWW page, at URL: <http://www.socialresearchmethods.net/kb/> (version current as of July 16, 2012).

3. Power Analysis • Power analysis allows the researcher to determine how many subjects are needed to give the hypothesis a chance of being accepted (assuming the hypothesis is true). Must consider: • Type I error (alpha, ) • Type II error (beta, ) • Effect size • Directionality of the Hypothesis

Power Analysis Considerations • Power = The probability of rejecting the null hypothesis when false (i.e., arriving at the correct conclusions). • Formula: 1-… or in this example, an 80% probability of detecting existing relationship. Type I and type II errors are inversely related. • If  (or ) level set at .05, researchers want to be 95% certain they will not commit a type I error. • Set  (for type II error) at four times the  value. • Example: If  set at .05, then  would be .20. Interpretation: There is a 20% probability that one will fail to uncover an existing relationship.

Power Analysis Considerations Effect Size Directionality of Hypothesis One-tailed test used if certain about direction (e.g., increase, decrease) of the effect. The two-tailed hypothesis (null hypothesis) examines differences in both directions. It is used if the researcher is uncertain about the effect of the independent variable on the dependent variable. • The size or magnitude of a difference. • Target having at least a ‘medium’ effect size. • Many fields not require reporting of effect size in journals. • Refer to course handout ‘Using Statistics’, page 15, for the interpretation of effect sizes for various statistical tests.

Power Analysis Computation • Refer to Power Analysis Tables as guide. • If the standard deviations in population are known, can compute sample size needed • Online (free) Sample Size Calculators: • http://www.surveysystem.com/sscalc.htm • http://www.nss.gov.au/nss/home.nsf/pages/Sample+Size+Calculator+Description?OpenDocument • Power Analysis Tables of Points to allow for 95% margin of error for various survey sample sizes • http://research-advisors.com/tools/SampleSize.htm

Sample Size Rules of Thumb • Another consideration is nature of study (impacts statistics). The recommended minimum number of subjects are as follows for the following types of studies: • 100 for a Descriptive Study • 50 for a Correlational Study • 30 in each group for Experimental and Causal-Comparative Study • The use of 15 subjects per group should likely be replicated

Determinants of Sample Size • Increase sample size: • When homogeneous sampling units. • For greater precision. • When want higher statistical significance and/ or power • When predict small differences between groups (want larger effect size) • If high attrition expected • If subsamples (thus smaller group size N for analyses) • If numerous variables are being explored • When numerous confounding variables are present • If dependent variable is expected to vary greatly • Mail surveys - Increase sample size by 40-50% • Use of less reliable and valid instrumentation (e.g., essay vs. validated Likert scale) • Balance with need for more resources to increase sample size (financial, time, manage participants)

External Validity a.k.a. Generalizability • External Validity refers to the extent that the results of a study can be generalized from a sample to a population. • Population generalizabilityis the degree to which a sample represents the population of interest. • Obtaining a representative sample becomes very important • Ecological generalizabilityrefers to the extent to which the results of a study can be generalized to conditions or settings other than those that prevailed in the study.

Nonsampling Errors • Nonsampling Errors • An inadequate sampling frame • Nonresponse from participants • Field errors • Response errors • Coding and data entry error

NTR 629 - Week 4C Recruitment Methods

Recruitment • Goals of recruitment: • Recruit participants that represent the target population • Recruit an adequate number of participants to meet the recommended sample size requirements for study • Pre-Screening Guidelines: http://healthcare.partners.org/phsirb/prescreen.htm • Recruitment Guidelines: http://healthcare.partners.org/phsirb/recruit.htm • The Response Rate is the percent of eligible subjects who agree to participate in the study.

Attrition Nonresponse Dropouts Dropouts = withdrawal after the start or unavailable for required follow-up. Count in the initial response rate, but do not count in the sample size (N) for results. If a percentage of dropouts are anticipated, a researcher should increase the sample size at the start of the study to account for this. • Compromises validity and distorts findings. Reduce nonresponse by: • Multiple methods to make contact • Several points of contact • Providing information materials • Less invasive procedures • Less time consuming and/or less visits • Incentives (e.g., gas $) • Bilingual staff

Sample Size

Sample Size

Presentation Transcript

Sample Size Calculation

Sample Size

Sample Size Determination

Surveys Sample Size

Survey Sample Size

Sample Size Estimation

SAMPLE SIZE DETERMINATION

SAMPLE SIZE ESTIMATION

Estimating Sample Size

Sample Size

Sample Size Estimation

Sample Size Calculation

Sample size determination

Sample Size Determination

Sample Size

Determining Sample Size

Sample Size

Sample Size Considerations

SAMPLE SIZE

Sample size calculation

SAMPLE SIZE:

Sample Size