350 likes | 598 Views
Epidemiology and Applied Statistics Review Module 3 – Study Design. American College of Veterinary Preventive Medicine Review Course Katherine Feldman, DVM, MPH, DACVPM kfeldman@umd.edu 301-314-6820. Plan. Students review modules on their own
E N D
Epidemiology and Applied Statistics ReviewModule 3 – Study Design American College of Veterinary Preventive Medicine Review Course Katherine Feldman, DVM, MPH, DACVPM kfeldman@umd.edu 301-314-6820
Plan • Students review modules on their own • Send questions by email to Katherine Feldman (kfeldman@umd.edu) by Friday March 23 a.m. • Conference call Friday March 23 2-3 p.m. • Watch email and Blackboard for conference call details
References • Gordis L. Epidemiology, 3rd ed. Elsevier Saunders, Philadelphia, 2004. • $47.95 from Amazon.com • Norman GR, Streiner DL. PDQ statistics, 3rd ed. BC Decker Inc., Hamilton, 2003. • $17.79 from Amazon.com
Types of Study Design • Design criteria dependent on • Whether study factor artificially manipulated • AND, if manipulated whether it is randomly allocated • Experimental studies • Observational studies
Experimental Studies • Laboratory experiments • Researcher designs and implements study • Clinical / Field trials • Therapeutics • Vaccine efficacy • Health promotion programs • Community intervention trials • Insecticide impregnated bednets
Observational Studies • Descriptive • Estimate disease frequency or time trend • Used to generate hypotheses • No comparisons or statistical analyses • Correlational studies, case reports and series • Analytic • Identify risk factors for disease • Estimate and quantify effect • Suggest possible interventions
Analytic Observational Studies • Case-control • Retrospective or prospective • Selection based on case status • Cohort • Retrospective or prospective • Selection based on exposure • Cross-sectional - snapshot • Ecologic – individual level data missing Most common designs you’ll encounter in observational epidemiology
Case-control Studies • Study participants selected based on disease status • Exposure status in diseased individuals (cases) is compared to exposure status in well individuals (controls) • Most commonly conducted observational analytic epidemiologic study
CASES CONTROLS Do not have the disease Have the disease Were not exposed Were exposed Were not exposed Were exposed Case-control Studies
When should we do a case-control study? • Can be a useful 1st step to evaluate relationship between an exposure and disease • When the disease is rare • When efficiency is needed (i.e., there isn’t time to do a cohort study)
Selection of Participants • Cases – Many sources, including • Hospital patients • Clinical practice • Diagnostic labs • Disease registries • Controls • The comparability of cases and controls is essential • Controls should be chosen from population which gives rise to cases • Can be very tricky!
Control Selection - Matching • Select controls so they are “matched” to cases on certain characteristics, such as age, race, sex, socioeconomic status and occupation • Done to adjust for the effect of the “matched” variable • Frequency matching (aka groupmatching) - select controls so proportion of controls with a certain characteristic is identical to proportion of cases with same characteristic. • E.g., If cases are 25% male, make control population 25% male • Individual matching (aka pairwisematching) – select a control for each case, so that case and control are similar with respect to variable(s) of concern. • E.g., Match 25 y.o. female case with 25 y.o. female control
Matching • Pairwise matching requires a matched analysis (we won’t get into that here) • Potential limitation is that you can no longer assess the effect of the matched variable because cases and controls are deliberately manipulated so there is no difference between the two groups
Case-control Studies - Analysis • Because the participants are selected based on disease status, there is no denominator population with which we can calculate risk • We estimate risk by comparing the odds of exposure among the diseased to the odds of exposure among the well Odds ratio = ad / bc
Case-control Studies - 2-by-2 tables I like to think of the case-control study as moving down the columns of a 2-by-2 table, because you are comparing cases to controls
Remember that the reduced formula for the OR is the CROSS-PRODUCT ratio in a 2-by-2 table adbc
Case-control Studies - Bias • Case-control studies are particularly prone to bias • Selection bias – A problem with who gets into your study • Can happen when inclusion of cases or controls somehow depends on exposure of interest • Recall bias and limitations of recall • When cases and controls recall their experiences differently • It may be difficult to remember things that happened in the past or the information might not exist
Case-control Studies – Strengths and Limitations • Strengths • Relatively quick and inexpensive • Particularly well-suited to diseases with long latent periods • Optimal for evaluating rare diseases • Can examine multiple etiologic factors for a single disease • Limitations • Inefficient for evaluating rare exposures • Cannot directly compute incidence rates of disease in exposed and nonexposed individuals, unless study is population based • May be difficult to establish a temporal relationship between exposure and disease • Is particularly prone to bias, in particular selection and recall bias
Cohort Studies • Study participants selected based on exposure status • Risk of disease in exposed individuals is compared to risk of disease in unexposed individuals • Often conducted after hypothesis has been explored in case-control study
Not Exposed Exposed Do not develop disease Do not develop disease Develop disease Develop disease Cohort Studies
Cohort Studies – Selection of Study Population • Select groups on the basis of whether or not they were exposed • E.g., To assess effects of an occupational exposure such as a solvent, the exposed group is the group that works with the solvent and the unexposed group might be office staff at the parent company OR • Select a defined population before any of the members become exposed or before the exposures are identified • Selection could be made on basis of some factor not related to exposure, such as community of residence • Take histories of, or perform blood tests or other assays on, the entire population • Using the results of the histories or the tests, the population can be separated into exposed and nonexposed groups
Retrospective vs. Prospective Cohort Studies • Corot studies may be prospective or retrospective, depending on temporal relationship between initiation of study and disease occurrence • Retrospective cohort • All relevant events (exposures and outcomes of interest) have occurred when the study is initiated • Prospective cohort (aka concurrent cohort or longitudinal studies) • Relevant exposure may or may not have occurred when study initiated but the outcomes have definitely not yet occurred • The cohort must be followed into the future
Cohort Studies - Analysis • If a positive association exists between exposure and disease, then the proportion of the exposed group that develops disease (risk in the exposed group) is greater that the proportion of the nonexposed group in which disease develops Riskexposed > Riskunexposed RR > 1
a/h1c/h2 RR= Cohort Studies - 2-by-2 tables I like to think of the cohort study as moving across the rows of a 2-by-2 table, because you are comparing exposed to unexposed
Cohort Studies - Bias • Bias in assessment of the outcome • The person who determines the disease status of each subject may be biased by knowing the study hypothesis and the exposure status of each subject • Information bias • If the quality and extent of information obtained is different for exposed and nonexposed subjects, bias can be introduced • Bias from nonparticipation • Generally, only a proportion of those who are eligible to participate actually do so • How do those who do not participate differ from those who participate? • Bias from loss to follow-up • The incidence rates (risk) for exposed and nonexposed groups may be difficult to calculate
Cohort Studies – Strengths and Limitations • Strengths • Temporal sequence between exposure and disease can be clearly established • Well suited for assessing the effects of rare exposures • Allow for the examination of multiple effects of a single exposure • Allow direct measurement of incidence of disease in exposed and nonexposed groups • Limitations • Inefficient for the evaluation of rare diseases • If prospective, can involve following large numbers of individuals into the future and are therefore very resource intensive • If retrospective, requires availability of adequate records • The possibility of loss to follow-up in cohort studies
Cross-Sectional Studies • A ‘snapshot’ of exposure and disease is determined because both exposure and disease outcome are determined simultaneously for each subject • The cases that are identified are prevalent cases, because they existed at the time of the study • Therefore, this design is aka prevalence study • When the outcome is serologic evidence of disease, this study design is called a seroprevalence study
Defined population Gather data on exposure and disease Not exposed, do not have disease Exposed, have disease Exposed, do not have disease Not exposed, have disease Cross-Sectional Studies
Cross-sectional Studies – 2-by-2 tables Compare prevalence of disease among exposed to prevalence of disease among unexposed: a vs. c Is this familiar? h1 h2 OR Compare prevalence of exposure among diseased to prevalence of exposure among those not diseased: a vs. b v1 v2
Cross-Sectional Studies – Strengths and Limitations • Strengths • May be first step in assessing an association • Efficient • Limitation • Cannot determine a temporal relationship • Therefore cannot make conclusions about causality • Because cross-sectional surveys consider prevalent rather than incident cases, the data reflect determinants of survival as well as etiology
Ecologic Studies • May be a first approach in determining whether an association exists • Rather than studying characteristics of individuals, characteristics of groups are studied • The results of ecologic studies should be interpreted with caution, but can be useful preliminary indicators to causal hypotheses that should be tested more thoroughly • Ecologic fallacy – the erroneous assumption that associations seen at group levels can be applied to the individual • Efficient study design because does not require follow-up or direct contact with individual subjects
Study validity • Internal • Whether the observed result is true • True only if following have been eliminated • Bias (systematic error) • Confounding (3rd var confounds association) • Random error (probability result due to chance) • External • Generalizability
Precise Yes No . . . . . . . . . . Yes . . . . . . . . . . . . . . . . Accurate No Precision vs. accuracy • Precision • AKA reliability, reproducibility • Extent to which repeated measurements of a relatively stable phenomenon fall closely to each other • Accuracy • AKA validity • Degree to which the results of a measurement correspond to true state of phenomenon being measured
How to design a study • Identify research question and refine to testable hypothesis • Choose design appropriate to question • Efficiency • Population • Unit of analysis • Consider • Sample size (power) • Sampling scheme • Choice of statistics • Resources (time, money, personnel) • Ethics • Data generation and management