Epidemiologic Methods- Fall 2001

Epidemiologic Methods- Fall 2001

Definitions of Epidemiology • The study of the distribution and determinants (causes) of disease • e.g. cardiovascular epidemiology • The method used to conduct human subject research • the methodologic foundation of any research where individual humans or groups of humans are the unit of observation

Understanding Measurement: Aspects of Reproducibility and Validity • Review Measurement Scales • Reproducibility • importance • sources of measurement variability • methods of assessment • by variable type: interval vs categorical • Validity • methods of assessment • gold standards present • no gold standard available

Clinical Research Sample Measure Analyze Infer

A study can only be as good as the data . . . -Martin Bland

Measurement Scales

Reproducibility vs Validity • Reproducibility • the degree to which a measurement provides the same result each time it is performed on a given subject or specimen • Validity • from the Latin validus - strong • the degree to which a measurement truly measures (represents) what it purports to measure (represent)

Reproducibility vs Validity • Reproducibility • aka: reliability, repeatability, precision, variability, dependability, consistency, stability • Validity • aka: accuracy

Relationship Between Reproducibility and Validity Good Reproducibility Poor Validity Poor Reproducibility Good Validity

Relationship Between Reproducibility and Validity Good Reproducibility Good Validity Poor Reproducibility Poor Validity

Why Care About Reproducibility? Impact on Validity • Mathematically, the upper limit of a measurement’s validity is a function of its reproducibility • Consider a study to measure height in the community: • Assume the measurement has imperfect reproducibility: if we measure height twice on a given person, we get two different values; 1 of the 2 values must be wrong (imperfect validity) • If study measures everyone only once, errors, despite being random, lead to biased inferences in descriptive and analytic work (i.e. lack validity)

Why Care About Reproducibility? Impact on Statistical Precision • Classical Measurement Theory: observed value (O) = true value (T) + measurement error (E) E is random and ~ N (0, 2E) When measuring a group of subjects, the variability of observed values is a combination of: the variability in their true values and measurement error 2O =2T + 2E

Why Care About Reproducibility? 2O =2T + 2E • More measurement error means more variability in observed measurements • More variability of observed measurements has profound influences on statistical precision/power: • RCT’s: power to detect a treatment difference is reduced • Observational studies: power to detect an influence of a particular risk factor upon a given disease is reduced.

Mathematical Definition of Reproducibility • Reproducibility • Varies from 0 (poor) to 1 (optimal) • As 2Eapproaches 0 (no error), reproducibility approaches 1

Phillips and Smith, J Clin Epi 1993

Sources of Measurement Variability • Observer • within-observer (intrarater) • between-observer (interrater) • Instrument • within-instrument • between-instrument • Subject • within-subject

Sources of Measurement Variability • e.g. plasma HIV viral load • observer: measurement to measurement differences in tube filling, time before processing • instrument: run to run differences in reagent concentration, PCR cycle times, enzymatic efficiency • subject: biologic variation in viral load

Assessing Reproducibility Depends on measurement scale • Interval Scale • within-subject standard deviation • coefficient of variation • Categorical Scale • Cohen’s Kappa

Reproducibility of an Interval Scale Measurement: Peak Flow • Assessment requires >1 measurement per subject • Peak Flow Rate in 17 adults (Bland & Altman)

Assessment by Simple Correlation

Pearson Product-Moment Correlation Coefficient • r (rho) ranges from -1 to +1 • r • r describes the strength of the association • r2 = proportion of variance (variability) of one variable accounted for by the other variable

r = 1.0 r = -1.0 r = 1.0 r = -1.0 r = 0.0 r = 0.8 r = 0.8 r = 0.0

Correlation Coefficient for Peak Flow Data r ( meas.1, meas. 2) = 0.98

Limitations of Simple Correlation for Assessment of Reproducibility • Depends upon range of data • e.g. Peak Flow • r (full range of data) = 0.98 • r (peak flow <450) = 0.97 • r (peak flow >450) = 0.94

Limitations of Simple Correlation for Assessment of Reproducibility • Depends upon ordering of data • Measures linear association only

1700 1500 1300 1100 900 Meas. 2 700 500 300 100 100 300 500 700 900 1100 1300 1500 1700 Meas 1

Limitations of Simple Correlation for Assessment of Reproducibility • Gives no meaningful parameter using the same scale as the original measurement

Within-Subject Standard Deviation • Mean within-subject standard deviation (sw) = 15.3 l/min

Computationally easier with ANOVA table: • Mean within-subject standard deviation (sw) :

sw: Further Interpretation • If assume that replicate results: • arenormally distributed • mean of replicates estimates true value • sw estimates true standard deviation • 95% of replicates within (1.96)(sw) of true value x  true value sw (1.96) (sw)

sw: Peak Flow Data • If assume that replicate results: • arenormally distributed • mean of replicates estimates true value • sw estimates true standard deviation • 95% of replicates within (1.96)(sw) = 30 l/min of true value x  true value sw = 15.3 l/min (1.96) (sw) = (1.96) (15.3) = 30

sw: Further Interpretation • Difference between any 2 replicates for same person = diff = meas1 - meas2 • Because var(diff) = var(meas1) + var(meas2), therefore, s2diff = sw2 + sw2 = 2sw2 sdiff

sw: Difference Between Two Replicates • If assume that differences: • arenormally distributed and mean of differences is 0 • sdiff estimates standard deviation • The difference between 2 measurements for the same subject is expected to be less than (1.96)(sdiff) = (1.96)(1.41)sw = 2.77sw for 95% of all pairs of measurements xdiff 0 sdiff (1.96) (sdiff)

sw: Further Interpretation • For Peak Flow data: • The difference between 2 measurements for the same subject is expected to be less than 2.77sw =(2.77)(15.3) = 42.4 l/min for 95% of all pairs • Bland-Altman refer to this as the “repeatability” of the measurement

One Common Underlying sw • Appropriate only if there is one sw • i.e, sw does not vary with true underlying value Kendall’s correlation coefficient = 0.17, p = 0.36 40 30 Within-Subject Std Deviation 20 10 0 100 300 500 700 Subject Mean Peak Flow

Another Interval Scale Example • Salivary cotinine in children (Bland-Altman) • n = 20 participants measured twice

Cotinine: Absolute Difference vs. Mean Kendall’s tau = 0.62, p = 0.001 4 3 Subject Absolute Difference 2 1 0 0 2 4 6 Subject Mean Cotinine

Logarithmic Transformation

Log Transformed: Absolute Difference vs. Mean Kendall’s tau=0.07, p=0.7 .6 .4 Subject abs log diff .2 0 -1 -.5 0 .5 1 Subject mean log cotinine

sw for log-transformed cotinine data • sw • back-transforming to native scale: • antilog(sw) = antilog(0.175) = 10 0.175 = 1.49

Coefficient of Variation • On the natural scale, there is not one common within-subject standard deviation for the cotinine data • Therefore, there is not one absolute number that can represent the difference any replicate is expected to be from the true value or from another replicate • Instead, = coefficient of variation

Cotinine Data • Coefficient of variation = 1.49 -1 = 0.49 • At any level of cotinine, the within-subject standard deviation of repeated measures is 49% of the level

Coefficient of Variation for Peak Flow Data • By definition, when the within-subject standard deviation is not proportional to the mean value, as in the Peak Flow data, then there is not a constant ratio between the within-subject standard deviation and the mean. • Therefore, there is not one common coefficient of variation • Estimating the the “average” coefficient of variation is not very meaningful

Peak Flow Data: Use of Coefficient of Variation when sw is Constant

Reproducibility of a Categorical Measurement: Chest X-Rays • On 2 different occasions, a radiologist is given the same 100 CXR’s from a group of high-risk smokers to evaluate for masses • How should reproducibility in reading be assessed?

Epidemiologic Methods- Fall 2001

Epidemiologic Methods- Fall 2001

Presentation Transcript

Epidemiologic Methods - Fall 2006

Epidemiologic Methods

The Use of Epidemiologic Methods in Disasters

Epidemiologic Methods Fall 2011 First 5 Lectures

Use of epidemiologic methods in disaster management

CMSC 671 Fall 2001

Physics 2053C – Fall 2001

Epidemiologic Methods - Fall 2009

Physics 2053C – Fall 2001

Introduction to Epidemiologic Methods

Epidemiologic Methods Fall 2012 First 5 Lectures

Epidemiologic Methods - Fall 2004

Epidemiologic Methods - Fall 2007

Physics 2053C – Fall 2001

Fall 2001 EOSP

Physics 2053C – Fall 2001

Physics 2053C – Fall 2001

CMSC 671 Fall 2001

CMS Fall 2001 Forum

CMSC 671 Fall 2001

CMSC 671 Fall 2001

CMSC 671 Fall 2001