280 likes | 465 Views
Local model uncertainty and Incomplete-data bias. S. Eguchi, ISM & GUAS This talk was a part of co-work with J. Copas, University of Warwick. Hidden Bias. Publication bias - not all studies are reviewed. Confounding - causal effect only partly explained.
E N D
Local model uncertainty and Incomplete-data bias S. Eguchi, ISM & GUAS This talk was a part of co-work with J. Copas, University of Warwick
Hidden Bias Publication bias -not all studies are reviewed Confounding -causal effect only partly explained Measurement error -errors in measure of exposure
Lung cancer & passive smoking 30 25 20 study 15 10 5 0.5 0.3 1.5 2.0 3.0 4.0 5.0 10.0 1.0 Odds ratio
Passive smoke and lung cancer Log relative risk estimates (j =1,…,30) from 30 2x2tables The estimated relative risk 1.24 with 95% confidence interval (1.13, 1.36)
Conventional analysis 30 25 20 study 15 10 5 1.24 0.5 0.3 1.5 2.0 3.0 4.0 5.0 10.0 1.0 Odds ratio
Incomplete Data z = (data on all studies, selection indicators) y = (data on selected studies) z = (response, treatment, potential confounders) y = (response, treatment) z = (disease status, true exposure, error) y = (disease status, observed exposure) y = h(z)
Level Sets of h(z) 1. One-to-one 2. Missing 3. Measurement error 5. Competing risk 4. Interval censor 6.Hidden confounder
Ignorable incompleteness Let Y= h(Z) be a many-to-one mapping. Z is complete; Y is incomplete If Zhas then Y has
Tubular Neighborhood Model Near-model M Copas, Eguchi (2001)
Near model Model Near-model
From pure misspecification biased perturbed Unbiased perturbed h
Nonignorable missingness The model assumes MCAR or MAR
Problem in estimation of bias The nonignorable model gives the worst case if However is inestimable and untestable: The profile likelihood is flat at
Sensitivity analysis The most sensitive model Estimating function of q with fixed e, w
Scenarios A, B, C Inference from using fY Scenario A: Scenario C: Scenario B:
Scenarios A and C Scenario A: Scenario C:
Scenario B Conditional confidence interval
Passive smoke and lung cancer The estimated relative risk 1.24 with 95% confidence interval (1.13, 1.36) Square root rule 95% confidence interval (1.08, 1.41)
Root-2-rule 30 25 20 study 15 10 5 1.24 0.5 0.3 1.5 2.0 3.0 4.0 5.0 10.0 1.0 Odds ratio
Present and Future Does all this matter? Statistics ( missing data, response bias, censoring) Biostatistics (drop-outs, compliance) Epidemiology ( confounding, measurement error) Econometrics (identifiability, instruments) Psychometrics (publication bias, SEM) causality, counter-factuals, ...