1 / 14

Guide to Handling Missing Information

Guide to Handling Missing Information. C ontacting researchers Algebraic recalculations, conversions and approximations Imputation method (substituting missing data). Imputation Method . - When recalculations not possible -e.g. no standard deviation for a study

kerryn
Download Presentation

Guide to Handling Missing Information

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Guide to Handling Missing Information • Contacting researchers • Algebraic recalculations, conversions and approximations • Imputation method (substituting missing data)

  2. Imputation Method - When recalculations not possible -e.g. no standard deviation for a study • Use available data from other studies or other meta-analysis Imputation Method b. Multiple imputations Within study imputation

  3. Within-study imputation _ ~ SDj= XjƩikSDi ______ _ Method 1.(Means) ƩikXi ~ SDj = Standard deviation (SD) for missing data from study j _ Xj =Mean from study with missing SD ƩikSDi =Summation of all known SD from different studies _ =Summation of means from different studies other than j (ƩikXi)

  4. - ~ SDj= XjƩikSDi Assumptions ______ _ ƩikXi • Assumes SD to mean ratio is at the same scale for all studies • - Experimental scales can differ tremendously between different taxonomic groups or experimental designs

  5. Method 2.(sample size) ~ sj=α+β(nj) -Regression techniques • Reports sample size but missing information to calculate pooled SD (required for Hedge’s d). α = Intercept β = slope of the linear regression of nvss nj = observed sample size of the study with missing data

  6. ~ Assumptions sj=α+β(nj) • Assumes n (observed sample size of the study with missing data) is a good predictor s. Method 3.No. of studies ~ sj= Ʃiksj √ni _____ K √nj K= number of studies with complete information on s and n (sample size of individual study)

  7. Method 4. Follmanet al. (1992) Furukawa et al. (2006) ~ sj= √Ʃik[(ni-1)Ϭ2i] __________ √Ʃik(ni-1) Ϭ2= variance n= sample size of individual study

  8. Assumptions • Some degree of homogeneity among the observed SD and X across studies • Assume information is missing at random and not due to reporting biases (non-random) -Imputations retain their original units. -Large variations among estimates will bias imputations. _

  9. Multiple imputations • Use random sampling approach • Average repeated sampling for missing data Overall imputed synthesis

  10. Advantage of multiple imputations • Variability is explicitly modeled therefore do no treat imputed value as true observation • e.g. Does not account for error associated with α or β. ~ sj=α+β(nj)

  11. Methods: Multiple imputations • Various methods: use maximum likelihood or Bayesian models. • Requires specialized software • e.g. Hot Deck- To calculate pooled s but several SD values missing • Random sample of s drawn with replacement possible s • Process repeated with replacement from possible s • Repeat till we get “m” number of complete data sets

  12. Methods: Hot deck _ _ calculate effect size= δ for each(m) data Calculate variance = Ϭ2 (δl) set δ = Ʃlm= 1δl . _ Pooled effect size ___ m . . Variance= Ϭ2(δ)= Ʃlm= 1 Ϭ2(δl) + (1+1) Ʃlm= 1(δl – δ)2 _ _ _________ _ _________ m m m-1 Rubin and Schenker (1991)If 30% data missing->m= 3 If 50% data missing->m= 5

  13. Non-parametric analyses and bootstrapping • Alternative to Hedge’s d • Using weighting scheme • Does not require SD • E.g log response ratio lnR= lnXT XC If sample size available but no SD Ϭ2=(lnR)= nTnCnT+nC _ T= treatmentC= control ___ _ ___ Inverse of a simplified estimate of variance

  14. Effects of Imputation • No standardized method for imputation-> bias Rubin and Schenker (1991) e.g. • Appropriateness of imputed data can be evaluated using a sensitivity analysis • Benefits despite potential bias • Improved variance estimate (i.e. smaller CI) over exclusion • May potentially improve representation of null studies

More Related