1 / 72

Causal Inference for Complex Observational Data Using Stata

Causal Inference for Complex Observational Data Using Stata. Chuck Huber StataCorp chuber@stata.com. ERMs Outline. Description of the dataset Unobserved confounding and endogeneity Nonrandom treatment assignment Missing not at random (MNAR) and selection bias Treatment effects.

olina
Download Presentation

Causal Inference for Complex Observational Data Using Stata

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Causal Inference for ComplexObservational Data Using Stata Chuck Huber StataCorp chuber@stata.com

  2. ERMs Outline • Description of the dataset • Unobserved confounding and endogeneity • Nonrandom treatment assignment • Missing not at random (MNAR) and selection bias • Treatment effects

  3. The Research Question • Fictional State University (FSU) has developed a new study-skills program with the goal of improving the grade point averages of their students.

  4. The Data

  5. The Data

  6. The Data

  7. The Data

  8. The Data

  9. The Data

  10. The Data

  11. The Data Students who participated in the program had lower GPAs?!?!?

  12. The Data

  13. The Data Students who participated in the program had higher GPAs when we account for high school GPA.

  14. The Data

  15. The Data

  16. The Data What was the effect of the study program on students GPAs?

  17. Outline • Description of the dataset • Unobserved confounding and endogeneity • Nonrandom treatment assignment • Missing not at random (MNAR) and selection bias • Treatment effects

  18. Observed and Unobserved Factors

  19. Endogeneity “An explanatory variable in a multiple regression model that is correlated with the error term…” (Wooldridge*, pg 838). *Jeffrey M. Wooldridge (2009) Introductory Econometrics: A Modern Approach, 4th ed.

  20. Omitted Variable Bias

  21. Confounding “…X and Y are confounded when there is a third variable Z that influences both X and Y…” (Pearl*, pg 193). *Judea Pearl (2009) Causality: Models, Reasoning, and Inference, 2nd ed.

  22. Unobserved Confounding

  23. Observed and Unobserved Factors High school GPA SAT Scores Parents Income Sex etc… Ability Motivation Sleep Support etc…

  24. Unobserved Confounding )

  25. Unobserved Confounding and Endogeneity ) )* )* hsgpa = (factors NOT related to Ability) + (Ability + error)

  26. Unobserved Confounding and Endogeneity ) hsgpa Ability income gpa ε

  27. Unobserved Confounding and Endogeneity )* )* hs_comp hsgpa ε2 Ability income gpa ε1

  28. Unobserved Confounding and Endogeneity hs_comp hsgpa ε2 income gpa ε1

  29. Unobserved Confounding and Endogeneity )* )* ε2 hs_comp hsgpa Ability Ability income gpa ε1

  30. Unobserved Confounding and Endogeneity )* )* hs_comp hsgpa ε2 Ability ε1 income gpa

  31. Unobserved Confounding and Endogeneity hs_comp hsgpa ε2 income gpa ε1

  32. Unobserved Confounding and Endogeneity

  33. Unobserved Confounding and Endogeneity

  34. Unobserved Confounding and Endogeneity Primary model eregressgpa income, /// endogenous(hsgpa = hs_comp income) Auxillary model

  35. Unobserved Confounding and Endogeneity

  36. Unobserved Confounding and Endogeneity

  37. Unobserved Confounding and Endogeneity

  38. Outline • Description of the dataset • Unobserved confounding and endogeneity • Nonrandom treatment assignment • Missing not at random (MNAR) and selection bias • Treatment effects

  39. Random Treatment Assignment

  40. Nonrandom Treatment Assignment

  41. Nonrandom Treatment Assignment A student’s decision to enroll in the study program is based on observed and unobserved factors.

  42. Unobserved Confounding )

  43. Endogenous Treatment ) )* )* P(program=1) = (factors NOT related to Ability) + (Ability + error)

  44. Endogenous Treatment hs_comp hsgpa ε2 income gpa ε1 scholarship P(program=1) ε3

  45. Endogenous Treatment Primary model eregressgpa income, /// endogenous(hsgpa = hs_comp income) /// entreat(program = income scholarship, nointeract) Auxillary model

  46. Endogenous Treatment

  47. Endogenous Treatment

  48. Endogenous Treatment

  49. Outline • Description of the dataset • Unobserved confounding and endogeneity • Nonrandom treatment assignment • Missing not at random (MNAR) and selection bias • Treatment effects

  50. No Missingness

More Related