1 / 31

TRYGVE HAAVELMO AND THE EMERGENCE OF CAUSAL CALCULUS

TRYGVE HAAVELMO AND THE EMERGENCE OF CAUSAL CALCULUS. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea). OUTLINE. Haavelmo’s major breakthroughs The evolution of causal calculus What causal calculus can do for us? Will Haavelmo’s legacy survive?.

pete
Download Presentation

TRYGVE HAAVELMO AND THE EMERGENCE OF CAUSAL CALCULUS

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. TRYGVE HAAVELMO AND THE EMERGENCE OF CAUSAL CALCULUS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea)

  2. OUTLINE • Haavelmo’s major breakthroughs • The evolution of causal calculus • What causal calculus can do for us? • Will Haavelmo’s legacy survive?

  3. HAAVELMO’S THREE INSIGHTS • An economic model is a set of hypothetical experiments, qualitatively encoded in a system of equations. • An economic model is capable of answering policy intervention questions, with no further assistance from the modeller. • There is a formal way of taking an arbitrary model, combining it with data, and derive valid answers to policy questions .

  4. WHAT IS AN ECONOMIC MODEL? “We have in mind some actual experiment, or some design of an experiment, which we could at least imagine arranging." (Haavelmo, 1944) His example: What information did the modeller intenda to carry in Eq. 1 and what information would a provide if we are able to estimate its value. Perhaps a=g[P(x,y,z...)] ? There is no such g[ ].

  5. P Joint Distribution Q(P) (Aspects of P) Data Inference TRADITIONAL STATISTICAL INFERENCE PARADIGM e.g., Infer whether customers who bought product A would also buy product B. Q = P(B | A)

  6. THE STRUCTURAL MODEL PARADIGM Joint Distribution Data Generating Model Q(M) (Aspects of M) Data M Inference • M – Invariant strategy (mechanism, recipe, law, protocol) by which Nature assigns values to variables in the analysis. “A painful de-crowning of a beloved oracle!”

  7. CLINGING DESPARATELY TO AN OLD ORACLE Fifty-two years later (1995): “I am speaking, of course, about the equation: {y = a + bx + }. What does it mean? The only meaning I have ever determined for such an equation is that it is a shorthand way of describing the conditional distribution of {y} given {x}.” (Holland, 1995, p. 54) “Beneath all multiple equation models, there is only a set of conditional distributions.” (Richard Berk, 2004, p. 196)

  8. ECONOMISTS CLINGING TO THE OLD ORACLE Fifty-two years later (1995): “For example, in the model: until the relationship between zt and ut is specified, the meaning of  is uncertain since E [ zt ut] could be either zero or non-zero on the information provided.”                               (Hendry, 1995) “The joint density is the basis: SEMs are merely aninterpretation of that.”     (1998, personal communication)

  9. ECONOMISTS CLINGING TO THE OLD ORACLE Sixty-four years later (2007): “A state implements tough new penalties on drunk drivers:What is the effect on highway fatalities?... [This effect] is an unknown characteristic of the population joint distribution of X and Y.”        (Stock and Watson, 2007, Introduction to Econometrics, Chapter 4, pp. 111) Sixty-eight years later (2011):“An econometric model specifies the statistical relationshipthat is believed to hold between the various economicquantities pertaining to  particular economic phenomenalunder study.” (Wikipedia – “Econometric Models,” 2011)

  10. WHAT CAN OUR NEW ORACLE DO? Haavelmo: Provide policy advice. “to predict consumption,... under the Government policy,... we may use the `theoretical’ equations obtained.”  (1943, p.12) Skeptics: “How can anyone predict outcomes of experiments that where never performed?'' Haavelmo:“this is only natural, because now the Government is, in fact, performing `experiments' of the type we had in mind when constructing each of the two equations.'' (1943, p.12)

  11. HOW CAN OUR NEW ORACLE DO ALL THAT? (THE ALGORITHMIZATION OF INTERVENTION) “Assume that the Government decides, through public spending, taxation, etc., to keep income, rt, at a given level,... the only change in the system being that, instead of we now have where giis Government expenditure, so adjusted as to keep rconstant, whatever be uand v,...'' (1943, p. 12) Haavelmo “surgery”:Modify an equation by adding an adjustable forcegi, while keeping all other equations in tact.

  12. u r y v THE EVOLUTION OF CAUSAL CALCULUS • Haavelmo's surgery (1943) • Add adjustable force • Strotz and Wold surgery (1960). “Wipe out” the equation • and replace it with • Graphical surgery (Spirtes et al., 1993; Pearl, 1993). • Wipe out incoming arrows to r • do-calculus (Pearl, 1994) •   Structural counterfactuals (Balke and Pearl, 1995) • Yr(u) = Y(u)in themodel • Unification with Neyman-Rubin Yx(u) and Lewis (1973)

  13. Joint probabilities of counterfactuals: In particular: ALGORITHMIZATION OF COUNTERFACTUALS Semantics: The counterfactual Yx(u) = y, which reads “Y would be y (in situation u), had X been x” is said to be true, if Y = y in the x-mutilated model, under exogenous conditions U = u.

  14. AXIOMS OF CAUSAL COUNTERFACTUALS Y would bey, hadXbeenx(in stateU = u) • Definiteness • Uniqueness • Effectiveness • Composition • Reversibility

  15. Data (D) Q - Estimates of Q(P) THE LOGIC OF CAUSAL ANALYSIS CAUSAL MODEL (MA) CAUSAL MODEL (MA) A - CAUSAL ASSUMPTIONS A* - Logical implications of A Causal inference Q Queries of interest Q(P) - Identified estimands T(MA) - Testable implications Statistical inference Goodness of fit Provisional claims Model testing

  16. Model 2 (Linear version) Model 1 Z1 Z2 W1 W2 Z3 X W3 Y ILLUSTRATING THE WORKING OF CAUSAL CALCULUS U’s are mutually independent

  17. Model 1 Z1 Z2 W1 Testable implications (FINITE!): Z1Z2 Z1Y | {X1,Z2,Z3}, Z2X | {Z1, Z3}. W2 Z3 X W3 Y WHAT ARE THE TESTABLE IMPLICATIONS OF THE MODEL? Missing edges: Z1–Z2, Z1 – Y, Z2– X . . . Separating sets: {0}, {X, Z2, Z3}, {Z1, Z3} . . . Zero regression coefficients: r{Z1 Z2} = 0 and r{YZ1• XZ2Z3} = 0 These imply *all* misspecification tests

  18. Z1 Z2 W1 W2 Z3 X W3 Y IMPLIED MISSPECIFICATION TESTS Question 4:If we regress Z1 on all other variables in the model, which regression coefficient will be zero? Answer: All but these three Question 5:If we regress Z1on Z3 and W1, which regression coefficient will change if we add Y as a regressor? Answer:The coefficient of Z3 will change and the coefficient of W1 will remain invariant. Non-invariance may not be misspecification.

  19. Z1 Z2 W1 W2 Z3 X W3 Y CAUSAL CALCULUS AS AN ORACLE FOR INTERVENTIONS • Suppose we wish to estimate the average causal effect of X on Y • Which subsets of variables need to be adjusted to • obtain an unbiased estimate of ACE? • Is there a single variable that, if measured, would • allow an unbiased estimate of ACE?

  20. Watch out! No, no! EFFECT OF WARM-UP ON INJURY (After Shrier & Platt, 2008) ??? Front Door Warm-up Exercises (X) Injury (Y)

  21. THE MACHINERY OF CAUSAL CALCULUS • Rule 1:Ignoring observations • P(y |do{x},z, w) = P(y |do{x},w) • Rule 2:Action/observation exchange • P(y |do{x},do{z}, w) = P(y|do{x},z,w) • Rule 3: Ignoring actions • P(y |do{x},do{z},w) = P(y|do{x},w)

  22. DERIVATION IN CAUSAL CALCULUS Genotype (Unobserved) Smoking Tar Cancer Probability Axioms P (c |do{s})=tP (c |do{s},t) P (t |do{s}) Rule 2 = tP (c |do{s},do{t})P (t |do{s}) Rule 2 = tP (c |do{s},do{t})P (t | s) Rule 3 = tP (c |do{t})P (t | s) Probability Axioms = stP (c |do{t},s) P (s|do{t})P(t |s) Rule 2 = stP (c | t, s) P (s|do{t})P(t |s) Rule 3 = stP (c | t, s) P (s) P(t |s)

  23. Z1 Z2 W1 W2 Z3 X W3 Y WHAT ELSE CAN CAUSAL CALCULUS DO FO US? • Equivalent models • Identifying Counterfactual • queries (ETT, PC) • Mediation • Causes of Effects • External validity Finding instrumentsIs there an instrumental variable for the Z3 Y relationship? Answer: NoCan we turn Z1 into an IV? Answer: Yes, condition on W1.

  24. WILL HAAVELMO'S LEGACY SURVIVE? • Two attempted hijackings • The regressionism assault (1975 - 2000)Economic model is a parameterized family ofjoint distributions.

  25. THE CAUSAL RENAISSANCE: VOCABULARY IN ECONOMICS From Hoover (2004) “Lost Causes”

  26. WILL HAAVELMO'S LEGACY SURVIVE? • Two attempted hijackings • The regressionism assault (1975 - 2000)Economic model is a parameterized family ofjoint distributions. • The “instrumentalism” assault (1995 - ) • Economic analysis is a potential clinical trial with missing • data (potential outcomes) and one treatment. • Instruments are needed, no structure. • The structuralist defense • Finding an instrument requires structural assumptions. • The "experimentalists" choice of language deprives them • from making those assumptions transparent.

  27. 1. English: Smoking (X), Cancer (Y), Tar (Z), Genotypes (U) U Y X Z 2. Counterfactuals: 3. Structural: Z X Y FORMULATING ASSUMPTIONS THREE LANGUAGES

  28. THE STRUCTURAL-COUNTERFACTUAL SYMBIOSIS • Express theoretical assumptions in structural language. • Express queries in counterfactual language. • Translate (1) into (2) for algebraic analysis,   Or (2) into (1) for graphical analysis. • Use either graphical or algebraic machinery   to answer the query in (2).

  29. CONCLUSIONS • Formal basis for causal and counterfactual inference (complete) • Unification of the graphical, potential-outcome and structural equation approaches • Friendly and formal solutions to century-old problems and confusions.

  30. CONCLUSIONS He is wise who bases causal inference on an explicit causal structure that is defensible on scientific grounds. After Haavelmo, 1943

  31. Thank you

More Related