Maria D. Fitzpatrick The B. E. Journal of Economic Analysis and Policy 2008

Starting School at Four: the Effect of Universal Pre-Kindergarten on Children’s Academic Achievement Maria D. Fitzpatrick The B. E. Journal of Economic Analysis and Policy 2008

Outline1. What effects of child care? (literature)2. The difference in difference framework3. The article’s estimation strategy & results3.1. Simple D-i-D 3.2. DDD 3.3. With synthetic control group4. Discussion

What input? preschool in the US • Preschool in the US: • Kindergarten • Pre-K • Head Start • Experiments: Abecedarian, Perry

What outcomes ? • Child development • Language abilities • Socialization, behavior ( tests) • School preparedness: later test scores • Even later: wages, crime, # of arrests … • Labor supply of mothers

Literature • Experiments: Abecedarian, Perry • Cost: • between 15 000 and 40 000 USD /year • vs. 4000 for universal Pre-K • Head Start : heterogeneous in practice  effects not that clear = trade-off between intensive targeted care and smaller-scale universal care?

 Articles to be reviewed • Cascio (JHR 2009): DiD, timing of free Kindergarten / maternal employment • Fitzpatrick (JoLE 2010): same theme, Regression Discontinuity Design • Cascio & al (QJE 2010) : effect of financial incentives / desegregation: : 2SLS + DD • Heckman & al (JPE 2010) rate of return to the HighScope Perry Preschool Program revisited (experimental design)

2. The D-in-D method

The “difference-in-difference” method • Intuition: • Comparing “before” and “after” situation of the treated  bias (many confounding factors) • Comparing “after” situation of treated and control  bias (composition effect)  rather look at change in thedifference in outcome between treated and control

The “difference-in-difference” method • Example (Wooldridge): impact of the building of an incinerator in 2005 on house prices in neighborhood • Define T=1 as “the house lies within 5 miles of the incinerator” • Estimating price2006= 0+ 1(T=1) + uyields, for example, = -30 000 (euros) = price T,2006 – price C,2006 = • Can you conclude that the building of the incinerator decreased neighborhood house prices by 30 000 € on average?

The “difference-in-difference” method • You need price data from before construction was even announced • Estimating price2002 = 0+ 1(T=1) + u yieldsdvd = -15 000 = price T,2002 – price C,2002 = • This shows that prices were already lower in the neighborhood where the incinerator would be built • The D-i-D estimator simply combines these 2 elements of information to estimate the causal impact of T

The “difference-in-difference” method • Average increase in the difference in prices between houses located near the incinerator (Near=1 : treated group) and those located far (Control) • This means we can easily derive and test whether it is significantly different from 0 by estimating which equation ? (priceC,2006 – priceT,2006) – (priceC,2002 – priceT,2002)

The “difference-in-difference” method price = 0 + 1.Near+ 0.date2 + 1.Near*date2 + u [where date2 is a variable that =1 if t = 2006] • What do 0, 0, 1 represent, in this model ?

The “difference-in-difference” method price = 0 + 1.Near+ 0.date2 + 1.Near*date2 + u b

price = 0 + 1.Near+ 0.date2 + 1.Near*date2 + u 1 measures the difference in average price change “difference”: between the neighborhood where the incinerator was built and the rest of the town “change”: between date 1 (2002) and date 2 (2006) b The “difference-in-difference” method

The “difference-in-difference” method price = 0 + 1.Near+ 0.date2 + 1.Near*date2 + u

The “difference-in-difference” method Price 0 • 1 • 1 • 0 0 1 2002 2006

The “difference-in-difference” method • In general: estimate y = 0 + 0.date2 + 1. (T=1)+ 1. (T=1)*date2 + controls + u • The treatment effect is given by coefficient 1 • It’s an ATE: Average Treatment Effect

The “difference-in-difference” method • Crucial assumption: • in the absence of treatment, the difference (once observables have been controlled for) between T & C would have remained constant = counterfactual

The “difference-in-difference” method • Same as: nothing but the treatment had an impact on T’s outcome trend but not on C’s • Under this assumption, the entire shift away from the common trend can be attributed to the treatment • If something else happens at the same time that could affect T & C differently, DD won’t identify causal effect

3. The paper’s strategy

The evaluated policy • Fall 1993: Georgia institutes lottery to fund targeted preschool • Fall 1995: too much money  preschool program becomes universal • Share of Georgian 4-year-olds enrolled: 8% (1993)  50% (1996)

The evaluated policy

The evaluated policy • Treatment on the treated? No • ATE of the availability of Pre-K • “Intention to treat” • What outcomes to consider? • Test scores in 4th grade • Grade retention

The evaluated policy

Important checks 1) Take-up and crowd out • If all increase is made of children new to preschool, total enrolment will have increased • If increase = only “crowd out” from children previously attending private preschool, no increase in total enrolment 2) Trends in Head Start • A pretty similar program  if changes at the same time, confounding factor

Important checks (1/2)

Important checks (2/2)

Estimation • The basic D-i-D model : • Where : • Yijt is test score of pupil i in school j on year t • Statei and yeart are state and year fixed effects • Georgia*after takes the value 1 if student 1 belongs to a cohort having had access to universal pre-K • Who are we comparing to whom?

Estimation • Potential bias if composition of test-taking population changed in GA relative to other states over time  added controls : • Where : • Xijt is a vector of characteristics of pupil i in school j on year t • Zjt is a vector of school j characteristics on year t

4. Results

Estimation strategy

Results

A DDD specification • Check for further confounding factors: add 8th graders to regression • The model becomes

A DDD specification • The estimated coeff before the Georgia*after*fourth dummy is [(GA 4th after - other States 4th after)- (GA 4th before - other States 4th before)] - [(GA 8th after- other States 8th after) - (GA 8th before - other States 8th before)]

A DDD specification • If something else than the policy at hand (universal Pre-K) caused school results of Georgian kids to shift away from those of other states, the D-i-D on 8th graders will catch that shift • Then the difference between that shift and the one observed on 4th graders would be the policy effect

A DDD specification • Assumption here = ? • Shift between GA & other states not caused by the program = same for 4th & 8th graders • Put differently: other factors (not the program) affected 4th & 8th graders in the same way

The “synthetic control” method • Abadie, Diamond, Hainmueller (2007): aggregrated data (e.g. at region level...) • One unit is treated • There are several control units

The “synthetic control” method • Instead of choosing one control group they construct a counterfactual outcome as a linear combination of all non treated outcomes • Each control unit is weighted by its distance to the treated unit (according to several predictors of the outcome)

The ADH method • Minimize distance between GA “before” and “before” synthetic control made of all other states (k=2 to N) with weights wk • = minimize distance between and • Uses both y and X

The ADH method Treatment effect can then be estimated the usual D-i-D way If is small enough, treatment effect is estimated by

The ADH method • Here: a “placebo DD test” with treatment between 1996 and 2000 yields an “effect” • Math scores decreased by 1.7% of an SD in GA relative to other states • Idea: compare GA to states that show the same trend before treatment • Even better: compare GA to the linear combination of states that best fits pre-treatment trend

Results

Results • Statistically insignificant though positive state-wide effects  heterogeneous effects?

Results

Discussion • Effects not very large, not always statistically significant • What could cause this even if pre-K had an effect on school preparedness? 12% in Head Start, 15 to 35% already in some non-subsidized preschool  many “treated” in the control group Only “intent to treat” effect  assuming all increase = new participants, ATT = much stronger

Discussion • Seems that benefits are more prominent for some groups : rural, non-white • Cost-benefit analysis: better grades  better wages  more taxes • but huge costs ($300M) outweigh benefits (<$50M) • Public policy recommendation: target publicly funded pre-K programs

Maria D. Fitzpatrick The B. E. Journal of Economic Analysis and Policy 2008

Maria D. Fitzpatrick The B. E. Journal of Economic Analysis and Policy 2008

Presentation Transcript

Journal : B

B D E

Macro Economic Policy Analysis Applications

Economic Analysis and Statistics, Policy Sector /

by Robert B. Fitzpatrick, Esq. Robert B. Fitzpatrick, PLLC Universal Building South

by Robert B. Fitzpatrick, Esq. Robert B. Fitzpatrick, PLLC Universal Building South

Measurement of B (D +   + ) and B ( D +   + )

G. B. E. D.

Economic Analysis of Proposed Dairy Policy Reforms

Economic Analysis of Proposed Dairy Policy Reforms

Economic Research and Policy Analysis Branch

by Robert B. Fitzpatrick, Esq. Robert B. Fitzpatrick, PLLC Universal Building South

B,C,D,E

Dr Jim Fitzpatrick Managing Director Fitzpatrick Associates, Economic Consultants

Maria NAGORNII Head of the Analysis, Monitoring and Policy Assessment Department

Structural Policy and Economic Analysis Unit

Foundations Review Questions/ answers B D B C A E B C D A B D E E C B B D C A

B,C,D,E