1 / 38

Econometric Analysis of Panel Data

Econometric Analysis of Panel Data. William Greene Department of Economics Stern School of Business. The Random Effects Model. The random effects model c i is uncorrelated with x it for all t; E[c i | X i ] = 0 E[ ε it | X i ,c i ]=0. Error Components Model.

leon
Download Presentation

Econometric Analysis of Panel Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business

  2. The Random Effects Model • The random effects model • ci is uncorrelated with xit for all t; • E[ci |Xi] = 0 • E[εit|Xi,ci]=0

  3. Error Components Model Generalized Regression Model

  4. Notation

  5. Maximum Likelihood

  6. MLE Panel Data Algebra (1)

  7. MLE Panel Data Algebra (1, cont.)

  8. MLE Panel Data Algebra (1, conc.)

  9. Maximizing the Log Likelihood • Difficult: “Brute force” + some elegant theoretical results: See Baltagi, pp. 22-23. (Back and forth from GLS to ε2 and u2.) • Somewhat less difficult and more practical: At any iteration, given estimates of ε2 and u2 the estimator of  is GLS (of course), so we iterate back and forth between these. See Hsiao, pp. 39-40.

  10. Direct Maximization of LogL

  11. Maximum Simulated Likelihood

  12. Likelihood Function for Individual i

  13. Log Likelihood Function

  14. Computing the Expected LogL Example: Hermite Quadrature Nodes and Weights, H=5 Nodes: -2.02018,-0.95857, 0.00000, 0.95857, 2.02018 Weights: 1.99532,0.39362, 0.94531, 0.39362, 1.99532 Applications usually use many more points, up to 96 and Much more accurate (more digits) representations.

  15. Quadrature

  16. Gauss-Hermite Quadrature

  17. Simulation

  18. Convergence Results

  19. MSL vs. ML .154272 = .023799

  20. Two Level Panel Data • Nested by construction • Unbalanced panels • No real obstacle to estimation • Some inconvenient algebra. • In 2 step FGLS of the RE, need “1/T” to solve for an estimate of σu2. What to use?

  21. Balanced Nested Panel Data Zi,j,k,t = test score for student t, teacher k, school j, district i L = 2 school districts, i = 1,…,L Mi = 3 schools in each district, j = 1,…,Mi Nij = 4 teachers in each school, k = 1,…,Nij Tijk = 20 students in each class, t = 1,…,Tijk Antweiler, W., “Nested Random Effects Estimation in Unbalanced Panel Data,” Journal of Econometrics, 101, 2001, pp. 295-313.

  22. Nested Effects Model

  23. GLS with Nested Effects

  24. Unbalanced Nested Data • With unbalanced panels, all the preceding results fall apart. • GLS, FGLS, even fixed effects become analytically intractable. • The log likelihood is very tractable • Note a collision of practicality with nonrobustness. (Normality must be assumed.)

  25. Log Likelihood (1)

  26. Log Likelihood (2)

  27. Maximizing Log L • Antweiler provides analytic first derivatives for gradient methods of optimization. Ugly to program. • Numerical derivatives:

  28. Asymptotic Covariance Matrix

  29. An Appropriate Asymptotic Covariance Matrix

  30. Some Observations • Assuming the wrong (e.g., nonnested) error structure • Still consistent – GLS with the wrong weights • Standard errors (apparently) biased downward (Moulton bias) • Adding “time” effects or other nonnested effects is “very challenging.” Perhaps do with “fixed” effects (dummy variables).

  31. An Application • Y1jkt = log of atmospheric sulfur dioxide concentration at observation station k at time t, in country i. • H = 2621, 293 stations, 44 countries, various numbers of observations, not equally spaced • Three levels, not 4 as in article. • Xjkt =1,log(GDP/km2),log(K/L),log(Income), Suburban, Rural,Communist,log(Oil price), average temperature, time trend.

  32. Estimates

  33. Rotating Panel-1 The structure of the sample and selection of individuals in a rotating sampling design are as follows: Let all individuals in the population be numbered consecutively. The sample in period 1 consists of N, individuals. In period 2, a fraction, met (0 <me2 < N1) of the sample in period 1 are replaced by mi2 new individuals from the population. In period 3 another fraction of the sample in the period 2, me2 (0 < me2<N2)individuals are replaced by mi3new individuals and so on. Thus the sample size in period t is Nt = {Nt-1 - met-1 + mii}. The procedure of dropping met-1 individuals selected in period t - 1 and replacing them by mitindividuals from the population in period t is called rotating sampling. In this framework total number of observations and individuals observed are ΣtNt and N1 + Σt=2 to Tmitrespectively. Heshmati, A,“Efficiency measurement in rotating panel data,” Applied Economics, 30, 1998, pp. 919-930

  34. Rotating Panel-2 The outcome of the rotating sample for farms producing dairy products is given in Table 1. Each of the annual sample is composed of four parts or subsamples. For example, in 1980 the sample contains 79, 62, 98, and 74 farms. The first three parts (79, 62, and 98) are those not replaced during the transition from 1979 to 1980. The last subsample contains 74 newly included farms from the population. At the same time 85 farms are excluded from the sample in 1979. The difference between the excluded part (85) and the included part (74) corresponds to the changes in the rotating sample size between these two periods, i.e. 313-324 = -11. This difference includes only the part of the sample where each farm is observed consecutively for four years, Nrot. The difference in the non-rotating part, N„„„, is due to those farms which are not observed consecutively. The proportion of farms not observed consecutively, Nnonin the total annual sample, Nnonvaries from 11.2 to 22.6% with an average of 18.7 per cent.

  35. Rotating Panels-3 • Simply an unbalanced panel • Treat with the familiar techniques • Accounting is complicated • Time effects may be complicated. • Biorn and Jansen (Scand. J. E., 1983) households cohort 1 has T = 1976,1977 while cohort 2 has T=1977,1978. • But,… “Time in sample bias…” may require special treatment. Mexican labor survey has 3 periods rotation. Some families in 1 or 2 or 3 periods.

  36. Pseudo Panels

More Related