340 likes | 428 Views
Chapter 15. Panel Data Analysis. What is in this Chapter?. This chapter discusses analysis of panel data. This is a situation where there are observations on individual cross-section units over a period of time. The chapter discusses several models for the analysis of panel data.
E N D
Chapter 15 Panel Data Analysis
What is in this Chapter? • This chapter discusses analysis of panel data. • This is a situation where there are observations on individual cross-section units over a period of time. • The chapter discusses several models for the analysis of panel data.
What is in this Chapter? • 1. Fixed effects models. • 2. Random effects models. • 3. Seemingly unrelated regression (SUR) model • 4. Random coefficient model.
Introduction • One of the early uses of panel data in economics was in the context of estimation of production functions. • The model used is now referred to as the "fixed effects" model and is given by
Introduction • This model is also referred to as the "least squares with dummy variables" (LSDV) model. • Theαiare estimated as coefficients of dummy variables.
The LSDV or Fixed Effects Model • Define
The LSDV or Fixed Effects Model • In the case of several explanatory variables, Wxx is a matrix and βand Wxy are vectors.
The OLS model • If we consider the hypothesis then the model is
Alternative method for the fixed effects model • where αi (i=1, 2…, N) and β (KX1 vector) are unknown parameters to be estimated.
Alternative method for the fixed effects model • As part of this study’s focus on the dynamic relationships between yit and xit(i.e.the β parameters) we take the ‘group difference’ between variables and redefine the equation as follows:
Alternative method for the fixed effects model • where * denotes variables deviated from the group mean (an example)
Industry and year dummies • Industry dummies • Using the first one-digit (or two-digit) of the firm’s SIC code. • Control for the potential variation across industries • Year dummies • Panel structure data • Year effect refers to the aggregate effects of unobserved factors in a particular year that affect all the companies equally
Industry and year dummies • Yi,t =0 + 1Xi,t + control variables + year dummies + industry dummies
The Random Effects Model • In the random effects model, the αiare treated as random variables rather than fixed constants. • The αiare assumed to be independent of the errors uu and also mutually independent. • This model is also known as the variance components model. • It became popular in econometrics following the paper by Balestra and Nerlove on the demand for natural gas.
The Random Effects Model • For the sake of simplicity we shall use only one explanatory variable. • The model is the same as equation (15.1) except that αi are random variables. • Since αi are random, the errors now are vit = αi + uit
The Random Effects Model • Since the errors are correlated, we have to use generalized least squares (GLS) to get efficient estimates. • However, after algebraic simplification the GLS estimator can be written in the simple form
The Random Effects Model • W refers to within-group • B refers to between-group • T refers to total
The SUR Model • Zeilner suggested an alternative method to analyze panel data, the seemingly unrelatedregression (SUR) estimation • In this model a GLS method is applied to exploit the correlations in the errors across cross-section units • The random effects model results in a particular type of correlation among the errors. It is an equicorrelated model. • In the SUR model the errors are independent over time but correlated across cross-section units:
The SUR Model • This type of correlation would arise if there are omitted variables that are common to all equations . • The estimation of the SUR model proceeds as follows. • We first estimate each of the N equations (for the cross-section units) by OLS. • We get the residuals . • Then we compute where k is the number of regressors. • After we get the estimates we use GLS on all the N equations jointly.
The SUR Model • If we have large N and small T this method is not feasible. • Also, the method is appropriate only if the errors are generated by a true multivariate distribution. • When the correlations are due to common omitted variables it is not clear whether the GLS method is superior to OLS. • The argument is similar to the one mentioned in Section 6.9. See "autocorrelation caused by omitted variables."
The Random Coefficient Model • If δ2 is large compared with υi, then the weights in equation (15.8) are almost equal and the weighted average would be close to simple average of the βi.
The Random Coefficient Model • In practice the GLS estimator cannot be computed because the parameters in equation (15.8 ) are not known. • To obtain these we estimate equations for the N cross-section units and get the residuals . • Then