340 likes | 357 Vues
This chapter delves into the analysis of panel data, discussing fixed effects models, random effects models, and the seemingly unrelated regression (SUR) model. Learn how these models are applied in economics and the estimation techniques used.
E N D
Chapter 15 Panel Data Analysis
What is in this Chapter? • This chapter discusses analysis of panel data. • This is a situation where there are observations on individual cross-section units over a period of time. • The chapter discusses several models for the analysis of panel data.
What is in this Chapter? • 1. Fixed effects models. • 2. Random effects models. • 3. Seemingly unrelated regression (SUR) model • 4. Random coefficient model.
Introduction • One of the early uses of panel data in economics was in the context of estimation of production functions. • The model used is now referred to as the "fixed effects" model and is given by
Introduction • This model is also referred to as the "least squares with dummy variables" (LSDV) model. • Theαiare estimated as coefficients of dummy variables.
The LSDV or Fixed Effects Model • Define
The LSDV or Fixed Effects Model • In the case of several explanatory variables, Wxx is a matrix and βand Wxy are vectors.
The OLS model • If we consider the hypothesis then the model is
Alternative method for the fixed effects model • where αi (i=1, 2…, N) and β (KX1 vector) are unknown parameters to be estimated.
Alternative method for the fixed effects model • As part of this study’s focus on the dynamic relationships between yit and xit(i.e.the β parameters) we take the ‘group difference’ between variables and redefine the equation as follows:
Alternative method for the fixed effects model • where * denotes variables deviated from the group mean (an example)
Industry and year dummies • Industry dummies • Using the first one-digit (or two-digit) of the firm’s SIC code. • Control for the potential variation across industries • Year dummies • Panel structure data • Year effect refers to the aggregate effects of unobserved factors in a particular year that affect all the companies equally
Industry and year dummies • Yi,t =0 + 1Xi,t + control variables + year dummies + industry dummies
The Random Effects Model • In the random effects model, the αiare treated as random variables rather than fixed constants. • The αiare assumed to be independent of the errors uu and also mutually independent. • This model is also known as the variance components model. • It became popular in econometrics following the paper by Balestra and Nerlove on the demand for natural gas.
The Random Effects Model • For the sake of simplicity we shall use only one explanatory variable. • The model is the same as equation (15.1) except that αi are random variables. • Since αi are random, the errors now are vit = αi + uit
The Random Effects Model • Since the errors are correlated, we have to use generalized least squares (GLS) to get efficient estimates. • However, after algebraic simplification the GLS estimator can be written in the simple form
The Random Effects Model • W refers to within-group • B refers to between-group • T refers to total
The SUR Model • Zeilner suggested an alternative method to analyze panel data, the seemingly unrelatedregression (SUR) estimation • In this model a GLS method is applied to exploit the correlations in the errors across cross-section units • The random effects model results in a particular type of correlation among the errors. It is an equicorrelated model. • In the SUR model the errors are independent over time but correlated across cross-section units:
The SUR Model • This type of correlation would arise if there are omitted variables that are common to all equations . • The estimation of the SUR model proceeds as follows. • We first estimate each of the N equations (for the cross-section units) by OLS. • We get the residuals . • Then we compute where k is the number of regressors. • After we get the estimates we use GLS on all the N equations jointly.
The SUR Model • If we have large N and small T this method is not feasible. • Also, the method is appropriate only if the errors are generated by a true multivariate distribution. • When the correlations are due to common omitted variables it is not clear whether the GLS method is superior to OLS. • The argument is similar to the one mentioned in Section 6.9. See "autocorrelation caused by omitted variables."
The Random Coefficient Model • If δ2 is large compared with υi, then the weights in equation (15.8) are almost equal and the weighted average would be close to simple average of the βi.
The Random Coefficient Model • In practice the GLS estimator cannot be computed because the parameters in equation (15.8 ) are not known. • To obtain these we estimate equations for the N cross-section units and get the residuals . • Then