Chapter 10: Basic Linear Unobserved Effects Panel Data. Models:



Similar documents
ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS

Panel Data: Linear Models

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS

SYSTEMS OF REGRESSION EQUATIONS

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

Clustering in the Linear Model

Econometric Methods for Panel Data

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week (0.052)

Correlated Random Effects Panel Data Models

Introduction to Regression Models for Panel Data Analysis. Indiana University Workshop in Methods October 7, Professor Patricia A.

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

A Subset-Continuous-Updating Transformation on GMM Estimators for Dynamic Panel Data Models

Employer-Provided Health Insurance and Labor Supply of Married Women

MODELS FOR PANEL DATA Q

Marketing Mix Modelling and Big Data P. M Cain

Lecture 3: Differences-in-Differences

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

Examining the effects of exchange rates on Australian domestic tourism demand: A panel generalized least squares approach

1 Introduction. 2 The Econometric Model. Panel Data: Fixed and Random Effects. Short Guides to Microeconometrics Fall 2015

Spatial panel models

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

Chapter 2. Dynamic panel data models

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

DISCUSSION PAPER SERIES

Fixed Effects Bias in Panel Data Estimators

1. THE LINEAR MODEL WITH CLUSTER EFFECTS

Implementing Panel-Corrected Standard Errors in R: The pcse Package

Panel Data Analysis in Stata

IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD

Is Infrastructure Capital Productive? A Dynamic Heterogeneous Approach.

Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*

Econometrics Simple Linear Regression

On Marginal Effects in Semiparametric Censored Regression Models

ANALYSIS OF FACTOR BASED DATA MINING TECHNIQUES

Panel Data Analysis Josef Brüderl, University of Mannheim, March 2005

Accounting for Time-Varying Unobserved Ability Heterogeneity within Education Production Functions

UNIVERSITY OF WAIKATO. Hamilton New Zealand

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Premium Copayments and the Trade-off between Wages and Employer-Provided Health Insurance. February 2011

The Effect of R&D Expenditures on Stock Returns, Price and Volatility

Standard errors of marginal effects in the heteroskedastic probit model

Sales forecasting # 1

Chapter 3: The Multiple Linear Regression Model

Simple Linear Regression Inference

2. Linear regression with multiple regressors

16 : Demand Forecasting

An Investigation of the Statistical Modelling Approaches for MelC

Poor identification and estimation problems in panel data models with random effects and autocorrelated errors

Minimum LM Unit Root Test with One Structural Break. Junsoo Lee Department of Economics University of Alabama

A General Approach to Variance Estimation under Imputation for Missing Survey Data

Multiple Choice Models II

Capturing Cross-Sectional Correlation with Time Series: with an Application to Unit Root Test

Estimating the marginal cost of rail infrastructure maintenance. using static and dynamic models; does more data make a


Chapter 1. Linear Panel Models and Heterogeneity

A Basic Introduction to Missing Data

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Univariate Regression

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION

From the help desk: Swamy s random-coefficients model

Longitudinal (Panel and Time Series Cross-Section) Data

Markups and Firm-Level Export Status: Appendix

TURUN YLIOPISTO UNIVERSITY OF TURKU TALOUSTIEDE DEPARTMENT OF ECONOMICS RESEARCH REPORTS. A nonlinear moving average test as a robust test for ARCH

ECON 523 Applied Econometrics I /Masters Level American University, Spring Description of the course

Least Squares Estimation

FULLY MODIFIED OLS FOR HETEROGENEOUS COINTEGRATED PANELS

SUPPLEMENT TO USAGE-BASED PRICING AND DEMAND FOR RESIDENTIAL BROADBAND : APPENDIX (Econometrica, Vol. 84, No. 2, March 2016, )

Note 2 to Computer class: Standard mis-specification tests

Example: Boats and Manatees

1. Review of the Basic Methodology

Online Appendix for Dynamic Inputs and Resource (Mis)Allocation

Online Appendix for Demand for Crash Insurance, Intermediary Constraints, and Risk Premia in Financial Markets

Internet Appendix for Money Creation and the Shadow Banking System [Not for publication]

Regression Analysis. Regression Analysis MIT 18.S096. Dr. Kempthorne. Fall 2013

Econometric analysis of the Belgian car market

Sales forecasting # 2

Reject Inference in Credit Scoring. Jie-Men Mok

LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as

Cost implications of no-fault automobile insurance. By: Joseph E. Johnson, George B. Flanigan, and Daniel T. Winkler

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

Illustration (and the use of HLM)

Average Redistributional Effects. IFAI/IZA Conference on Labor Market Policy Evaluation

3. Regression & Exponential Smoothing

1. When Can Missing Data be Ignored?

The marginal cost of rail infrastructure maintenance; does more data make a difference?

Hailong Qian. Department of Economics John Cook School of Business Saint Louis University 3674 Lindell Blvd, St. Louis, MO 63108, USA

Nominal and ordinal logistic regression

HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009

Panel Data Econometrics

Spatial Dependence in Commercial Real Estate

1 Teaching notes on GMM 1.

1 Short Introduction to Time Series

Trend and Seasonal Components

Introduction to Regression and Data Analysis

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

Marginal Person. Average Person. (Average Return of College Goers) Return, Cost. (Average Return in the Population) (Marginal Return)

Financial Risk Management Exam Sample Questions/Answers

Panel Data Analysis Fixed and Random Effects using Stata (v. 4.2)

Chapter 6: Multivariate Cointegration Analysis

Investment and Firm-Specific Cost of Capital: Evidence from Firm-Level Panel Data

Transcription:

Chapter 10: Basic Linear Unobserved Effects Panel Data Models: Microeconomic Econometrics I Spring 2010 10.1 Motivation: The Omitted Variables Problem We are interested in the partial effects of the observable explanatory variables x j in the population regression function E(y x 1, x 2,..., x k, c) (10.1) An unobserved,time-constant variable is called an unobserved effect in panel data analysis. Such a data set is usually called a balanced panel because the same time periods are available for all cross section units. 1

10.2 Assumptions about the Unobserved Effects and Explanatory Variables 10.2.1 Random or Fixed Effects? The basic unobserved effects model (UEM) can be written, for a randomly drawn cross section observation i, as y it = x it β + c i + u it, t = 1,2,..., T (10.11) In addition to unobserved effect, there are many other names given to c i in applications: unobserved component, latent variable, and unobserved heterogeneity are common. If i indexes individuals, then c i is sometimes called an individual effect or individual heterogeneity; analogous terms apply to families, firms, cities, and other cross-sectional units. The u it are called the idiosyncratic errors or idiosyncratic disturbances because these change across t as well as across i. Nevertheless, later we will label two different estimation methods random effects estimation and fixed effects estimation. 2

10.2.2 Strict Exogeneity Assumptions on the Explanatory Variables With an unobserved effect the most revealing form of the strict exogeneity assumption is E(y it x i1, x i2,..., x it, c i ) = E(y it x it, c i ) = x it β + c i (10.12) When assumption (10.12) holds, we say that the x it : t = 1,2,..., T are strictly exogenous conditional on the unobserved effect c i. 10.2.3 Some Examples of Unobserved Effects Panel Data Models 3

Example10.1(Program Evaluation): log(wage it ) = θ t + z it γ + δ 1 prog it + c i + u it (10.16) Example 10.2 (Distributed Lag Model): patents it = θ t + z it γ + δ 0 RD it + δ 1 RD i,t 1 + + δ 5 RD i,t 5 + c i + u it (10.17) 4

Example 10.3 (Lagged Dependent Variable): log(wage it ) = β 1 log(wage i,t 1 ) + c i + u it, t = 1,2,..., T (10.18) 10.3 Estimating Unobserved Effects Models by Pooled OLS Under certain assumptions, the pooled OLS estimator can be used to obtain a consistent estimator of β in model (10.11). Write the model as y it = x it β + υ it, t = 1,2,..., T (10.21) where υ it c i + u it, t = 1,2,...T are the composite errors. 5

10.4 Random Effects Methods 10.4.1 Estimation and Inference under the Basic Random Effects Assumptions As with pooled OLS, a random effects analysis puts c i into the error term. Assumption RE.1: 1. E(u it x i, c i ) = 0, t = 1,..., T. 2. E(c i x i ) = E(c i ) = 0 where x i (x i1, x i2,..., x it ). Assumption RE.2: rank E(X i Ω 1 X i ) = K. When Ω has the form (10.31), we say it has the random effects structure. Assumption RE.3: 1. E(u i u i x i, c i ) = σ 2 u I T. 2. E(c 2 i x i) = σ 2 c In a panel data context, the FGLS estimator that uses the variance matrix (10.33) is what is known as the random effects estimator: ( N ˆβ RE = X ˆΩ ) 1 ( N 1 i X i X ˆΩ ) 1 i y i i=1 i=1 (10.34) Example 10.4(RE Estimation of the Effects of Job Training Grants): log(ŝcrap) =.415.093 d88.270 d89 +.548 union.215 grant.377 grant 1 (.243) (.109) (.132) (.411) (.148) (.205) 6

10.4.2 Robust Variance Matrix Estimator 10.4.3 A General FGLS Analysis If the idiosyncratic errors u it : t = 1,2,..., T are generally heteroskedastic and serially correlated across t, a more general estimator of Ω can be used in FGLS: ˆΩ = N 1 N ˆv i ˆv i (10.38) i=1 7

10.4.4 Testing for the Presence of an Unobserved Effect From equation (10.37), we base a test of H 0 : σ 2 c = 0 on the null asymptotic distribution of N 1/2 N T 1 T i=1 t=1 s=t+1 ˆυ it ˆυ is (10.39) which is essentially the estimator ˆσ 2 c scaled up by N. 10.5 Fixed Effects Methods 10.5.1 Consistency of the Fixed Effects Estimator Again consider the linear unobserved effects model for T time periods: y it = x it β + c i + u it, t = 1,..., T (10.41) 8

Assumption FE.1: E(u it x i, c i ) = 0, t = 1,2,..., T. In this section we study the fixed effects transformation, also called the within transformation. The fixed effects (FE) estimator, denoted by ˆβFE,is the pooled OLS estimator from the regression ÿ it on ẍ it, t = 1,2,..., T; i = 1,2,..., N (10.48) This set of equations can be obtained by premultiplying equation (10.42) by a timedemeaning matrix. Assumption FE.2: rank( Tt=1 E(ẍ itẍit) ) [ ] = rank E(Ẍ i Ẍ i) = K. It is also called the within estimator because it uses the time variation within each cross section. The between estimator, which uses only variation between the cross section observations, is the OLS estimator applied to the time-averaged equation (10.45). 9

10.5.2 Asymptotic Inference with Fixed Effects Assumption FE.3: E(u i u i x i, c i ) = σ 2 u I T. To see how to estimate σ 2 u, we use equation (10.51) summed across t : T t=1 E(ü 2 it ) = (T 1)σ 2 u, and so [N(T 1)] 1 N Tt=1 i=1 E(ü 2 it ) = σ2 u.now, define the fixed effects residuals as û it = ÿ it ẍ it ˆβFE, t = 1,2,..., T; i = 1,2,..., N (10.55) which are simply the OLS residuals from the pooled regression (10.48). Example 10.5(FE Estimation of the Effects of Job Training Grants): log(ŝcrap) =.080 d88.247 d89.252 grant.422 grant 1 (.109) (.133) (.151) (.210) 10

10.5.3 The Dummy Variable Regression The estimator of β obtained from regression (10.57) is, in fact, the fixed effects estimator. This is why ˆβFE is sometimes referred to as the dummy variable estimator. 10.5.4 Serial Correlation and the Robust Variance Matrix Estimator Applying equation (7.26), the robust variance matrix estimator of ˆβFE is Ava ˆr( ˆβFE ) = (Ẍ Ẍ) 1( N Ẍ iûiû i Ẍ i) (Ẍ Ẍ) 1 (10.59) which was suggested by Arellano (1987) and follows from the general results of White (1984, Chapter 6). i=1 11

Example 10.5 (continued): log(ŝcrap) =.080 d88.247 d89.252 grant.422 grant 1 (.109) (.133) (.151) (.210) [.096] [.193] [.140] [.276] 10.5.5 Fixed Effects GLS Assumption FEGLS.3: E(u i u i x i, c i ) = Λ, a T T positive definite matrix. The fixed effects GLS (FEGLS) estimator is defined by ˆβ FEGLS = ( N Ẍ ˆΩ 1 ) 1 ( N i Ẍ i Ẍ ˆΩ 1 ) i ÿ i i=1 i=1 where Ẍ i and ÿ i are defined with the last time period dropped. 12

Assumption FEGLS.2: ranke(ẍ i ˆΩ 1 Ẍ i ) = K. 10.5.6 Using Fixed Effects Estimation for Policy Analysis Consider the model y it = x it β + υ it = z it γ + δw it + υ it where υ it may or may not contain an unobserved effect. 13

10.6 First Differencing Methods 10.6.1 Inference Assumption FD.1: Same as Assumption FE.1. Lagging the model (10.41) one period and subtracting gives y it = x it β + u it, t = 2,3,..., T (10.63) where y it = y it y i,t 1, x it = x it x i,t 1, and u it = u it u i,t 1. As with the FE transformation, this first-differencing transformation eliminates the unobserved effect c i. The first-difference (FD) estimator, ˆβF D, is the pooled OLS estimator from the regression y it on x it, t = 2,..., T; i = 1,2,..., N (10.65) Tt=2 ) Assumption FD.2: rank( E( x it x it) = K. Assumption FD.3: E(e i e i x i1,..., x it, c i ) = σ 2 e I T 1, where e i is the (T 1) 1 vector containing e it,t = 2,..., T. 14

10.6.2 Robust Variance Matrix If Assumption FD.3 is violated, then, as usual, we can compute a robust variance matrix. The estimator in equation (7.26) applied in this context is Avar( ˆ ˆβFD ) = ( X X) 1( N ) X i êi ê i X i ( X X) 1 (10.70) where X denotes the N(T 1) K matrix of stacked first differences of x it. i=1 Example 10.6 (FD Estimation of the Effects of Job Training Grants): log(scrap it ) = δ 1 + δ 2 d89 t + β 1 grant it + β 2 grant i,t 1 + u it 15

10.6.3 Testing for Serial Correlation The regression is based on T 2 time periods: ê it = ˆp 1 ê i,t 1 + error it, t = 3,4,..., T; i = 1,2,..., N (10.71) Example10.6 (continued): 10.6.4 Policy Analysis Using First Differencing In this one case, prog i = prog i2,and the first-differenced equation can be written as y i2 = θ 2 + z i2 γ + δ 1 prog i2 + u i2 (10.72) 16

When z i2 is omitted, the estimate of δ 1 from equation (10.72) is the difference-indifferences (DID) estimator (see Problem 10.4): ˆ 1 = y t reat y c ortrol. 10.7 Comparison of Estimators 10.7.1 Fixed Effects versus First Differencing With more than two time periods, a test of strict exogeneity is a test of H 0 : γ = 0 in the expanded equation y t = x t β + w t γ + u t, t = 2,..., T where w t is a subset of x t (that would exclude time dummies). 17

10.7.2 The Relationship between the Random Effects and Fixed Effects Estimators Using the fact that j T j T = T, we can write Ω under the random effects structure as Ω = σ 2 u I T + σ 2 c j T j T = σ2 u I T + Tσ 2 c j T( j T j T) 1 j T = σ2 u I T + Tσ 2 c P T = (σ 2 u + σ2 c )(P T + ηq T ) where P T I T Q T = j T ( j T j T) 1 j T and η σ2 u /(σ2 u + Tσ2 c ). Equation (10.76) shows that the random effects estimator is obtained by a quasitime demeaning: rather than removing the time average from the explanatory and dependent variables at each t, random effects removes a fraction of the time average. Example 10.7 (Job Training Grants): 10.7.3 The Hausman Test Comparing the RE and FE Estimators 18