Three-stage least squares

Similar documents
Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

Chapter 4: Vector Autoregressive Models

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

SYSTEMS OF REGRESSION EQUATIONS

Solución del Examen Tipo: 1

3.1 Least squares in matrix form

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

2. Linear regression with multiple regressors

Chapter 3: The Multiple Linear Regression Model

Lecture 15. Endogeneity & Instrumental Variable Estimation

Chapter 2. Dynamic panel data models

IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD

Similarity and Diagonalization. Similar Matrices

ECONOMETRIC METHODS IN MARKETING RESEARCH

Internet Appendix to CAPM for estimating cost of equity capital: Interpreting the empirical evidence

Econometrics Simple Linear Regression

Department of Economics

Lecture 2 Matrix Operations

Least Squares Estimation

CAPM, Arbitrage, and Linear Factor Models

Multiple regression - Matrices

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

The Use of Event Studies in Finance and Economics. Fall Gerald P. Dwyer, Jr.

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination February 14 th, 2014.

Data Mining: Algorithms and Applications Matrix Math Review

Chapter 6: Multivariate Cointegration Analysis

Univariate Regression

Regression III: Advanced Methods

Correlated Random Effects Panel Data Models

ESTIMATING AN ECONOMIC MODEL OF CRIME USING PANEL DATA FROM NORTH CAROLINA BADI H. BALTAGI*

Causes of Inflation in the Iranian Economy

Chapter 10: Basic Linear Unobserved Effects Panel Data. Models:

Multivariate Analysis of Variance (MANOVA): I. Theory

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS

Implementing Panel-Corrected Standard Errors in R: The pcse Package

Econometric Methods for Panel Data

Chapter 7. Matrices. Definition. An m n matrix is an array of numbers set out in m rows and n columns. Examples. (

FULLY MODIFIED OLS FOR HETEROGENEOUS COINTEGRATED PANELS

Marketing Mix Modelling and Big Data P. M Cain

MULTIPLE REGRESSION WITH CATEGORICAL DATA

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

Note 2 to Computer class: Standard mis-specification tests

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

Clustering in the Linear Model

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.

16 : Demand Forecasting

Panel Data Econometrics

PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA

1 Teaching notes on GMM 1.

Impulse Response Functions

Simple Linear Regression Inference

Recall this chart that showed how most of our course would be organized:

Java Modules for Time Series Analysis

Solution to Homework 2

Trend and Seasonal Components

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Linear Algebra Notes

Introduction to Matrix Algebra

Multiple Linear Regression in Data Mining

Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree of PhD of Engineering in Informatics

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop

Notes on Determinant

Introduction to Path Analysis

F Matrix Calculus F 1

5. Linear Regression

y t by left multiplication with 1 (L) as y t = 1 (L) t =ª(L) t 2.5 Variance decomposition and innovation accounting Consider the VAR(p) model where

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

STATISTICS AND DATA ANALYSIS IN GEOLOGY, 3rd ed. Clarificationof zonationprocedure described onpp

Performing Unit Root Tests in EViews. Unit Root Testing

Testing for Granger causality between stock prices and economic growth

An Introduction into the SVAR Methodology: Identification, Interpretation and Limitations of SVAR models

UNIVERSITY OF WAIKATO. Hamilton New Zealand

Chapter 5: Bivariate Cointegration Analysis

Notes on Applied Linear Regression

E 4101/5101 Lecture 8: Exogeneity

Introduction to General and Generalized Linear Models

Poisson Models for Count Data

Linear Models for Continuous Data

The Wondrous World of fmri statistics

Introducing the Multilevel Model for Change

Panel Data: Linear Models

From the help desk: Swamy s random-coefficients model

Multivariate Normal Distribution

ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS

How To Find Out If A Firm Is Profitable

IDENTIFICATION IN A CLASS OF NONPARAMETRIC SIMULTANEOUS EQUATIONS MODELS. Steven T. Berry and Philip A. Haile. March 2011 Revised April 2011

From the help desk: Bootstrapped standard errors

An extension of the factoring likelihood approach for non-monotone missing data

Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors

The Big Picture. Correlation. Scatter Plots. Data

Predict the Popularity of YouTube Videos Using Early View Data

Spatial panel models

Transcription:

Three-stage least squares Econometric Methods Lecture 7, PhD Warsaw School of Economics

Outline 1 Introduction 2 3

Outline 1 Introduction 2 3

3-Stage Least Squares: basic idea 1-stage (OLS): inconsistent for systems with simultaneous equation models 2-stage (2SLS): let's clean endogenous regressors step 1: regress endogenous regressors against all predetermined variables of the system, take theoretical value step 2: do OLS with these theoretical (instead of empirical) values in case of endogenous regressors but still: a limited-information method! focus on single equation simultaneous correlations between various equations' error terms: ignored! result: ineciency

3-Stage Least Squares: basic idea 1-stage (OLS): inconsistent for systems with simultaneous equation models 2-stage (2SLS): let's clean endogenous regressors step 1: regress endogenous regressors against all predetermined variables of the system, take theoretical value step 2: do OLS with these theoretical (instead of empirical) values in case of endogenous regressors but still: a limited-information method! focus on single equation simultaneous correlations between various equations' error terms: ignored! result: ineciency

3-Stage Least Squares: basic idea 1-stage (OLS): inconsistent for systems with simultaneous equation models 2-stage (2SLS): let's clean endogenous regressors step 1: regress endogenous regressors against all predetermined variables of the system, take theoretical value step 2: do OLS with these theoretical (instead of empirical) values in case of endogenous regressors but still: a limited-information method! focus on single equation simultaneous correlations between various equations' error terms: ignored! result: ineciency

solution do 2SLS (step 1 and 2) add step 3 to account for these correlations the dierence between 2SLS and 3SLS analogous to dierence between OLS and GLS in the single-equation context whereby the latter takes into account inter-temporal (and not simultaneous) correlations between error terms

solution do 2SLS (step 1 and 2) add step 3 to account for these correlations the dierence between 2SLS and 3SLS analogous to dierence between OLS and GLS in the single-equation context whereby the latter takes into account inter-temporal (and not simultaneous) correlations between error terms

Notation (1) Consider the entire m-equation model: y 1 = Z1 F1 + ε 1 y 2 = Z2 F2 + ε 2 y m = Zm Fm + ε m where Zj = [ ] Ỹ j Xj vector of explanatory variables (endoand exogeneous respectively) in j-th equation Let ^F 2SLS j vector of j-th equation's parameter estimates via 2SLS

Notation (2) Denote the model as matrix equation: y 1 Z 1 0 0 F 1 ε 1 y 2 0 Z 2 0 F 2 ε 2 = + y m 0 0 Z m F m ε m }{{}}{{}}{{}}{{} y Z F ε where ε 1, ε 2, are vertical vectors sized T 1

Covariances of error terms Knowing 2SLS parameter estimated (and hence 2SLS residuals), we can estimate the variance-covariance matrix of the random disturbances Var (ε t ) = E ( ) ε t ε T t element-by-element in a standard way: ˆΣ = [ˆσ ij ] ( ) ( ) T ˆσ ij = /T y i Z i^f2mnk i y j Z j^f2mnk j where T number of observations Alternatively, instead of T, one can divide the product sum in the nominator by the geometrically averaged degrees of freedom for equations i and j: ( T M i K i ) (T M j K j ) M i ( M j ) number of endogenous regressors in i-th (j-th) equation K i ( K j ) number of exogeneous regressors in i-th (j-th) equation

Kronecker product [ ] [ ] a11 a 12 b11 b 12 = a 21 a 22 b 21 b [ ] 22 [ b11 b a 11 12 b11 b a b 21 b 12 12 [ 22 ] [ b 21 b 22 b11 b a 21 12 b11 b a b 21 b 22 12 22 b 21 b 22 It can be demonstrated easily that: (A B) 1 = A 1 B 1 (A B) (C D) = AC BD ] ]

Outline 1 Introduction 2 3

3SLS step 1 Step 1 Reduced-form estimation and theoretical values for j-th equation: ( ^~ Z j = XΠ j = X X T 1 X) X T ~ Zj In the matrix notation, for the entire system: ^Z = ( X X T 1 X) X T Z1 0 0 ( 0 X X T 1 X) X T Z2 0 ( 0 0 X X T 1 X) X T Zm { I m [ X ( X T X) 1 X T ]} Z =

3SLS step 2 Step 2 Estimation of parameters for indivual equations in the structural form (2SLS): empirical endogenous explanatory variables replaced with theoretical values from step 1: ^F2SLS = ( ^Z T ^Z) 1 ^ZT y

3SLS step 3 (1) Step 3 Acknowledging the simultaneous correlations of error terms in the model If j-th equation's error term is spherical, its variance-covariance σ 2 jj 0 0 0 σjj 2 0 matrix is 0 0 σjj 2 What's the variance-covariance matrix of the entire vector ε?

3SLS step 3 (2) σ 2 11 σ 2 12 σ 2 1m σ 2 11 σ 2 12 σ 2 1m σ 2 11 σ 2 12 σ 2 1m σ 2 12 σ 2 22 σ 2 2m σ 2 12 σ 2 22 σ 2 2m σ 2 12 σ 2 22 σ 2 2m σ 2 1m σ 2 2m σ 2 mm σ 2 1m σ 2 2m σ 2 mm σ 2 1m σ 2 2m σ 2 mm

Kronecker product in 3SLS σ 2 11 σ 2 12 σ 2 1m σ 2 11 σ 2 12 σ 2 1m σ 2 11 σ 2 12 σ 2 1m σ 2 12 σ 2 22 σ 2 2m σ 2 12 σ 2 22 σ 2 2m σ 2 12 σ 2 22 σ 2 2m σ 2 1m σ 2 2m σ 2 mm σ 2 1m σ 2 2m σ 2 mm = = σ 2 11 σ 2 12 σ 2 1m σ 2 12 σ 2 22 σ 2 2m σ1m 2 σ2m 2 σmm 2 σ1m 2 σ2m 2 σmm 2 1 0 0 0 1 0 0 0 1 T T = Σ I T

Final formula for 3SLS estimator (1) Knowing the entrie variance-covariance matrix ˆΣ, we can apply GLS to the model y = ZF + ε, ε (0, Ω) ^FGLS = ( Z T Ω 1 1 Z) Z T Ω 1 y This is done jointly with using 2SLS theoretical values for endogenous regressors (obtained in step 1): ( 1^Z) 1 ^Z T Ω ^ZT Ω 1 y ^F3SLS =

Final formula for 3SLS estimator (2) We already know that ˆΩ = ˆΣ IT and ^Z = ^F3SLS consequence: = { [ [Z T Im [ Z {Im T X X ( ) X T 1 ]} T 1 { T X X (ˆΣ IT) ( ) X T 1 ]} T ) 1 T X X (ˆΣ IT y { [ ( ) Im X X T 1 ]} T X X Z In [ Im From the properties of Kronecker product: ^F3SLS [ ( ) = [Z {ˆΣ T 1 X X T 1 ]} 1 [ T X X Z] Z {ˆΣ T 1 X ( ) X T 1 ]} 1 T X X Z] X ( ) X T 1 ]} T X X y

Outline 1 Introduction 2 3

Matrix of predetermined variables: 1 0 6 1 X = [ ] 1 1 5 2 x 0 x 1 x 2 x 3 = 1 1 3 2 1 2 3 1 1 2 2 0 Matrix of endogenous variables: 1 3 4 Y = [ ] 2 3 6 y 1 y 2 y 3 = 1 2 7 2 1 5 3 1 3 Estimate with 3SLS the structural parameters of the following model: y 1t = α 0 + α 1x 1t + α 2y 2t + ε 1t y 2t = β 0 + β 1x 2t + β 2y 3t + ε 2t y 3t = γ 0 + γ 1x 3t + γ 2y 1t + ε 3t