Econometrics of Panel Data

Size: px
Start display at page:

Download "Econometrics of Panel Data"

Transcription

1 Econometrics of Panel Data 1. Basics and Examples 2. The generalized least squares estimator 3. Fixed effects model 4. Random Effects model 1

2 1 Basics and examples We observes variables for N units, called the cross-sections, for T consecutive periods: (Y it, X it ) i = 1,...,N, with N the cross-sectional dimension. t = 1,...,T, with T the temporal dimension. panel of size N T. 2

3 Y it is the income of family i during year t, for 1 i 1000, and observed in years 2000, 2001, 2002, so T = 3. Y it is the unemployment rate for EU-country i, (1 i 15), observed monthly from 1998:01 up to 2001:12, so T = 48. Note that: T large, N small multiple time series T small, N large survey data on individuals/firms for a small number of waves. 3

4 Example 1: South American countries For 8 South-American countries we want to model the Real GDP per capita in 1985 prices (=Rgdl) in function of the following explicative variables. Population in 1000 s (Pop) Real Investment share of GDP, in % (I) Real Government share of GDP, in % (G) Exchange Rate with U.S. dollar (XR) Measure of Openness of the Economy (Open) You find the data in the file penn.wmf, already in Eviews format. We are in particular interested in the effect of Openness on economic growth. 4

5 1. Create a pool object in Eviews ( /Object/New object ). Give it a name and define the cross-section identifiers. These identifiers are those parts of the names of the series identifying the cross-section. 2. Open the XR-variables as a group and make a plot of them. Compute them in log-difference, using the PoolGenr menu of the pool object and logdifxr?=dlog(xr?). The? will be substituted by every cross-section identifier. Plot the transformed variables. 3. Compute the medians of the variable I? for the different countries (use View/descriptive statistics within the Pool object.) Compute now the average value of I? for every year. 4. Estimate the regression model for Brazil, using /Quick/estimate equation and specifying in Eviews the equation dlog(rgdp bra) c dlog(pop bra) i bra g bra dlog(xr bra) open bra 5. Now we want to pool the data of all countries, to increase the sample size. Use, within the pooled object, /Estimate, and specify: dependent variable=dlog(rgdp?); common coefficients=c dlog(pop?) i? g? dlog(xr?) open?. This is a pooled regression model. 5

6 6. Pooling the data ignores the fact that the data originate from different countries. Dummy variables for the different countries need to be added. This can be done by specifying the constant term as a cross section specific coefficient. We obtain a fixed effect panel data model. Discuss the regression output. 7. The fixed effect panel data model assumes that the effect of openness is the same of all countries. How could you relax this assumption? 8. Test whether all country effects are equal (to know how Eviews labels the coefficients, use View/Representation), using a Wald test. The country effects are called the fixed effects, and if there are significantly different then there is unobserved heterogeneity. 6

7 2 The Generalized Least Squares estimator Standard linear regression model: Y i = X iβ + ε i (i = 1,...,n) with Var(ε i ) = σ 2 is constant homoscedastic errors Cov(ε i, ε j ) = 0 for i j uncorrelated errors 7

8 At the standard model, the Ordinary Least Squares (OLS) estimator is Consistent, meaning that ˆβ β for n tending to infinity. Has the smallest variance among all estimators (for normal errors) and smallest variance among all linear estimators. One has that ˆβ OLS = ( n ) 1 ( n ) X i X i X i Y i. i=1 i=1 8

9 What if the the errors are not homoscedastic and uncorrelated? E.g. for panel data: Cross-sectional heteroscedasticity Correlation among cross sections Serial correlation within and across cross-sections... The Ordinary Least Squares (OLS) estimator is still consistent, but not optimal anymore. 9

10 General linear regression model: Y i = X iβ + ε i (i = 1,...,n) with Var(ε i ) = σ 2 i heteroscedastic errors Cov(ε i, ε j ) = σ ij for i j correlated errors. One can still use OLS (not even a bad idea), if one uses White standard errors (if heteroscedasticity) Newey-West standard errors (if correlated errors + heteroscedasticity) 10

11 The Generalized Least Squares (GLS) estimator will be consistent and optimal and is given by ˆβ GLS = n n w ij X i X j 1 n n w ij X i Y j, i=1 j=1 i=1 j=1 where the weights depends on the values of σ ij. More precisely: let Σ be the n n matrix with elements σ ij, then w ij = (Σ 1 ) ij. Unfortunately, the values in Σ are unknown. 11

12 The Feasible Generalized Least Squares (GLS) proceeds in 2 steps: 1. Compute ˆβ OLS and the residuals r OLS i = Y i X i ˆβ OLS. 2. Use the above residuals to estimate the σ ij. [This will require some additional assumptions on the structure of Σ] Compute then the GLS estimator with estimated weights w ij. The above scheme can be iterated fully iterated GLS estimator. 12

13 Theoretical Example Our sample of size n = 20 consists of two groups of equal size (e.g. men and women). There is no correlation among the observations, but we think that the variances of the error terms for men and women might be of different size. [The error terms contains the omitted and unobserved variables. We might indeed think that their size is different for women than for men, e.g. when regressing salary on individual characteristics] σ 2 i = σ ii = σ 2 M for i = 1,...,10 σ 2 i = σ ii = σ 2 F for i = 11,...,20 σ ij = 0 for i j. 13

14 Computation of the (Feasible) GLS estimator: 1. Compute the OLS estimator and the residuals r OLS i. 2. Estimate ˆσ 2 M = i=1 (r OLS i ) 2 and ˆσ 2 F = (r OLS i=11 i ) 2. Due to the simple structure of the matrix Σ, we have ŵ i = 1 ˆσ 2 M (i = 1,...,10) and ŵ i = 1 ˆσ 2 F (i = 11,...,20) ˆβ GLS = ( n ) 1 ( n ) w i X i X i w i X i Y i. i=1 i=1 14

15 Application to panel data regression Let ε it be the error term of a panel data regression model, with 1 i n, and 1 t T. Three different specifications are common: 1. V ar(ε it ) = σ 2 and all covariances between error terms are zero. OLS can be applied (no weighting). 2. V ar(ε it ) = σi 2 and all covariances between error terms are zero. We have cross-sectional heteroscedasticity. GLS can be applied (cross-section weights): 3. V ar(ε it ) = σi 2, Cov(ε it, ε jt ) = σ ij, all other covariances zero. We allow now for contemporaneous correlation between cross-sections. GLS can be applied (SUR weights). 15

16 Example South American (continued) 1. Have a look at the residuals (View/residuals/Graphs) within the pool object). Compute the covariance and the correlation matrix of the residuals (i) Is there cross-sectional heteroscedasticity? (ii) Is there contemporaneous correlation? 2. Estimate now the model with the appropriate GLS estimator. Are the results depending a lot on the weighting scheme? 3. Is there still serial correlation present in the residuals, i.e. (cross)-correlation at leads and lags? Hence, is the model capturing the dynamics in the data? 16

17 3 The Fixed Effects regression model Fixed effects Model: Y it = X itβ + α i + ε it with t = 1,...T time periods and i = 1...,N cross-sectional units. The α i contain the omitted variables, constant over time, for every unit i. The α i are called the fixed effects, and induce unobserved heterogeneity in the model. The X it are the observed part of the heterogeneity. The ε it contain the remaining omitted variables. 17

18 Testing for unobserved heterogeneity: H 0 : α 1 =... = α N := α (Test for redundant fixed effects) In case H 0 holds, there is no unobserved heterogeneity, and the model reduces to the pooled regression model: Y it = X itβ + α + ε it Ignoring unobserved heterogeneity may lead to severe bias of the estimated β, see figure: 18

19 15 Cross Section 1 Cross Section 2 10 Pooled Regression y 5 Cross Section x 19

20 LSDV estimation LSDV=Least Squares Dummy Variable estimation Rewrite the model as Y it = α 1 D 1 i α n D n i + X itβ + ε it, with D j i = 1 if i = j and zero if i j. Estimate model by OLS or GLS (weighting). If necessary, use White/Newey West type of Standard Errors (also if GLS is used, see later). 20

21 Within groups estimator Compute averages of X it and Y it within each group of cross-sectional unit X i. and Ȳi. Y it = X it β + α i + ε it Ȳ i. = X i. β + α i + ε i. (Y it Ȳi.) = (X it X i. ) β + (ε it ε i. ) Regress the centered Y it on the centered X it by OLS. By centering, the fixed effects are eliminated! One can show that the within group estimator is identical to LSDV. 21

22 Comments 1. If a variable X it is constant in time for all cross-sections, the FE model cannot be estimated. Why? 2. The fixed effects model can be rewritten with a common intercept included as Y it = X itβ + α + µ i + ε it, and µ 1 + µ µ N = 0. Obviously, we have α i = α + µ i, and α is the average of the fixed effects. 22

23 3. One can add time effects (or period effects) in the model: Y it = X itβ + α i + δ t + ε it, The δ t contain the omitted variables, constant over cross-sections, at every time point t. The time effects capture the business cycle. 23

24 4. If we think that the cross-sectional units are an i.i.d. sample (typical for micro-applications), but serial correlation or period heteroscedasticity is present (within each unit), then OLS can be made more precise/efficient: (a) V ar(ε it ) = σt 2 and all covariances between error terms are zero. We have period heteroscedasticity. GLS can be applied (Period weights): (b) V ar(ε it ) = σt 2, Cov(ε it, ε is ) = σ ts, all other covariances zero. We allow for serial correlation. GLS can be applied (Period weights). 24

25 Example: Grunfeld data We consider investment data for 10 American firms from , and consider the model INV it = β i1 V AL it + β i2 CAP it + α i + ε it for 1 i N = 10, and 1 t T = 20. The variables are Gross investment for the firm (INV) Value of the firm (VAL) Real Value of the Capital stock (plant and equipment) (CAP) The data are in the excel file grunfeld2.xls. 25

26 1. Have a look at the data in the Excel File. Write up the number of observations, the number of variables, and the upper left cell of the data matrix. Close the Excel file, create an unstructured Workfile and read in the data (Proc/Import/Read Text Lotus Excel). 2. To apply a panel structure, double click on the Range: line at the top of the workfile window, or select Proc/Structure/Resize Current Page. Select Dated Panel, and enter the appropriate variables as Date Series and as Cross Section ID series. 3. Open the investment series. Explore the Descriptive Statistics and tests menu. 26

27 4. Use View/Graph to (i) Make a line plot of the time series for every cross section (ii) Make boxplots of the distribution of investment over the different cross sections and over time. 5. Use Quick/Estimate Equation to estimate the fixed effects model. Specify the equation inv c cap value and use Panel Options to indicate that you use fixed effects. 6. Interpret your outcome. Would it be useful to add period effects? Test whether they this is necessary with View/Fixed Random Effects testing. 7. Select an appropriate weighting scheme within Panel Options. Interpret your outcome. 27

28 4 Random Effects model Model Y it = c + X itβ + ε it where the error term is decomposed as ε it = α i + v it. α i is a random effect N(0, σα). 2 It is the permanent component of the error term. v it a noise term N(0, σv). 2 It is the idiosyncratic component of the error term. 28

29 (The v it are uncorrelated among cross-sections, are serially uncorrelated at all leads and lags, within and across cross sections. The random effects are uncorrelated among cross-sections.) At the price of one extra parameter σα 2, the random effects model allows for correlation within cross-section units: For every i and t s: Cov(ε it, ε is ) = Cov(α i + v it, α i + v is ) = σ 2 α The following Variance decomposition holds: Var(ε it ) = Var(α i + v it ) = σ 2 α + σ 2 v. 29

30 Within groups/cross sections correlation: ρ = Corr(ε it, ε is ) = σ2 α σ 2 α + σ 2 v. The larger the value of ρ, the more unobserved heterogeneity. One estimates β by Generalized Least Squares, and obtains the RE-estimator. Different methods are existing to make GLS feasible. 30

31 Testing for correlated random effects: The random effect α i needs to be uncorrelated with the X-variables. This is a strong assumption. If not, there is an endogeneity problem, and the RE-estimator is inconsistent. H 0 : Corr(α i, X it ) = 0 The Hausman test compares two estimators: the FE (always consistent) and the RE estimator (consistent under H 0 ). One rejects H 0 if the difference between the two estimators is large. 31

32 Using Fixed or random effects? In econometrics, the fixed effects model seems to be the most appropriate (H O not needed). If N is large, and T is small, and the cross-sectional units are a random sample from a population, then random effects model becomes attractive: It is a parsimonious model, that captures within group-correlation. (For N large, FE requires estimation of many parameters) Random effects is popular for modeling grouped data: (i) Sample of 1000 children coming from 30 different schools (ii) Sample of 1000 persons from 20 different villages... 32

33 Robust Standard Errors: For RE no weighted versions are available. Using robust standard errors (or coefficient covariance) might be appropriate. This only affects the SE, not the estimators. 1. White cross section: robust to V ar(ε it ) = σi 2 and Cov(ε it, ε jt ) = σ ij. [robust to cross-section heteroscedasticity and contemporenous correlation among cross sections; appropriate if N << T.] 2. White period: robust to V ar(ε it ) = σ 2 t and Cov(ε it, ε is ) = σ ts. [robust to serial correlation within cross-section and changing variances over time; appropriate if cross-sections are random sample and T << N.] 3. White diagonal: robust to V ar(ε it ) = σ 2 it [robust to all forms of heteroscedasticity, but not robust for any type of correlation over time of across cross-section.] Can also be used for FE. 33

34 Exercise Consider the grunfeld data in grundfeld2.wf1. The model was: INV it = β i1 V AL it + β i2 CAP it + α i + ε it 1. Estimate the model as a random effects model. 2. What is the within-group correlation? 3. Perform the Hausman test. (View/Fixed random effects testing/correlated random effects) 4. Compute different types of robust SE. How is this affecting the results? 34

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and

More information

SYSTEMS OF REGRESSION EQUATIONS

SYSTEMS OF REGRESSION EQUATIONS SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations

More information

Chapter 10: Basic Linear Unobserved Effects Panel Data. Models:

Chapter 10: Basic Linear Unobserved Effects Panel Data. Models: Chapter 10: Basic Linear Unobserved Effects Panel Data Models: Microeconomic Econometrics I Spring 2010 10.1 Motivation: The Omitted Variables Problem We are interested in the partial effects of the observable

More information

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038

More information

Panel Data: Linear Models

Panel Data: Linear Models Panel Data: Linear Models Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Panel Data: Linear Models 1 / 45 Introduction Outline What

More information

IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD

IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD REPUBLIC OF SOUTH AFRICA GOVERNMENT-WIDE MONITORING & IMPACT EVALUATION SEMINAR IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD SHAHID KHANDKER World Bank June 2006 ORGANIZED BY THE WORLD BANK AFRICA IMPACT

More information

Correlated Random Effects Panel Data Models

Correlated Random Effects Panel Data Models INTRODUCTION AND LINEAR MODELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. The Linear

More information

Marketing Mix Modelling and Big Data P. M Cain

Marketing Mix Modelling and Big Data P. M Cain 1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored

More information

Chapter 9: Serial Correlation

Chapter 9: Serial Correlation Chapter 9: Serial Correlation In this chapter: 1. Creating a residual series from a regression model 2. Plotting the error term to detect serial correlation (UE, pp. 313-315) 3. Using regression to estimate

More information

Note 2 to Computer class: Standard mis-specification tests

Note 2 to Computer class: Standard mis-specification tests Note 2 to Computer class: Standard mis-specification tests Ragnar Nymoen September 2, 2013 1 Why mis-specification testing of econometric models? As econometricians we must relate to the fact that the

More information

Chapter 4: Vector Autoregressive Models

Chapter 4: Vector Autoregressive Models Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...

More information

Clustering in the Linear Model

Clustering in the Linear Model Short Guides to Microeconometrics Fall 2014 Kurt Schmidheiny Universität Basel Clustering in the Linear Model 2 1 Introduction Clustering in the Linear Model This handout extends the handout on The Multiple

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions What will happen if we violate the assumption that the errors are not serially

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Econometric analysis of the Belgian car market

Econometric analysis of the Belgian car market Econometric analysis of the Belgian car market By: Prof. dr. D. Czarnitzki/ Ms. Céline Arts Tim Verheyden Introduction In contrast to typical examples from microeconomics textbooks on homogeneous goods

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Multiple Linear Regression in Data Mining

Multiple Linear Regression in Data Mining Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple

More information

ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS

ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS Jeffrey M. Wooldridge THE INSTITUTE FOR FISCAL STUDIES DEPARTMENT OF ECONOMICS, UCL cemmap working

More information

Chapter 2. Dynamic panel data models

Chapter 2. Dynamic panel data models Chapter 2. Dynamic panel data models Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans Université d Orléans April 2010 Introduction De nition We now consider

More information

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.

More information

Lecture 15. Endogeneity & Instrumental Variable Estimation

Lecture 15. Endogeneity & Instrumental Variable Estimation Lecture 15. Endogeneity & Instrumental Variable Estimation Saw that measurement error (on right hand side) means that OLS will be biased (biased toward zero) Potential solution to endogeneity instrumental

More information

Panel Data Analysis in Stata

Panel Data Analysis in Stata Panel Data Analysis in Stata Anton Parlow Lab session Econ710 UWM Econ Department??/??/2010 or in a S-Bahn in Berlin, you never know.. Our plan Introduction to Panel data Fixed vs. Random effects Testing

More information

EFFECT OF INVENTORY MANAGEMENT EFFICIENCY ON PROFITABILITY: CURRENT EVIDENCE FROM THE U.S. MANUFACTURING INDUSTRY

EFFECT OF INVENTORY MANAGEMENT EFFICIENCY ON PROFITABILITY: CURRENT EVIDENCE FROM THE U.S. MANUFACTURING INDUSTRY EFFECT OF INVENTORY MANAGEMENT EFFICIENCY ON PROFITABILITY: CURRENT EVIDENCE FROM THE U.S. MANUFACTURING INDUSTRY Seungjae Shin, Mississippi State University Kevin L. Ennis, Mississippi State University

More information

COURSES: 1. Short Course in Econometrics for the Practitioner (P000500) 2. Short Course in Econometric Analysis of Cointegration (P000537)

COURSES: 1. Short Course in Econometrics for the Practitioner (P000500) 2. Short Course in Econometric Analysis of Cointegration (P000537) Get the latest knowledge from leading global experts. Financial Science Economics Economics Short Courses Presented by the Department of Economics, University of Pretoria WITH 2015 DATES www.ce.up.ac.za

More information

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

More information

Models for Longitudinal and Clustered Data

Models for Longitudinal and Clustered Data Models for Longitudinal and Clustered Data Germán Rodríguez December 9, 2008, revised December 6, 2012 1 Introduction The most important assumption we have made in this course is that the observations

More information

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors 2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

More information

Redistributional impact of the National Health Insurance System

Redistributional impact of the National Health Insurance System Redistributional impact of the National Health Insurance System in France: A microsimulation approach Valrie Albouy (INSEE) Laurent Davezies (INSEE-CREST-IRDES) Thierry Debrand (IRDES) Brussels, 4-5 March

More information

Econometric Methods for Panel Data

Econometric Methods for Panel Data Based on the books by Baltagi: Econometric Analysis of Panel Data and by Hsiao: Analysis of Panel Data Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies

More information

Vector Time Series Model Representations and Analysis with XploRe

Vector Time Series Model Representations and Analysis with XploRe 0-1 Vector Time Series Model Representations and Analysis with plore Julius Mungo CASE - Center for Applied Statistics and Economics Humboldt-Universität zu Berlin mungo@wiwi.hu-berlin.de plore MulTi Motivation

More information

7 Time series analysis

7 Time series analysis 7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are

More information

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem Chapter Vector autoregressions We begin by taking a look at the data of macroeconomics. A way to summarize the dynamics of macroeconomic data is to make use of vector autoregressions. VAR models have become

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Regression Analysis (Spring, 2000)

Regression Analysis (Spring, 2000) Regression Analysis (Spring, 2000) By Wonjae Purposes: a. Explaining the relationship between Y and X variables with a model (Explain a variable Y in terms of Xs) b. Estimating and testing the intensity

More information

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Introduction to Regression Models for Panel Data Analysis. Indiana University Workshop in Methods October 7, 2011. Professor Patricia A.

Introduction to Regression Models for Panel Data Analysis. Indiana University Workshop in Methods October 7, 2011. Professor Patricia A. Introduction to Regression Models for Panel Data Analysis Indiana University Workshop in Methods October 7, 2011 Professor Patricia A. McManus Panel Data Analysis October 2011 What are Panel Data? Panel

More information

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week 10 + 0.0077 (0.052)

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week 10 + 0.0077 (0.052) Department of Economics Session 2012/2013 University of Essex Spring Term Dr Gordon Kemp EC352 Econometric Methods Solutions to Exercises from Week 10 1 Problem 13.7 This exercise refers back to Equation

More information

Forecasting Using Eviews 2.0: An Overview

Forecasting Using Eviews 2.0: An Overview Forecasting Using Eviews 2.0: An Overview Some Preliminaries In what follows it will be useful to distinguish between ex post and ex ante forecasting. In terms of time series modeling, both predict values

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Premium Copayments and the Trade-off between Wages and Employer-Provided Health Insurance. February 2011

Premium Copayments and the Trade-off between Wages and Employer-Provided Health Insurance. February 2011 PRELIMINARY DRAFT DO NOT CITE OR CIRCULATE COMMENTS WELCOMED Premium Copayments and the Trade-off between Wages and Employer-Provided Health Insurance February 2011 By Darren Lubotsky School of Labor &

More information

Panel Data Econometrics

Panel Data Econometrics Panel Data Econometrics Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans University of Orléans January 2010 De nition A longitudinal, or panel, data set is

More information

The Basic Two-Level Regression Model

The Basic Two-Level Regression Model 2 The Basic Two-Level Regression Model The multilevel regression model has become known in the research literature under a variety of names, such as random coefficient model (de Leeuw & Kreft, 1986; Longford,

More information

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand

More information

FULLY MODIFIED OLS FOR HETEROGENEOUS COINTEGRATED PANELS

FULLY MODIFIED OLS FOR HETEROGENEOUS COINTEGRATED PANELS FULLY MODIFIED OLS FOR HEEROGENEOUS COINEGRAED PANELS Peter Pedroni ABSRAC his chapter uses fully modified OLS principles to develop new methods for estimating and testing hypotheses for cointegrating

More information

How Far is too Far? Statistical Outlier Detection

How Far is too Far? Statistical Outlier Detection How Far is too Far? Statistical Outlier Detection Steven Walfish President, Statistical Outsourcing Services steven@statisticaloutsourcingservices.com 30-325-329 Outline What is an Outlier, and Why are

More information

State Space Time Series Analysis

State Space Time Series Analysis State Space Time Series Analysis p. 1 State Space Time Series Analysis Siem Jan Koopman http://staff.feweb.vu.nl/koopman Department of Econometrics VU University Amsterdam Tinbergen Institute 2011 State

More information

Econometrics Simple Linear Regression

Econometrics Simple Linear Regression Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight

More information

Chapter 6: Multivariate Cointegration Analysis

Chapter 6: Multivariate Cointegration Analysis Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration

More information

Solución del Examen Tipo: 1

Solución del Examen Tipo: 1 Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical

More information

HLM software has been one of the leading statistical packages for hierarchical

HLM software has been one of the leading statistical packages for hierarchical Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush

More information

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2 University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages

More information

The Method of Least Squares

The Method of Least Squares Hervé Abdi 1 1 Introduction The least square methods (LSM) is probably the most popular technique in statistics. This is due to several factors. First, most common estimators can be casted within this

More information

Appendices with Supplementary Materials for CAPM for Estimating Cost of Equity Capital: Interpreting the Empirical Evidence

Appendices with Supplementary Materials for CAPM for Estimating Cost of Equity Capital: Interpreting the Empirical Evidence Appendices with Supplementary Materials for CAPM for Estimating Cost of Equity Capital: Interpreting the Empirical Evidence This document contains supplementary material to the paper titled CAPM for estimating

More information

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Simple Linear Regression, Scatterplots, and Bivariate Correlation 1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.

More information

Stock prices are not open-ended: Stock trading seems to be *

Stock prices are not open-ended: Stock trading seems to be * Stock prices are not open-ended: Stock trading seems to be * Gunther CAPELLE-BLANCARD Université Paris 1 Panthéon-Sorbonne gunther.capelle-blancard@univ-paris1.fr First draft: August 2014 Current draft:

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*

Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?* Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?* Jennjou Chen and Tsui-Fang Lin Abstract With the increasing popularity of information technology in higher education, it has

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

5. Multiple regression

5. Multiple regression 5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful

More information

2.2 Elimination of Trend and Seasonality

2.2 Elimination of Trend and Seasonality 26 CHAPTER 2. TREND AND SEASONAL COMPONENTS 2.2 Elimination of Trend and Seasonality Here we assume that the TS model is additive and there exist both trend and seasonal components, that is X t = m t +

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

ADVANCED FORECASTING MODELS USING SAS SOFTWARE

ADVANCED FORECASTING MODELS USING SAS SOFTWARE ADVANCED FORECASTING MODELS USING SAS SOFTWARE Girish Kumar Jha IARI, Pusa, New Delhi 110 012 gjha_eco@iari.res.in 1. Transfer Function Model Univariate ARIMA models are useful for analysis and forecasting

More information

5. Linear Regression

5. Linear Regression 5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4

More information

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS. SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed

More information

I n d i a n a U n i v e r s i t y U n i v e r s i t y I n f o r m a t i o n T e c h n o l o g y S e r v i c e s

I n d i a n a U n i v e r s i t y U n i v e r s i t y I n f o r m a t i o n T e c h n o l o g y S e r v i c e s I n d i a n a U n i v e r s i t y U n i v e r s i t y I n f o r m a t i o n T e c h n o l o g y S e r v i c e s Linear Regression Models for Panel Data Using SAS, Stata, LIMDEP, and SPSS * Hun Myoung Park,

More information

16 : Demand Forecasting

16 : Demand Forecasting 16 : Demand Forecasting 1 Session Outline Demand Forecasting Subjective methods can be used only when past data is not available. When past data is available, it is advisable that firms should use statistical

More information

Implementing Panel-Corrected Standard Errors in R: The pcse Package

Implementing Panel-Corrected Standard Errors in R: The pcse Package Implementing Panel-Corrected Standard Errors in R: The pcse Package Delia Bailey YouGov Polimetrix Jonathan N. Katz California Institute of Technology Abstract This introduction to the R package pcse is

More information

MULTIPLE REGRESSION WITH CATEGORICAL DATA

MULTIPLE REGRESSION WITH CATEGORICAL DATA DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 86 MULTIPLE REGRESSION WITH CATEGORICAL DATA I. AGENDA: A. Multiple regression with categorical variables. Coding schemes. Interpreting

More information

Lab 5 Linear Regression with Within-subject Correlation. Goals: Data: Use the pig data which is in wide format:

Lab 5 Linear Regression with Within-subject Correlation. Goals: Data: Use the pig data which is in wide format: Lab 5 Linear Regression with Within-subject Correlation Goals: Data: Fit linear regression models that account for within-subject correlation using Stata. Compare weighted least square, GEE, and random

More information

Multiple Regression: What Is It?

Multiple Regression: What Is It? Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in

More information

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools. Tools for Summarizing Data Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

More information

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest Analyzing Intervention Effects: Multilevel & Other Approaches Joop Hox Methodology & Statistics, Utrecht Simplest Intervention Design R X Y E Random assignment Experimental + Control group Analysis: t

More information

T-test & factor analysis

T-test & factor analysis Parametric tests T-test & factor analysis Better than non parametric tests Stringent assumptions More strings attached Assumes population distribution of sample is normal Major problem Alternatives Continue

More information

Employer-Provided Health Insurance and Labor Supply of Married Women

Employer-Provided Health Insurance and Labor Supply of Married Women Upjohn Institute Working Papers Upjohn Research home page 2011 Employer-Provided Health Insurance and Labor Supply of Married Women Merve Cebi University of Massachusetts - Dartmouth and W.E. Upjohn Institute

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

Longitudinal (Panel and Time Series Cross-Section) Data

Longitudinal (Panel and Time Series Cross-Section) Data Longitudinal (Panel and Time Series Cross-Section) Data Nathaniel Beck Department of Politics NYU New York, NY 10012 nathaniel.beck@nyu.edu http://www.nyu.edu/gsas/dept/politics/faculty/beck/beck home.html

More information

The US dollar exchange rate and the demand for oil

The US dollar exchange rate and the demand for oil The US dollar exchange rate and the demand for oil Selien De Schryder Ghent University Gert Peersman Ghent University Norges Bank/ECB workshop on "Monetary Policy and Commodity Prices" 19-20 November 2012

More information

A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500

A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500 A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500 FE8827 Quantitative Trading Strategies 2010/11 Mini-Term 5 Nanyang Technological University Submitted By:

More information

UNIVERSITY OF WAIKATO. Hamilton New Zealand

UNIVERSITY OF WAIKATO. Hamilton New Zealand UNIVERSITY OF WAIKATO Hamilton New Zealand Can We Trust Cluster-Corrected Standard Errors? An Application of Spatial Autocorrelation with Exact Locations Known John Gibson University of Waikato Bonggeun

More information

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? SAMUEL H. COX AND YIJIA LIN ABSTRACT. We devise an approach, using tobit models for modeling annuity lapse rates. The approach is based on data provided

More information

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University Practical I conometrics data collection, analysis, and application Christiana E. Hilmer Michael J. Hilmer San Diego State University Mi Table of Contents PART ONE THE BASICS 1 Chapter 1 An Introduction

More information

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or

More information

Performing Unit Root Tests in EViews. Unit Root Testing

Performing Unit Root Tests in EViews. Unit Root Testing Página 1 de 12 Unit Root Testing The theory behind ARMA estimation is based on stationary time series. A series is said to be (weakly or covariance) stationary if the mean and autocovariances of the series

More information

Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480

Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480 1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500

More information

Introduction to Data Analysis in Hierarchical Linear Models

Introduction to Data Analysis in Hierarchical Linear Models Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University Scope & Prerequisites Strong applied emphasis Focus on HLM

More information

An analysis method for a quantitative outcome and two categorical explanatory variables.

An analysis method for a quantitative outcome and two categorical explanatory variables. Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that

More information

Mgmt 469. Fixed Effects Models. Suppose you want to learn the effect of price on the demand for back massages. You

Mgmt 469. Fixed Effects Models. Suppose you want to learn the effect of price on the demand for back massages. You Mgmt 469 Fixed Effects Models Suppose you want to learn the effect of price on the demand for back massages. You have the following data from four Midwest locations: Table 1: A Single Cross-section of

More information

1 Short Introduction to Time Series

1 Short Introduction to Time Series ECONOMICS 7344, Spring 202 Bent E. Sørensen January 24, 202 Short Introduction to Time Series A time series is a collection of stochastic variables x,.., x t,.., x T indexed by an integer value t. The

More information

Corporate Defaults and Large Macroeconomic Shocks

Corporate Defaults and Large Macroeconomic Shocks Corporate Defaults and Large Macroeconomic Shocks Mathias Drehmann Bank of England Andrew Patton London School of Economics and Bank of England Steffen Sorensen Bank of England The presentation expresses

More information

Multinomial and Ordinal Logistic Regression

Multinomial and Ordinal Logistic Regression Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,

More information

Simulation Models for Business Planning and Economic Forecasting. Donald Erdman, SAS Institute Inc., Cary, NC

Simulation Models for Business Planning and Economic Forecasting. Donald Erdman, SAS Institute Inc., Cary, NC Simulation Models for Business Planning and Economic Forecasting Donald Erdman, SAS Institute Inc., Cary, NC ABSTRACT Simulation models are useful in many diverse fields. This paper illustrates the use

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Econometric Modelling for Revenue Projections

Econometric Modelling for Revenue Projections Econometric Modelling for Revenue Projections Annex E 1. An econometric modelling exercise has been undertaken to calibrate the quantitative relationship between the five major items of government revenue

More information

Financial Risk Management Exam Sample Questions/Answers

Financial Risk Management Exam Sample Questions/Answers Financial Risk Management Exam Sample Questions/Answers Prepared by Daniel HERLEMONT 1 2 3 4 5 6 Chapter 3 Fundamentals of Statistics FRM-99, Question 4 Random walk assumes that returns from one time period

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

1 Teaching notes on GMM 1.

1 Teaching notes on GMM 1. Bent E. Sørensen January 23, 2007 1 Teaching notes on GMM 1. Generalized Method of Moment (GMM) estimation is one of two developments in econometrics in the 80ies that revolutionized empirical work in

More information

Trend and Seasonal Components

Trend and Seasonal Components Chapter 2 Trend and Seasonal Components If the plot of a TS reveals an increase of the seasonal and noise fluctuations with the level of the process then some transformation may be necessary before doing

More information

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

More information