An Introduction to Time Series Regression

Similar documents
Chapter 5: Bivariate Cointegration Analysis

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Sales forecasting # 2

VI. Real Business Cycles Models

Chapter 9: Univariate Time Series Analysis

Minimum LM Unit Root Test with One Structural Break. Junsoo Lee Department of Economics University of Alabama

Univariate Time Series Analysis; ARIMA Models

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

I. Basic concepts: Buoyancy and Elasticity II. Estimating Tax Elasticity III. From Mechanical Projection to Forecast

Econometrics Simple Linear Regression

Univariate Time Series Analysis; ARIMA Models

Fractionally integrated data and the autodistributed lag model: results from a simulation study

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

The Engle-Granger representation theorem

Rob J Hyndman. Forecasting using. 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1

Non-Stationary Time Series andunitroottests

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Econometrics I: Econometric Methods

Integrated Resource Plan

Some useful concepts in univariate time series analysis

Part 2: Analysis of Relationship Between Two Variables

Topic 5: Stochastic Growth and Real Business Cycles

Lecture 3: Differences-in-Differences

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

The US dollar exchange rate and the demand for oil

Air passenger departures forecast models A technical note

Inflation as a function of labor force change rate: cointegration test for the USA

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

Spatial panel models

Time Series Analysis

Testing for Granger causality between stock prices and economic growth

Probability Calculator

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

Week TSX Index

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

2. Illustration of the Nikkei 225 option data

2. Linear regression with multiple regressors

Lecture 2: ARMA(p,q) models (part 3)

Business cycles and natural gas prices

A Classical Monetary Model - Money in the Utility Function

Time Series Analysis

ARMA, GARCH and Related Option Pricing Method

ADVANCED FORECASTING MODELS USING SAS SOFTWARE

Sales forecasting # 1

Lecture 14 More on Real Business Cycles. Noah Williams

GRADO EN ECONOMÍA. Is the Forward Rate a True Unbiased Predictor of the Future Spot Exchange Rate?

Chapter 4: Vector Autoregressive Models

Sovereign Defaults. Iskander Karibzhanov. October 14, 2014

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU

Macroeconomic Effects of Financial Shocks Online Appendix

Additional sources Compilation of sources:

Estimating an ARMA Process

Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?

Stock Returns and Equity Premium Evidence Using Dividend Price Ratios and Dividend Yields in Malaysia

Time Series Analysis and Forecasting

2. What is the general linear model to be used to model linear trend? (Write out the model) = or

Financial Market Efficiency and Its Implications

Note 2 to Computer class: Standard mis-specification tests

13. Poisson Regression Analysis

IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD

How Important Is the Stock Market Effect on Consumption?

Online Appendix for Demand for Crash Insurance, Intermediary Constraints, and Risk Premia in Financial Markets

ijcrb.com INTERDISCIPLINARY JOURNAL OF CONTEMPORARY RESEARCH IN BUSINESS AUGUST 2014 VOL 6, NO 4

FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits

Why the saving rate has been falling in Japan

Forecasting the US Dollar / Euro Exchange rate Using ARMA Models

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

The Relationship between Insurance Market Activity and Economic Growth

Testing for Cointegrating Relationships with Near-Integrated Data

THE EFFECTS OF BANKING CREDIT ON THE HOUSE PRICE

Impulse Response Functions

Simple approximations for option pricing under mean reversion and stochastic volatility

Introduction to Dynamic Models. Slide set #1 (Ch in IDM).

How Much Equity Does the Government Hold?

Energy consumption and GDP: causality relationship in G-7 countries and emerging markets

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

Empirical Properties of the Indonesian Rupiah: Testing for Structural Breaks, Unit Roots, and White Noise

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Lecture 15. Endogeneity & Instrumental Variable Estimation

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

TURUN YLIOPISTO UNIVERSITY OF TURKU TALOUSTIEDE DEPARTMENT OF ECONOMICS RESEARCH REPORTS. A nonlinear moving average test as a robust test for ARCH

As we explained in the textbook discussion of statistical estimation of demand

Simple Linear Regression Inference

Java Modules for Time Series Analysis

New Estimates of the Fraction of People Who Consume Current Income

16 : Demand Forecasting

Conditional guidance as a response to supply uncertainty

Lecture Note: Self-Selection The Roy Model. David H. Autor MIT Spring 2003 November 14, 2003

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Real Business Cycle Theory. Marco Di Pietro Advanced () Monetary Economics and Policy 1 / 35

Illustration (and the use of HLM)

Chapter 6: Multivariate Cointegration Analysis

Chapter 5. Conditional CAPM. 5.1 Conditional CAPM: Theory Risk According to the CAPM. The CAPM is not a perfect model of expected returns.

Unit Labor Costs and the Price Level

Performing Unit Root Tests in EViews. Unit Root Testing

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

THE CONTRIBUTION OF EQUIPMENT LEASING IN THE ERROR- CORRECTION MODEL OF INVESTMENT IN MACHINERY AND EQUIPMENT: EVIDENCE FROM ITALY ZANIN, Luca *

Forecasting in supply chains

Accounting for Trends in Productivity and R&D: A. Schumpeterian Critique of Semi-Endogenous Growth Theory

Transcription:

An Introduction to Time Series Regression Henry Thompson Auburn University An economic model suggests examining the effect of exogenous x t on endogenous y t with an exogenous control variable z t. In general functional form y t = f(x t, z t ) where x t and z t can refer to more than a single variable. This introduction focuses on estimating this single equation as the reduced form of an equilibrium condition. Time series variables depend on their own history and estimating these underlying processes is the first step to estimating the relationship of interest. The best predictor of y t+1 may be its own past values that include the influence of all related variables while the estimated relationship is a hypothetical model and related variables are measured with error. Estimates of structural models isolate the effects of exogenous variables and may suggest ways to improve theory. In theory, variables in an OLS regression have a normal distribution with a constant mean and each observation equal to the mean plus a white noise error. Distributions of nonstationary variables with trends have low peaks and fat tails. Further, time plays an explicit role. With a positive trend, early observations are below the mean and later ones above the mean. Standard errors assume constant means. Non-stationary variables understate standard errors, resulting in inflation significance and explanatory power. Consider the OLS regression y t = α 0 + α 1 x t + α 2 z t + ε t (1)

with variables y t, x t, and z t in natural logs. The coefficients of log linear models are point estimates of elasticities. The explicit goal is to interpret theory in terms of the estimated coefficient α 1 = y t / x t. Begin with a theoretical model and derive (1) to relate estimated coefficients to the theoretical model. Rely on theory and regression analysis to suggest the most relevant exogenous variables. The exogenous variable z t can represent a vector. The residual ε t should be white noise WN and has to pass critical tests for a zero mean, lack of residual correlation, and constant variance. The ultimate form of the regression may not be as simple as (1) since OLS requires at least stationary variables. Time series variables may have trends, structural breaks, and heteroskedastic variance. The typical problem in applied time series analysis is that variables are not stationary implying the standard errors in (1) are understated. If theory suggests x t should affect y t but both have trends, they will be correlated and coefficients will appear significant. Significance and explanatory power, however, are overstated. The key concept in applied time series analysis is whether series are stationary. A stationary series has a long history and converges to a steady state. Stationary is weaker than normality but regressions with stationary variables typically produce reliable statistics. As the number of observations increases, reliability increases at an increasing rate. The residual error ε t in (1) should pass the tests of white noise WN. An OLS regression with non-stationary variables leads to residual correlation indicated by a significant correlation corr(ε t ε t-1 ). Residual correlation implies information resides in the residual. Something else must then affect y t in a systematic way.

A spurious OLS regression has biased and underestimated variances, inflated t-statistics, and an inflated R 2. Coefficient estimates are unbiased, however, just as likely above as below the true value. Estimated coefficients are consistent, converging to the true value as the number of observations increases and the variance approaches zero. Coefficients are in fact super consistent with accelerating convergence as the number of observations increases. Nevertheless, the underestimated standard errors are a nagging problem in application. A series that is not stationary may be a difference stationary. If so, regressions on differences of the variables will be reliable. Difference stationary random walks may also be cointegrated. An error correction process then adjusts the variable relative to the long run dynamic equilibrium in the economic model. From a practical perspective, a difference regression that is unsuccessful may conceal significant error correction process. Solve economic models with more than a single dependent variable in reduced form with each dependent variable a function of the exogenous variables. The goal is to estimate this reduced form equation. For example, the market model determines endogenous price P and quantity Q = D = S from the demand function D = D(P,y) and the supply function S = S(P,w). The exogenous variables in the model are the demand shifter y and the supply shifter w. It would be appropriate to estimate Q or P as functions of y and w but inappropriate to estimate Q as a function of P, or vice versa. Theory is flexible in that various assumptions about endogeneity lead to different models and regressions. Be very specific in developing the regression that the right hand side variables are exogenous. The ISLMBP macroeconomic model provides another example of flexibility, with national income Y as a function of exogenous government spending G, money supply M s, the

foreign interest rate r*, and foreign income Y*. The interest rate r is an endogenous variable and should not be exogenous in a regression on the dependent variable Y. It is possible to estimate a separate equation for r. A floating exchange rate e would be an endogenous variable with an exogenous balance of payments B since e adjusts if B 0. If there were a fixed exchange rate, it would be exogenous and B would then be endogenous. Time series processes will determine the form of variables in regression analysis. Lagged effects may be important. An increase in wage might lower labor input next year. The theoretical model and regression then includes lags of exogenous variables. Regression options include transforming variables with logarithms, differences, inverses, and lags. The error correction model ECM includes the residual ε t of the spurious equation (1) in a difference model that separates transitory adjustment from adjustment relative to the long term dynamic equilibrium. Let theory be the guide to variable selection and endogenous variables. Regression results may suggest ways to refine theory. Empirical analysis leads to improved theory. The rest of this Introduction is available. Sections headings are: White noise Stationary variables Stationary with a structural break Difference stationary variables Unit with a structural break Difference models Error correction model ECM Lagged transformation models Other econometric models Conclusion Contact thomph1@auburn.edu.

Stationary Flow Chart WN residuals (a) zero mean (b) zero covariance (c) constant variance (b) -t < μ ε /SD < t ρ(ε t, ε t-1 ) < ρ c = ρ(n - 2).5 /(1- ρ 2 ).5 LB DW Dubin-h ARCH1 β 0 = β 1 = 0 in (2) by F test and e t in (2) WN by (a)& AR(1) Model y t = α 0 + α 1 y t-1 + ε t (α 1 + 2σ) < 1 stationary y t non-stationary, ε t not WN by LB & ARCH AR(2) y t = α 0 + α 1 y t-1 + α 2 y t-2 + ε t a = [α 1 ± (α 2 1 + 4α 2 ).5 ]/2 y t non-stationary, ε t not WN by LB & ARCH unit s a i + 2σ i < 1 (derive σ i ) AR(1) with structural break y t = α 0 + α 1 y t-1 + α 2 D + α 3 Dy t-1 + ε t α + 2σ < 1 before & after break y t, D non-stationary, ε t not WN by LB & ARCH DF Δy t = γy t-1 + ε t γ = 0 random walk RW Δy t *Unit γ 0, ε t not WN by DW & ARCH τ stat DFc Δy t = α 0 + γy t-1 + ε t γ = 0 trend stationary ε t WN γ 0, ε t not WN by DW & ARCH τ μ stat φ 1 test α 0 = γ = 0 => RW Δy t *Unit DFt Δy t = α 0 + γy t-1 + α 1 t + ε t γ = 0 ε t WN γ 0, ε t not WN by DW& ARCH τ T stat φ 2 test α 0 = γ = α 1 = 0 Δy t *Unit ADF Δy t = α 0 + γy t-1 + α 1 Δy t-1 +α 2 t + ε t ADF γ = 0 ε t ADF WN

γ 0, ε t not WN by DW & ARCH τ T stat ADF(2) add α 1 Δy t-2 φ 3 test α 0 = γ = α 1 = α 2 = 0 Δy t *Unit Unit with structural break P y t = a 0 + a 2 t + µ 2 D + ε t Perron ε P t = a 1 ε P t-1 + e t a 1 = 1 ε P t = y t Δy t *Unit Models Spurious model y t = α 0 + α 1 x t + α 2 z t + ε t s ε t s stationary by EG Δε t s = a 1 ε t-1 s + e t with a 1 < 0 ECM Difference Δy t = α 0 + α 1 Δx t + α 2 Δz t + u t or Δy t = α 1 Δx t + α 2 Δz t + u t (spurious coefficients) ECM Δy t = β 0 + β 1 Δx t + β 2 Δz t + β y ε t-1 s + u t with lags β 1 Δx t-1 etc LTM y t = α 0 + α 1 y t-1 + α 2 x t + α 3 z t + α 4 x t-1 + α 5 z t-1 + e t if ε t s RW by EG test with a 1 = 0 Detrend y t = α 0 + α 1 t + α 2 t 2 + α 3 t 3 dt + ε t WN ε t dt

Stationary Table AR(1) DF DFc DFt ADF AR(2) P SB y t α 1 +2σ<1 t* t t t a i + 2σ i t LB* F* F F > 1 ε t DW* DW ARCH* DW ARCH x t α 1 +2σ>1 t t ε t DW* DW ARCH z t α 1 +2σ>1 t* t F* ε t τ DF φ τ τ μ φ 1 t F* τ T φ 2 t F DW ARCH τ T φ 3-3.76 Notes: 1. Variables are not AR(1) stationary, y t due to residual correlation. Not necessary to report WN test if coefficient test fails. Other AR(p) or ARMA(p,q) models can be reported. 2. y t has a unit by Perron structural break test P SB but not by the DF (t > τ), DFc (F > φ 1 ), DFt (DW), or ADF (ARCH). y t is not AR(2) stationary. 2. x t has unit by DFc test but not DF test due to residual correlation. 3. z t is RW with F < φ 3 in ADF