Note 2 to Computer class: Standard mis-specification tests

Save this PDF as:

Size: px
Start display at page:

Transcription

1 Note 2 to Computer class: Standard mis-specification tests Ragnar Nymoen September 2, Why mis-specification testing of econometric models? As econometricians we must relate to the fact that the data generating process, DGP, which has produced the data, is not the same as the econometric model that we have specified and estimated. The situation is very different from the case where the data is from a laboratory experiment. Then the DGP can in principle be regarded as known (anything else can be seen a result of bad experimental design ) since the experiment has been devised and supervised by the researchers themselves. The situation that an experimental researcher is in can be thought of as follows: Y i = g(x i ) + v i. (1) result input shock The variable Y i is the result of the experiment, while the X i is the imputed input variable which is decided by the researcher. g(x i ) is a deterministic function. The variable v i is a shock which leads to some separate variation in Y i for the chosen X i. The aim of the experiment is to find the effect that X has as a causal variable on Y. If the g(x i )-function is linear, this causal relationship can be investigated with the use of OLS estimation. In economics, the use of experimental data is increasing, but still the bulk of applied econometric analysis makes use of non-experimental data. Non-experimental economic data is usually collected for other purposes than research and the data reflect the real-life decisions made by a vast number of heterogenous agents. Hence, the starting point of an econometric modelling project is usually fundamentally different from the statistical anaysis of experimental data. In order to maintain (1) as a model of econometrics for this kind of data, we have to invoke the axiom of correct specification, meaning that we know the DGP before the analysis is made. If we want to avoid the axiom of correct specification then, instead of (1), we need to write Y i = f(x i ) + ε i (2) observed explained remainder where Y i are the observations of the dependent variable which we seek to explain by the use of economic theory and our knowledge of the subject matter. Our explanation is given by the function f(x i ) which in the regression case can be characteristic This note is a translated extract from Chapter 8 in Bårdsen and Nymoen (2011). It also draws on Hendry (1995, Ch 1) and Hendry and Nielsen (2007, Ch 11). 1

2 precisely as the conditional expectation function. The non-experimental Y i is not determined or caused by f(x i ), it is determined by a DGP that is unknown for us, and all variation in Y i that we do not account for, must therefore end up in the remainder ε i. Unlike (1), where v i represents free and independent variation to Y i, ε i in (2) is an implied variable which gets its properties from the DGP and the explanation, in effect from the model f(x i ). Hence in econometrics, we should write: ε i = Y i f(x i ) (3) to describe that whatever we do on the right hand side of (3) by way of changing the specification of f(x i ) or by changing the measurement of Y i, the left-hand side is derived as a result. This analysis poses at least two important questions. The first is related to causation: Although we can make f(x i ) precise, as a conditional expectation function, we cannot claim that X i is a causal variable. Again this is different from the experimental case. However, as we shall see, we can often combine economic theory and further econometric analysis of the system that contains Y i and X i as endogenous variable, to reach a meaningful interpretation of the joint evidence in favour of oneway causality, or two-way causality. Recently, there has also been a surge in micro data sets based on large registers, which has opened up a new approach to causal modelling based on natural experiments and difference in differences estimators. 1 The second major issue, is potential model mis-specification and how to discover mis-specification if it occurs. Residual mis-specification, in particular, is defined relatively to the classical regression model. Hence we say that there is residual mis-specification if the residual from the model behaves significantly differently from what we would expect to see if the true disturbances of the model adhered to the classical assumptions about homoscedasticity, non-autocorrelation, or no cross-sectional dependence. Clearly, if the axiom of correct specification holds, we would see little evidence of residual mis-specification. However, even the smallest experience of applied econometrics will show that mis-specification frequently happens. As we know from elementary econometrics, the consequences for mis-specification for the properties of estimators and tests are sometimes not very serious. For example, non-normality alone only entails that there are problems with knowing the exact distribution of for example the t statistic in small sample sizes. There are ways around this problem, by use of robust methods for covariance estimation. Other forms of mis-specification gives more serious problems: Autocorrelated disturbances in a dynamic model may for example lead to coefficient estimators being biased, even in very large samples (i.e., they are inconsistent). 1 An elementary exposition is found in for example Hill et al. (2008, Ch 7.51). Stewart and Wallis (1981) is an early textbook presentation of the basic form of this estimator (see page ), but without the label Difference in differences, which is a much more recent innovation. 2

3 The following table gives and overview, and can serve as a review of what we know from elementary econometrics. Disturbances ε i are: X i heteroscedastic autocorrelated X i ˆβ1 V ar(ˆβ 1 ) ˆβ 1 V ar(ˆβ 1 ) unbiased unbiased exogenous wrong wrong consistent consistent predetermined biased consistent wrong biased inconsistent wrong Here we have in mind a linear model Y i = β 0 + β 1 X i + ε i and ˆβ 1 is the OLS estimator for the slope coefficient β 1 and V ar(ˆβ 1 ) is the standard error of the estimator. The entry wrong indicates that this estimator of the variance of ˆβ 1 is not the correct estimator to use, it can overestimate or underestimate the uncertainty. We assume that we estimate by OLS because we are interested in β 1 as a parameter in the conditional expectation function. This means that we can regard X i as exogenous in the sense that all the disturbances are uncorrelated with X i. There is one important exception, and that is when we have time series data and X t is the lag of Y t,i.e., we have Y t 1 on the right hand side. In this case X i in the table is not exogenous but pre-determined: It is uncorrelated with future disturbances, but not ε t 1, ε t 2, and so on backward. Because of its importance in the assessment of the quality of econometric models, most programs contains a battery of mis-specification test. PcGive is no exception, and in fact PcGive reports such tests in the default output. The output (the default) is a little different for cross section and time series models, and for simplicity we show examples of both types of models, and comment on the differences. We give reference Davidson and MacKinnon s Econometric theory and Methods, and to the book by Hill, Griffit and Lim used in ECON-3150/4150 and to Bårdsen and Nymoen (2011), this may be useful for Norwegian students since it has a separate chapter on mis-specification testing. 2 Mis-specification tests for cross-section data We take the simple regression on the konsum sim data set as our example, see the note called Seminar PcGive intro.pdf : 3

4 The default mis-specification tests are at the bottom of the screen-capture. Normality test The normality assumption for the disturbances is important for the exact statistical distribution of OLS estimators and the associated test statistics. Concretely: Which p-values to use for t-tests and F -tests and for confidence intervals and prediction intervals. If the normality assumption holds, it is correct inference to use the t-distribution to test hypothesis about single parameters of the models, and the F -distribution to test joint hypothesis. If the normality assumption cannot be maintained,inference with the t- and F -distribution is no longer exact, but it can still be a good approximation. And it get increasingly good with increasing sample size. In the output above, the normality test is Chi-square distributed with two degrees of freedom, χ 2 (2), reported as Normality test: Chi^2(2). This test is based on the two moments ˆκ 2 3 = ˆε 3 i /ˆσ 3 (skewness) and ˆκ 2 4 = ˆε 4 i /ˆσ 4 3 (kurtosis) where ˆε i denote a residual from the estimated model. Skewness refers to how symmetric the residuals are around zero. Kurtosis refers to the peakedness of the distribution. For a normal distribution the kurtosis value is 3. These two moments are used to construct the test statistics χ 2 skew = n ˆκ2 3 6 χ 2 kurt = n ˆκ2 4 and, jointly χ 2 24 norm = χ 2 skew + χ2 kurt with degrees of freedom 1, 1 and 2 under the null hypothesis of normality of ε i. As you can guess, χ 2 norm corresponds to Normality test: Chi^2(2) in the Screen Capture. The p-value is in brackets and refers to the joint null of no skewness and no excess kurtosis. As you can see to reject that null you would have to accept a significance level of the test of Hence, there is no formal evidence of non-normality for this model. PcGive calculates the skewness and kurtosis moments, but they not reported as part of the default output. To access the more detailed information click Model- Test from the main menu and then check the box for Tests.., click OK and in the next menu check for Normality test and click OK. The χ 2 norm- statistics is often referred to as the Jarque-Bera-test due Jarque and Bera (1980). 4

5 Davidson and MacKinnon: The unnumbered section Tests for Skewness and kurtosis page The exposition is rather technical. It refers for example to the concept of a Outer-Product.of the Gradient (OPG) estimator,which we do not assume familiarity with in ECON Nevertheless note equation (15.34) on page 663, and that they too refer to this joint test as the Jarque-Bera test. Textbook in ECON 4150: Hill, Griffits and Lim, p 89 Textbook in Norwegian: Bårdsen and Nymoen p Heteroscedasticity tests (White-test) Formal tests of the homoscedasticity assumption were proposed by, White (1980), so these tests are often referred to as White-tests. In the simplest case, which we have here. the test is based on the auxiliary regression: ˆε 2 i = a 0 + a 1 X i + a 2 X 2 i, i = 1, 2,..., n, (4) where, as stated above, the ˆε i s are the OLS residuals from the model. Under the null hypothesis of homoscedasticity we have H 0 : a 1 = a 2 = 0 which can be tested by the usual F -test on (4). This statistic, which we will refer to by the symbol F het, is then F -distributed with 2 and n 3 degrees of freedom under the H 0. n denotes the number of observations. In our example, this means that we use F (2, 47 3), i.e., F (2, 44) and it is reported as Hetero test: F(2,44) in the screen capture. Note that you would reject the null hypothesis at the 5 % level based on this evidence, but not reject at the stricter 2.5 % level. You will often see in textbooks that there are Chi-square distributed versions of the mis-specification tests that are based on auxiliary regressions. This is the case for White s test, which is distributed χ 2 (2) in the present example. It is calculated as nrhet 2, where R2 het is the multiple correlation coefficient from (4). From elementary econometrics we know the F -distributed statistic can be written as F het = Rhet 2 n 3 (1 Rhet 2 ) 2 confirming that the two version of the test use the same basic information and that the difference is that the F -version adjusts for degrees of freedom. Usually the effect is to keep control over the level (or size of the test) so that the p-values are not overstated. In PcGive you get the nrhet 2 version of the test by using Model-Test from the main menu and then check the box for Tests.., click OK and in the next menu check for Heteroscedasticity test (using squares) and click OK. With two or more explanatory variables there is an extended version of White s test that includes cross-products of the regressors in the auxiliary regression. In the screen capture, this test is Hetero-X test: F(2,44). Since we have one regressor it is identical to the first test. If we include a second regressor in the model the. 5

6 test would be Hetero-X test: X 1i,X 2i, X1i,X 2 2i 2 and X 1i X 2i. F(5,41) since the auxiliary regression contains Davidson and MacKinnon: Section 7.5. The Gauss-Newton Regression (GNR) they refer to is the same as what we have called the auxilliary regression above. The theoretical motivation for GNR is given in Chapter 6 Non-linear regression (the concept is introduced on page 236), which is very useful as a reference, although we do not require any detailed knowledge about numerical methods of otimization in this course. Textbook used in ECON 4150: Hill, Griffits and Lim, p 215 Textbook in Norwegian: Bårdsen and Nymoen p Regression Specification Error Test, RESET The RESET test in the last line of the screen capture is based the auxiliary regression Ŷi Y i = a 0 + a 1 X i + a 2 Ŷ 2 i + a 3 Ŷ 3 i + v i, i = 1, 2,..., n, (5) where Ŷi denotes the fitted values. RESET23 test indicates that there is both a squared and a cubic term in (5) so that the joint null hypothesis is: a 2 = a 3 = 0. If you access the Model-Test menu, you also get the RESET test that only includes the squares Ŷ i 2. Note that there are χ 2 distributed versions of both tests. As the name suggests, the RESET test is sometimes interpreted as a test of the correctness of the model, the functional form in particular. However, most modern textbook now stress that the RESET test is nonconstructive (i.e., it gives no indication about what to do if the test is significant ). Hence, the consensus is to interpret the RESET test as a general mis-specification test. Davidson and MacKinnon. Section 15.2 Textbook in ECON 4150: Hill, Griffits and Lim, p Textbook in Norwegian: Bårdsen and Nymoen p Mis-specification tests for time series data We can re-estimate the same regression as above but as an explicit model for time series data. Follow the instructions in Seminar PcGive intro.pdf to obtain: The only difference is that we have two new mis-specification tests, labelled AR 1-2 test and ARCH 1-1 test in the output. This gives us a double message: First that all the mis-specification tests cross-section data, are equally relevant for models that use time series data. Second that there are special mis-specification issues for time series data. This is because of three features. First, with time series 6

7 data, we have a natural ordering of the observations, from the past to the present. Second, time series data are usually autocorrelated, meaning that Y t is correlated with Y t 1, Y t 2 and usually also longer lags (and leads). Third, unless f(x t ) in (2), interpreted as a time series model, explains all the autocorrelation in Y t, there will be residual autocorrelation in ε t, meaning that the classical assumption about uncorrelated disturbances does not hold. Residual autocorrelation AR 1-2 test is a standard test of autocorrelation up to degree 2. It tests the joint hypothesis that ˆε t is uncorrelated with ˆε t j for any choice of j, against the alternative that ˆε t is correlated with ˆε t 1 or ˆε t 2. The test makes use of the auxiliary regression and the null hypothesis tested is ˆε t = a 0 + a 1ˆε t 1 + a 2ˆε t 2 + a 3 X t + v t (6) H 0 : a 1 = a 2 = 0. Many textbooks (Greene and Hill, Griffits and Lim also) refer to this (rather technically) as the Lagrange multiplier test, but then one should add for autocorrelation since also the other tests can be interpreted statistically as Lagrange multiplier tests. As noted by Bårdsen and Nymoen (2011), several researchers have contributed to this test for autoregression, notably Godfrey (1978) and Harvey (1981, side 173). Based on the evidence (note the F distribution again, the χ 2 form is available from the Model-Test menu), there is no sign of autocorrelation in this case. This test is flexible. If you have reason to believe that the likely form of autocorrelation is of the first degree, it is efficient to base the test on an auxiliary regression with only a single lag. Extension to higher order autocorrelation is also straight forward and is easily done in the Model-Test menu PcGive. 7

8 Importantly, the test is also valid for dynamic model, where Y t 1 is among the explanatory variables. This is not the case for the older Durbin-Watson test for example (which still can be found in Model-Test menu though). Davidson and MacKinnon, section 7.7 Textbook used in ECON 4150: Hill, Griffits and Lim, p 242 Textbook in Norwegian: Bårdsen and Nymoen p Autoregressive Conditional Heteroscedasticity (ARCH) With time series data it is possible that the variance of ε t is non-constant. If the variance follows an autoregressive model of the first order, this type of heteroscedasticity is represented as V ar(ε t ε t 1 ) = a 0 + α 1 ε 2 t 1 The null hypothesis of constant variance can be tested by using the auxiliary regression: ˆε 2 t = a 0 + a 1ˆε 2 t 1 + v t, (7) where ˆε 2 t (t = 1, 2,..., T ) are squared residuals. The coefficient of determination, Rarch 2, from (7) is used to calculate T R2 arch which is χ2 (1) under the null hypothesis. In the same way as many of the other test, the F -form of the test is however preferred, as also the screen-capture above shows. Extensions to higher order residual ARCH are done in the Model-Test menu. We use the ARCH model as a mis-specification test here, but this class of model has become widely used for modelling volatile time series, especially in finance. The ARCH model is due to Engle (1982). Davidson and MacKinnon, section 13.6, page 589 Textbook in ECON 4150: Hill, Griffits and Lim, p 369 Textbook in Norwegian: Bårdsen and Nymoen p References Bårdsen, G. and R. Nymoen (2011). Innføring i økonometri. Fagbokforlaget. Engle, R. F. (1982). Autoregressive conditional heteroscedasticity, with estimates of the variance of United Kingdom inflation. Econometrica, 50, Godfrey, L. G. (1978). Testing for Higher Order Serial Correlation When the Regressors Include Lagged Dependent Variables. Econometrica, 46, Greene, W. (2012). Econometric Analysis. Pearson, 7th edn. 8

9 Harvey, A. C. (1981). The Econometric Analysis of Time Series. Philip Allan, Oxford. Hendry, D. F. (1995). Dynamic Econometrics. Oxford University Press, Oxford. Hendry, D. F. and B. Nielsen (2007). Econometric Modeling. Princeton University Press, Princeton and Oxford. Hill, R. C., W. Griffiths and G. Lim (2008). Principles of Econometrics. Wiley, 3rd edn. Jarque, C. M. and A. K. Bera (1980). Efficient tests for normality, homoscedasticity and serial independence of regression residuals. Economic Letters, 6, Stewart, M. B. and K. F. Wallis (1981). Introductory Econometrics. Basil Blackwell. White, J. (1980). A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test of Heteroskedasticity. Econometrica, 48,

Introduction to Regression and Data Analysis

Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions What will happen if we violate the assumption that the errors are not serially

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

Online Appendices to the Corporate Propensity to Save

Online Appendices to the Corporate Propensity to Save Appendix A: Monte Carlo Experiments In order to allay skepticism of empirical results that have been produced by unusual estimators on fairly small

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

Chapter 4: Vector Autoregressive Models

Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...

STATA Tutorial Professor Erdinç Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software 1.Wald Test Wald Test is used

Vector Time Series Model Representations and Analysis with XploRe

0-1 Vector Time Series Model Representations and Analysis with plore Julius Mungo CASE - Center for Applied Statistics and Economics Humboldt-Universität zu Berlin mungo@wiwi.hu-berlin.de plore MulTi Motivation

The following postestimation commands for time series are available for regress:

Title stata.com regress postestimation time series Postestimation tools for regress with time series Description Syntax for estat archlm Options for estat archlm Syntax for estat bgodfrey Options for estat

Econometrics Simple Linear Regression

Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight

Solución del Examen Tipo: 1

Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical

Clustering in the Linear Model

Short Guides to Microeconometrics Fall 2014 Kurt Schmidheiny Universität Basel Clustering in the Linear Model 2 1 Introduction Clustering in the Linear Model This handout extends the handout on The Multiple

Elements of statistics (MATH0487-1)

Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -

Simple Linear Regression Inference

Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

Examining the effects of exchange rates on Australian domestic tourism demand: A panel generalized least squares approach

19th International Congress on Modelling and Simulation, Perth, Australia, 12 16 December 2011 http://mssanz.org.au/modsim2011 Examining the effects of exchange rates on Australian domestic tourism demand:

E 4101/5101 Lecture 8: Exogeneity

E 4101/5101 Lecture 8: Exogeneity Ragnar Nymoen 17 March 2011 Introduction I Main references: Davidson and MacKinnon, Ch 8.1-8,7, since tests of (weak) exogeneity build on the theory of IV-estimation Ch

Multiple Regression: What Is It?

Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4

4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

ADVANCED FORECASTING MODELS USING SAS SOFTWARE

ADVANCED FORECASTING MODELS USING SAS SOFTWARE Girish Kumar Jha IARI, Pusa, New Delhi 110 012 gjha_eco@iari.res.in 1. Transfer Function Model Univariate ARIMA models are useful for analysis and forecasting

TURUN YLIOPISTO UNIVERSITY OF TURKU TALOUSTIEDE DEPARTMENT OF ECONOMICS RESEARCH REPORTS. A nonlinear moving average test as a robust test for ARCH

TURUN YLIOPISTO UNIVERSITY OF TURKU TALOUSTIEDE DEPARTMENT OF ECONOMICS RESEARCH REPORTS ISSN 0786 656 ISBN 951 9 1450 6 A nonlinear moving average test as a robust test for ARCH Jussi Tolvi No 81 May

Diagnostic Testing in Econometrics: Variable Addition, RESET, and Fourier Approximations *

Diagnostic Testing in Econometrics: Variable Addition, RESET, and Fourier Approximations * Linda F. DeBenedictis and David E. A. Giles Policy, Planning and Legislation Branch, Ministry of Social Services,

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

Factors affecting online sales

Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

Introduction to Fixed Effects Methods

Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages

INTRODUCTORY STATISTICS

INTRODUCTORY STATISTICS FIFTH EDITION Thomas H. Wonnacott University of Western Ontario Ronald J. Wonnacott University of Western Ontario WILEY JOHN WILEY & SONS New York Chichester Brisbane Toronto Singapore

SYSTEMS OF REGRESSION EQUATIONS

SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations

Regression Analysis (Spring, 2000)

Regression Analysis (Spring, 2000) By Wonjae Purposes: a. Explaining the relationship between Y and X variables with a model (Explain a variable Y in terms of Xs) b. Estimating and testing the intensity

Regression step-by-step using Microsoft Excel

Step 1: Regression step-by-step using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression

Integrated Resource Plan

Integrated Resource Plan March 19, 2004 PREPARED FOR KAUA I ISLAND UTILITY COOPERATIVE LCG Consulting 4962 El Camino Real, Suite 112 Los Altos, CA 94022 650-962-9670 1 IRP 1 ELECTRIC LOAD FORECASTING 1.1

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

Module 5: Multiple Regression Analysis

Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

Performing Unit Root Tests in EViews. Unit Root Testing

Página 1 de 12 Unit Root Testing The theory behind ARMA estimation is based on stationary time series. A series is said to be (weakly or covariance) stationary if the mean and autocovariances of the series

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

Testing the assumption of Linearity. Abstract

Testing the assumption of Linearity Theodore Panagiotidis Department of Economics Finance, Brunel University Abstract The assumption of linearity is tested using five statistical tests for the US and the

Simple Regression Theory II 2010 Samuel L. Baker

SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits

Technical Paper Series Congressional Budget Office Washington, DC FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits Albert D. Metz Microeconomic and Financial Studies

EFFECT OF INVENTORY MANAGEMENT EFFICIENCY ON PROFITABILITY: CURRENT EVIDENCE FROM THE U.S. MANUFACTURING INDUSTRY

EFFECT OF INVENTORY MANAGEMENT EFFICIENCY ON PROFITABILITY: CURRENT EVIDENCE FROM THE U.S. MANUFACTURING INDUSTRY Seungjae Shin, Mississippi State University Kevin L. Ennis, Mississippi State University

Chapter 7. One-way ANOVA

Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

Chapter 4: Statistical Hypothesis Testing

Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin

16 : Demand Forecasting

16 : Demand Forecasting 1 Session Outline Demand Forecasting Subjective methods can be used only when past data is not available. When past data is available, it is advisable that firms should use statistical

Department of Economics

Department of Economics Working Paper Do Stock Market Risk Premium Respond to Consumer Confidence? By Abdur Chowdhury Working Paper 2011 06 College of Business Administration Do Stock Market Risk Premium

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University

Practical I conometrics data collection, analysis, and application Christiana E. Hilmer Michael J. Hilmer San Diego State University Mi Table of Contents PART ONE THE BASICS 1 Chapter 1 An Introduction

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and

ENDOGENOUS GROWTH MODELS AND STOCK MARKET DEVELOPMENT: EVIDENCE FROM FOUR COUNTRIES

ENDOGENOUS GROWTH MODELS AND STOCK MARKET DEVELOPMENT: EVIDENCE FROM FOUR COUNTRIES Guglielmo Maria Caporale, South Bank University London Peter G. A Howells, University of East London Alaa M. Soliman,

Module 3: Correlation and Covariance

Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

Intro to Data Analysis, Economic Statistics and Econometrics

Intro to Data Analysis, Economic Statistics and Econometrics Statistics deals with the techniques for collecting and analyzing data that arise in many different contexts. Econometrics involves the development

Why High-Order Polynomials Should Not be Used in Regression Discontinuity Designs

Why High-Order Polynomials Should Not be Used in Regression Discontinuity Designs Andrew Gelman Guido Imbens 2 Aug 2014 Abstract It is common in regression discontinuity analysis to control for high order

Directions for using SPSS

Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

Causal Forecasting Models

CTL.SC1x -Supply Chain & Logistics Fundamentals Causal Forecasting Models MIT Center for Transportation & Logistics Causal Models Used when demand is correlated with some known and measurable environmental

Sales forecasting # 1

Sales forecasting # 1 Arthur Charpentier arthur.charpentier@univ-rennes1.fr 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting

HLM software has been one of the leading statistical packages for hierarchical

Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush

Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*

Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?* Jennjou Chen and Tsui-Fang Lin Abstract With the increasing popularity of information technology in higher education, it has

Sales forecasting # 2

Sales forecasting # 2 Arthur Charpentier arthur.charpentier@univ-rennes1.fr 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting

Granger Causality between Government Revenues and Expenditures in Korea

Volume 23, Number 1, June 1998 Granger Causality between Government Revenues and Expenditures in Korea Wan Kyu Park ** 2 This paper investigates the Granger causal relationship between government revenues

Least Squares Estimation

Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

Financial Risk Management Exam Sample Questions/Answers

Financial Risk Management Exam Sample Questions/Answers Prepared by Daniel HERLEMONT 1 2 3 4 5 6 Chapter 3 Fundamentals of Statistics FRM-99, Question 4 Random walk assumes that returns from one time period

Chapter 2. Dynamic panel data models

Chapter 2. Dynamic panel data models Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans Université d Orléans April 2010 Introduction De nition We now consider

Forecasting Using Eviews 2.0: An Overview

Forecasting Using Eviews 2.0: An Overview Some Preliminaries In what follows it will be useful to distinguish between ex post and ex ante forecasting. In terms of time series modeling, both predict values

GRADO EN ECONOMÍA. Is the Forward Rate a True Unbiased Predictor of the Future Spot Exchange Rate?

FACULTAD DE CIENCIAS ECONÓMICAS Y EMPRESARIALES GRADO EN ECONOMÍA Is the Forward Rate a True Unbiased Predictor of the Future Spot Exchange Rate? Autor: Elena Renedo Sánchez Tutor: Juan Ángel Jiménez Martín

Chapter 10: Basic Linear Unobserved Effects Panel Data. Models:

Chapter 10: Basic Linear Unobserved Effects Panel Data Models: Microeconomic Econometrics I Spring 2010 10.1 Motivation: The Omitted Variables Problem We are interested in the partial effects of the observable

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

From the help desk: Bootstrapped standard errors

The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

4 Data Issues 4.1 Truncated Regression population model y i = x i β + ε i, ε i N(0, σ 2 ) given a random sample, {y i, x i } N i=1, then OLS is consistent and efficient problem arises when only a non-random

THE IMPACT OF EXCHANGE RATE VOLATILITY ON BRAZILIAN MANUFACTURED EXPORTS

THE IMPACT OF EXCHANGE RATE VOLATILITY ON BRAZILIAN MANUFACTURED EXPORTS ANTONIO AGUIRRE UFMG / Department of Economics CEPE (Centre for Research in International Economics) Rua Curitiba, 832 Belo Horizonte

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression

Chapter 9 Simple Linear Regression An analysis appropriate for a quantitative outcome and a single quantitative explanatory variable. 9.1 The model behind linear regression When we are examining the relationship

11. Analysis of Case-control Studies Logistic Regression

Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

Lecture 15. Endogeneity & Instrumental Variable Estimation

Lecture 15. Endogeneity & Instrumental Variable Estimation Saw that measurement error (on right hand side) means that OLS will be biased (biased toward zero) Potential solution to endogeneity instrumental

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

Corporate Defaults and Large Macroeconomic Shocks

Corporate Defaults and Large Macroeconomic Shocks Mathias Drehmann Bank of England Andrew Patton London School of Economics and Bank of England Steffen Sorensen Bank of England The presentation expresses

SPSS TUTORIAL & EXERCISE BOOK

UNIVERSITY OF MISKOLC Faculty of Economics Institute of Business Information and Methods Department of Business Statistics and Economic Forecasting PETRA PETROVICS SPSS TUTORIAL & EXERCISE BOOK FOR BUSINESS

Minimum LM Unit Root Test with One Structural Break. Junsoo Lee Department of Economics University of Alabama

Minimum LM Unit Root Test with One Structural Break Junsoo Lee Department of Economics University of Alabama Mark C. Strazicich Department of Economics Appalachian State University December 16, 2004 Abstract

The Use of Event Studies in Finance and Economics. Fall 2001. Gerald P. Dwyer, Jr.

The Use of Event Studies in Finance and Economics University of Rome at Tor Vergata Fall 2001 Gerald P. Dwyer, Jr. Any views are the author s and not necessarily those of the Federal Reserve Bank of Atlanta

Univariate Regression

Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

STOCK MARKET VOLATILITY AND REGIME SHIFTS IN RETURNS

STOCK MARKET VOLATILITY AND REGIME SHIFTS IN RETURNS Chia-Shang James Chu Department of Economics, MC 0253 University of Southern California Los Angles, CA 90089 Gary J. Santoni and Tung Liu Department

A model to predict client s phone calls to Iberdrola Call Centre

A model to predict client s phone calls to Iberdrola Call Centre Participants: Cazallas Piqueras, Rosa Gil Franco, Dolores M Gouveia de Miranda, Vinicius Herrera de la Cruz, Jorge Inoñan Valdera, Danny

Introduction to Quantitative Methods

Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables Kathleen M. Lang* Boston College and Peter Gottschalk Boston College Abstract We derive the efficiency loss

Teaching model: C1 a. General background: 50% b. Theory-into-practice/developmental 50% knowledge-building: c. Guided academic activities:

1. COURSE DESCRIPTION Degree: Double Degree: Derecho y Finanzas y Contabilidad (English teaching) Course: STATISTICAL AND ECONOMETRIC METHODS FOR FINANCE (Métodos Estadísticos y Econométricos en Finanzas

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

DOES IT PAY TO HAVE FAT TAILS? EXAMINING KURTOSIS AND THE CROSS-SECTION OF STOCK RETURNS

DOES IT PAY TO HAVE FAT TAILS? EXAMINING KURTOSIS AND THE CROSS-SECTION OF STOCK RETURNS By Benjamin M. Blau 1, Abdullah Masud 2, and Ryan J. Whitby 3 Abstract: Xiong and Idzorek (2011) show that extremely

From the help desk: Swamy s random-coefficients model

The Stata Journal (2003) 3, Number 3, pp. 302 308 From the help desk: Swamy s random-coefficients model Brian P. Poi Stata Corporation Abstract. This article discusses the Swamy (1970) random-coefficients

Moderation. Moderation

Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation

Overview of Factor Analysis

Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,

ECON 523 Applied Econometrics I /Masters Level American University, Spring 2008. Description of the course

ECON 523 Applied Econometrics I /Masters Level American University, Spring 2008 Instructor: Maria Heracleous Lectures: M 8:10-10:40 p.m. WARD 202 Office: 221 Roper Phone: 202-885-3758 Office Hours: M W

COURSES: 1. Short Course in Econometrics for the Practitioner (P000500) 2. Short Course in Econometric Analysis of Cointegration (P000537)

Get the latest knowledge from leading global experts. Financial Science Economics Economics Short Courses Presented by the Department of Economics, University of Pretoria WITH 2015 DATES www.ce.up.ac.za