SPURIOUS REGRESSIONS IN ECONOMETRICS
|
|
- Ilene Fleming
- 7 years ago
- Views:
Transcription
1 Journal of Econometrics 2 (1974) (6 North-Holland Publishing Company SPURIOUS REGRESSIONS IN ECONOMETRICS C.W.J. GRANGER and P. NEWBOLD University of Nottingham, Nottingham NG7 ZRD, England Received May 1973, revised version received December Introduction It is very common to see reported in applied econometric literature time series regression equations with an apparently high degree of fit, as measured by the coefficient of multiple correlation R2 or the corrected coefficient R2, but with an extremely low value for the Durbin-Watson statistic. We find it very curious that whereas virtually every textbook on econometric methodology contains explicit warnings of the dangers of autocorrelated errors, this phenomenon crops up so frequently in well-respected applied work. Numerous examples could be cited, but doubtless the reader has met sufficient cases to accept our point. It would, for example, be easy to quote published equations for which R2 = and the Durbin-Watson statistic (d) is The most extreme example we have met is an equation for which R2 = 0.99 and d = I-Iowever, we shall suggest that cases with much less extreme values may well be entirely spurious. The recent experience of one of us [see Box and Newbold (1971)] has indicated just how easily one can be led to produce a spurious model if sufficient care is not taken over an appropriate formulation for the autocorrelation structure of the errors from the regression equation. We felt, then, that we should undertake a more detailed enquiry seeking to determine what, if anything, could be inferred from those regression equations having the properties just described. There are, in fact, as is well-known, three major consequences of autocorrelated errors in regression analysis : (i) Estimates of the regression coefficients are inefficient. (ii) Forecasts based on the regression equations are sub-optimal. (iii) The usual significance tests on the coefficients are invalid. The first two points are well documented. For the remainder of this paper, we shall concentrate on the third point, and, in particular, examine the potent& alities for discovering spurious relationships which appear to us to be inherent in a good deal of current econometric methodology. The point of view we intend
2 112 C. W.J. Granger, P. Newbold, Regressions in econometrics to take is that of the statistical time series analyst, rather than the more classic econometric approach. In this way it is hoped that we might be able to illuminate the problem from a new angle, and hence perhaps present new insights. Accordingly, in the following section we summarize some relevant results in time series analysis. In sect. 3 we indicate how nonsense regressions relating economic time series can arise, and illustrate these points in sect. 4 with the results of a simulation study. Finally, in sect. 5, we re-emphasize the importance of error specification and draw a distinction between the philosophy of time series analysis and econometric methodology, which we feel to be of great importance to practitioners of the latter. 2. Some results in time series analysis Let W, denote a time series which is stationary (it could represent deviation from some deterministic trend). Then, the so-called mixed autoregressive moving average process, w,--+,w,_,--...-c#jpw,_, = a,-8~a,_~ scar-,, (1) where a, represents a sequence of uncorrelated deviates, each with the same variance, is commonly employed to model such series. The sequence ut is referred to as white noise. For brevity, eq. (1) can be written as W)K = WW,, (2) where 4(B) and 8(B) are polynomial lag operators with appropriate roots to ensure stationarity of W, and uniqueness of representation. Suppose, now, that one has a given time series X,. Box and Jenkins (1970) urge that, while this series itself may not be stationary, it can often be reduced to stationarity by differencing a sufficient number of times; that is, there exists an integer d such that vdx, = w, (3) is a stationary time series. Combining eqs. (2) and (3), the series X, can be represented by the model, &B)VdX, = &@a,. (4) Eq. (4) is said to represent an autoregressive integrated moving average process of order (p, d, q), denoted as A.R.I.M.A. (p, d, q). As regards economic time series, one typically finds a very high serial correlation between adjacent values, particularly if the sampling interval is small, such as a week or a month. This is because many economic series are rather smooth, with changes being small in magnitude compared to the current level. There is thus a good deal of evidence to suggest that the appropriate value for d in
3 C. W.J. Granger, P. Newbold, Regressions in econometrics 113 eq. (4) is very often one. [See, for example, Granger (1966), Reid (1969) and Newbold and Granger (1974).] Alternatively, if d = 0 in eq. (4) we would expect 4(B) to have a root (1- ~/JB) with 4 very close to unity. The implications of this statement are extremely important, as will be seen in the following section. The simplest example of the kind of series we have in mind is the random walk, VX, = a,. This model has been found to represent well certain price series, particularly in speculative markets. For many other series, the integrated moving average process, VX, = a,-8a,_l, has been found to provide good representation. A consequence of this behaviour of economic time series is that a naive no change model will often provide adequate, though by no means optimal, forecasts. Such models are often employed as bench-marks against which the forecast performance of econometric models can be judged. [For a criticism of this approach to evaluation, see Granger and Newbold (1973).] 3. How nonsense regressions can arise Let us consider the usual linear regression model with stochastic regressors : Y = x/3+8, (5) where Y is a T x 1 vector of observations on a dependent variable, /I is a Kx 1 vector of coefficients whose first member &, represents a constant term and X is a T x K matrix containing a column of ones and T observations on each of (K- 1) independent variables which are stochastic, but distributed independently of the T x 1 vector of errors a. It is generally assumed that and E(8) = 0, (6) E(se ) = a21. (7) A test of the null hypothesis that the independent variables contribute nothing towards explaining variation in the dependent variable can be framed in terms of the coefficient of multiple correlation R2. The null hypothesis is and the test statistic HI): j?i = p2 = *.. = &-I = 0, (8) F T-K R2 =K_11- (9)
4 114 C. W.J. Granger, P. Newbold, Regressions in econometrics is compared with tabulated values of Fisher s F distribution with (K- 1) and (T-K) degrees of freedom, normality being assumed. Of course, it is entirely possible that, whatever the properties of the individual time series, there does exist some /I so that e = Y-xp satisfies the conditions (6) and (7). However, to the extent that the Y, s do not constitute a white noise process, the null hypothesis (8) cannot be true, and tests of it are inappropriate. Next, let us suppose that the null hypothesis is correct and one attempts to fit a regression of the form (5) to the leuels of economic time series. Suppose, further, that, as we have argued in the previous section is often the case, these series are non-stationary or, at best, highly autocorrelated. In such a situation the test procedure just described breaks down, since the quantity F in eq. (9) will not follow Fisher s F distribution under the null hypothesis (8). This follows since under that hypothesis the residuals from eq. (5), s, = Y,-P,; t = 1,2,..., T, will have the same autocorrelation properties as the Y, series. Some idea of the distributional problems involved can be obtained from consideration of the case : where it is assumed that Y, and X, follow the independent regressive processes, first order auto- Y, = 4Y,-,+-a,, x, = 4*x,_,+cc,. (10) In this case, R2 is simply the square of the ordinary sample correlation between Y, and X,. Kendall (1954) gives : var (R) = T- (1 +~#$*)/(l Since R is constrained to lie in the region (- 1, l), if its variance is greater than $ then its distribution cannot have a single mode at zero. The necessary condition is 44* > (T- 3)/(T+ 3). Thus, for example, if T = 20 and $J = 4*, a distribution which is not unimodal at the origin will arise if C$ > 0.86, and if C$ = 0.9, E(R2) = Thus a high value of R2 should not, on the grounds of traditional tests, be regarded as evidence of a significant relationship between autocorrelated series. Also a low value of L! strongly suggests that there does not exist a B such that E in eq. (5) satisfies eq. (7). Thus, the phenomenon we have described might well arise from an attempt to fit regression equations relating the levels of independent time series. TO examine this possibility, we conducted a number of simulation experiments which are reported in the following section.
5 C. W.J. Granger, P. Newbold, Regressions in econometrics Some simulation results As a preliminary, we looked at the regression Y, = PO+&X*9 where Y, and X, were, in fact, generated as independent random walks each of length 50. Table 1 shows values of the customary statistic for testing the significance of fil, for 100 simulations. Table 1 Regressing two independent random walks. s: o-1 l Frequency : a 8 5 s: Frequency : Using the traditional t test at the 5 % level, the null hypothesis of no relationship between the two series would be-rejected (wrongly) on approximately threequarters of all occasions. If fll/s.e.(i$) were distributed as N(0, l), then the expected value of S would be 1/2/n N 0.8. In fact, the observed average value of S was 4.5, suggesting that the standard deviation of & is being underestimated by the multiple factor 5.6. Thus, instead of using a t-value of approximately 2.0, one should use a value of 11.2, when attributing a coefficient value to be significant at the 5 % level. To put these results in context, they may be compared with results reported by Malinvaud (1966). Suppose that X, follows the process (10) and the error series obeys the model Et = &-l+a,, so that, under the null hypothesis, Yt will also follow this process, where a, and ~1, are independent white noise series. In the case C$ = 4* = 0.8, it is shown that the estimated variance of 8, should be multiplied by a factor 5.8, when the length of the series is T = 50. The approximations on which this result is based break down as both 4 and c$* tend to unity, but our simulation indicates that the estimated variance of fll should be multiplied by (5.6) N when T = 50 and random walks are involved. Our second simulation was more comprehensive. A series Yt was regressed on m independent series Xj,t; j = 1,2,..., m, with m taking values from one to
6 116 C. W.J. Granger, P. Newbold, Regressions in econometrics five. Each of the series involved obey the same model, the models being (i) random walks, (ii) white noises, (iii) A.R.I.M.A. (0, 1, I), (iv) changes in A.R.I.M.A. (0, 1, I), i.e., first order moving average. Table 2 Regressions of a series on m independent explanatory series. Series either all random walks or all A.R.I.M.A. (0, 1, 1) series, or changes in these. Y, = 100, Y, = Y,_l+a,, Y, = Y,+kb,; X,., = 100, XL,= X~.,_l+aj.~ Xj., =XJ,t+kbJ,r;a,,r,a,,b,,bJ,r sets of independent N(0, 1) white noises. k = 0 gives random walks, k = 1 gives A.R.I.M.A. (0, 1, 1) series. Ho = no relationship, is true. Series length = 50, number of simulations = 100, R = corrected RZ. Per cent times Average Average Per cent Ho rejected Durbin-Watson d R2 Rf > 0.1 Levels m=l 76 m=2 78 m=3 93 m=4 95 m=5 96 Random walks Changes m = m= m= m= m= Levels m=l 64 m=2 81 m=3 82 m=4 90 m=5 90 A.R.I.M.A. (0, 1, I) Changes m = 1 m=2 m=3 m=4 m= Test at 5% level, using an overall test on RZ. All error terms were distributed as N(0, 1) and the A.R.I.M.A. (0, 1, 1) series was derived as the sum of a random walk and independent white noise. The results of the simulations, with 100 replications and series of length 50 are shown in table 2. It is seen that the probability of accepting H,, the hypothesis of no relationship, becomes very small indeed for m > 3 when regressions involve indepen-
7 C. W.J. Granger, P. Newbold, Regressions in econometrics 117 dent random walks. The average R2 steadily rises with m, as does the average d, in this case. Similar conclusions hold for the A.R.I.M.A. (0, 1, 1) process. When white noise series, i.e., changes in random walks, are related, classical regression yields satisfactory results, since the error series will be white noise and least squares fully efficient. However, in the case where changes in the A.R.I.M.A. (0, 1, 1) series are considered-that is, first order moving average processes-the null hypothesis is rejected, on average twice as often as it should be. It is quite clear from these simulations that if one s variables are random walks, or near random walks, and one includes in regression equations variables which should in fact not be included, then it will be the rule rather than the exception to find spurious relationships. It is also clear that a high value for R2 or R2, combined with a low value of d, is no indication of a true relationship. 5. Discussion and conclusion It has been well known for some time now that if one performs a regression and finds the residual series is strongly autocorrelated, then there are serious problems in interpreting the coefficients of the equation. Despite this, many papers still appear with equations having such symptoms and these equations are presented as though they have some worth. It is possible that earlier warnings have been stated insufficiently strongly. From our own studies we would conclude that if a regression equation relating economic variables is found to have strongly autocorrelated residuals, equivalent to a low Durbin-Watson value, the onzy conclusion that can be reached is that the equation is mis-specified, whatever the value of R2 observed. If such a conclusion is accepted, the question then arises of what to do about the mis-specification. The form of the mis-specification can be considered to be either (i) the omission of relevant variables or (ii) the inclusion of irrelevant variables or (iii) autocorrelated residuals. In general, the mis-specification is best considered to be a combination of these possibilities. The usual recommendations are to either include a lagged dependent variable or take first differences of the variables involved in the equation or to assume a simple first-order autoregressive form for the residual of the equation. Although any of these methods will undoubtedly alleviate the problem in general, it is doubtful if they will completely remove it. It is not our intention in this paper to go deeply into the problem of how one should estimate equations in econometrics, but rather to point out the difficulties involved. In our opinion the econometrician can no longer ignore the time series properties of the variables with which he is concerned - except at his peril. The fact that many economic levels are near random walks or integrated processes means that considerable care has to be taken in specifying one s equations. One
8 118 C. W.J. G-anger, P. Newbold, Regressions in econometrics method we are currently considering is to build single series models for each variable, using the methods of Box and Jenkins (1970) for example, and then searching for relationships between series by relating the residuals from these single models. The rationale for such an approach is as follows. In building a forecasting model, the time series analyst regards the series to be forecast as containing two components. The first is that part of the series which can be explained in terms of its own past behaviour and the second is the residual part [at in eq. (4)] which cannot. Thus, in order to explain this residual element one must look for other sources of information-related time series, or perhaps considerations of a non-quantitative nature. Hence, in building regression equations, the quantity to be explained is variation in LI, - not variation in the original series. This study is, however, still in its formative stages. Until a really satisfactory procedure is available, we recommend taking first differences of all variables that appear to be highly autocorrelated. Once more, this may not completely remove the problem but should considerably improve the interpretability of the coefficients. Perhaps at this point we should make it clear that we are not advocating first differencing as a universal sure-fire solution to any problem encountered in applied econometric work. One cannot propose universal rules about how to analyse a group of time series as it is virtually always possible to find examples that could occur for which the rule would not apply. However, one can suggest a rule that is useful for a class of series that very frequently occur in practice. As has been noted, very many economic series are rather smooth, in that the first serial correlation coefficient is very near unity and the other low-order serial correlations are also positive and large. Thus, if one has a small sample, of say twenty terms, the addition of a further term adds very little to the information available, as this term is so highly correlated with its predecessor. It follows that the total information available is very limited and the estimates of parameters associated with this data will have high variance values. However, a simple calculation shows that the first differences of such a series will necessarily have serial correlations that are small in magnitude, so that a new term of the differenced series adds information that is almost uncorrelated to that already available and this means that estimates are more efficient. One is much less likely to be misled by efficient estimates. The suggested rule perhaps should be to build one s models both with levels and also with changes, and then interpret the combined results so obtained. As an example (admittedly extreme) of the changes that can occur in one s results from differencing, Sheppard (1971) regressed U.K. consumption on autonomous expenditure and mid-year money stock, both for levels and changes for the time period The regression on levels yielded a corrected R2 of 0.99 and a d of 0.59, whilst for changes these quantities were and 2.21 respectively. This provides an indication of just how one can be misled by regressions involving levels if the message of the d statistic is unheeded.
9 C. P. Newbold, Regressions in econometrics 119 It has been suggested by a referee that our results have relevance to the structural model - unrestricted reduced form controversy, the feeling being that the structural model is less vulnerable to the problems we have described since its equations are in the main based on well developed economic theory and contain relatively few variables on the right-hand side. There is some force to this argument, in theory at least, although we believe that in practice things are much less clear cut. When considering this problem the question immediately arises of what is meant by a good theory. To the time series analyst a good theory is one that provides a structure to a model such that the errors or residuals of the fitted equations are white noises that cannot be explained or forecast from other economic variables. On the other hand, some econometricians seem to view a good theory as one that appears inherently correct and thus does not need testing. We would suggest that in fact most economic theories are insufficient in these respects as even if the variables to be included in a model are well specified, the theory generally is imprecise about the lag structure to be used and typically says nothing about the time-series properties of the residuals. There are also data problems in that the true lags need not necessarily be integer multiples of the sampling interval of the available data and there will almost certainly be added measurement errors to the true values of the variables being considered. All of these considerations suggest that a simple-minded application of regression techniques to levels could produce unacceptable results. If one does obtain a very high R2 value from a fitted equation, one is forced to rely on the correctness of the underlying theory, as testing the significance of adding further variables becomes impossible. It is one of the strengths of using changes, or some similar transformations, that typically lower R2 values result and so more experimentation and testing can be contemplated. In any case, if a good theory holds for levels, but is unspecific about the time-series properties of the residuals, then an equivalent theory holds for changes so that nothing is lost by model building with both levels and changes. However, much could be gained from this strategy as it may prevent the presentation in econometric literature of possible spurious regressions, which we feel is still prevalent despite the warnings given in the text books about this possibility. References Box, G.E.P. and G.M. Jenkins, 1970, Time series analysis, forecasting and control (Holden- Day, San Francisco). Box, G.E.P. and P. Newbold, 1971, Some comments on a paper of Coen, Gomme and Kendall, J. R. Statist. Sot. A 134, Granger, C.W.J., 1966, The typical spectral shape of an economic variable, Econometrica 34, Granger, C.W.J. and P. Newbold, 1973, Some comments on the evaluation of economic forecasts, Applied Economics 5,
10 120 C. W.J. Granger, P. Newbold, Regressions in econometrics Kendall, M.G., 1954, Exercises in theoretical statistics (Griffin, London). Malinvaud, E., 1966, Statistical methods of econometrics (North Holland, Amsterdam). Newbold, P. and C.W.J. Granger, 1974, Experience with forecasting univariate time series and the combination of forecasts, J. R. Statist. Sot. A 137, forthcoming. Reid, D.J., 1969, A comparative study of time series prediction techniques on economic data, Ph.D. Thesis (University of Nottingham, U.K.). Sheppard, D.K., 1971, The growth and role of U.K. financial institutions (Methuen. London).
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationChapter 4: Vector Autoregressive Models
Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...
More informationThe VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.
Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium
More informationTesting for Granger causality between stock prices and economic growth
MPRA Munich Personal RePEc Archive Testing for Granger causality between stock prices and economic growth Pasquale Foresti 2006 Online at http://mpra.ub.uni-muenchen.de/2962/ MPRA Paper No. 2962, posted
More informationPITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU
PITFALLS IN TIME SERIES ANALYSIS Cliff Hurvich Stern School, NYU The t -Test If x 1,..., x n are independent and identically distributed with mean 0, and n is not too small, then t = x 0 s n has a standard
More informationTime Series and Forecasting
Chapter 22 Page 1 Time Series and Forecasting A time series is a sequence of observations of a random variable. Hence, it is a stochastic process. Examples include the monthly demand for a product, the
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More informationPerforming Unit Root Tests in EViews. Unit Root Testing
Página 1 de 12 Unit Root Testing The theory behind ARMA estimation is based on stationary time series. A series is said to be (weakly or covariance) stationary if the mean and autocovariances of the series
More informationTime Series Analysis
Time Series Analysis Identifying possible ARIMA models Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 2012 Alonso and García-Martos
More informationTIME SERIES ANALYSIS
TIME SERIES ANALYSIS L.M. BHAR AND V.K.SHARMA Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-0 02 lmb@iasri.res.in. Introduction Time series (TS) data refers to observations
More informationWooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions
Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions What will happen if we violate the assumption that the errors are not serially
More informationForecasting areas and production of rice in India using ARIMA model
International Journal of Farm Sciences 4(1) :99-106, 2014 Forecasting areas and production of rice in India using ARIMA model K PRABAKARAN and C SIVAPRAGASAM* Agricultural College and Research Institute,
More information16 : Demand Forecasting
16 : Demand Forecasting 1 Session Outline Demand Forecasting Subjective methods can be used only when past data is not available. When past data is available, it is advisable that firms should use statistical
More informationI. Introduction. II. Background. KEY WORDS: Time series forecasting, Structural Models, CPS
Predicting the National Unemployment Rate that the "Old" CPS Would Have Produced Richard Tiller and Michael Welch, Bureau of Labor Statistics Richard Tiller, Bureau of Labor Statistics, Room 4985, 2 Mass.
More informationIs the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?
Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate? Emily Polito, Trinity College In the past two decades, there have been many empirical studies both in support of and opposing
More informationForecasting of Paddy Production in Sri Lanka: A Time Series Analysis using ARIMA Model
Tropical Agricultural Research Vol. 24 (): 2-3 (22) Forecasting of Paddy Production in Sri Lanka: A Time Series Analysis using ARIMA Model V. Sivapathasundaram * and C. Bogahawatte Postgraduate Institute
More informationADVANCED FORECASTING MODELS USING SAS SOFTWARE
ADVANCED FORECASTING MODELS USING SAS SOFTWARE Girish Kumar Jha IARI, Pusa, New Delhi 110 012 gjha_eco@iari.res.in 1. Transfer Function Model Univariate ARIMA models are useful for analysis and forecasting
More informationTrend and Seasonal Components
Chapter 2 Trend and Seasonal Components If the plot of a TS reveals an increase of the seasonal and noise fluctuations with the level of the process then some transformation may be necessary before doing
More informationUnivariate and Multivariate Methods PEARSON. Addison Wesley
Time Series Analysis Univariate and Multivariate Methods SECOND EDITION William W. S. Wei Department of Statistics The Fox School of Business and Management Temple University PEARSON Addison Wesley Boston
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationChapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
More informationTIME SERIES ANALYSIS
TIME SERIES ANALYSIS Ramasubramanian V. I.A.S.R.I., Library Avenue, New Delhi- 110 012 ram_stat@yahoo.co.in 1. Introduction A Time Series (TS) is a sequence of observations ordered in time. Mostly these
More informationExamining the Relationship between ETFS and Their Underlying Assets in Indian Capital Market
2012 2nd International Conference on Computer and Software Modeling (ICCSM 2012) IPCSIT vol. 54 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V54.20 Examining the Relationship between
More informationChapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem
Chapter Vector autoregressions We begin by taking a look at the data of macroeconomics. A way to summarize the dynamics of macroeconomic data is to make use of vector autoregressions. VAR models have become
More information99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm
Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the
More informationLuciano Rispoli Department of Economics, Mathematics and Statistics Birkbeck College (University of London)
Luciano Rispoli Department of Economics, Mathematics and Statistics Birkbeck College (University of London) 1 Forecasting: definition Forecasting is the process of making statements about events whose
More informationTime Series Analysis
JUNE 2012 Time Series Analysis CONTENT A time series is a chronological sequence of observations on a particular variable. Usually the observations are taken at regular intervals (days, months, years),
More informationINDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its
More informationUK GDP is the best predictor of UK GDP, literally.
UK GDP IS THE BEST PREDICTOR OF UK GDP, LITERALLY ERIK BRITTON AND DANNY GABAY 6 NOVEMBER 2009 UK GDP is the best predictor of UK GDP, literally. The ONS s preliminary estimate of UK GDP for the third
More informationAdvanced Forecasting Techniques and Models: ARIMA
Advanced Forecasting Techniques and Models: ARIMA Short Examples Series using Risk Simulator For more information please visit: www.realoptionsvaluation.com or contact us at: admin@realoptionsvaluation.com
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationTime Series Analysis: Basic Forecasting.
Time Series Analysis: Basic Forecasting. As published in Benchmarks RSS Matters, April 2015 http://web3.unt.edu/benchmarks/issues/2015/04/rss-matters Jon Starkweather, PhD 1 Jon Starkweather, PhD jonathan.starkweather@unt.edu
More informationForecasting Tourism Demand: Methods and Strategies. By D. C. Frechtling Oxford, UK: Butterworth Heinemann 2001
Forecasting Tourism Demand: Methods and Strategies By D. C. Frechtling Oxford, UK: Butterworth Heinemann 2001 Table of Contents List of Tables List of Figures Preface Acknowledgments i 1 Introduction 1
More information3. Regression & Exponential Smoothing
3. Regression & Exponential Smoothing 3.1 Forecasting a Single Time Series Two main approaches are traditionally used to model a single time series z 1, z 2,..., z n 1. Models the observation z t as a
More informationChapter 9: Univariate Time Series Analysis
Chapter 9: Univariate Time Series Analysis In the last chapter we discussed models with only lags of explanatory variables. These can be misleading if: 1. The dependent variable Y t depends on lags of
More informationWhat s New in Econometrics? Lecture 8 Cluster and Stratified Sampling
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and
More informationChapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS
Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple
More informationHow To Model A Series With Sas
Chapter 7 Chapter Table of Contents OVERVIEW...193 GETTING STARTED...194 TheThreeStagesofARIMAModeling...194 IdentificationStage...194 Estimation and Diagnostic Checking Stage...... 200 Forecasting Stage...205
More informationIntroduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
More informationForecasting the US Dollar / Euro Exchange rate Using ARMA Models
Forecasting the US Dollar / Euro Exchange rate Using ARMA Models LIUWEI (9906360) - 1 - ABSTRACT...3 1. INTRODUCTION...4 2. DATA ANALYSIS...5 2.1 Stationary estimation...5 2.2 Dickey-Fuller Test...6 3.
More informationTime Series Analysis and Forecasting
Time Series Analysis and Forecasting Math 667 Al Nosedal Department of Mathematics Indiana University of Pennsylvania Time Series Analysis and Forecasting p. 1/11 Introduction Many decision-making applications
More informationCOURSES: 1. Short Course in Econometrics for the Practitioner (P000500) 2. Short Course in Econometric Analysis of Cointegration (P000537)
Get the latest knowledge from leading global experts. Financial Science Economics Economics Short Courses Presented by the Department of Economics, University of Pretoria WITH 2015 DATES www.ce.up.ac.za
More informationTesting The Quantity Theory of Money in Greece: A Note
ERC Working Paper in Economic 03/10 November 2003 Testing The Quantity Theory of Money in Greece: A Note Erdal Özmen Department of Economics Middle East Technical University Ankara 06531, Turkey ozmen@metu.edu.tr
More informationState Space Time Series Analysis
State Space Time Series Analysis p. 1 State Space Time Series Analysis Siem Jan Koopman http://staff.feweb.vu.nl/koopman Department of Econometrics VU University Amsterdam Tinbergen Institute 2011 State
More informationNote 2 to Computer class: Standard mis-specification tests
Note 2 to Computer class: Standard mis-specification tests Ragnar Nymoen September 2, 2013 1 Why mis-specification testing of econometric models? As econometricians we must relate to the fact that the
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More informationPremaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
More informationTime Series Analysis
Time Series Analysis Autoregressive, MA and ARMA processes Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 212 Alonso and García-Martos
More informationAPPLICATION OF THE VARMA MODEL FOR SALES FORECAST: CASE OF URMIA GRAY CEMENT FACTORY
APPLICATION OF THE VARMA MODEL FOR SALES FORECAST: CASE OF URMIA GRAY CEMENT FACTORY DOI: 10.2478/tjeb-2014-0005 Ramin Bashir KHODAPARASTI 1 Samad MOSLEHI 2 To forecast sales as reliably as possible is
More informationGranger Causality between Government Revenues and Expenditures in Korea
Volume 23, Number 1, June 1998 Granger Causality between Government Revenues and Expenditures in Korea Wan Kyu Park ** 2 This paper investigates the Granger causal relationship between government revenues
More informationSample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
More informationProbability and Random Variables. Generation of random variables (r.v.)
Probability and Random Variables Method for generating random variables with a specified probability distribution function. Gaussian And Markov Processes Characterization of Stationary Random Process Linearly
More informationTURUN YLIOPISTO UNIVERSITY OF TURKU TALOUSTIEDE DEPARTMENT OF ECONOMICS RESEARCH REPORTS. A nonlinear moving average test as a robust test for ARCH
TURUN YLIOPISTO UNIVERSITY OF TURKU TALOUSTIEDE DEPARTMENT OF ECONOMICS RESEARCH REPORTS ISSN 0786 656 ISBN 951 9 1450 6 A nonlinear moving average test as a robust test for ARCH Jussi Tolvi No 81 May
More informationDepartment of Economics
Department of Economics Working Paper Do Stock Market Risk Premium Respond to Consumer Confidence? By Abdur Chowdhury Working Paper 2011 06 College of Business Administration Do Stock Market Risk Premium
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More informationBig Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index
Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index Rickard Nyman *, Paul Ormerod Centre for the Study of Decision Making Under Uncertainty,
More informationThreshold Autoregressive Models in Finance: A Comparative Approach
University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Informatics 2011 Threshold Autoregressive Models in Finance: A Comparative
More informationFinancial Risk Management Exam Sample Questions/Answers
Financial Risk Management Exam Sample Questions/Answers Prepared by Daniel HERLEMONT 1 2 3 4 5 6 Chapter 3 Fundamentals of Statistics FRM-99, Question 4 Random walk assumes that returns from one time period
More informationA Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500
A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500 FE8827 Quantitative Trading Strategies 2010/11 Mini-Term 5 Nanyang Technological University Submitted By:
More informationSYSTEMS OF REGRESSION EQUATIONS
SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations
More information4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4
4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression
More information1 Short Introduction to Time Series
ECONOMICS 7344, Spring 202 Bent E. Sørensen January 24, 202 Short Introduction to Time Series A time series is a collection of stochastic variables x,.., x t,.., x T indexed by an integer value t. The
More informationIntroduction to time series analysis
Introduction to time series analysis Margherita Gerolimetto November 3, 2010 1 What is a time series? A time series is a collection of observations ordered following a parameter that for us is time. Examples
More informationBusiness Cycles and Natural Gas Prices
Department of Economics Discussion Paper 2004-19 Business Cycles and Natural Gas Prices Apostolos Serletis Department of Economics University of Calgary Canada and Asghar Shahmoradi Department of Economics
More informationCentre for Central Banking Studies
Centre for Central Banking Studies Technical Handbook No. 4 Applied Bayesian econometrics for central bankers Andrew Blake and Haroon Mumtaz CCBS Technical Handbook No. 4 Applied Bayesian econometrics
More informationIntegrated Resource Plan
Integrated Resource Plan March 19, 2004 PREPARED FOR KAUA I ISLAND UTILITY COOPERATIVE LCG Consulting 4962 El Camino Real, Suite 112 Los Altos, CA 94022 650-962-9670 1 IRP 1 ELECTRIC LOAD FORECASTING 1.1
More informationSecond Order Linear Nonhomogeneous Differential Equations; Method of Undetermined Coefficients. y + p(t) y + q(t) y = g(t), g(t) 0.
Second Order Linear Nonhomogeneous Differential Equations; Method of Undetermined Coefficients We will now turn our attention to nonhomogeneous second order linear equations, equations with the standard
More informationUnivariate Time Series Analysis; ARIMA Models
Econometrics 2 Spring 25 Univariate Time Series Analysis; ARIMA Models Heino Bohn Nielsen of4 Outline of the Lecture () Introduction to univariate time series analysis. (2) Stationarity. (3) Characterizing
More informationChapter 5: Bivariate Cointegration Analysis
Chapter 5: Bivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie V. Bivariate Cointegration Analysis...
More informationSome useful concepts in univariate time series analysis
Some useful concepts in univariate time series analysis Autoregressive moving average models Autocorrelation functions Model Estimation Diagnostic measure Model selection Forecasting Assumptions: 1. Non-seasonal
More informationPPP Hypothesis and Multivariate Fractional Network Marketing
econstor www.econstor.eu Der Open-Access-Publikationsserver der ZBW Leibniz-Informationszentrum Wirtschaft The Open Access Publication Server of the ZBW Leibniz Information Centre for Economics Caporale,
More informationVector Time Series Model Representations and Analysis with XploRe
0-1 Vector Time Series Model Representations and Analysis with plore Julius Mungo CASE - Center for Applied Statistics and Economics Humboldt-Universität zu Berlin mungo@wiwi.hu-berlin.de plore MulTi Motivation
More informationForecasting in STATA: Tools and Tricks
Forecasting in STATA: Tools and Tricks Introduction This manual is intended to be a reference guide for time series forecasting in STATA. It will be updated periodically during the semester, and will be
More informationAnalysis of algorithms of time series analysis for forecasting sales
SAINT-PETERSBURG STATE UNIVERSITY Mathematics & Mechanics Faculty Chair of Analytical Information Systems Garipov Emil Analysis of algorithms of time series analysis for forecasting sales Course Work Scientific
More informationCAPM, Arbitrage, and Linear Factor Models
CAPM, Arbitrage, and Linear Factor Models CAPM, Arbitrage, Linear Factor Models 1/ 41 Introduction We now assume all investors actually choose mean-variance e cient portfolios. By equating these investors
More informationWorking Papers. Cointegration Based Trading Strategy For Soft Commodities Market. Piotr Arendarski Łukasz Postek. No. 2/2012 (68)
Working Papers No. 2/2012 (68) Piotr Arendarski Łukasz Postek Cointegration Based Trading Strategy For Soft Commodities Market Warsaw 2012 Cointegration Based Trading Strategy For Soft Commodities Market
More informationSoftware Review: ITSM 2000 Professional Version 6.0.
Lee, J. & Strazicich, M.C. (2002). Software Review: ITSM 2000 Professional Version 6.0. International Journal of Forecasting, 18(3): 455-459 (June 2002). Published by Elsevier (ISSN: 0169-2070). http://0-
More informationAn introduction to Value-at-Risk Learning Curve September 2003
An introduction to Value-at-Risk Learning Curve September 2003 Value-at-Risk The introduction of Value-at-Risk (VaR) as an accepted methodology for quantifying market risk is part of the evolution of risk
More informationMinimum LM Unit Root Test with One Structural Break. Junsoo Lee Department of Economics University of Alabama
Minimum LM Unit Root Test with One Structural Break Junsoo Lee Department of Economics University of Alabama Mark C. Strazicich Department of Economics Appalachian State University December 16, 2004 Abstract
More informationSales forecasting # 2
Sales forecasting # 2 Arthur Charpentier arthur.charpentier@univ-rennes1.fr 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting
More informationExact Nonparametric Tests for Comparing Means - A Personal Summary
Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola
More informationTesting for Unit Roots: What Should Students Be Taught?
Testing for Unit Roots: What Should Students Be Taught? John Elder and Peter E. Kennedy Abstract: Unit-root testing strategies are unnecessarily complicated because they do not exploit prior knowledge
More informationTime Series Analysis of Aviation Data
Time Series Analysis of Aviation Data Dr. Richard Xie February, 2012 What is a Time Series A time series is a sequence of observations in chorological order, such as Daily closing price of stock MSFT in
More informationMarketing Mix Modelling and Big Data P. M Cain
1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored
More informationA Century of PPP: Supportive Results from non-linear Unit Root Tests
Mohsen Bahmani-Oskooee a, Ali Kutan b and Su Zhou c A Century of PPP: Supportive Results from non-linear Unit Root Tests ABSTRACT Testing for stationarity of the real exchange rates is a common practice
More informationFEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL STATIsTICs 4 IV. RANDOm VECTORs 1. JOINTLY DIsTRIBUTED RANDOm VARIABLEs If are two rom variables defined on the same sample space we define the joint
More informationStatistical estimation using confidence intervals
0894PP_ch06 15/3/02 11:02 am Page 135 6 Statistical estimation using confidence intervals In Chapter 2, the concept of the central nature and variability of data and the methods by which these two phenomena
More informationThe Engle-Granger representation theorem
The Engle-Granger representation theorem Reference note to lecture 10 in ECON 5101/9101, Time Series Econometrics Ragnar Nymoen March 29 2011 1 Introduction The Granger-Engle representation theorem is
More informationEmpirical Methods in Applied Economics
Empirical Methods in Applied Economics Jörn-Ste en Pischke LSE October 2005 1 Observational Studies and Regression 1.1 Conditional Randomization Again When we discussed experiments, we discussed already
More informationCURVE FITTING LEAST SQUARES APPROXIMATION
CURVE FITTING LEAST SQUARES APPROXIMATION Data analysis and curve fitting: Imagine that we are studying a physical system involving two quantities: x and y Also suppose that we expect a linear relationship
More informationUsing JMP Version 4 for Time Series Analysis Bill Gjertsen, SAS, Cary, NC
Using JMP Version 4 for Time Series Analysis Bill Gjertsen, SAS, Cary, NC Abstract Three examples of time series will be illustrated. One is the classical airline passenger demand data with definite seasonal
More informationCHI-SQUARE: TESTING FOR GOODNESS OF FIT
CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity
More informationIBM SPSS Forecasting 22
IBM SPSS Forecasting 22 Note Before using this information and the product it supports, read the information in Notices on page 33. Product Information This edition applies to version 22, release 0, modification
More informationChapter 10 Introduction to Time Series Analysis
Chapter 1 Introduction to Time Series Analysis A time series is a collection of observations made sequentially in time. Examples are daily mortality counts, particulate air pollution measurements, and
More informationAdaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement
Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement Toshio Sugihara Abstract In this study, an adaptive
More informationBusiness cycles and natural gas prices
Business cycles and natural gas prices Apostolos Serletis and Asghar Shahmoradi Abstract This paper investigates the basic stylised facts of natural gas price movements using data for the period that natural
More informationLecture 4: Seasonal Time Series, Trend Analysis & Component Model Bus 41910, Time Series Analysis, Mr. R. Tsay
Lecture 4: Seasonal Time Series, Trend Analysis & Component Model Bus 41910, Time Series Analysis, Mr. R. Tsay Business cycle plays an important role in economics. In time series analysis, business cycle
More informationChapter 1 Introduction. 1.1 Introduction
Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations
More informationTime series Forecasting using Holt-Winters Exponential Smoothing
Time series Forecasting using Holt-Winters Exponential Smoothing Prajakta S. Kalekar(04329008) Kanwal Rekhi School of Information Technology Under the guidance of Prof. Bernard December 6, 2004 Abstract
More informationRelationship among crude oil prices, share prices and exchange rates
Relationship among crude oil prices, share prices and exchange rates Do higher share prices and weaker dollar lead to higher crude oil prices? Akira YANAGISAWA Leader Energy Demand, Supply and Forecast
More information