Explaining Cointegration Analysis: Part II

Size: px
Start display at page:

Download "Explaining Cointegration Analysis: Part II"

Transcription

1 Explaining Cointegration Analysis: Part II David F. Hendry and Katarina Juselius Nuffield College, Oxford, OX1 1NF. Department of Economics, University of Copenhagen, Denmark Abstract We describe the concept of cointegration, its implications in modelling and forecasting, and discuss inference procedures appropriate in integrated-cointegrated vector autoregressive processes (VARs). Particular attention is paid to the properties of VARs, to the modelling of deterministic terms, and to the determination of the number of cointegration vectors. The analysis is illustrated by empirical examples. 1. Introduction Hendry and Juselius (2000) investigated the properties of economic time series that were integrated processes, such as random walks, which contained a unit root in their dynamics. Here we extend the analysis to the multivariate context, and focus on cointegration in systems of equations. We showed in Hendry and Juselius (2000) that when data were non-stationary purely due to unit roots (integrated once, denoted I(1)), they could be brought back to stationarity by the linear transformation of differencing, as in x t x t 1 = x t. For example, if the data generation process (DGP) were the simplest random walk with an independent normal (IN) error having mean zero and constant variance σ 2 ɛ : x t = x t 1 + ɛ t where ɛ t IN [ 0,σ 2 ɛ ], (1) then subtracting x t 1 from both sides delivers x t IN [ 0,σ 2 ɛ ] which is certainly stationary. 1 Since x t cannot have a unit root, it must be I(0). Such an analysis generalizes to (say) twice-integrated series which are I(2) so must become I(0) after differencing twice. It is natural to enquire if other linear transformations than differencing will also induce stationarity. The answer is possibly, but unlike differencing, there is no guarantee that the outcome must be I(0): cointegration analysis is designed to find linear combinations of variables that also remove unit roots. In a bivariate context, if y t and x t are both I(1), there may (but need not) be a unique value of β such that y t βx t is I(0): in other words, there is no unit root in the relation linking y t and x t. Consequently, cointegration is a restriction on a dynamic model, and so is testable. Cointegration vectors are of considerable interest when they exist, since they determine I(0) relations that hold between variables which are individually non-stationary. Such relations are often called long-run equilibria, since it can be proved that they act as attractors towards which convergence occurs whenever there are departures therefrom (see e.g., Granger (1986), and Banerjee, Dolado, Galbraith, and Hendry (1993), ch. 2). Since I(1) variables wander (often quite widely) because of their stochastic trends, whereas (weakly) stationary variables have constant means and variances, if there exists a linear combination 1 Notice that differencing is not an operator for equations: one can difference data (to create x t), but attempting to difference equation (1) would lead to x t = x t 1 + ɛ t. Such an equation is not well defined, since can be cancelled on both sides, so is redundant.

2 2 David F. Hendry and Katarina Juselius that delivers an I(0) relation, it might be thought that it would be obvious from graphs of the variables. Unfortunately, that need not be the case, as figure 1 shows. In panel a, three variables (denoted y a, y b, y c ) are plotted that are actually very strongly cointegrated, whereas panel b plots another two (denoted Y a, Y b ) that are neither cointegrated, nor linked in any causal way. Panels c and d respectively show the changes in these variables. It is not obvious from the graphs that the first set are closely linked whereas the second are not connected at all. Nevertheless, the PcGive cointegration test, described in Hendry and Juselius (2000), applied to simple dynamic models relating y a to y b and y c,andy a to Y b, respectively takes the values t ur = 6.25 and t ur = 2.31, so the first strongly rejects a unit root in the relation, whereas the second does not. Thus, cointegration may or may not exist between variables that do or do not look cointegrated, and the only way to find out is through a careful statistical analysis, rather than rely on visual inspection. These two points, namely the importance but non-obvious nature of cointegrated relations, motivates our discussion. 7.5 y a y b y c 6 Y a Y b y a y b y c.2 Y a Y b Figure 1. Cointegrated and non-cointegrated time series The organization of this paper is as follows. Section 2 begins by illustrating the inherently multivariate nature of cointegration analysis: several variables must be involved, and this determines the form of the statistical tools required. Section 3 then discusses the conditions under which a vector autoregressive process (VAR) would provide a feasible empirical model for integrated economic time series, spelling out both its statistical and economic requirements, illustrated by the empirical example used in Hendry and Juselius (2000). In section 4, we consider alternative representations of the VAR that yield different insights into its properties under stationarity, and also set the scene for deriving the necessary and sufficient conditions that deliver an integrated-cointegrated process. The purpose of section 5 is to define cointegration via restrictions on the VAR model, and relate the properties of the vector process to stochastic trends and stationary components based on the moving-average representa-

3 Explaining Cointegration Analysis: Part II 3 tion. Section 6 considers the key role of deterministic terms (like constants and trends) in cointegration analyses. At that stage, the formalization of the model and analysis of its properties are complete, so we turn to issues of estimation (section 7) and inference (section 8), illustrated empirically in section 9. Section 10 considers the identification of the cointegration parameters, and hypothesis tests thereon, and section 11 discusses issues that arise in the analysis of partial systems (conditional on a subset of the variables) and the closely related concept of exogeneity. Finally, we discuss forecasting in cointegrated systems (section 12) and the associated topic of parameter constancy (section 13), also relevant to any policy applications of cointegrated systems. Section 14 concludes. The paper uses matrix algebra extensively to explain the main ideas, so we adopt the following notation: bold-face capital letters for matrices, bold-face lower case for vectors, normal case for variables and coefficients, and Greek letters for parameters. We generally assume all variables are in logs, which transformation produces more homogeneous series for inherently-positive variables (see e.g., Hendry (1995a), ch. 2), but we will not distinguish explicitly between logs and the original units The multivariate nature of cointegration analysis Cointegration analysis is inherently multivariate, as a single time series cannot be cointegrated. Consequently, consider a set of integrated variables, such as gasoline prices at different locations as in Hendry and Juselius (2000), where each individual gasoline price (denoted p i,t )isi(1), but follows a common long-run path, affected by the world price of oil (p o,t ). Cointegration between the gasoline prices could arise, for example, if the price differentials between any two locations were stationary. However, cointegration as such does not say anything about the direction of causality. For example, one of the locations could be a price leader and the others price followers; or, alternatively, none of the locations might be more important than the others. In the first case, the price of the leading location would be driving the prices of the other locations (be exogenous to the other prices) and cointegration could be analyzed from the equations for the other adjusting prices, given the price of the leader. In the second case, all prices would be equilibrium adjusting and, hence, all equations would contain information about the cointegration relationships. In the bivariate analysis in Hendry and Juselius (2000), cointegration was found in a single-equation model of p 1,t given p 2,t, thereby assuming that p 2,t was a price leader. If this assumption was incorrect, then the estimates of the cointegration relation would be inefficient, and could be seriously biased. To find out which variables adjust, and which do not adjust, to the long-run cointegration relations, an analysis of the full system of equations is required, as illustrated in Section 11. Here, we will focus on a vector autoregression (VAR) as a description of the system to be investigated. In a VAR, each variable is explained by its own lagged values, and the lagged values of all other variables in the system. To see which questions can be asked within a cointegrated VAR, we postulate a trivariate VAR model for the two gasoline prices p 1,t and p 2,t, together with the price of crude oil, p o,t. We restrict the analysis to one lagged change for simplicity, and allow for 2 cointegration relations. 2 If a variable had a unit root in its original units of measurement, it would become essentially deterministic over time if it had a constant error variance. Thus, absolute levels must have heteroscedastic errors to make sense; but if so, that is not a sensible place to start modeling. Moreover, if the log had a unit root, then the original must be explosive. Many economic variables seem to have that property, appearing to show quadratic trends in absolute levels.

4 4 David F. Hendry and Katarina Juselius Then the system can be written as: p 1,t p 2,t = φ 11 φ 12 φ 13 φ 21 φ 22 φ 23 p 1,t 1 p 2,t 1 + α 11 α 12 ( ) α 21 α 22 (p1 p 2 ) t 1 (p p o,t φ 31 φ 32 φ 33 p o,t 1 α 31 α 2 p o ) t ψ 1 ψ 2 d 1,t + π 1 π 2 + ɛ 1,t ɛ 2,t, (2) ψ 3 π 3 ɛ 3,t where ɛ t is assumed IN 3 [0, Ω ɛ ],andω ɛ is the (positive-definite, symmetric) covariance matrix of the error process. Within the hypothetical system (2), we could explain the three price changes from period t 1 (previous week) to t (this week) as a result of: (i) an adjustment to previous price changes, with impacts φ ij for the j th lagged change in the i th equation; (ii) an adjustment to previous disequilibria between prices in different locations, (p 1 p 2 ),and between the price in location 2 and the price of crude oil, (p 2 p o ), with impacts α i1 and α i2 respectively in the i th equation; (iii) an extraordinary intervention in the whole market, such as the outbreak of the Kuwait war, described by the intervention dummy d 1,t ; (iv) a constant term π; and (v) random shocks, ɛ t. When all three prices are I(1), whereas (p 1,t p 2,t ) and (p 2,t p o,t ) are I(0), then the latter describe cointegrated relations, i.e., relations that are stationary even when the variables themselves are non-stationary. Cointegration between the prices means that the three prices follow the same longrun trends, which then cancel in the price differentials. This may seem reasonable apriori, but could nevertheless be incorrect empirically: using multivariate cointegration analysis, we can formally test whether such is indeed the case. In general, we write cointegration relations in the form: β 11 p 1,t + β 12 p 2,t + β 13 p o,t ; and β 21 p 1,t + β 22 p 2,t + β 32 p o,t (3) etc., where, in (2), we have normalized β 11 = β 22 =1,andsetβ 13 = β 21 =0. Such restrictions cannot be imposed arbitrarily in empirical research, so we will discuss how to test restrictions on cointegration relations in Section 10. The existence of cointegration by itself does imply which prices equilibrium adjust and which do not; nor does it entail whether any adjustment is fast or slow. Information about such features can be provided by the α ij coefficients. For example, α 31 = α 32 =0, would tell us that there were no feed-back effects onto the price of crude oil from deviant price behavior in the gasoline market. In this case, the price of crude oil would influence gasoline prices, but would not be influenced by them. Next, consider, for example, when α 11 = 0.6 whereas α 21 = 0.1. Then, gasoline prices at location 1 adjust more quickly to restore an imbalance between its own price and the price at location 2 than the other way around. Finally, consider α 22 = 0.4: then the location-2 price would adjust quite quickly to changes in the level of crude oil price. In that case, we would be inclined to say that the price of crude oil influenced the price of location 2 which influenced the price at location 1. This would certainly be the case if the covariance matrix Ω ε was diagonal, so there were no contemporaneous links: if Ω ε was not diagonal, revealing cross-correlated residuals, one would have to be careful about causal

5 Explaining Cointegration Analysis: Part II 5 interpretations. The interpretation of the parameter estimates is generally more straightforward when Ω ε is diagonal, but this is seldom the case; price shocks are often correlated, sometimes indicating an modeled causal link. For example, if a shock to crude-oil prices affects gasoline prices within the same week (say), the correlations between the residuals from the crude-oil equation and the gasoline equations are the result of a current oil-price effect. Another explanation for residual cross-correlation is omitted variables that simultaneously influence several variables in the system. As already discussed in Hendry and Juselius (2000), the constant terms, π i, can both describe an intercept in the cointegration relations and linear trends in the variables, and the empirical analysis can be used to estimate both effects. Finally, the random shocks, ɛ i,t, are assumed to be serially independent, and homoscedastic. All these, and other issues, will be discussed and illustrated with an empirical example below. It should now be evident that a cointegrated VAR provides a rich model: the β ij coefficients characterize long-run relationships between levels of variables; the α ij coefficients describe changes that help restore an equilibrium market position; the φ ij coefficients describe short-term changes resulting from previous changes in the market which need not have permanent effects on the levels; we will comment on the intercepts in Section 6; and the intervention effects, ψ i, describe extraordinary events in the market, like the Kuwait war. We might, for example, ask whether such an event affected the gasoline prices differently at different locations (i.e., whether ψ 1 ψ 2 ), and hence, permanently changed the mean of the price differential (p 1 p 2 ): this will be investigated empirically in Section 9.1. First, we must analyze the VAR model more generally. 3. The statistical adequacy of a VAR model To understand when a VAR is an adequate description of reality, it is important to know the limitations as well as the possibilities of that model. The purpose of this section is, therefore, to demonstrate that a VAR model can be a convenient way of summarizing the information given by the autocovariances of the data under certain assumptions about the DGP: see Hendry (1995a) for details. However, the required assumptions may not hold in any given instance, so the first step in any empirical analysis of a VAR is to test if these assumptions are indeed appropriate. In section 3.1, we first assume that there are p 2 variables x i,t under analysis, and that the p- dimensional process x t is stationary, so does not contain any unit roots. We then derive the VAR model under this simplifying assumption. In section 3.2, we discuss an interpretation of the VAR in terms of rational economic behavior, and finally, in section 3.4, we extend the discussion to consider stability and unit-root properties. Notice that unit roots are a restriction of the initial VAR model, so can be tested, but it transpires that the tests are not standard t, F or χ Stochastic properties A stationary VAR arises naturally as a model of a data set (x 1,...,x T ) viewed as a sequence of T realizations from the p-dimensional process {x t }, given the two general simplifying assumptions of multivariate normality and time-invariant covariances. Derivations of a VAR from a general DGP are described in (e.g.) Hendry (1995a, ch. 9). The resulting VAR could have many lagged variables, but for simplicity of notation, we restrict attention here to the case with 2 lags denoted VAR(2), which

6 6 David F. Hendry and Katarina Juselius suffices to illustrate all the main properties and problems (the results generalize easily, but lead to more cumbersome notation). We write the simplest VAR(2) as: x t = π + Π 1 x t 1 + Π 2 x t 2 + ɛ t (4) where ɛ t IN p [0, Ω ɛ ], (5) t =1,...,T and the parameters (π, Π 1, Π 2, Ω ɛ ) are constant and unrestricted, except for Ω ɛ being positive-definite and symmetric. Given (4), the conditional mean of x t is: say, and the deviation of x t from m t defines ɛ t : E [x t x t 1, x t 2 ]=π + Π 1 x t 1 + Π 2 x t 2 = m t, x t m t = ɛ t. Hence, if the assumptions of multivariate normality, time-constant covariances, and truncation at lag 2 are correct, then (4): is linear in the parameters; has constant parameters; has normally distributed errors ɛ t, with: (approximate) independence between ɛ t and ɛ t h for lags h =1, 2,.... These conditions provide the model builder with testable hypotheses on the assumptions needed to justify the VAR. In economic applications, the multivariate normality assumption is seldom satisfied. This is potentially a serious problem, since derivations of the VAR from a general DGP rely heavily on multivariate normality, and statistical inference is only valid to the extent that the assumptions of the underlying model are correct. An important question is, therefore, how we should modify the standard VAR model in practice. We would like to preserve its attractiveness as a reasonably tractable description of the basic characteristics of the data, while at the same time, achieving valid inference. Simulation studies have demonstrated that statistical inference is sensitive to the validity of some of the assumptions, such as, parameter non-constancy, serially-correlated residuals and residual skewness, while moderately robust to others, such as excess kurtosis (fat-tailed distributions) and residual heteroscedasticity. Thus, it seems advisable to ensure the first three are valid. Both direct and indirect testing of the assumptions can enhance the success of the empirical application. It is often useful to calculate descriptive statistics combined with a graphical inspection of the residuals as a first check of the adequacy of the VAR model, then undertake formal mis-specification tests of each key assumption (see Doornik and Hendry (1999, ch. 10): all later references to specific tests are explained there). Once we understand why a model fails to satisfy the assumptions, we can often modify it to end with a well-behaved model. Precisely how depends on the application, as will be illustrated in section 3.3 for the gasoline price series discussed in Hendry and Juselius (2000) Economic interpretation and estimation As discussed in Hendry (1995a), the conditional mean m t can be given an economic interpretation as the agents plans at time t 1 given the past information of the process, x t 1, x t 2, etc., denoted

7 Explaining Cointegration Analysis: Part II 7 X 0 t 1. The IN distributional assumption in (5) implies that agents are rational, in the sense that the deviation between the actual outcome x t and the plan E t 1 [x t X 0 t 1 ] is a white-noise innovation, not explicable by the past of the process. Thus, the VAR model is consistent with economic agents who seek to avoid systematic forecast errors when they plan for time t based on the information available at time t 1. By way of contrast, a VAR with autocorrelated residuals would describe agents that do not use all information in the data as efficiently as possible. This is because they could do better by including the systematic variation left in the residuals, thereby improving the accuracy of their expectations about the future. Checking the assumptions of the model, (i.e., checking the white-noise requirement of the residuals, and so on), is not only crucial for correct statistical inference, but also for the economic interpretation of the model as a description of the behavior of rational agents. To derive a full-information maximum likelihood (FIML) estimator requires an explicit probability formulation of the model. Doing so has the advantage of forcing us to take the statistical assumptions seriously. Assume that we have derived an estimator under the assumption of multivariate normality. We then estimate the model, and find that the residuals are not normally distributed, or that the residual variance is heteroscedastic instead of homoscedastic, or that residuals exhibit significant autocorrelation, etc. The parameter estimates (based on an incorrectly-derived estimator) may not have any meaning, and since we do not know their true properties, inference is likely to be hazardous. Therefore, to claim that conclusions are based on FIML inference is to claim that the empirical model is capable of accounting for all the systematic information in the data in a satisfactory way. Although the derivation of a FIML estimator subject to parameter restrictions can be complicated, this is not so when the parameters (π, Π 1, Π 2, Ω ɛ ) of the VAR model (4) are unrestricted. In that case, the ordinary least squares (OLS) estimator is equivalent to FIML. After the model has been estimated by OLS, the IN distributional assumption can be checked against the data using the residuals ɛ t. As already mentioned, the white-noise assumption is often rejected for a first tentatively-estimated model, and one has to modify the specification of the VAR model accordingly. This can be done, for example, by: investigating parameter constancy (e.g., is there a structural shift in the model parameters?); increasing the information set by adding new variables; increasing the lag length; changing the sample period; adding intervention dummies to account for significant political or institutional events; conditioning on weakly-exogenous variables; checking the adequacy of the measurements of the chosen variables. Any or all of these steps may be needed, but we stress the importance of checking that the initial VAR is congruent with the data evidence before proceeding with empirical analysis A tentatively-estimated VAR As a first step in the analysis, the unrestricted VAR(2) model, with a constant term and without dummy variables, was estimated by OLS for the two gasoline prices at different locations. Table 1 reports some descriptive statistics for the logs of the variables in levels, differences, and for the residuals. As discussed in Hendry (1995a), since the gasoline prices are apparently non-stationary, the empirical density is not normal, but instead bimodal. The price changes on the other hand, seem to be stationary

8 8 David F. Hendry and Katarina Juselius 1987:26 98:29 x s x Skew Ex.K. Jarq.Bera min max p p p p ɛ p ɛ p Table 1. Descriptive statistics around a constant mean. From Table 1, the mean is not significantly different from zero for either price change. Normality is tested with the Jarque Bera test, distributed as χ 2 (2) under the null, so is strongly rejected for both series of price changes and the VAR residuals. Since rejection could be due to either excess kurtosis (the normal has kurtosis of 3), or skewness, we report these statistics separately in Table 1. It appears that excess kurtosis is violated for both equations, but that for p 1,t also seemstobeskewedtotheright. The graphs (in logs) of the two gasoline prices are shown in Figure 2 in levels and differences (with 99% confidence bands), as well as the residuals (similarly with 99% confidence bands). There seem to be some outlier observations (larger than ±3 σ), both in the differenced prices and in the residuals. The largest is at 1990:31, the week of the outbreak of the Kuwait war. This is clearly not a normal observation, but must be adequately accounted for in the specification of the VAR model. The remaining large price changes (> ±3 σ) can (but need not necessarily) be intervention outliers. It is, however, always advisable to check whether any large changes in the data correspond to some extraordinary events: exactly because they are big, they will influence the estimates with a large weight and, hence, potentially bias the estimates if they are indeed outliers. The role of deterministic components, such as intervention dummies, will be discussed in more detail in Section 6. In Figure 3, the autocorrelograms and the empirical densities (with the normal density) are reported for the two VAR residuals. There should be no significant autocorrelation, if the truncation after the second lag is appropriate. Since all the autocorrelation coefficients are very small, this seems to be the case. Furthermore, the empirical density should not deviate too much from the normal density; and the residuals should be homoscedastic, so have similar variances over time. The empirical densities seem to have longer tails (excess kurtosis) than the normal density, and the Kuwait war outlier sticks out, confirming our previous finding of non-normality Stability and unit-root properties Up to this point, we have discussed (and estimated) the VAR model as if it were stationary, i.e., without considering unit roots. 3 The dynamic stability of the process in (4) can be investigated by calculating the roots of: ( Ip Π 1 L Π 2 L 2) x t = Π(L)x t, 3 One can always estimate the unrestricted VAR with OLS, but if there are unit roots in the data, some inferences are no longer standard, as discussed in Hendry and Juselius (2000).

9 Explaining Cointegration Analysis: Part II 9 1 Price 1 Price Diff p Diff p Residual Residual Figure 2. Graphs of gasoline prices 1 and 2 in levels and differences, with residuals from a VAR(2) and 99% confidence bands where L i x t = x t i. Define the characteristic polynomial: Π(z) = ( I p Π 1 z Π 2 z 2). The roots of Π (z) = 0 contain all necessary information about the stability of the process and, therefore, whether it is stationary or non-stationary. In econometrics, it is more usual to discuss stability in terms of the companion matrix of the system, obtained by stacking the variables such that a firstorder system results. Ignoring deterministic terms, we have: ( xt x t 1 ) = ( Π1 Π 2 I p 0 )( xt 1 x t 2 ) + ( ɛt 0 ), (6) where the first block is the original system, and the second merely an identity for x t 1. Now, stability ( depends on the eigenvalues of the coefficient matrix in (6), and these are precisely the roots of ) Π z 1 =0(see e.g., Banerjee, Dolado, Galbraith, and Hendry (1993)). For a p-dimensional VAR with 2 lags, there are 2p eigenvalues. The following results apply: (a) if all the eigenvalues of the companion matrix are inside the unit circle, then {x t } is stationary; (b) if all the eigenvalues are inside or on the unit circle, then {x t } is non-stationary; (c) if any of the eigenvalues are outside the unit circle, then {x t } is explosive. For the bivariate gasoline-price VAR(2) model, we have 2 2=4roots, the moduli of which are: 0.93, 0.72, 0.72, 0.56.

10 10 David F. Hendry and Katarina Juselius Figure 3. The autocorrelogram and the empirical and normal density of the two VAR residuals Figure?? illustrates these in relation to the unit circle:

11 Explaining Cointegration Analysis: Part II 11 The roots of the characteristic polynomial for the VAR(2) model. We note that the system is stable (no explosive roots), that there is one near-unit root, suggesting the presence of a stochastic trend, as well as a pair of complex roots (whether a pair of roots is real or complex can depend on the third or smaller digits of the estimated coefficients, so is not usually a fundamental property of such a figure). Since there is only one root close to unity for the two variables, the series seem non-stationary and possibly cointegrated. When there are unit roots in the model, it is convenient to reformulate the VAR into an equilibriumcorrection model (EqCM). The next section discusses different ways of formulating such models. 4. Different representations of the VAR The purpose of this section is to demonstrate that the unrestricted VAR can be given different parametrizations without imposing any binding restrictions on the parameters of the model (i.e., without changing the value of the likelihood function). At this stage, we do not need to specify the order of integration of x t : as long as the parameters (π, Π 1, Π 2, Ω ɛ ) are unrestricted, OLS can be used to estimate them, as discussed in the previous section. Thus, any of the four parameterizations below, namely (4), (7), (8), or (9) can be used to obtain the first unrestricted estimates of the VAR. Although the parameters differ in the four representations, each of them explains exactly as much of the variation in x t.

12 12 David F. Hendry and Katarina Juselius The first reformulation of (4) is into the following equilibrium-correction form: x t = Φ 1 x t 1 Πx t 1 + π + ɛ t, (7) where ɛ t IN p [0, Ω ɛ ], with the lagged levels matrix Π = I p Π 1 Π 2 and Φ 1 = Π 2. 4 In (7), the lagged levels matrix, Π, has been placed at time t 1, but could be chosen at any feasible lag without changing the likelihood. For example, placing the Π matrix at lag 2 yields the next parameterization: x t = Φ 1 x t 1 Πx t 2 + π + ɛ t (8) where Φ 1 =(Π 1 I p ), with an unchanged Π matrix. 5 In a sense, (7) is more appropriate if one wants to discriminate between the short-run adjustment effects to the long-run relations (in levels), and the effects of changes in the lagged differences (the transitory effects). The estimated coefficients and their p-values can vary considerably between the two formulations (7) and (8), despite their being identical in terms of explanatory power, and likelihood (note that they have identical errors {ɛ t }). Often, many more significant coefficients are obtained with (8) than with (7), illustrating the increased difficulty of interpreting coefficients in dynamic models relative to static regression models: many significant coefficients need not imply high explanatory power, but could result from the parameterization of the model. The final convenient reformulation of the VAR model is into second-order differences (acceleration rates), changes, and levels: 2 x t = Φ x t 1 Πx t 1 + π + ɛ t (9) where Φ = Φ 1 I p = I p Π 2,andΠ remains as before. This formulation is most convenient when x t contains I(2) variables, but from an economic point of view it also provides a natural decomposition of economic data which cover periods of rapid change, when acceleration rates (in addition to growth rates) become relevant, and more stable periods, when acceleration rates are zero, but growth rates still matter. It also clarifies that the ultimate variable to be explained is 2 x t, which is often treated as a surprise, but as (9) demonstrates, can be explained by the determinants of the model. Indeed, it can be seen that treating 2 x t purely as a surprise imposes Φ = 0 and Π = 0, and makes the differences behave as random walks. Although the above reformulations are equivalent in terms of explanatory power, and can be estimated by OLS without considering the order of integration, inferences on some the parameters will not be standard unless x t I(0). For example, when x t is non-stationary, the joint significance of the estimated coefficients cannot be based on standard F-tests (see Hendry and Juselius (2000) for a discussion in the context of single-equation models). We will now turn to the issue of non-stationarity in the VAR model. 5. Cointegration in the VAR We first note that the general condition for x t I(0) is that Π has full rank, so is non-singular. In this case, Π (1) = Π 0, which corresponds to condition (a) in Section 3.4 that all the eigenvalues should lie within the unit circle. Stationarity can be seen as follows: stationary variables cannot grow 4 In the general VAR(k) model, Φ matrices cumulate the longer lag coefficients of the levels representation. 5 In the general VAR(k), Φ matrices cumulate the earlier lag coefficients.

13 Explaining Cointegration Analysis: Part II 13 systematically over time (that would violate the constant-mean requirement), so if x t I(0) in (7), then E[ x t ]=0. Taking expectations yields: ΠE [x t 1 ]+π = 0 (10) so when Π has full rank, E [x t ] = Π 1 π. Thus, the levels of stationary variables have a unique equilibrium mean this is precisely why stationarity is so unreasonable for economic variables which are usually evolving! When Π is not full rank (i.e., when x t exhibits I(1) behavior), (10) leaves some of the levels indeterminate. At the other extreme, where Π = 0, the VAR becomes one in the differences x t, and these are stationary if Φ 1 I p has full rank, in which case x t I(1). Notice that Φ 1 = I p when Π = 0 makes x t a vector of random walks, so x t I(2). Section 5.1 presents the conditions for cointegration in the I(1) model as restrictions on the Π matrix and Section 5.2 discusses the properties of the vector process when the data are I(1) and cointegrated, based on the moving-average representation Determining cointegration in the VAR model As discussed in Section 3.4, when some of the roots of the system (4) are on the unit circle (case (b)), the vector process x t is non-stationary. However, some linear combinations, denoted β x t, might be stationary even though the variables themselves are non-stationary. Then the variables are cointegrated from I(1), down one step to I(0), which Engle and Granger (1987) expressed as being CI(1, 1). There are two general conditions for x t I(1), which we now discuss. The first condition, needed to ensure that the data are not I(0),isthatΠ has reduced rank r<p,so can be written as: Π = αβ (11) where α and β are p r matrices, both of rank r. Substituting (11) into (9) delivers the cointegrated VAR model: 2 x t = Φ x t 1 + α ( β x t 1 ) + π + ɛt. (12) An important feature of reduced rank matrices like α and β is that they have orthogonal complements, which we denote by α and β : i.e., α and β are p (p r) matrices orthogonal to α and β (so α α = 0 and β β = 0), where the p p matrices (α α ) and (β β ) both have full rank p. These orthogonal matrices play a crucial role in understanding the relationship between cointegration and common trends as we explain below (a simple algorithm for constructing α and β from α and β is given in Hendry and Doornik (1996)). Note that multiplying (12) by α will eliminate the cointegrating relations since α α = 0. The second condition, which is needed to ensure that the data are not I(2), is somewhat more technical, and requires that a transformation of Φ in (12) must be of full rank. 6 Here, we will disregard the I(2) problem and only discuss the case when the footnoted condition is satisfied. If r = p, thenx t is stationary, so standard inference (based on t, F, and χ 2 ) applies. If r =0, then x t is stationary, but it is not possible to obtain stationary relations between the levels of the 6 Specifically: α Φβ = ζη (13) where ζ and η are (p r) s matrices for s = p r, which is the number of common stochastic trends of first order when x t I(1). Thus, (13) must have full rank (s = p r) forx t I(1). If s<p r, then the model contains p r s second-order stochastic trends, and x t I(2).

14 14 David F. Hendry and Katarina Juselius variables by linear combinations. Such variables do not have any cointegration relations, and hence, cannot move together in the long run. In this case, each of (7) (9) becomes a VAR model in differences but, since x t I(0), standard inference still applies. If p>r>0, thenx t I(1) and there exist r directions in which the process can be made stationary by linear combinations, β x t. These are the cointegrating relations, exemplified in (2) above by the two price differentials The VAR model in moving-average form When the characteristic polynomial Π(z) =I p Π 1 z Π 2 z 2 contains a unit root, the determinant Π(z) =0for z =1,soΠ(z) cannot be inverted to express x t as a moving average of current and past ɛ t. Instead, we must decompose the characteristic polynomial into a unit-root part and a stationary invertible part, written as the product: Π(z) =(1 z)π (z), where Π (z) has no unit roots, and is invertible. The VAR model can now be written as: (1 L)x t = x t =[Π (L)] 1 ɛ t. (14) Thus, x t is a moving average of current and past ɛ t. To see the nature of that relation, expand [Π (L)] 1 as a power series in L: [Π (L)] 1 = C 0 + C 1 L + C 2 L = C(L) (say). In turn, express C(L) as: C (L) =C + C (L)(1 L), so C(1) = C as C (1)(1 1) = 0. We can now rewrite (14) as: x t =[C + C (L)(1 L)] ɛ t. By integration (dividing by the difference operator, (1 L)): ( ) ɛt x t = C + C (L)ɛ t + x 0, 1 L for some initial condition denoted x 0, which we set to zero to express x t as: x t = C t ɛ i + C (L)ɛ t. (15) i=1 In (15), x t is decomposed into a stochastic trend, C t i=1 ɛ i, and a stationary stochastic component, e t = C (L)ɛ t. 7 There are (p r) linear combinations between the cumulated residuals, α t i=1 ɛ i 7 It can be shown that: C = β (α Φβ ) 1 α, (16) so the C matrix is directly related to (13), and can be calculated from estimates of α, β, and Φ: see e.g., Johansen (1992a). Letting B = β (α Φβ ) 1, then C = Bα, so the common stochastic trends have a reduced-rank representation similar to the stationary cointegration relations.

15 Explaining Cointegration Analysis: Part II 15 which define the common stochastic trends that affect the variables x t with weights B, wherec = Bα. In this sense, there exists a beautiful duality between cointegration and common trends. The following example illustrates. Assume that there exists one common trend between the two gasoline price series, and hence one cointegration relation as reported in Hendry and Juselius (2000). Then, r =1and p r =1,andwe can write the moving-average (common-trends) representation as: ( p1,t ) = p 2,t ( b11 b 21 ) ( t i=1 ) ( ) ( ) δ1 e1,t u i + t +, (17) δ 2 e 2,t where B = (b 11,b 21 ) are the weights of the estimated common trend given by û i = α ɛ i, and δ 1,δ 2 are the coefficients of linear deterministic trends in p 1,t,p 2,t, respectively. Hendry and Juselius (2000) showed that (p 1,t p 2,t ) I(0), i.e., that the two prices were cointegrated with β =(1, 1). Furthermore, the assumption behind the single-equation model in Hendry and Juselius (2000) was that α =(, 0). These estimates of β and α correspond to B =(1, 1) and α =(0, 1) in (17). We express this outcome as: cumulated shocks to p 2,t give an estimate of the common stochastic trend in this small system. Both prices are similarly affected by the stochastic trend, so that in: (p 1,t p 2,t )=(b 11 b 21 ) t u i +(δ 1 δ 2 )t +(e 1,t e 2,t ) (18) i=1 we have (b 11 b 21 )=0, and the linear relation (p 1,t p 2,t ) has no stochastic trend left, thereby defining a cointegration relation. Note that (17) also allows for a deterministic linear trend t in x t. If, in addition, δ 1 = δ 2, then both the stochastic and the linear trend will cancel in the linear relation (p 1,t p 2,t ) in (18). If δ 1 δ 2, then we need to allow for a linear trend in the cointegration relation, such that (p 1,t p 2,t d 1 t) contains neither stochastic nor deterministic trends. In this case, we say that the price differential is trend stationary, and that there is a trend in the cointegration space. We have reached the stage where we need a more complete discussion of the key role played by deterministic terms in cointegrated models. 6. Deterministic components in a cointegrated VAR A characteristic feature of the equilibrium-correction formulation (12) is the inclusion of both differences and levels in the same model, allowing us to investigate both short-run and long-run effects in the data. As discussed in Hendry and Juselius (2000), however, the interpretation of the coefficients in terms of dynamic effects is difficult. This is also true for the trend and the constant term,as well as other deterministic terms like dummy variables. The following treatment starts from the discussion of the dual role of the constant term and the trend in the dynamic regression model in Hendry and Juselius (2000), and extends the results to the cointegrated VAR model. When two (or more) variables share the same stochastic and deterministic trends, it is possible to find a linear combination that cancels both the trends. The resulting cointegration relation is not trending, even if the variables by themselves are. In the cointegrated VAR model, this case can be accounted for by including a trend in the cointegration space. In other cases, a linear combination of variables removes the stochastic trend(s), but not the deterministic trend, so we again need to allow for

16 16 David F. Hendry and Katarina Juselius a linear trend in the cointegration space. Similar arguments can be used for an intervention dummy: the intervention might have influenced several variables similarly, such that the intervention effect cancels in a linear combination of them, and no dummy is needed. Alternatively, if an intervention only affects a subset of the variables (or several, but asymmetrically), the effect will not disappear in the cointegration relation, so we need to include an intervention dummy. These are only a few examples showing that the role of the deterministic and stochastic components in the cointegrated VAR is quite complicated. However, it is important to understand their role in the model, partly because one can obtain misleading (biased) parameter estimates if the deterministic components are incorrectly formulated, partly because the asymptotic distributions of the cointegration tests are not invariant to the specifications of these components. Furthermore, the properties of the resulting formulation may prove undesirable for (say) forecasting, by inadvertently retaining unwanted components such as quadratic trends, as illustrated by Case 1 below. In general, parameter inference, policy simulations, and forecasting are much more sensitive to the specification of the deterministic than the stochastic components of the VAR model. Doornik, Hendry, and Nielsen (1998) provide a comprehensive discussion Intercepts in cointegration relations and growth rates Another important aspect is to decompose the intercept, π, into components that induce growth in the system, and those that capture the means of the cointegration relations. 8 Reconsider the VAR representation (7): x t = Φ 1 x t 1 + α ( β ) x t 1 + π + ɛt (19) where x t I(0) and ɛ t I(0). To balance (19), β x t 1 must be I(0) also. Since the r cointegration relations β x t 1 are stationary, each of them has a constant mean. Similarly, x t is stationary with a constant mean, which we denote by E[ x t ]=γ, describing a (p 1) vector of growth rates. This was illustrated in (17) by allowing for linear trends in the two prices with slope coefficients δ 1 and δ 2. Furthermore, let E [ β ] x t 1 = µ describe a (r 1) vector of intercepts in the cointegrating relations. We now take expectations in (19): (I p Φ 1 ) γ = αe [ β x t 1 ] + π = αµ + π. Consequently, π =(I p Φ 1 ) γ αµ. Note that the constant term π in the VAR model does indeed consist of two components: one related to the linear growth rates in the data, and the other related to the mean values of the cointegrating relations (i.e., the intercepts in the long-run relations). This decomposition is similar to the simpler single-equation case discussed in Hendry and Juselius (2000). When the cointegration relations are trend free as in (19): so we can express (19) in mean-deviation form as: β E [ x t ]=E [ β x t ] = β γ = 0, ( x t γ) =Φ 1 ( x t 1 γ)+α ( β x t 1 µ ) + ɛ t. (20) There are two forms of equilibrium correction in (20): that of the growth x t in the system to its mean γ; and of the cointegrating vectors β x t 1 to their mean µ. The two mean values, γ and 8 One of the reasons we assume all the variables are in logs is to avoid the growth rates depending on the levels of the variables.

17 Explaining Cointegration Analysis: Part II 17 µ, play an important role in the cointegrated VAR model, and it is important to ascertain whether they are significantly different from zero or not at the outset of the empirical analysis. In the next subsection, we will present five baseline cases describing how the trend and the intercept can enter the VAR specification Five cases for trends and intercepts The basic ideas are illustrated using the p-dimensional cointegrated VAR with a constant and a linear trend, but to simplify notations we assume that only one lag is needed, so Φ 1 = 0. As before, ɛ t IN p [0, Ω ɛ ]: x t = αβ x t 1 + π + δt + ɛ t. (21) Without loss of generality, the two (p 1) vectors π and δ can each be decomposed into two new vectors, of which one is related to the mean value of the cointegrating relations, β x t 1 (case (3) in section 2 of Hendry and Juselius (2000)), and the other to growth rates in x t : π = αµ + γ 0 δ = αρ + τ (22) Substituting (22) into (21) yields: x t = αβ x t 1 + αµ + γ + αρt + τ t + ɛ t, (23) so, collecting terms in (23): x t = α ( β : µ : ρ ) x t 1 1 t +(γ + τ t)+ɛ t. (24) Thus, we can rewrite (21) as: x t = α β µ ρ x t 1 +(γ + τ t)+ɛ t, (25) where x t 1 =(x t 1, 1,t). We can always choose µ and ρ such that the equilibrium error (β ) x t = v t has mean zero (where β = ( β, µ, ρ ) ), so the trend component in (25) can be interpreted from the equation: E [ x t ]=γ + τ t. (26) Thus, γ 0 corresponds to constant growth in the variables x t (case 1 in section 2 of Hendry and Juselius (2000)), whereas τ 0 corresponds to linear trends in growth, and so quadratic trends in the variables. Hence, the constant term and the deterministic linear trend play a dual role in the cointegrated model: in the α directions they describe a linear trend and an intercept in the steady-state relations; in the remaining directions, they describe quadratic and linear trends in the data. To correctly interpret the model, one has to understand the distinction between the part of the deterministic component that belongs to the cointegration relations, and the part that belongs to the differences. In empirical work, usually one has some idea whether there are linear deterministic trends in some (or all) of the variables. It might, however, be more difficult to know if they cancel in the cointegrating

18 18 David F. Hendry and Katarina Juselius relations or not. Luckily, we do not need to know beforehand, because the econometric analysis can be used to find out. As discussed below, all these cases can be expressed as linear restrictions on the deterministic components of the VAR model and, hence, can be tested. We now discuss five of the most frequently used models arising from restricting the deterministic components in (21): see Johansen (1994). Case 1. Case 2. No restrictions on π and δ, so the trend and intercept are unrestricted in the VAR model. With unrestricted parameters,π, δ, the model is consistent with linear trends in the differenced series x t as shown in (26) and, thus, quadratic trends in x t. Although quadratic trends may sometimes improve the fit within the sample, forecasting outside the sample is likely to produce implausible results. Be careful with this option: it is preferable to find out what induced the apparent quadratic growth and, if possible, increase the information set of the model. Moreover, as shown in Doornik, Hendry, and Nielsen (1998), estimation and inference can be unreliable. τ = 0, but γ, µ, ρ remain unrestricted, so the trend is restricted to lie in the cointegration space, but the constant is unrestricted in the model. Thus, τ being zero in (25) still allows linear, but precludes quadratic, trends in the data. As illustrated in the previous section, E[ x t ]=γ 0 implies linear deterministic trends in the level x t. When, in addition, ρ 0, these linear trends in the variables do not cancel in the cointegrating relations, so the model contains trend-stationary relations which can either describe a single trend-stationary variable, (x 1,t b 1 t) I(0), oran equilibrium relation (β 1x t b 2 t) I(0). Therefore, the hypothesis that a variable is trendstationary can be tested in this model. Case 3. δ = 0, so there are no linear trends in (21). Since the constant term π is unrestricted, thereare still linear trends in the data, but no deterministic trends in any cointegration relations. Also, E[ x t ]=γ 0, is consistent with linear deterministic trends in the variables but, since ρ = 0, these trends cancel in the cointegrating relations. It appears from (24) that π 0 accounts for both linear trends in the DGP and a non-zero intercept in the cointegration relations. Case 4. Case 5. δ = 0, γ = 0, butµ 0,so the constant term is restricted to lie in the cointegration space in (25). In this case, there are no linear deterministic trends in the data, consistent with E[ x t ]=0. The only deterministic components in the model are the intercepts in any cointegrating relations, implying that some equilibrium means are different from zero. δ = 0 and π = 0, so the model excludes all deterministic components in the data, with both E[ x t ]=0 and E[β x t ]=0, implying no growth and zero intercepts in every cointegrating relation. Since an intercept is generally needed to account for the initial level of measurements, x 0, only in the exceptional case when the measurements start from zero, or when the measurements cancel in the cointegrating relations, can the restriction π = 0 be justified. Turning to our empirical example, Table 1 showed that E[ p i,t ]=0could not be rejected. Hence, there is no evidence of linear deterministic trends in the gasoline prices, at least not over the sample period. The graphs in Figure 1 support this conclusion. We conclude that the cointegrated VAR model should be formulated according to case 4 here, with the constant term restricted to the cointegration space, and no deterministic trend terms. 7. The likelihood-based procedure So far, we have discussed the formulation of the VAR model in terms of well-specified stochastic and deterministic properties. All this can be done before addressing the unit-root problem. As in Hendry

19 Explaining Cointegration Analysis: Part II 19 and Juselius (2000), we will now assume that some of the roots of the characteristic polynomial are on the unit circle. This means that x t I(1), and we consider the cointegrated VAR(2) model (7) where Π = αβ with a constant, a linear trend and a vector of dummy variables d t included: x t = Φ 1 x t 1 + αβ x t 1 + π + δt + Ψd t + ɛ t. (27) Since x t I(0) and ɛ t I(0), all stochastic components in (27) are stationary by definition except for β x t 1. For (27) to be internally consistent, given that x t I(1), β cannot be a full-rank p p matrix (because then something stationary would be equal to something non-stationary). The only possible solution is that β is a reduced rank (p r) matrix with r<p,sor linear combinations cancel stochastic trends as shown in section 5. Below, we will only discuss the broad ideas of the maximum likelihood estimation procedure, and will not go through the derivations of the results. The interested reader is referred to Johansen (1988, Johansen (1995) and Banerjee, Dolado, Galbraith, and Hendry (1993), inter alia, for details. The derivation of the maximum likelihood estimator (MLE) is done via the concentrated likelihood of the VAR model. Since the latter is crucial for understanding both the statistical and economic properties of the VAR, we will demonstrate how it is defined. We use the following shorthand notation: z 0,t = x t z 1,t = x t 1 z 2,t = ( x t 1, 1, t, d t ). (additional lagged differences are easily included in z 2,t ). Rewrite (27) as: z 0,t = αβ z 1,t 1 + Θz 2,t + ɛ t, where Θ =(Φ 1, π, δ, Ψ). By concentrating out the short-run dynamic adjustment effects, Θz 2,t, we are able to obtain a simpler model. This is done by first defining the following auxiliary OLS regressions: z 0,t = D 1 z 2,t + x t z 1,t = D 2 z 2,t + x t 1, where x t = z 0,t M 02 M 1 22 z 2,t and x t 1 = z 1,t M 12 M 1 22 z 2,t are the OLS residuals, and M ij = t (z i,tz j,t )/T is a product-moment matrix, so that D 1 = M 02M 1 22 and D 2 = M 12M The concentrated model can now be written as: x t = αβ x t 1 + u t, (28) so we have transformed the original VAR containing short-run adjustments and intervention effects into the baby model form, in which the adjustments are exclusively towards the long-run steady-state relations. The MLE is close to limited-information maximum likelihood (LIML: see Hendry (1976) for a consolidation) in that the key issue is to handle a reduced-rank problem, which essentially amounts to solving an eigenvalue problem. In practice, the estimators are derived in two steps. First, to derive an estimator of α, assume that β is known: then β x t 1 becomes a known variable in (28), so α can be estimated by OLS. Next, insert that α = α(β) in the expression for the concentrated likelihood function, which becomes a function of β alone, and no longer depends on α. To find the value of β that maximizes this likelihood function is a non-linear problem, but one that can be solved by reduced-rank regression (see Johansen (1988)). The solution delivers p eigenvalues λ i where 0 λ i 1: λ =(λ 1,λ 2,...,λ p ),

20 20 David F. Hendry and Katarina Juselius which are ordered such that λ 1 λ 2 λ p. The estimate of β for r cointegrating vectors is givenbythep r matrix of eigenvectors corresponding to the largest r eigenvalues (the selection of r isdiscussedinsection8). GiventheMLE β of β, calculate α = α( β). The estimates of the two eigenvectors, and the corresponding α weights for the empirical example are reported in Table 3. Each λ i can be interpreted as the squared canonical correlation between linear combinations of the levels, β i x t 1, and a linear combination of the differences, ϕ i x t. In this sense, the magnitude of λ i is an indication of how strongly the linear combination β i x t 1 is correlated with the stationary part of the process ϕ i x t. If λ i 0, the linear combination β i x t 1 is not at all correlated with the stationary part of the process, and hence is non-stationary. In the situation where both α and β are unrestricted (beyond normalizations), standard errors cannot be obtained, but the imposition of rank and other identifying restrictions usually allows appropriate standard errors to be obtained for α and β. 8. Testing cointegration rank Given that the unrestricted VAR model has been found to satisfactorily describe the data (is a congruent representation), one can start the simplification search, which means imposing valid restrictions on the model such as a reduced-rank restriction, restrictions on the long-run parameters β, and finally restrictions on the short-run adjustment parameters α and Φ. The first, and most crucial, step is to discriminate empirically between zero and non-zero eigenvalues when allowing for sample variation, and then impose an appropriate cointegration rank restriction r on the Π matrix. Note that: if we underestimate r, then empirically-relevant equilibrium-correction mechanisms (EqCMs) will be omitted; whereas if we overestimate r, the distributions of some statistics will be non-standard, so that incorrect inferences may result from using conventional critical test values (based on t, F, χ 2 ); forecasts will be less accurate due to incorrectly retaining I(1) components, which will increase forecast variances. Atestforr cointegrating vectors can be based on the maximum likelihood approach proposed by Johansen (1988). The statistical problem is to derive a test procedure to discriminate between the λ i,i=1,...,r, which are large enough to correspond to stationary β i x t 1, and those λ i,i= r +1,...,p, which are small enough to correspond to non-stationary eigenvectors. The rank r is determined by a likelihood-ratio test procedure between the two hypotheses: H p :rank= p, i.e., full rank, so x t is stationary; H r :rank= r<p, i.e., r cointegration relations. The test is: LR(H r H p )= T ln [(1 λ r+1 ) (1 λ p )] = T p i=r+1 ln(1 λ i ). If λ r+1 = = λ p =0, the test statistic should be small (close to zero), which delivers the critical value under the null. The test is based on non-standard asymptotic distributions that have been simulated for the five cases discussed in section 6. There is an additional problem, in that H r may be correctly accepted when λ r =0, or even λ r 1 =0. Therefore, if H r is accepted, we conclude that

21 Explaining Cointegration Analysis: Part II 21 there are at least p r unit roots, i.e., p r common trends in the process (but there can be more) correspondingto at most r stationary relations. However, if LR(H r 1 H p ) is calculated, the test statistic includes ln(1 λ r ), which will not be close to zero, so an outcome in excess of the critical value should be obtained, correctly rejecting the false null of fewer that r cointegration relations. As discussed above, the asymptotic distributions depend on whether there is a constant and/or a trend; and whether these are unrestricted or not in the model. However other deterministic components, such as intervention dummies, are also likely to influence the shape of the test distributions. In particular, care should be taken when a deterministic component generates trending behavior in the levels of the data such as an unrestricted shift dummy (,0,0,0,1,1,1, ): an explanation of the procedure is provided in Johansen, Nielsen, and Mosconi (2000), and Juselius (2000): Doornik, Hendry, and Nielsen (1998) also consider the estimation and inference problems resulting from including dummies. Because the asymptotic distributions for the rank test depend on the deterministic components in the model and on whether these are restricted or unrestricted, the rank and the specification of the deterministic components have to be determined jointly. Nielsen and Rahbek (1998) have demonstrated that a test procedure based on a model formulation that allows a deterministic component, for example a deterministic trend t, to be restricted to the cointegration relations and the differenced component, t =1, to be unrestricted in the model induces similarity in the test procedure (i.e., the critical values do not depend on the parameter values, so can be tabulated). This is because when there are linear trends in the data, i.e. E[ x t ] 0, they can enter the model through the constant term, γ 0 in (26) or through the cointegration relations, ρ 0 in (25). Hence, given linear trends in the data, case 2 is the most general case. When the rank has been determined, it is always possible to test the hypothesis ρ = 0, as a linear restriction on the cointegrating relations. If, on the other hand, E[ x t ]=0, so there are no linear trends in the data, then the baseline model has the constant term restricted to the cointegration space, which is case 4 above. Therefore, based on the similarity argument, the rank should be based on either case 4 (trends in the data) or case 2 (no trends in the data). Nevertheless, if there is strong prior information that there are trends in the data, but they do not appear in the cointegration relations, then case 3 is the appropriate choice. 9. Empirical model specification The rank test is defined for a correctly specified model. Prior to the determination of the cointegration rank we should make sure that the empirical model is well-behaved. In Section 9.1, we choose the trend and constant in the baseline model, and account for the extraordinary events in the sample period; whereas in Section 9.2, we discuss the difficult choice of cointegration rank Model specification For the gasoline example, we found in Table 1 that E[ x t ]=0 cannot be rejected. Hence, there is little evidence of linear deterministic trends in the data, at least not over the sample period, so we should determine the rank based on a case-4 model. But before testing the rank of the Π matrix, we need to account for the effect of the Kuwait war on gasoline prices as discussed in Section 3.2, plus possibly some of the extraordinary price changes in this period that violate the normality assumption.

22 22 David F. Hendry and Katarina Juselius p 1,t 0.52 (11.3) p 2,t 0.05 (1.3) p 1,t 0.08 (3.6) p 2,t 0.07 (3.7) Table 2. The estimates of the short-run effects Φ The Kuwait war effects Ωε p 1,t 1 p 2,t 1 Ds90.31 t Ds90.31 t 1 ɛ 1,t ɛ 2,t 0.14 (2.3) 0.53 (12.0) 0.18 (7.8) 0.11 (5.8) 0.09 (4.0) 0.02 (1.2) ɛ 1,t 1.0 ɛ 2,t The estimates of the extraordinary price changes ( ɛ i,t > 3.3) Di89.13 Di89.39 Di89.51 Di90.49 Di9103 Di93.43 Dti (3.9) 0.04 (2.0) 0.09 (4.1) 0.05 (2.9) 0.08 (3.5) 0.07 (3.6) 0.09 (3.9) 0.08 (4.2) 0.05 (2.5) 0.08 (4.6) 0.08 (4.7) 0.03 (2.5) The question is now which kind of intervention dummies should be used to describe these extraordinary events. Juselius and Johansen (2000), show that a shift in the level of the variables corresponds to a permanent impulse blip in the differences, and an impulse effect in the levels corresponds to a transitory blip in the differences. The Kuwait war could have caused either a permanent or a transitory shift in the level of gasoline prices. In the latter case, we would see an initial rise in the gasoline prices in 1990:31 followed later on by a return to the previous price level. Based on the graphs in Figure 1, the differences show a large positive spike followed by another more moderately sized negative spike a few weeks later. This suggests that the initial increase in prices was partly permanent, partly transitory. To account for this extraordinary event on the price levels, the model includes a step dummy, Ds90.31 t beingzerouptot = 1990:31 and unity after that, restricted to lie in the cointegration space (if significant, then the permanent price increase in crude oil as a result of the war had a different effect on gasoline prices in the two locations). To account for the effect on the price changes, we also need to include current and lagged values of the impulse dummy Di90.31 t, defined below, as unrestricted in the model. Table 2 shows that the estimated direct price effect at the outbreak of the Kuwait war was an increase of 18% in location 1 and 11% in location 2, followed by a drop in the price by 9% in location 1 and 2% in location 2. Hence, the permanent effect of the war seemed to be approximately 9% in both locations. Altogether, the immediate price reaction in location 1 seemed to be stronger than in location 2. Based on the criterion ɛ i,t > 3.3, we detected seven additional outlier observations, accounted for by dummy variables defined as follows: the impulse dummy Dixx.yy t is unity for t =19xx:yy, and zero otherwise; the transitory impulse dummy Dtixx.yy t is unity for t =19xx:yy, 1 for t = 19xx:yy+1, and zero otherwise. Table2 reports the estimates of these very large price changes. Again, we note that the price reactions in location 1 seem stronger than in location 2: except for 1993:43, gasoline prices in location 1 change more dramatically than in location 2. The estimates of the Φ matrix in Table2 demonstrate quite strong autoregressive price behavior: a price change one week tends to be followed by a similar change but only half the size, next week. It also appears that a price change by p 2,t is followed by a lagged change in p 1,t, but not the other way around. The estimates of the residual covariance matrix Ω ε show a large positive correlation between price shocks to the two gasoline prices, which suggests that there may be current, as well as lagged, price effects. This will be further discussed in Section 11. After having accounted for these extraordinary events, the distributions of the residuals became

23 Explaining Cointegration Analysis: Part II 23 much closer to a normal distribution than in the first tentatively estimated model. However, the empirical model still showed some evidence of excess kurtosis and ARCH (autoregressive conditional heteroscedasticity, so the squared residuals are serially correlated: see Engle (1982)), as appears from Table 3, but neither of these usually causes serious problems for the properties of the estimates (Gonzalo, 199*). A plausible explanation for these problems is that the residual variance of the process changes in the middle of the sample, at around 1992:40. We have analyzed the data separately for the split sample, and the ARCH and excess kurtosis disappear. However, the basic results remained unchanged in the two sub-samples and there is, therefore, no obvious need to report the results separately. We conclude that the empirical model is reasonably well specified, and turn to the determination of the cointegration rank Rank determination As already mentioned, a correct choice of the cointegration rank is crucial for the analysis, but in practice, is far from easy. In many economic applications, the size of the sample is often quite small, and the tabulated asymptotic distributions can be rather poor approximations as has been demonstrated in many papers. See, for instance, Johansen (1999) for analytical results, and (ref.) for Monte Carlo or bootstrap results. Another reason for concern is that when using correct small-sample distributions for the trace test, the size of the test is correct, but the power can be low, sometimes even of the same magnitude as the size. In such cases, a 5% test procedure will reject a unit root incorrectly 5% of the time, but accept a unit root incorrectly 95% of the time! Thus, unless a unit root is given a structural interpretation (and hence, should be tested rigorously), it is important to make the decision based on as much information as possible, including prior economic information, and sensitivity analyses of doubtful cases to find out if important information is lost by leaving out the r th +1cointegration vector, or if anything is gained by including it. The following information is often useful when deciding on the choice of cointegration rank: 1. the trace test for cointegration rank; 2. the characteristic roots of the model: if the r th +1cointegration vector is non-stationary and is wrongly included in the model, then the largest characteristic root will be close to the unit circle; 3. the t-values of the α-coefficients for the r th +1cointegration vector; if these are all small, say less than 3.0, then one would not gain greatly by including that vector as a cointegrating relation in the model; 4. the recursive graphs of the trace statistic for r =1, 2,...,p: since the variable T j ln(1 λ i ), for j = T 1,...,T, grows linearly over time when λ i 0, the recursively-calculated components of the trace statistic should increase linearly for the first r components, but stay constant for the remainder; 5. the graphs of the cointegrating relations: if the graphs reveal distinctly non-stationary behavior of a cointegration relation, which is supposedly stationary, one should reconsider the choice of r, or find out if the model specification is in fact incorrect, for example, if the data are I(2) instead of I(1); 6. the economic interpretability of the results. We will consider all the above pieces of information in turn: The trace test (item 1) in the bivariate gasoline price case, should be able to discriminate between the following alternatives: no unit roots, one unit root, or two unit roots. The first case corresponds to

24 24 David F. Hendry and Katarina Juselius Table 3. Rank and specification tests Rank determination The two largest roots Misspecification tests λ i Trace Q 95 (r =2) (r =1) ARCH Norm. R p p The unrestricted cointegration vectors and their weights p 1 p 2 Ds90.31 constant β 1 xt β 2 xt β p 1 : ˆα (7.5) 0.02 (2.1) β p 2 : ˆα (1.2) 0.03 (3.8) prices being stationary, the second to them being I(1) with one stationary cointegration relation, and the last case to them being I(1) but with no cointegrating relation between them. The two trace test statistics in Table 3 are larger than their 95% quantiles, so that both λ 1 and λ 2 have to be considered different from zero, suggesting that both price series are stationary. As mentioned above λ i can be interpreted as a squared canonical correlation coefficient. It appears from the table that both λ 1 and λ 2 are small in absolute value, indicating a fairly low correlation with the stationary part, and that λ 1 is much larger than λ 2, suggesting that the adjustment to the first cointegration relation is much stronger than to the second. The graphs of the cointegration vectors (item 5) in Figure 4 indicate that both are mean-reverting, but the second one much more strongly so. The two largest characteristic roots (item 2) show that r =2implies a fairly large root, 0.93, in the model. The recursive graphs (item 4) of T j ln(1 λ 1j ) and T j ln(1 λ 2j ) in Figure?? show that the second component remains almost constant, whereas the first grows linearly over time. All this suggests that the first cointegrating relation is stationary and that the second is near integrated, but with significant mean reversion. A similar result was found in Hendry and Juselius (2000).

25 Explaining Cointegration Analysis: Part II 25 ecm ecm Figure 4. Graphs of the unrestricted cointegration relations β 1x t and β 2x t Recursively calculated trace tests.

26 26 David F. Hendry and Katarina Juselius Why does the trace test find that the small value of λ 2 =0.02 is still significantly different from zero? The simple explanation is that the sample size is very large here, 576 observations. Because the trace test is calculated as T ln(1 λ), even a small deviation from zero can be found to be significant when T is large enough. However, inference is much closer to the so called Dicky Fuller distributions than to standard t-, F-, and χ 2 -distributions when there is a near unit root in the model. Hence, to make inference more robust, it is often a good idea to approximate a near unit root by a unit root even when it is found to be statistically different from one. Before finally deciding about the rank, we first check the economic interpretability of the second cointegration relation to see if it contains valuable information for the analysis. It appears from Table 3 that the adjustment coefficients to the second relation α 21 and α 22 (item 3) are very small, α 21 is hardly significant in the first gasoline price equation, whereas α 22 is more significant in the second price equation. Nevertheless, the coefficients of β 2 x t (item 6) indicate that it is essentially a unit vector describing price 2. This is confirmed by the graphs of the cointegration vectors (item 5) in Figure 4, where the second graph closely resembles p 2,t in Figure 2. Therefore, accepting r =2is equivalent to saying that the gasoline prices are stationary, albeit subject to substantial persistence. As already discussed in Hendry and Juselius (2000), this might be the case, but choosing r =2would leave a near unit root in the model, and conventional (t-, F-, χ 2 ) inference is likely to be misleading. Moreover, asserting a constant long-run mean for nominal gasoline prices does not seem plausible. Finally, since β 2x t will not add much valuable information about the co-movements of the two gasoline prices, we conclude that the empirical analysis will not benefit from choosing r =2. We therefore continue the empirical analysis assuming r = Identification and hypothesis testing Given the choice of the number of cointegrating relations, r, the Johansen procedure gives the maximum likelihood estimates of the unrestricted cointegrating relations β x t. Although the unrestricted β is uniquely determined based on the chosen normalization, the latter is not necessarily meaningful from an economic point of view. Therefore, an important part of a long-run cointegration analysis is to impose (over-) identifying restrictions on β to achieve economic interpretability. In section 10.1, we discuss a typology of restrictions on β and note the problem of calculating the degrees of freedom in the case of over-identifying restrictions. Section 10.2 provides some examples of how to specify hypotheses in a testable form Restrictions on β As an example of just-identifying restrictions, consider the following design matrix Q =(β 1 ) where β a is a (r r) non-singular matrix defined by β =(β a, β b ). In this case, αβ = α(β a β 1 a β )= α(i r, β) where I r is the (r r) unit matrix, and β = β 1 a β b is an (r (p r)) full-rank matrix. These just-identifying restrictions have transformed β to the long-run reduced form. Because justidentifying restrictions do not change the likelihood function, no tests are involved. In general, just identification can be achieved by imposing one appropriate normalization and (r 1) restrictions on each β i. Care is required to ensure that the coefficient which is normalized is non-zero. The calculation of the degrees of freedom when testing structural hypotheses on the cointegration relations is often quite difficult. It is useful from the outset to distinguish between:

27 Explaining Cointegration Analysis: Part II pseudo restrictions that can be obtained by linear manipulations, because it is always possible to impose r 1 restrictions and one normalization without changing the value of the likelihood function no testing is involved for such restrictions ; 2. additional testable restrictions on the parameters, which change the value of the likelihood function; in the latter group there are two kinds of testable restrictions: (a) restrictions that are not identifying, for example: (i) the same restrictions on all cointegrating vectors, (ii) one vector assumed known and the remaining vectors unrestricted. (b) genuine over-identifying restrictions. The first step is to examine whether the restrictions satisfy the rank and order condition for identification: luckily, many available software packages do the checking, and will usually inform the user when identification is violated. Also, note that when normalizing a β i vector by diving through by a nonzero element β ij, the corresponding α i vector will by multiplied by the same element, so normalization does not change Π = αβ Hypotheses testing Hypotheses on the cointegration vectors can be formulated in two alternative ways: either by specifying the s i free parameters in each β vector, or by specifying the m i restrictions on each vector. We consider each in turn. First: β =(H 1 κ 1,..., H r κ r ), where β is (p 1 r), κ i are (s i 1) coefficient matrices, and H i are (p 1 s i ) design matrices where p 1 is the dimension of x t 1 in (25). In this case, we use the design matrices to determine the s i free parameters in each cointegration vector. The other way of formulating restrictions is: R 1β 1 = 0,..., R rβ r = 0 where R i is a p m i restrictions matrix. Note that R i = H,i, i.e., R i H i = 0. In some cases, we may want to test restrictions which are not identifying. Such restrictions could be the same restriction on all cointegration relations, for example long-run price homogeneity on all vectors. In this case, the H i (or R i ) are all identical, and we can formulate the hypothesis as β = Hκ, where κ is now an s r matrix of free parameters. Another possibility is to test a hypothesis on just one of the cointegration vectors. In this case, we formulate the hypothesis as β = {H 1 κ 1, κ 2,..., κ r }, where H 1 is a (p s 1 ) matrix, κ 1 is (s 1 1), and the other vectors are unrestricted. All these hypotheses can be tested by a likelihood-ratio procedure described in detail in Johansen and Juselius (1990, Johansen and Juselius (1992) Johansen. and Juselius (1994). Our simple empirical example consists of only two gasoline prices, so the number of interesting hypotheses to test is limited. Here, we will only test one hypothesis as an illustration and refer the interested reader to the many published papers containing a wide variety of testable hypotheses, for example Juselius.... Table 3 showed that the unrestricted coefficients of the first cointegration vector were almost equal with opposite signs, indicating long-run price homogeneity. This hypothesis can be formulated either as β =(Hκ) or, equivalently, as R β = 0, where: H = , κ = κ 1 κ 2 κ 3, and R = ( ).

28 28 David F. Hendry and Katarina Juselius The first alternative specifies the hypothesis in terms of the s 1 =3remaining free parameters; the second in terms of the m 1 =1imposed restriction. The hypothesis is tested by the LR test procedure described in Johansen and Juselius (1992). The test statistic value was 0.55, distributed as χ 2 (1), and hence accepted with a p-value of The restricted cointegration relation became: p 1 p Ds (1.8) (0.6) α 11 = 0.14, α 12 = 0.01 (7.3) (0.8) The constant term is not significantly different from zero, but for reasons discussed above in Section 6, we prefer to keep it in the cointegration relation. Similarly, the step dummy is only borderline significant (cf. the discussion in Section 9.1), and suggests that the permanent increase in gasoline prices at location 1 might (possibly) have been slightly higher than at location 2. The estimates of the adjustment coefficients α 1i show significant adjustment in the first price, but not in the second. This suggests that a price shock leading to an increase in the price differential between the two locations will initiate an adjustment in the first price, but not in the second: in this sense, the second location is a price leader, i.e., is driving the gasoline prices in this market. This takes us to the next issue, which is exogeneity and partial systems. 11. Partial systems and exogeneity The hypothesis that a variable is influencing the long-run development of the other variables of the system, but is not influenced by them, is called the hypothesis of no levels feedback, or long-run weak exogeneity (when the parameters of interest are β). We test the following hypothesis on α: H α (r) :α = A α (29) where α is p r, A is a p s matrix, α is an s r matrix of nonzero α-coefficients and s r. (Compare this formulation with the hypothesis of the same restriction on all β, i.e. β =(H β).) As with tests on β, we can express the restriction (29) in the equivalent form: H α (r) :Bα = 0 (30) where B = A. The condition s r implies that the number of non-zero rows in α must not be greater than r. This is because a variable that has a zero row in α is not adjusting to the long-run relations and, hence, can be considered as a driving trend in the system, i.e., as a common stochastic trend. Since there can at most be (p r) common trends, the number of zero-row restrictions can at most be equal to (p r). The hypothesis (29) can be expressed as: ( ) ( ) αa α =. α b 0 Under H 0 : x t = Φ 1 x t 1 + αβ x t 1 + π + Ψd t + ɛ t. (31)

29 Explaining Cointegration Analysis: Part II 29 The weak exogeneity hypothesis can be tested with a LR test procedure described in Johansen and Juselius (1992). It is asymptotically distributed as χ 2 with the degrees of freedom υ = s r, i.e., equal to the number of zero restrictions on the α-coefficients. If the zero-row restriction on α is accepted, we can partition the p variables into (p s) variables which exhibit levels feedback, and s variables with no levels feedback. We say that the s variables are weakly exogenous when the parameters of interest are β. Because the s weakly-exogenous variables do not contain information about the long-run parameters, we can obtain fully-efficient estimates of β from the (p s) adjustment equations, conditional on the marginal models of the s weakly-exogenous variables (see Engle, Hendry, and Richard (1983), Johansen (1992b), and Hendry (1995b)). This gives the condition for when partial models can be used to estimate β without losing information. More formally, this can be stated as: let {x t } = {x a,t, x b,t } where x b,t is weakly exogenous when β is the parameter of interest; then a fully efficient estimate of β can be obtained from the partial model: x a,t = A 0 x b,t + Φ 11 x t 1 + α 1 β x t 1 + π 1 + Ψ 1 d t + v 1,t. (32) In our simple bivariate model of the two gasoline prices, weak exogeneity can be tested as a hypothesis on one α coefficient. Because p =2and r =1, there can at most be one weakly exogenous variable. It appeared from Table 3 that α 11 is statistically significant, whereas α 21 is not. Hence, in this case p 2,t is weakly exogenous based on a t-test. In the bivariate system, the t-test and the χ 2 -test are asymptotically equivalent. In a larger system with several cointegrating relations, the hypothesis that a variable is long-run weakly exogenous is that a row of α-coefficients is zero, i.e., R α = 0. In our simple example, R =(0, 1). The test statistic value, distributed as χ 2 (1), was 0.48 and, hence, is accepted. Long-run weak exogeneity does not imply short-run weak-exogeneity. Hence, it does not exclude the possibility that p 1,t is reacting in the short run to changes in p 2,t. But in many cases, the economic interest is in the long-run effects, and establishing long-run weak exogeneity for a variable is often used as a justification for continuing the model analysis conditional on the weakly exogenous variable(s). Although a full system analysis is usually needed to test for weak exogeneity (see Harbo, Johansen, Nielsen, and Rahbek (1999), for a test procedure based on a partial system), there can be some advantages to continuing the analysis in a partial system. For example, if the weakly-exogenous variable has been subject to many interventions (large shocks) and current changes in this variable have significantly affected the other variables in system, so A 0 0 in (32), then conditioning on the weakly-exogenous variable is likely to reduce the need for intervention dummies in the model, and possibly lead to more constant parameter estimates. For example, if the model had additionally included the price of crude oil and it had been found to be weakly exogenous, conditioning on the current change of the price of crude oil would probably have taken care of many of the large changes in gasoline prices that in the present model had to be accounted for by dummy variables. Moreover, the residual variance in the conditional model is often much reduced relative to the marginal model, hence improving the precision of statistical inference. In our simple bivariate model, p 2,t was found to be weakly exogenous and we present the estimation results when conditioning on it in (33). Note that this model is the equivalent of the dynamic, singleequation EqCM in Hendry and Juselius (2000): p 1,t = 0.48 p 1,t p 2,t 0.33 p 2,t (14.8) (25.1) (7.2) (8.7) σ ɛ =0.016, R 2 =0.77 ecm t 1 + As discussed above, the need for dummy variables changed in this model version: five of the seven dummy variables became insignificant after conditioning on p 2,t. But, because the residual standard (33)

30 30 David F. Hendry and Katarina Juselius error dropped from to 0.016, some of the larger price changes no longer passed the outlier criterion ɛ i,t > 3.3, so in the final model we needed the same number of outlier dummies as before, albeit at different dates. 12. Forecasting in cointegrated processes The existence of cointegration between series importantly affects forecasts of them, since some linear combinations remain linked, whereas others drift apart. It seems less important whether or not such cointegration links are imposed in estimation: the additional errors from not doing so are O(1/T ). The topic is treated at length in Clements and Hendry (1998, Clements and Hendry (1999); here we consider the simplest case, where there are no unmodeled structural changes in the data generation process. Consider an h-period ahead forecast based on a dynamic system for the vector of p variables x t, using the simplest EqCM: x t = π + Πx t 1 + ɛ t where ɛ t IN n [0, Ω ɛ ]. (34) Then, h-periods ahead, from an end-of-sample point T (the forecast origin), the outcome is: The VAR forecast is: h 1 x T +h = Π h x T + Π i (π + ɛ T +h i ). (35) i=0 x T +h = Π h 1 h x T + i=0 Π i π, which can be rather uncertain due to powering up Π (which contains estimated unit roots), and not partitioning the componentsof π between equilibrium means and growth rates. When there is cointegration, we can write (34) as the EqCM: x t = γ + α ( β x t 1 µ ) + ɛ t. (36) It can be shown that when forecasting increasingly further ahead, forecasts of cointegrating combinations converge on their equilibrium means, as is appropriate for stationary combinations: β x T +h µ. Consequently, forecasts of changes converge on γ, so any EqCM effect on x t dies out: x T +h γ. However, for levels ( x T +h ), the EqCM coefficient increases: thus, a small feedback coefficient α may nevertheless entail a large long-run effect from the disequilibrium. Imposing valid cointegration restrictions can improve forecast accuracy somewhat (see Clements and Hendry (1995)), but for short-horizon forecasts, any potential benefit can be overwhelmed by changes in the parameters γ and µ (surprisingly, changes to the other parameters are less pernicious: see Hendry (2000)). Imagine an unmodeled change in µ to µ (say): the model treats the discrepancy ( β x T µ ) as a disequilibrium, so forecasts of changes go in the opposite direction to the data if growth actually increases, the model will forecast a fall. For example, if the price differential between the two gasoline prices changed permanently at the outbreak of the Kuwait war (i.e., relative gasoline prices moved to a new equilibrium position), without us modeling it explicitly, then the model will interpret the increased price differential as a disequilibrium, so (wrongly) forecast a reduction back

31 Explaining Cointegration Analysis: Part II 31 to the old equilibrium. This is why it is important to view the model as equilibrium correcting : it always adjusts to the imposed equilibrium (whether that is correct or not), and certainly does not error correct after a shift to a new equilibrium position. However, Clements and Hendry (1999) consider a variety of solutions to this problem. 13. Parameter constancy and policy Constancy of the parameters of models is a fundamental attribute for inference, theory, forecasting and policy, and is no less relevant in cointegrated systems. Many tests exist for that hypothesis (see e.g., Hendry (1995a), and Doornik and Hendry (1999)), and it has been established that the conventional distributions of diagnostic test statistics remain valid in integrated-cointegrated systems (see e.g., Wooldridge (1999)), so the usual tests remain applicable. We remarked in the previous section on the crucial role of changes in deterministic terms in inducing forecast failure, and the consequential ease of detection, as against the difficulty of detecting changes in (say) α. Unfortunately, some procedures for analyzing policy in cointegrated VARs, such as impulse-response analyses, are heavily dependent on the absence of changes in the parameters of the dynamics the very feature that is difficult to detect. Thus, their implications must remain tentative pending better tests for mean-zero parameter changes (for additional critiques, see Ericsson, Hendry, and Mizon (1998), and Hendry and Mizon (1998): Banerjee, Hendry, and Mizon (1996), provide an overview). We also remarked on the potential advantages of open or partial systems which validly conditioned on weakly-exogenous variables. At first sight, it would seem unlikely that policy variables could be weakly exogenous, since the disequilibria in an economy are a major determinant of policy actions, and cointegration deviations comprise an important class of disequilibria. However, provided the cointegration analysis is appropriately undertaken in the full (closed) system, and only then are the cointegration vectors computed, weak exogeneity (necessary for efficient conditional inference) need not be invoked. Once the β x t are estimated, they become variables, the presence of which in several equations, including those equations determining conditional policy variables, is inconsequential. Thus, conditional policy models can be constructed. This is invaluable, since most practical policy involves changes in variables with non-zero means, and hence unmodeled changes would be easily detected. Thus, one can learn of any mistakes quickly, and rapidly adapt to them, rather than persist with an incorrect policy. 14. Conclusions The recognition that economic time series are non-stationary has profoundly altered the technology of econometrics, introducing the concepts and tools associated with integrated-cointegrated data that have been the focus of our two-part exposition. In part, plus ça change, plus c est la meme chose rules: many inferences can be conducted as in a stationary world, since unit roots can be removed by the linear transformations of cointegration and differencing. But, additional sources of non-stationarity still remain problematic, especially structural change; changes in the covariances of the data violate one of the underlying assumptions of the VAR model. Nevertheless, we can understand, and resolve other aspects of unit-root data such as nonsense regressions (dating back to the century before last),

32 32 David F. Hendry and Katarina Juselius how to avoid them (as well as deliberately create them, as in Hendry (1980)). In part I, we formalized an appropriate canonical model, the equilibrium-correction model, relevant for non-stationarity (unit-root) economic data, which resolves the nonsense regression problem. Here we have analyzed the properties of this model in further detail, demonstrated the changes needed to validate inference procedures, and illustrated the powerful new modeling procedures with gasoline price series relevant for energy economics. Application of cointegration analysis requires careful thought about model specification and interpretation, and an increased emphasis on the appropriate treatment of deterministic terms. How they enter a model, and whether the same terms also enter the DGP both affects parameter inference and the behavior of the model for such purposes as policy and forecasting. Recent developments in the theory of forecasting, summarized in the two volumes Clements and Hendry (1998, Clements and Hendry (1999), have highlighted the key role in forecast failure of shifts in the deterministic terms, so more attention should be paid to the theory and practice of modeling deterministic terms. The reader who has followed both parts of our overview in their entirety has traveled a long route. We have explained the many concepts, and the associated language of the unit-root revolution in econometrics, which admittedly has been a slow revolution! Regrettably we suspect that the worry expressed in Working (1934) that Economic theory has fallen far short of recognizing the full implications of the resemblance of many economic time series to random-difference series still remains. Modeling cointegrated series is difficult because of the need to model systems of equations in which one has to simultaneously specify the deterministic terms and how they enter, determine the lag length, and ensure a congruent representation. Nevertheless, powerful software facilitates the task for those wishing to undertake their own analyses, including the programs PcGive and CATS in RATS that we have utilized. We also hope that publications on the topic will be more comprehensible in the future. To those who found the material hard going, we also draw on an old quote, by a discussant of Yule (1926), who complained about the mathematical difficulty of that paper yet school mathematics today easily suffices to understand it. There is no going back Acknowledgments All the computations reported in this paper were carried out using the PcGive suite (see especially Doornik and Hendry (1996, Doornik and Hendry (1999)), and CATS in RATS. We are pleased to acknowledge financial support from the Danish Social Sciences Research Council, and the UK Economic and Social Research Council under grant L We are grateful to Campbell Watkins for helpful comments on a previous draft. References Banerjee, A., J. J. Dolado, J. W. Galbraith, and D. F. Hendry (1993). Co-integration, Error Correction and the Econometric Analysis of Non-Stationary Data. Oxford: Oxford University Press. Banerjee, A., D. F. Hendry, and G. E. Mizon (1996). The econometric analysis of economic policy. Oxford Bulletin of Economics and Statistics 58, Clements, M. P. and D. F. Hendry (1995). Forecasting in cointegrated systems. Journal of Applied Econometrics 10, Clements, M. P. and D. F. Hendry (1998). Forecasting Economic Time Series. Cambridge: Cambridge University Press. The Marshall Lectures on Economic Forecasting.

33 Explaining Cointegration Analysis: Part II 33 Clements, M. P. and D. F. Hendry (1999). Forecasting Non-stationary Economic Time Series: The Zeuthen Lectures on Economic Forecasting. Cambridge, Mass.: MIT Press. Doornik, J. A. and D. F. Hendry (1996). GiveWin: An Interactive Empirical Modelling Program. London: Timberlake Consultants Press. Doornik, J. A. and D. F. Hendry (1999). Modelling Dynamic Systems using PcFiml 9 for Windows. London: Timberlake Consultants Press. Doornik, J. A., D. F. Hendry, and B. Nielsen (1998). Inference in cointegrated models: UK M1 revisited. Journal of Economic Surveys 12, Engle, R. F. (1982). Autoregressive conditional heteroscedasticity, with estimates of the variance of United Kingdom inflation. Econometrica 50, Engle, R. F. and C. W. J. Granger (1987). Cointegration and error correction: Representation, estimation and testing. Econometrica 55, Engle, R. F., D. F. Hendry, and J.-F. Richard (1983). Exogeneity. Econometrica 51, Reprinted in Hendry, D. F., Econometrics: Alchemy or Science? Oxford: Blackwell Publishers, 1993; and in Ericsson, N. R. and Irons, J. S.(eds.) Testing Exogeneity, Oxford: Oxford University Press, Ericsson, N. R., D. F. Hendry, and G. E. Mizon (1998). Exogeneity, cointegration and economic policy analysis. Journal of Business and Economic Statistics 16, Granger, C. W. J. (1986). Developments in the study of cointegrated economic variables. Oxford Bulletin of Economics and Statistics 48, Harbo, I., S. Johansen, B. Nielsen, and A. Rahbek (1999). Asymptotic inference on cointegrating rank in partial systems. Journal of Business and Economic Statistics.. Hendry, D. F. (1976). The structure of simultaneous equations estimators. Journal of Econometrics 4, Reprinted in Hendry, D. F. (1993), Econometrics: Alchemy or Science? Oxford: Blackwell Publishers. Hendry, D. F. (1980). Econometrics: Alchemy or science? Economica 47, Reprinted in Hendry, D. F. (1993), Econometrics: Alchemy or Science? Oxford: Blackwell Publishers. Hendry, D. F. (1995a). Dynamic Econometrics. Oxford: Oxford University Press. Hendry, D. F. (1995b). On the interactions of unit roots and exogeneity. Econometric Reviews 14, Hendry, D. F. (2000). On detectable and non-detectable structural change. Structural Change and Economic Dynamics 11, Hendry, D. F. and J. A. Doornik (1996). Empirical Econometric Modelling using PcGive for Windows. London: Timberlake Consultants Press. Hendry, D. F. and K. Juselius (2000). Explaining cointegration analysis: Part I. Energy Journal 21, Hendry, D. F. and G. E. Mizon (1998). Exogeneity, causality, and co-breaking in economic policy analysis of a small econometric model of money in the UK. Empirical Economics 23, Johansen, S. (1988). Statistical analysis of cointegration vectors. Journal of Economic Dynamics and Control 12, Reprinted in R.F. Engle and C.W.J. Granger (eds), Long-Run Economic Relationships, Oxford: Oxford University Press, 1991, Johansen, S. (1992a). A representation of vector autoregressive processes integrated of order 2. Econometric Theory 8, Johansen, S. (1992b). Testing weak exogeneity and the order of cointegration in UK money demand. Journal of Policy Modeling 14, Johansen, S. (1994). The role of the constant and linear terms in cointegration analysis of nonstationary variables. Econometric Reviews 13, Johansen, S. (1995). Likelihood-based Inference in Cointegrated Vector Autoregressive Models. Oxford: Oxford University Press. Johansen, S. and K. Juselius (1990). Maximum likelihood estimation and inference on cointegration With application to the demand for money. Oxford Bulletin of Economics and Statistics 52, Johansen, S. and K. Juselius (1992). Testing structural hypotheses in a multivariate cointegration analysis of the PPP and the UIP for UK. Journal of Econometrics 53, Wooldridge, J. M. (1999). Asymptotic properties of some specification tests in linear models with integrated processes. In R. F. Engle and H. White (Eds.), Cointegration, Causality and Forecasting, pp Oxford: Oxford University Press.

34 34 David F. Hendry and Katarina Juselius Working, H. (1934). A random difference series for use in the analysis of time series. Journal of the American Statistical Association 29, Yule, G. U. (1926). Why do we sometimes get nonsense-correlations between time-series? A study in sampling and the nature of time series (with discussion). Journal of the Royal Statistical Society 89, 1 64.

Chapter 6: Multivariate Cointegration Analysis

Chapter 6: Multivariate Cointegration Analysis Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration

More information

Chapter 4: Vector Autoregressive Models

Chapter 4: Vector Autoregressive Models Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...

More information

Normalization and Mixed Degrees of Integration in Cointegrated Time Series Systems

Normalization and Mixed Degrees of Integration in Cointegrated Time Series Systems Normalization and Mixed Degrees of Integration in Cointegrated Time Series Systems Robert J. Rossana Department of Economics, 04 F/AB, Wayne State University, Detroit MI 480 E-Mail: [email protected]

More information

Trends and Breaks in Cointegrated VAR Models

Trends and Breaks in Cointegrated VAR Models Trends and Breaks in Cointegrated VAR Models Håvard Hungnes Thesis for the Dr. Polit. degree Department of Economics, University of Oslo Defended March 17, 2006 Research Fellow in the Research Department

More information

Testing The Quantity Theory of Money in Greece: A Note

Testing The Quantity Theory of Money in Greece: A Note ERC Working Paper in Economic 03/10 November 2003 Testing The Quantity Theory of Money in Greece: A Note Erdal Özmen Department of Economics Middle East Technical University Ankara 06531, Turkey [email protected]

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Time Series Analysis

Time Series Analysis Time Series Analysis Identifying possible ARIMA models Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 2012 Alonso and García-Martos

More information

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium

More information

Vector Time Series Model Representations and Analysis with XploRe

Vector Time Series Model Representations and Analysis with XploRe 0-1 Vector Time Series Model Representations and Analysis with plore Julius Mungo CASE - Center for Applied Statistics and Economics Humboldt-Universität zu Berlin [email protected] plore MulTi Motivation

More information

Chapter 5: Bivariate Cointegration Analysis

Chapter 5: Bivariate Cointegration Analysis Chapter 5: Bivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie V. Bivariate Cointegration Analysis...

More information

Chapter 9: Univariate Time Series Analysis

Chapter 9: Univariate Time Series Analysis Chapter 9: Univariate Time Series Analysis In the last chapter we discussed models with only lags of explanatory variables. These can be misleading if: 1. The dependent variable Y t depends on lags of

More information

SYSTEMS OF REGRESSION EQUATIONS

SYSTEMS OF REGRESSION EQUATIONS SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations

More information

Note 2 to Computer class: Standard mis-specification tests

Note 2 to Computer class: Standard mis-specification tests Note 2 to Computer class: Standard mis-specification tests Ragnar Nymoen September 2, 2013 1 Why mis-specification testing of econometric models? As econometricians we must relate to the fact that the

More information

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU PITFALLS IN TIME SERIES ANALYSIS Cliff Hurvich Stern School, NYU The t -Test If x 1,..., x n are independent and identically distributed with mean 0, and n is not too small, then t = x 0 s n has a standard

More information

Univariate and Multivariate Methods PEARSON. Addison Wesley

Univariate and Multivariate Methods PEARSON. Addison Wesley Time Series Analysis Univariate and Multivariate Methods SECOND EDITION William W. S. Wei Department of Statistics The Fox School of Business and Management Temple University PEARSON Addison Wesley Boston

More information

Working Papers. Cointegration Based Trading Strategy For Soft Commodities Market. Piotr Arendarski Łukasz Postek. No. 2/2012 (68)

Working Papers. Cointegration Based Trading Strategy For Soft Commodities Market. Piotr Arendarski Łukasz Postek. No. 2/2012 (68) Working Papers No. 2/2012 (68) Piotr Arendarski Łukasz Postek Cointegration Based Trading Strategy For Soft Commodities Market Warsaw 2012 Cointegration Based Trading Strategy For Soft Commodities Market

More information

Jim Gatheral Scholarship Report. Training in Cointegrated VAR Modeling at the. University of Copenhagen, Denmark

Jim Gatheral Scholarship Report. Training in Cointegrated VAR Modeling at the. University of Copenhagen, Denmark Jim Gatheral Scholarship Report Training in Cointegrated VAR Modeling at the University of Copenhagen, Denmark Xuxin Mao Department of Economics, the University of Glasgow [email protected] December

More information

TEMPORAL CAUSAL RELATIONSHIP BETWEEN STOCK MARKET CAPITALIZATION, TRADE OPENNESS AND REAL GDP: EVIDENCE FROM THAILAND

TEMPORAL CAUSAL RELATIONSHIP BETWEEN STOCK MARKET CAPITALIZATION, TRADE OPENNESS AND REAL GDP: EVIDENCE FROM THAILAND I J A B E R, Vol. 13, No. 4, (2015): 1525-1534 TEMPORAL CAUSAL RELATIONSHIP BETWEEN STOCK MARKET CAPITALIZATION, TRADE OPENNESS AND REAL GDP: EVIDENCE FROM THAILAND Komain Jiranyakul * Abstract: This study

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

The Engle-Granger representation theorem

The Engle-Granger representation theorem The Engle-Granger representation theorem Reference note to lecture 10 in ECON 5101/9101, Time Series Econometrics Ragnar Nymoen March 29 2011 1 Introduction The Granger-Engle representation theorem is

More information

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem Chapter Vector autoregressions We begin by taking a look at the data of macroeconomics. A way to summarize the dynamics of macroeconomic data is to make use of vector autoregressions. VAR models have become

More information

1 Short Introduction to Time Series

1 Short Introduction to Time Series ECONOMICS 7344, Spring 202 Bent E. Sørensen January 24, 202 Short Introduction to Time Series A time series is a collection of stochastic variables x,.., x t,.., x T indexed by an integer value t. The

More information

Non-Stationary Time Series andunitroottests

Non-Stationary Time Series andunitroottests Econometrics 2 Fall 2005 Non-Stationary Time Series andunitroottests Heino Bohn Nielsen 1of25 Introduction Many economic time series are trending. Important to distinguish between two important cases:

More information

Time Series Analysis

Time Series Analysis Time Series Analysis Forecasting with ARIMA models Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 2012 Alonso and García-Martos (UC3M-UPM)

More information

Analysis of Bayesian Dynamic Linear Models

Analysis of Bayesian Dynamic Linear Models Analysis of Bayesian Dynamic Linear Models Emily M. Casleton December 17, 2010 1 Introduction The main purpose of this project is to explore the Bayesian analysis of Dynamic Linear Models (DLMs). The main

More information

E 4101/5101 Lecture 8: Exogeneity

E 4101/5101 Lecture 8: Exogeneity E 4101/5101 Lecture 8: Exogeneity Ragnar Nymoen 17 March 2011 Introduction I Main references: Davidson and MacKinnon, Ch 8.1-8,7, since tests of (weak) exogeneity build on the theory of IV-estimation Ch

More information

Business cycles and natural gas prices

Business cycles and natural gas prices Business cycles and natural gas prices Apostolos Serletis and Asghar Shahmoradi Abstract This paper investigates the basic stylised facts of natural gas price movements using data for the period that natural

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500

A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500 A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500 FE8827 Quantitative Trading Strategies 2010/11 Mini-Term 5 Nanyang Technological University Submitted By:

More information

Example G Cost of construction of nuclear power plants

Example G Cost of construction of nuclear power plants 1 Example G Cost of construction of nuclear power plants Description of data Table G.1 gives data, reproduced by permission of the Rand Corporation, from a report (Mooz, 1978) on 32 light water reactor

More information

Chapter 5: The Cointegrated VAR model

Chapter 5: The Cointegrated VAR model Chapter 5: The Cointegrated VAR model Katarina Juselius July 1, 2012 Katarina Juselius () Chapter 5: The Cointegrated VAR model July 1, 2012 1 / 41 An intuitive interpretation of the Pi matrix Consider

More information

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its

More information

Time Series Analysis III

Time Series Analysis III Lecture 12: Time Series Analysis III MIT 18.S096 Dr. Kempthorne Fall 2013 MIT 18.S096 Time Series Analysis III 1 Outline Time Series Analysis III 1 Time Series Analysis III MIT 18.S096 Time Series Analysis

More information

FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits

FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits Technical Paper Series Congressional Budget Office Washington, DC FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits Albert D. Metz Microeconomic and Financial Studies

More information

ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE

ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE YUAN TIAN This synopsis is designed merely for keep a record of the materials covered in lectures. Please refer to your own lecture notes for all proofs.

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Fractionally integrated data and the autodistributed lag model: results from a simulation study

Fractionally integrated data and the autodistributed lag model: results from a simulation study Fractionally integrated data and the autodistributed lag model: results from a simulation study Justin Esarey July 1, 215 Abstract Two contributions in this issue, Grant and Lebo (215) and Keele, Linn

More information

Testing for Unit Roots: What Should Students Be Taught?

Testing for Unit Roots: What Should Students Be Taught? Testing for Unit Roots: What Should Students Be Taught? John Elder and Peter E. Kennedy Abstract: Unit-root testing strategies are unnecessarily complicated because they do not exploit prior knowledge

More information

Multivariate Normal Distribution

Multivariate Normal Distribution Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

1 Solving LPs: The Simplex Algorithm of George Dantzig

1 Solving LPs: The Simplex Algorithm of George Dantzig Solving LPs: The Simplex Algorithm of George Dantzig. Simplex Pivoting: Dictionary Format We illustrate a general solution procedure, called the simplex algorithm, by implementing it on a very simple example.

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?

Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate? Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate? Emily Polito, Trinity College In the past two decades, there have been many empirical studies both in support of and opposing

More information

Marketing Mix Modelling and Big Data P. M Cain

Marketing Mix Modelling and Big Data P. M Cain 1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored

More information

The Relationship Between Crude Oil and Natural Gas Prices

The Relationship Between Crude Oil and Natural Gas Prices The Relationship Between Crude Oil and Natural Gas Prices by Jose A. Villar Natural Gas Division Energy Information Administration and Frederick L. Joutz Department of Economics The George Washington University

More information

15.062 Data Mining: Algorithms and Applications Matrix Math Review

15.062 Data Mining: Algorithms and Applications Matrix Math Review .6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop

More information

Some useful concepts in univariate time series analysis

Some useful concepts in univariate time series analysis Some useful concepts in univariate time series analysis Autoregressive moving average models Autocorrelation functions Model Estimation Diagnostic measure Model selection Forecasting Assumptions: 1. Non-seasonal

More information

State Space Time Series Analysis

State Space Time Series Analysis State Space Time Series Analysis p. 1 State Space Time Series Analysis Siem Jan Koopman http://staff.feweb.vu.nl/koopman Department of Econometrics VU University Amsterdam Tinbergen Institute 2011 State

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

Financial Integration of Stock Markets in the Gulf: A Multivariate Cointegration Analysis

Financial Integration of Stock Markets in the Gulf: A Multivariate Cointegration Analysis INTERNATIONAL JOURNAL OF BUSINESS, 8(3), 2003 ISSN:1083-4346 Financial Integration of Stock Markets in the Gulf: A Multivariate Cointegration Analysis Aqil Mohd. Hadi Hassan Department of Economics, College

More information

EMPIRICAL INVESTIGATION AND MODELING OF THE RELATIONSHIP BETWEEN GAS PRICE AND CRUDE OIL AND ELECTRICITY PRICES

EMPIRICAL INVESTIGATION AND MODELING OF THE RELATIONSHIP BETWEEN GAS PRICE AND CRUDE OIL AND ELECTRICITY PRICES Page 119 EMPIRICAL INVESTIGATION AND MODELING OF THE RELATIONSHIP BETWEEN GAS PRICE AND CRUDE OIL AND ELECTRICITY PRICES Morsheda Hassan, Wiley College Raja Nassar, Louisiana Tech University ABSTRACT Crude

More information

Chapter 5. Analysis of Multiple Time Series. 5.1 Vector Autoregressions

Chapter 5. Analysis of Multiple Time Series. 5.1 Vector Autoregressions Chapter 5 Analysis of Multiple Time Series Note: The primary references for these notes are chapters 5 and 6 in Enders (2004). An alternative, but more technical treatment can be found in chapters 10-11

More information

The Best of Both Worlds:

The Best of Both Worlds: The Best of Both Worlds: A Hybrid Approach to Calculating Value at Risk Jacob Boudoukh 1, Matthew Richardson and Robert F. Whitelaw Stern School of Business, NYU The hybrid approach combines the two most

More information

7 Time series analysis

7 Time series analysis 7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are

More information

Granger Causality between Government Revenues and Expenditures in Korea

Granger Causality between Government Revenues and Expenditures in Korea Volume 23, Number 1, June 1998 Granger Causality between Government Revenues and Expenditures in Korea Wan Kyu Park ** 2 This paper investigates the Granger causal relationship between government revenues

More information

Univariate Time Series Analysis; ARIMA Models

Univariate Time Series Analysis; ARIMA Models Econometrics 2 Spring 25 Univariate Time Series Analysis; ARIMA Models Heino Bohn Nielsen of4 Outline of the Lecture () Introduction to univariate time series analysis. (2) Stationarity. (3) Characterizing

More information

Maximum likelihood estimation of mean reverting processes

Maximum likelihood estimation of mean reverting processes Maximum likelihood estimation of mean reverting processes José Carlos García Franco Onward, Inc. [email protected] Abstract Mean reverting processes are frequently used models in real options. For

More information

South Carolina College- and Career-Ready (SCCCR) Algebra 1

South Carolina College- and Career-Ready (SCCCR) Algebra 1 South Carolina College- and Career-Ready (SCCCR) Algebra 1 South Carolina College- and Career-Ready Mathematical Process Standards The South Carolina College- and Career-Ready (SCCCR) Mathematical Process

More information

Integrated Resource Plan

Integrated Resource Plan Integrated Resource Plan March 19, 2004 PREPARED FOR KAUA I ISLAND UTILITY COOPERATIVE LCG Consulting 4962 El Camino Real, Suite 112 Los Altos, CA 94022 650-962-9670 1 IRP 1 ELECTRIC LOAD FORECASTING 1.1

More information

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA We Can Early Learning Curriculum PreK Grades 8 12 INSIDE ALGEBRA, GRADES 8 12 CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA April 2016 www.voyagersopris.com Mathematical

More information

REASSESSMENT OF SUSTAINABILITY OF CURRENT ACCOUNT DEFICIT IN INDIA

REASSESSMENT OF SUSTAINABILITY OF CURRENT ACCOUNT DEFICIT IN INDIA South-Eastern Europe Journal of Economics 1 (2012) 67-79 REASSESSMENT OF SUSTAINABILITY OF CURRENT ACCOUNT DEFICIT IN INDIA AVIRAL KUMAR TIWARI * ICFAI University, Tripura Abstract In this study, we examined

More information

FULLY MODIFIED OLS FOR HETEROGENEOUS COINTEGRATED PANELS

FULLY MODIFIED OLS FOR HETEROGENEOUS COINTEGRATED PANELS FULLY MODIFIED OLS FOR HEEROGENEOUS COINEGRAED PANELS Peter Pedroni ABSRAC his chapter uses fully modified OLS principles to develop new methods for estimating and testing hypotheses for cointegrating

More information

STOCK MARKET VOLATILITY AND REGIME SHIFTS IN RETURNS

STOCK MARKET VOLATILITY AND REGIME SHIFTS IN RETURNS STOCK MARKET VOLATILITY AND REGIME SHIFTS IN RETURNS Chia-Shang James Chu Department of Economics, MC 0253 University of Southern California Los Angles, CA 90089 Gary J. Santoni and Tung Liu Department

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

Introduction to time series analysis

Introduction to time series analysis Introduction to time series analysis Margherita Gerolimetto November 3, 2010 1 What is a time series? A time series is a collection of observations ordered following a parameter that for us is time. Examples

More information

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Yong Bao a, Aman Ullah b, Yun Wang c, and Jun Yu d a Purdue University, IN, USA b University of California, Riverside, CA, USA

More information

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document

More information

Sales forecasting # 2

Sales forecasting # 2 Sales forecasting # 2 Arthur Charpentier [email protected] 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting

More information

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Clustering in the Linear Model

Clustering in the Linear Model Short Guides to Microeconometrics Fall 2014 Kurt Schmidheiny Universität Basel Clustering in the Linear Model 2 1 Introduction Clustering in the Linear Model This handout extends the handout on The Multiple

More information

Penalized regression: Introduction

Penalized regression: Introduction Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.

CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES From Exploratory Factor Analysis Ledyard R Tucker and Robert C MacCallum 1997 180 CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES In

More information

Overview of Factor Analysis

Overview of Factor Analysis Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls [email protected] MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

Financial Risk Forecasting Chapter 8 Backtesting and stresstesting

Financial Risk Forecasting Chapter 8 Backtesting and stresstesting Financial Risk Forecasting Chapter 8 Backtesting and stresstesting Jon Danielsson London School of Economics 2015 To accompany Financial Risk Forecasting http://www.financialriskforecasting.com/ Published

More information

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives

More information

Analysis and Computation for Finance Time Series - An Introduction

Analysis and Computation for Finance Time Series - An Introduction ECMM703 Analysis and Computation for Finance Time Series - An Introduction Alejandra González Harrison 161 Email: [email protected] Time Series - An Introduction A time series is a sequence of observations

More information

The Basic Two-Level Regression Model

The Basic Two-Level Regression Model 2 The Basic Two-Level Regression Model The multilevel regression model has become known in the research literature under a variety of names, such as random coefficient model (de Leeuw & Kreft, 1986; Longford,

More information

On the long run relationship between gold and silver prices A note

On the long run relationship between gold and silver prices A note Global Finance Journal 12 (2001) 299 303 On the long run relationship between gold and silver prices A note C. Ciner* Northeastern University College of Business Administration, Boston, MA 02115-5000,

More information

Time Series Analysis 1. Lecture 8: Time Series Analysis. Time Series Analysis MIT 18.S096. Dr. Kempthorne. Fall 2013 MIT 18.S096

Time Series Analysis 1. Lecture 8: Time Series Analysis. Time Series Analysis MIT 18.S096. Dr. Kempthorne. Fall 2013 MIT 18.S096 Lecture 8: Time Series Analysis MIT 18.S096 Dr. Kempthorne Fall 2013 MIT 18.S096 Time Series Analysis 1 Outline Time Series Analysis 1 Time Series Analysis MIT 18.S096 Time Series Analysis 2 A stochastic

More information

Electricity Demand for Sri Lanka: A Time Series Analysis

Electricity Demand for Sri Lanka: A Time Series Analysis SEEDS Surrey Energy Economics Discussion paper Series SURREY ENERGY ECONOMICS CENTRE Electricity Demand for Sri Lanka: A Time Series Analysis Himanshu A. Amarawickrama and Lester C Hunt October 2007 SEEDS

More information

COURSES: 1. Short Course in Econometrics for the Practitioner (P000500) 2. Short Course in Econometric Analysis of Cointegration (P000537)

COURSES: 1. Short Course in Econometrics for the Practitioner (P000500) 2. Short Course in Econometric Analysis of Cointegration (P000537) Get the latest knowledge from leading global experts. Financial Science Economics Economics Short Courses Presented by the Department of Economics, University of Pretoria WITH 2015 DATES www.ce.up.ac.za

More information

You Are What You Bet: Eliciting Risk Attitudes from Horse Races

You Are What You Bet: Eliciting Risk Attitudes from Horse Races You Are What You Bet: Eliciting Risk Attitudes from Horse Races Pierre-André Chiappori, Amit Gandhi, Bernard Salanié and Francois Salanié March 14, 2008 What Do We Know About Risk Preferences? Not that

More information

Econometrics Simple Linear Regression

Econometrics Simple Linear Regression Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight

More information

I. Basic concepts: Buoyancy and Elasticity II. Estimating Tax Elasticity III. From Mechanical Projection to Forecast

I. Basic concepts: Buoyancy and Elasticity II. Estimating Tax Elasticity III. From Mechanical Projection to Forecast Elements of Revenue Forecasting II: the Elasticity Approach and Projections of Revenue Components Fiscal Analysis and Forecasting Workshop Bangkok, Thailand June 16 27, 2014 Joshua Greene Consultant IMF-TAOLAM

More information

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4 4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression

More information

Introduction to Principal Components and FactorAnalysis

Introduction to Principal Components and FactorAnalysis Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a

More information

3. Regression & Exponential Smoothing

3. Regression & Exponential Smoothing 3. Regression & Exponential Smoothing 3.1 Forecasting a Single Time Series Two main approaches are traditionally used to model a single time series z 1, z 2,..., z n 1. Models the observation z t as a

More information

Masters in Financial Economics (MFE)

Masters in Financial Economics (MFE) Masters in Financial Economics (MFE) Admission Requirements Candidates must submit the following to the Office of Admissions and Registration: 1. Official Transcripts of previous academic record 2. Two

More information

How To Model A Series With Sas

How To Model A Series With Sas Chapter 7 Chapter Table of Contents OVERVIEW...193 GETTING STARTED...194 TheThreeStagesofARIMAModeling...194 IdentificationStage...194 Estimation and Diagnostic Checking Stage...... 200 Forecasting Stage...205

More information

Java Modules for Time Series Analysis

Java Modules for Time Series Analysis Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series

More information

Multiple Regression: What Is It?

Multiple Regression: What Is It? Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in

More information

Elements of statistics (MATH0487-1)

Elements of statistics (MATH0487-1) Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -

More information