Threshold Autoregressive Model for a Time Series Data

Similar documents
TIME SERIES ANALYSIS

TIME SERIES ANALYSIS

Forecasting areas and production of rice in India using ARIMA model

Forecasting of Paddy Production in Sri Lanka: A Time Series Analysis using ARIMA Model

Advanced Forecasting Techniques and Models: ARIMA

Time Series - ARIMA Models. Instructor: G. William Schwert

Using JMP Version 4 for Time Series Analysis Bill Gjertsen, SAS, Cary, NC

Some useful concepts in univariate time series analysis

USE OF ARIMA TIME SERIES AND REGRESSORS TO FORECAST THE SALE OF ELECTRICITY

Energy Load Mining Using Univariate Time Series Analysis

Studying Achievement

Threshold Autoregressive Models in Finance: A Comparative Approach

TIME-SERIES ANALYSIS, MODELLING AND FORECASTING USING SAS SOFTWARE

Time Series Analysis

Univariate and Multivariate Methods PEARSON. Addison Wesley

Causes of Inflation in the Iranian Economy

Time Series Analysis

Rob J Hyndman. Forecasting using. 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1

The SAS Time Series Forecasting System

MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal

Session 7 Bivariate Data and Analysis

COMP6053 lecture: Time series analysis, autocorrelation.

Regression and Time Series Analysis of Petroleum Product Sales in Masters. Energy oil and Gas

JOHANNES TSHEPISO TSOKU NONOFO PHOKONTSI DANIEL METSILENG FORECASTING SOUTH AFRICAN GOLD SALES: THE BOX-JENKINS METHODOLOGY

Predicting Indian GDP. And its relation with FMCG Sales

Testing for Granger causality between stock prices and economic growth

Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models

Chapter 27 Using Predictor Variables. Chapter Table of Contents

Software Review: ITSM 2000 Professional Version 6.0.

Time Series Analysis

ADVANCED FORECASTING MODELS USING SAS SOFTWARE

THE UNIVERSITY OF CHICAGO, Booth School of Business Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Homework Assignment #2

Module 5: Multiple Regression Analysis

The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network

5. Multiple regression

AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.

Integrated Resource Plan

Traffic Safety Facts. Research Note. Time Series Analysis and Forecast of Crash Fatalities during Six Holiday Periods Cejun Liu* and Chou-Lin Chen

Time Series and Forecasting

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

Time series Forecasting using Holt-Winters Exponential Smoothing

Predictability of Non-Linear Trading Rules in the US Stock Market Chong & Lam 2010

Time Series Analysis

9th Russian Summer School in Information Retrieval Big Data Analytics with R

Luciano Rispoli Department of Economics, Mathematics and Statistics Birkbeck College (University of London)

ITSM-R Reference Manual

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2015, Mr. Ruey S. Tsay. Solutions to Midterm

Turkey s Energy Demand

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network

Lecture 2: ARMA(p,q) models (part 3)

16 : Demand Forecasting

Module 3: Correlation and Covariance

FORECAST MODEL USING ARIMA FOR STOCK PRICES OF AUTOMOBILE SECTOR. Aloysius Edward. 1, JyothiManoj. 2

This content downloaded on Tue, 19 Feb :28:43 PM All use subject to JSTOR Terms and Conditions

I. Introduction. II. Background. KEY WORDS: Time series forecasting, Structural Models, CPS

3. Regression & Exponential Smoothing

Exam Solutions. X t = µ + βt + A t,

Sales forecasting # 2

Univariate Regression

How To Model A Series With Sas

Time Series Analysis

Time Series Analysis of Household Electric Consumption with ARIMA and ARMA Models

Readers will be provided a link to download the software and Excel files that are used in the book after payment. Please visit

Time Series Analysis: Basic Forecasting.

EMPIRICAL INVESTIGATION AND MODELING OF THE RELATIONSHIP BETWEEN GAS PRICE AND CRUDE OIL AND ELECTRICITY PRICES

Forecasting the US Dollar / Euro Exchange rate Using ARMA Models

Macroeconomic drivers of private health insurance coverage. nib Health Insurance

TIME SERIES ANALYSIS & FORECASTING

Analysis of the Volatility of the Electricity Price in Kenya Using Autoregressive Integrated Moving Average Model

Analysis of algorithms of time series analysis for forecasting sales

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques Page 1 of 11. EduPristine CMA - Part I

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Forecasting Using Eviews 2.0: An Overview

2.2 Elimination of Trend and Seasonality

Time-Series Analysis CHAPTER General Purpose and Description 18-1

JetBlue Airways Stock Price Analysis and Prediction

4. Simple regression. QBUS6840 Predictive Analytics.

Forecasting Analytics. Group members: - Arpita - Kapil - Kaushik - Ridhima - Ushhan

Analysis and Computation for Finance Time Series - An Introduction

Time Series Analysis and Forecasting

MSwM examples. Jose A. Sanchez-Espigares, Alberto Lopez-Moreno Dept. of Statistics and Operations Research UPC-BarcelonaTech.

Time Series Laboratory

Multiple Linear Regression

Time Series Analysis and Forecasting Methods for Temporal Mining of Interlinked Documents

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU

THE SVM APPROACH FOR BOX JENKINS MODELS

An Effective Time Series Analysis for Stock Trend Prediction Using ARIMA Model for Nifty Midcap-50

Univariate Time Series Analysis; ARIMA Models

Vector Time Series Model Representations and Analysis with XploRe

THE IMPACT OF EXCHANGE RATE VOLATILITY ON BRAZILIAN MANUFACTURED EXPORTS

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Transcription:

Jour. Ind. Soc. Ag. Statistics 53(2),2000: 151-160 Threshold Autoregressive Model for a Time Series Data C. Kesavan Nampoothiri and N. Balakrishna Cochin University of Science and Technology, Cochin-682022 (Received: May, 1999) SUMMARY In this paper we try to fit a threshold autoregressive (TAR) model to time series data of monthly coconut oil prices at Cochin market. The procedure proposed by Tsay [7] for fitting the TAR model is briefly presented. The fitted model is compared with a simple autoregressive (AR) model. The results are in favour of TAR process. Thus the monthly coconut oil prices exhibit a type of non-linearity which can be accounted for by a threshold model. Key Words : Autocorrelation functions. Coconut oil, Threshold autoregressive model. 1. Introduction Coconut is one of the most important crops cultivated by a large number of farmers in Kerala, India. Coconut oil (a major product of coconut) is a vegetable oil used in every household for culinary and toiletry purposes. Since the production of coconuts involves large investments, stability in price is needed for the development of this crop. But the coconut oil prices undergo wide and violent fluctuations. These fluctuations can generally be attributed to various factors. Since the observed prices arise in a time sequence. it is possible that the consecutive observations are dependent. In this study, a time series model based approach has been tried to explain the variations other than trend and seasonal variations. Box and Jenkins [1] provided a method for constructing a time series model in practice. Using this method the current value of the series can be linearly expressed as a function of its previous values. But in practice, non-linearity can often be detected in the underlying structure of the time-series. There are several non-linear models introduced by Tong [4]. Here we consider only the piece-wise models, that is, the models for which discontinuities arise resulting in the change of regimes. Because of the complexities of the method introduced by Tong, it is not widely used in practice. Tsay [7] proposed a simultaneous method for testing the non-linearity and the identification of

152 JOURNAL OF THE INDIAN SOCIETY OF AGRICULTURAL STATISTICS the delay parameter. In this paper, a threshold autoregressive (TAR) model is tried for the monthly coconut oil prices using the method proposed by Tsay and compared it with that of a simple autoregressive model. In the following sections the TAR modelling technique is briefly discussed followed by the results and discussions. 2. Methodology A class of threshold autoregressive models can be defined by (j) (j) (j). Zt=ao+ L 3.j ~-1+~ Ifrj_l::;~ d::;rj (2.1) where j = 1,2,... k is the number of regimes being separated by (k - 1) threshold values ( - 00 ::; TO < r 1 <... < r k = + 00), d is the delay parameter (d ::; p), (ag), ap» i = 1,2,...p and j = 1,2,...,k are model parameters of the regime j and (~(j), j = 1, 2,..., k) is a sequence of independent normal variables with mean zero and variance ~t' Such a process partitions the one-dimensional Eucleadian space into k regimes. The model (2.1) is non-linear when there are at least two regimes with different linear models. For these models, the autoregressive coefficients change over time, changes which are determined by comparing the previous values (back shifted by a delay parameter d) to fixed threshold values. The order of AR models may also change over regimes. The first step in the Tsay's procedure is to test for non-linearity. Consider a TAR model with only two regimes. Let the delay parameter be 'd' and threshold value be rl' then the model can be written as Zt = ahl)+lap)~-i+~p) if ~-d::; rl = ll~) + La(2) 7. + ~(2) if 7 > r (2.2) -v 1 "'t-i ~ "'t-d 1 where t E (p + 1,...,n ), n is the number of observations. Now arrange the observations in ascending order. Let 1t i be the time index of the i th smallest observation, then the model can be rewritten as Z7ti+d = a8)+l~i)zni+d-i+~+d if i ::; s = ag) + La?) Z7ti + d _ j + ~(+ d if i> s (2.3) where i E (p + 1 - d,... n - d ) and Z1IS < ZltS + l' The arranged autoregression grouped the data points into two, so that all the observations in one group I follow the same AR process.

THRESHOWAUTOREGRESSIVE MODEL FOR A TIME SERIES DATA 153 The non-linearity can be tested using the classical F-statistics (Tsay [7]) corresponding to the predictive residuals ~i + d (recursively estimated) of the arranged autoregresion on the regressors (1, Ziti + d _ l',", Ziti + P _ d) (i = P+ 1- d, '",n - d). The recursive estimate of the parameters were computed by the algorithm developed by Ertel and Fowlkes [2], The recursion starts with m = (1';)1+ p observations so that there are (n - p - m) predictive residuals available. tonsider the least squares regression (2.4) for i = m + 1,..., n - p and compute the statistic I~ [ (p+ -11:] 1) F(p,d} =----2::------... (2.5) Ill t (n-i-2p-m) follows approximately an F-distribution with (p + 1) and (n - 1-2p - m) degrees of freedom. The significance of the F-distribution detects the presence of non-linearity. Now for fixed 'p' and 'd', the value of 'd p ' is chosen from the values (1,2,... p) as d = max (F(P,S» p ls5sp where F(P,5) is the statistic as defined by (2.5). After obtaining the delay parameter the next step is to obtain the values of the thresholds. To obtain these values, use the scatter plots of the t ratios of the recursive estimates of the AR coefficients versus ZI_ dp' The t ratio starts to turn at the threshold values, otherwise it would smoothly converge to a fixed point as the number of observations in the recursive estimation increases. After identifying AR order 'p' and the delay parameter 'd p ' we can refine the order for each regime and the value of the threshold based on the Akaike's Information Criterion (AIC). The AIC is calculated for different values 'p' and for each potential threshold values. This criterion is defined as AIC(p) = N 10g(RSS/N) +(p+ I) log(n) RSS is the residual sum of squares, N is the effective number of observations and p is the number of independent parameters of the model.

154 JOURNAL OF THE INDIAN SOCIETY OF AGRICULTURAL STATISTICS 3. Analysis of the Monthly Coconut Oil Prices The monthly coconut oil prices show an upward trend during the period 1978-96 (Fig-I). Apart from the sharp increase in prices, fluctuations in the prices of coconut oil within the year have also been seen. Generally speaking, the highest price in any year was received during the period November to January whereas the price tended to decline during the peak monsoon months (June- July) or the months immediately preceding it. These seasonal fluctuations in prices may be due to the seasonality in the production of the coconuts. About 60 percent of the nuts were harvested during the first six months of the year. Because of the abundant availability of oil in the market, the prices are generally low during the summer months and during the early periods of monsoon. During the second half of the year, the production of the coconuts and the availability of copra are low and these cause the rise in the market prices of coconut oil. Monthly Coconut Oil Prices 1978-96 7r---------------------------------------------~ s 4 2.1 ~ ~ ~--~---L---L---L~~--~--~--~--~~ 2 22.42 62 82 102 122 142 162 182 202 222 -x- ORIGINAL PRICE -0- DIFFERENCE Fig. 1 Monthly coconut oil prices (1978-96)

THRESHOWA UTOREGRESSIVE MODEL FOR A TIME SERIES DATA 155 In this paper we are interested in the univariate time series analysis of the monthly coconut oil prices in the Cochin market. The data consists of 228 monthly observations from January 1978 to December 1996. Since a fairly good estimate of the parameters of the series in obtained only if the series is stationary, we take the first order difference of the prices (i.e., if Zt is the price then the first order difference is VZ t = ZI- Zt I) for further analysis (Fig-I). Firstly we try to model the monthly coconut oil prices using Univariate Box-Jenkins (UBJ) method and then using the threshold autoregressive (TAR) method. In the UBJ technique a model can be fitted to the data by studying the characteristics of the auto correlation functions (ACF) and partial autocorrelation functions (PACF). After identifying the order and nature of the relationship, the parameters can be estimated. These models are used for short-term forecasting, because most of the autoregressive integrated moving average models place emphasis on recent past rather than the distant past. The ACF and PACF of the differenced series converge to zero reasonably quickly. The cutoff of the PACF after the lag two (Fig. 2) recommends an autoregressive process of order two for the series. Also an examination of the AlC and SSR ACF and PACF of first order difference 0.4 r-------------------------...(). '-7-~~---I~I-~16:----::2!-:-1--!:26---.!3~1--:L-----L..--..1.-..-1 36 41 6 -x- ACF -.':)- PACF Fig. 2. ACF and PACF of first order difference

156 JOURNAL OF THE INDIAN SOCIETY OF AGRICULTURAL STATISTICS for different orders of 'p' and 'q' (Table-l) suggests an AR(2) is more appropriate for the series. After fitting a UBJ model a non-linear model was also tried to get a better representation. Table 1. Estimates of the parameters of ARMA (p, q) p=l q=o p=2 q:::o p=l q=l p=2 q=l p:::2 q=2 Constant 22.77 (16.37) 24.89 (21.16) 25.10 (21.31) 24.90 (21.23) 22.36 (16.92) AR(I) -0.308 (0.065) 0.243 (0.066) 0.688 (0.136) 0.247 (0.286) 1.446 (0.217) AR (2) 0.236 (0.067) MA(I) 0.401 (0.166) 0.235 (0.108) 0.004 (0.293) -0.616 (0.163) 1.193 (0.217) MA(2) -0.575 (0.129) AIC 2981 2306 2974 2973 2971 RSS 6592777 6262937 6362139 6243104 6169479 Error Variance 40833 27844 28381 27699 27753 AIC - Akaike Information Criteria RSS - Residual Sum of Square AR (p) - Autoregressive process of order p MA (q) - Moving average process of order q After selecting the order 'po of the AR process the next step is to test the non-linearity' using the Tsay's statistic (2.5). Here the AR order is p = 2 therefore the delay parameter can take the value either 1 or 2. The recursion starts with 25 observations so that there are 200 predictive residuals. The values of the F-statistics for the delay parameter d = 1 and d = 2 are given in Table-2. Since the F value is maximum for d = 1. we choose the delay parameter as d p = 1. The second step is to locate the threshold value using the scatter plot of the t-ratios of the recursive estimates of an AR coefficient versus VZ. _dp' The scatter diagram tells the location of the delay parameter directly. The t ratios of the estimates behave exactly as those of a linear time series before the recursion reaches the threshold value r i, Once

THRESHOWAUTOREGRESSIVE MODEL FOR A TIME SERIES DATA 157 rj is reached t ratio begins to deviate. The scatter plot (Fig-3) of the t ratios indicate the possible threshold values are around -100, 40 and 100. Since it needs a minimum of 50 observations for an accurate parameter estimation, we choose the values as -100 and 40, that is, a threshold model with three regimes. There are 51, 74, and 100 observations in the first, second and third regimes respectively. In order to refine the AR order in each regimes and threshold values, we used the Akaike Information Criterion (AIC). Since AIC is minimum for p = 2 and for r l = - 100 and r 2 = 40 we choose the order of AR as two and the threshold values as -100 and 40. The parameters are estimated for all the models (Table-2). The identified TAR (2, 2, 1) is as follows V'Zt = -45.11+0.1088V'Zt_I-0.OO418V'Zt_2+~~I) if V'Zt_1 $ -100 = 10.88 + 0.0297 V'~-l - 0.12370 V'~-2 + ~~2) if -100 < V'Zt_l $ 40 = 67.69+0.3419V'~_1-0.3250V'~_2+~~3) other wise (3.1) Table 2. Estimates of the autoregressive parameters for TAR (2, 3, 1) and AR (2) TAR (2, 3,1) model AR (2) model Constant -45.11 10.88 67.67 24.89 1 0.1088 0.0297 0.3419 0.243 (0.145) (0.118) (0.092) (0.066) 2-0.00418-0.123 0.0169 0.236 (0.146) (0.121) (0.125) (0.067) AIC 528 702 1022 2306 RSS 2168589 1044825 3191737 6262937 Residual 44257 14313 32569 27844 Variance Table 2a. Values of the F-statistic d F (8, d) 1.81 2 1.20

158 JOURNAL OF THE INDIAN SOCIETY OF AGRICULTURAL STATISTICS t ratios of the Coefficients 0<) 025 5 '13 0,2 IS 0.1$ r1 01 ii: 0<05 -O<os -01 119 110.$4 29 42 11 102 1S6 2S} Zt-l Fig, 3a. t ratios of the Coefficients t ratios of the Coefficients 0.4 0.3 i:!.~ u,0.2 1 fil 0.1 0-0,1 119 110 54 29 42 7a 101 IS6 153 Zt-2 Fig. 3b, t ratios of the Coefficients

THRESHOLD AUTOREGRESSIVE MODEL FOR A TIME SERIES DATA 159 In the diagnostic stage, we compute the ACF of the residuals for each models. Most of the ACF are within the 2cr limits shows the independence of the residuals. The sum of squared residuals and the AIC are less for TAR model than that for ordinary AR model. The forecast percent error (Fig-4) is also minimum for TAR model. These observations are in favour of modeling the series by a TAR process. 40 Forecast Percent Error 30... 20 E 10 III... C III ~ 0 III Jl. 10 20 30 20S 210 2., 220 -AR Fig. 4. Forecast Percent Error TAR 4. Concluding Remarks The values of the F-statistic do not show any non-linearity in the series. But other factors like the scatter plot of the t ratios, the values of RSS and AIC are in favour of the TAR process. The percent forecast error for TAR process is lower than that for an AR process. These factors are in favour of modeling the coconut oil prices using TAR process. Tsay used the scatter plot to identify the threshold values. The pattern of the gradual convergence of the t ratios starts to tum when the recursion reaches the threshold values. Here the scatter diagram starts to turn when the value of difference between the previous two observations reaches around - 100 and 40. Thus the TAR process gives a better fit for time series of coconut oil prices. For an observation Zt' the model change is identified using the difference of previous two observations.

160 JOURNAL OF THE INDIAN SOCIE7Y OF AGRICULTURAL STATISTICS REFERENCES [1] Box, G.E.P. and Jenkins, G.M. (1970). Time Series Analysis: Forecasting and Control. San Francisco: Holden-Day. [2] Ertel, J.E. and Fowlkes, E.B. (1976). Some algorithms for linear spline and piecewise multiple regressions. J. Am. Statist. Assoc., 71, 640-648. [3] Jacob, Mathew (1978). Trend and fluctuations in the prices of coconut and coconut oil. M. Phil dissertation, Jawaharlal Nehru University, New Delhi. [4] Tong, H. (1990). Non-linear Time Series: A Dynamic System Approach. Oxford University Press, New York. [5] Tong, H. and Lim, K.S. (1980). Threshold autoregression, limit cycles and cyclical data (with discussion). J. Roy. Statist. Soc., B 142, 245-292. [6] Tsay, R.S. (1986). Non-linearity tests for time series. Biometrica, 73,461-466. [7] Tsay, R.S. (1989). Testing and modelling threshold autoregressive processes. J. Am. Statist. Assoc., 84, 231-240. [8] Watier, L. and Richardson, S. (1995). Modelling of an epidemiological time series by a threshold autoregressive model. The Statistician, 44, 353-364.