Co-integration of Stock Markets using Wavelet Theory and Data Mining



Similar documents
WAVEFORM DICTIONARIES AS APPLIED TO THE AUSTRALIAN EXCHANGE RATE

GLOBAL STOCK MARKET INTEGRATION - A STUDY OF SELECT WORLD MAJOR STOCK MARKETS

ANALYSIS OF EUROPEAN, AMERICAN AND JAPANESE GOVERNMENT BOND YIELDS

Introduction to time series analysis

A Wavelet Based Prediction Method for Time Series

Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?

A Novel Method to Improve Resolution of Satellite Images Using DWT and Interpolation

Wavelet analysis. Wavelet requirements. Example signals. Stationary signal 2 Hz + 10 Hz + 20Hz. Zero mean, oscillatory (wave) Fast decay (let)

Prediction of Stock Performance Using Analytical Techniques

Univariate and Multivariate Methods PEARSON. Addison Wesley

Examining the Relationship between ETFS and Their Underlying Assets in Indian Capital Market

Hong Kong Stock Index Forecasting

Concepts in Investments Risks and Returns (Relevant to PBE Paper II Management Accounting and Finance)

SPEECH SIGNAL CODING FOR VOIP APPLICATIONS USING WAVELET PACKET TRANSFORM A

COMPARISON OF CURRENCY CO-MOVEMENT BEFORE AND AFTER OCTOBER 2008

An Empirical Comparison of Moving Average Envelopes and Bollinger Bands

Real Estate as a Strategic Asset Class. Less is More: Private Equity Investments` Benefits. How to Invest in Real Estate?

by Maria Heiden, Berenberg Bank

8.1 Summary and conclusions 8.2 Implications

GLOBAL CURRENCY MOVEMENT AND GENERALIZED RULE INDUCTION

This study concludes that Asian investors would benefit from greater global diversification.

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques Page 1 of 11. EduPristine CMA - Part I

A Review of Cross Sectional Regression for Financial Data You should already know this material from previous study

Summary Nonstationary Time Series Multitude of Representations Possibilities from Applied Computational Harmonic Analysis Tests of Stationarity

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

JPEG Image Compression by Using DCT

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition

Sampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically.

The European Central Bank s Minimum Bid Rate and Its Effect on Major Currency Pairs

International Transmission of Stock Market Movements : A Wavelet Analysis on MENA Stock Markets

How To Segmentate Heart Sound

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control

Share Price Change: the Efficient Market Hypothesis and the whitenoise Hypothesis Dichotomy

Goldman Sachs Electronic Trading India: Algorithmic Trading. FIXGlobal Face2Face Electronic Trading Forum - India

A Study of the Relation Between Market Index, Index Futures and Index ETFs: A Case Study of India ABSTRACT

A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500

International linkages of Japanese bond markets: an empirical analysis

Sachin Patel HOD I.T Department PCST, Indore, India. Parth Bhatt I.T Department, PCST, Indore, India. Ankit Shah CSE Department, KITE, Jaipur, India

SGN-1158 Introduction to Signal Processing Test. Solutions

Working Papers. Cointegration Based Trading Strategy For Soft Commodities Market. Piotr Arendarski Łukasz Postek. No. 2/2012 (68)

Wavelet Analysis Based Estimation of Probability Density function of Wind Data

The Dynamic Co-movements between Oil and Stock Market returns in: The Case of GCC Countries

Machine Learning in Statistical Arbitrage

The Merits of Absolute Return Quantitative Investment Strategies

Agilent Creating Multi-tone Signals With the N7509A Waveform Generation Toolbox. Application Note

TDS5000B, TDS6000B, TDS/CSA7000B Series Acquisition Modes

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

U.S. SHORT TERM. Equity Risk Model IDENTIFY ANALYZE QUANTIFY NORTHFIELD

INVESTMENT BARRIERS AND STOCK MARKET PERFORMANCE An Evidence from Emerging Markets

Artificial Neural Network and Non-Linear Regression: A Comparative Study

A Big Data Analytical Framework For Portfolio Optimization Abstract. Keywords. 1. Introduction

Improved Trend Following Trading Model by Recalling Past Strategies in Derivatives Market

An Evaluation of Irregularities of Milled Surfaces by the Wavelet Analysis

Time-Frequency Detection Algorithm of Network Traffic Anomalies

Gerard Mc Nulty Systems Optimisation Ltd BA.,B.A.I.,C.Eng.,F.I.E.I

Midwest Symposium On Circuits And Systems, 2004, v. 2, p. II137-II140. Creative Commons: Attribution 3.0 Hong Kong License

Neural Network Applications in Stock Market Predictions - A Methodology Analysis

Chapter 10 Introduction to Time Series Analysis

Global Currency Hedging

The Scientific Data Mining Process

Matlab GUI for WFB spectral analysis

Redundant Wavelet Transform Based Image Super Resolution

STRESS TESTING GUIDELINE

Final Year Project Progress Report. Frequency-Domain Adaptive Filtering. Myles Friel. Supervisor: Dr.Edward Jones

SOFTWARE FOR GENERATION OF SPECTRUM COMPATIBLE TIME HISTORY

This document introduces the principles behind LDI, how LDI strategies work and how to decide on an appropriate approach for your pension scheme.

Correlation between Growth Rate and Non Performing Retail Loans in the Republic of Macedonia

High Frequency Equity Pairs Trading: Transaction Costs, Speed of Execution and Patterns in Returns

By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

Dirac Live & the RS20i

Import Prices and Inflation

Answers to Concepts in Review

Marketing Mix Modelling and Big Data P. M Cain

Or part of or all the risk is dynamically hedged trading regularly, with a. frequency that needs to be appropriate for the trade.

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

Image Compression through DCT and Huffman Coding Technique

Discussion Paper 01: Aviation Demand Forecasting

Broadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.

A new approach for event study of private placement announcement effect: Evidence from China Yuling Zhao 1,a*

not possible or was possible at a high cost for collecting the data.

Jim Gatheral Scholarship Report. Training in Cointegrated VAR Modeling at the. University of Copenhagen, Denmark

Low Contrast Image Enhancement Based On Undecimated Wavelet Transform with SSR

Predictive modelling around the world

Filtering method in wireless sensor network management based on EMD algorithm and multi scale wavelet analysis

Financial Market Efficiency and Its Implications

Internet Traffic Variability (Long Range Dependency Effects) Dheeraj Reddy CS8803 Fall 2003

A Secure File Transfer based on Discrete Wavelet Transformation and Audio Watermarking Techniques

Application of Predictive Model for Elementary Students with Special Needs in New Era University

SNMP Simple Network Measurements Please!

Stock Market Forecasting Using Machine Learning Algorithms

CONTRACTS FOR DIFFERENCE

SIGNAL PROCESSING & SIMULATION NEWSLETTER

Asset allocation A key component of a successful investment strategy

Time-Correlated Multi-domain RF Analysis with the MSO70000 Series Oscilloscope and SignalVu Software

Financial Markets of Emerging Economies Part I: Do Foreign Investors Contribute to their Volatility? Part II: Is there Contagion from Mature Markets?

Futures Trading: The Way to Go?

Analysis/resynthesis with the short time Fourier transform

The Dangers of Using Correlation to Measure Dependence

CFDs and Liquidity Provision

Co-movements of NAFTA trade, FDI and stock markets

Transcription:

Co-integration of Stock Markets using Wavelet Theory and Data Mining R.Sahu P.B.Sanjeev rsahu@iiitm.ac.in sanjeev@iiitm.ac.in ABV-Indian Institute of Information Technology and Management, India Abstract The most powerful argument for cross-border investing is the risk reduction due to low correlation of world s stock markets. Risk diversification has become more important as financial markets globalize. Today, financial markets are becoming increasingly related due to advanced information technology which lowers the transaction costs. Diversifying internationally in markets with low correlation with domestic markets reduces the systematic risk and investors must be willing to take advantage of these correlations to reduce volatility in their portfolios. The paper tries to analyze the long-run relationship among seven prominent stock indices using Wavelet theory. The emanating cointegration results are further substantiated using the Apriori association rule mining. The findings suggest that there is strong to moderate co-integration among many stock markets. 1.0 Introduction After the inception of stock indices, economists have attempted to explain and forecast the movements of indices using macroeconomic fundamentals. Most of these analysis techniques are based on the quantitative approach that is often implemented with the classical data analysis technique that is ideal for stationary signals (time invariant). However, the real data for the stock indices, using tick-by-tick observations obtained, are no longer accepted to be stationary. Since the conventional statistical methods have proven to be inadequate to describe the evolutionary nature of most of the real-world time series data, the research community has provided us an alternative perspective of looking into the data by using the signal processing approach. The paper uses wavelet transformation to study co-movement of stock markets that are highly non-linear and non-stationary dynamic processes. The joint time-frequency nature of the wavelet analysis helps to separate the underlying trends found in the stock data for identification of local patterns at various at varying time granularities and the same is compared with Apriori association rule mining. 2.0 Co integration of Stock Markets In an era of increasing globalization, the transmission of movements in international financial markets is an important issue for economic policy, especially in periods where markets are highly volatile. The determination of diversification strategies by an international investor also depends crucially on the nature and magnitude of the relationships existing between different stock markets. Thus it becomes important for international investors to understand the interrelations among the various markets to diversify risk and to derive high return. The present work tries to investigate the relationship the prominent stock markets particularly the Cac40 of France, the of Germany, the of Great Britan, the Sensex of India, the of USA, the HangSeng of Hong Kong and the Kospi of South Korea.

The selection of these markets has been based on the following sapient considerations- Each business day begins in each financial center, first at Tokyo, to the Asean markets, to Europe, to London and further towards New York. Investors can respond to currency fluctuations caused by economic, social and political events at the time frames they occur (day or night) and hence mitigate their portfolio risks. The large capitalization of the indices and the huge volume of shares traded in each of the indices. The developed and developing nation status of the countries housing these indices. Affiliation of most of these indices to Asean block, European Free Trade Agreement (EFTA) and North Atlantic Free Trade Agreement (NAFTA). These three primary economic blocks account for 80 percent of the world trade. The data of the Stock market indices for of the selected countries for a period a period to 10 years (1997-2007) totaling to 2,375 days of observations were and tested for existence of long run relationships. The data is publicly available at Source: http://finance.yahoo.com. The Economic and financial data are intrinsically discrete, and hence the discrete wavelet transforms offers a parsimonious and cost effective discrete transformation of the data. The wavelet analysis can also differentiate between short (high-frequency) shocks and longer (low-frequency) shocks and subsequently their impact on the other markets. 3.0 Wavelet Transform Transformations are applied to signals/data to obtain further information from them which is not readily visible in the raw signal. In time series data, when we plot timedomain signals, we obtain a time-amplitude representation of the signal. This representation is not always the best representation of the signal for most signal processing related applications. In many cases, the most distinguished information is hidden in the frequency content of the signal. In the conventional methods, the Fourier Transform (FT) of a signal in time domain gives the frequency-amplitude representation of that signal which reveals the extent of each frequency in the signal. But it does not reveal when in time these frequency components exist. This information is not required when the signal is stationary where all frequency components exist at all times. Hence, the FT is not a suitable technique for a non-stationary signal. On the other hand, the Wavelet transform is capable of providing the time and frequency information simultaneously, thereby giving a time-frequency representation of the signal. Often, a particular spectral component occurring at a particular instant can be of special interest. In these cases it may be very beneficial to know the time intervals these particular spectral components occur. The Wavelet Transform (WT) is a suitable method for analyzing non-stationary signals, i.e., whose frequency response varies in time. To overcome the non-stationarity, the signal is divided into smaller segments that can be assumed to be stationary. For this purpose, a window function "w" is chosen. The width of this window must be equal to the segment of the signal where its stationarity is valid. In case of wavelet transform, the width of the window changes as the transform is computed for every single spectral component, which is probably the most significant

characteristic of the wavelet transform. This overcomes the problem of resolution (a measure of the amount of detail information in the signal) by suitably selecting the appropriate window size to capture the detailed information within the signal/data. In wavelet analysis, the choice of a prototype for generating window functions is important. This implies that the functions with different region of support that are used in the transformation process are derived from one main function, or the mother wavelet. The choice of the mother wavelet is important when analyzing a given time series for two reasons. First, the length of the discrete wavelet function determines how well it will be able to isolate features to specific frequency intervals. Second, it is being used to represent the hidden information contained in time series. 3.1 Discrete Wavelet Transform Coefficients using Haar Wavelets The Discrete Wavelet Transformation, DWT, analyzes the input signal for different frequency bands with different resolutions by decomposing the original signal into a coarse approximation and detail information through the coefficients. DWT employs two sets of functions, called scaling functions and wavelet functions, which are associated with low-pass and high-pass filters, respectively. The decomposition of the signal into different frequency bands is simply obtained by successive high-pass and low-pass filtering of the time domain signal. The original signal x[n] is first passed through a halfband high-pass filter g[n] and a low-pass filter h[n] as shown in Figure 1. h[n] g[n] Figure 1: Approximation and Detail Coefficients using DWT After each filtering, half of the samples/data are can be segregated (or eliminated) as per the Nyquist s rule. Since the high-pass filtered signal has now the highest frequency of π/2 radians instead of π, the signal can therefore be sub-sampled by 2. This constitutes one level of decomposition. Output from the high pass filter is downloaded as the Level 1 Detail D1, and the output from the low pass filter becomes the Level 1 Approximation, A1. Starting afresh with A1, the process can be successively repeated as per requirements. Again, as h(n) and g(n) form perfect reconstruction filters, the original data can be reconstructed from the down-sampled coefficients. With successive filtering, the level of frequency resolution increases at the expense of time resolution. There are a wide variety of popular wavelet algorithms, like Daubechies wavelets, Mexican Hat wavelets and Morlet wavelets. The present work adopts DWT using Haar wavelet as the mother wavelet as it is the simplest and comparatively computationally less intensive. Some of the advantages of Haar wavelet transform are: conceptually simple, computationally fast, memory efficient as they can be calculated without a temporary array, are exactly reversible without the edge effects unlike the other wavelet transforms and are good choice to detect localized information.

4.0 Results from DWT As stated earlier, The DWT yields two types of coefficients: Approximation Coefficients and Detail Coefficients. The Detail Coefficients indicate the short bursts in the financial data which are more of impulsive reactions to news and events. On the other hand, the Approximation Coefficients denote the average behaviour of the indices in the long run and are considered for determination relationship of movement between the stock indices. Following the computation of the approximation and detail coefficients, the approximation coefficients were also subjected to SPSS for determining the pair wise correlation coefficients between the indices. The plots of coefficients from wavelet analysis of the stock indices are shown in Figure 2 through Figure 15 where the x-axis denotes the serial of the approximation / detail coefficients and the y-axis denotes the value of the corresponding coefficients. Figure 2: Approximation coefficients for CAC 40 Figure 3: Detail coefficients for CAC 40. Figure 4: Approximation coefficients for Figure 5: Detail coefficients for

Figure 6: Approximation coefficients for XetraDax Figure 7: Detail coefficients for XetraDax Figure 8: Approximation coefficients for Sensex Figure 9: Detail coefficients for Sensex Figure 10: Approximation coefficients for Figure 11: Detail coefficients for

Figure 12: Approximation coefficients for HangSeng. Figure 13: Detail coefficients for HangSeng Figure 14: Approximation coefficients for Kospi Figure 15: Detail coefficients for Kospi Based on the coefficient of correlation that lies between -1 to +1, the test pairs can be objectively classified for strong, moderate and weak long-term correlation based on the following corollary(s). The Pearson correlation above 0.75 indicates a relationship and that below 0.50 is adopted as a relationship. The Pearson coefficient between 0.50 and 0.75 is considered as a relationship. Table 1 summarizes the pair-wise Pearson correlation coefficient using SPSS 10.0. Table 2 summarizes the cointegration pairs with their respective degrees of association.

Table 1: Pair wise Pearson Correlation coefficients among the stock indices Sensex Cac40 XetraDax HangSeng Kospi Sensex 1.000.341.361.279.168.756.900 Cac40.341 1.000.946.914.802.644.269.361.946 1.000.825.833.695.345.279.914.825 1.000.721.523.178.168.802.833.721 1.000.534.233 HangSeng.756.644.695.523.534 1.000.749 Kospi.900.269.345.178.233.749 1.000 Table 2: Degrees of Cointegration among stock indices using DWT Stock Indices Degree of Association Sensex- Cac40 Sensex- Sensex- Sensex- Sensex-HangSeng Sensex-Kospi Cac40- XetraDax Cac40- FTSE100 Cac40- Cac40- HangSeng Cac40- Kospi XetraDax XetraDax XetraDax HangSeng XetraDax Kospi HangSeng - Kospi HangSeng - Kospi HangSeng - Kospi 4.0 Co-integration using Association Rule Mining The association rules try to discover association or correlation relationships among a large set of data items. They identify collections of data attributes that are statistically related in the underlying data. An association rule is of the form X => Y where X and Y are disjoint conjunctions of attribute-value pairs. The confidence (probability count) of the rule is the conditional probability of Y given X, Probability(Y X), and the support of the rule is the prior probability of X and Y, Probability(X and Y). Support (A => B) = P (A U B). Confidence (A => B) = P (B A).

The support count aims at alienating the item sets which occur infrequently within the data set and hence are irrelevant in the final associations. The confidence or the probability count establishes the intensity of the association whether it is weak, moderate or strong. The present work uses Apriori, a classic algorithm for implementing and finding out association rules, that attempts to find subsets which are common to a minimum support count and confirm to a minimum probability count. Table 3 summarizes the significant outcomes from Apriori association rule mining at various support counts. Table 4 summarizes the cointegration pairs with their respective degrees of association. Support Count Probability Count Greater than 40 Greater than 60 Greater than 80 Table 3: Significant Stock Market Relations at various Support Counts 150 140 130 120 Cac 40 FTSE 100 Cac 40 HangSeng Cac 40 Xetra Dax Kospi HangSeng Kospi HangSeng Sensex Cac 40 Cac 40 FTSE 100 Cac 40 HangSeng Cac 40 Xetra Dax Cac 40! XetraDax! XetraDax FTSE 100 HangSeng Kospi Cac 40 FTSE 100 Cac 40 Xetra Dax Cac40! Xetra Dax! XetraDax FTSE 100! represents a downward movement Cac 40 Ftse100 ^ XetraDax Cac 40 ^ Ftse100 XetraDax Cac 40 ^ XetraDax Ftse100 Sensex ^ HangSeng Kospi Sensex Kospi XetraDax HangSeng Sensex ^ Cac 40 Xetra Dax Cac 40 Ftse100 ^ XetraDax Cac 40 ^ Ftse100 XetraDax Cac 40 ^ XetraDax Ftse100 Sensex ^ HangSeng Kospi Sensex Kospi XetraDax HangSeng Sensex Kospi XetraDax Cac 40 Hang Seng ^ ^ ^ ^ HangSeng ^ HangSeng HangSeng ^ Kospi ^ HangSeng Sensex Cac 40 ^ Sensex Kospi ^ HangSeng ^ ^ ^ ^ HangSeng ^ HangSeng HangSeng ^ ^ Cac 40 ^ Cac 40 Cac 40 ^ ^ ^ HangSeng ^ Sensex Kospi Cac 40 ^ ^ Cac 40 ^ ^ Cac 40 ^ ^ Cac 40 ^ ^ Cac 40 HangSeng ^ Kospi HangSeng Sensex HangSeng Cac 40 ^ Cac 40 ^ Cac 40 ^ Cac 40 HangSeng Sensex HangSeng Cac40 ^ Cac40^ Cac40^

Table 4: Degrees of Cointegration among stock indices using Apriori Algorithm Stock Indices Cac40 Cac 40 - Cac 40 Sensex-Kospi Cac 40 HangSeng XetraDax XetraDax Cac40- FTSE100-XetraDax XetraDax HangSeng Cac40- HangSeng HangSeng Kospi HangSeng HangSeng XetraDax Kospi Sensex Cac 40 Sensex Cac40 - - - Degree of Association 5.0 Conclusion The assumption of stationary series is hardly satisfied in economic and financial modeling. Transformation of the data to reduce the effect of non-stationarity may lead to inaccurate inferences. To achieve stationariity, the signal is decomposed into small series of segments which can be assumed to be near stationary without any loss of information. The paper outlines such a noble approach of determining long-run relationships among stock markets. The approach is a loss-less approach that filters the high-frequency shocks from the low frequency signals. The approach also provides a measure of information variations incorporated at different levels of decompositions in-depth analysis. Using data from the stock markets of France, Germany, England, USA, India, Hong Kong and South Korea, the Wavelet analysis finds a strong co-integration between France-UK, France- Germany and France-US, India-South Korea, Germany UK and Germany-US stock indices. It also exposes a moderate relationship between the India Hong Kong, France- Hong Kong, Germany Hong Kong, UK- USA and Hong Kong-South Korea stock indices. The analysis using Apiori association data mining reveals a similar scenario of cointegration among the selected stock markets. However, the association mining that suggests a strong co-integration between France-Hong Kong indices though the Wavelet approach suggests a moderate co-integration between them. The wavelet theory presents a faster and a more efficient method to derive the results as it gives a cumulative information variation at various levels of decomposition, which are most suitable for deriving long-run relationships. Morover, the Apriori algorithm calls for assigning the support counts and this demands a large number of trials to derive the appropriate results.

References 1. Hall, J.; Mani, G.; and Barr, D. (1996), Applying Computational Intelligence to the Investment Process, Proceedings of CIFER-96: Computational Intelligence in Financial Engineering, Washington, D.C. 2. Cheng, B., and Titterington, D. M. (1994), Neural Networks A Review from a Statistical Perspective. Statistical Science, Vol. 9, No. 1, pp. 2 30. 3. Prabakaran, S., R. Sahu, and S. Verma (2006), Feature Selection using Haar Wavelet Power Spectrum, International Journal on BMC Bioinformatics, Vol. 7, Issue 1, pp. 1 11. 4. S. Prabakaran, R. Sahu, S. Verma (2006), Characteristics of Microarray Data using Wavelet Power Spectrum, International Journal of Knowledge-based and Intelligence Engineering Systems, Vol. 10, No.6, pp. 493 501. 5. S. Prabakaran, R. Sahu, and S. Verma (2006), Analysis of Microarray Data under Wavelet Power Spectrum, International Journal of Engineering Letters, Vol. 13 No. 3, EL_13_3_17 (Advance online publication: 4 November 2006). 6. S. Prabakaran, R. Sahu and S. Verma (2006), A Feature Selection Method using Wavelet Power Spectrum, Print Version Available: December 2006,, IAENG International Journal of Computer Science, Vol. 32, No. 4, pp. 419-423.