Machine Learning in Statistical Arbitrage
|
|
|
- Pearl Cameron
- 10 years ago
- Views:
Transcription
1 Machine Learning in Statistical Arbitrage Xing Fu, Avinash Patra December 11, 2009 Abstract We apply machine learning methods to obtain an index arbitrage strategy. In particular, we employ linear regression and support vector regression (SVR) onto the prices of an exchange-traded fund and a stream of stocks. By using principal component analysis (PCA) in reducing the dimension of feature space, we observe the benefit and note the issues in application of SVR. To generate trading signals, we model the residuals from the previous regression as a mean reverting process. At the end, we show how our trading strategies beat the market. 1 Introduction In the field of investment, statistical arbitrage refers to attempting to profit from pricing inefficiencies identified through mathematical models. The basic assumption is that prices will move towards a historical average. The most commonly used and simplest case of statistical arbitrage is pairs trading. If stocks P and Q are in the same industry or have similar characteristics, we expect the returns of the two stocks to track each other. Accordingly, if P t and Q t denote the corresponding price time series, then we can model the system as dp t P t = αdt + β dq t Q t + dx t, where X t is a mean reverting Ornstein-Uhlenbeck stochastic process. In many cases of interest, the drift α is small compared to the fluctuations of X t and can therefore be neglected. The model suggests a contrarian investment strategy in which we go long one dollar of stock P and short beta dollars of stock Q if X t is small and, conversely, go short P and long Q if X t is large. Opportunities for this kind of pairs trading depend upon the existence of similar pairs of assets, and thus are naturally limited. Here, we use a natural extension of pairs trading called index arbitrage, as described in [1]. The trading system we develop tries to exploit discrepancies between a target asset, the ishares FTSE/MACQ traded in the London Stock Exchange, and a synthetic artificial asset represented by a data stream obtained as a linear combination of a possibly large set of explanatory streams assumed to be correlated with the target stream. In our case, we use the stock prices of the 100 constituents of FTSE 100 Index to reproduce the target asset. We start with linear regression on the 100 constituents and take the window as 101 days from April to September After extracting significant factors by Principal Component Analysis, we calibrate the model cutting the window size down to 60 days. Closed prices of each asset are used. The emphasis here is to decompose the stock data into systematic and idiosyncratic components and statistically model the idiosyncratic part. We apply both Linear Regression and Supported Vector Regression, and collect the residual that remains after the decomposition is done. Finally, we study the mean-reversion using auto-regression model in the residuals to generate trading signals for our target asset. 1
2 2 Linear Regression We consider the data of 101 days from Apr.28 to Sep.18, The reason we choose this time period is that given 100 feature variables, we need at least 101 observations to train the 101 parameters in the linear regression model. To avoid overfitting parameters, we only use 101 training examples. Denote P t to be target asset and Q it to be the ith stock constituent at time t. The linear model can be written as 100 P t = θ 0 + θ i Q it. i=1 Implementing the ordinary least squares algorithm in MATLAB, we get parameters θ and training errors, i.e. residuals. we apply Principal Component Analysis to reduce the dimension of the model. Figure 2: Generalization error on 30 testing days 3 Principal Component Analysis (PCA) Now we apply PCA to analyze the 100 stocks. The estimation window for the correlation matrix is 101 days. The eigenvalues at the top of the spectrum which are isolated from the bulk spectrum are obviously significant. The problem becomes evident by looking at eigenvalues of the correlation matrix in Figures 3. Apparently, the Figure 1: Residuals of linear regression over 100 constituents From Figure 1, we see that the empirical error is acceptable. However, when we test this linear model using 30 examples not in the training set, i.e. prices of each asset from Sep. 21 to Oct. 30, 2009, the generalization error is far from satisfying, as shown in Figure 2. This implies that we are facing an overfitting problem, even though we ve used the smallest possible training set. There are so many parameters in the linear model, which gets us into this problem. Hence, Figure 3: eigenvalue of the correlation matrix eigenvalues within the first twenty reveal almost 2
3 all information of the matrix. Now we apply validation rules to find how many principal components to use that can give us the least generalization error. Given that the dimension of the model is reduced, we reset our window size to 60 days to avoid overfitting problem. After running multivariate linear regression within the first 20 components using 60 days training set, we find that the first 12 components give the smallest generalization error on the 30 testing days. The out-of-sample residual is shown in Figure 4. Figure 5: Empirical error on the first 12 components over 60 training days 4 Support Vector Regression (SVR) Figure 4: Generalization error on the first 12 components over 30 out-of-sample days We apply SVR on the 12 feature attributes obtained through PCA. We use a Gaussian kernel and empirically decide on the kernel bandwidth, cost and epsilon (slack variable) parameters. (We used SVM and Kernel Methods MAT- LAB toolbox, available online for implementation). We do not notice any improvement in this approach,as seen in the plots of training error and test error, Figure 6 and Figure 7. A main issue here is to be able to determine appropriate SVR parameters. We take a quick look back at the empirical error of these 12 components over 60 training days. From Figure 5, we see that the residuals are not as satisfying as in Figure 1, in terms of magnitude, but residuals here successfully reveal the trend of residuals using 100 constituents. Thus, by applying PCA to reduce the dimension of the model, we avoid overfitting parameters. We will use the in-sample residuals from regression on principal components to generate trading signals in a following section. 5 Mean Reverting Process We want to model the price of the target asset such that it accounts for a drift which measures systematic deviations from the sector and a price fluctuation that is mean-reverting to the overall industry level. We construct a trading strategy when significant fluctuations from equilibrium are observed. We introduce a parametric mean reverting model for the asset, the Ornstein-Uhlembeck 3
4 Figure 6: Empirical error on the first 12 components over 60 training days using SVR process, dx(t) = κ(m X(t))dt + σdw (t), κ > 0. The term dx(t) is assumed to be the increment of a stationary stochastic process which models price fluctuations corresponding to idiosyncratic fluctuations in the prices which are not reflected in the industry sector, i.e. the residuals from linear regression on principal components in the previous section. Note that the increment dx(t) has unconditional mean zero and conditional mean equal to E{dX(t) X(s), s t} = κ(m X(t))dt. The conditional mean, i.e. the forecast of expected daily returns, is positive or negative according to the sign of m X(t). This process is stationary and can be estimated by an auto-regression with lag 1. We use the residuals on a window of length 60 days, assuming the parameters are constant over the window. In fact, the model is shown as X t+1 = a + bx t + ɛ t+1, t = 1, 2,, 59, where we have a = m(1 e κ t ) Figure 7: Generalization error on the first 12 components over 30 testing days using SVR Hence, we get b = e κ t V ar(ɛ) = σ 2 1 e κ t. 2κ κ = log(b) 101 = a m = 1 b = V ar(ɛ) 2κ σ = 1 b 2 = V ar(ɛ) σ eq = 1 b 2 = Signal Generation We define a scalar varaible called the s-score s = X(t) m σ eq. The s-score measures the distance of the cointegrated residual from equilibrium per unit standard deviation, i.e. how far away a given stock is from the theoretical equilibrium value associated with our model. According to empirical studies in this field, our basic trading signal based on mean-reversion is 4
5 1. Buy ishare FTSE if s < Sell ishare FTSE if s > Close short position in ishare FTSE if s < Close long position in ishare FTSE if s > 0.5 The rationale for opening trades only when the s-score s is far from equilibrium is to trade only when we think that we detected an anomalous escursion of the co-integration residual. We then need to consider when we close trades. Closing trades when the s-score is near zero makes sense, since we expect most stocks to be near equilibrium most of the time. Thus, our trading rule detects stocks with large excursions and trades assuming these excursions will revert to the mean. Figure 8 shows the signals we generate over 60 days. 60 days data) that using our strategy based on the trading signals clearly makes a trading profit as on average we purchase ishare FTSE when it is low and sell it when it is high. On the other hand, in the future, idiosyncratic factors behaving erratically (such as occurrence of some specific unforeseen event) might lead to the index systematically under-performing or doing much better than the significant factors found from PCA and this could seriously undermine the effectiveness of our approach. To achieve a systematic method, online learning would be a good way to try out, in order to update our feature set with the latest information. References [1] G. Montana, K. Triantafyllopoulos and T. Tsagaris, Flexible least squares for temporal data mining and statistical arbitrage, Expert Systems with Applications, Vol. 36, Issue 2, Part 2, pp , [2] A. Marco and Jeong-Hyun Lee, Statistical Arbitrage in the U.S. Equities Market, Social Science Research Network, [3] T. Fletcher, Support Vector Macines Explained, 2009 Figure 8: Trading signals over 60 days 7 Conclusion We note that PCA helped in getting rid of overfitting problem with 100 feature attributes, while conducting linear regression. However, we see that to effectively apply Support Vector Regression, a technique to learn SVR-parameters might have to be developed. We see on testing (over 5
A Regime-Switching Model for Electricity Spot Prices. Gero Schindlmayr EnBW Trading GmbH [email protected]
A Regime-Switching Model for Electricity Spot Prices Gero Schindlmayr EnBW Trading GmbH [email protected] May 31, 25 A Regime-Switching Model for Electricity Spot Prices Abstract Electricity markets
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht [email protected] 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht [email protected] 539 Sennott
Statistical Arbitrage in the U.S. Equities Market
Statistical Arbitrage in the U.S. Equities Market Marco Avellaneda and Jeong-Hyun Lee First draft: July 11, 2008 This version: June 15, 2009 Abstract We study model-driven statistical arbitrage in U.S.
Volatility in the Overnight Money-Market Rate
5 Volatility in the Overnight Money-Market Rate Allan Bødskov Andersen, Economics INTRODUCTION AND SUMMARY This article analyses the day-to-day fluctuations in the Danish overnight money-market rate during
Drugs store sales forecast using Machine Learning
Drugs store sales forecast using Machine Learning Hongyu Xiong (hxiong2), Xi Wu (wuxi), Jingying Yue (jingying) 1 Introduction Nowadays medical-related sales prediction is of great interest; with reliable
Chapter 4: Vector Autoregressive Models
Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...
Some Quantitative Issues in Pairs Trading
Research Journal of Applied Sciences, Engineering and Technology 5(6): 2264-2269, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2013 Submitted: October 30, 2012 Accepted: December
AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.
AUTOMATION OF ENERGY DEMAND FORECASTING by Sanzad Siddique, B.S. A Thesis submitted to the Faculty of the Graduate School, Marquette University, in Partial Fulfillment of the Requirements for the Degree
Forecasting methods applied to engineering management
Forecasting methods applied to engineering management Áron Szász-Gábor Abstract. This paper presents arguments for the usefulness of a simple forecasting application package for sustaining operational
A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500
A Trading Strategy Based on the Lead-Lag Relationship of Spot and Futures Prices of the S&P 500 FE8827 Quantitative Trading Strategies 2010/11 Mini-Term 5 Nanyang Technological University Submitted By:
Risk Decomposition of Investment Portfolios. Dan dibartolomeo Northfield Webinar January 2014
Risk Decomposition of Investment Portfolios Dan dibartolomeo Northfield Webinar January 2014 Main Concepts for Today Investment practitioners rely on a decomposition of portfolio risk into factors to guide
Detecting Network Anomalies. Anant Shah
Detecting Network Anomalies using Traffic Modeling Anant Shah Anomaly Detection Anomalies are deviations from established behavior In most cases anomalies are indications of problems The science of extracting
Stock Price Dynamics, Dividends and Option Prices with Volatility Feedback
Stock Price Dynamics, Dividends and Option Prices with Volatility Feedback Juho Kanniainen Tampere University of Technology New Thinking in Finance 12 Feb. 2014, London Based on J. Kanniainen and R. Piche,
Statistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents
Financial Market Efficiency and Its Implications
Financial Market Efficiency: The Efficient Market Hypothesis (EMH) Financial Market Efficiency and Its Implications Financial markets are efficient if current asset prices fully reflect all currently available
Multiple Kernel Learning on the Limit Order Book
JMLR: Workshop and Conference Proceedings 11 (2010) 167 174 Workshop on Applications of Pattern Analysis Multiple Kernel Learning on the Limit Order Book Tristan Fletcher Zakria Hussain John Shawe-Taylor
VI. Real Business Cycles Models
VI. Real Business Cycles Models Introduction Business cycle research studies the causes and consequences of the recurrent expansions and contractions in aggregate economic activity that occur in most industrialized
Machine Learning and Pattern Recognition Logistic Regression
Machine Learning and Pattern Recognition Logistic Regression Course Lecturer:Amos J Storkey Institute for Adaptive and Neural Computation School of Informatics University of Edinburgh Crichton Street,
DATA ANALYSIS II. Matrix Algorithms
DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where
A Study on the Comparison of Electricity Forecasting Models: Korea and China
Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 675 683 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.675 Print ISSN 2287-7843 / Online ISSN 2383-4757 A Study on the Comparison
Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?
Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate? Emily Polito, Trinity College In the past two decades, there have been many empirical studies both in support of and opposing
Acknowledgments. Data Mining with Regression. Data Mining Context. Overview. Colleagues
Data Mining with Regression Teaching an old dog some new tricks Acknowledgments Colleagues Dean Foster in Statistics Lyle Ungar in Computer Science Bob Stine Department of Statistics The School of the
Effective Linear Discriminant Analysis for High Dimensional, Low Sample Size Data
Effective Linear Discriant Analysis for High Dimensional, Low Sample Size Data Zhihua Qiao, Lan Zhou and Jianhua Z. Huang Abstract In the so-called high dimensional, low sample size (HDLSS) settings, LDA
Cointegration Pairs Trading Strategy On Derivatives 1
Cointegration Pairs Trading Strategy On Derivatives Cointegration Pairs Trading Strategy On Derivatives 1 By Ngai Hang CHAN Co-Authors: Dr. P.K. LEE and Ms. Lai Fun PUN Department of Statistics The Chinese
Chapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
University of Lille I PC first year list of exercises n 7. Review
University of Lille I PC first year list of exercises n 7 Review Exercise Solve the following systems in 4 different ways (by substitution, by the Gauss method, by inverting the matrix of coefficients
3. Regression & Exponential Smoothing
3. Regression & Exponential Smoothing 3.1 Forecasting a Single Time Series Two main approaches are traditionally used to model a single time series z 1, z 2,..., z n 1. Models the observation z t as a
Time Series Analysis of Aviation Data
Time Series Analysis of Aviation Data Dr. Richard Xie February, 2012 What is a Time Series A time series is a sequence of observations in chorological order, such as Daily closing price of stock MSFT in
Algorithmic Trading: Model of Execution Probability and Order Placement Strategy
Algorithmic Trading: Model of Execution Probability and Order Placement Strategy Chaiyakorn Yingsaeree A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy
Introduction to Support Vector Machines. Colin Campbell, Bristol University
Introduction to Support Vector Machines Colin Campbell, Bristol University 1 Outline of talk. Part 1. An Introduction to SVMs 1.1. SVMs for binary classification. 1.2. Soft margins and multi-class classification.
Online Appendix. Supplemental Material for Insider Trading, Stochastic Liquidity and. Equilibrium Prices. by Pierre Collin-Dufresne and Vyacheslav Fos
Online Appendix Supplemental Material for Insider Trading, Stochastic Liquidity and Equilibrium Prices by Pierre Collin-Dufresne and Vyacheslav Fos 1. Deterministic growth rate of noise trader volatility
PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU
PITFALLS IN TIME SERIES ANALYSIS Cliff Hurvich Stern School, NYU The t -Test If x 1,..., x n are independent and identically distributed with mean 0, and n is not too small, then t = x 0 s n has a standard
Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem
Chapter Vector autoregressions We begin by taking a look at the data of macroeconomics. A way to summarize the dynamics of macroeconomic data is to make use of vector autoregressions. VAR models have become
15.062 Data Mining: Algorithms and Applications Matrix Math Review
.6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop
Exploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016
and Principal Components Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016 Agenda Brief History and Introductory Example Factor Model Factor Equation Estimation of Loadings
Subspace Analysis and Optimization for AAM Based Face Alignment
Subspace Analysis and Optimization for AAM Based Face Alignment Ming Zhao Chun Chen College of Computer Science Zhejiang University Hangzhou, 310027, P.R.China [email protected] Stan Z. Li Microsoft
Sales forecasting # 2
Sales forecasting # 2 Arthur Charpentier [email protected] 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting
High Frequency Equity Pairs Trading: Transaction Costs, Speed of Execution and Patterns in Returns
High Frequency Equity Pairs Trading: Transaction Costs, Speed of Execution and Patterns in Returns David Bowen a Centre for Investment Research, UCC Mark C. Hutchinson b Department of Accounting, Finance
Rob J Hyndman. Forecasting using. 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1
Rob J Hyndman Forecasting using 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1 Outline 1 Regression with ARIMA errors 2 Example: Japanese cars 3 Using Fourier terms for seasonality 4
Working Papers. Cointegration Based Trading Strategy For Soft Commodities Market. Piotr Arendarski Łukasz Postek. No. 2/2012 (68)
Working Papers No. 2/2012 (68) Piotr Arendarski Łukasz Postek Cointegration Based Trading Strategy For Soft Commodities Market Warsaw 2012 Cointegration Based Trading Strategy For Soft Commodities Market
2.2 Elimination of Trend and Seasonality
26 CHAPTER 2. TREND AND SEASONAL COMPONENTS 2.2 Elimination of Trend and Seasonality Here we assume that the TS model is additive and there exist both trend and seasonal components, that is X t = m t +
CHAPTER 11: ARBITRAGE PRICING THEORY
CHAPTER 11: ARBITRAGE PRICING THEORY 1. The revised estimate of the expected rate of return on the stock would be the old estimate plus the sum of the products of the unexpected change in each factor times
Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler
Machine Learning and Data Mining Regression Problem (adapted from) Prof. Alexander Ihler Overview Regression Problem Definition and define parameters ϴ. Prediction using ϴ as parameters Measure the error
Least-Squares Intersection of Lines
Least-Squares Intersection of Lines Johannes Traa - UIUC 2013 This write-up derives the least-squares solution for the intersection of lines. In the general case, a set of lines will not intersect at a
Forecasting in supply chains
1 Forecasting in supply chains Role of demand forecasting Effective transportation system or supply chain design is predicated on the availability of accurate inputs to the modeling process. One of the
1.3. DOT PRODUCT 19. 6. If θ is the angle (between 0 and π) between two non-zero vectors u and v,
1.3. DOT PRODUCT 19 1.3 Dot Product 1.3.1 Definitions and Properties The dot product is the first way to multiply two vectors. The definition we will give below may appear arbitrary. But it is not. It
Impulse Response Functions
Impulse Response Functions Wouter J. Den Haan University of Amsterdam April 28, 2011 General definition IRFs The IRF gives the j th -period response when the system is shocked by a one-standard-deviation
Earnings Announcement and Abnormal Return of S&P 500 Companies. Luke Qiu Washington University in St. Louis Economics Department Honors Thesis
Earnings Announcement and Abnormal Return of S&P 500 Companies Luke Qiu Washington University in St. Louis Economics Department Honors Thesis March 18, 2014 Abstract In this paper, I investigate the extent
Estimating Volatility
Estimating Volatility Daniel Abrams Managing Partner FAS123 Solutions, LLC Copyright 2005 FAS123 Solutions, LLC Definition of Volatility Historical Volatility as a Forecast of the Future Definition of
The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.
Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
Introduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
STRATEGIC CAPACITY PLANNING USING STOCK CONTROL MODEL
Session 6. Applications of Mathematical Methods to Logistics and Business Proceedings of the 9th International Conference Reliability and Statistics in Transportation and Communication (RelStat 09), 21
6.2 Permutations continued
6.2 Permutations continued Theorem A permutation on a finite set A is either a cycle or can be expressed as a product (composition of disjoint cycles. Proof is by (strong induction on the number, r, of
Neural Networks for Sentiment Detection in Financial Text
Neural Networks for Sentiment Detection in Financial Text Caslav Bozic* and Detlef Seese* With a rise of algorithmic trading volume in recent years, the need for automatic analysis of financial news emerged.
On the long run relationship between gold and silver prices A note
Global Finance Journal 12 (2001) 299 303 On the long run relationship between gold and silver prices A note C. Ciner* Northeastern University College of Business Administration, Boston, MA 02115-5000,
Dynamic Relationship between Interest Rate and Stock Price: Empirical Evidence from Colombo Stock Exchange
International Journal of Business and Social Science Vol. 6, No. 4; April 2015 Dynamic Relationship between Interest Rate and Stock Price: Empirical Evidence from Colombo Stock Exchange AAMD Amarasinghe
How can we discover stocks that will
Algorithmic Trading Strategy Based On Massive Data Mining Haoming Li, Zhijun Yang and Tianlun Li Stanford University Abstract We believe that there is useful information hiding behind the noisy and massive
Structural Analysis of Network Traffic Flows Eric Kolaczyk
Structural Analysis of Network Traffic Flows Eric Kolaczyk Anukool Lakhina, Dina Papagiannaki, Mark Crovella, Christophe Diot, and Nina Taft Traditional Network Traffic Analysis Focus on Short stationary
Yao Zheng University of New Orleans. Eric Osmer University of New Orleans
ABSTRACT The pricing of China Region ETFs - an empirical analysis Yao Zheng University of New Orleans Eric Osmer University of New Orleans Using a sample of exchange-traded funds (ETFs) that focus on investing
Analysis of Bayesian Dynamic Linear Models
Analysis of Bayesian Dynamic Linear Models Emily M. Casleton December 17, 2010 1 Introduction The main purpose of this project is to explore the Bayesian analysis of Dynamic Linear Models (DLMs). The main
D-optimal plans in observational studies
D-optimal plans in observational studies Constanze Pumplün Stefan Rüping Katharina Morik Claus Weihs October 11, 2005 Abstract This paper investigates the use of Design of Experiments in observational
17. SIMPLE LINEAR REGRESSION II
17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.
Lecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
Testing for Granger causality between stock prices and economic growth
MPRA Munich Personal RePEc Archive Testing for Granger causality between stock prices and economic growth Pasquale Foresti 2006 Online at http://mpra.ub.uni-muenchen.de/2962/ MPRA Paper No. 2962, posted
Estimating an ARMA Process
Statistics 910, #12 1 Overview Estimating an ARMA Process 1. Main ideas 2. Fitting autoregressions 3. Fitting with moving average components 4. Standard errors 5. Examples 6. Appendix: Simple estimators
Jim Gatheral Scholarship Report. Training in Cointegrated VAR Modeling at the. University of Copenhagen, Denmark
Jim Gatheral Scholarship Report Training in Cointegrated VAR Modeling at the University of Copenhagen, Denmark Xuxin Mao Department of Economics, the University of Glasgow [email protected] December
by Maria Heiden, Berenberg Bank
Dynamic hedging of equity price risk with an equity protect overlay: reduce losses and exploit opportunities by Maria Heiden, Berenberg Bank As part of the distortions on the international stock markets
Trading Basket Construction. Mean Reversion Trading. Haksun Li [email protected] www.numericalmethod.com
Trading Basket Construction Mean Reversion Trading Haksun Li [email protected] www.numericalmethod.com Speaker Profile Dr. Haksun Li CEO, Numerical Method Inc. (Ex-)Adjunct Professors, Industry
Maximum likelihood estimation of mean reverting processes
Maximum likelihood estimation of mean reverting processes José Carlos García Franco Onward, Inc. [email protected] Abstract Mean reverting processes are frequently used models in real options. For
Vector Spaces; the Space R n
Vector Spaces; the Space R n Vector Spaces A vector space (over the real numbers) is a set V of mathematical entities, called vectors, U, V, W, etc, in which an addition operation + is defined and in which
Studying Achievement
Journal of Business and Economics, ISSN 2155-7950, USA November 2014, Volume 5, No. 11, pp. 2052-2056 DOI: 10.15341/jbe(2155-7950)/11.05.2014/009 Academic Star Publishing Company, 2014 http://www.academicstar.us
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Designing a learning system
Lecture Designing a learning system Milos Hauskrecht [email protected] 539 Sennott Square, x4-8845 http://.cs.pitt.edu/~milos/courses/cs750/ Design of a learning system (first vie) Application or Testing
Part 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
Univariate and Multivariate Methods PEARSON. Addison Wesley
Time Series Analysis Univariate and Multivariate Methods SECOND EDITION William W. S. Wei Department of Statistics The Fox School of Business and Management Temple University PEARSON Addison Wesley Boston
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
Component Ordering in Independent Component Analysis Based on Data Power
Component Ordering in Independent Component Analysis Based on Data Power Anne Hendrikse Raymond Veldhuis University of Twente University of Twente Fac. EEMCS, Signals and Systems Group Fac. EEMCS, Signals
Discussion of Momentum and Autocorrelation in Stock Returns
Discussion of Momentum and Autocorrelation in Stock Returns Joseph Chen University of Southern California Harrison Hong Stanford University Jegadeesh and Titman (1993) document individual stock momentum:
JetBlue Airways Stock Price Analysis and Prediction
JetBlue Airways Stock Price Analysis and Prediction Team Member: Lulu Liu, Jiaojiao Liu DSO530 Final Project JETBLUE AIRWAYS STOCK PRICE ANALYSIS AND PREDICTION 1 Motivation Started in February 2000, JetBlue
4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4
4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression
TWO L 1 BASED NONCONVEX METHODS FOR CONSTRUCTING SPARSE MEAN REVERTING PORTFOLIOS
TWO L 1 BASED NONCONVEX METHODS FOR CONSTRUCTING SPARSE MEAN REVERTING PORTFOLIOS XIAOLONG LONG, KNUT SOLNA, AND JACK XIN Abstract. We study the problem of constructing sparse and fast mean reverting portfolios.
Nonlinear Regression:
Zurich University of Applied Sciences School of Engineering IDP Institute of Data Analysis and Process Design Nonlinear Regression: A Powerful Tool With Considerable Complexity Half-Day : Improved Inference
The Effect of Short-selling Restrictions on Liquidity: Evidence from the London Stock Exchange
The Effect of Short-selling Restrictions on Liquidity: Evidence from the London Stock Exchange Matthew Clifton ab and Mark Snape ac a Capital Markets Cooperative Research Centre 1 b University of Technology,
Private Equity Fund Valuation and Systematic Risk
An Equilibrium Approach and Empirical Evidence Axel Buchner 1, Christoph Kaserer 2, Niklas Wagner 3 Santa Clara University, March 3th 29 1 Munich University of Technology 2 Munich University of Technology
Modelling electricity market data: the CARMA spot model, forward prices and the risk premium
Modelling electricity market data: the CARMA spot model, forward prices and the risk premium Formatvorlage des Untertitelmasters Claudia Klüppelberg durch Klicken bearbeiten Technische Universität München
Trend and Seasonal Components
Chapter 2 Trend and Seasonal Components If the plot of a TS reveals an increase of the seasonal and noise fluctuations with the level of the process then some transformation may be necessary before doing
TEMPORAL CAUSAL RELATIONSHIP BETWEEN STOCK MARKET CAPITALIZATION, TRADE OPENNESS AND REAL GDP: EVIDENCE FROM THAILAND
I J A B E R, Vol. 13, No. 4, (2015): 1525-1534 TEMPORAL CAUSAL RELATIONSHIP BETWEEN STOCK MARKET CAPITALIZATION, TRADE OPENNESS AND REAL GDP: EVIDENCE FROM THAILAND Komain Jiranyakul * Abstract: This study
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering Department of Industrial Engineering and Management Sciences Northwestern University September 15th, 2014
Stock Market Forecasting Using Machine Learning Algorithms
Stock Market Forecasting Using Machine Learning Algorithms Shunrong Shen, Haomiao Jiang Department of Electrical Engineering Stanford University {conank,hjiang36}@stanford.edu Tongda Zhang Department of
Using Options Trading Data to Algorithmically Detect Insider Trading
MS&E 444 Final Project Report Using Options Trading Data to Algorithmically Detect Insider Trading Instructor: Prof. Kay Giesecke TA: Benjamin Armbruster Group Members: 1 Youdan Li Elaine Ou Florin Ratiu
Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression
Logistic Regression Department of Statistics The Pennsylvania State University Email: [email protected] Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max
Econometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
8.1 Summary and conclusions 8.2 Implications
Conclusion and Implication V{tÑàxÜ CONCLUSION AND IMPLICATION 8 Contents 8.1 Summary and conclusions 8.2 Implications Having done the selection of macroeconomic variables, forecasting the series and construction
