Nonparametric Time Series Analysis: A review of Peter Lewis contributions to the field



Similar documents
Threshold Autoregressive Models in Finance: A Comparative Approach

Analysis of Financial Time Series

Regression Modeling Strategies

16 : Demand Forecasting

TEMPORAL CAUSAL RELATIONSHIP BETWEEN STOCK MARKET CAPITALIZATION, TRADE OPENNESS AND REAL GDP: EVIDENCE FROM THAILAND

Predictability of Non-Linear Trading Rules in the US Stock Market Chong & Lam 2010

Energy Demand Forecasting Industry Practices and Challenges

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

BayesX - Software for Bayesian Inference in Structured Additive Regression

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

TOWARD BIG DATA ANALYSIS WORKSHOP

SureSense Software Suite Overview

Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering

Better decision making under uncertain conditions using Monte Carlo Simulation

Predictive time series analysis of stock prices using neural network classifier

College Readiness LINKING STUDY

How To Understand The Theory Of Probability

REPORT DOCUMENTATION PAGE

Detection of changes in variance using binary segmentation and optimal partitioning

9.2 User s Guide SAS/STAT. Introduction. (Book Excerpt) SAS Documentation

Quantifying Seasonal Variation in Cloud Cover with Predictive Models

Learning outcomes. Knowledge and understanding. Competence and skills

Data Mining mit der JMSL Numerical Library for Java Applications

Advanced Forecasting Techniques and Models: ARIMA

Solving Regression Problems Using Competitive Ensemble Models

THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell

Introduction to Regression and Data Analysis

Data Preparation and Statistical Displays

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

Energy Load Mining Using Univariate Time Series Analysis

A Regime-Switching Model for Electricity Spot Prices. Gero Schindlmayr EnBW Trading GmbH

Chapter 4: Vector Autoregressive Models

Handling attrition and non-response in longitudinal data

Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement

Model-Based Recursive Partitioning for Detecting Interaction Effects in Subgroups

Predictive Modeling Techniques in Insurance

The primary goal of this thesis was to understand how the spatial dependence of

Big Data Techniques Applied to Very Short-term Wind Power Forecasting

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

Big Data Big Knowledge?

Statistics Graduate Courses

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

Studying Achievement

Least Squares Estimation

Supervised and unsupervised learning - 1

Java Modules for Time Series Analysis

STA 4273H: Statistical Machine Learning

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU

> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) quantile diff (error)

Predict the Popularity of YouTube Videos Using Early View Data

System Informatics: Combine Engineering Knowledge and Big Data for Model Building and Decision Making

Marketing Mix Modelling and Big Data P. M Cain

Forecasting methods applied to engineering management

Machine Learning Methods for Demand Estimation

The Operational Value of Social Media Information. Social Media and Customer Interaction

List of Ph.D. Courses

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling

Simple Predictive Analytics Curtis Seare

Statistical issues in the analysis of microarray data

D-optimal plans in observational studies

OUTLIER ANALYSIS. Data Mining 1

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

Data Mining Part 5. Prediction

ANALYTICS IN BIG DATA ERA

Statistical machine learning, high dimension and big data

The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network

Causal Leading Indicators Detection for Demand Forecasting

Integrated Resource Plan

Linear Models and Conjoint Analysis with Nonlinear Spline Transformations

Penalized Logistic Regression and Classification of Microarray Data

DATA MINING TECHNIQUES AND APPLICATIONS

Machine Learning in FX Carry Basket Prediction

The relation between news events and stock price jump: an analysis based on neural network

Executive Summary. Abstract. Heitman Analytics Conclusions:

Module 6: Introduction to Time Series Forecasting

Publication List. Chen Zehua Department of Statistics & Applied Probability National University of Singapore

STATISTICA Formula Guide: Logistic Regression. Table of Contents

CIESIN Columbia University

On the effect of data set size on bias and variance in classification learning

Principles of Data Mining by Hand&Mannila&Smyth

Environment Protection Engineering APPROXIMATION OF IMISSION LEVEL AT AIR MONITORING STATIONS BY MEANS OF AUTONOMOUS NEURAL MODELS

Design and Deployment of Specialized Visualizations for Weather-Sensitive Electric Distribution Operations

DATA MINING - SELECTED TOPICS

AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.

8.1 Summary and conclusions 8.2 Implications

Time Series Analysis

Software Review: ITSM 2000 Professional Version 6.0.

Regression III: Advanced Methods

Overview. Longitudinal Data Variation and Correlation Different Approaches. Linear Mixed Models Generalized Linear Mixed Models

Transcription:

Nonparametric Time Series Analysis: A review of Peter Lewis contributions to the field Bonnie Ray IBM T. J. Watson Research Center Joint Statistical Meetings 2012

Outline Background My connection to Peter Joint work: Nonparametric time series analysis Motivation Methodological foundation Applications and Extensions Current related work Impact of Peter s work 2

How I knew Peter IBM T. J. Watson Research Center Yorktown Heights, NY Carmel Beach Carmel, CA Naval Postgraduate School Monterey, CA 3

Underlying motivation for much of our joint work 4

Key idea Peter recognized that non-parametric regression techniques, under development in the late 80 s and early 90 s, could be applied in the time series context to model non-linear time series phenomena Focused on Multivariate Adaptive Regression Splines technique (MARS) Fits truncated linear splines functions to the data with optimal knot points selected automatically Extended MARS to TS-MARS Used MARS algorithm to modeled nonlinear univariate time series using lagged values of the series itself and possible exogenous covariates Results in nonlinear threshold models that are continuous in the domain of the predictor variables (ASTAR, SMASTAR) More general than self-exciting threshold-type models (SETAR, TARSO), which identify piecewise linear functions over disjoint subregions and are discontinuous at the boundaries of the domain of interest Main publications Lewis, P., and J. Stevens (1991): Nonlinear modeling of time series using multivariate adaptive regression splines (MARS), Journal of the American Statistical Association, 87, 864 877. (130+ citations) Lewis, P., and B. Ray (1997): Modeling nonlinearity, long-range dependence, and periodic phenomena in sea surface temperatures using TSMARS, Journal of the American Statistical Association, 92, 881-893. (50+ citations) 5

MARS Applied to Granite Canyon Data Model for 5 Years of SSTs using Wind Direction and Wind Speed) X t = 2.192(0.0036) + 0.878(0.0079)(X t 1 2.13)+ +1.616(0.2770)(2.22 X t 34 )+ +0.013(0.0018)(WS t 1 1.10)+I(WD t 1 {1, 2}) 0.035(0.0018)(WS t 1 1.10)+I(WD t 1 {2, 3}).499(0.0060)(X t 1 1 2.13)+(2.75 X t 8 8)+(2.68 X t 17 )+ 0.584(0.0999)(2.27 X t 34 )+(WS t 1 1.10)+I(WD t 1 {2, 3}) Suggests that when the wind blows from the Northwest on the previous day, the SST tends to decrease 0.517(0.1174)(X t 49 2.510)+(WS t 1 3.00)+I(WD t 1 {1, 4, 5}) +4.665(1.0344)(2.51 X t 49 49) )+(2.26 X t 24 24) )+I(WD t 1 1 {2, 3}) Reflects the fact that the average time between storm fronts in the vicinity i it of Granite Canyon in the winter is about 8 days Suggests a coupling of SSTs with SSTs approximately 49 days previous, dependent on the wind direction and speed 6

Periodic Autoregressive Models: Characterizing River Flows Periodic time series Correlation structure does not change from cycle to cycle, but differs from period to period within a cycle For example, monthly data may have a yearly cycle, but the correlation between observations in Jan and Feb is different from the correlation structure between observations in Feb and Mar Scatter plots of the logarithm of mean monthly flow of the Fraser River over the time period March 1913 December 1991 7

Innovations in Modeling Periodic Time Series: P-CASTAR Adapted nonlinearity tests for threshold-type behavior to the case of periodic time series Applied MARS algorithm to time series exhibiting periodic behavior to capture non-linear relationships Initially modeled each subseries separately using MARS algorithm Introduced the use of categorical predictors representing each period within a cycle to simultaneously model nonlinear behavior for each period Each response weighted to adjust for heteroskedasticity of the residuals in different periods and weights updated iteratively 8 Lewis, P. and Ray, B. (2002). Modeling periodic threshold autoregressions using TSMARS, Journal of Time Series Analysis, 23, 459-471. Change the mean level of the model only Reduces the correlation between May and June riverflows and July and August riverflows

Impact of Peter s work using MARS to model nonlinear structure in time series TSMARS methodology has been used to model energy price series, mobile communication channels, foreign exchange rates, brain dynamics, ozone extremes, nuclear safeguards and non-proliferation,.. For example, Krzyzscinki, J.W. Nonlinear (MARS) modeling of long-term variations of surface UV-B radiation as revealed from the analysis of Belsk, Poland data for the period 1976 2000. Annales Geophysicae (2003) 21: 1887 1896 De Gooijer, J., Ray, B. and Krager, H. (1998). Forecasting exchange rates using TSMARS, International Journal of Money and Finance, 17, 513-534. Ideas extended d to multivariate i t time series modeling Kooperberg Bose and Stone (JASA, 1997) developed PolyMARS (PMARS) algorithm to extend the advantages of the MARS algorithm over simple recursive partitioning to the multiple classification problem DeGooijer and Ray (CSDA, 2003) applied PMARS algorithm to model vector thresholdtype nonlinearity in multivariate time series 9

Using PMARS to model Electricity Load Data Three weeks of half-hourly electricity load data (n=1008) from the Australian states of New South Wales (NSW) and Victoria (VIC) 7500 7000 6500 6000 5500 5000 4500 4000 3500 10000 9000 8000 7000 Victoria 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 Time (in days) New South Wales Used temperature data, available only for NSW, as an additional predictor, along with Time of Day (TOD) and Time of Week(TOW) indicator variables Results Interactions between the TOD t and lagged loads, suggesting that prior electricity usage acts to modulate TOD t effects Several terms involving lagged loads contained thresholds, indicating that electricity loads exhibit different behavior when usage is above or below certain levels Model showed a feedback relationship between the electricity loads of the two states 6000 5000 4000 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 Time (in days) See De Gooijer, J. and Ray, B.(2003). Modeling vector nonlinear time series using POLYMARS, Computational Statistics and Data Analysis, 42,73-90.

Related work from IBM Research: Scalable Matrix-valued Kernel Learning and High-dimensional Nonlinear Causal Inference Innovations Propose a general matrix-valued multiple kernel learning framework to fit non- parametric models to multivariate time series, i.e. kernels are selected dynamically from a library of kernels based on the local structure of the data Allow a broad class of mixed norm regularizers, including those that induce sparsity, to be imposed on a dictionary of vector-valued Reproducing Kernel HilbertSpaces (RKHS) Resulting models Non-parametric nonlinear, sparse temporal-causal models May be viewed as non-parametric multivariate extension of Group Lasso and related sparse learning models Applications Weekly log returns of multiple related stocks Time-course gene expression microarray data Modeled the expression levels of 2397 unique genes simultaneously measured at 66 time points corresponding to various developmental stages and grouped into 35 functional groups based on their gene to infer causal interactions between functional groups, as well obtain insight on within group relationships between genes. 11

Continuing impact of Peter s work using MARS to model nonlinear structure in time series Motivated theoretical work on limiting properties, boosting, improved partitioning algorithms, etc., e.g. K. S. CHAN and RUEY S. TSAY. Limiting properties of the least squares estimator of a continuous threshold autoregressive model Biometrika (1998) 85(2): 413-426 Thomas R. Boucher and Daren B. H. Cline, STABILITY OF CYCLIC THRESHOLD AND THRESHOLD-LIKE AUTOREGRESSIVE TIME SERIES S MODELS. Statistica Sinica 17(2007), 43-62 P Bühlmann. Dynamic adaptive partitioning for nonlinear time series. Biometrika (1999) 86(3): 555-571. Robinzonov, Nikolay, Tutz, Gerhard, Hothorn, Torsten. Boosting techniques for nonlinear time series models. AStA Advances in Statistical Analysis. (2012). 96 (1). 99-122. Ideas extended to nonlinear transfer function-type models Liu J.M., Chen R., Yao Q. Nonparametric transfer function models (2010) Journal of Econometrics, 157 (1), pp. 151-164 Most recent work found directly linked to work of Lewis and Ray 12 Nonlinearity, Breaks, and Long-Range Dependence in Time-Series Models, Eric Hillebrand and Marcelo C. Medeiros, CREATES Research Paper 2012-30, Department t of Economics and Business, Aarhus University, Bartholins Allé 10, DK-8000 Aarhus C, Denmark

Conclusion Operations Research Distinguished Professor Emeritus Peter Lewis, 1932 2011 A leader in the fields of computer simulation, applied statistics and probability, and operations research...a common theme running through the comments about Peter Lewis by many of his colleagues and former students is his extraordinary influence on their professional careers and his steadfast encouragement and support of their work. 13