Time Series Forecasting Real World Data: Auto Regression Using a Neural Network Forecaster with Weighted Windows

Similar documents
Cross Validation. Dr. Thomas Jensen Expedia.com

Univariate and Multivariate Methods PEARSON. Addison Wesley

The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network

NEURAL NETWORKS A Comprehensive Foundation

A spreadsheet Approach to Business Quantitative Methods

Data Mining Practical Machine Learning Tools and Techniques

Forecasting Tourism Demand: Methods and Strategies. By D. C. Frechtling Oxford, UK: Butterworth Heinemann 2001

ER Volatility Forecasting using GARCH models in R

Data-stream Mining for Rule-based Access Control. Andrii Shalaginov, 13 th of October 2014 COINS PhD seminar

Studying Achievement

Ch.3 Demand Forecasting.

Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement

Some useful concepts in univariate time series analysis

Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal

A New Method for Electric Consumption Forecasting in a Semiconductor Plant

Forecasting the first step in planning. Estimating the future demand for products and services and the necessary resources to produce these outputs

Sales Forecast for Pickup Truck Parts:

Outline: Demand Forecasting

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling

How To Predict Web Site Visits

Segmenting sales forecasting accuracy

A Comparative Study of the Pickup Method and its Variations Using a Simulated Hotel Reservation Data

Java Modules for Time Series Analysis

Week TSX Index

Advanced analytics at your hands

Linear regression methods for large n and streaming data

Forecasting methods applied to engineering management

Predictive Coding Defensibility and the Transparent Predictive Coding Workflow

Supervised Feature Selection & Unsupervised Dimensionality Reduction

Demand Management Where Practice Meets Theory

Time Series and Forecasting

Predictive Coding Defensibility and the Transparent Predictive Coding Workflow

A Forecasting Decision Support System

A New Quantitative Behavioral Model for Financial Prediction

Students will become familiar with the Brandeis Datastream installation as the primary source of pricing, financial and economic data.

Chapter 20: Data Analysis

Neural Network and Genetic Algorithm Based Trading Systems. Donn S. Fishbein, MD, PhD Neuroquant.com

Supply Chain Forecasting Model Using Computational Intelligence Techniques

VENDOR MANAGED INVENTORY

Experiment #1, Analyze Data using Excel, Calculator and Graphs.

International Journal of Computer Trends and Technology (IJCTT) volume 4 Issue 8 August 2013

Simple and efficient online algorithms for real world applications

Introduction to Financial Models for Management and Planning

Studying Auto Insurance Data

Statistics Graduate Courses

A Wavelet Based Prediction Method for Time Series

Time series forecasting

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

Joseph Twagilimana, University of Louisville, Louisville, KY

OBJECTIVE ASSESSMENT OF FORECASTING ASSIGNMENTS USING SOME FUNCTION OF PREDICTION ERRORS

Non Linear Dependence Structures: a Copula Opinion Approach in Portfolio Optimization

Characterizing Task Usage Shapes in Google s Compute Clusters

Modelling and Forecasting Packaged Food Product Sales Using Mathematical Programming

CHAPTER 11 FORECASTING AND DEMAND PLANNING

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

Rotating Machinery Diagnostics & Instrumentation Solutions for Maintenance That Matters

SINGULAR SPECTRUM ANALYSIS HYBRID FORECASTING METHODS WITH APPLICATION TO AIR TRANSPORT DEMAND

DATA MINING IN FINANCE

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Not Your Dad s Magic Eight Ball

Corporate Defaults and Large Macroeconomic Shocks

A FUZZY LOGIC APPROACH FOR SALES FORECASTING

Simple Methods and Procedures Used in Forecasting

IDENTIFICATION OF DEMAND FORECASTING MODEL CONSIDERING KEY FACTORS IN THE CONTEXT OF HEALTHCARE PRODUCTS

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis

STATISTICAL DATA ANALYSIS COURSE VIA THE MATLAB WEB SERVER

2) The three categories of forecasting models are time series, quantitative, and qualitative. 2)

Credit Ratings and Bank Loan Monitoring

Stock Prediction using Artificial Neural Networks

Chapter 5 Analysis of variance SPSS Analysis of variance

Indian School of Business Forecasting Sales for Dairy Products

Practical Applications of Evolutionary Computation to Financial Engineering

Data Mining Techniques Chapter 6: Decision Trees

Event driven trading new studies on innovative way. of trading in Forex market. Michał Osmoła INIME live 23 February 2016

Research Methods & Experimental Design

Regression and Time Series Analysis of Petroleum Product Sales in Masters. Energy oil and Gas

Hong Kong Stock Index Forecasting

IBM SPSS Forecasting 22

Demand Forecasting When a product is produced for a market, the demand occurs in the future. The production planning cannot be accomplished unless

Forecasting Of Indian Stock Market Index Using Artificial Neural Network

The SAS Time Series Forecasting System

Conditional guidance as a response to supply uncertainty

BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING

Feature Subset Selection in Spam Detection

Bayesian networks - Time-series models - Apache Spark & Scala

Predicting the Stock Market with News Articles

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

ADVANCED FORECASTING MODELS USING SAS SOFTWARE

Predictive Modeling and Big Data

16 : Demand Forecasting

Software Review: ITSM 2000 Professional Version 6.0.

MSCA Introduction to Statistical Concepts

PERFORMANCE ANALYSIS OF HYBRID FORECASTING MODEL IN STOCK MARKET FORECASTING

Promotional Forecast Demonstration

Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model

Review of basic statistics and the simplest forecasting model: the sample mean

How To Understand The Theory Of Probability

Aachen Summer Simulation Seminar 2014

An empirical analysis of the dynamic relationship between investment grade bonds and credit default swaps

Power Prediction Analysis using Artificial Neural Network in MS Excel

Transcription:

Time Series Forecasting Real World Data: Auto Regression Using a Neural Network Forecaster with Weighted Windows By Brad Morantz Ph.D. Copyright 2004, 2008

Time Series Forecasting Used when don t have causal variables to build model Don t know all of the variables Can t measure all of the variables Do not have metrics to describe variables Do not know the model Assume whatever has been happening will continue (from near past to near future) Effect of causal variables reflected in criterion/dependent lagged values Use lagged values to predict future

Nonlinearity Problems The real world is not linear Usually not even Gaussian Changing Causal Factors Time of occurrence is important Underlying factors change Slowly Quickly Combination of both

Changing Causal Factors When there is a system shock Example: Sept 11 effect on the stock market When there is a slow change over time in causal factors Example: the price of computer memory New processes and methods New standards Economies of scale New or different needs Competition

Tracker Passive object Behaves according to projectile theory Other items & gravity act upon it Active object Fins move, motors turn on Engine causes maneuver Sudden changes in direction Sudden changes in acceleration

Remediation Transform Data Square Square root Trig function Exponential Log etc. Must convert back in order to interpret

Pros and Cons of Transforms Pros More linear Better fit Less error Cons Need to convert back before can assess error or fit or apply results Does not always work Requires Time Requires operator skill

Current Methods OLS autoregression Best fit linear model incorporating all data Random Walk Naïve forecast expects no changes ARIMA/Box-Jenkins Requires model Incorporates all data Moving average

Methods Cont d Exponential smoothing Specialized non-linear ARCH, GARCH, Bilinear, TAR, STAR Kalman Filter, extended Kalman Batch Least Squares Artificial neural network Many more

Neural Network Can fit non-linear models Not affected by multi-colinearity Can solve multiple step horizon Learns input output relationship More immune to noise The general approximator Data driven - few a priori rules Successfully used in forecasting

Ways to Cope with Changes Short forecast horizons use only more recent data Weighted regression NIST states effective for heteroscedasticity Moving versus Rolling Windows Control number of observations used for training/building model

Used to build model Training Data Must clean data before use Garbage in = garbage out Methods to use data Rolling Window Moving Window Weighted Window (WW) Must maintain positive degrees of freedom

Rolling vs. Moving Window Rolling Window Rolls forward including all data behind Constant starting point with ever increasing size Moving Window Deletes the oldest as it adds the newest Constant size with ever increasing starting point

Rolling vs. Moving Window Rolling Window Moving Window

Advantages Rolling Window Learns from all Doesn t forget Everything is equally important Can reduce variance and confidence interval Moving Window Not affected by causal changes Not as affected by system shocks

Disadvantages Rolling Window Might include too much information May include rule that now has smaller affect May include rule that has no affect Moving Window May not include enough data May have lost valuable information Might have deleted important rule

Weighted Window (WW) Currently uses linear ramp Until improvement developed Deletes information that is too old The newest information is the most important (recency rule) Older information is retained but is not as important as new data Superset of rolling & moving windows

Linear Ramp Weight 1 0 Support Core

Weighted Window

Selecting Window Size Trial and Error Try many Select the one with least error A priori knowledge Heuristics or formulae Application of Chaos theory

Macro View Neural Network Forecaster with Weighted window (WW) Learns from history Newest information is most important Older knowledge is not lost, it is just not as important as the most recent Can fit non-linear applications

Results Data set ANN* WW* Change CD Rates 11.4% 5.36% 52.98% Eurodollar 4.98% 4.13% 17.17% Fed Funds 4.41% 2.49% 43.39% N Hse Sals 20.12% 11.19% 44.38% FR Franc 3.26% 2.79% 14.48% *Mean Absolute Percentage Error (MAPE)(best of rolling or moving window) Change is reduction in MAPE

Potential Applications Time series with changing causal variables Time series that has system shocks Examples: Projectile with some active control Stock market Sales of popcorn

Future Optimizing Window size Application of Chaos Theory [Frank et al] Selecting similar data [Shimodiara] Use of genetic algorithm (GA) Optimizing weighting pattern A priori knowledge Heuristics & formulae

Future Cont d Weighted window with neural network classifier Optimizing Architecture of ANN Genetic Algorithm Embedding theorem [Frank et al] A Priori knowledge Combination of above

Home Project Just built quad core machine Projectile simulation Active Passive Compare accuracy, precision, computation time Use Kalman vs Neural Analyze with multi-factor ANOVA

Appendix Statistical Analysis Methods Two factor ANOVA Time of forecast Treatment (type of window) MAPE derived from 31 forecasts One unit (month) horizon ANOVA performed Tukey s HST performed Longer horizons also forecast with comparable results

Results of Two Factor ANOVA Statistically Significant CD rates Federal Funds Rate New House Sales Not significant - excessive variance within treatment Eurodollar French Franc

More Statistical Analysis Three Factor ANOVA Treatment (type of window) Time of forecast Data set Results All treatments were statistically significant

Final Note This was part of a more exhaustive study. Academic references are available upon request Yes, more research is continuing in this area

References www.machine-cognition.com IEEE computational intelligence society TIMS/ORSA Times Series Forecasting Bauerman & Occonel

My Contact Information Brad Morantz PhD www.machine-cognition.com bradscientist@ieee.org www.cognitive-decisions.com

Questions?