Detekce změn v autoregresních posloupnostech

Size: px
Start display at page:

Download "Detekce změn v autoregresních posloupnostech"

Transcription

1 Nové Hrady 2012

2 Outline 1 Introduction 2 3 4

3 Change point problem (retrospective) The data Y 1,..., Y n follow a statistical model, which may change once or several times during the observation period Task: decide whether change or changes occurred locate change(s) estimate model before and after change(s) Many theoretical results, many applications Basic reference Csörgő and Horváth, (1997) Chen and Gupta (2000) Antoch, Hušková, Jarušková (2000)

4 On-line approach (sequential monitoring), Chu et al (1996) The observations are coming one by one, after each new observations arrives we make a decision whether change occurs or not. Y 1,... Y m - historical data that satisfy a model We use them as training data and start monitoring in time m + 1 We construct a detector statistic Q(k, m) based on all observations up to m + k and when it cross a prescribed boundary, we stop the monitoring immediately Usually, we solve the problem via the sequential hypotheses testing

5 Change in an autoregression model Model: Y i = β 1 Y i β p Y i p + ε i, i m = (β 1 + δ 1 )Y i (β p + δ p )Y i p + ε i, i > m variance of ε i can be constant or varying

6 Regression model with random regressors Y i = x T i β + x T i δ I {i > m} + ε i, i = p + 1,..., n x i = Y i 1 = (Y i 1,..., Y i p ) T Y i 1 delayed observations, β = (β 1,..., β p ) T, δ = δ n = (δ 1n,..., δ pn ) T 0 Var ε i = σ 2 unknown H 0 : m = n H 1 : p < m < n p < m = m n n change point

7 Tests based on maximum likelihood principle Testing procedures lead to functionals of the partial sums of residuals (Kulperger, 1985, Horváth, 1993, Bai, 1993) S k = k i=p+1 (Y i x T i β n ), k = p + 1,..., n or weighted residuals S k = k i=p+1 x i (Y i x T i β n ), k = p + 1,..., n, β n - LSE based on Y 1,..., Y n

8 Typical test statistics { T n = max S T k C 1 p k n k C n(c }/ k ) 1 S k σ 2 n, (1) T n (ɛ) = { max S T k C 1 k C n(c }/ k nɛ k n(1 ɛ) ) 1 S k σ 2 n, (2) { S T T n (q) = max k C 1 n p k n q 2 (k/n) S k }/ σ 2 n, (3) σ 2 n is a suitable estimator of σ 2, and q is a positive weight function C k = k n x i x T i, C k = x i x T i i=1 i=k+1

9 Limit behaviour under the null hypothesis Let σ 2 n be an estimator of σ 2 such that σ 2 n σ 2 = o P (1). Then, for ɛ (0, 1/2), under causality, as n, ( Tn (ɛ) ) 1/2 B(t) sup ɛ t 1 ɛ t(1 t) ( Tn (q) ) 1/2 B(t) sup 0<t<1 q(t), where {B(t), t [0, 1]} is a p-dimensional Brownian bridge process. HPS (2007)

10 Let σ 2 n be an estimator of σ 2 such that Then, σ 2 n σ 2 = o P ( (log log n) 1 ) lim P( a(log n)tn 1/2 b(log n) t ) = exp{ 2e t } n where a(y) = (2 log y) 1/2, b p (y) = 2 log y + p 2 log log y log Γ( p ) 2 Davis, Huang and Yao (1995); HPS(2007)

11 The limit distribution in the last theorem belongs to the Gumbel-type extreme value distributions, and the convergence is quite slow. Approximations to the critical values can easily be calculated, but they are reliable only for very large n. These approximations can be improved by using resampling techniques HPS+Kirch, 2008 regression bootstrap - resampling residuals only pair bootstrap - resampling pairs (Y i, Y i 1 )

12 Limit behaviour under alternatives Estimators of the change point m m 1 = argmax p<l<n ( S T l C 1 l C n (C l ) 1 S l ) m 2 = argmax p<l<n ( S T l C 1 n S l ) Then, under causality and regularity conditions, m j m = o P (b n ), j = 1, 2, for any sequence {b n } satisfying b n.

13 Estimators of σ 2 or σ 2 n = 1 n p n i=p+1 ( Y in x T i β n ) 2, σn( 2 m) = 1 { m ( Yi x T i n p i=p+1 ) 2 n ( β m + Yi x T i i= m+1 β 0 m ) } 2 where m is an estimator of the change point m, β m and β 0 m are the least squares estimators of β based on Y 1,..., Y m and Y m+1,..., Y n, respectively

14 Application 700 Px50, PX50, daily returns, Index PX50, left, daily returns - right

15 PX50, , ACF PV50, , PACF Sample Autocorrelation Sample Partial Autocorrelations Lag Lag returns PX50, : ACF - left, PACF - right

16 25 PX50, , detection of change PX50 - daily returns, change in autoregression, change

17 PX50, PX50, , ACF Sample Autocorrelation Sample Autocorrelation Lag Lag Index PX50, ACF: before change - left, after change - right

18 PX50, PX50, , PACF Sample Partial Autocorrelations Sample Partial Autocorrelations Lag Lag Index PX50, PACF: before change - left, after change - right

19 model before change: AR(1) X t = X t 1 + ɛ t errors ɛ t satisfy GARCH(1,1) model ɛ t = Z t σ t σ 2 t = ɛ 2 t σ 2 t 1 model after change: GARCH (1,1): X t = σ t Z t σ 2 t = X 2 t σ 2 t 1

20 Efficient score test statistic model: Y i µ = p j=1 β j(y i j µ) + ε i change in ξ = (µ, β 1,..., β p, σ 2 ) ξ n - MLE of ξ L(ξ) score vector (gradient of the log-likelihood function) I(ξ) - Fisher information matrix Test statistic based on Y 1,..., Y n : under H 0 (no change): max B(t) = n 1 2 I 1 2 ( ξ nt )L( ξ nt ) sup 1 j p+2 0 t 1 B j (t) B j (t) = o p (1) B j (t) independent Brownian bridges (Gombay 2008, under normality, Starinská 2011 for general distribution of errors)

21 Sunspots data

22 1 Wolf numbers, sample ACF 1 Wolf numbers, sample PACF 0.8 Sample Autocorrelation Sample Partial Autocorrelations Lag Lag Sunspots: ACF - left, PACF - right

23 150 Wolf index, Sunspots data, prediction in AR(9)

24 change in the mean: AR(2) AR(9) µ n σ n max B(t) change in the mean in AR(2) (at time 1935) mean before change: , mean after change: no change in AR(9) change in AR(2) parameters: changes in years 1826 and 1920 change in variance - ARMA(2,1) model Berkes at al (2009): 1936 (extremal theorem), 1946 (efficient score test)

25 file:///e /beamer/novehrady/sunspotmotyl.gif Sunspots data file:///e /beamer/novehrady/sunspotmotyl.gif [ :23:07]

26 Change in Vector autoregressive process VAR(p) y t = c + β 1 y t β p y t p + ɛ t, t = 1..., k = c + β 1 y t β p y t p + ɛ t, t = k , T H 0 : k = T, H 1 : k = k 0 < T Dvořák (2011,12) - asymptotic pseudo-likelihood ratio test, based on the strong invariance principle, test based on the Gumbel distribution

27 Varying parameters 0.15 MSCI financial index, MSCI financial index, MSCI sector financial index

28 Random coefficients autoregression Basic RCA(p) model: X t = p (β i + B it )X t i + Y t = (β + B t ) T X t 1 + Y t, t Z i=1 β = (β 1..., β p ) T - parameters (constant) B t = (B 1t,..., B pt ) T - iid (0, Σ) - random coefficients X t 1 = (X t 1,..., X t p ) T Y t - iid (0, σ 2 ) - errors

29 Parameters estimation Assume: EB t = 0, E(B t B T t ) = Σ; EY t = 0, EY t = σ 2 ; E(Y t B t ) = 0 X t = (β + B t ) T X t 1 + Y t = β T X t 1 + u t (4) u t = B T t X t 1 + Y t Eu 2 t F t 1 = X T t 1ΣX t 1 + σ 2 = ω T z t 1 + σ 2 (5) F t = σ{(b s, Y s ), s t} ω = vech Σ z t 1 = Kvec (X t 1 X T t 1) K - duplication matrix θ = (β, ω, σ 2 ) T - parameters of the model

30 Maximum- and quasi-maximum likelihood estimators Nichols and Quinn (1982), Aue et al (2006) two-step procedures non-linear optimization nonstationary RCA(1) models - Berkes at al (2009) unified procedures for stationary and nonstationary RCA(1) models - Aue and Horváth (2010)

31 RCA(p) model with parameters θ = (β, ω, σ 2 ) T Γ conditional log-likelihood function under joint normality of (B t, Y t ) (given X 0, X 1,... X p+1 ) L n (θ) = 1 2 n [ (Xt β T X t 1 ) 2 ] ω T z t 1 + σ 2 + log(ω T z t 1 + σ 2 ) t=1 quasi-maximum likelihood estimators - without normality transformed log-likelihood: l n (θ) = 2L n (θ)/n QMLE: l n ( θ) = inf θ Γ l n(θ)

32 l n (θ) = 1 n = 1 n n [ (Xt β T X t 1 ) 2 ] ω T z t 1 + σ 2 + log(ω T z t 1 + σ 2 ) n g t (θ) t=1 t=1 l n (θ) := l n(θ) = 1 n g t(θ) = ( gt (θ) β n g t(θ) t=1, g t(θ) ω {g t(θ)} is strictly stationary mds, g ) t(θ) T σ 2 Eg 1(θ) = 0, Eg 1(θ)g 1(θ) T = A(θ)

33 ( l n 2 l n (θ) ) (θ) = = 1 θ i θ j n n t=1 Eg 1 (θ) = H(θ) = El n (θ) A(θ 0 ) = A, H(θ 0 ) = H nonsingular, g t (θ) n( θ θ0 ) N (0, H 1 AH 1 ) (Nicholls and Quinn 1982, Aue, Horváth, Steinebach, 2006)

34 Monitoring in RCA(p) Model: X t = (β t + Σ 1/2 t b t ) T X t 1 + σ t y t b t iid(0, I), y t iid(0, 1), mutually independent θ t = (β t, ω t, σ 2 t ) T, ω t = vechσ t X 1,..., X m are stable (training, historical data) θ 1 = = θ m = θ 0 continue in observation and stop as soon as a change occurs

35 Sequential testing problem H 0 : θ i = θ 0, i = 1, 2,..., against the alternative that a change occurs, i.e., H A : there exists K 1, θ i = θ 0, 1 i < m + K, θ i = θ 0 + δ m, m + K i <, δ m 0, θ 0, δ m, K - unknown

36 Stopping time of test procedure: τ(m) = inf{k 1, Q(m, k) c q(m, k)} Q(m, k) is detector statistic q(m, k) is a boundary function c > 0 is a constant such that lim P(τ(m) < H 0) = α m (α is the prescribed probability of false alarm), and lim P(τ(m) < H 1) = 1 m (consistency of the test procedure)

37 detector: where Q(m, k) = ( m+k 1 m i=m+1 g i ( θ m ) ) T Â 1 m ( m+k 1 m i=m+1 g i (θ) is gradient vector θ m is QMLE of θ based on the training sample ) g i ( θ m ) Â m = 1 m m i=1 g i ( θ m )g i ( θ m ) T estimator of A based on the training sample boundary function: q(t) = (1 + t) 2 ( t 1+t )2γ, γ [0, 1/2)

38 Theorem. Let {b t }, {y t } be iid, mutually independent, Eb0i 4 <, i = 1,..., p, Ey0 4 <, and condition on strict stationarity holds. Let θ 0 be an inner point of Γ. Then under H 0, as m P( max Q(m, k)/q(k/m) x) P( sup W(t) 2 /t 2γ < x) 1 k< 0<t<1 where W is d-dimensional standard Wiener process with independent components, d = (p + 1)(1 + p/2).

39 Proof: FCLT: nt {n 1/2 A(θ) 1/2 g i (θ), t [0, 1]} {W(t), t [0, 1]} i=1 a few steps that approximate the asymptotic behaviour of by sup 1 k< sup 1 k< m+k i=m+1 g i ( θ) q(k/m) m+k i=m+1 g i (θ) k m m i=1 g i (θ) q(k/m)

40 monitoring under H A δ m is such that δ m 0, m δ m as m, and K = O(m η ), η = η(γ) < 1/2 Then max Q(m, k)/q(k/m) 1 k< in probability (test has asymptotic power 1).

41 0.018 density of stopping time, LSE density of stopping time, CWLSE Densities of stopping times, γ = 0, (blue), γ = 0.25 (green), γ = 0.49 (red). Left: LSE, right: QMLE. True value K=5, m=500, β = 0.3, δ = 0.5, σ 2 B = 0.25, σ 2 Y = 0.25, 3000 observations

42 0.014 density of stopping time, LSE density of stopping time, CWLS Densities of stopping times, γ = 0, (blue), γ = 0.25 (green), γ = 0.49 (red). Left: LSE, right: QMLE. True value K=50, m=500, β = 0.3, δ = 0.5, σ 2 B = 0.25, σ 2 Y = 0.25, 3000 observations

43 THANK YOU!!!

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium

More information

Estimating an ARMA Process

Estimating an ARMA Process Statistics 910, #12 1 Overview Estimating an ARMA Process 1. Main ideas 2. Fitting autoregressions 3. Fitting with moving average components 4. Standard errors 5. Examples 6. Appendix: Simple estimators

More information

Sales forecasting # 2

Sales forecasting # 2 Sales forecasting # 2 Arthur Charpentier arthur.charpentier@univ-rennes1.fr 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Chapter 5: Bivariate Cointegration Analysis

Chapter 5: Bivariate Cointegration Analysis Chapter 5: Bivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie V. Bivariate Cointegration Analysis...

More information

A Unified Approach to Structural Change Tests Based on ML Scores, F Statistics, and OLS Residuals

A Unified Approach to Structural Change Tests Based on ML Scores, F Statistics, and OLS Residuals A Unified Approach to Structural Change Tests Based on ML Scores, F Statistics, and OLS Residuals Achim Zeileis Wirtschaftsuniversität Wien Abstract Three classes of structural change tests (or tests for

More information

Software Review: ITSM 2000 Professional Version 6.0.

Software Review: ITSM 2000 Professional Version 6.0. Lee, J. & Strazicich, M.C. (2002). Software Review: ITSM 2000 Professional Version 6.0. International Journal of Forecasting, 18(3): 455-459 (June 2002). Published by Elsevier (ISSN: 0169-2070). http://0-

More information

Time Series Analysis 1. Lecture 8: Time Series Analysis. Time Series Analysis MIT 18.S096. Dr. Kempthorne. Fall 2013 MIT 18.S096

Time Series Analysis 1. Lecture 8: Time Series Analysis. Time Series Analysis MIT 18.S096. Dr. Kempthorne. Fall 2013 MIT 18.S096 Lecture 8: Time Series Analysis MIT 18.S096 Dr. Kempthorne Fall 2013 MIT 18.S096 Time Series Analysis 1 Outline Time Series Analysis 1 Time Series Analysis MIT 18.S096 Time Series Analysis 2 A stochastic

More information

Analysis and Computation for Finance Time Series - An Introduction

Analysis and Computation for Finance Time Series - An Introduction ECMM703 Analysis and Computation for Finance Time Series - An Introduction Alejandra González Harrison 161 Email: mag208@exeter.ac.uk Time Series - An Introduction A time series is a sequence of observations

More information

Time Series Analysis III

Time Series Analysis III Lecture 12: Time Series Analysis III MIT 18.S096 Dr. Kempthorne Fall 2013 MIT 18.S096 Time Series Analysis III 1 Outline Time Series Analysis III 1 Time Series Analysis III MIT 18.S096 Time Series Analysis

More information

ITSM-R Reference Manual

ITSM-R Reference Manual ITSM-R Reference Manual George Weigt June 5, 2015 1 Contents 1 Introduction 3 1.1 Time series analysis in a nutshell............................... 3 1.2 White Noise Variance.....................................

More information

Threshold Autoregressive Models in Finance: A Comparative Approach

Threshold Autoregressive Models in Finance: A Comparative Approach University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Informatics 2011 Threshold Autoregressive Models in Finance: A Comparative

More information

Vector Time Series Model Representations and Analysis with XploRe

Vector Time Series Model Representations and Analysis with XploRe 0-1 Vector Time Series Model Representations and Analysis with plore Julius Mungo CASE - Center for Applied Statistics and Economics Humboldt-Universität zu Berlin mungo@wiwi.hu-berlin.de plore MulTi Motivation

More information

Testing, Monitoring, and Dating Structural Changes in Exchange Rate Regimes

Testing, Monitoring, and Dating Structural Changes in Exchange Rate Regimes Testing, Monitoring, and Dating Structural Changes in Exchange Rate Regimes Achim Zeileis http://eeecon.uibk.ac.at/~zeileis/ Overview Motivation Exchange rate regimes Exchange rate regression What is the

More information

Time Series Analysis

Time Series Analysis Time Series Analysis Identifying possible ARIMA models Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 2012 Alonso and García-Martos

More information

Rob J Hyndman. Forecasting using. 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1

Rob J Hyndman. Forecasting using. 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1 Rob J Hyndman Forecasting using 11. Dynamic regression OTexts.com/fpp/9/1/ Forecasting using R 1 Outline 1 Regression with ARIMA errors 2 Example: Japanese cars 3 Using Fourier terms for seasonality 4

More information

Chapter 4: Vector Autoregressive Models

Chapter 4: Vector Autoregressive Models Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...

More information

Testing against a Change from Short to Long Memory

Testing against a Change from Short to Long Memory Testing against a Change from Short to Long Memory Uwe Hassler and Jan Scheithauer Goethe-University Frankfurt This version: December 9, 2007 Abstract This paper studies some well-known tests for the null

More information

Lecture 8: Signal Detection and Noise Assumption

Lecture 8: Signal Detection and Noise Assumption ECE 83 Fall Statistical Signal Processing instructor: R. Nowak, scribe: Feng Ju Lecture 8: Signal Detection and Noise Assumption Signal Detection : X = W H : X = S + W where W N(, σ I n n and S = [s, s,...,

More information

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

Linear Threshold Units

Linear Threshold Units Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear

More information

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU PITFALLS IN TIME SERIES ANALYSIS Cliff Hurvich Stern School, NYU The t -Test If x 1,..., x n are independent and identically distributed with mean 0, and n is not too small, then t = x 0 s n has a standard

More information

Appendix 1: Time series analysis of peak-rate years and synchrony testing.

Appendix 1: Time series analysis of peak-rate years and synchrony testing. Appendix 1: Time series analysis of peak-rate years and synchrony testing. Overview The raw data are accessible at Figshare ( Time series of global resources, DOI 10.6084/m9.figshare.929619), sources are

More information

Factor analysis. Angela Montanari

Factor analysis. Angela Montanari Factor analysis Angela Montanari 1 Introduction Factor analysis is a statistical model that allows to explain the correlations between a large number of observed correlated variables through a small number

More information

Time Series Analysis

Time Series Analysis Time Series 1 April 9, 2013 Time Series Analysis This chapter presents an introduction to the branch of statistics known as time series analysis. Often the data we collect in environmental studies is collected

More information

Assessing the Relative Power of Structural Break Tests Using a Framework Based on the Approximate Bahadur Slope

Assessing the Relative Power of Structural Break Tests Using a Framework Based on the Approximate Bahadur Slope Assessing the Relative Power of Structural Break Tests Using a Framework Based on the Approximate Bahadur Slope Dukpa Kim Boston University Pierre Perron Boston University December 4, 2006 THE TESTING

More information

Monitoring Structural Change in Dynamic Econometric Models

Monitoring Structural Change in Dynamic Econometric Models Monitoring Structural Change in Dynamic Econometric Models Achim Zeileis Friedrich Leisch Christian Kleiber Kurt Hornik http://www.ci.tuwien.ac.at/~zeileis/ Contents Model frame Generalized fluctuation

More information

Modelling electricity market data: the CARMA spot model, forward prices and the risk premium

Modelling electricity market data: the CARMA spot model, forward prices and the risk premium Modelling electricity market data: the CARMA spot model, forward prices and the risk premium Formatvorlage des Untertitelmasters Claudia Klüppelberg durch Klicken bearbeiten Technische Universität München

More information

Chapter 6: Multivariate Cointegration Analysis

Chapter 6: Multivariate Cointegration Analysis Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration

More information

Testing against a Change from Short to Long Memory

Testing against a Change from Short to Long Memory Testing against a Change from Short to Long Memory Uwe Hassler and Jan Scheithauer Goethe-University Frankfurt This version: January 2, 2008 Abstract This paper studies some well-known tests for the null

More information

Traffic Safety Facts. Research Note. Time Series Analysis and Forecast of Crash Fatalities during Six Holiday Periods Cejun Liu* and Chou-Lin Chen

Traffic Safety Facts. Research Note. Time Series Analysis and Forecast of Crash Fatalities during Six Holiday Periods Cejun Liu* and Chou-Lin Chen Traffic Safety Facts Research Note March 2004 DOT HS 809 718 Time Series Analysis and Forecast of Crash Fatalities during Six Holiday Periods Cejun Liu* and Chou-Lin Chen Summary This research note uses

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

Non-Stationary Time Series andunitroottests

Non-Stationary Time Series andunitroottests Econometrics 2 Fall 2005 Non-Stationary Time Series andunitroottests Heino Bohn Nielsen 1of25 Introduction Many economic time series are trending. Important to distinguish between two important cases:

More information

ARMA, GARCH and Related Option Pricing Method

ARMA, GARCH and Related Option Pricing Method ARMA, GARCH and Related Option Pricing Method Author: Yiyang Yang Advisor: Pr. Xiaolin Li, Pr. Zari Rachev Department of Applied Mathematics and Statistics State University of New York at Stony Brook September

More information

A course in Time Series Analysis. Suhasini Subba Rao Email: suhasini.subbarao@stat.tamu.edu

A course in Time Series Analysis. Suhasini Subba Rao Email: suhasini.subbarao@stat.tamu.edu A course in Time Series Analysis Suhasini Subba Rao Email: suhasini.subbarao@stat.tamu.edu August 5, 205 Contents Introduction 8. Time Series data................................. 8.. R code....................................2

More information

SYSTEMS OF REGRESSION EQUATIONS

SYSTEMS OF REGRESSION EQUATIONS SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations

More information

The CUSUM algorithm a small review. Pierre Granjon

The CUSUM algorithm a small review. Pierre Granjon The CUSUM algorithm a small review Pierre Granjon June, 1 Contents 1 The CUSUM algorithm 1.1 Algorithm............................... 1.1.1 The problem......................... 1.1. The different steps......................

More information

Lecture 3: Linear methods for classification

Lecture 3: Linear methods for classification Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,

More information

Some useful concepts in univariate time series analysis

Some useful concepts in univariate time series analysis Some useful concepts in univariate time series analysis Autoregressive moving average models Autocorrelation functions Model Estimation Diagnostic measure Model selection Forecasting Assumptions: 1. Non-seasonal

More information

Is a Brownian motion skew?

Is a Brownian motion skew? Is a Brownian motion skew? Ernesto Mordecki Sesión en honor a Mario Wschebor Universidad de la República, Montevideo, Uruguay XI CLAPEM - November 2009 - Venezuela 1 1 Joint work with Antoine Lejay and

More information

The Power of the KPSS Test for Cointegration when Residuals are Fractionally Integrated

The Power of the KPSS Test for Cointegration when Residuals are Fractionally Integrated The Power of the KPSS Test for Cointegration when Residuals are Fractionally Integrated Philipp Sibbertsen 1 Walter Krämer 2 Diskussionspapier 318 ISNN 0949-9962 Abstract: We show that the power of the

More information

GLM, insurance pricing & big data: paying attention to convergence issues.

GLM, insurance pricing & big data: paying attention to convergence issues. GLM, insurance pricing & big data: paying attention to convergence issues. Michaël NOACK - michael.noack@addactis.com Senior consultant & Manager of ADDACTIS Pricing Copyright 2014 ADDACTIS Worldwide.

More information

Random Effects Models for Longitudinal Survey Data

Random Effects Models for Longitudinal Survey Data Analysis of Survey Data. Edited by R. L. Chambers and C. J. Skinner Copyright 2003 John Wiley & Sons, Ltd. ISBN: 0-471-89987-9 CHAPTER 14 Random Effects Models for Longitudinal Survey Data C. J. Skinner

More information

Examples. David Ruppert. April 25, 2009. Cornell University. Statistics for Financial Engineering: Some R. Examples. David Ruppert.

Examples. David Ruppert. April 25, 2009. Cornell University. Statistics for Financial Engineering: Some R. Examples. David Ruppert. Cornell University April 25, 2009 Outline 1 2 3 4 A little about myself BA and MA in mathematics PhD in statistics in 1977 taught in the statistics department at North Carolina for 10 years have been in

More information

Is the Basis of the Stock Index Futures Markets Nonlinear?

Is the Basis of the Stock Index Futures Markets Nonlinear? University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2011 Is the Basis of the Stock

More information

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i. Chapter 3 Kolmogorov-Smirnov Tests There are many situations where experimenters need to know what is the distribution of the population of their interest. For example, if they want to use a parametric

More information

Time Series Analysis

Time Series Analysis Time Series Analysis hm@imm.dtu.dk Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby 1 Outline of the lecture Identification of univariate time series models, cont.:

More information

Time Series Analysis

Time Series Analysis JUNE 2012 Time Series Analysis CONTENT A time series is a chronological sequence of observations on a particular variable. Usually the observations are taken at regular intervals (days, months, years),

More information

Using JMP Version 4 for Time Series Analysis Bill Gjertsen, SAS, Cary, NC

Using JMP Version 4 for Time Series Analysis Bill Gjertsen, SAS, Cary, NC Using JMP Version 4 for Time Series Analysis Bill Gjertsen, SAS, Cary, NC Abstract Three examples of time series will be illustrated. One is the classical airline passenger demand data with definite seasonal

More information

3.1 Stationary Processes and Mean Reversion

3.1 Stationary Processes and Mean Reversion 3. Univariate Time Series Models 3.1 Stationary Processes and Mean Reversion Definition 3.1: A time series y t, t = 1,..., T is called (covariance) stationary if (1) E[y t ] = µ, for all t Cov[y t, y t

More information

Christfried Webers. Canberra February June 2015

Christfried Webers. Canberra February June 2015 c Statistical Group and College of Engineering and Computer Science Canberra February June (Many figures from C. M. Bishop, "Pattern Recognition and ") 1of 829 c Part VIII Linear Classification 2 Logistic

More information

Univariate and Multivariate Methods PEARSON. Addison Wesley

Univariate and Multivariate Methods PEARSON. Addison Wesley Time Series Analysis Univariate and Multivariate Methods SECOND EDITION William W. S. Wei Department of Statistics The Fox School of Business and Management Temple University PEARSON Addison Wesley Boston

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Time Series - ARIMA Models. Instructor: G. William Schwert

Time Series - ARIMA Models. Instructor: G. William Schwert APS 425 Fall 25 Time Series : ARIMA Models Instructor: G. William Schwert 585-275-247 schwert@schwert.ssb.rochester.edu Topics Typical time series plot Pattern recognition in auto and partial autocorrelations

More information

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical

More information

CARNEGIE MELLON UNIVERSITY NONPARAMETRIC TIME SERIES ANALYSIS USING GAUSSIAN PROCESSES A DISSERTATION SUBMITTED TO THE GRADUATE SCHOOL

CARNEGIE MELLON UNIVERSITY NONPARAMETRIC TIME SERIES ANALYSIS USING GAUSSIAN PROCESSES A DISSERTATION SUBMITTED TO THE GRADUATE SCHOOL CARNEGIE MELLON UNIVERSITY NONPARAMETRIC TIME SERIES ANALYSIS USING GAUSSIAN PROCESSES A DISSERTATION SUBMITTED TO THE GRADUATE SCHOOL IN PARTIAL FULFILLMENT OF THE REQUIREMENTS for the degree DOCTOR OF

More information

Minimum LM Unit Root Test with One Structural Break. Junsoo Lee Department of Economics University of Alabama

Minimum LM Unit Root Test with One Structural Break. Junsoo Lee Department of Economics University of Alabama Minimum LM Unit Root Test with One Structural Break Junsoo Lee Department of Economics University of Alabama Mark C. Strazicich Department of Economics Appalachian State University December 16, 2004 Abstract

More information

Pattern Analysis. Logistic Regression. 12. Mai 2009. Joachim Hornegger. Chair of Pattern Recognition Erlangen University

Pattern Analysis. Logistic Regression. 12. Mai 2009. Joachim Hornegger. Chair of Pattern Recognition Erlangen University Pattern Analysis Logistic Regression 12. Mai 2009 Joachim Hornegger Chair of Pattern Recognition Erlangen University Pattern Analysis 2 / 43 1 Logistic Regression Posteriors and the Logistic Function Decision

More information

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as

LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as LOGISTIC REGRESSION Nitin R Patel Logistic regression extends the ideas of multiple linear regression to the situation where the dependent variable, y, is binary (for convenience we often code these values

More information

Discrete Time Series Analysis with ARMA Models

Discrete Time Series Analysis with ARMA Models Discrete Time Series Analysis with ARMA Models Veronica Sitsofe Ahiati (veronica@aims.ac.za) African Institute for Mathematical Sciences (AIMS) Supervised by Tina Marquardt Munich University of Technology,

More information

Chapter 5. Analysis of Multiple Time Series. 5.1 Vector Autoregressions

Chapter 5. Analysis of Multiple Time Series. 5.1 Vector Autoregressions Chapter 5 Analysis of Multiple Time Series Note: The primary references for these notes are chapters 5 and 6 in Enders (2004). An alternative, but more technical treatment can be found in chapters 10-11

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Basics of Statistical Machine Learning

Basics of Statistical Machine Learning CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Modern machine learning is rooted in statistics. You will find many familiar

More information

Univariate Time Series Analysis; ARIMA Models

Univariate Time Series Analysis; ARIMA Models Econometrics 2 Spring 25 Univariate Time Series Analysis; ARIMA Models Heino Bohn Nielsen of4 Outline of the Lecture () Introduction to univariate time series analysis. (2) Stationarity. (3) Characterizing

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

NOV - 30211/II. 1. Let f(z) = sin z, z C. Then f(z) : 3. Let the sequence {a n } be given. (A) is bounded in the complex plane

NOV - 30211/II. 1. Let f(z) = sin z, z C. Then f(z) : 3. Let the sequence {a n } be given. (A) is bounded in the complex plane Mathematical Sciences Paper II Time Allowed : 75 Minutes] [Maximum Marks : 100 Note : This Paper contains Fifty (50) multiple choice questions. Each question carries Two () marks. Attempt All questions.

More information

Statistical Machine Learning

Statistical Machine Learning Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes

More information

Logistic Regression. Vibhav Gogate The University of Texas at Dallas. Some Slides from Carlos Guestrin, Luke Zettlemoyer and Dan Weld.

Logistic Regression. Vibhav Gogate The University of Texas at Dallas. Some Slides from Carlos Guestrin, Luke Zettlemoyer and Dan Weld. Logistic Regression Vibhav Gogate The University of Texas at Dallas Some Slides from Carlos Guestrin, Luke Zettlemoyer and Dan Weld. Generative vs. Discriminative Classifiers Want to Learn: h:x Y X features

More information

Master s Theory Exam Spring 2006

Master s Theory Exam Spring 2006 Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem

More information

Multivariate Normal Distribution

Multivariate Normal Distribution Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues

More information

Linear Classification. Volker Tresp Summer 2015

Linear Classification. Volker Tresp Summer 2015 Linear Classification Volker Tresp Summer 2015 1 Classification Classification is the central task of pattern recognition Sensors supply information about an object: to which class do the object belong

More information

Powerful new tools for time series analysis

Powerful new tools for time series analysis Christopher F Baum Boston College & DIW August 2007 hristopher F Baum ( Boston College & DIW) NASUG2007 1 / 26 Introduction This presentation discusses two recent developments in time series analysis by

More information

Studying Achievement

Studying Achievement Journal of Business and Economics, ISSN 2155-7950, USA November 2014, Volume 5, No. 11, pp. 2052-2056 DOI: 10.15341/jbe(2155-7950)/11.05.2014/009 Academic Star Publishing Company, 2014 http://www.academicstar.us

More information

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Yong Bao a, Aman Ullah b, Yun Wang c, and Jun Yu d a Purdue University, IN, USA b University of California, Riverside, CA, USA

More information

These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher Bishop

These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher Bishop Music and Machine Learning (IFT6080 Winter 08) Prof. Douglas Eck, Université de Montréal These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher

More information

Response variables assume only two values, say Y j = 1 or = 0, called success and failure (spam detection, credit scoring, contracting.

Response variables assume only two values, say Y j = 1 or = 0, called success and failure (spam detection, credit scoring, contracting. Prof. Dr. J. Franke All of Statistics 1.52 Binary response variables - logistic regression Response variables assume only two values, say Y j = 1 or = 0, called success and failure (spam detection, credit

More information

Testing The Quantity Theory of Money in Greece: A Note

Testing The Quantity Theory of Money in Greece: A Note ERC Working Paper in Economic 03/10 November 2003 Testing The Quantity Theory of Money in Greece: A Note Erdal Özmen Department of Economics Middle East Technical University Ankara 06531, Turkey ozmen@metu.edu.tr

More information

Introduction to Time Series Analysis. Lecture 1.

Introduction to Time Series Analysis. Lecture 1. Introduction to Time Series Analysis. Lecture 1. Peter Bartlett 1. Organizational issues. 2. Objectives of time series analysis. Examples. 3. Overview of the course. 4. Time series models. 5. Time series

More information

Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression

Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression Logistic Regression Department of Statistics The Pennsylvania State University Email: jiali@stat.psu.edu Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max

More information

FULLY MODIFIED OLS FOR HETEROGENEOUS COINTEGRATED PANELS

FULLY MODIFIED OLS FOR HETEROGENEOUS COINTEGRATED PANELS FULLY MODIFIED OLS FOR HEEROGENEOUS COINEGRAED PANELS Peter Pedroni ABSRAC his chapter uses fully modified OLS principles to develop new methods for estimating and testing hypotheses for cointegrating

More information

Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models

Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models William Q. Meeker Department of Statistics Iowa State University Ames, IA 50011 January 13, 2001 Abstract S-plus is a highly

More information

Wes, Delaram, and Emily MA751. Exercise 4.5. 1 p(x; β) = [1 p(xi ; β)] = 1 p(x. y i [βx i ] log [1 + exp {βx i }].

Wes, Delaram, and Emily MA751. Exercise 4.5. 1 p(x; β) = [1 p(xi ; β)] = 1 p(x. y i [βx i ] log [1 + exp {βx i }]. Wes, Delaram, and Emily MA75 Exercise 4.5 Consider a two-class logistic regression problem with x R. Characterize the maximum-likelihood estimates of the slope and intercept parameter if the sample for

More information

Time Series Analysis in Economics. Klaus Neusser

Time Series Analysis in Economics. Klaus Neusser Time Series Analysis in Economics Klaus Neusser May 26, 2015 Contents I Univariate Time Series Analysis 3 1 Introduction 1 1.1 Some examples.......................... 2 1.2 Formal definitions.........................

More information

Forecasting model of electricity demand in the Nordic countries. Tone Pedersen

Forecasting model of electricity demand in the Nordic countries. Tone Pedersen Forecasting model of electricity demand in the Nordic countries Tone Pedersen 3/19/2014 Abstract A model implemented in order to describe the electricity demand on hourly basis for the Nordic countries.

More information

Joint Exam 1/P Sample Exam 1

Joint Exam 1/P Sample Exam 1 Joint Exam 1/P Sample Exam 1 Take this practice exam under strict exam conditions: Set a timer for 3 hours; Do not stop the timer for restroom breaks; Do not look at your notes. If you believe a question

More information

Seemingly Unrelated Regressions

Seemingly Unrelated Regressions Seemingly Unrelated Regressions Hyungsik Roger Moon Department of Economics University of Southern California moonr@usc.edu Benoit Perron Département de sciences économiques, CIREQ, and CIRANO Université

More information

Signal detection and goodness-of-fit: the Berk-Jones statistics revisited

Signal detection and goodness-of-fit: the Berk-Jones statistics revisited Signal detection and goodness-of-fit: the Berk-Jones statistics revisited Jon A. Wellner (Seattle) INET Big Data Conference INET Big Data Conference, Cambridge September 29-30, 2015 Based on joint work

More information

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its

More information

Lecture 6: Logistic Regression

Lecture 6: Logistic Regression Lecture 6: CS 194-10, Fall 2011 Laurent El Ghaoui EECS Department UC Berkeley September 13, 2011 Outline Outline Classification task Data : X = [x 1,..., x m]: a n m matrix of data points in R n. y { 1,

More information

Extracting correlation structure from large random matrices

Extracting correlation structure from large random matrices Extracting correlation structure from large random matrices Alfred Hero University of Michigan - Ann Arbor Feb. 17, 2012 1 / 46 1 Background 2 Graphical models 3 Screening for hubs in graphical model 4

More information

Time Series Analysis

Time Series Analysis Time Series Analysis Autoregressive, MA and ARMA processes Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 212 Alonso and García-Martos

More information

Simple and efficient online algorithms for real world applications

Simple and efficient online algorithms for real world applications Simple and efficient online algorithms for real world applications Università degli Studi di Milano Milano, Italy Talk @ Centro de Visión por Computador Something about me PhD in Robotics at LIRA-Lab,

More information

TIME SERIES ANALYSIS

TIME SERIES ANALYSIS TIME SERIES ANALYSIS Ramasubramanian V. I.A.S.R.I., Library Avenue, New Delhi- 110 012 ram_stat@yahoo.co.in 1. Introduction A Time Series (TS) is a sequence of observations ordered in time. Mostly these

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

i=1 In practice, the natural logarithm of the likelihood function, called the log-likelihood function and denoted by

i=1 In practice, the natural logarithm of the likelihood function, called the log-likelihood function and denoted by Statistics 580 Maximum Likelihood Estimation Introduction Let y (y 1, y 2,..., y n be a vector of iid, random variables from one of a family of distributions on R n and indexed by a p-dimensional parameter

More information

Random graphs with a given degree sequence

Random graphs with a given degree sequence Sourav Chatterjee (NYU) Persi Diaconis (Stanford) Allan Sly (Microsoft) Let G be an undirected simple graph on n vertices. Let d 1,..., d n be the degrees of the vertices of G arranged in descending order.

More information

Chapter 2: Binomial Methods and the Black-Scholes Formula

Chapter 2: Binomial Methods and the Black-Scholes Formula Chapter 2: Binomial Methods and the Black-Scholes Formula 2.1 Binomial Trees We consider a financial market consisting of a bond B t = B(t), a stock S t = S(t), and a call-option C t = C(t), where the

More information