Chapter 10 Introduction to Time Series Analysis



Similar documents
Univariate and Multivariate Methods PEARSON. Addison Wesley

Summary Nonstationary Time Series Multitude of Representations Possibilities from Applied Computational Harmonic Analysis Tests of Stationarity

Statistical Analysis of Arterial Blood Pressure (ABP), Central Venous. Pressure (CVP), and Intracranial Pressure (ICP)

Probability and Random Variables. Generation of random variables (r.v.)

Advanced Signal Processing and Digital Noise Reduction

The Calculation of G rms

Time Series Analysis in WinIDAMS

Software Review: ITSM 2000 Professional Version 6.0.

7. DYNAMIC LIGHT SCATTERING 7.1 First order temporal autocorrelation function.

HERMES: Human Exposure and Radiation Monitoring of Electromagnetic Sources.

Univariate Time Series Analysis; ARIMA Models

Lecture 4: Seasonal Time Series, Trend Analysis & Component Model Bus 41910, Time Series Analysis, Mr. R. Tsay

Convolution, Correlation, & Fourier Transforms. James R. Graham 10/25/2005

Chapter 9: Univariate Time Series Analysis

Sales forecasting # 2

Time Series Analysis

VCO Phase noise. Characterizing Phase Noise

Speech Signal Processing: An Overview

RANDOM VIBRATION AN OVERVIEW by Barry Controls, Hopkinton, MA

Time series Forecasting using Holt-Winters Exponential Smoothing

Time Series Analysis

TIME SERIES ANALYSIS: AN OVERVIEW

Geography 4203 / GIS Modeling. Class (Block) 9: Variogram & Kriging

Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?

1 Short Introduction to Time Series

Introduction to Time Series Analysis. Lecture 1.

Matrices and Polynomials

NRZ Bandwidth - HF Cutoff vs. SNR

Promotional Forecast Demonstration

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

1 Example of Time Series Analysis by SSA 1

Time Series Analysis of Aviation Data

Lesson19: Comparing Predictive Accuracy of two Forecasts: Th. Diebold-Mariano Test

Working Papers. Cointegration Based Trading Strategy For Soft Commodities Market. Piotr Arendarski Łukasz Postek. No. 2/2012 (68)

THE SVM APPROACH FOR BOX JENKINS MODELS

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement

Time series analysis as a framework for the characterization of waterborne disease outbreaks

Discrete Time Series Analysis with ARMA Models

Frequency domain application of the Hot-Spot method for the fatigue assessment of the weld seams

CHAPTER 6 PRINCIPLES OF NEURAL CIRCUITS.

Chapter 5: Bivariate Cointegration Analysis

Brain Maps The Sensory Homunculus

Lecture 8: Signal Detection and Noise Assumption

Chapter 6: Multivariate Cointegration Analysis

SGN-1158 Introduction to Signal Processing Test. Solutions

The Fourier Analysis Tool in Microsoft Excel

Moving Holidays and Seasonality: An Application in the Time and the Frequency Domains For Turkey

Using Options Trading Data to Algorithmically Detect Insider Trading

Introduction to time series analysis

How To Model A Series With Sas

Trend and Seasonal Components

Selected Radio Frequency Exposure Limits

A course in Time Series Analysis. Suhasini Subba Rao

MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal

Time series analysis of data from stress ECG

Linear-Quadratic Optimal Controller 10.3 Optimal Linear Control Systems

Luciano Rispoli Department of Economics, Mathematics and Statistics Birkbeck College (University of London)

Time Series - ARIMA Models. Instructor: G. William Schwert

Univariate Time Series Analysis; ARIMA Models

State Space Time Series Analysis

Time Series Analysis by Higher Order Crossings. Benjamin Kedem Department of Mathematics & ISR University of Maryland College Park, MD

INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS

NONLINEAR TIME SERIES ANALYSIS

Tutorial on Markov Chain Monte Carlo

4. Simple regression. QBUS6840 Predictive Analytics.

ITSM-R Reference Manual

Power Spectral Density

A Regime-Switching Model for Electricity Spot Prices. Gero Schindlmayr EnBW Trading GmbH

Overview. Longitudinal Data Variation and Correlation Different Approaches. Linear Mixed Models Generalized Linear Mixed Models

SYSTEMS OF REGRESSION EQUATIONS

NAG C Library Chapter Introduction. g13 Time Series Analysis

AP Physics 1 and 2 Lab Investigations

Statistical Tests for Multiple Forecast Comparison

Statistical Analysis of Non-stationary Waves off the Savannah Coast, Georgia, USA

Time Series HILARY TERM 2010 PROF. GESINE REINERT

Analysis and Computation for Finance Time Series - An Introduction

Introduction to Time Series Analysis and Forecasting. 2nd Edition. Wiley Series in Probability and Statistics

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

Application Note Noise Frequently Asked Questions

Studying Achievement

Measuring Line Edge Roughness: Fluctuations in Uncertainty

Time Series Analysis and Forecasting Methods for Temporal Mining of Interlinked Documents

Spectrum Level and Band Level

Autocovariance and Autocorrelation

Vector Time Series Model Representations and Analysis with XploRe

3. Regression & Exponential Smoothing

The Information Processing model

Bandwidth-dependent transformation of noise data from frequency into time domain and vice versa

Time Series Analysis

Introduction to Engineering System Dynamics

SR2000 FREQUENCY MONITOR

Time Series Analysis in Economics. Klaus Neusser

An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R)

Transcription:

Chapter 1 Introduction to Time Series Analysis A time series is a collection of observations made sequentially in time. Examples are daily mortality counts, particulate air pollution measurements, and temperature data. Figure 1 shows these for the city of Chicago from 1987 to 1994. The public health question is whether daily mortality is associated with particle levels, controlling for temperature. 134

13 178 17 136 11 94 73 183 14 17 69 31-7 89 68 47 26-16 Mortality Pollution Temperature 1987 1988 1989 199 1991 1992 1993 1994 We represent time series measurements with Y 1,..., Y T where T is the total number of measurements. In order to analyze a time series, it is useful to set down a statistical model in the form of a stochastic process. A stochastic process can be described as a statistical phenomenon that evolves in time. While most statistical problems are concerned with estimating properties of a population from a sample, in time series analysis there is a different situation. Although it might be possible to vary the length of the observed sample, it is usually impossible to make multiple observations at any single time (for example, one can t observe today s mortality count more than once). This makes the conventional statistical procedures, based on large sample estimates, inappropriate. Stationarity is a convenient assumption that permits us to describe the statistical properties of a time series.

136 CHAPTER 1. INTRODUCTION TO TIME SERIES ANALYSIS 1.1 Stationarity Broadly speaking, a time series is said to be stationary if there is no systematic trend, no systematic change in variance, and if strictly periodic variations or seasonality do not exist. Most processes in nature appear to be non-stationary. Yet much of the theory in time-series literature is only applicable to stationary processes. One way of describing a stochastic process is to specify the joint distribution of the observations Y (t 1 ),..., Y (t n ) for any set of times t 1,..., t n and any value of n. A time series is said to be strictly stationary if the joint distribution of Y (t 1 ),..., Y (t n ) is the same as that of Y (t 1 + h),... Y (t n + h) for all t 1,..., t n and h. To see how this is a useful assumption, notice that the above condition implies that the expected value and covariance structure of any two components, Y a (t) and Y b (t), of a time series are constant in time E{Y a (t)} = µ a, var{y a (t)} = σ 2 a and corr{y a (t), Y b (t + h)} = γ ab (h). (1.1) The function γ ab (h) is called the cross-correlation function if a b and the autocorrelation function if a = b. In practice it is often useful to define stationarity in a less restricted way than that described above. In many cases, the statistical structure of the processes can be completely described with the second-order properties of equation (1.1). We can estimate the quantities in (1.1) using standard statistical procedures, for example we may estimate the cross-correlation at lag h, γ a,b (h) with the sample correlation of Y a (1),..., Y a (T h) and Y b (h + 1),..., Y b (T ). 1.1.1 An example: Fetal Monitoring Measurements of fetal heart rate (FHR) and fetal movement (FM) are generated by maternal-fetal monitoring. Approximately measurements per second are taken during minutes on 12 subjects that are monitored at 2,24,28,32,36, and 38-39 weeks of gestation. Both FHR and FM are recorded giving us a multiple time series Y (t), t = 1,..., 6, where Y (t) is a vector with 2 entries.

1.1. STATIONARITY 137 The association between accelerations of FHR and FM has been documented since the 193s. For example, it has been observed that in the third trimester most large fetal heart accelerations are associated with fetal activity. In Section 4.2 we will describe how relatively straight-forward time series techniques provide a visual descriptions of how these associations vary with weeks gestation. These description have motivated a methodology that will provide us is with a more rigorous assessment of this relationship. If we consider the FM and FHR measurements, seen in Figure, as outcomes from a two component time series, we may consider the cross-correlation function of these two components as a description of the association between these two processes. Notice that the measurements taken for each fetus at each gestation week has a cross-correlation function associated with them. In Figure 6, as a descriptive plot, we show the average, over individuals, of these functions for each gestation week. Notice that a peak at around the 6 second lag starts to appear in the plot for the 24 week gestation. As the fetus gets older, this peak grows and becomes more defined. This result can be considered a first step in the characterization of the relationship between FM and FHR.

138 CHAPTER 1. INTRODUCTION TO TIME SERIES ANALYSIS Time Series Plots of Fetal Activity and Heart Rate 64 Activity 36 16 4 1 2 3 4 Time in minutes Heart Rate 12 13 14 1 16 1 2 3 4 Time in minutes

1.1. STATIONARITY 139 Average cross-correlations over subjects Week 2 Week 24 Correlation..1.2.3 Correlation..1.2.3-4 -2 2 4 Lag Week 28-4 -2 2 4 Lag Week 36 Correlation..1.2.3 Correlation..1.2.3-4 -2 2 4 Lag -4-2 2 4 Lag Week 38 Correlation..1.2.3-4 -2 2 4 Lag

14 CHAPTER 1. INTRODUCTION TO TIME SERIES ANALYSIS 1.2 Spectral Analysis Sometimes it is useful to describe the properties of the time series in a frequency domain. The spectrum is defined as f ab (λ) = σ2 2π h= γ ab (h) exp( iλh) There is a one-to-one correspondence between the spectrum and the autocovariance function σ 2 γ ab (h) = π π f(λ) exp(iλh)dλ We call f aa 2 the power spectrum. A natural way of estimating a power spectrum is using the periodogram which is the modulus of the Fourier transform of the data I(λ) = 1 T 2πT Y t exp( iλt) 2 t=1 We usually compute the periodogram at the Fourier frequencies λ j = (2πj)/T, j = 1,..., T/2. These have desirable statistical properties The periodogram is also useful for detecting periodicities (deterministic ones) in the signal. It is a mathematical fact that if the data Y 1,...Y T has a period p, the the periodogram will have peaks at frequencies λ = 2πT/p and its multiples.

1.2. SPECTRAL ANALYSIS 141 Time Representation 2 Pollution 1 1 1987 1988 1989 199 1991 1992 1993 1994 Time in years Frequency Representation Square Root of Power 4 3 2 1 2 7 1 12 1 17 Frequency in cycles per year 1.2.1 An Application Figured 4a and 4c shows recorded ECoG signal for two channels for a subject that has received a sensory stimulus at some point during the recording. A straightforward way of estimating the spectrum of a stationary process is the periodogram I(λ) = 1 2πT Y (t) exp(iλt) t In Figures 4b and 4c the periodogram of this data is shown. Brain researchers have speculated that the so-called α (8 13Hz.), β (1 2Hz.), and γ (> 3Hz.) bands of human brain signals can indicate functional activation of sensorimotor cortex. Notice that the periodogram exhibits a peak around frequencies 1 Hz., 2 Hz. and 6 Hz.. If we were to approximate the ECoG signal as a stationary 2.

142 CHAPTER 1. INTRODUCTION TO TIME SERIES ANALYSIS processes, we would describe it as having periodic components around these frequencies. However, we are interested in learning how the signal changes when the subjects are given a stimuli. Thus it seems more appropriate to model the signal as a non-stationary processes and study the time-varying spectral density. A straightforward estimate of a time-varying spectral density would be the dynamic periodogram. Basically, for each time t we consider a window around that point of size h(t ) and estimate a weighted periodogram I(t ; λ) = 1 2πh(t ) w t ( ) t t h(t ) Y (t) exp(iλt) Figures 4c and 4e show the estimated time-varying spectral densities for the signals of channels 19 and 2 (lighter colors represent higher values) The figure seems to suggest that the α band changes power and frequency after the stimulus (time ). 2. Trial 1 Trial 9 Trial 8 Trial 7 Trial 6 Trial Trial 4 Trial 3 Trial 2 Trial 1 Mean of these 1 trials Mean of all trials. 1 1. 2 2. 3 Time (seconds)

1.2. SPECTRAL ANALYSIS 143 1.2 1 1 1.8 Frequency (hertz) 1 1.6.4 1 1 2 1 2 1 2 1 2 1 2 1 2 Time (seconds) 1 2 1 2.2

144 CHAPTER 1. INTRODUCTION TO TIME SERIES ANALYSIS.2.1 1 Frequency (hertz) 1 1.1.2.3 1.4 1 1 2 1 2 1 2 1 2 1 2 1 2 Time (seconds) 1 2 1 2.