An Introduction to Extreme Value Theory

Similar documents
An introduction to the analysis of extreme values using R and extremes

CATASTROPHIC RISK MANAGEMENT IN NON-LIFE INSURANCE

Estimation and attribution of changes in extreme weather and climate events

Applying Generalized Pareto Distribution to the Risk Management of Commerce Fire Insurance

COMPARISON BETWEEN ANNUAL MAXIMUM AND PEAKS OVER THRESHOLD MODELS FOR FLOOD FREQUENCY PREDICTION

An analysis of the dependence between crude oil price and ethanol price using bivariate extreme value copulas

Statistical methods to expect extreme values: Application of POT approach to CAC40 return index

An application of extreme value theory to the management of a hydroelectric dam

Statistical Software for Weather and Climate

Operational Risk Management: Added Value of Advanced Methodologies

The R programming language. Statistical Software for Weather and Climate. Already familiar with R? R preliminaries

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails

Extreme Value Theory with Applications in Quantitative Risk Management

EVIM: A Software Package for Extreme Value Analysis in MATLAB

The Extremes Toolkit (extremes)

A Comparative Software Review for Extreme Value Analysis

Corrected Diffusion Approximations for the Maximum of Heavy-Tailed Random Walk

Generating Random Samples from the Generalized Pareto Mixture Model

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

A software review for extreme value analysis

Extreme Value Theory for Heavy-Tails in Electricity Prices

Contributions to extreme-value analysis

Inference on the parameters of the Weibull distribution using records

Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification

Monte Carlo Simulation

The Extreme-Value Dependence Between the Chinese and. Other International Stock Markets

Maximum Likelihood Estimation

GENERALIZED PARETO FIT TO THE SOCIETY OF ACTUARIES LARGE CLAIMS DATABASE

FULL LIST OF REFEREED JOURNAL PUBLICATIONS Qihe Tang

Intra-Day Seasonality in Foreign Exchange Market Transactions. John Cotter and Kevin Dowd *

A Simple Formula for Operational Risk Capital: A Proposal Based on the Similarity of Loss Severity Distributions Observed among 18 Japanese Banks

Uncertainty quantification for the family-wise error rate in multivariate copula models

Probabilistic Real-Time Systems

Volatility modeling in financial markets

Quantitative Models for Operational Risk: Extremes, Dependence and Aggregation

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

NIST GCR Probabilistic Models for Directionless Wind Speeds in Hurricanes

Armenian State Hydrometeorological and Monitoring Service

Un point de vue bayésien pour des algorithmes de bandit plus performants

Extreme Value Analysis of Corrosion Data. Master Thesis

UNIVERSITY OF OSLO. The Poisson model is a common model for claim frequency.

Multivariate Extremes, Max-Stable Process Estimation and Dynamic Financial Modeling

Multivariate Normal Distribution

Asymptotics for ruin probabilities in a discrete-time risk model with dependent financial and insurance risks

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session STS040) p.2985

MONITORING OF DROUGHT ON THE CHMI WEBSITE

Optimal reinsurance with ruin probability target

THE IMPORTANCE OF OPERATIONAL RISK MODELING FOR ECONOMIC CAPITAL MANAGEMENT IN BANKING 1

Analysis and Classification of Volcanic Eruptions

How to Model Operational Risk, if You Must

Fuzzy Probability Distributions in Bayesian Analysis

Java Modules for Time Series Analysis

Operational Risk Modeling Analytics

Properties of Future Lifetime Distributions and Estimation

Contributions to Modeling Extreme Events on Financial and Electricity Markets

Tutorial on Markov Chain Monte Carlo

APPLYING COPULA FUNCTION TO RISK MANAGEMENT. Claudio Romano *

Operational Risk - Scenario Analysis

Measurement and Modelling of Internet Traffic at Access Networks

Visualization Tools for Insurance Risk Processes

5.1 Derivatives and Graphs

MODELLING, ESTIMATING AND VALIDATING MULTIDIMENSIONAL DISTRIBUTION FUNCTIONS -WITH APPLICATIONS TO RISK MANAGEMENT- By Markus Junker D 368

Master s Theory Exam Spring 2006

A simple scaling approach to produce climate scenarios of local precipitation extremes for the Netherlands

Gamma Distribution Fitting

Message-passing sequential detection of multiple change points in networks

Large Insurance Losses Distributions

Statistical Machine Learning

Financial Time Series Analysis (FTSA) Lecture 1: Introduction

Climate Change risk and Agricultural Productivity in the Sahel

INTELLIGENT ENERGY MANAGEMENT OF ELECTRICAL POWER SYSTEMS WITH DISTRIBUTED FEEDING ON THE BASIS OF FORECASTS OF DEMAND AND GENERATION Chr.

Frequency analysis of the hydrological drought regime. Case of oued Mina catchment in western of Algeria

THE CENTRAL LIMIT THEOREM TORONTO

Loss Distribution Approach for operational risk

Transcription:

An Introduction to Extreme Value Theory Petra Friederichs Meteorological Institute University of Bonn COPS Summer School, July/August, 2007

Applications of EVT Finance distribution of income has so called fat tails value-at-risk: maximal daily lost re-assurance Hydrology protection against flood Q100: maximal flow that is expected once every 100 years Meteorology extreme winds risk assessment (e.g. ICE, power plants) heavy precipitation events heat waves, hurricanes, droughts extremes in a changing climate

Application In classical statistics: focus on AVERAGE behavior of stochastic process central limit theorem In extreme value theory: focus on extreme and rare events Fisher-Tippett theorem What is an extreme? from Ulrike Schneider

Extreme Value Theory Block Maximum M n =max {X 1,, X n } for n M n follows a Generalized Extreme Value (GEV) distribution Peak over Threshold (POT) {X i u X i u} very large threshold u follow a Generalized Pareto Distribution (GPD) Poisson-Point GPD Process combines POT with Poisson point process

Extreme Value Theory Block Maximum M n =max {X 1,, X n } Example: station precipitation (DWD) at Dresden 1948 2004: 57 seasonal (Nov-March and May-Sept) (maxima over approximately 150 days) Summer Winter Dresden

GEV Fisher-Tippett Theorem The distribution of n converges to ( ) M n =max {X 1,, X n } 0 G y=exp [1 y ] 1/ =0 G y=exp exp y which is called the Generalized Extreme Value (GEV) distribution. It has three parameters location parameter scale parameter shape parameter

GEV Types of Distributions GEV has 3 types depending on shape parameter =0 G x=exp exp[ x ] Gumbel x= y Fréchet =1/0 G x=exp [1 x ] Weibull = 1/0 G x=exp [1 x ]

GEV Types of Distributions GEV has 3 types depending on shape parameter Gumbel =0 exponential tail x= y Fréchet =1/0 so called fat tail Weibull = 1/0 upper finite endpoint

GEV Types of Distribution Conditions to the sample X i {X 1,, X n } from which the maxima are drawn must be independently identically distributed (i.i.d.) Let F x be the distribution of X i F x is in the domain of attraction of a Gumbel type GEV iff lim x 1 F xtb x 1 F x =e t p=0.99 p=0.9 F(x) for all t0 Exponential decay in the tail of F x x 0.9 x 0.99

GEV Types of Distribution Conditions to the sample X i {X 1,, X n } from which the maxima are drawn must be independently identically distributed (i.i.d.) Let F x be the distribution of X i F x is in the domain of attraction of a Frechet type GEV iff lim x 1 F x 1 F x = 1/ for all 0 polynomial decay in the tail of F x

GEV Types of Distribution Conditions to the sample X i {X 1,, X n } from which the maxima are drawn must be independently identically distributed (i.i.d.) Let F x be the distribution of X i F x there exists and is in the domain of attraction of a Weibull type GEV iff lim x F with F F =1 1 F F 1 x 1 F F 1 x 1 = 1/ for all 0 F x has a finite upper end point F

GEV return level Of interest often is return level z m value expected every m observation (block maxima) Prob yz m =1 G yz m = 1 m calculated using invers distribution function (quantile function) z m = G 1 1 1 m = 1 log1 1/m z m = log log 1 1/m can be estimated empirically as the z m = G 1 1 1 m = inf y F y 1 m

Block Maxima Maximum daily precipitation for Nov-March (Winter) and May-Sept (Summer) Dresden Summer Winter

Block Maxima

Block Maxima

Peak over Threshold Peak over Threshold (POT) {X i u X i u} very large threshold u follow a Generalized Pareto Distribution (GPD) Daily precipitation for Nov-March (green) and May-Sept (red) Dresden

Peak over Threshold Peak over Threshold (POT) The distribution of exceedances over large threshold u are asymptotically distributed following a Generalized Pareto Distribution (GPD) two parameters scale parameter shape parameter Y i :=X i u X i u H y X i u=1 1 y 1/ u advantage: more efficient use of data disadvantage: how to choose threshold not evident

POT Types of Distribution GDP has same 3 types as GEV depending on shape parameter Gumbel =0 H y=1 exp y u exponential tail Pareto (Fréchet) 0 1 H y~c y 1/ polynomial tail behavior Weibull 0 has upper end point F = u

Peak over Threshold POT: threshold u = 15mm

Poisson Point GPD Process Poisson point GPD process with intensity A=t 2 t 1 [1 y u 1/ ] u on A=t 1, t 2 y,

Estimation and Uncertainty General concept of estimating parameters from a sample { y i }, i=1,,n Maximum Likelihood (ML) Method Assume and PDF { y i } are draw from a GEV (GPD,...) with unknown parameters y i ~F y,, f y,,=f ' y,,

Maximum Likelihood Method The likelihood L of the sample is then L,,= i=1 n f y i,, It is easier to minimize the negative logarithm of the likelihood l,,= i=1 n log f y i,, in general there is no analytical solution for the minimum with respect to the parameters Minimize using numerical algorithms. The estimates,, maximize the likelihood of the data.

Profile log Likelihood

To Take Home There exists a well elaborated statistical theory for extreme values. It applies to (almost) all (univariate) extremal problems. EVT: extremes from a very large domain of stochastic processes follow one of the three types: Gumbel, Frechet/Pareto, or Weibull Only those three types characterize the behavior of extremes! Note: Data need to be in the asymptotic limit of a EVD!

References Coles, S (2001): An Introduction to Statistical Modeling of Extreme Values. Springer Series in Statistics. Springer Verlag London. 208p Beirlant, J; Y. Goegebeur; J. Segers; J. Teugels (2005): Statistics of Extremes. Theory and Applications. John Wiley & Sons Ltd. 490p Embrechts, Küppelberg, Mikosch (1997): Modelling Extremal Events for Insurance and Finance. Springer Verlag Heidelberg.648p Gumbel, E.J. (1958): Statistics of Extremes. (Dover Publication, New York 2004) R Development Core Team (2003): R: A language and environment for statistical computing, available at http://www.r-project.org The evd and ismev Packages by Alec Stephenson and Stuard Coles