Estimation and attribution of changes in extreme weather and climate events



Similar documents
An Introduction to Extreme Value Theory

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

An introduction to the analysis of extreme values using R and extremes

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page

CATASTROPHIC RISK MANAGEMENT IN NON-LIFE INSURANCE

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Exploratory Data Analysis

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails

Binomial random variables

Applying Generalized Pareto Distribution to the Risk Management of Commerce Fire Insurance

The Extremes Toolkit (extremes)

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Least Squares Estimation

Interpretation of Somers D under four simple models

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

Questions and Answers

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

seven Statistical Analysis with Excel chapter OVERVIEW CHAPTER

Monte Carlo Simulation

Binomial random variables (Review)

SUMAN DUVVURU STAT 567 PROJECT REPORT

Comparison of resampling method applied to censored data

Supplement to Call Centers with Delay Information: Models and Insights

Confidence Intervals for the Difference Between Two Means

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Statistical Rules of Thumb

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Nonparametric adaptive age replacement with a one-cycle criterion

Risk and return (1) Class 9 Financial Management,

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

Another Look at Sensitivity of Bayesian Networks to Imprecise Probabilities

THE KRUSKAL WALLLIS TEST

Extreme Value Modeling for Detection and Attribution of Climate Extremes

Modeling the Implied Volatility Surface. Jim Gatheral Stanford Financial Mathematics Seminar February 28, 2003

12: Analysis of Variance. Introduction

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

Normality Testing in Excel

Maximum Likelihood Estimation

EVALUATION OF PROBABILITY MODELS ON INSURANCE CLAIMS IN GHANA

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Package SHELF. February 5, 2016

Gamma Distribution Fitting

Probability Calculator

Machine Learning and Pattern Recognition Logistic Regression

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania

Independent t- Test (Comparing Two Means)

Two-sample hypothesis testing, II /16/2004

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

Statistical Analysis of Life Insurance Policy Termination and Survivorship

Analysis and Interpretation of Clinical Trials. How to conclude?

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

Part 2: Analysis of Relationship Between Two Variables

Protein Prospector and Ways of Calculating Expectation Values

A Basic Introduction to Missing Data

Calculating VaR. Capital Market Risk Advisors CMRA

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Tutorial on Markov Chain Monte Carlo

Jitter Measurements in Serial Data Signals

INSURANCE RISK THEORY (Problems)

Dongfeng Li. Autumn 2010

Generating Random Samples from the Generalized Pareto Mixture Model

Schools Value-added Information System Technical Manual

Permutation Tests for Comparing Two Populations

Missing data and net survival analysis Bernard Rachet

Contents. List of Figures. List of Tables. List of Examples. Preface to Volume IV

Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005

Lecture 5 : The Poisson Distribution

Algorithmic Trading: Model of Execution Probability and Order Placement Strategy

How to assess the risk of a large portfolio? How to estimate a large covariance matrix?

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Determining distribution parameters from quantiles

Descriptive Statistics

How To Test For Significance On A Data Set

Course 4 Examination Questions And Illustrative Solutions. November 2000

Estimating the Degree of Activity of jumps in High Frequency Financial Data. joint with Yacine Aït-Sahalia

An Introduction to Modeling Stock Price Returns With a View Towards Option Pricing

Joint Exam 1/P Sample Exam 1

Statistical Confidence Calculations

Statistics in Retail Finance. Chapter 6: Behavioural models

The Wilcoxon Rank-Sum Test

Hypothesis Testing for Beginners

E3: PROBABILITY AND STATISTICS lecture notes

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

Distribution (Weibull) Fitting

A Regime-Switching Model for Electricity Spot Prices. Gero Schindlmayr EnBW Trading GmbH

Extreme Value Analysis of Corrosion Data. Master Thesis

1 Sufficient statistics

An Empirical Examination of Investment Risk Management Models for Serbia, Hungary, Croatia and Slovenia

Extreme-Value Analysis of Corrosion Data

Geostatistics Exploratory Analysis

Statistical Machine Learning

Transcription:

IPCC workshop on extreme weather and climate events, 11-13 June 2002, Beijing. Estimation and attribution of changes in extreme weather and climate events Dr. David B. Stephenson Department of Meteorology University of Reading www.met.rdg.ac.uk/ cag

Plan of the talk 1. Introduction 2. Descriptive methods 3. Understanding the changes 4. Modelling the tail of the pdf 5. Recommendations

The definition problem: Extreme events can be defined by: Gare Montparnasse, 22 October 1895 Train crash here Maxima/ minima Magnitude Rarity Impact/ losses Man can believe the impossible, but man can never believe the improbable. - Oscar Wilde

IPCC 2001 definition of extreme event: An extreme weather event is an event that is rare within its statistical reference distribution at a particular place. Definitions of "rare" vary, but an extreme weather event would normally be as rare or rarer than the 10th or 90th percentile.

Daily CET winter temperatures (Nov-Mar) Sample 1: 1772-1900 n= 19232 values Sample 2: 1900-2000 n= 14933 values

Change in the p.d.f. of winter temp. Above freezing Below freezing Note shift in probability density for cold temperatures

Order statistics x [i] Simply rank sample values in ascending order Largest values in sample from 1900-2000

Empirical Distribution Function (e.d.f.) = =

Empirical return level plot x = A way of plotting the distribution function x vs. 1/(1-F)

Empirical quantile-quantile plots Note: quantiles differ not just a shift due to means 1772-1900: mean=-4.31 deg C stdev= 3.45 deg C 1900-2000: mean=-4.94 deg C stdev= 3.26 deg C

Negated Central England Temperature

Marked point process for exceedances Marked point process = random point process in time + random amount at each event

Principal attributes of exceedances Rate of exceedance R= Pr{ X> u} Mean excess M= E(X-u X> u) Volatility V= Stdev(X-u X> u)/ M Clustering in time L= 1/ Extremal Index Note: In many weather/ climate studies only rate is discussed yet the other attributes also merit attention!

Attributes as a function of threshold Rate R Mean excess M Volatility V Cluster time C

Changes in the rate of exceedance R R = X > u = F u Note reduced rate of exceedance in the later period.

Changes in the mean excess = = > M mean excess E( X u X u)

Changes in the volatility of excesses Volatility = = Coefficien t of Variation of std. deviation( excess)/mean(excess) excesses Note: values less than one imply Weibull type distribution with finite upper limit

Clustering in time For an unclustered point process the interarrival times T(i+1)-T(i) are exponentially distributed. However, extremes often come in clusters and it is important to take account of this dependency:

Changes in cluster length L = (Extremal Index) 1 = 2 T ( 2 T )2

Summary statistics Period R Log R M V L 1799-2000 0.091-2.39 1.74 0.92 3.45 1799-1900 0.108-2.23 1.82 0.93 3.98 1900-2000 0.070-2.66 1.59 0.90 3.55 Change -0.038-0.43-0.23-0.03-0.43 Fewer extremes Less intense extremes Shorter clusters

Changes in location, scale, and shape Fig. 5.1 from PhD thesis of Tahl Kestin: a) Shift in mean b) Increase in variance c) Change in shape

Hypotheses about changing extremes H0: No change (variation due to sampling only) Sampling uncertainty can be tested using tail model H1: Change due to mean effect e.g. Mearns et al. (1984), Wigley (1985), H2: Change due to variance effect e.g. Katz and Brown (1992), Katz and coworkers H3: Change due to mean and variance effects e.g. Brown and Katz (1995), H4: Structural change in shape etc e.g. Kestin 2001, Antoniadou et al. 2001

Response to change in the mean Change in exceedance rate for standard normal distribution Dashed lines show 95% confidence intervals for estimates based on 400 samples Note: nonlinear response in p2/p1 p2/p1 is very sensitive especially for small p1 sampling uncertainty also increases for small p1

Signal and noise for change in mean Change in exceedance rate for shift of 0.5 in mean of the standard normal distribution. Red lines show 95% confidence intervalsfor estimates based on 400 samples Largest uncertainty for most extreme events σ 1 np 1 p p 2 1 Note: largest change ( signal ) for the most extreme events BUT also much more sampling uncertainty ( noise ) in the tail signal/noise ratio decreases for more extreme events!

Response to change in variance Change in exceedance rate for standard normal distribution Dashed lines show 95% confidence intervalsfor estimates based on 400 samples Note: very nonlinear response in p2/p1 p2/p1 is very sensitive especially for small p1 sampling uncertainty also increases substantially for small p1

Signal diagnosis When δ log p 2 R p 1 = Pr{ X = f R x 1 F e -0. 43 x > x} δµ { h( δµ + zδσ )} 1 F + x σ µ Example:for CET threshold x dlog R = = = 0. 5(-0. 63-0. 24 x σ µ δσ 0 : mean and variance change both contribute = )

The hazard function for CET h x = f F x x

Structural change Antoniadou et al. 2001 Tail shape parameter has become more sensitive to westerly flow NAO+ winters in the 20 th century

Why do we need to model the tail? Quantify sampling uncertainty Make predictions beyond what is in the data sample e.g. larger values Use asymptotic theory to get the most information from the tail behaviour Summarise tail behaviour at ALL thresholds using just a few parameters Note: Modelling whole of parent distribution can fail to capture proper tail behaviour e.g. rainfall extremes and gamma fits to parent.

Tail parameter estimation methods 1. GEV fits to block maxima Uses only one value per block! (wastes data!) Choice of block length? 2. GEV fits to r-largest order statistics Choice of r? Choice of block length? Uses r values per block (better than maxima method) 3. GPD fits to peaks over threshold Uses ALL data above threshold Results should be insensitive to choice of threshold Choice of threshold? 4. Point process fits to peaks over threshold Uses ALL data above threshold Stochastic process that can be generalised Choice of threshold?

The Generalised Extreme Value distribution G Maxima/minima and the extreme tail of distributions can be modelled asymptotically by the 3-parameter Generalised Extreme Value (GEV) distribution: z = x µ ξ = + ξ σ z 1 ξ

Parametric description of attributes = R(u) ξ ξ µ ξ σ σ µ ξ ξ 2 1 1 ) ( 1 ) ( ) ( 1 1 exp ) ( 1 = + = + = u V u u M u n u R Note: These expressions can be used to calculate the attributes for ANY threshold once the 3 tail parameters are known

The Generalised Pareto Distribution Excesses X-u above a high threshold u can be modelled using the 2-parameter Generalised Pareto Distribution: X σ u u > z = + ξ z 1 ξ σ u = σ + ξ u µ

Poisson process above high threshold Probability of finding m points in region above x = m! e λλ m ξ σ ξ λ 1/ 1 2 1 ) ( ) ( where rate + = Ω u u x t t Ω

Point process estimates (with std. errors) µ σ ξ Period µ σ ξ 1799-2000 -5.70± 0.22 2.75± 0.15-0.124± 0.014 16.5 1799-1900 -5.47± 0.26 2.81± 0.18-0.123± 0.019 17.3 1900-2000 -6.20± 0.43 2.82± 0.27-0.147± 0.024 13.0 Change -0.73(-2.03) 0.01(0.06) -0.024 (-1.10) -4.3 (*) are t values change divided by standard error: t >1.96 is significant at 5% level statistically significant decrease in tail mean (p<0.05) hardly any change in tail variance large decrease in shape parameter (more convex) large reduction in upper limit of distribution

Main conclusions Extremes are fascinating but require careful statistical analysis ALL attributes of extremes should be investigated NOT just the rate of exceedence Tail parameters should be estimated using as much data as possible (e.g. not block maxima, use of pooling, etc.) Changes in attributes should be diagnosed in terms of changes in mean and variance of parent distribution (trends in variance=?) More complicated structural change might occur and can be diagnosed from regression on large-scale factors (covariates)

This document was created with Win2PDF available at http://www.daneprairie.com. The unregistered version of Win2PDF is for evaluation or non-commercial use only.