Retrospective Test for Loss Reserving Methods - Evidence from Auto Insurers



Similar documents
One-year reserve risk including a tail factor : closed formula and bootstrap approaches

Bootstrap Modeling: Beyond the Basics

PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE

How To Calculate Multi-Year Horizon Risk Capital For Non-Life Insurance Risk

THE MULTI-YEAR NON-LIFE INSURANCE RISK

STOCHASTIC CLAIMS RESERVING IN GENERAL INSURANCE. By P. D. England and R. J. Verrall. abstract. keywords. contact address. ".

Modelling the Claims Development Result for Solvency Purposes

The Study of Chinese P&C Insurance Risk for the Purpose of. Solvency Capital Requirement

Intermediate Track II

ON THE PREDICTION ERROR IN SEVERAL CLAIMS RESERVES ESTIMATION METHODS IEVA GEDIMINAITĖ

RISK MARGINS FOR HEALTH INSURANCE. Adam Searle and Dimity Wall

STOCHASTIC CLAIM RESERVING BASED ON CRM FOR SOLVENCY II PURPOSES

Options on Stock Indices, Currencies and Futures

Chapter 10 Run-off triangles

FIXED INCOME ATTRIBUTION

Bayesian prediction of disability insurance frequencies using economic indicators

Stochastic claims reserving in non-life insurance

Micro-level stochastic loss reserving for general insurance

Financial Services [Applications]

Power functions: f(x) = x n, n is a natural number The graphs of some power functions are given below. n- even n- odd

High School Students Who Take Acceleration Mechanisms Perform Better in SUS Than Those Who Take None

Solvency capital for insurance company : modelling dependence using copula

Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification

Stochastic Reserving: Lessons Learned from General Insurance

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

Combining GLM and datamining techniques for modelling accident compensation data. Peter Mulquiney

TESTING THE ASSUMPTIONS OF AGE-TO-AGE FACTORS. Abstract

Performance Metrics for Graph Mining Tasks

Munich Re INSURANCE AND REINSURANCE. May 6 & 7, 2010 Dave Clark Munich Reinsurance America, Inc 4/26/2010 1

A MPCP-Based Centralized Rate Control Method for Mobile Stations in FiWi Access Networks

Integrated Resource Plan

Development Period Observed Payments

Labor Demand. 1. The Derivation of the Labor Demand Curve in the Short Run:

Risk and Return: Estimating Cost of Capital

Is the trailing-stop strategy always good for stock trading?

Simple Linear Regression

S T O C H A S T I C C L A I M S R E S E R V I N G O F A S W E D I S H L I F E I N S U R A N C E P O R T F O L I O

Jinadasa Gamage, Professor of Mathematics, Illinois State University, Normal, IL, e- mail:

STATISTICA Formula Guide: Logistic Regression. Table of Contents

SOA Health 2007 Spring Meeting Valuation and Reserving Techniques

!!! Technical Notes : The One-click Installation & The AXIS Internet Dynamic DNS Service. Table of contents

IASB Educational Session Non-Life Claims Liability

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,

Assessing Inflation Risk in Non-Life Insurance

Obtaining Predictive Distributions for Reserves Which Incorporate Expert Opinion. R. J. Verall

An Anatomy of Futures Returns: Risk Premiums and Trading Strategies

How To Model Claims Inflation

We discuss 2 resampling methods in this chapter - cross-validation - the bootstrap

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

A distribution-based stochastic model of cohort life expectancy, with applications

Stochastic Claims Reserving Methods in Non-Life Insurance

Quantitative Impact Study 1 (QIS1) Summary Report for Belgium. 21 March 2006

A RAPID METHOD FOR WATER TARGET EXTRACTION IN SAR IMAGE

USING A CLAIM SIMULATION MODEL FOR RESERVING & LOSS FORECASTING FOR MEDICAL PROFESSIONAL LIABILITY. By Rajesh V. Sahasrabuddhe, FCAS, MAAA

A performance analysis of EtherCAT and PROFINET IRT

SYLLABUS OF BASIC EDUCATION 2016 Estimation of Policy Liabilities, Insurance Company Valuation, and Enterprise Risk management Exam 7

RESP Investment Strategies

How to assess the risk of a large portfolio? How to estimate a large covariance matrix?

Introduction to Predictive Modeling Using GLMs

Java Modules for Time Series Analysis

Integrating Financial Statement Modeling and Sales Forecasting

Examples. David Ruppert. April 25, Cornell University. Statistics for Financial Engineering: Some R. Examples. David Ruppert.

Methodology. Discounting. MVM Methods

Investment Analysis (FIN 670) Fall Homework 5

Factors Affecting the Adoption of Global Sourcing: A Case Study of Tuskys Supermarkets, Kenya

Numbers 101: Growth Rates and Interest Rates

PARAMETRIC DISTRIBUTION FAMILIES USEFUL FOR MODELLING NON-LIFE INSURANCE PAYMENTS DATA. TAIL BEHAVIOUR. Sandra Teodorescu 1

Solving Newton s Second Law Problems

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

Practical Applications of Stochastic Modeling for Disability Insurance

Advanced Fixed Income Analytics Lecture 1

Dirty Data on Both Sides of the Pond

Runoff of the Claims Reserving Uncertainty in Non-Life Insurance: A Case Study

RISKY LOSS DISTRIBUTIONS AND MODELING THE LOSS RESERVE PAY-OUT TAIL

Two Correlated Proportions (McNemar Test)

MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal

Ratemakingfor Maximum Profitability. Lee M. Bowron, ACAS, MAAA and Donnald E. Manis, FCAS, MAAA

GLM I An Introduction to Generalized Linear Models

Claims reserving with R: ChainLadder Package Vignette

Local outlier detection in data forensics: data mining approach to flag unusual schools

Property Casualty Insurer Economic Capital using a VAR Model Thomas Conway and Mark McCluskey

8. Hardware Acceleration and Coprocessing

Multivariate Analysis of Ecological Data

Pricing Alternative forms of Commercial Insurance cover

Financial Time Series Analysis (FTSA) Lecture 1: Introduction

Transcription:

Retrospective Test or Loss Reserving Methods - Evidence rom Auto Insurers eng Shi - Northern Illinois University joint work with Glenn Meyers - Insurance Services Oice CAS Annual Meeting, November 8, 010

Outline Introduction Loss reserving methods Sampling o NAIC Schedule Analysis or the industry Analysis or individual insurers Concluding remarks

Introduction A loss reserving model rom a upper triangle training data, one is interested in whether it is a good or bad predictive distribution. Standard error is commonly used measure o variability, does a small standard error mean a good predictive model? Hold-out observations are needed to answer the above question. For a run-o triangle o incremental paid losses, suppose we observe all the losses in the lower triangle hold-out sample, the retrospective test in this study is based on the ollowing well-know result: I X is a random variable with distribution F, then the transormation FX ollows a uniorm distribution on 0,1.

Introduction X : total reserve - Use a sample o independent insurers. - Test whether the percentiles o total reserves are rom uniorm 0,1. - Inorms us whether a predictive model is good or the whole industry. X : incremental paid losses in each cell o the lower triangle - Test or each single insurer. - Test whether the percentiles o incremental losses in the lower triangle hold-out sample are rom uniorm 0,1 - Inorms us whether a predictive model perorms well or a particular insurer

Loss reserving methods Three methods are considered: Mack chain ladder, bootstrap overdispersed oisson, Bayesian log-normal An industry benchmark: Chain-Ladder technique - Large literature on CL, see England and Verrall 00, Wüthrich and Merz 008 - Many stochastic models reproduce CL estimates, e.g. Mack 1993,1999, Renshaw and Verrall 1998, Verrall 000 - Modiications o CL, e.g. Barnnett and Zehnwirth 000 Mack CL: - Variability can be rom recursive relationship, see Mack 1999 - Assume normality in the calculation o percentiles Bootstrap OD: - Resample residuals o GLM - Fit CL to pseudo data - Simulate incremental loss or each cell

A Bayesian Log-normal Model revious studies: Alba 00,006, Ntzouras and Dellaportas 00 Calendar year eect has been ignored We propose log Y µ ij ij ~ N µ, σ ij = αi + β j + t= i+ j, i, j = 1, N Y α - trend or accident year i β ij - normalized incremental paid lossor cell i,j - trend or development lag j - trend or calendar year t t i j We use accident year premium as exposure variable

A Bayesian Log-normal Model Dierent ways to speciy calendar year trend Calendar year trend introduce correlation due to calendar year eects The state space speciication could be used on accident year or development year trend We ocus on AR and RW speciications in the ollowing analysis 0, ~ 0, ~ Random Walk :, ~ 0, ~ Autoregressive Model: 0, ~ IIDSpeciication : 1 1 σ σ η η σ µ σ η η φ σ η η N N N N N t t t t t t t t t + = + =

A Bayesian Log-normal Model The likelihood unction can be derived as ollows We perorm the analysis using WinBUGS,,, where where we use log - normal speciication,,, then },,, { }and,,, { Let 3 1 3 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 φ σ σ η η η β α φ σ σ σ β α η η y n n n n n n n i n j j i n i n j ij t j i = = = = = = = = = = = = y y y y

Sampling o NAIC Schedule

Sampling o NAIC Schedule Training data is rom 1997 schedule Accident year 1988 1997 Hold-out sample is rom schedule o subsequent years e.g. actual paid losses or AY 1989 is rom 1998 schedule actual paid losses or AY 1990 is rom 1999 schedule actual paid losses or AY 1997 is rom 006 schedule Limit to group insurers or single entities Use data or personal auto and commercial auto or our analysis Check overlapping periods in training data and hold-out sample e.g. Training 1988 1989 1990 1991 199 1993 1994 1995 1996 1997 AY 1989 Hold-out 1989 1990 1991 199 1993 1994 1995 1996 1997 1998 AY 1989

Analysis or the industry Consider largest 50 insurers or personal and commercial auto lines Use net premiums written to measure size For each line o business: - derive the predictive distribution o total reserves or insurer i, say F i - calculate the percentile o the actual losses p i = F i loss i - repeat or all 50 irms Test i p i ollows uniorm 0,1 Implications: - i a predictive model perorms well, percentiles should be a realization rom uniorm 0,1 - an outcome that alls on the lower or higher percentile o the distribution does not suggest a bad model

Commercial Auto Consider Mack CL and bootstrap OD or top 50 insurers Compare point estimate o total reserve and prediction error 1 st igure compares point prediction that conirms two methods provide same estimates nd igure compares percentiles o actual losses, indicating a similar predictive distribution

Commercial Auto Next two slides present the percentiles p i i = 1,,50 o total reserves or the 50 insurers under dierent loss reserving methods Histogram and uniorm pp-plot are produced or our methods K-S test is used to test i p i ollows uniorm We observe: - again Mack CL and bootstrap OD provides similar results - pp-plots show both might have overitting problem - among state space modeling, AR1 speciication perorms better with a high p-value in the K-S test

Mack CL Bootstrap OD

LN - RW LN AR: p-value o K-S test is 0.43

ersonal Auto Repeat above analysis o total reserves or personal auto First we consider Mack CL and bootstrap OD using data rom largest 50 insurers Comparison o point prediction and percentile o actual losses conirms the close results rom the two chain ladder models

ersonal Auto As done or commercial auto, next two slides present the percentiles p i i = 1,,50 o total reserves or the 50 insurers under dierent loss reserving methods We exhibit both histogram and uniorm pp-plot, and K-S test is used to test i p i ollows uniorm We observe: - again Mack CL and bootstrap OD provides similar results - the perormance i worse than the commercial auto, since most realized outcomes lie on the lower percentile o the predictive distribution - Log-normal model does not suer like the above two, and a high p-value o the K-S test suggests the good perormance o the AR1 speciication

Mack CL Bootstrap OD

LN - RW LN AR: p-value o K-S test is 0.1

Analysis or individual insurers Consider individual insurers For illustrative purposes, we pick out insurers or each line Compare OD and LN-AR model Out o the two individual insurers or each line, we show that OD is better or one irm and LN model is better or the other one Though the analysis, we hope to explain why a certain method outperorms the other one

Commercial Auto Insurer A For insurer A, we derive the predictive distribution or each cell in the lower part o the triangle Then calculate the percentiles or actual incremental paid losses in the hold-out sample Uniorm pp-plots o percentiles with the p-value o K-S tests are shown in next slide LN model outperorms OD slightly We also compare mean error and mean absolute percentage error o the two methods over the 9 testing periods The result, to a great extent, agrees with K-S test

Commercial Auto Insurer A

Commercial Auto Insurer A In the next two slides, we analyze the predictive distributions rom the two methods 1st slide shows the predictive distributions or calendar year reserves - or early calendar years, LN provides wider distribution, as one moves to the bottom right o the lower triangle, LN provides narrow distribution - recall calendar year reserve is the sum o losses rom cells in the same diagonal nd slide shows the predictive distribution o each cell in calendar year CY=, that is calendar year 1998 - or top right cells on the diagonal, LN provides narrower distribution, and or bottom let cells, LN provides wider distribution - LN provides higher volatility or early development year

redictive distribution or calendar year reserves

redictive distribution or each cell o irst calendar year

Commercial Auto Insurer A We look into the pattern o the training data We show time-series plot o incremental losses or each accident year and over development lag Let panel shows losses and right panel shows loss ratio We observe high volatility in early development lag that might explain the better perormance o LN model

Commercial Auto Insurer B We did similar analysis or insurer B and the results are summarized in the ollowing three slides For insurer B, OD outperorms LN model slightly Again we observe that the wider predictive distribution or early calendar years rom LN model is explained by the wider distribution or early development year

Commercial Auto Insurer B

redictive distribution or calendar year reserves

redictive distribution or each cell o irst calendar year

Commercial Auto Insurer B Figures below show time-series plot o incremental losses or each accident year and over development lag Let panel shows losses and right panel shows loss ratio Dierent with insurer A, there is less volatility in early development years LN model might underit the data

ersonal Auto Insurer A For insurer A, we derive the predictive distribution or each cell in the lower part o the triangle Then calculate the percentiles or actual incremental paid losses in the hold-out sample Uniorm pp-plots o percentiles with the p-value o K-S tests are shown in next slide LN model outperorms OD We also compare mean error and mean absolute percentage error o the two methods over the 9 testing periods For each testing period, LN perorms better than OD

ersonal Auto Insurer A

ersonal Auto Insurer A In the next two slides, we analyze the predictive distributions rom the two methods 1st slide shows the predictive distributions or cells with development year 10 - the predictive distribution or all accident year are similar - LN provides narrower distributions nd slide shows the predictive distribution or cells in accident year 1997 - we want to see the eects over development lags - LN provides wider distribution or early development years and narrower distribution or later development years

redictive distributions or cells with development year 10

redictive distributions or cells with accident year 1997

ersonal Auto Insurer A Again look into the pattern o the training data We show time-series plot o incremental losses or each accident year and over development lag Let panel shows losses and right panel shows loss ratio Again the high volatility in early development lag that might explain the better perormance o LN model

ersonal Auto Insurer B We did similar analysis or insurer B and the results are summarized in the ollowing three slides For insurer B, OD outperorms LN model From the predictive distributions, we observe again LN provides wider distributions or early development years, while the distributions across accident years are similar under two methods

ersonal Auto Insurer B

redictive distributions or cells with development year 10

redictive distributions or cells with accident year 1997

ersonal Auto Insurer B Figures below show time-series plot o incremental losses or each accident year and over development lag Let panel shows losses and right panel shows loss ratio Dierent with insurer A, there is less volatility in early development years, especially or the loss ratio Thus LN model introduces more volatility and does not work well or this insurer

Concluding Remarks We use simple test to examine the perormance o loss reserving methods Our analysis is based on hold-out sample We ind the current industry standard over-estimate reserves or the industry We compare chain ladder and LN model on individual insurers Chain ladder ails in case o higher volatility Bayesian methods mitigates the potential overitting problem