Finite Mixtures for Insurance Modeling



Similar documents
Offset Techniques for Predictive Modeling for Insurance

The zero-adjusted Inverse Gaussian distribution as a model for insurance claims

A Deeper Look Inside Generalized Linear Models

The Distribution of Aggregate Life Insurance Claims

A Model of Optimum Tariff in Vehicle Fleet Insurance

Longitudinal Modeling of Singapore Motor Insurance

Development Period Observed Payments

More Flexible GLMs Zero-Inflated Models and Hybrid Models

A revisit of the hierarchical insurance claims modeling

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA

AT&T Global Network Client for Windows Product Support Matrix January 29, 2015

Stochastic programming approaches to pricing in non-life insurance

SUMAN DUVVURU STAT 567 PROJECT REPORT

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania

Analysis One Code Desc. Transaction Amount. Fiscal Period

Simulation Models for Business Planning and Economic Forecasting. Donald Erdman, SAS Institute Inc., Cary, NC

Executive Master of Public Administration. QUANTITATIVE TECHNIQUES I For Policy Making and Administration U6311, Sec. 003

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page

COMPARISON OF FIXED & VARIABLE RATES (25 YEARS) CHARTERED BANK ADMINISTERED INTEREST RATES - PRIME BUSINESS*

COMPARISON OF FIXED & VARIABLE RATES (25 YEARS) CHARTERED BANK ADMINISTERED INTEREST RATES - PRIME BUSINESS*

Travelers Analytics: U of M Stats 8053 Insurance Modeling Problem

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Logistic Regression (1/24/13)

HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Monte Carlo Simulation

The Study of Chinese P&C Insurance Risk for the Purpose of. Solvency Capital Requirement

SAS Certificate Applied Statistics and SAS Programming

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Proposal to Reduce Opening Hours at the Revenues & Benefits Coventry Call Centre

VI. Introduction to Logistic Regression

SAS Code to Select the Best Multiple Linear Regression Model for Multivariate Data Using Information Criteria

Predicting Customer Default Times using Survival Analysis Methods in SAS

Benchmark Rates for XL Reinsurance Revisited: Model Comparison for the Swiss MTPL Market

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

How To Build A Predictive Model In Insurance

Maximum Likelihood Estimation

Case 2:08-cv ABC-E Document 1-4 Filed 04/15/2008 Page 1 of 138. Exhibit 8

i=1 In practice, the natural logarithm of the likelihood function, called the log-likelihood function and denoted by

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

OMBU ENTERPRISES, LLC. Process Metrics. 3 Forest Ave. Swanzey, NH Phone: Fax: OmbuEnterprises@msn.

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION

SAS Software to Fit the Generalized Linear Model

Pricing Alternative forms of Commercial insurance cover. Andrew Harford

Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry

Approximation of Aggregate Losses Using Simulation

Variable Selection for Health Care Demand in Germany

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Pricing Alternative forms of Commercial Insurance cover

Sales Forecast for Pickup Truck Parts:

Statistical Analysis of Life Insurance Policy Termination and Survivorship

Detection of changes in variance using binary segmentation and optimal partitioning

Model Fitting in PROC GENMOD Jean G. Orelien, Analytical Sciences, Inc.

Predicting Customer Churn in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS

Department of Epidemiology and Public Health Miller School of Medicine University of Miami

Operational Risk Modeling Analytics

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

RISKY LOSS DISTRIBUTIONS AND MODELING THE LOSS RESERVE PAY-OUT TAIL

Modeling the Implied Volatility Surface. Jim Gatheral Stanford Financial Mathematics Seminar February 28, 2003

Model Selection and Claim Frequency for Workers Compensation Insurance

Master s Theory Exam Spring 2006

Enhanced Vessel Traffic Management System Booking Slots Available and Vessels Booked per Day From 12-JAN-2016 To 30-JUN-2017

SAS Syntax and Output for Data Manipulation:

10/27/14. Consumer Credit Risk Management. Tackling the Challenges of Big Data Big Data Analytics. Andrew W. Lo

Probability Calculator

Hypothesis testing - Steps

Testing, Monitoring, and Dating Structural Changes in Exchange Rate Regimes

The Probit Link Function in Generalized Linear Models for Data Mining Applications

Measuring the Return on Marketing Investment

STOCHASTIC ANALYTICS. Chris Jermaine Rice University

lavaan: an R package for structural equation modeling

Share of Dealership Profits. Auto Lending Abuses: The Pitfalls of Financing Cars. Car Lending Abuses. The Trust Tax.

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes


Operational Value-at-Risk in Case of Zero-inflated Frequency

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

Logistic Regression (a type of Generalized Linear Model)

Reject Inference in Credit Scoring. Jie-Men Mok

Is log ratio a good value for measuring return in stock investments

Problem of Missing Data

Univariate Regression

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE

The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network

CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA

Package SHELF. February 5, 2016

Statistics in Retail Finance. Chapter 6: Behavioural models

Multiple Choice: 2 points each

Multivariate Normal Distribution

Hierarchical Insurance Claims Modeling

Transcription:

Matthew J. Flynn, PhD mattflynn@me.com 860-895-3530 Finite Mixtures for Insurance Modeling

Outline - Finite Mixture Models (FMM) Motivation Grouped claims, long-tailed lines JMP 9 Distribution Platform finite mixtures Interactive JMP Two-Component Normal mixture R two packages - flexmix, gamlss JMP s Nonlinear Platform SAS Proc NLMIXED More Examples Poisson counts, WC Losses

Joint work with Luyang Fu and Doug Pirtle of State Auto

HO by-peril example: heavier tail than lognormal

Motivation Personal Lines Auto Comprehensive Losses

JMP 9 includes finite 2,3+ component Normal mixtures

Interactive JMP Two-Component Normal mixture

Via R library(gamlss); library(gamlss.mx); m2 <- gamlssmx( waiting ~ 1, data=faithful, family=no, k=2); m2 library("flexmix") fl <- flexmix(waiting ~ 1, data = faithful, k = 2)

Via SAS Proc UNIVARIATE

SAS obtain starting values via Proc SQL /* two-component normal mixture */ proc sql; select log(mean(waiting)-0.5*var(waiting)**0.5) as mu1start, log(mean(waiting)+0.5*var(waiting)**0.5) as mu2start into :mu1start, :mu2start from faithful; quit;

SAS obtain starting values via Proc SQL /* two-component normal mixture */ proc sql; select log(mean(waiting)-0.5*var(waiting)**0.5) as mu1start, log(mean(waiting)+0.5*var(waiting)**0.5) as mu2start into :mu1start, :mu2start from faithful; quit; Create SAS Macro variables note: separation

Proc NLMIXED data=faithful; parms eta_mu1=&mu1start. eta_mu2=&mu2start. eta_sigma1=1.8 eta_sigma2=1.8 eta_p1=0.57 ; mu1 = exp(eta_mu1); mu2 = exp(eta_mu2); sigma1 = exp(eta_sigma1); sigma2 = exp(eta_sigma2); p1 = exp(eta_p1)/(1 + exp(eta_p1)); p2 = 1 - p1; y = waiting; Starting values (from above) Log link functions Normal 2 Component Finite Mixture loglikelihood loglike = logpdf('normalmix', y, 2, p1, p2, mu1, mu2, sigma1, sigma2) ; *loglike = logpdf('normal', y, mu1, sigma1)*p1 + (1 - p1)*logpdf('normal', y, mu2, sigma2); model y ~ general(loglike); estimate 'mu1' mu1; estimate 'mu2' mu2; estimate 'sigma1' sigma1; estimate 'sigma2' sigma2; estimate 'p1' p1; estimate 'p2' p2; run;

Via JMP nonlinear platform - setup dt = Current Data Table(); // set up the negative log likelihood with // starting values ll = dt << new column("normmix"); Exform = expr( ll << set formula(parameter( { eta_mu1=4.160438, eta_mu2=4.352785, eta_sigma1=1.8, eta_sigma2=1.8, eta_p1=-0.57 }, mu1 = exp(eta_mu1); mu2 = exp(eta_mu2); sigma1 = exp(eta_sigma1); sigma2 = exp(eta_sigma2); p1 = exp(eta_p1)/(1 + exp(eta_p1)); p2 = 1 - p1; -log( Normal Mixture Density( :waiting, mu1 / mu2, sigma1 / sigma2, p1 / p2 ) ) ) )); eval(eval Expr(exform));

Via JMP Nonlinear Platform nl = Nonlinear( Loss( :Normmix ), Numeric Derivatives Only( 1 ), Loss is Neg LogLikelihood( 1 ), QuasiNewton BFGS, Finish, Custom Estimate( exp(eta_mu1) ), Custom Estimate( exp(eta_mu2) ), Custom Estimate( exp(eta_sigma1) ), Custom Estimate( exp(eta_sigma2) ), Custom Estimate( exp(eta_p1)/(1 + exp(eta_p1)) ), Custom Estimate( 1 - exp(eta_p1)/(1 + exp(eta_p1)) ), );

Via JMP Nonlinear Platform Output

JMP Analyze, Distribution Platform

Time Permitting Additional Examples Proc FMM.sas FMM(2) Poisson Counts - regressors Exp_mix.sas FMM Exponential, Gamma dists WC_Loss.sas FMM Gamma with regressors

Further Reading Deb, Partha and J. F. Burgess Jr., A quasi-experimental comparison of statistical models for health care expenditures, 2003, wp, http://urban.hunter.cuny.edu/repec/htr/papers/debburgess10.pdf Deb, Partha and P. K. Trivedi, Demand for Medical Care by the Elderly: A Finite Mixture Approach, Journal of Applied Econometrics, 1997, 12, 313-336, http://qed.econ.queensu.ca/jae/1997- v12.3/deb-trivedi/ Grun, Bettina and Friedrich Leisch, Fitting Finite Mixtures of Generalized Linear Regressions in R, Computational Statistics and Data Analysis, 2006, http://statmath.wu.ac.at/projects/aasc/mixtures/gruen+leisch-2007b.pdf Klugman, Stuart and Jacques Rioux, Toward a unified approach to fitting loss models, North American Actuarial Journal, Jan-06, 10, 1, 63-83, http://www.iowaactuariesclub.org/library/lossmodels.pdf Lee, Andy H., Kui Wang, Kelvin K.W. Yau, Geoffrey J. McLachlan and S.K. Ng Maternity length of stay modeling by gamma mixture regression with random effects Biometrical Journal, Aug-2007, v49, n5, p750-764 http://www.maths.uq.edu.au/~gjm/lwymn_biomj07.doc Leisch, Friederich and Bettina Gruen, FlexMix Version 2: Finite mixtures with concomitant variables and varying and constant parameters, Journal of Statistical Software, 2007, 28(4), 1-35, http://cran.rproject.org/web/packages/flexmix/vignettes/mixture-regressions.pdf Park, Byung-Jung and Dominique Lord, Application of Finite Mixture Models for Vehicle Crash Data Analysis, wp, Feb-2009, https://ceprofs.civil.tamu.edu/dlord/papers/park_lord_%20finite_mixture_model.pdf

Further Reading Rempala, Grzegorz A. and Richard A. Derrig, Modeling Hidden Exposures in Claim Severity via the EM Algorithm, ASTIN Colloquia - Bergen, Norway Jun-2004, http://www.actuaries.org/astin/colloquia/bergen/rempala_derrig.pdf Stokes, Maura E., Fang Chen, and Ying So, On Deck SAS/STAT 9.3, SAS Global Forum, 2011, 331, http://support.sas.com/resources/papers/proceedings11/331-2011.pdf Teodorescu, Sandra, Different approaches to model the loss distribution of a real data set from motor third party liability insurance, Romanian Journal of Insurance, Apr-2010, 93-104, http://www.imaimi.ro/en/publications/assets/pdf/romanian%20journal%20of%20insurance%20year%202010%20no.4.pdf#page=94

Thank you Questions? Matt Flynn mjflynn@travelers.com 860.954.0894 #analytics2011