PREDICTIVE MODELLING FOR COMMERCIAL INSURANCE



Similar documents
Predictive Analytics for Life Insurance: How Data and Advanced Analytics are Changing the Business of Life Insurance Seminar May 23, 2012

How Will Predictive Modeling Change the P&C Industry in the Next 5-10 Years?

Model Validation Techniques

Data Mining Algorithms Part 1. Dejan Sarka

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar

Data Mining Methods: Applications for Institutional Research

Anti-Trust Notice. Agenda. Three-Level Pricing Architect. Personal Lines Pricing. Commercial Lines Pricing. Conclusions Q&A

Introduction to Predictive Modeling Using GLMs

Risk pricing for Australian Motor Insurance

Combining Linear and Non-Linear Modeling Techniques: EMB America. Getting the Best of Two Worlds

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

Predictive Modeling Techniques in Insurance

Gerry Hobbs, Department of Statistics, West Virginia University

Predictive Modeling and Big Data

Staying Ahead of the Analytical Competitive Curve: Integrating the Broad Range Applications of Predictive Modeling in a Competitive Market Environment

Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell

Easily Identify Your Best Customers

New Work Item for ISO Predictive Analytics (Initial Notes and Thoughts) Introduction

A Property & Casualty Insurance Predictive Modeling Process in SAS

Customer Segmentation and Predictive Modeling It s not an either / or decision.

Innovations and Value Creation in Predictive Modeling. David Cummings Vice President - Research

Why do statisticians "hate" us?

An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century

How To Make A Credit Risk Model For A Bank Account

BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING

The Data Mining Process

BIG DATA and Opportunities in the Life Insurance Industry

Challenge. Solutions. Early results. Personal Lines Case Study Celina Insurance Reduces Expenses & Improves Processes Across the Business.

How To Build A Predictive Model In Insurance

Cross Validation. Dr. Thomas Jensen Expedia.com

EMB America PREDICTIVE MODELING: PITFALLS AND POTENTIALS A CONSULTANT S VIEWPOINT. Jeff Kucera Senior Consultant CAS Annual Meeting

not possible or was possible at a high cost for collecting the data.

Combining GLM and datamining techniques for modelling accident compensation data. Peter Mulquiney

Revising the ISO Commercial Auto Classification Plan. Anand Khare, FCAS, MAAA, CPCU

Assessing Data Mining: The State of the Practice

Insurance Premium Increase Optimization: Case Study. Charles Pollack B.Ec F.I.A.A.

Insurance Fraud Detection: MARS versus Neural Networks?

Benchmarking of different classes of models used for credit scoring

Umbrella & Excess Liability - Understanding & Quantifying Price Movement

A Short Tour of the Predictive Modeling Process

Data Mining Techniques Chapter 6: Decision Trees

Driving Insurance World through Science Murli D. Buluswar Chief Science Officer

Data Mining Applications in Higher Education

Why Use Dashboard Metrics? Because the right metrics matter. RKM Research and Communications, Inc., Portsmouth, NH. All Rights Reserved.

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Predictive modelling around the world

Statistics for BIG data

Data Mining. Nonlinear Classification

SOA 2013 Life & Annuity Symposium May 6-7, Session 30 PD, Predictive Modeling Applications for Life and Annuity Pricing and Underwriting

Data Mining Techniques

Summary. WHITE PAPER Using Segmented Models for Better Decisions

Development Period Observed Payments

BIG DATA Driven Innovations in the Life Insurance Industry

Personal Auto Predictive Modeling Update: What s Next? Roosevelt Mosley, FCAS, MAAA CAS Predictive Modeling Seminar October 6 7, 2008 San Diego, CA

Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms

AN INTRODUCTION TO PREMIUM TREND

How To Price Insurance In Canada

OPTIMIZING YOUR MARKETING STRATEGY THROUGH MODELED TARGETING

Data Mining: An Introduction

An Introduction to Advanced Analytics and Data Mining

Better credit models benefit us all

A Property and Casualty Insurance Predictive Modeling Process in SAS

Football Match Winner Prediction

Data Mining - Evaluation of Classifiers

Chapter 12 Discovering New Knowledge Data Mining

Report to the 79 th Legislature. Use of Credit Information by Insurers in Texas

A Comparison of Decision Tree and Logistic Regression Model Xianzhe Chen, North Dakota State University, Fargo, ND

Studying Auto Insurance Data

A Review of Data Mining Techniques

Addressing Analytics Challenges in the Insurance Industry. Noe Tuason California State Automobile Association

Machine Learning Methods for Causal Effects. Susan Athey, Stanford University Guido Imbens, Stanford University

Data mining and statistical models in marketing campaigns of BT Retail

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Agenda. Mathias Lanner Sas Institute. Predictive Modeling Applications. Predictive Modeling Training Data. Beslutsträd och andra prediktiva modeller

Data Mining Solutions for the Business Environment

DATA MINING TECHNIQUES AND APPLICATIONS

Information Visualization WS 2013/14 11 Visual Analytics

Building risk prediction models - with a focus on Genome-Wide Association Studies. Charles Kooperberg

We discuss 2 resampling methods in this chapter - cross-validation - the bootstrap

Session 62 TS, Predictive Modeling for Actuaries: Predictive Modeling Techniques in Insurance Moderator: Yonasan Schwartz, FSA, MAAA

Direct Marketing of Insurance. Integration of Marketing, Pricing and Underwriting

Predictive Analytics: Achieving Greater Decision Accuracy, Better Risk Segmentation, and Greater Profitability

Predictive Analytics Worksheet Using Data to Reveal Opportunities

HOW PREDICTIVE ANALYTICS DRIVES PROFITABILITY IN ASSET FINANCE

Data Mining in Pharmaceutical Marketing and Sales Analysis. Andrew Chabak Rembrandt Group

Handling attrition and non-response in longitudinal data

Variable Reduction for Predictive Modeling with Clustering Robert Sanche, and Kevin Lonergan, FCAS

Marketing Mix Modelling and Big Data P. M Cain

Tree Ensembles: The Power of Post- Processing. December 2012 Dan Steinberg Mikhail Golovnya Salford Systems

Data Mining for Fun and Profit

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics

Data Mining Practical Machine Learning Tools and Techniques

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone:

MSCA Introduction to Statistical Concepts

Data Mining Approaches to Modeling Insurance Risk. Dan Steinberg, Mikhail Golovnya, Scott Cardell. Salford Systems 2009

Transcription:

PREDICTIVE MODELLING FOR COMMERCIAL INSURANCE General Insurance Pricing Seminar 13 June 2008 London James Guszcza, FCAS, MAAA jguszcza@deloitte.com

General Themes

Predictive modelling: 3 Levels of Discussion Strategy Profitable Growth Right-pricing Improved retention Methodology Model design (actuarial) Modelling process (modern machine learning POV) Technique GLM vs classification trees vs neural networks

Methodology vs Technique Technique is only one facet of overall methodology. It s not enough to be statisticians we must be actuarial statisticians. How does predictive modelling need actuarial science? Variable creation Model design Model validation How does actuarial science need predictive modelling? Advances in computing, modelling techniques Ideas from other fields can be applied to insurance problems

Semantics: Data Mining vs Predictive Modelling Data Mining: knowledge discovery, often in large industrial databases KDD Data exploration techniques (some brute force) Data visualization e.g. discover strength of credit variables Predictive Modelling: Application statistical techniques (like GLM) after knowledge discovery phase is completed. Quantify & synthesize relationships found during KDD phase e.g. build a credit model

Aside: A Famous Example of KDD in Insurance Mid-90 s: insurers discovered a strikingly powerful relationship between personal credit score and personal motor / homeowners claim propensity. The reason why was (is?) mysterious. The discovery and the business benefit did not hinge on particularly advanced statistical techniques. A dramatic illustration of the business value of the data mining / KDD paradigm. KDD is fact-finding.

Commercial Insurance vs Personal Insurance Personal insurance modelling is a nice statistical problem. Many data points Straightforward exposure base (car-year) Many well understood pricing factors In the UK s liberal market especially, prices can be determined scientifically GLM-based loss cost modelling Elasticity modelling, price optimisation Controlled pricing experiments

Commercial Insurance vs Personal Insurance Commercial insurance modelling is a messy statistical problem. Fewer data points especially for new business Often lower frequency / higher severity Heterogeneous risks The corner bakery vs the suburban über-market Complex exposure bases (sales, payroll, feet 2 ) Messy data Risk selection/pricing often a free for all Underwriter Subjectivity

Strategy: Why Undertake a Modelling Project?

The Parable of Moneyball (Or: How Underwriting is Like Baseball) In 1999 Billy Beane (manager of the Oakland Athletics) found a novel use of data mining. A s not a wealthy team: ranked 12th (out of 14) in payroll How could the A s compete with the rich teams? Beane hired a junior statistician (Paul depodesta) to analyze statistics advocated by baseball guru Bill James. Using predictive analytics, Beane was able to hire excellent players undervalued by the market. A year after Beane took over, the A s ranked 2nd!

The Implication Beane quantified how well a player would do. Not perfectly, just better than his peers He realized that statistical regularities are more reliable than baseball scouts subjective, expert judgments. Implication: Be on the lookout for fields where an expert is required to reach a decision based on judgmentally synthesizing quantifiable information across many dimensions. (Does this sound like commercial insurance underwriting?) Maybe a predictive model can beat the human expert.

Mental Accounting Take a guess: which is a worse EL risk?... and by how much? Flower shop 4 employees 5 year-old business 2 EL claims in past 5 years Credit: 70 th %ile Pub 10 employees 15 year-old business Most recent EL claim: 4 years ago Credit: 90 th %ile Unlike a human decision-maker, a predictive algorithm knows how much weight to give each consideration. Just as the A s used models to select players, commercial insurers use models to select and price risks. Humans are predictably irrational but models don t engage in creative mental accounting.

Keeping Score Billy Beane Beane s Scouts Potential Team Member Bill James stats Billy Bean s Super Cruncher CEO who wants to run the next Progressive Insurance Commercial Insurance Underwriters Potential Policyholder Innovative collection of predictive variables You and me

The Moral of Our Parable Billy Beane has arguably transformed US professional sports by introducing the strategic use of predictive analytics to baseball. The way Beane crunched his numbers was determined by his business strategy: Exploit an inefficient and subjective market for baseball players. Similarly in the commercial insurance domain: Start off by trying to understand the business/strategic context. Allow the modelling strategy to conform to the business strategy, not vice versa.

Competing on Analytics In Competing on Analytics, Tom Davenport defines: An analytic competitor [is] an organization that uses analytics extensively and systematically to outthink and out-execute the competition. Think of predictive modelling as a strategic capacity not just another actuarial tool. The most valuable modelling projects are an integral part of a company s core strategy.

More Business Considerations Davenport: truly analytic competitors promulgate an analytic and fact-based culture from the top down. A related point: culture change is often a critical part of implementing a predictive model. A model can be worse than nothing if it is implemented improperly and/or if critical users do not buy into it. Building models is only a one phase of a predictive modelling project. Planning, data scrubbing, project management, IT implementation, business implementation often dwarf the modelling part of the project. Modelling is the fun part, not the hard part! Highly multi-disciplinary process.

Methodology: Integrating Concepts from Statistics, Actuarial Science, Machine Learning

Concepts from Modern Statistics Generalized Linear Models Goodness-of-fit measures R 2, AIC, BIC, Nested models, analysis of deviance, F-tests, Graphical analysis of model fit Graphical residual analysis Variance estimators Bayesian credibility Bootstrapping, simulation ( you know the drill) But these doesn t exhaust modern predictive modelling

Concepts from Modern Machine Learning Data Mining and KDD Brute-force search techniques Scoring engines A predictive model by any other name Lift Curves Operationally meaningful measure of predictive power Out-of-sample model tests, cross-validation Ideally yield unbiased estimates of predictive power Alternative to AIC, BIC

Scoring Engines Scoring engine: (non)linear function of multiple predictors: score = f(x 1, X 2,, X N ) Used for segmentation. The X 1, X 2,, X N are as important as the f( ) Major reason why actuarial expertise is necessary. A large part of the modelling process consists of variable creation and selection Often possible to generate 100 s of variables Steepest part of the learning curve Data scrubbing / variable creation is time-consuming

Model Evaluation the Lift Curve Sort data by model score Policy Profitability Model -- Lift Curve Break the dataset into 10 equal pieces Best decile : lowest score lowest LR Worst decile : highest score highest LR Difference: Lift Lift = segmentation power Loss Ratio Relativity -0.5 0.0 0.5 Lift ROI of the modelling project model decile

Out-of-Sample Model Validation Randomly divide data into 3 pieces Training data, Test data, Validation data Use Training data to fit models Score the Test data to create a lift curve Perform the train/test steps iteratively until you have a model you re happy with Test data is implicitly used in building the final model test lift is overly optimistic During this iterative phase, validation data is set aside in a lock box Once model has been finalized, score the Validation data and produce a lift curve Unbiased estimate of future performance

Credit Scoring is a Classic Example All four of our machine learning concepts apply to Credit Scoring. Knowledge discovery in databases (KDD) Scoring engine Lift Curve evaluation translates to LR improvement ROI Blind-test validation Credit scoring has been the insurance industry s segue into the modern synthesis of classical statistics with machine learning concepts. Very useful paradigm in the context of commercial insurance modelling.

Concepts from Actuarial Science Overall design of model / analysis What are we trying to predict? At what level? Predictive variable creation Calls on subject-matter expertise of insurance Target variable creation Loss development and trending Whether/how to use premium Deductibles, claim/claimant level, etc Considerations of time periods Analysis file creation Level of the analysis risk, policy, account, Inclusions / exclusions

What are we Trying to Predict? Pricing: Pure Premium Underwriting: Profitability Premium audit: Additional / returned premium Retention models Cross-sell models Elasticity models Agent/agency profitability Target marketing Fraud detection Again the modelling strategy should follow the business strategy. No one-size-fits-all answer

Variable Creation Research possible data sources Extract/purchase data Check data for quality (QA) Messy! (we are still toiling deep in the data mines) Create Predictive and Target Variables Opportunity to quantify tribal wisdom and come up with new ideas Can be a very big task! Steepest part of the learning curve

Types of Predictive Variables Behavioral Prior claims, bill-paying, credit Policyholder Business class, age, # employees Policy specifics Number of buildings, Construction Type Territorial Geo-demographic, economic, weather

Data Exploration & Variable Transformation 1-way analyses of predictive variables Weed out weak / redundant variables Correlation study of predictive variables Avoid multicollineariliy further weeding out Exploratory Data Analysis (EDA) Advanced techniques can be helpful Data Visualization very helpful here Use EDA to cap / transform predictive variables Extreme values, missing values, etc

Modeling Process 1. Finalize set of transformed predictive variables 2. Iterative training / testing of candidate models Build candidate models on training data Evaluate on test data Many things to tweak Different target variables Different predictive variables Different modelling techniques # NN nodes, hidden layers; tree splitting rules; tuning parameters 3. Select & validate final model Use as-yet untouched validation data

Some Pragmatic Considerations Do signs / magnitudes of parameters make sense? Statistically significant? Is the model biased for/against certain types of policies? Regions? Policy sizes? Business classes?... If so, is that an appropriate thing, or not? Predictive power holds up for larger policies? Continuity Are there small changes in input values resulting in large score swings? Could an agent or underwriter game the model?

Model Analysis & Implementation Perform model analytics Necessary for client to gain comfort with the model Calibrate Models Create user-friendly scale client dictates Implement models Technical: IT skills are critical here Business: Culture change can be critical Monitor performance Distribution of scores over time, predictiveness, usage of model... Plan model maintenance

Technique: Regressions and its Relations Artificial Neural Networks MARS CART

Regression and its Relations GLM: relaxes some regression assumptions Assume linearity on link function scale Variance is modeled as a function of expected value MARS & Neural Networks Clever ways of automatically transforming and interacting input variables Why: sometimes the true relationships aren t linear Universal approximators: model any functional form CART is simplified MARS

Uses of Advanced Techniques Alternatives to GLM models Provide benchmarks for GLM models Exploratory data analysis (especially CART) Variable selection Detection of interaction terms Detection of optimal variable transformations

Neural Networks: Architecture A neural net models Y as a complicated non-linear function of X. 1 Lingo Green: input layer Red: hidden layer Yellow: output layer X 1 a 11 a 12 a 01 Z 1 b 1 1 b 0 The {a, b} numbers are weights to be estimated. X 2 a 21 a 22 Y The network architecture and the weights constitute the model. X 3 a 31 Z 2 a 02 b 2 a 32 1

Neural Networks: Functional Form Z 1 e 01 11 1 1 a b x b x b x 1 21 2 31 3 Z 1 e Y 1 e b 02 0 1 b 12 1 1 2 a b x b x b x z 1 1 b 22 2 z 2 2 32 3 X 1 X 2 a 11 a 12 a 21 a 22 1 a 01 Z 1 b 1 1 b 0 Y These look like logit models. NN is thus related to GLM. X 3 a 31 Z 2 a 02 b 2 a 32 1

y y y y y y y y y y y y y y y y MARS Multivariate Adaptive Regression Splines Automatically searches a space of basis functions for the right combination to model complex, multi-dimensional, non-linear patterns. Basis functions look like hockey sticks MARS model is a linear model of hockey sticks and interactions of hockey sticks. Cross-validation is built into the core MARS algorithm. y -2-1 0 1 2 0 15 0 20 0 30 70 0 40 100 0 20 40 60 80 100 x 0 20 40 60 80 100 x 0 20 40 60 80 100 x 0 10 20 0 20 40 0 30 0 40 0 20 40 60 80 100 x 0 20 40 60 80 100 x 0 20 40 60 80 100 x 0 5 15 0 15 35 0 30 60 0 40 80 0 20 40 60 80 100 x 0 20 40 60 80 100 x 0 20 40 60 80 100 x 0 4 8 0 15 30 0 30 0 40 0 20 40 60 80 100 x 0 20 40 60 80 100 x 0 20 40 60 80 100 x 0 20 40 60 80 100 0 20 40 60 80 100 0 20 40 60 80 100 0 20 40 60 80 100 x x x x 0 20 40 60 80 100 Linear model offers a poor fit x MARS considers basis function transformations

MARS Result y = 0.29 + 0.02*x y = 0.29 + 0.02*x - 0.086*max(0,x-35) MARS performs a stepwise search and the prunes back. Cross-validation is used to determine optimally complex model. y -2-1 0 1 2 0 20 40 60 80 100 y -2-1 0 1 2 0 20 40 60 80 100 x x The final MARS model is: y^ = 0.29 + 0.02*x - 0.086*max(0,x-35) + 0.084*max(0,x-65) This is a GLM model! A more complex example would have multiple variables and interactions. y -2-1 0 1 2 y = 0.29 + 0.02*x - 0.086*max(0,x-35) + 0.084*max(0,x-65) 0 20 40 60 80 100 x

CART: Recursive Partitioning Classification And Regression Trees Key idea: recursive partitioning Take all of the data. Consider all possible values of all variables. Select the variable/value (X=t 1 ) that produces the greatest separation in the target. (X=t 1 ) is called a split. If X< t 1 then send the data to the left ; otherwise, send data point to the right. Now repeat same process on these two nodes. You get a tree-structured model. As with MARS, cross-validation is used to prune back.

Commercial Insurance Example Suppose you have 3 variables: # vehicles: {1,2,3 10 + } Age category: Liability-only: {0,1} {1,2,3 6} At each iteration, CART tests all 15 splits. (#veh<2), (#veh<3),, (#veh<10) (age<2),, (age<6) (lia<1) Select split resulting in greatest increase in purity. Perfect purity: each split has either all claims or all no-claims. Perfect impurity: each split has same proportion of claims as overall population. Then iterate grow the tree out then prune back

Example of a Split Commercial Auto Dataset 57,000 policies 34% claim frequency Predict likelihood of claim Classification Tree using Gini splitting rule NUM_V EH <= 4.500 Node 1 NUM_V EH Clas s Cas es % 0 37891 66.2 1 19312 33.8 N = 57203 NUM_V EH > 4.500 First split: Policies with 5 vehicles have 58% claim frequency Else 20% Big increase in purity Terminal Node 1 Clas s Cas es % 0 29083 80.0 1 7276 20.0 N = 36359 Terminal Node 2 Clas s Cas es % 0 8808 42.3 1 12036 57.7 N = 20844

Growing The Tree Node 1 NUM_V EH N = 57203 NUM_V EH <= 4.500 Node 2 LIAB_ONLY N = 36359 NUM_V EH > 4.500 Node 4 NUM_V EH N = 20844 LIAB_ONLY > 0.500 NUM_V EH > 10.500 LIAB_ONLY <= 0.500 Node 3 FREQ1_F_RPT N = 28489 Terminal Node 3 Class = 0 Class Cases % 0 7591 96.5 1 279 3.5 N = 7870 NUM_V EH <= 10.500 Node 5 AVGAGE_CAT N = 11707 Terminal Node 6 Class = 1 Class Cases % 0 2409 26.4 1 6728 73.6 N = 9137 FREQ1_F_RPT <= 0.500 FREQ1_F_RPT > 0.500 AVGAGE_CAT <= 8.500 AVGAGE_CAT > 8.500 Terminal Node 1 Class = 0 Class Cases % 0 18984 78.7 1 5138 21.3 N = 24122 Terminal Node 2 Class = 1 Class Cases % 0 2508 57.4 1 1859 42.6 N = 4367 Terminal Node 4 Class = 1 Class Cases % 0 4327 48.1 1 4671 51.9 N = 8998 Terminal Node 5 Class = 0 Class Cases % 0 2072 76.5 1 637 23.5 N = 2709

Bringing it All Back Home Remember that a MARS model is a GLM model fit on basisfunction-transformed variables. as well as interactions thereof A CART model is like a MARS model in which the hockey stick basis functions are replaced with {0,1} step functions. tree-structured regression Thus like MARS and NNET models CART models are relatives of regression models. Only connect. E.M. Forster

References For Beginners: Data Mining Techniques --Michael Berry & Gordon Linhoff For Mavens: The Elements of Statistical Learning --Jerome Friedman, Trevor Hastie, Robert Tibshirani

PREDICTIVE MODELLING FOR COMMERCIAL INSURANCE General Insurance Pricing Seminar 13 June 2008 London James Guszcza, FCAS, MAAA jguszcza@deloitte.com