Predictive Modeling Techniques in Insurance



Similar documents
Session 62 TS, Predictive Modeling for Actuaries: Predictive Modeling Techniques in Insurance Moderator: Yonasan Schwartz, FSA, MAAA

Machine Learning with MATLAB David Willingham Application Engineer

SOA 2013 Life & Annuity Symposium May 6-7, Session 30 PD, Predictive Modeling Applications for Life and Annuity Pricing and Underwriting

Big Data and Advanced Analytics: Are You Behind the Competition?

Predictive Modeling and Big Data

Predictive modelling around the world

Session C-31: Predictive Analytics in P&C Insurance

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Combining Linear and Non-Linear Modeling Techniques: EMB America. Getting the Best of Two Worlds

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

BIG DATA and Opportunities in the Life Insurance Industry

Local classification and local likelihoods

Azure Machine Learning, SQL Data Mining and R

BIG DATA Driven Innovations in the Life Insurance Industry

A Property & Casualty Insurance Predictive Modeling Process in SAS

Logistic Regression (a type of Generalized Linear Model)

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

DATA MINING TECHNIQUES AND APPLICATIONS

Principles of Data Mining by Hand&Mannila&Smyth

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Data Mining + Business Intelligence. Integration, Design and Implementation

The Data Mining Process

A Deeper Look Inside Generalized Linear Models

Data Mining for Business Intelligence. Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. 2nd Edition

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

BayesX - Software for Bayesian Inference in Structured Additive Regression

Data Mining in CRM & Direct Marketing. Jun Du The University of Western Ontario jdu43@uwo.ca

Maximizing Return and Minimizing Cost with the Decision Management Systems

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

Predictive Analytics

Virtual Site Event. Predictive Analytics: What Managers Need to Know. Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015

ANALYTICS CENTER LEARNING PROGRAM

Practical applications of Predictive Modelling Overview of the process, the techniques and the applications

Regression Modeling Strategies

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Data Mining Algorithms Part 1. Dejan Sarka

Data Mining mit der JMSL Numerical Library for Java Applications

Introduction to Predictive Modeling Using GLMs

Predictive Modeling for Life Insurers

Maschinelles Lernen mit MATLAB

Data Mining with SAS. Mathias Lanner Copyright 2010 SAS Institute Inc. All rights reserved.

Get to Know the IBM SPSS Product Portfolio

Turning Data into Actionable Insights: Predictive Analytics with MATLAB WHITE PAPER

Statistics in Retail Finance. Chapter 6: Behavioural models

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

Operationalising Predictive Insights

How To Understand The Theory Of Probability

Using Data Mining for Mobile Communication Clustering and Characterization

Using Ensemble of Decision Trees to Forecast Travel Time

Internal audit value optimization for insurance organizations

Data Mining. Nonlinear Classification

High Performance Predictive Analytics in R and Hadoop:

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges

TNS EX A MINE BehaviourForecast Predictive Analytics for CRM. TNS Infratest Applied Marketing Science

Advanced analytics at your hands

SAS Fraud Framework for Banking

Better credit models benefit us all

EARLY WARNING INDICATOR FOR TURKISH NON-LIFE INSURANCE COMPANIES

Predictive Analytics 101

Learning outcomes. Knowledge and understanding. Competence and skills

Chapter 12 Discovering New Knowledge Data Mining

Data Mining - Evaluation of Classifiers

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING

Studying Auto Insurance Data

Data Mining with SQL Server Data Tools

Data Mining: Overview. What is Data Mining?

240ST014 - Data Analysis of Transport and Logistics

MS1b Statistical Data Mining

An Introduction to Data Mining

HT2015: SC4 Statistical Data Mining and Machine Learning

Predictive Data modeling for health care: Comparative performance study of different prediction models

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone:

Whitepaper. Power of Predictive Analytics. Published on: March 2010 Author: Sumant Sahoo

Easily Identify Your Best Customers

Travelers Analytics: U of M Stats 8053 Insurance Modeling Problem

DecisionCraft Proposition

Predict Influencers in the Social Network

Technology and Trends for Smarter Business Analytics

Knowledge Discovery and Data Mining

Course Syllabus. Purposes of Course:

Maximize Revenues on your Customer Loyalty Program using Predictive Analytics

Quick Start. Creating a Scoring Application. RStat. Based on a Decision Tree Model

Risk pricing for Australian Motor Insurance

Transcription:

Predictive Modeling Techniques in Insurance Tuesday May 5, 2015 JF. Breton Application Engineer 2014 The MathWorks, Inc. 1

Opening Presenter: JF. Breton: 13 years of experience in predictive analytics and risk management in insurance and banking This session: Cover different best-practice predictive modeling techniques in insurance Show how these can answer practical business questions with MATLAB Takeaways: Big data and new available technology are creating new opportunities in the insurance industry MATLAB can quickly help you get value from your data with its predictive analytics capability 2

Agenda Predictive modeling background Case study Claim settlements forecasting Conclusion (Warning: Crowded and busy slides ahead!) 3

What is predictive modeling? Use of mathematical language to make predictions about the future More of an art than a science (lots of trial and error) Input/ Predictors Predictive model Output/ Response 4

Predictive modeling applications in insurance Price risks Identify target clienteles Predict fraud Claims triage Set assumptions and reserves And many more reasons 5

Where does it fit? 6

Trends that drive the use of these techniques Available technology and large amount of data Increased need for customized products/services Pressure on top and bottom lines of income statement (ref: Insurance Risk and Reward, The Economist 3/18/2015) 7

Historical perspective: P&C vs. Life & Annuity P&C industry has matured much faster Life & Annuity Credit and risk scores have been used to predict future P&C claims for over 20 years Short duration P&C products have limited tail risk compared to most life contracts Mortality studies can require several years of data to analyze but Life & annuity insurers are catching up 8

2013 Insurance predictive modeling survey Impacts Predictive models now widely used yet still mainly in pricing and underwriting Benefits seen on profitability, risk reduction and operational efficiency Challenges Lack of sufficient data attributes and skilled modelers Data prep and model deployment can often take +3 months each Big Data is currently mainly leveraged by large insurers (Source: Earnix) 9

The future: Increased focus on these techniques? Disruptions may occur from outside the industry From risk pooling to Big Mother Application completed Disruption From Non-Traditional Insurance Providers Predictive Model Determines UW class Identify lifestyle based risks after policy underwritten and issued Health /Lifestyle feedback Provided to p/h Positive feedback/coaching included in contract Life Insurance Policy Issued Predictive Model run annually on all policies Policyholder Chooses to incorporate feedback Policyholder incentive to reduce risk Social Media Data Geospatial Data Premium Adjusted / Lapse Decision Reduce tail risk with adjustable premium 10

100 90 80 70 60 50 40 30 20 10 0 Bank Marketing Campaign Misclassification Rate No Misclassified Yes Misclassified Some applications and associated techniques Predicting loss severity (parametric) Generalized Linear Model Predicting S&P 500 for annuities (time series) ARIMA modeling GARCH modeling S&P 500 1800 1700 1600 1500 1400 1300 1200 1100 1000 Original Data Simulated Data Realized vs Median Forecasted Path 900 800 May-01 Feb-04 Nov-06 Aug-09 May-12 Predicting customer response (non-parametric) Classification learning techniques Percentage Measure accuracy and compare models Neural Net Logistic Regression Discriminant Analysis k-nearest Neighbors Naive Bayes Support VM Decision Trees TreeBagger Reduced TB 11

Our focus on today: Learning Techniques Parametric (Statistical) Non-parametric Supervised Learning (The target is known) Linear Regression Time Series Generalized Linear Models Hazard Models Mixed Effect Models Neural Networks CART (Classification and Regression Trees) Random Forests MARS (Multivariate Adaptive Regression Splines) Unsupervised Learning (The target is unknown) Cluster Analysis (i.e. K-means) Principal Components Analysis Neural Networks 12

GLM as primary method of loss cost analysis Primary method Additional methods Principal component analysis Source: Towers Watson survey in 2014 13 13

Flexibility of GLMs Common GLM Applications: Technique Link Function Distribution Application Classical Regression (Ordinary Least Squares) Identity: g(µ)=µ Normal General Scoring Models Logistical Regression Logit: g(µ)= log[µ/(1-µ)] Binomial Binary Target Applications (i.e. Retention) Frequency Modeling Log: g(µ)=log(µ) Poisson Negative Binomial Count Target Variable Frequency Modelnig Severity Modeling Inverse: g(µ)=(-1/µ) Gamma Size of claim modeling Severity Modeling Inverse Squared: g(µ)=(-1/µ^2)) Inverse Gaussian Size of claim modeling 14

Other popular techniques and applications GLM Product Development Develop product assumptions based on prior products CART (Tree models) Neural Networks Marketing Underwriting Targeted marketing campaigns Customer segmentation Streamline/reduce UW requirements Audit UW process Clustering Retention Align customer retention with customer value Random Forests Time Series/ Survival Models Claims Inforce Management Enhance claims forecasting Fraud detection Refine valuation assumptions Understand drivers of policyholder behavior 15

Predictive modeling workflow Select Model & Predictors Data Train the Model Use for Prediction Import Data Explore Data Prepare Data Known data Known responses Model Model New Data Predicted Responses Measure Accuracy 16

Best practice #1: Testing and performance monitoring Modeling Data Iterative model building Train X% Test Y% Model Validation Model Implementation Ongoing Performance Monitoring Validation Z% 17

Best practice #2: Refining the model Simplify your model when possible Remove predictors, decrease iterations Adding synthetic variables Example: is the contract owner the insured? Refining the results with additional models or rules Underwriting rules Principle Components Analysis Combinations of different models 18

Agenda Predictive modeling background Case study Claim settlements forecasting Conclusion 19

Claim Settlements Forecasting Goal: Improve reserves by producing accurate model to predict insurance claim settlement amounts Approach: Train a regression model using different techniques Measure accuracy and compare approaches Use model for prediction 20

Takeaways of case study Interactive environment Visual tools for exploratory data analysis Easy to evaluate and choose best algorithm Apps available to help you get started (e.g,. neural network tool, curve fitting tool) Multiple algorithms to choose from Clustering Classification Regression 21

Agenda Predictive modeling background Case study Claim settlements forecasting Conclusion 22

Summary Big data and new available technology are creating new opportunities in the insurance industry MATLAB can quickly help you get value from your data with its predictive analytics capability 23

Q&A Questions!?! 24