A Deeper Look Inside Generalized Linear Models

Similar documents
How To Build A Predictive Model In Insurance

Travelers Analytics: U of M Stats 8053 Insurance Modeling Problem

Offset Techniques for Predictive Modeling for Insurance

Anti-Trust Notice. Agenda. Three-Level Pricing Architect. Personal Lines Pricing. Commercial Lines Pricing. Conclusions Q&A

Combining Linear and Non-Linear Modeling Techniques: EMB America. Getting the Best of Two Worlds

My name is Steven Lehmann. I am a Principal with Pinnacle Actuarial Resources, Inc., an actuarial consulting

GLM I An Introduction to Generalized Linear Models

How To Price Insurance In Canada

Introduction to Predictive Modeling Using GLMs

Predictive Modeling Techniques in Insurance

Predictive Modeling and Big Data

Personal Auto Predictive Modeling Update: What s Next? Roosevelt Mosley, FCAS, MAAA CAS Predictive Modeling Seminar October 6 7, 2008 San Diego, CA

Stochastic programming approaches to pricing in non-life insurance

Poisson Models for Count Data

BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar

Staying Ahead of the Analytical Competitive Curve: Integrating the Broad Range Applications of Predictive Modeling in a Competitive Market Environment

We hope you will find this sample a useful supplement to your existing educational materials, and we look forward to receiving your comments.

Development Period Observed Payments

SAS Software to Fit the Generalized Linear Model

Math 370/408, Spring 2008 Prof. A.J. Hildebrand. Actuarial Exam Practice Problem Set 2 Solutions

Major Trends in the Insurance Industry

Revising the ISO Commercial Auto Classification Plan. Anand Khare, FCAS, MAAA, CPCU

Usage-based Auto Insurance (UBI)

More Flexible GLMs Zero-Inflated Models and Hybrid Models

Measuring per-mile risk for pay-as-youdrive automobile insurance. Eric Minikel CAS Ratemaking & Product Management Seminar March 20, 2012

GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE

SOA 2013 Life & Annuity Symposium May 6-7, Session 30 PD, Predictive Modeling Applications for Life and Annuity Pricing and Underwriting

Predictive Modeling for Life Insurers

EDUCATION AND EXAMINATION COMMITTEE SOCIETY OF ACTUARIES RISK AND INSURANCE. Copyright 2005 by the Society of Actuaries

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Mathematics of Risk. Introduction. Case Study #1 Personal Auto Insurance Pricing. Mathematical Concepts Illustrated. Background

Economic Education for Consumers Chapter 14 Study Guide Automobile and Home Insurance: Sharing the Risk

AN INTRODUCTION TO PREMIUM TREND

Insurance Solutions. 17 October Risk Solutions

Glossary of Insurance Terms

Predictive modelling around the world

Emerging Leaders Program Business Operations Track

The Relationship of Credit-Based Insurance Scores to Private Passenger Automobile Insurance Loss Propensity

GLM, insurance pricing & big data: paying attention to convergence issues.

Math 370, Spring 2008 Prof. A.J. Hildebrand. Practice Test 1 Solutions

Optimization approaches to multiplicative tariff of rates estimation in non-life insurance

Overall Level of Home Insurance Rates

Analysis of Internet Purchasing Behavior. October 3, Roosevelt Mosley, FCAS, MAAA Principal Pinnacle Actuarial Resources

Why it Matters: The Tyranny of the Average

When You Have Options, You Can Save Big!

GAP Insurance Techniques and Challenges

Matβlas Underwriting and Actuarial Consulting, Training and Research

Predictive Analytics White Paper

Package insurancedata

INSURANCE BASICS (DON T RISK IT)

Age and Insurance: Helping older customers find the cover they need. February 2009

Innovations and Value Creation in Predictive Modeling. David Cummings Vice President - Research

Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression

RE: Disclosure Requirements for Short Duration Insurance Contracts

Automobile Insurance Guide

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

NEFE High School Financial Planning Program Unit 6 Your Money: Keeping it Safe and Secure. Unit 6 - Insurance: Protecting What You Have

Predictive Analytics: Achieving Greater Decision Accuracy, Better Risk Segmentation, and Greater Profitability

CHAPTER 28 Insurance

EMPIRICAL RISK MINIMIZATION FOR CAR INSURANCE DATA

Chapter 9 Experience rating

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

STATE OF CALIFORNIA DEPARTMENT OF INSURANCE 45 Fremont Street, 21 st Floor San Francisco, California 94105

The Use of Education and Occupation as Underwriting Factors in Determining Policyholder Premiums for Private Passenger Auto Insurance

Driving Insurance World through Science Murli D. Buluswar Chief Science Officer

UBI-5: Usage-Based Insurance from a Regulatory Perspective

Affordable Auto Insurance

Lecture 8: Gamma regression

Motor and Household Insurance: Pricing to Maximise Profit in a Competitive Market

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Risk pricing for Australian Motor Insurance

Own Damage, Third Party Property Damage Claims and Malaysian Motor Insurance: An Empirical Examination

Direct Marketing of Insurance. Integration of Marketing, Pricing and Underwriting

NAIC Consumer Shopping Tool for Auto Insurance

Logistic Regression (a type of Generalized Linear Model)

Introduction to Longitudinal Data Analysis

PREDICTIVE LOSS RATIO

STATE OF CALIFORNIA DEPARTMENT OF INSURANCE 45 Fremont Street, 21 st Floor San Francisco, California 94105

Automobile Insurance in Pennsylvania a supplement to the Automobile Insurance Guide

Probabilistic concepts of risk classification in insurance

Predictive Modeling and Claims Analytics to Incorporate Leakage Analyses

IMPACT OF THE DEREGULATION OF MASSACHUSETTS PERSONAL AUTO INSURANCE. November 17, 2009

Transcription:

A Deeper Look Inside Generalized Linear Models University of Minnesota February 3 rd, 2012 Nathan Hubbell, FCAS

Agenda Property & Casualty (P&C Insurance) in one slide The Actuarial Profession Travelers Broad Overview Actuarial & Analytics Career Opportunities GLMs Background Insurance Applications 2

What is insurance and what good does it serve? Insurance restores individuals to the financial state they were in prior to a loss (e.g. car accident; tree fell on house) For this benefit, customers pay a premium to the insurance company If a customer doesn t have a loss, then their premium: A. Helps the insurance company cover the loss another customer did have B. Keeps the insurance company functioning so it can continue providing this service If the customer does have a loss, it s the other insureds that are helping them! 3

The Actuarial Profession Some statistics on the actuarial profession: Approximately 20,000 actuaries in the U.S. Projected 21% Increase in Employment from 2008 2018 Over half of All Actuaries are Employed by Insurance Carriers 4

Actuarial Salaries Source: DW Simpson October 2011 Salary Survey; includes base pay and bonus. 5

Two Actuarial Societies Casualty Actuarial Society (CAS) and the Society of Actuaries (SOA) Exams 1 through 4 Lower Level Exams Similar for the two societies Exams 5 through 9 Upper Level Exams Different for the two societies Exams are offered 1 6 times a year Validation by Educational Experience (VEEs) 6

Lower Level Exams Exam 1/P Probability Exam 2/FM Financial Mathematics Exam 3F Financial Economics Exam 3L Life Contingences and Statistics Exam 4/C Construction and Evaluation of Actuarial Models Exams are multiple choice, range from 2 to 3 hours in length, and require about 300 hours of studying Very challenging exams you are competing against many other exam takers 7

Actuarial Exams Computer Based Testing Available for first exam. Offered at Prometric Testing Locations in Woodbury and Edina. Register online at http://www.prometric.com/soa 8

About Travelers Offers property and casualty solutions to individuals and companies of all sizes Second largest commercial insurer in the U.S. Second largest personal insurer through the independent agency channel No. 98 on the Fortune 500 list of largest U.S. companies Representatives in every U.S. state, Canada, Ireland and the United Kingdom A member of the Dow Jones Industrial Average the only insurance company on the list 9

Analytics at Travelers Who are we? Across Travelers, we form a large (300+) and diverse community of Ph.D., Masters and Bachelors holders in the following disciplines: mathematics statistics physics actuarial science computer science business and more! 10

Actuaries at Travelers What do we do? We ask and answer questions requiring sophisticated analyses How much will it cost to insure a customer? What amount of loss reserves need to be set aside to pay future claims? How likely is it that this customer will purchase our product? How many claims adjusters will we need in two years? What new statistical methods will help move our business into the future? 11

Why are these questions hard? Example: The cost to insure an auto customer It s impossible to predict if someone is going to A. Get into an accident B. The type of accident (hit a telephone pole, hit another vehicle, bodily injury) C. How bad the accident will be But if we have enough customers, we can start to group them in group A) we expect 1 / 10 to get into an accident, costing on average $1000 in group B) etc. 12

Why it matters The more finely we group, the more accurate the price. In that case: 1. If our competitors charge more, the customer will choose us and we will grow profitably 2. If our competitors charge less, the customer will choose them and they ll grow unprofitably Either way, we win! (or if we price inaccurately or too coarsely, we lose!) The trick is finding the right groups, and getting the right price for them Now, how might one go about doing this? 13

Generalized Linear Models 14

Predictive Modeling Using Generalized Linear Models (GLM s) and other statistical methods to predict exposure to loss at detailed level. Recently, property casualty insurance companies have embraced predictive modeling as a strategic tool for competing in the marketplace. Originally introduced as a method of increasing precision for personal auto insurance pricing Extended to homeowners and commercial lines Today, it is applied in areas such as marketing, underwriting, pricing, and claims management 15

How are rates differentiated? Automobile Age Gender Marital status # vehicles # drivers Home policy Driving record Years Licensed Limits Prior Insurance Student/Nonstudent Location (Garage/driven) Annual Mileage Homeowners Age of home # occupants Primary / Secondary Prior claim experience Construction Protection Roof Type Location (CAT?) Amount of Insurance Auto Policy Responsibility of owner 16

Possible Solution: Multiple Linear Regression / Ordinary Least Squares E[Y] = a 0 + a 1 X 1 + + a n X n Two Key Assumptions: Y is Normally distributed random variable. Variance of Y is constant (homoscedastic). 17

Problems With OLS Regression Part I Y is NOT normally distributed. Number of claims is discrete (and most policies have zero claims) Claim sizes are skewed to the right Probability of an event is in [0,1] 18

Problems With OLS Regression Part II Variance of Y is NOT constant. Varies by expected loss. High frequency losses have less variance. High severity losses have more variance. Varies by exposure. 19

Problems With OLS Regression Part III Nonlinear 3.5 relationship between X s and Y s. Example: 3 Age of driver Loss Relativity 2.5 2 1.5 1 0.5 0 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64 67 70 73 76 79 82 85 88 Driver Age (numbers are illustrative only) 20

Introducing: Generalized Linear Models (GLMs) E[Y] = g 1 (a 0 + a 1 X 1 + + a n X n ) Fewer restrictions: Non linear relationships. g(x) = x Additive model g(x) = exp(x) Multiplicative model g(x) = 1 / (1+exp(x)) Logistic model 21

Generalized Linear Models (GLMs) E[Y] = g 1 (a 0 + a 1 X 1 + + a n X n ) Fewer restrictions: Y can be from any exponential family of distributions. Poisson (number of claims) Binomial (probability of renewing) Gamma (loss severity) 22

Generalized Linear Models (GLMs) E[Y] = g 1 (a 0 + a 1 X 1 + + a n X n ) Fewer restrictions: Variance depends on the expected mean. Normal: Variance is constant. ( ) Poisson: Variance equals mean. ( E[Y]) Gamma: Variance equals mean squared ( E[Y] 2 ) 23

Exponential Family ABCs Source: A Practitioner's Guide to Generalized Linear Models 24

Exponential Family Mean and Variance Tweedie V( ) = p ; 1 < p < 2 Source: A Practitioner's Guide to Generalized Linear Models 25

Uncle Tweedie Source: A Practitioner's Guide to Generalized Linear Models 26

Effect of Exponential Family Choice on Fit Source: A Practitioner's Guide to Generalized Linear Models 27

Link Functions: How to Translate Linear Predictors Source: A Practitioner's Guide to Generalized Linear Models 28

What Link to Choose? Source: A Practitioner's Guide to Generalized Linear Models 29

Maximum Likelihood Looks Pretty Ugly Source: A Practitioner's Guide to Generalized Linear Models 30

Maximum Likelihood Never Looked So Pretty SAS: PROC GENMOD PROG LOGISTIC R: glm {base} Source: A Practitioner's Guide to Generalized Linear Models 31

Common Response Variables in Insurance Source: A Practitioner's Guide to Generalized Linear Models 32

References Generalized Linear Models in Insurance A Practitioner's Guide to Generalized Linear Models Casualty Actuarial Society Discussion Paper Program Casualty Actuarial Society Arlington, Virginia (2004), p. 1 116 http://www.casact.org/pubs/dpp/dpp04/04dpp1.pdf Actuarial Exams and Career Information http://www.beanactuary.org/ http://www.casact.org/ http://www.soa.org/ Travelers Careers http://www.travelers.com/careers Research Internship and Full Time Actuarial & Analytics Leadership Development Program Internships and Full Time 33