Using Predictive Analytics to Detect Fraudulent Claims



Similar documents
Personal Auto Predictive Modeling Update: What s Next? Roosevelt Mosley, FCAS, MAAA CAS Predictive Modeling Seminar October 6 7, 2008 San Diego, CA

8/24/2012. No Fault: Concept versus Reality

Regarding: 2012 Questionable Claim Referral Analysis for Palm Beach, Broward, and Miami-Dade Counties

Insurance Fraud Overview. Florida Department of Financial Services Division of Insurance Fraud Captain David Lindsay,

Predictive Modeling and Big Data

Chapter 12 Discovering New Knowledge Data Mining

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar

Appendix A. CAR Special Investigative Units Standards

Insurance Fraud Detection: MARS versus Neural Networks?

Innovations and Value Creation in Predictive Modeling. David Cummings Vice President - Research

12 Common Auto Accident Myths: Protecting your rights and wallet after being injured in an auto accident.

Dan French Founder & CEO, Consider Solutions

CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學. Self-Organizing Map: Visualization and Data Handling 自 組 織 神 經 網 絡 : 可 視 化 和 數 據 處 理

Florida No-Fault Law Reform. CAS Antitrust Notice. Background - Example. 1 August 10, 2012 [Enter presentation title in footer] Copyright 2007

Azure Machine Learning, SQL Data Mining and R

Social Media Mining. Data Mining Essentials

Segmentation of stock trading customers according to potential value

Session 94 L, Detecting Fraudulent Claims in General Insurance Moderator: Anthony E. Cappelletti, FSA, FCAS, FCIA

DECISION TREE ANALYSIS: PREDICTION OF SERIOUS TRAFFIC OFFENDING

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Data Mining + Business Intelligence. Integration, Design and Implementation

Data Mining: Overview. What is Data Mining?

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

In comparison, much less modeling has been done in Homeowners

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

THE TOP 10 QUESTIONS YOU SHOULD ASK YOUR CAR ACCIDENT LAWYER

Predictive Modeling for Workers Compensation Claims

Actuarial Roundtable Presented by Milliman

POINT OF SALE FRAUD PREVENTION

Health Spring Meeting May 2008 Session # 42: Dental Insurance What's New, What's Important

FLORIDA DEPARTMENT OF FINANCIAL SERVICES TARGET MARKET CONDUCT EXAMINATION OF UNITED AUTOMOBILE INSURANCE COMPANY AS OF.

Data Mining with SAS. Mathias Lanner Copyright 2010 SAS Institute Inc. All rights reserved.

Data Mining Techniques Chapter 7: Artificial Neural Networks

Data Mining and Neural Networks in Stata

^rfc #007. Re: Proposed Medallion Rulemaking Changes- Insurance. Gentlemen:

Why Alerts Suck and Monitoring Solutions need to become Smarter

Big Data Analytics for SCADA

Martian Chronicles: Is MARS better than Neural Networks? by Louise Francis, FCAS, MAAA

Credit Card Fraud Detection Using Self Organised Map

USING DATA SCIENCE TO DISCOVE INSIGHT OF MEDICAL PROVIDERS CHARGE FOR COMMON SERVICES

Insuring Business Vehicles

AUTO INSURANCE CLAIMS & CLAIMANTS RIGHTS

The State of Insurance Fraud Technology. A study of insurer use, strategies and plans for anti-fraud technology

Data Mining Using SAS Enterprise Miner Randall Matignon, Piedmont, CA

Model Validation Techniques

Using multiple models: Bagging, Boosting, Ensembles, Forests

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Auto. The Instant Insurance Guide: What To Do If You re In An Accident. Info and tips for buying automobile and motorcycle insurance in Delaware

Auto. The Instant Insurance Guide: Info and tips for buying automobile and motorcycle insurance in Delaware.

Profit from Big Data flow. Hospital Revenue Leakage: Minimizing missing charges in hospital systems

Using Analytics to detect and prevent Healthcare fraud. Copyright 2010 SAS Institute Inc. All rights reserved.

SAS Fraud Framework for Banking

practical problems. life) property) 11) Health Care Insurance. 12) Medical care insurance.

Anti-Trust Notice. Agenda. Three-Level Pricing Architect. Personal Lines Pricing. Commercial Lines Pricing. Conclusions Q&A

Easily Identify Your Best Customers

Using Business Analytics within the insurance claims process reducing the loss ratio by 2 to 5%

The point of this essay is to learn how to assess risk and put an economic value on it!

The Florida Senate. Issue Brief August 2011 PERSONAL INJURY PROTECTION (PIP) Statement of the Issue. Discussion

Data Mining Applications in Higher Education

Data Mining Techniques

Uncovering More Insurance Fraud with Predictive Analytics Strategies for Improving Results and Reducing Losses

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

General Insurance Definitions:

Florida Workers Compensation: A Guide for the HR Professional

Self-Organizing g Maps (SOM) COMP61021 Modelling and Visualization of High Dimensional Data

CLUSTER ANALYSIS FOR SEGMENTATION

Driving Down Claim Costs With PREDICTIVE MODELING. December Sponsored by:

New Jersey Anti-Fraud Prevention and Detection Plan Protocol GROUP NAME NAIC #999 MEMBER COMPANIES:

PERSONAL INJURY/AUTO ACCIDENT INTAKE FORM

OFFICE OF INSURANCE REGULATION Property and Casualty Product Review

Self Organizing Maps: Fundamentals

Insurance Bureau of Canada

Transcription:

Using Predictive Analytics to Detect Fraudulent Claims May 17, 211 Roosevelt C. Mosley, Jr., FCAS, MAAA CAS Spring Meeting Palm Beach, FL Experience the Pinnacle Difference!

Predictive Analysis for Fraud Claim fraud is increasing, focus on fraud is magnified There are special investigators in the industry that are good at detecting fraud As good as they are, they can t review every claim and detect all fraud Predictive i analytics can bring the expertise to bear on all claims Predictive analytics can enhance the work of investigators by uncovering complexities the human eye may miss

Claim Fraud is Increasing, and the Focus on Claim Fraud is Increasing as Well

Increasing Claim Fraud 211 Headlines March 3 Suspicious claims rise 34% in Florida April 17 The Battle Against Insurance Fraud in Georgia April 26 Insurance Groups Stress Need for N.Y. No-Fault Reform at Hearing April 26 PIP Bills Crash in Florida May 2 Four Women Booked with Insurance Fraud in Louisiana May 5 Council Woman Gets Jail Time for Insurance Fraud May 6 - Allstate Files $4 Million Insurance Fraud Case in New York May 8 Questionable Claims on the Rise in Oklahoma (+15%) May 12 NY State Must Stand Against No Fault Car Insurance Fraud

Increase in Questionable Claims 4,5 4, 3,5 3, 2,5 2, 1,5 1, 5-1,578 1,82 1,82 867 867 868 647 734 446 3,613 4,232 4,16 1,982 1,825 1,654 Tampa Miami Orlando New York City Los Angeles 28 29 21 Source: National Insurance Crime Bureau

Fraud Detection Process

Geneal Fraud Identification Process Identify triggers that alert the claim adjuster to potential fraud (fraud indicators) Rely on claim adjusters to identify potentially fraudulent claims (recognition, intuition) Potentially fraudulent claims are referred to SIU Smaller group of SIU investigators handle the investigation of fraudulent claims

Recognition (I ve Seen This Before) Examples Repeat offenders Provider/patient/attorney combinations Approach Advisory claim database Experience of adjuster Disadvantages Assumes adjuster has seen it before Aliases Fraud becomes smarter

Fraud Indicators Rules based system Identify known or potential fraud scenarios Advantages Easy to implement and modify Easy to understand Effective to attack specific problems Disadvantages Doesn t detect new and unknown fraud Creates smarter fraud

Fraud Indicators - Examples Distance between claimant s home address and medical provider Multiple medical opinions/providers Certain claim types (e.g., soft tissue) Changing providers for the same treatment (possibly correlated with other claim activity) High number of treatments for type of injury Abnormally long treatment time off for the type of injury Accident severity does not correlate with severity of injury

Intuition (Something Smells Funny) Something about the claim doesn t seem right to the adjuster, and it is referred to the SIU Relies on ability and experience of adjuster to see suspicious cases Inexperienced adjusters will not have the ability to detect suspicious as well

As Good as the SIU Is

Concerns with the Current Process Claim referral can be inconsistent heavy dependence on claim adjuster False positives Claim adjuster may not be aware of all suspicious relationships Not all historical fraud has been identified Pi Prioritization iti of potentially ti fraudulent claims

Using Predictive Analytics to Address These Concerns Predictive analysis of historical referrals (consistent treferrals) Predictive analysis of historical fraudulent claims (false positives) Association analysis (recognition of claim patterns) Clustering Methods (missed claims, prioritization) K-mean clustering Kohonen self-organizing maps PRIDIT (consistent referrals,,p prioritization)

Analysis of Historical Referrals Target: history of claim referrals to SIU Independent Factors: details of claim Models Tested Decision tree Neural network Linear regression Ensemble Result: given the history of claim referrals, the likelihood that a new claim should be referred to SIU based on the claim characteristics

Decision Tree Most serious injury: neck sprain/strain Claimant's hospital treatment: did not go, outpatient Arbitration: non-binding Impact severity to claimant's vehicle: none, minor Was claimant represented by an attorney? Y

Regression: First Report of Claim First Report of Claim 1.4 1.3155 1.2 1. 1. 1.834 1.1742.8.6.4.2. Insured Claimant Attorney Other

Referral Score 18 % Referral Score 14.% 16.% 18.% 8.% 1.% 12.% 4.% 6.% 8.%.% 2.%..4.8.12.16.2.24.28.32.36.4.44.48.52.56.6.64.68.72.76

Analysis of Historical Fraudulent Claims Target: history of actionable claim referrals to SIU Independent Factors: details of claim Models Tested Result: Decision tree Neural network Linear regression Ensemble given the history of claim referrals, the likelihood that action will be taken on a new claim based on the claim characteristics Comparison to referral claims

Decision Tree Comparison Variable Importance Actionable SIU Referral Variable Importance Importance Ratio Central City 1..464 46.4% Replace:Claimant's state of residence.967 1. 13.5% Impact severity to claimant's vehicle.962.828 86.2% Was claimant represented by an attorney?.85.95 16.4% Policy coverage limits per person.75.411 54.9% Arbitration.547.368 67.2% Most serious injury.53.375 7.9% Settlement_lag.456..% Who reported injury to insurer.439.374 85.3% Most expensive injury.423.239 56.5% DRAGE.312.36 98.% Lawsuit status.295..% Driver, other violation.285..% Amount Spent on Medical Professionals.255.412 161.6% 6%

Difference in Referred vs. Actionable Claims 5.% % D 45.% i s 4.% t 35.% r 3.% i 25.% b 2.% u t 15.% i 1.% o 5.% n %.% Referred Minus Actionable Should have been referred? 3.9%.%.%.%.1%.2%.4%.3%.2% 35.7% 43.5% False Positives 5.9% 3.2% 2.2% 1.8% 1.2%.2%.8%.2%.2% -1. -.9 -.8 -.7 -.6 -.5 -.3 -.2 -.1..1.2.3.4.5.6.7.8.9 1. Difference

Association Analysis (recognition of patterns) Technique used in market basket analysis Identification of items that occur together in the same record Produces event occurrence as well as confidence interval around the occurrence likelihood Can lead to sequence analysis as well, which h considers timing and ordering of events

Association Analysis Measurements Support how often items occur together Transactions that contain items A & B All transactions Confidence strength th of association Transactions that contain items A & B Transactions that contain item A Expected Confidence proportion of items that satisfy right side of rule Transactions that contain item B All transactions

Association Analysis Output

Association Output Example

Self Organizing Maps Topological mapping from input space to clusters Observations from the input space are mapped onto an organized grids Neurons are determined initially, and as inputs are mapped to the grids the neurons are adjusted As a input is matched to the grid, all the neurons around that grid are updated

SOM SIU Indicator

Clustering/Segmentation Unsupervised classification technique Groups data into set of discrete clusters or contiguous groups of cases Performs disjoint cluster analysis on the basis of Euclidean distances computed from one or more quantitative input variables and cluster seeds Objects in each cluster tend to be similar, objects in different clusters tend to be dissimilar Can be used as a dimension reduction technique

Cluster Evaluation Suspicion Scores Root Mean Square Standard Deviation variability of claims within a cluster Distance to Nearest Cluster group of outlier claims Distance from Cluster Seed the distance of the claim from the average Review of cluster summary statistics

Homeowner Contents Analysis Claim values by detailed category Replacement cost value Depreciation Number of items Age Property characteristics (age, bathrooms, bedrooms) Coverage details (coverage C) Insured demographics (age, education, income)

Cluster Proximities All Causes of Loss

Cumulative Distribution Distance to Nearest est Cluster (Theft)

Review of Cluster Summary Statistics

Distance from Cluster Mean 9 8 7 6 5 4 3 2 1 8.8 Public Adjuster 2.3 2.5 18.3 Theft Fire Water Weather Other

Final Fraud Calculations Factor Name Description Input Value insured_kids_2 Y, N, or U u peril_2 Cause of Loss Fire public_adjuster or 1 IMP_REP_Coverage_C Coverage C Amount 19,5 IMP_REP_Insured_Home_Bathrooms Number of Bathrooms 2 IMP_REP_Insured_Home_Bedrooms Number of Bedrooms 3 IMP_REP_Insured_Home_SqFt Insured Square Footage 1412 1,412 IMP_REP_Insured_Home_YearBuilt Year Built 1973 IMP_REP_Insured_Homeowner Homeowner (Y or N) Y IMP_REP_acvloss_rcttotal Ratio of ACV Loss to RCT Total 1.13 IMP_REP_create_lag Delay in Creating Record 9 IMP_REP_insured_age_2 Insured Age 5 IMP_REP_insured_educationlevel_2 Years of Education 12 IMP_REP_insured_homevalue_calc_r Home Value Calculation Rounded 149 IMP_REP_insured_yearsinhome_2 Insured Years in Home 6 Suspicion Score Root Mean Square Error 99.7% Distance to Nearest Cluster 99.4% Distance from Mean 96.5% Combined 98.3%

PRIDIT Comparison

Wrap Up - Predictive Analysis for Fraud Claim fraud is increasing, focus on fraud is magnified There are special investigators in the industry that are good at detecting fraud As good as they are, they can t review every claim and detect all fraud Predictive i analytics can bring the expertise to bear on all claims Predictive analytics can enhance the work of investigators by uncovering complexities the human eye may miss