Insurance Fraud Detection: MARS versus Neural Networks?

Size: px
Start display at page:

Download "Insurance Fraud Detection: MARS versus Neural Networks?"

Transcription

1 Insurance Fraud Detection: MARS versus Neural Networks? Louise A Francis FCAS, MAAA [email protected] 1

2 Objectives Introduce a relatively new data mining method which can be used as an alternative to neural networks Compare the method to neural networks Apply the methods to fraud data 2

3 MARS Acronym for Multivariate Adaptive Regression Splines In many ways it is similar to regression, but: It can deal with data complexities that ordinary linear regression had difficulties 3

4 Data Complexities Nonlinear functions Interactions Missing Data 4

5 The Fraud Study Data 1993 Automobile Insurers Bureau closed Personal Injury Protection claims Dependent Variables Suspicion Score Expert assessment of liklihood of fraud or abuse Predictor Variables Red flag indicators Claim file variables 5

6 Example: Nonlinear Function Neural Network Fit of SUSPICION vs Provider Bill netfraud Provider Bill 6

7 MARS Fit to Nonlinear Function MARS Fit of SUSPICION vs Provider Bill 4 Fitted Suspicion Score Provider Bill 7

8 How MARS Fits Nonlinear Function MARS fits a piecewise regression BF1 = max(0, 2185 X ) Y = * BF1 BF1 is basis function MARS uses statistical optimization to find best basis function Basis function similar to dummy variable in regression 8

9 Interactions Effect of a predictor variable on dependent variable depends on the values of another variable(s) Neural Network Predicted for Provider Bill and Injury Type inj.type: Neural Net Predicted inj.type: 03 in j.type: 04 inj.type: 01 in j.type: Provider Bill

10 Interactions: MARS Fit MARS Predicted for Provider Bill and Injury Type inj.type: 05 inj.type: 06 3 Fitted Suspicion Score 0 6 inj.type: 03 inj.type: 04 inj.type: 01 inj.type: Provider 1 Bill 10

11 Interactions: The Basis Functions Injury type 4 (neck sprain), and type 5 (back sprain) increase faster and have higher scores than the other injury types BF1 = max(0, X ) BF2 = ( INJTYPE = 4 OR INJTYPE = 5) BF3 = max(0, X - 159) * BF2 Y = * BF * BF E-03 * BF3 where X is the provider bill INJTYPE is the injury type 11

12 Missing Data Occurs frequently in insurance data There are some sophisticated methods for addressing this (i.e., EM algorithm) MARS uses basis functions to find surrogates for variables with missing values 12

13 Missing Data Example: Health Insurance (Claimant has Health Insurance) Value Frequency Percent Cumulative Percent No Missing

14 Missing Data Example BF1 = max(0, MP_BILL ) BF2 = max(0, MP_BILL ) BF3 = (HEALTHIN MISSING) BF4 = (HEALTHIN = MISSING) BF5 = (HEALTHIN = N) BF7 = max(0, MP_BILL ) * BF5 BF8 = max(0, MP_BILL ) * BF5 BF9 = max(0, MP_BILL - 98) * BF4 BF10 = max(0, 98 - MP_BILL ) * BF4 BF11 = max(0, MP_BILL - 710) * BF3 BF13 = max(0, MP_BILL ) BF15 = BF3 * BF2 Y = * BF * BF * BF E-04 * BF E-03 * BF * BF * BF * BF E-03 * BF E-03 * BF15 14

15 More Complex Example Dependent variable: Expert s assessment of liklihood claim is legitimate A classification application Predictor variables: Combination of claim file variables (age of claimant, legal representation) red flag variables (injury is strain/sprain only, claimant has history of previous claim) 15

16 More Complex Example BF1 = (LEGALREP = 1) BF2 = (LEGALREP = 2) BF3 = ( TRTLAG = missing) BF4 = ( TRTLAG NE missing) BF5 = ( INJ01 = 1) * BF2 BF7 = ( ACC04 = 1) * BF4 BF9 = ( ACC14 = 1) BF11 = ( PARTDIS = 1) * BF4 BF15 = max(0, AGE - 36) * BF4 BF16 = max(0, 36 - AGE ) * BF4 BF18 = max(0, 55 - AMBUL ) * BF15 BF20 = max(0, 10 - RPTLAG ) * BF4 BF21 = ( CLT02 = 1) BF23 = POLLAG * BF21 BF24 = ( ACC15 = 1) * BF16 Y = * BF * BF * BF * BF * BF * BF * BF * BF E-03 * BF * BF E-03 * BF * BF24 16

17 Evaluating Predictor Variables: Generalized Cross-validation GCV = 1 N y ˆ ( i f x [ N 1 k / N i= 1 i ) ] 2 where N is the number of observations y is the dependent variable x is the independent variable(s) k is the effective number of parameters or degrees of freedom in the model. 17

18 Variable Importance Ranking Rank Variable MARS Ranking of Variables Description 1 LEGALREP Legal Representation 2 TRTMIS Treatment lag missing 3 ACC04 Single vehicle accident 4 INJ01 Injury consisted of strain or sprain only 5 AGE Claimant age 6 PARTDIS Claimant partially disabled Property damage was inconsistent with 7 ACC14 accident 8 CLT02 Had a history of previous claims 9 POLLAG Policy lag 10 RPTLAG Report lag 11 AMBUL Ambulance charges 12 ACC15 Francis Very Analytics minor impact and Actuarial collision 18

19 Methods of Assessing Fit Cross Validation Confusion Matrix Sensitivity Specificity ROC Curve Area Under the ROC Curve 19

20 Cross-validation Four Fold Cross-validation Percent Technique R^2 Correct MARS Neural Network

21 Confusion Matrix MARS Predicted * Actual Predicted Actual No Yes Total No Yes Total

22 Sensitivity/Specificity Sensitivity: Percent of targets correctly predicted Specificity: Percent of non-targets correctly predicted Model Sensitivity Specificity MARS Neural Network

23 ROC Curve ROC Curve 1.0 Sensitivity Neural Net MARS BASE Specificity 23

24 Area Under the ROC Curve Statistics for Area Under the ROC Curve Test Result Variables Area Std Error Asymptotic Sig Lower 95% Bound Upper 95% Bound MARS Probability Neural Probability

25 Which One is Better? Depends on application MARS handles missing values better MARS clusters categories on nominal variables with many categories MARS can be explained more easily On applications where analyst believes neural networks will outperform MARS, use them Also use hybrid models to improve performance 25

26 Using the Model Results Both claim file variables and red flag variables appear to be significant in predicting fraud Other research supports value of using statistical and data mining models to predict fraud Derrig (Journal of Risk and Insurance, 2002) advocates using analytic models to sort claims Pay claims with low score Devote resources to claims with high scores 26

Martian Chronicles: Is MARS better than Neural Networks? by Louise Francis, FCAS, MAAA

Martian Chronicles: Is MARS better than Neural Networks? by Louise Francis, FCAS, MAAA Martian Chronicles: Is MARS better than Neural Networks? by Louise Francis, FCAS, MAAA Abstract: A recently developed data mining technique, Multivariate Adaptive Regression Splines (MARS) has been hailed

More information

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar Prepared by Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc. www.data-mines.com [email protected]

More information

Session 94 L, Detecting Fraudulent Claims in General Insurance Moderator: Anthony E. Cappelletti, FSA, FCAS, FCIA

Session 94 L, Detecting Fraudulent Claims in General Insurance Moderator: Anthony E. Cappelletti, FSA, FCAS, FCIA Session 94 L, Detecting Fraudulent Claims in General Insurance Moderator: Anthony E. Cappelletti, FSA, FCAS, FCIA Presenters: Richard A. Derrig Louise A. Francis, FCAS, MAAA Using Predictive Analytics

More information

Predictive Modeling in Automobile Insurance: A Preliminary Analysis. Stephen P. D Arcy. Paper to be Presented at the

Predictive Modeling in Automobile Insurance: A Preliminary Analysis. Stephen P. D Arcy. Paper to be Presented at the Predictive Modeling in Automobile Insurance: A Preliminary Analysis by Stephen P. D Arcy Paper to be Presented at the World Risk and Insurance Economics Congress Salt Lake City, Utah August 2005 Contact

More information

Using Predictive Analytics to Detect Fraudulent Claims

Using Predictive Analytics to Detect Fraudulent Claims Using Predictive Analytics to Detect Fraudulent Claims May 17, 211 Roosevelt C. Mosley, Jr., FCAS, MAAA CAS Spring Meeting Palm Beach, FL Experience the Pinnacle Difference! Predictive Analysis for Fraud

More information

A Property & Casualty Insurance Predictive Modeling Process in SAS

A Property & Casualty Insurance Predictive Modeling Process in SAS Paper AA-02-2015 A Property & Casualty Insurance Predictive Modeling Process in SAS 1.0 ABSTRACT Mei Najim, Sedgwick Claim Management Services, Chicago, Illinois Predictive analytics has been developing

More information

Predictive Modeling and Big Data

Predictive Modeling and Big Data Predictive Modeling and Presented by Eileen Burns, FSA, MAAA Milliman Agenda Current uses of predictive modeling in the life insurance industry Potential applications of 2 1 June 16, 2014 [Enter presentation

More information

Predictive Modeling Techniques in Insurance

Predictive Modeling Techniques in Insurance Predictive Modeling Techniques in Insurance Tuesday May 5, 2015 JF. Breton Application Engineer 2014 The MathWorks, Inc. 1 Opening Presenter: JF. Breton: 13 years of experience in predictive analytics

More information

Model Validation Techniques

Model Validation Techniques Model Validation Techniques Kevin Mahoney, FCAS kmahoney@ travelers.com CAS RPM Seminar March 17, 2010 Uses of Statistical Models in P/C Insurance Examples of Applications Determine expected loss cost

More information

Azure Machine Learning, SQL Data Mining and R

Azure Machine Learning, SQL Data Mining and R Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:

More information

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification

More information

Predictive modelling around the world 28.11.13

Predictive modelling around the world 28.11.13 Predictive modelling around the world 28.11.13 Agenda Why this presentation is really interesting Introduction to predictive modelling Case studies Conclusions Why this presentation is really interesting

More information

BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING

BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING Xavier Conort [email protected] Session Number: TBR14 Insurance has always been a data business The industry has successfully

More information

Customer and Business Analytic

Customer and Business Analytic Customer and Business Analytic Applied Data Mining for Business Decision Making Using R Daniel S. Putler Robert E. Krider CRC Press Taylor &. Francis Group Boca Raton London New York CRC Press is an imprint

More information

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be

More information

Data Preparation Part 1: Exploratory Data Analysis & Data Cleaning, Missing Data

Data Preparation Part 1: Exploratory Data Analysis & Data Cleaning, Missing Data Data Preparation Part 1: Exploratory Data Analysis & Data Cleaning, Missing Data CAS Predictive Modeling Seminar Louise Francis Francis Analytics and Actuarial Data Mining, Inc. www.data-mines.com [email protected]

More information

Predictive Modeling for Workers Compensation Claims

Predictive Modeling for Workers Compensation Claims Predictive Modeling for Workers Compensation Claims AASCIF Super Conference Kirsten C. Hernan Deloitte Consulting LLP October 4, 2012 NOTICE: THIS DOCUMENT IS PROPRIETARY AND CONFIDENTIAL This document

More information

Data Mining. Nonlinear Classification

Data Mining. Nonlinear Classification Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15

More information

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation

More information

May provide differing lists of damaged/stolen property regarding an insurable loss. May provide questionable proof of purchase for expensive items. May provide invoices or estimates that appear to be inflated

More information

COMPARING NEURAL NETWORK ALGORITHM PERFORMANCE USING SPSS AND NEUROSOLUTIONS

COMPARING NEURAL NETWORK ALGORITHM PERFORMANCE USING SPSS AND NEUROSOLUTIONS COMPARING NEURAL NETWORK ALGORITHM PERFORMANCE USING SPSS AND NEUROSOLUTIONS AMJAD HARB and RASHID JAYOUSI Faculty of Computer Science, Al-Quds University, Jerusalem, Palestine Abstract This study exploits

More information

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges A Basic Guide to Modeling Techniques for All Direct Marketing Challenges Allison Cornia Database Marketing Manager Microsoft Corporation C. Olivia Rud Executive Vice President Data Square, LLC Overview

More information

How To Build A Predictive Model In Insurance

How To Build A Predictive Model In Insurance The Do s & Don ts of Building A Predictive Model in Insurance University of Minnesota November 9 th, 2012 Nathan Hubbell, FCAS Katy Micek, Ph.D. Agenda Travelers Broad Overview Actuarial & Analytics Career

More information

Benchmarking of different classes of models used for credit scoring

Benchmarking of different classes of models used for credit scoring Benchmarking of different classes of models used for credit scoring We use this competition as an opportunity to compare the performance of different classes of predictive models. In particular we want

More information

Predictive Analytics 101

Predictive Analytics 101 Predictive Analytics 101 Current Trends in Predictive Modeling and Analysis Frank A. Alerte, Esq.* *We would like to acknowledge the following individuals for their contribution and valuable input in preparing

More information

Corporate Defaults and Large Macroeconomic Shocks

Corporate Defaults and Large Macroeconomic Shocks Corporate Defaults and Large Macroeconomic Shocks Mathias Drehmann Bank of England Andrew Patton London School of Economics and Bank of England Steffen Sorensen Bank of England The presentation expresses

More information

TNS EX A MINE BehaviourForecast Predictive Analytics for CRM. TNS Infratest Applied Marketing Science

TNS EX A MINE BehaviourForecast Predictive Analytics for CRM. TNS Infratest Applied Marketing Science TNS EX A MINE BehaviourForecast Predictive Analytics for CRM 1 TNS BehaviourForecast Why is BehaviourForecast relevant for you? The concept of analytical Relationship Management (acrm) becomes more and

More information

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments Contents List of Figures Foreword Preface xxv xxiii xv Acknowledgments xxix Chapter 1 Fraud: Detection, Prevention, and Analytics! 1 Introduction 2 Fraud! 2 Fraud Detection and Prevention 10 Big Data for

More information

CART 6.0 Feature Matrix

CART 6.0 Feature Matrix CART 6.0 Feature Matri Enhanced Descriptive Statistics Full summary statistics Brief summary statistics Stratified summary statistics Charts and histograms Improved User Interface New setup activity window

More information

THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell

THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell THE HYBID CAT-LOGIT MODEL IN CLASSIFICATION AND DATA MINING Introduction Dan Steinberg and N. Scott Cardell Most data-mining projects involve classification problems assigning objects to classes whether

More information

Combining GLM and datamining techniques for modelling accident compensation data. Peter Mulquiney

Combining GLM and datamining techniques for modelling accident compensation data. Peter Mulquiney Combining GLM and datamining techniques for modelling accident compensation data Peter Mulquiney Introduction Accident compensation data exhibit features which complicate loss reserving and premium rate

More information

Data Mining: STATISTICA

Data Mining: STATISTICA Data Mining: STATISTICA Outline Prepare the data Classification and regression 1 Prepare the Data Statistica can read from Excel,.txt and many other types of files Compared with WEKA, Statistica is much

More information

Chapter 6. The stacking ensemble approach

Chapter 6. The stacking ensemble approach 82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described

More information

Data Mining - Evaluation of Classifiers

Data Mining - Evaluation of Classifiers Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010

More information

A Property and Casualty Insurance Predictive Modeling Process in SAS

A Property and Casualty Insurance Predictive Modeling Process in SAS Paper 11422-2016 A Property and Casualty Insurance Predictive Modeling Process in SAS Mei Najim, Sedgwick Claim Management Services ABSTRACT Predictive analytics is an area that has been developing rapidly

More information

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL Paper SA01-2012 Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL ABSTRACT Analysts typically consider combinations

More information

A Deeper Look Inside Generalized Linear Models

A Deeper Look Inside Generalized Linear Models A Deeper Look Inside Generalized Linear Models University of Minnesota February 3 rd, 2012 Nathan Hubbell, FCAS Agenda Property & Casualty (P&C Insurance) in one slide The Actuarial Profession Travelers

More information

How To Understand The Theory Of Probability

How To Understand The Theory Of Probability Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL

More information

Prediction of Stock Performance Using Analytical Techniques

Prediction of Stock Performance Using Analytical Techniques 136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University

More information

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19 PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations

More information

A Short Tour of the Predictive Modeling Process

A Short Tour of the Predictive Modeling Process Chapter 2 A Short Tour of the Predictive Modeling Process Before diving in to the formal components of model building, we present a simple example that illustrates the broad concepts of model building.

More information

Statistics in Retail Finance. Chapter 7: Fraud Detection in Retail Credit

Statistics in Retail Finance. Chapter 7: Fraud Detection in Retail Credit Statistics in Retail Finance Chapter 7: Fraud Detection in Retail Credit 1 Overview > Detection of fraud remains an important issue in retail credit. Methods similar to scorecard development may be employed,

More information

The Data Mining Process

The Data Mining Process Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data

More information

Maschinelles Lernen mit MATLAB

Maschinelles Lernen mit MATLAB Maschinelles Lernen mit MATLAB Jérémy Huard Applikationsingenieur The MathWorks GmbH 2015 The MathWorks, Inc. 1 Machine Learning is Everywhere Image Recognition Speech Recognition Stock Prediction Medical

More information

FRAUD CLASSIFICATION USING PRINCIPAL COMPONENT ANALYSIS OF RIDITs

FRAUD CLASSIFICATION USING PRINCIPAL COMPONENT ANALYSIS OF RIDITs The Journal of Risk and Insurance, 2002, Vol. 69, No. 3, 341-371 FRAUD CLASSIFICATION USING PRINCIPAL COMPONENT ANALYSIS OF RIDITs Patrick L. Brockett Richard A. Derrig Linda L. Golden Arnold Levine Mark

More information

Data Mining. Dr. Saed Sayad. University of Toronto 2010 [email protected]. http://chem-eng.utoronto.ca/~datamining/

Data Mining. Dr. Saed Sayad. University of Toronto 2010 saed.sayad@utoronto.ca. http://chem-eng.utoronto.ca/~datamining/ Data Mining Dr. Saed Sayad University of Toronto 2010 [email protected] http://chem-eng.utoronto.ca/~datamining/ 1 Data Mining Data mining is about explaining the past and predicting the future by

More information

Using Data Mining to Predict Automobile Insurance Fraud

Using Data Mining to Predict Automobile Insurance Fraud UNIVERSIDADE CATÓLICA PORTUGUESA CATÓLICA LISBON SCHOOL OF BUSINESS AND ECONOMICS Master of Science in Business Administration Using Data Mining to Predict Automobile Insurance Fraud JOÃO BERNARDO DO VALE

More information

answers to some of the tough questions that insurers get asked in Ontario. We hope it helps you own the road this summer.

answers to some of the tough questions that insurers get asked in Ontario. We hope it helps you own the road this summer. Dear colleague, We ve prepared this Q&A backgrounder to arm you with answers to some of the tough questions that insurers get asked in Ontario. We hope it helps you own the road this summer. We ll follow

More information

Cleaned Data. Recommendations

Cleaned Data. Recommendations Call Center Data Analysis Megaputer Case Study in Text Mining Merete Hvalshagen www.megaputer.com Megaputer Intelligence, Inc. 120 West Seventh Street, Suite 10 Bloomington, IN 47404, USA +1 812-0-0110

More information

IBM's Fraud and Abuse, Analytics and Management Solution

IBM's Fraud and Abuse, Analytics and Management Solution Government Efficiency through Innovative Reform IBM's Fraud and Abuse, Analytics and Management Solution Service Definition Copyright IBM Corporation 2014 Table of Contents Overview... 1 Major differentiators...

More information

Application of Predictive Analytics to Higher Degree Research Course Completion Times

Application of Predictive Analytics to Higher Degree Research Course Completion Times Application of Predictive Analytics to Higher Degree Research Course Completion Times Application of Decision Theory to PhD Course Completions (2006 2013) Rachna 1 I Dhand, Senior Strategic Information

More information

Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.

Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4. Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví Pavel Kříž Seminář z aktuárských věd MFF 4. dubna 2014 Summary 1. Application areas of Insurance Analytics 2. Insurance Analytics

More information

ROI CASE STUDY SPSS INFINITY PROPERTY & CASUALTY

ROI CASE STUDY SPSS INFINITY PROPERTY & CASUALTY ROI CASE STUDY SPSS INFINITY PROPERTY & CASUALTY THE BOTTOM LINE Infinity Property & Casualty Corporation (IPACC) deployed SPSS to reduce its payments on fraudulent claims and improve its ability to collect

More information

An effective approach to preventing application fraud. Experian Fraud Analytics

An effective approach to preventing application fraud. Experian Fraud Analytics An effective approach to preventing application fraud Experian Fraud Analytics The growing threat of application fraud Fraud attacks are increasing across the world Application fraud is a rapidly growing

More information

Principles of Data Mining by Hand&Mannila&Smyth

Principles of Data Mining by Hand&Mannila&Smyth Principles of Data Mining by Hand&Mannila&Smyth Slides for Textbook Ari Visa,, Institute of Signal Processing Tampere University of Technology October 4, 2010 Data Mining: Concepts and Techniques 1 Differences

More information

PREDICTIVE MODELLING FOR COMMERCIAL INSURANCE

PREDICTIVE MODELLING FOR COMMERCIAL INSURANCE PREDICTIVE MODELLING FOR COMMERCIAL INSURANCE General Insurance Pricing Seminar 13 June 2008 London James Guszcza, FCAS, MAAA [email protected] General Themes Predictive modelling: 3 Levels of Discussion

More information

The New NCCI Hazard Groups

The New NCCI Hazard Groups The New NCCI Hazard Groups Greg Engl, PhD, FCAS, MAAA National Council on Compensation Insurance CAS Reinsurance Seminar June, 2006 Workers Compensation Session Agenda History of previous work Impact of

More information

Gerry Hobbs, Department of Statistics, West Virginia University

Gerry Hobbs, Department of Statistics, West Virginia University Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit

More information

How To Price Insurance In Canada

How To Price Insurance In Canada The New Paradigm of Property & Casualty Insurance Pricing: Multivariate analysis and Predictive Modeling The ability to effectively price personal lines insurance policies to accurately match rate with

More information

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise

More information

Chapter 12 Discovering New Knowledge Data Mining

Chapter 12 Discovering New Knowledge Data Mining Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to

More information

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup Network Anomaly Detection A Machine Learning Perspective Dhruba Kumar Bhattacharyya Jugal Kumar KaKta»C) CRC Press J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor

More information

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America Application of SAS! Enterprise Miner in Credit Risk Analytics Presented by Minakshi Srivastava, VP, Bank of America 1 Table of Contents Credit Risk Analytics Overview Journey from DATA to DECISIONS Exploratory

More information

Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios

Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios By: Michael Banasiak & By: Daniel Tantum, Ph.D. What Are Statistical Based Behavior Scoring Models And How Are

More information

Predictive Modeling from a Risk Management Perspective Recording of this session via any media type is strictly prohibited.

Predictive Modeling from a Risk Management Perspective Recording of this session via any media type is strictly prohibited. Predictive Modeling from a Risk Management Perspective Page 1 David P. Duden Director Deloitte Consulting Hartford, CT Eric Oldroyd Group Manager Target Corporation Minneapolis, MN Page 2 NOTICE: THIS

More information

The Relationship of Credit-Based Insurance Scores to Private Passenger Automobile Insurance Loss Propensity

The Relationship of Credit-Based Insurance Scores to Private Passenger Automobile Insurance Loss Propensity The Relationship of Credit-Based Insurance Scores to Private Passenger Automobile Insurance Loss Propensity An Actuarial Study by EPIC Actuaries, LLC Principal Authors: Michael J. Miller, FCAS, MAAA Richard

More information

Knowledge Discovery and Data Mining

Knowledge Discovery and Data Mining Knowledge Discovery and Data Mining Unit # 10 Sajjad Haider Fall 2012 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right

More information

Linear Models and Conjoint Analysis with Nonlinear Spline Transformations

Linear Models and Conjoint Analysis with Nonlinear Spline Transformations Linear Models and Conjoint Analysis with Nonlinear Spline Transformations Warren F. Kuhfeld Mark Garratt Abstract Many common data analysis models are based on the general linear univariate model, including

More information

Risk pricing for Australian Motor Insurance

Risk pricing for Australian Motor Insurance Risk pricing for Australian Motor Insurance Dr Richard Brookes November 2012 Contents 1. Background Scope How many models? 2. Approach Data Variable filtering GLM Interactions Credibility overlay 3. Model

More information

Data Mining in CRM & Direct Marketing. Jun Du The University of Western Ontario [email protected]

Data Mining in CRM & Direct Marketing. Jun Du The University of Western Ontario jdu43@uwo.ca Data Mining in CRM & Direct Marketing Jun Du The University of Western Ontario [email protected] Outline Why CRM & Marketing Goals in CRM & Marketing Models and Methodologies Case Study: Response Model Case

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information

Driving Down Claim Costs With PREDICTIVE MODELING. December 2011. Sponsored by:

Driving Down Claim Costs With PREDICTIVE MODELING. December 2011. Sponsored by: Driving Down Claim Costs With PREDICTIVE MODELING December 2011 Sponsored by: Driving Down Claim Costs With PREDICTIVE MODELING Executive Summary Tools based on predictive modeling are transforming claims

More information

Detecting Money Laundering Actions Using Data Mining and Expert Systems

Detecting Money Laundering Actions Using Data Mining and Expert Systems Detecting Money Laundering Actions Using Data Mining and Expert Systems Ekrem Duman Dogus University Industrial Engineering Department Istanbul, TURKEY Joint work with Ayse Buyukkaya Outline Our experience

More information

Enhancing Compliance with Predictive Analytics

Enhancing Compliance with Predictive Analytics Enhancing Compliance with Predictive Analytics FTA 2007 Revenue Estimation and Research Conference Reid Linn Tennessee Department of Revenue [email protected] Sifting through a Gold Mine of Tax Data

More information

Data Mining Applications in Higher Education

Data Mining Applications in Higher Education Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2

More information

Data Mining + Business Intelligence. Integration, Design and Implementation

Data Mining + Business Intelligence. Integration, Design and Implementation Data Mining + Business Intelligence Integration, Design and Implementation ABOUT ME Vijay Kotu Data, Business, Technology, Statistics BUSINESS INTELLIGENCE - Result Making data accessible Wider distribution

More information

Local classification and local likelihoods

Local classification and local likelihoods Local classification and local likelihoods November 18 k-nearest neighbors The idea of local regression can be extended to classification as well The simplest way of doing so is called nearest neighbor

More information

Anomaly detection. Problem motivation. Machine Learning

Anomaly detection. Problem motivation. Machine Learning Anomaly detection Problem motivation Machine Learning Anomaly detection example Aircraft engine features: = heat generated = vibration intensity Dataset: New engine: (vibration) (heat) Density estimation

More information

Didacticiel Études de cas

Didacticiel Études de cas 1 Theme Data Mining with R The rattle package. R (http://www.r project.org/) is one of the most exciting free data mining software projects of these last years. Its popularity is completely justified (see

More information

Performance Measures in Data Mining

Performance Measures in Data Mining Performance Measures in Data Mining Common Performance Measures used in Data Mining and Machine Learning Approaches L. Richter J.M. Cejuela Department of Computer Science Technische Universität München

More information

Using Predictive Analytics to Detect Contract Fraud, Waste, and Abuse Case Study from U.S. Postal Service OIG

Using Predictive Analytics to Detect Contract Fraud, Waste, and Abuse Case Study from U.S. Postal Service OIG Using Predictive Analytics to Detect Contract Fraud, Waste, and Abuse Case Study from U.S. Postal Service OIG MACPA Government & Non Profit Conference April 26, 2013 Isaiah Goodall, Director of Business

More information

Data Mining Part 5. Prediction

Data Mining Part 5. Prediction Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification

More information

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA ABSTRACT Current trends in data mining allow the business community to take advantage of

More information

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA Welcome Xindong Wu Data Mining: Updates in Technologies Dept of Math and Computer Science Colorado School of Mines Golden, Colorado 80401, USA Email: xwu@ mines.edu Home Page: http://kais.mines.edu/~xwu/

More information

Predicting Bankruptcy with Robust Logistic Regression

Predicting Bankruptcy with Robust Logistic Regression Journal of Data Science 9(2011), 565-584 Predicting Bankruptcy with Robust Logistic Regression Richard P. Hauser and David Booth Kent State University Abstract: Using financial ratio data from 2006 and

More information

Claims Auditing in Automobile Insurance

Claims Auditing in Automobile Insurance Claims Auditing in Automobile Insurance by Pau Salsas-Forn University of Barcelona and Sharon Tennyson Cornell University February, 2001 We are grateful to Richard Derrig for making the data available

More information

Knowledge Discovery and Data Mining

Knowledge Discovery and Data Mining Knowledge Discovery and Data Mining Unit # 11 Sajjad Haider Fall 2013 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right

More information

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis ElegantJ BI White Paper The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis Integrated Business Intelligence and Reporting for Performance Management, Operational

More information