Insurance Fraud Detection: MARS versus Neural Networks?
|
|
|
- Mervin Harmon
- 10 years ago
- Views:
Transcription
1 Insurance Fraud Detection: MARS versus Neural Networks? Louise A Francis FCAS, MAAA [email protected] 1
2 Objectives Introduce a relatively new data mining method which can be used as an alternative to neural networks Compare the method to neural networks Apply the methods to fraud data 2
3 MARS Acronym for Multivariate Adaptive Regression Splines In many ways it is similar to regression, but: It can deal with data complexities that ordinary linear regression had difficulties 3
4 Data Complexities Nonlinear functions Interactions Missing Data 4
5 The Fraud Study Data 1993 Automobile Insurers Bureau closed Personal Injury Protection claims Dependent Variables Suspicion Score Expert assessment of liklihood of fraud or abuse Predictor Variables Red flag indicators Claim file variables 5
6 Example: Nonlinear Function Neural Network Fit of SUSPICION vs Provider Bill netfraud Provider Bill 6
7 MARS Fit to Nonlinear Function MARS Fit of SUSPICION vs Provider Bill 4 Fitted Suspicion Score Provider Bill 7
8 How MARS Fits Nonlinear Function MARS fits a piecewise regression BF1 = max(0, 2185 X ) Y = * BF1 BF1 is basis function MARS uses statistical optimization to find best basis function Basis function similar to dummy variable in regression 8
9 Interactions Effect of a predictor variable on dependent variable depends on the values of another variable(s) Neural Network Predicted for Provider Bill and Injury Type inj.type: Neural Net Predicted inj.type: 03 in j.type: 04 inj.type: 01 in j.type: Provider Bill
10 Interactions: MARS Fit MARS Predicted for Provider Bill and Injury Type inj.type: 05 inj.type: 06 3 Fitted Suspicion Score 0 6 inj.type: 03 inj.type: 04 inj.type: 01 inj.type: Provider 1 Bill 10
11 Interactions: The Basis Functions Injury type 4 (neck sprain), and type 5 (back sprain) increase faster and have higher scores than the other injury types BF1 = max(0, X ) BF2 = ( INJTYPE = 4 OR INJTYPE = 5) BF3 = max(0, X - 159) * BF2 Y = * BF * BF E-03 * BF3 where X is the provider bill INJTYPE is the injury type 11
12 Missing Data Occurs frequently in insurance data There are some sophisticated methods for addressing this (i.e., EM algorithm) MARS uses basis functions to find surrogates for variables with missing values 12
13 Missing Data Example: Health Insurance (Claimant has Health Insurance) Value Frequency Percent Cumulative Percent No Missing
14 Missing Data Example BF1 = max(0, MP_BILL ) BF2 = max(0, MP_BILL ) BF3 = (HEALTHIN MISSING) BF4 = (HEALTHIN = MISSING) BF5 = (HEALTHIN = N) BF7 = max(0, MP_BILL ) * BF5 BF8 = max(0, MP_BILL ) * BF5 BF9 = max(0, MP_BILL - 98) * BF4 BF10 = max(0, 98 - MP_BILL ) * BF4 BF11 = max(0, MP_BILL - 710) * BF3 BF13 = max(0, MP_BILL ) BF15 = BF3 * BF2 Y = * BF * BF * BF E-04 * BF E-03 * BF * BF * BF * BF E-03 * BF E-03 * BF15 14
15 More Complex Example Dependent variable: Expert s assessment of liklihood claim is legitimate A classification application Predictor variables: Combination of claim file variables (age of claimant, legal representation) red flag variables (injury is strain/sprain only, claimant has history of previous claim) 15
16 More Complex Example BF1 = (LEGALREP = 1) BF2 = (LEGALREP = 2) BF3 = ( TRTLAG = missing) BF4 = ( TRTLAG NE missing) BF5 = ( INJ01 = 1) * BF2 BF7 = ( ACC04 = 1) * BF4 BF9 = ( ACC14 = 1) BF11 = ( PARTDIS = 1) * BF4 BF15 = max(0, AGE - 36) * BF4 BF16 = max(0, 36 - AGE ) * BF4 BF18 = max(0, 55 - AMBUL ) * BF15 BF20 = max(0, 10 - RPTLAG ) * BF4 BF21 = ( CLT02 = 1) BF23 = POLLAG * BF21 BF24 = ( ACC15 = 1) * BF16 Y = * BF * BF * BF * BF * BF * BF * BF * BF E-03 * BF * BF E-03 * BF * BF24 16
17 Evaluating Predictor Variables: Generalized Cross-validation GCV = 1 N y ˆ ( i f x [ N 1 k / N i= 1 i ) ] 2 where N is the number of observations y is the dependent variable x is the independent variable(s) k is the effective number of parameters or degrees of freedom in the model. 17
18 Variable Importance Ranking Rank Variable MARS Ranking of Variables Description 1 LEGALREP Legal Representation 2 TRTMIS Treatment lag missing 3 ACC04 Single vehicle accident 4 INJ01 Injury consisted of strain or sprain only 5 AGE Claimant age 6 PARTDIS Claimant partially disabled Property damage was inconsistent with 7 ACC14 accident 8 CLT02 Had a history of previous claims 9 POLLAG Policy lag 10 RPTLAG Report lag 11 AMBUL Ambulance charges 12 ACC15 Francis Very Analytics minor impact and Actuarial collision 18
19 Methods of Assessing Fit Cross Validation Confusion Matrix Sensitivity Specificity ROC Curve Area Under the ROC Curve 19
20 Cross-validation Four Fold Cross-validation Percent Technique R^2 Correct MARS Neural Network
21 Confusion Matrix MARS Predicted * Actual Predicted Actual No Yes Total No Yes Total
22 Sensitivity/Specificity Sensitivity: Percent of targets correctly predicted Specificity: Percent of non-targets correctly predicted Model Sensitivity Specificity MARS Neural Network
23 ROC Curve ROC Curve 1.0 Sensitivity Neural Net MARS BASE Specificity 23
24 Area Under the ROC Curve Statistics for Area Under the ROC Curve Test Result Variables Area Std Error Asymptotic Sig Lower 95% Bound Upper 95% Bound MARS Probability Neural Probability
25 Which One is Better? Depends on application MARS handles missing values better MARS clusters categories on nominal variables with many categories MARS can be explained more easily On applications where analyst believes neural networks will outperform MARS, use them Also use hybrid models to improve performance 25
26 Using the Model Results Both claim file variables and red flag variables appear to be significant in predicting fraud Other research supports value of using statistical and data mining models to predict fraud Derrig (Journal of Risk and Insurance, 2002) advocates using analytic models to sort claims Pay claims with low score Devote resources to claims with high scores 26
Martian Chronicles: Is MARS better than Neural Networks? by Louise Francis, FCAS, MAAA
Martian Chronicles: Is MARS better than Neural Networks? by Louise Francis, FCAS, MAAA Abstract: A recently developed data mining technique, Multivariate Adaptive Regression Splines (MARS) has been hailed
Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar
Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar Prepared by Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc. www.data-mines.com [email protected]
Session 94 L, Detecting Fraudulent Claims in General Insurance Moderator: Anthony E. Cappelletti, FSA, FCAS, FCIA
Session 94 L, Detecting Fraudulent Claims in General Insurance Moderator: Anthony E. Cappelletti, FSA, FCAS, FCIA Presenters: Richard A. Derrig Louise A. Francis, FCAS, MAAA Using Predictive Analytics
Predictive Modeling in Automobile Insurance: A Preliminary Analysis. Stephen P. D Arcy. Paper to be Presented at the
Predictive Modeling in Automobile Insurance: A Preliminary Analysis by Stephen P. D Arcy Paper to be Presented at the World Risk and Insurance Economics Congress Salt Lake City, Utah August 2005 Contact
Using Predictive Analytics to Detect Fraudulent Claims
Using Predictive Analytics to Detect Fraudulent Claims May 17, 211 Roosevelt C. Mosley, Jr., FCAS, MAAA CAS Spring Meeting Palm Beach, FL Experience the Pinnacle Difference! Predictive Analysis for Fraud
A Property & Casualty Insurance Predictive Modeling Process in SAS
Paper AA-02-2015 A Property & Casualty Insurance Predictive Modeling Process in SAS 1.0 ABSTRACT Mei Najim, Sedgwick Claim Management Services, Chicago, Illinois Predictive analytics has been developing
Predictive Modeling and Big Data
Predictive Modeling and Presented by Eileen Burns, FSA, MAAA Milliman Agenda Current uses of predictive modeling in the life insurance industry Potential applications of 2 1 June 16, 2014 [Enter presentation
Predictive Modeling Techniques in Insurance
Predictive Modeling Techniques in Insurance Tuesday May 5, 2015 JF. Breton Application Engineer 2014 The MathWorks, Inc. 1 Opening Presenter: JF. Breton: 13 years of experience in predictive analytics
Model Validation Techniques
Model Validation Techniques Kevin Mahoney, FCAS kmahoney@ travelers.com CAS RPM Seminar March 17, 2010 Uses of Statistical Models in P/C Insurance Examples of Applications Determine expected loss cost
Azure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification
Predictive modelling around the world 28.11.13
Predictive modelling around the world 28.11.13 Agenda Why this presentation is really interesting Introduction to predictive modelling Case studies Conclusions Why this presentation is really interesting
BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING
BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING Xavier Conort [email protected] Session Number: TBR14 Insurance has always been a data business The industry has successfully
Customer and Business Analytic
Customer and Business Analytic Applied Data Mining for Business Decision Making Using R Daniel S. Putler Robert E. Krider CRC Press Taylor &. Francis Group Boca Raton London New York CRC Press is an imprint
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be
Data Preparation Part 1: Exploratory Data Analysis & Data Cleaning, Missing Data
Data Preparation Part 1: Exploratory Data Analysis & Data Cleaning, Missing Data CAS Predictive Modeling Seminar Louise Francis Francis Analytics and Actuarial Data Mining, Inc. www.data-mines.com [email protected]
Predictive Modeling for Workers Compensation Claims
Predictive Modeling for Workers Compensation Claims AASCIF Super Conference Kirsten C. Hernan Deloitte Consulting LLP October 4, 2012 NOTICE: THIS DOCUMENT IS PROPRIETARY AND CONFIDENTIAL This document
Data Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation
May provide differing lists of damaged/stolen property regarding an insurable loss. May provide questionable proof of purchase for expensive items. May provide invoices or estimates that appear to be inflated
COMPARING NEURAL NETWORK ALGORITHM PERFORMANCE USING SPSS AND NEUROSOLUTIONS
COMPARING NEURAL NETWORK ALGORITHM PERFORMANCE USING SPSS AND NEUROSOLUTIONS AMJAD HARB and RASHID JAYOUSI Faculty of Computer Science, Al-Quds University, Jerusalem, Palestine Abstract This study exploits
A Basic Guide to Modeling Techniques for All Direct Marketing Challenges
A Basic Guide to Modeling Techniques for All Direct Marketing Challenges Allison Cornia Database Marketing Manager Microsoft Corporation C. Olivia Rud Executive Vice President Data Square, LLC Overview
How To Build A Predictive Model In Insurance
The Do s & Don ts of Building A Predictive Model in Insurance University of Minnesota November 9 th, 2012 Nathan Hubbell, FCAS Katy Micek, Ph.D. Agenda Travelers Broad Overview Actuarial & Analytics Career
Benchmarking of different classes of models used for credit scoring
Benchmarking of different classes of models used for credit scoring We use this competition as an opportunity to compare the performance of different classes of predictive models. In particular we want
Predictive Analytics 101
Predictive Analytics 101 Current Trends in Predictive Modeling and Analysis Frank A. Alerte, Esq.* *We would like to acknowledge the following individuals for their contribution and valuable input in preparing
Corporate Defaults and Large Macroeconomic Shocks
Corporate Defaults and Large Macroeconomic Shocks Mathias Drehmann Bank of England Andrew Patton London School of Economics and Bank of England Steffen Sorensen Bank of England The presentation expresses
TNS EX A MINE BehaviourForecast Predictive Analytics for CRM. TNS Infratest Applied Marketing Science
TNS EX A MINE BehaviourForecast Predictive Analytics for CRM 1 TNS BehaviourForecast Why is BehaviourForecast relevant for you? The concept of analytical Relationship Management (acrm) becomes more and
COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments
Contents List of Figures Foreword Preface xxv xxiii xv Acknowledgments xxix Chapter 1 Fraud: Detection, Prevention, and Analytics! 1 Introduction 2 Fraud! 2 Fraud Detection and Prevention 10 Big Data for
CART 6.0 Feature Matrix
CART 6.0 Feature Matri Enhanced Descriptive Statistics Full summary statistics Brief summary statistics Stratified summary statistics Charts and histograms Improved User Interface New setup activity window
THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell
THE HYBID CAT-LOGIT MODEL IN CLASSIFICATION AND DATA MINING Introduction Dan Steinberg and N. Scott Cardell Most data-mining projects involve classification problems assigning objects to classes whether
Combining GLM and datamining techniques for modelling accident compensation data. Peter Mulquiney
Combining GLM and datamining techniques for modelling accident compensation data Peter Mulquiney Introduction Accident compensation data exhibit features which complicate loss reserving and premium rate
Data Mining: STATISTICA
Data Mining: STATISTICA Outline Prepare the data Classification and regression 1 Prepare the Data Statistica can read from Excel,.txt and many other types of files Compared with WEKA, Statistica is much
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
Data Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
A Property and Casualty Insurance Predictive Modeling Process in SAS
Paper 11422-2016 A Property and Casualty Insurance Predictive Modeling Process in SAS Mei Najim, Sedgwick Claim Management Services ABSTRACT Predictive analytics is an area that has been developing rapidly
Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL
Paper SA01-2012 Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL ABSTRACT Analysts typically consider combinations
A Deeper Look Inside Generalized Linear Models
A Deeper Look Inside Generalized Linear Models University of Minnesota February 3 rd, 2012 Nathan Hubbell, FCAS Agenda Property & Casualty (P&C Insurance) in one slide The Actuarial Profession Travelers
How To Understand The Theory Of Probability
Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL
Prediction of Stock Performance Using Analytical Techniques
136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University
CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19
PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations
A Short Tour of the Predictive Modeling Process
Chapter 2 A Short Tour of the Predictive Modeling Process Before diving in to the formal components of model building, we present a simple example that illustrates the broad concepts of model building.
Statistics in Retail Finance. Chapter 7: Fraud Detection in Retail Credit
Statistics in Retail Finance Chapter 7: Fraud Detection in Retail Credit 1 Overview > Detection of fraud remains an important issue in retail credit. Methods similar to scorecard development may be employed,
The Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
Maschinelles Lernen mit MATLAB
Maschinelles Lernen mit MATLAB Jérémy Huard Applikationsingenieur The MathWorks GmbH 2015 The MathWorks, Inc. 1 Machine Learning is Everywhere Image Recognition Speech Recognition Stock Prediction Medical
FRAUD CLASSIFICATION USING PRINCIPAL COMPONENT ANALYSIS OF RIDITs
The Journal of Risk and Insurance, 2002, Vol. 69, No. 3, 341-371 FRAUD CLASSIFICATION USING PRINCIPAL COMPONENT ANALYSIS OF RIDITs Patrick L. Brockett Richard A. Derrig Linda L. Golden Arnold Levine Mark
Data Mining. Dr. Saed Sayad. University of Toronto 2010 [email protected]. http://chem-eng.utoronto.ca/~datamining/
Data Mining Dr. Saed Sayad University of Toronto 2010 [email protected] http://chem-eng.utoronto.ca/~datamining/ 1 Data Mining Data mining is about explaining the past and predicting the future by
Using Data Mining to Predict Automobile Insurance Fraud
UNIVERSIDADE CATÓLICA PORTUGUESA CATÓLICA LISBON SCHOOL OF BUSINESS AND ECONOMICS Master of Science in Business Administration Using Data Mining to Predict Automobile Insurance Fraud JOÃO BERNARDO DO VALE
answers to some of the tough questions that insurers get asked in Ontario. We hope it helps you own the road this summer.
Dear colleague, We ve prepared this Q&A backgrounder to arm you with answers to some of the tough questions that insurers get asked in Ontario. We hope it helps you own the road this summer. We ll follow
Cleaned Data. Recommendations
Call Center Data Analysis Megaputer Case Study in Text Mining Merete Hvalshagen www.megaputer.com Megaputer Intelligence, Inc. 120 West Seventh Street, Suite 10 Bloomington, IN 47404, USA +1 812-0-0110
IBM's Fraud and Abuse, Analytics and Management Solution
Government Efficiency through Innovative Reform IBM's Fraud and Abuse, Analytics and Management Solution Service Definition Copyright IBM Corporation 2014 Table of Contents Overview... 1 Major differentiators...
Application of Predictive Analytics to Higher Degree Research Course Completion Times
Application of Predictive Analytics to Higher Degree Research Course Completion Times Application of Decision Theory to PhD Course Completions (2006 2013) Rachna 1 I Dhand, Senior Strategic Information
Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.
Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví Pavel Kříž Seminář z aktuárských věd MFF 4. dubna 2014 Summary 1. Application areas of Insurance Analytics 2. Insurance Analytics
ROI CASE STUDY SPSS INFINITY PROPERTY & CASUALTY
ROI CASE STUDY SPSS INFINITY PROPERTY & CASUALTY THE BOTTOM LINE Infinity Property & Casualty Corporation (IPACC) deployed SPSS to reduce its payments on fraudulent claims and improve its ability to collect
An effective approach to preventing application fraud. Experian Fraud Analytics
An effective approach to preventing application fraud Experian Fraud Analytics The growing threat of application fraud Fraud attacks are increasing across the world Application fraud is a rapidly growing
Principles of Data Mining by Hand&Mannila&Smyth
Principles of Data Mining by Hand&Mannila&Smyth Slides for Textbook Ari Visa,, Institute of Signal Processing Tampere University of Technology October 4, 2010 Data Mining: Concepts and Techniques 1 Differences
PREDICTIVE MODELLING FOR COMMERCIAL INSURANCE
PREDICTIVE MODELLING FOR COMMERCIAL INSURANCE General Insurance Pricing Seminar 13 June 2008 London James Guszcza, FCAS, MAAA [email protected] General Themes Predictive modelling: 3 Levels of Discussion
The New NCCI Hazard Groups
The New NCCI Hazard Groups Greg Engl, PhD, FCAS, MAAA National Council on Compensation Insurance CAS Reinsurance Seminar June, 2006 Workers Compensation Session Agenda History of previous work Impact of
Gerry Hobbs, Department of Statistics, West Virginia University
Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit
How To Price Insurance In Canada
The New Paradigm of Property & Casualty Insurance Pricing: Multivariate analysis and Predictive Modeling The ability to effectively price personal lines insurance policies to accurately match rate with
WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat
Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise
Chapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup
Network Anomaly Detection A Machine Learning Perspective Dhruba Kumar Bhattacharyya Jugal Kumar KaKta»C) CRC Press J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor
Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America
Application of SAS! Enterprise Miner in Credit Risk Analytics Presented by Minakshi Srivastava, VP, Bank of America 1 Table of Contents Credit Risk Analytics Overview Journey from DATA to DECISIONS Exploratory
Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios
Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios By: Michael Banasiak & By: Daniel Tantum, Ph.D. What Are Statistical Based Behavior Scoring Models And How Are
Predictive Modeling from a Risk Management Perspective Recording of this session via any media type is strictly prohibited.
Predictive Modeling from a Risk Management Perspective Page 1 David P. Duden Director Deloitte Consulting Hartford, CT Eric Oldroyd Group Manager Target Corporation Minneapolis, MN Page 2 NOTICE: THIS
The Relationship of Credit-Based Insurance Scores to Private Passenger Automobile Insurance Loss Propensity
The Relationship of Credit-Based Insurance Scores to Private Passenger Automobile Insurance Loss Propensity An Actuarial Study by EPIC Actuaries, LLC Principal Authors: Michael J. Miller, FCAS, MAAA Richard
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 10 Sajjad Haider Fall 2012 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
Linear Models and Conjoint Analysis with Nonlinear Spline Transformations
Linear Models and Conjoint Analysis with Nonlinear Spline Transformations Warren F. Kuhfeld Mark Garratt Abstract Many common data analysis models are based on the general linear univariate model, including
Risk pricing for Australian Motor Insurance
Risk pricing for Australian Motor Insurance Dr Richard Brookes November 2012 Contents 1. Background Scope How many models? 2. Approach Data Variable filtering GLM Interactions Credibility overlay 3. Model
Data Mining in CRM & Direct Marketing. Jun Du The University of Western Ontario [email protected]
Data Mining in CRM & Direct Marketing Jun Du The University of Western Ontario [email protected] Outline Why CRM & Marketing Goals in CRM & Marketing Models and Methodologies Case Study: Response Model Case
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
Driving Down Claim Costs With PREDICTIVE MODELING. December 2011. Sponsored by:
Driving Down Claim Costs With PREDICTIVE MODELING December 2011 Sponsored by: Driving Down Claim Costs With PREDICTIVE MODELING Executive Summary Tools based on predictive modeling are transforming claims
Detecting Money Laundering Actions Using Data Mining and Expert Systems
Detecting Money Laundering Actions Using Data Mining and Expert Systems Ekrem Duman Dogus University Industrial Engineering Department Istanbul, TURKEY Joint work with Ayse Buyukkaya Outline Our experience
Enhancing Compliance with Predictive Analytics
Enhancing Compliance with Predictive Analytics FTA 2007 Revenue Estimation and Research Conference Reid Linn Tennessee Department of Revenue [email protected] Sifting through a Gold Mine of Tax Data
Data Mining Applications in Higher Education
Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2
Data Mining + Business Intelligence. Integration, Design and Implementation
Data Mining + Business Intelligence Integration, Design and Implementation ABOUT ME Vijay Kotu Data, Business, Technology, Statistics BUSINESS INTELLIGENCE - Result Making data accessible Wider distribution
Local classification and local likelihoods
Local classification and local likelihoods November 18 k-nearest neighbors The idea of local regression can be extended to classification as well The simplest way of doing so is called nearest neighbor
Anomaly detection. Problem motivation. Machine Learning
Anomaly detection Problem motivation Machine Learning Anomaly detection example Aircraft engine features: = heat generated = vibration intensity Dataset: New engine: (vibration) (heat) Density estimation
Didacticiel Études de cas
1 Theme Data Mining with R The rattle package. R (http://www.r project.org/) is one of the most exciting free data mining software projects of these last years. Its popularity is completely justified (see
Performance Measures in Data Mining
Performance Measures in Data Mining Common Performance Measures used in Data Mining and Machine Learning Approaches L. Richter J.M. Cejuela Department of Computer Science Technische Universität München
Using Predictive Analytics to Detect Contract Fraud, Waste, and Abuse Case Study from U.S. Postal Service OIG
Using Predictive Analytics to Detect Contract Fraud, Waste, and Abuse Case Study from U.S. Postal Service OIG MACPA Government & Non Profit Conference April 26, 2013 Isaiah Goodall, Director of Business
Data Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification
Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA
Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA ABSTRACT Current trends in data mining allow the business community to take advantage of
Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA
Welcome Xindong Wu Data Mining: Updates in Technologies Dept of Math and Computer Science Colorado School of Mines Golden, Colorado 80401, USA Email: xwu@ mines.edu Home Page: http://kais.mines.edu/~xwu/
Predicting Bankruptcy with Robust Logistic Regression
Journal of Data Science 9(2011), 565-584 Predicting Bankruptcy with Robust Logistic Regression Richard P. Hauser and David Booth Kent State University Abstract: Using financial ratio data from 2006 and
Claims Auditing in Automobile Insurance
Claims Auditing in Automobile Insurance by Pau Salsas-Forn University of Barcelona and Sharon Tennyson Cornell University February, 2001 We are grateful to Richard Derrig for making the data available
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 11 Sajjad Haider Fall 2013 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis
ElegantJ BI White Paper The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis Integrated Business Intelligence and Reporting for Performance Management, Operational
