Smart Sell Re-quote project for an Insurance company.



Similar documents
Predictive Modeling of Titanic Survivors: a Learning Competition

Internet Gambling Behavioral Markers: Using the Power of SAS Enterprise Miner 12.1 to Predict High-Risk Internet Gamblers

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL

Role of Customer Response Models in Customer Solicitation Center s Direct Marketing Campaign

A Comparison of Decision Tree and Logistic Regression Model Xianzhe Chen, North Dakota State University, Fargo, ND

Reevaluating Policy and Claims Analytics: a Case of Non-Fleet Customers In Automobile Insurance Industry

Improving performance of Memory Based Reasoning model using Weight of Evidence coded categorical variables

Data Mining Techniques Chapter 6: Decision Trees

Paper Downtime of a truck = Truck repair end date - Truck repair start date

Data Mining - Evaluation of Classifiers

Text Analytics using High Performance SAS Text Miner

Agenda. Mathias Lanner Sas Institute. Predictive Modeling Applications. Predictive Modeling Training Data. Beslutsträd och andra prediktiva modeller

Gerry Hobbs, Department of Statistics, West Virginia University

PAKDD 2006 Data Mining Competition

DECISION TREE ANALYSIS: PREDICTION OF SERIOUS TRAFFIC OFFENDING

Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

Data Mining Using SAS Enterprise Miner Randall Matignon, Piedmont, CA

Data mining and statistical models in marketing campaigns of BT Retail

Survey Analysis: Data Mining versus Standard Statistical Analysis for Better Analysis of Survey Responses

EXPLORING & MODELING USING INTERACTIVE DECISION TREES IN SAS ENTERPRISE MINER. Copyr i g ht 2013, SAS Ins titut e Inc. All rights res er ve d.

Enhancing Compliance with Predictive Analytics

An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century

Class #6: Non-linear classification. ML4Bio 2012 February 17 th, 2012 Quaid Morris

Data Mining Classification: Decision Trees

Predictive Analytics in the Public Sector: Using Data Mining to Assist Better Target Selection for Audit

Microsoft Azure Machine learning Algorithms

Data Mining Applications in Higher Education

Using multiple models: Bagging, Boosting, Ensembles, Forests

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

A Property and Casualty Insurance Predictive Modeling Process in SAS

Knowledge Discovery and Data Mining. Bootstrap review. Bagging Important Concepts. Notes. Lecture 19 - Bagging. Tom Kelsey. Notes

Data Mining Algorithms Part 1. Dejan Sarka

MERGING BUSINESS KPIs WITH PREDICTIVE MODEL KPIs FOR BINARY CLASSIFICATION MODEL SELECTION

Data Mining Methods: Applications for Institutional Research

Successfully Implementing Predictive Analytics in Direct Marketing

Insurance Premium Increase Optimization: Case Study. Charles Pollack B.Ec F.I.A.A.

Social Media Mining. Data Mining Essentials

Data Mining: A Magic Technology for College Recruitment. Tongshan Chang, Ed.D.

A fast, powerful data mining workbench designed for small to midsize organizations

College Tuition: Data mining and analysis

!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"

THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell

Beating the MLB Moneyline

How To Make A Credit Risk Model For A Bank Account

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

Big Data Analytics. Benchmarking SAS, R, and Mahout. Allison J. Ames, Ralph Abbey, Wayne Thompson. SAS Institute Inc., Cary, NC

Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic

Improving Demand Forecasting

Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms

Benchmarking of different classes of models used for credit scoring

Alex Vidras, David Tysinger. Merkle Inc.

Better credit models benefit us all

A Property & Casualty Insurance Predictive Modeling Process in SAS

Crawling and Detecting Community Structure in Online Social Networks using Local Information

Five Ways Retailers Can Profit from Customer Intelligence

Make Better Decisions Through Predictive Intelligence

Start-up Companies Predictive Models Analysis. Boyan Yankov, Kaloyan Haralampiev, Petko Ruskov

Companies already have customer data Creating a more effective sales team With all of this improved technology Most companies have a CRM

Improving SAS Global Forum Papers

The Operational Value of Social Media Information. Social Media and Customer Interaction

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

Predictive Data Mining in Very Large Data Sets: A Demonstration and Comparison Under Model Ensemble

Chapter 7: Data Mining

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Modeling to improve the customer unit target selection for inspections of Commercial Losses in Brazilian Electric Sector - The case CEMIG

10 Ingredients of a Successful Marketing Plan SDB Creative Group, Inc.

Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100

Sharing the experiences of teaching business analytics in a University course

New Work Item for ISO Predictive Analytics (Initial Notes and Thoughts) Introduction

Data Mining and Visualization

Skills Knowledge Energy Time People and decide how to use themto accomplish your objectives.


Text Analytics Illustrated with a Simple Data Set

Predicting Readmission of Diabetic Patients using the high performance Support Vector Machine algorithm of SAS Enterprise Miner

USING LOGISTIC REGRESSION TO PREDICT CUSTOMER RETENTION. Andrew H. Karp Sierra Information Services, Inc. San Francisco, California USA

Improving the Thermal Efficiency of Coal-Fired Power Plants: A Data Mining Approach

Fast Analytics on Big Data with H20

Predictive Modeling and Big Data

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

Data Mining: Overview. What is Data Mining?

Cloud Analytics for Capacity Planning and Instant VM Provisioning

RECOMMENDATIONS HOW TO ATTRACT CLIENTS TO ROBOFOREX

Leveraging Ensemble Models in SAS Enterprise Miner

What is Data Mining? MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling

MHI3000 Big Data Analytics for Health Care Final Project Report

Transcription:

SAS Analytics Day Smart Sell Re-quote project for an Insurance company. A project by Ajay Guyyala Naga Sudhir Lanka Narendra Babu Merla Kiran Reddy Samiullah Bramhanapalli Shaik

Business Situation XYZ is the leading Insurance company specializing in Auto Insurances. Re-quote project is an independent initiative by the company where the agent will be able to sell the unsold quotes back to the customers. Identifying the shared characteristics of agents and factors influencing them will help the company emerge as the leader in the market.

Need for Prediction There is an imminent need to predict the characteristics of agents which will really influence the agents to use the re-quote project effectively. This information is necessary for the company to use the strategies/patterns and sell higher percentage of quotes. Discovery of different trends through this whole process will help better predict the percentage of re-quotes done.

Data Preparation The modeling dataset has 42,322 observations and more than 300 variables. The data is messy, inconsistent, redundant and it also has many missing values. Some of the binary variables have more than 2 levels. Binary variables were recoded accordingly and were brought down to 2 levels. Measurement levels of certain variables needs to be recoded in order to obtain correct representation of the data. The missing values were imputed and the data was transformed to obtain normality using SAS Enterprise Miner.

Sample dataset The data has a disproportionate distribution of the levels in target variable. The level HIGH has very less number of observations. We sampled out a dataset which has exactly 5320 observations of each level. This is done to have equal representation of the levels in the data which will in-turn enable us to build a good model.

Prior Probabilities The random sampling dataset has a total of 15960 observations. This dataset is adjusted using prior probabilities to make it have all the original properties of the modeling dataset. Adjusted prior probabilities are entered in the exact same proportion as of the original modeling dataset.

Model Building Different models like Autonomous Decision tree, Gini Tree, Entropy Tree, Probability tree, Forward Regression, Stepwise Regression, Polynomial Regression, Neural Network, Auto neural, Radial basis neural networks(equal and unequal width) are some of the models that we considered and built. More than 25 models were considered and analyzed for our project. Gini Decision tree was selected as the best model with an accuracy of 62.8% and sensitivity of 85.48%.

Overview of model statistics Model Misclassification Rate Average Square Error Sensitivity Specificity Autonomous Tree 0.4142 0.1519 87.88% 58.12% Probability Tree 0.4200 0.1473 89.00% 57.09% 3-way Split tree 0.3920 0.1470 84.49% 63.52% Gini Tree 0.3720 0.1506 85.48% 62.96% Entropy Tree 0.3878 0.1555 79.33% 69.35% Forward Regression 0.3781 0.1443 83.65% 65.51% Step Wise Regression 0.3781 0.1444 83.65% 65.64% Large P value 0.6666 0.2222 0 1 Polynomial Regression 0.3900 0.1504 82.48% 65.86% Neural Network 0.3854 0.1453 82.85% 65.14% Auto Neural (Regression) 0.3994 0.1555 84.68% 61.62% Neural Network 0.3824 0.1433 85.76% 63.48% (Regression) Neural Network 0.3807 0.1449 81.25% 66.65% (6Nodes) RBF Unequal Width 0.4277 0.1630 86.47% 57.49% RBF Equal Width 0.3786 0.1433 85.48% 64.32% Ensemble 0.3889 0.1643 86.61% 62.77%

Scoring We applied the learning from the modeling on the scoring data. From the scoring results it was found that 6.235% was predicted as the HIGH percentage of requotes done by agents.

Business Insights and Opportunities The most notable pattern in our predictions is that the agent who is active on FAO website would generate more revenue. The percentage change in the number of producers should be low over time as it would impact the requote rate. The number of sales representatives should be increased so as to target more number of agency locations. The effective buying income and population of an area are some key demographic factors influencing the percentage of re-quotes.

Business Recommendations The agents with VQAR comparative status should be encouraged. The agent who maintains special relationship with the company would add more credit in re-quote rate. The agent who is available on Find An Agent actively will be easily accessible to customers and should be encouraged. The company needs to focus on agents who handles lesser number of competitors. Establishing more number of ad campaigns might increase the probability of targeting large agencies and in-turn increasing the percentage of re-quotes.

For more details contact presenters at: Ajay Guyyala Phone:703-470-1455 Email:ajju425@gmail.com Naga Sudhir Lanka Phone:405-612-1805 Email:vsudhir90@gmail.com Narendra Babu Merla Phone:862-576-2040 Email:narendra.itian@gmail.com Samiullah Bramhanapalli Shaik Phone: 405-408-9236 Email: sami.bramhanapalli_shaik@okstate.edu Kiran Reddy Kondamadugula Phone:940-435-8509 Email: Faculty Advisor: Dr. Goutam Chakraborty