SOA 2013 Life & Annuity Symposium May 6-7, Session 30 PD, Predictive Modeling Applications for Life and Annuity Pricing and Underwriting
|
|
- Beverly Randall
- 8 years ago
- Views:
Transcription
1 SOA 2013 Life & Annuity Symposium May 6-7, 2013 Session 30 PD, Predictive Modeling Applications for Life and Annuity Pricing and Underwriting Moderator: Barry D. Senensky, FSA, FCIA, MAAA Presenters: Jonathan P. Polon, FSA Qichun (Richard) Xu, FSA, Ph.D. Primary Competency Technical Skills and Analytical Problem Solving
2 Predictive Modeling for Life and Annuity Pricing and Underwriting Life and Annuity Symposium Session 30 May 6, 2013 Jonathan Polon FSA Overview One Key Takeaway Traditional Actuarial Techniques vs. Predictive Modeling Credibility vs. Validation Benefits of Predictive Modeling Data Considerations Performing the Analysis 2 1
3 One Key Takeaway 3 One Key Takeaway Mortality experience takes several years to develop Your ability to analyze internal mortality experience in the future will be affected by the way you capture data today Invest the time and resources today to develop and implement a data collection strategy What data to collect Where to collect the data from How to structure the data storage to facilitate analysis 4 2
4 Data Quality vs. Data Quantity The Law of Diminishing Marginal Returns applies Select the data elements that you believe are most important Focus on accuracy, completeness and structure for the key data elements rather than simply maximizing the number of data elements; this will: Minimize the cost of data storage and capture Greatly decrease time to create models Increase the interpretability of the models Reduce the risk of overfit Probably improve model accuracy 5 Traditional Actuarial Techniques vs. Predictive Modeling 6 3
5 Traditional Actuarial Techniques Its purpose is not to classify individual risks Rather, it is used to determine average cost for each class of risk Objective is to be accurate at the class level in aggregate, not at the level of the individual case Typically applied in low number of dimensions (e.g., age, gender, smoking, underwriting class, duration) Techniques include tables and classical statistics 7 Predictive Modeling Its purpose is to make predictions at the individual case level Output for each case could be a risk class, number of debits or qx vector Each case is unique and has its own combination of characteristics Typically applied in high number of dimensions Techniques include machine learning and numerical analysis Iteratively improving fit of model to the historic data 8 4
6 Credibility vs. Validation 9 Credibility Actuaries apply credibility theory to ensure their mortality analysis is based upon a sufficient number of observations From CIA Educational Note on Expected Mortality, July 2002 Goal of credibility theory is to provide a framework for combining data from different sources Typically company data, which may not be fully credible, and industry data, which is assumed to be fully credible The Normalized Method is the preferred credibility method and 3,007 is the suggested number of deaths needed for full credibility Credibility is as much an art is it is a science Barry Senensky FSA FCIA MAAA, April
7 Validation Credibility is not independent of dimensionality If # observations is small, can still model primary predictors As # observations increases, models can be expanded to include predictors of secondary and tertiary importance Danger of using predictive modeling is overfit modeling noise rather than signal Validation i is applied to protect against overfit Validate using out-of-sample data to ensure models are robust Withhold 10-20% of data until models are deemed complete 11 Benefits of Predictive Modeling 12 6
8 Benefits of Predictive Modeling The potential benefits of predictive modeling vs. traditional techniques: 1. Improved accuracy 2. Reduced time to decision 3. Lower expense 13 Improved Accuracy Improved accuracy is most important for larger risks E.g., large amounts or impaired lives These cases are typically fully underwritten and a lot of information is generated E.g., application, lab results, APS, financial underwriting Predictive models can provide more accurate estimates of the risk of each applicant This can be used as a decision support tool for the underwriting decision 14 7
9 Reduced Time to Decision and Lower Expense Quicker turnaround and lower cost of underwriting decisions can be especially important in the middle markets Can increase close rates and reduce issue expenses Why can t an insurance policy be sold in real-time? Whether at the agent/broker office or direct online No medical exam or fluid samples required Base underwriting decision on other sources of information such as prescription drug history 15 Data Considerations 16 8
10 Big Data Big data is the catch phrase of 2013 Not only are we creating data in new ways, such as: social media, cell phone GPS, webclicks The information created by our interactions with the world is being stored electronically to a greater and greater extent Sources of data for life pricing and underwriting: An insurer s internal data Data for sale from external data aggregators Webscraping 17 Internal Data Sources Application for insurance Lab results Attending physician statements Underwriter s notes Data generated from other product lines 18 9
11 External Data Sources MIB MVR Prescription drug history Credit score Public records Consumer data There may be regulatory, legal and reputational risk involved with the use of some external data sources. Be sure to research before using. 19 Webscraping Crawling the internet to uncover information about an entity More difficult to perform for an individual than a business Names are unlikely to be unique Personal Facebook, Twitter and other social media accounts are often set to private May be greater reputational risk to an insurer that is webscraping for information about an individual as opposed to searching for information about a business 20 10
12 Vital Status Analysis should probably include all applicants for insurance not just written cases May need to determine vital status of non-written cases In the US, the SS DMF is a good start, but not complete Companies that aggregate public records may be of help Can validate these data sources against insured lives and develop assumptions to account for the missing deaths 21 Performing the Analysis 22 11
13 Steps Define objective Identify data sources Acquire and clean data Analyze data and train models Validate models 23 Define objective Sounds trivial but is of critical importance Will drive all other steps of the modeling process What should the model output be? For example: Replicate underwriting decisions (doesn t require vital status) Risk classification (e.g., preferred, standard, substandard) Number of debits to apply to the base table Applicant-specific mortality rates (qx) for the first several years 24 12
14 Identify Data Sources Must have the target (independent) variable available in the historic data What predictor data is available? Internal sources External sources Webscraping 25 Acquire and Clean Data May be 80% of the total effort required for the project Data collected from different sources must be linked Raw data is seldom in a form appropriate for modeling Text mine documents, such as APS or underwriter notes Perform some basic calculations like age or BMI Some data elements will need to be transformed to optimize modeling, depending on modeling techniques to be applied 26 13
15 Analyze Data and Train Models Can begin with analysis of current basis Identify types of applications where actual outcomes are similar to or different from expected outcomes Train the new models Iterative process: will require testing of various modeling techniques and data transformations Evaluate new models Typically on a hold-out sample of testing data Consider: goodness-of-fit metrics, univariate analysis, model complexity vs. interpretability, consistency with expectations 27 Model Validation Final test on out-of-sample ( validation ) data Can really only be performed once then data is no longer unseen Goodness-of-fit should, at a minimum, be improved relative to current basis Requires a goodness-of-fit metric such as mean squared error 28 14
16 Predictive Modeling Applications Case Study Richard Xu Global R&D RGA LAS May 2013 UW Model GLM model Contents Experience Study GLM model Pricing Model CART model Client Segmentation Clustering 2 1
17 Underwriting Identify best risks Be fast & consistent Prioritize cases Reduce not-taken rates Claims Predict claim frequency Identify claim severity Prioritize resources Identify claims most likely fraudulent/rescinded PM Applications Pricing/ Reserves Improve pricing accuracy Identify deviation of pricing variables Reserve more accurate Compute reserve variance Experience Analysis Identify drivers in experience Handle low credibility data Create own mortality/lapse tables Sales & Marketing Make effective campaigns Recommend products Select new agents Monitor existing agents In Force Business Client segmentation Predict lapses Design retention strategies Offer other products 3 3 Generalized Linear Model OLS(LM) GLM Random Systematic Link OLS Normal only GLM Various distributions Generalized Linear Model (GLM) Inclusion of most distributions related to insurance data Normal, binomial, Poisson, Gamma, inverse-gaussian, etc. Ordinary Least Square (OLS) is a special case of GLM Great flexibility in variance structure Weights & offset to be more flexible Multiplicative model intuitive & consistent with insurance practice Easy to understand & communicate 4 2
18 Case Study 1: UW Model Goal: to predict UW decisions on its existing customers Bancassurance in Asia with large customer pool, but low penetration in life product Identify certain pre-qualified existing customers, & offer guaranteed issue (GI) or simplified issue (SI) without medical UW Acquisition costs will be significantly reduced Market penetration will be deeper, and sales will increase Bancassurance is unique for PM Financial/demographic information about customers Major challenges - very limited data A total of about 8k-9k full UW cases Target variable UW decision, with very low declined/rated cases, ~3.0% Many missing values due to old time, especially for sub-std Not all information collected at the time of UW 5 Key Variables GLM with binomial and logistic link function About a dozen of predictor variables that are statistically significant for prediction & readily available in client database Key predictor variables Positive means the probability to be STD increases if the value goes up; otherwise, it is Negative Name Type Note Age_At_Entry Numeric Negative; less likely to qualify for STD as age goes up Branch Categorical Proxy of geographic locations AUM Numeric Positive; more likely to qualify for STD with large AUM Customer_Segment Categorical Positive for Premier, negative for non-premier Nationality Categorical Positive for domestic; negative for certain others 6 3
19 STD Rate non 18.0% 16.0% 14.0% 12.0% 10.0% 8.0% 6.0% 4.0% 2.0% 0.0% Lift Plot for In Sample Results Declined Rated Average nonstd Rate 3.0% 0.6% 0.5% 0.2% Sorted Model Output In-sample results show model performance under optimal condition May over-fit data 0.5% of sub-std in top 30% of model output non STD Rate Model Results Validation results are a better test of model performance in real business 0.6% sub-std in the top 30% of model outputs, about 80% reduction Declined vs. Rated 16.0% 14.0% 12.0% 10.0% 8.0% 6.0% 4.0% 2.0% 0.0% Lift Plot for Validation Results Declined Rated Average nonstd Rate 3.0% 0.5% 0.4% 0.8% 0.5% Sorted Model Output 7 Model results Gain curve, another way to understand model capability to differentiate STD from sub-std Best 30% of model outputs t contains about 5% of total non-std Lowest 30% captures about 75% of bad risks Model implementation Results delivered to the client Final implementation stage Final control on offers by insurer non STD % Model Gain Curve In sample results Validation results Random Sorted Model Output 8 4
20 Case Study 2: Experience Study PM vs. traditional actuarial approach True multivariate approach vs. univariate, under/over-estimation Impact of interaction term on target More efficient use of data, and handle low credibility data Establish own assumption based on experience data Type of studies Mortality, lapse, claim severity, incidence rate, continue table Major challenge - Data! Data! Data! Understanding business; clean data; mapping of data; data legacy; missing values; timing, etc. Not enough credible data Too much data, in big data territory 9 Post level Lapse Rates vs. Duration Term Tail Lapse Rates Tail lapse rates for 10-year term product Duration, premium jump, face amount, UW class, issue age, gender, etc. Post level Lapse Rate vs. Premium Jump Formula-based results with uncertainty estimated Business insights 10 5
21 CART Model Classification And Regression Tree (CART) Both classification and regression Non-parametric approach (no insight in data structure) CART tree is generated by repeated partitioning i of data set Data is split into two partitions (binary partition) Partitions can also be split into sub-partitions (recursive) Until data in end node(leaf) is homogeneous (more or less) Results are very intuitive Identify specific groups that deviate in target variable Yet, algorithm is very sophisticated 11 Case Study 3: LTD Pricing Business: US group Long-Term Disability(LTD) About 13k policies, with lives per policies from 10 to 30k Current pricing variables: about Experience data of past 5 years with >80 variables Major pricing variables: age, gender, industry, location, benefit structure Objective To determine additional pricing variables and possible interaction terms (for pricing) To identify groups with experience deviating from pricing assumptions (for UW) Client has experience with PM Minimum efforts on business & data understanding Profit margin as target variable 12 6
22 CART Model results Results Easy to develop, interpret and understand; business insights Not efficient for linear function; sensitive to noise; over-fitting 13 CART Model results Results improve profit margin and pricing accuracy Useful tool for both pricing and UW of group LTD business Model implementation Client is very interested in model results; approved by management team Implemented in Q1 13 Quartile # of cases Actual EPM Model Predicted EPM (0.28) (0.32) (0.088) 088) (0.060) 060)
23 Clustering algorithm Data Clustering Find similarities in data according to features found in data and group similar objects into clusters Unsurprised (no pre-defined), classification, non-parametric How to measure similarities/dissimilarities, e.g. distance Numeric, categorical, and ordinal variables Partitioning (k-means), Hierarchical, Density-based, etc. 15 Case Study 4: Client Segmentation Existing client segmentation is based on geographic location, a more self-serving approach for own benefit rather than market and needs Objective To better understand client base, identifying knowledge gaps To capture tacit knowledge; create structured data on our clients & a tool for client analysis and strategic decision-making purposes on an ongoing basis To identify opportunities to better serve our clients needs and grow business To help better optimizing resourcing requirements 16 8
24 Client Segmentation Business team survey data Three main data categories Description of clients Behavior when facing of risks Needs to deal with risks Clustering algorithm & principal component analysis Algorithms find clusters that clients in same cluster are more similar to each other than to those in other clusters Un-supervised algorithm without target variable Data is dominated by categorical variables 17 Results on 5 clusters Number of clusters is a free parameters Example: opportunity Clustering Model Results 18 9
25 Clustering Model Example: Two High Level Clusters Direct distribution and Living Benefits related products Data quality is very important Prefer objective variables than subjective variables 5 (12% by NB volume) Want Direct 32 (25% by NB volume) Want Direct 8 (7% by NB volume) Want Direct e-sales 16 (5% by NB volume) Want Direct Traditional 17 (16% by NB volume) Want living benefits - related 13 (11% by NB volume) Want Living Benefits 4 (5% by NB volume) Want Combination 3 (1% by NB volume) Want Direct to in-force 19 PM is skills for actuaries in future Conclusion PM is to find knowledge in data so that we can understand and gain advantage Everything eyt gshould be made as simple as possible, but not simpler. Albert Einstein 20 10
26 Predictive Modeling Applications Case Study Richard Xu Global R&D RGA LAS May
Predictive modelling around the world 28.11.13
Predictive modelling around the world 28.11.13 Agenda Why this presentation is really interesting Introduction to predictive modelling Case studies Conclusions Why this presentation is really interesting
More informationBIG DATA and Opportunities in the Life Insurance Industry
BIG DATA and Opportunities in the Life Insurance Industry Marc Sofer BSc FFA FIAA Head of Strategic Initiatives North Asia & India RGA Reinsurance Company BIG DATA I keep saying the sexy job in the next
More informationPredictive Modeling Techniques in Insurance
Predictive Modeling Techniques in Insurance Tuesday May 5, 2015 JF. Breton Application Engineer 2014 The MathWorks, Inc. 1 Opening Presenter: JF. Breton: 13 years of experience in predictive analytics
More informationBIG DATA Driven Innovations in the Life Insurance Industry
BIG DATA Driven Innovations in the Life Insurance Industry Edmund Fong FIAA Vincent Or FSA RGA Reinsurance Company 13 November 2015 I keep saying the sexy job in the next ten years will be statisticians.
More informationSession 62 TS, Predictive Modeling for Actuaries: Predictive Modeling Techniques in Insurance Moderator: Yonasan Schwartz, FSA, MAAA
Session 62 TS, Predictive Modeling for Actuaries: Predictive Modeling Techniques in Insurance Moderator: Yonasan Schwartz, FSA, MAAA Presenters: Jean-Frederic Breton David A. Moore, FSA, MAAA Session 62:
More informationSession 26 Predictive Modeling How Can it Help? Jonathan Polon, FSA
Session 26 Predictive Modeling How Can it Help? Jonathan Polon, FSA Predictive Modeling Applications Marketing / cross-selling Customer analytics / policyholder behaviour Underwriting / risk selection
More informationAdvanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090
Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 trhodes@mib.com Abstract
More informationPredictive Modeling and Big Data
Predictive Modeling and Presented by Eileen Burns, FSA, MAAA Milliman Agenda Current uses of predictive modeling in the life insurance industry Potential applications of 2 1 June 16, 2014 [Enter presentation
More informationA Deeper Look Inside Generalized Linear Models
A Deeper Look Inside Generalized Linear Models University of Minnesota February 3 rd, 2012 Nathan Hubbell, FCAS Agenda Property & Casualty (P&C Insurance) in one slide The Actuarial Profession Travelers
More informationData Mining Methods: Applications for Institutional Research
Data Mining Methods: Applications for Institutional Research Nora Galambos, PhD Office of Institutional Research, Planning & Effectiveness Stony Brook University NEAIR Annual Conference Philadelphia 2014
More informationReport on the Lapse and Mortality Experience of Post-Level Premium Period Term Plans (2014)
Report on the Lapse and Mortality Experience of Post-Level Premium Period Term Plans (2014) REVISED MAY 2014 SPONSORED BY Society of Actuaries PREPARED BY Derek Kueker, FSA Tim Rozar, FSA, CERA, MAAA Michael
More informationHow To Calculate Pbr Reserves
Credibility Procedures: VM-20 (Life Insurance Mortality) and Proposed VM-22 (Annuity) Session 156 PD, Application of Credibility Theory Thomas E Rhodes, FSA, MAAA AVP & Actuarial Director, MIB October
More informationApplications of Credit in Life Insurance
Applications of Credit in Life Insurance Southeastern Actuaries Conference Derek Kueker, FSA MAAA June 25, 2015 1 Proprietary & Confidential All of the information contained in this document is proprietary
More informationRisk pricing for Australian Motor Insurance
Risk pricing for Australian Motor Insurance Dr Richard Brookes November 2012 Contents 1. Background Scope How many models? 2. Approach Data Variable filtering GLM Interactions Credibility overlay 3. Model
More informationLessons From Down Under The D2C Market in Australia and what the UK can learn.
Lessons From Down Under The D2C Market in Australia and what the UK can learn. Kate Gillmore, Business Development Director, RGA Mick James, Business Development Director, RGA 7 May 2015 Australia: The
More informationPredictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar
Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar Prepared by Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc. www.data-mines.com Louise.francis@data-mines.cm
More informationModel Validation Techniques
Model Validation Techniques Kevin Mahoney, FCAS kmahoney@ travelers.com CAS RPM Seminar March 17, 2010 Uses of Statistical Models in P/C Insurance Examples of Applications Determine expected loss cost
More informationThe Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
More informationReport on the Lapse and Mortality Experience of Post-Level Premium Period Term Plans
Report on the Lapse and Mortality Experience of Post-Level Premium Period Term Plans Sponsored by The Product Development Section and The Committee on Life Insurance Research of the Society of Actuaries
More informationSocial Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
More informationWebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat
Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise
More informationSession 35 PD, Predictive Modeling for Actuaries: Integrating Predictive Analytics in Assumption Setting Moderator: David Wang, FSA, FIA, MAAA
Session 35 PD, Predictive Modeling for Actuaries: Integrating Predictive Analytics in Assumption Setting Moderator: David Wang, FSA, FIA, MAAA Presenters: Guillaume Briere-Giroux, FSA, MAAA Eileen Sheila
More informationAddressing Analytics Challenges in the Insurance Industry. Noe Tuason California State Automobile Association
Addressing Analytics Challenges in the Insurance Industry Noe Tuason California State Automobile Association Overview Two Challenges: 1. Identifying High/Medium Profit who are High/Low Risk of Flight Prospects
More informationCombining Linear and Non-Linear Modeling Techniques: EMB America. Getting the Best of Two Worlds
Combining Linear and Non-Linear Modeling Techniques: Getting the Best of Two Worlds Outline Who is EMB? Insurance industry predictive modeling applications EMBLEM our GLM tool How we have used CART with
More informationHealth Spring Meeting May 2008 Session # 42: Dental Insurance What's New, What's Important
Health Spring Meeting May 2008 Session # 42: Dental Insurance What's New, What's Important Floyd Ray Martin, FSA, MAAA Thomas A. McInteer, FSA, MAAA Jonathan P. Polon, FSA Dental Insurance Fraud Detection
More informationData Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing C. Olivia Rud, VP, Fleet Bank ABSTRACT Data Mining is a new term for the common practice of searching through
More informationModerator: Gregory A. Brandner, FSA, MAAA. Presenters: Gregory A. Brandner, FSA, MAAA Sean J. Conrad, FSA, MAAA Lisa Hollenbeck Renetzky, FSA, MAAA
Session 138 PD, Streamlined Underwriting and Product Development: Good, Fast and Cheap? Can I Get More than Just Two of Three? Moderator: Gregory A. Brandner, FSA, MAAA Presenters: Gregory A. Brandner,
More informationPractical applications of Predictive Modelling Overview of the process, the techniques and the applications
Practical applications of Predictive Modelling Overview of the process, the techniques and the applications Jean-Yves Rioux CIA 2014 Annual Meeting, June 19, 2014 Agenda What it is and why do it 2 Process
More informationModeling Lifetime Value in the Insurance Industry
Modeling Lifetime Value in the Insurance Industry C. Olivia Parr Rud, Executive Vice President, Data Square, LLC ABSTRACT Acquisition modeling for direct mail insurance has the unique challenge of targeting
More informationSession 54 PD, Credibility and Pooling for Group Life and Disability Insurance Moderator: Paul Luis Correia, FSA, CERA, MAAA
Session 54 PD, Credibility and Pooling for Group Life and Disability Insurance Moderator: Paul Luis Correia, FSA, CERA, MAAA Presenters: Paul Luis Correia, FSA, CERA, MAAA Brian N. Dunham, FSA, MAAA Credibility
More informationA Property & Casualty Insurance Predictive Modeling Process in SAS
Paper AA-02-2015 A Property & Casualty Insurance Predictive Modeling Process in SAS 1.0 ABSTRACT Mei Najim, Sedgwick Claim Management Services, Chicago, Illinois Predictive analytics has been developing
More informationInnovations and Value Creation in Predictive Modeling. David Cummings Vice President - Research
Innovations and Value Creation in Predictive Modeling David Cummings Vice President - Research ISO Innovative Analytics 1 Innovations and Value Creation in Predictive Modeling A look back at the past decade
More informationChallenge. Solutions. Early results. Personal Lines Case Study Celina Insurance Reduces Expenses & Improves Processes Across the Business.
Celina Insurance Reduces Expenses & Improves Processes Across the Business About Celina Insurance Group Founded in 1914, Celina Insurance Group is composed of four mutual property and casualty insurance
More informationEXPLORING & MODELING USING INTERACTIVE DECISION TREES IN SAS ENTERPRISE MINER. Copyr i g ht 2013, SAS Ins titut e Inc. All rights res er ve d.
EXPLORING & MODELING USING INTERACTIVE DECISION TREES IN SAS ENTERPRISE MINER ANALYTICS LIFECYCLE Evaluate & Monitor Model Formulate Problem Data Preparation Deploy Model Data Exploration Validate Models
More informationSession 60 PD, Predictive Modeling Real Applications in Life Insurance and Annuities. Moderator: Ricardo Trachtman, FSA, MAAA
Session 60 PD, Predictive Modeling Real Applications in Life Insurance and Annuities Moderator: Ricardo Trachtman, FSA, MAAA Presenters: JJ Lane Carroll, FSA, MAAA Allen M. Klein, FSA, MAAA Scott Anthony
More informationPractical Applications of Stochastic Modeling for Disability Insurance
Practical Applications of Stochastic Modeling for Disability Insurance Society of Actuaries Session 8, Spring Health Meeting Seattle, WA, June 007 Practical Applications of Stochastic Modeling for Disability
More informationData Analytical Framework for Customer Centric Solutions
Data Analytical Framework for Customer Centric Solutions Customer Savviness Index Low Medium High Data Management Descriptive Analytics Diagnostic Analytics Predictive Analytics Prescriptive Analytics
More informationSTATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
More informationData Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
More informationCONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19
PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations
More informationLocation matters. 3 techniques to incorporate geo-spatial effects in one's predictive model
Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model Xavier Conort xavier.conort@gear-analytics.com Motivation Location matters! Observed value at one location is
More informationAn Overview of Data Mining: Predictive Modeling for IR in the 21 st Century
An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century Nora Galambos, PhD Senior Data Scientist Office of Institutional Research, Planning & Effectiveness Stony Brook University AIRPO
More informationCredibility and Pooling Applications to Group Life and Group Disability Insurance
Credibility and Pooling Applications to Group Life and Group Disability Insurance Presented by Paul L. Correia Consulting Actuary paul.correia@milliman.com (207) 771-1204 May 20, 2014 What I plan to cover
More informationData Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More informationBIDM Project. Predicting the contract type for IT/ITES outsourcing contracts
BIDM Project Predicting the contract type for IT/ITES outsourcing contracts N a n d i n i G o v i n d a r a j a n ( 6 1 2 1 0 5 5 6 ) The authors believe that data modelling can be used to predict if an
More informationBOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING
BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING Xavier Conort xavier.conort@gear-analytics.com Session Number: TBR14 Insurance has always been a data business The industry has successfully
More informationSession 190 PD, Model Risk Management and Controls Moderator: Chad R. Runchey, FSA, MAAA
Session 190 PD, Model Risk Management and Controls Moderator: Chad R. Runchey, FSA, MAAA Presenters: Michael N. Failor, ASA, MAAA Michael A. McDonald, FSA, FCIA Chad R. Runchey, FSA, MAAA SOA 2014 Annual
More informationIntroduction to Predictive Modeling Using GLMs
Introduction to Predictive Modeling Using GLMs Dan Tevet, FCAS, MAAA, Liberty Mutual Insurance Group Anand Khare, FCAS, MAAA, CPCU, Milliman 1 Antitrust Notice The Casualty Actuarial Society is committed
More informationStochastic Analysis of Long-Term Multiple-Decrement Contracts
Stochastic Analysis of Long-Term Multiple-Decrement Contracts Matthew Clark, FSA, MAAA, and Chad Runchey, FSA, MAAA Ernst & Young LLP Published in the July 2008 issue of the Actuarial Practice Forum Copyright
More informationSTATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table
More informationCross Validation. Dr. Thomas Jensen Expedia.com
Cross Validation Dr. Thomas Jensen Expedia.com About Me PhD from ETH Used to be a statistician at Link, now Senior Business Analyst at Expedia Manage a database with 720,000 Hotels that are not on contract
More informationSession 8: The Latest on Practical Uses of Big Data and Predictive Analytics. Moderator: Phil Murphy
Session 8: The Latest on Practical Uses of Big Data and Predictive Analytics Moderator: Phil Murphy Presenters: Ron Schaber Tim Hill Derek Kueker Jean Marc Fix Chris Stehno PRACTICAL USES OF BIG DATA AND
More informationPredictive Analytics for Life Insurance: How Data and Advanced Analytics are Changing the Business of Life Insurance Seminar May 23, 2012
Predictive Analytics for Life Insurance: How Data and Advanced Analytics are Changing the Business of Life Insurance Seminar May 23, 2012 Session 2 How to Build a Risk Based Analytical Model for Life Insurance
More informationSession 42 PD, Predictive Analytics for Actuaries: Building an Effective Predictive Analytics Team. Moderator: Courtney Nashan
Session 42 PD, Predictive Analytics for Actuaries: Building an Effective Predictive Analytics Team Moderator: Courtney Nashan Presenters: Ian G. Duncan, FSA, FCIA, FIA, MAAA Andy Ferris, FSA, MAAA Christine
More informationPredictive Modeling for Life Insurers
Predictive Modeling for Life Insurers Application of Predictive Modeling Techniques in Measuring Policyholder Behavior in Variable Annuity Contracts April 30, 2010 Guillaume Briere-Giroux, FSA, MAAA, CFA
More informationData Mining Applications in Higher Education
Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2
More informationData Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka (dsarka@solidq.com) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
More informationBeating the NCAA Football Point Spread
Beating the NCAA Football Point Spread Brian Liu Mathematical & Computational Sciences Stanford University Patrick Lai Computer Science Department Stanford University December 10, 2010 1 Introduction Over
More informationAnti-Trust Notice. Agenda. Three-Level Pricing Architect. Personal Lines Pricing. Commercial Lines Pricing. Conclusions Q&A
Achieving Optimal Insurance Pricing through Class Plan Rating and Underwriting Driven Pricing 2011 CAS Spring Annual Meeting Palm Beach, Florida by Beth Sweeney, FCAS, MAAA American Family Insurance Group
More informationLearning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal
Learning Example Chapter 18: Learning from Examples 22c:145 An emergency room in a hospital measures 17 variables (e.g., blood pressure, age, etc) of newly admitted patients. A decision is needed: whether
More informationDeriving Value from ORSA. Board Perspective
Deriving Value from ORSA Board Perspective April 2015 1 This paper has been produced by the Joint Own Risk Solvency Assessment (ORSA) Subcommittee of the Insurance Regulation Committee and the Enterprise
More informationSupervised Learning (Big Data Analytics)
Supervised Learning (Big Data Analytics) Vibhav Gogate Department of Computer Science The University of Texas at Dallas Practical advice Goal of Big Data Analytics Uncover patterns in Data. Can be used
More informationMachine Learning using MapReduce
Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous
More informationA Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic
A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic Report prepared for Brandon Slama Department of Health Management and Informatics University of Missouri, Columbia
More informationMaximize Revenues on your Customer Loyalty Program using Predictive Analytics
Maximize Revenues on your Customer Loyalty Program using Predictive Analytics 27 th Feb 14 Free Webinar by Before we begin... www Q & A? Your Speakers @parikh_shachi Technical Analyst @tatvic Loves js
More informationSimple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
More informationHow to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning
How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume
More informationAzure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
More informationGerry Hobbs, Department of Statistics, West Virginia University
Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit
More informationBanking Analytics Training Program
Training (BAT) is a set of courses and workshops developed by Cognitro Analytics team designed to assist banks in making smarter lending, marketing and credit decisions. Analyze Data, Discover Information,
More informationLecture 10: Regression Trees
Lecture 10: Regression Trees 36-350: Data Mining October 11, 2006 Reading: Textbook, sections 5.2 and 10.5. The next three lectures are going to be about a particular kind of nonlinear predictive model,
More informationData Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.7 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Introduction Linear Regression Other Regression Models References Introduction Introduction Numerical prediction is
More informationInsurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.
Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví Pavel Kříž Seminář z aktuárských věd MFF 4. dubna 2014 Summary 1. Application areas of Insurance Analytics 2. Insurance Analytics
More informationA Property and Casualty Insurance Predictive Modeling Process in SAS
Paper 11422-2016 A Property and Casualty Insurance Predictive Modeling Process in SAS Mei Najim, Sedgwick Claim Management Services ABSTRACT Predictive analytics is an area that has been developing rapidly
More information1 Choosing the right data mining techniques for the job (8 minutes,
CS490D Spring 2004 Final Solutions, May 3, 2004 Prof. Chris Clifton Time will be tight. If you spend more than the recommended time on any question, go on to the next one. If you can t answer it in the
More informationSession 114 PD, RGA Session Series Part 2: Reinventing Insurance. Moderator: Michael H. Choate, FSA, MAAA. Presenters: Kevin J Pledge FSA,FIA
Session 114 PD, RGA Session Series Part 2: Reinventing Insurance Moderator: Michael H. Choate, FSA, MAAA Presenters: Kevin J Pledge FSA,FIA Reinventing Insurance KEVIN PLEDGE FIA, FSA 13 Oct 2015 Session
More informationDecision Trees from large Databases: SLIQ
Decision Trees from large Databases: SLIQ C4.5 often iterates over the training set How often? If the training set does not fit into main memory, swapping makes C4.5 unpractical! SLIQ: Sort the values
More informationData Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification
More informationChapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
More informationJean-Yves Rioux. Big Data and Analytics dramatic impacts in the Life Insurance Industry
Jean-Yves Rioux Big Data and Analytics dramatic impacts in the Life Insurance Industry 2 Agenda Drivers and barriers Data sources The process Life and health insurance applications Modeling/analyzing longevity
More information69 PD Underwriting Issues for Group Life and Disability Insurance. Moderator: Peter A. Heinrichs, FSA, MAAA
69 PD Underwriting Issues for Group Life and Disability Insurance Moderator: Peter A. Heinrichs, FSA, MAAA Presenters: Susan L. Ebertz Michael F. Vassar Mark R. Yoest, FSA, MAAA Underwriting Trends in
More informationSession 106 PD, Profitability Trends for Disability Insurance. Moderator: John R. Murphy, FSA, MAAA
Session 106 PD, Profitability Trends for Disability Insurance Moderator: John R. Murphy, FSA, MAAA Presenters: Scott D. Haglund, FSA, MAAA John R. Murphy, FSA, MAAA Robert F. Wade, FSA, MAAA 2015 Society
More informationIntroduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
More informationReFocus 2009. March 1-4, 2009. Las Vegas, NV
ReFocus 2009 March 1-4, 2009 Las Vegas, NV B5 Automated Underwriting and Straight- Through Processing: The Present and Future of Underwriting Technology Jym Barnes, Michael Keller, James McArdle, Dirk
More informationEfficiency in Software Development Projects
Efficiency in Software Development Projects Aneesh Chinubhai Dharmsinh Desai University aneeshchinubhai@gmail.com Abstract A number of different factors are thought to influence the efficiency of the software
More informationRole of Social Networking in Marketing using Data Mining
Role of Social Networking in Marketing using Data Mining Mrs. Saroj Junghare Astt. Professor, Department of Computer Science and Application St. Aloysius College, Jabalpur, Madhya Pradesh, India Abstract:
More informationData Mining + Business Intelligence. Integration, Design and Implementation
Data Mining + Business Intelligence Integration, Design and Implementation ABOUT ME Vijay Kotu Data, Business, Technology, Statistics BUSINESS INTELLIGENCE - Result Making data accessible Wider distribution
More informationData Mining Classification: Decision Trees
Data Mining Classification: Decision Trees Classification Decision Trees: what they are and how they work Hunt s (TDIDT) algorithm How to select the best split How to handle Inconsistent data Continuous
More informationApplied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification
More informationIntroduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
More informationPentaho Data Mining Last Modified on January 22, 2007
Pentaho Data Mining Copyright 2007 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest information, please visit our web site at www.pentaho.org
More informationKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 11 Sajjad Haider Fall 2013 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
More informationAn Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
More informationEmployer Health Insurance Premium Prediction Elliott Lui
Employer Health Insurance Premium Prediction Elliott Lui 1 Introduction The US spends 15.2% of its GDP on health care, more than any other country, and the cost of health insurance is rising faster than
More informationRobust Outlier Detection Technique in Data Mining: A Univariate Approach
Robust Outlier Detection Technique in Data Mining: A Univariate Approach Singh Vijendra and Pathak Shivani Faculty of Engineering and Technology Mody Institute of Technology and Science Lakshmangarh, Sikar,
More informationRegression Modeling Strategies
Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions
More informationDriving Insurance World through Science - 1 - Murli D. Buluswar Chief Science Officer
Driving Insurance World through Science - 1 - Murli D. Buluswar Chief Science Officer What is The Science Team s Mission? 2 What Gap Do We Aspire to Address? ü The insurance industry is data rich but ü
More information!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"
!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"!"#"$%&#'()*+',$$-.&#',/"-0%.12'32./4'5,5'6/%&)$).2&'7./&)8'5,5'9/2%.%3%&8':")08';:
More informationHow To Make A Credit Risk Model For A Bank Account
TRANSACTIONAL DATA MINING AT LLOYDS BANKING GROUP Csaba Főző csaba.fozo@lloydsbanking.com 15 October 2015 CONTENTS Introduction 04 Random Forest Methodology 06 Transactional Data Mining Project 17 Conclusions
More information