Text mining for insurance claim cost prediction
|
|
|
- Doris Tate
- 10 years ago
- Views:
Transcription
1 Text mining for insurance claim cost prediction Prepared by Inna Kolyshkina and Marcel van Rooyen Presented to the Institute of Actuaries of Australia XVth General Insurance Seminar October 2005 This paper has been prepared for the Institute of Actuaries of Australia s (Institute) XVth General Insurance Seminar The Institute Council wishes it to be understood that opinions put forward herein are not necessarily those of the Institute and the Council is not responsible for those opinions. PricewaterhouseCoopers 2005 The Institute will ensure that all reproductions of the paper acknowledge the Author/s as the author/s, and include the above copyright statement: The Institute of Actuaries of Australia Level 7 Challis House 4 Martin Place Sydney NSW Australia 2000 Telephone: Facsimile: [email protected] Website:
2 Abstract. The paper presents the findings of an industry-based study in the utility of text mining. The purpose of the study was to evaluate the impact of textual information in claims cost prediction. The industrial research setting was a large Australian insurance company. The data mining methodologies used in this research included text mining, and the application of the results from the text mining in subsequent predictive data mining models. The researchers used software of the leading commercial vendors. The research found commercially interesting utility in textual information for claim cost prediction, and also identified new risk management factors. Key words: text mining, predictive model, insurance claim prediction, risk management. Introduction Claims cost prediction is an important focus area for insurance companies. The reason is that proactive case management can significantly reduce the final claims pay-out value. The issue is particularly relevant, when considered that a small number of cases amount to a disproportionately big portion of total claims pay-out value. It follows that small improvements in claims payout value prediction, may bring significant financial benefits to insurance companies. There has been recognition for some time now that data about incidents contain information which allows for a proactive risk management approach (Feyer and Williamson 1998, p.1). The large size of insurance databases is making data mining an increasingly attractive tool for analysis compared to traditional analytical methods (Kolyshkina, Steinberg et al. 2003, p.493). Up to 80% of this data is in unstructured textual format (Feldman 2003, p.481). Realising the potential value of information resident in this textual data, there is growing interest by insurers in the application of new text mining techniques (Feyer, Stout et al. 2001). We refer to an example where text mining analysis of narrative fields about claims, resulted in beneficial claims management and fraud detection in the occupational injury insurance domain (Stout 1998). In the example, the benefits stemmed from information in the textual narrative data, which was not present in the existing coding system. Based on the above, the researchers formulated the relevant question, whether textual data can improve claim pay-out value prediction. If the answer was positive, the further question arises how to use the predictive Page 1
3 potency of textual data in such predictive models. The researchers therefore set out to investigate the utility of unstructured textual data in improving predicted pay-out value of insurance claims. The research was done using data provided by a large Australian insurance company. That insurer first wanted to asses the potential value that using text mining facilities could add to the organisation in increasing the precision of claim cost prediction; second to explore the possibilities and benefits of augmenting their existing incident coding system using free text; and thirdly to suggest how text mining could be used for improvement in other areas of the business. Our approach was to create a model identifying at the time of the incident report, whether the incident would result in a claim pay-out value within the top 10 percent by value, by the end of the next quarter. We assessed the model in terms of the predictive power of textual information on its own, and in terms of textual information adding predictive power to other, non-textual predictors. 1 Description of algorithms used The researchers used both SAS Enterprise Miner and SPSS Clementine text mining software for textual data preparation and text mining. The discovered concepts were similar irrespective of the software package used. Predictive feature selection and predictive modelling was done using both CART and TreeNet (Hastie, Tibshirani et al. 2001). The choice of TreeNet was because of its high precision in predictive modelling, its effectiveness in selecting predictors, and its resistance to overtraining. CART models are more easily interpretable than TreeNet models, therefore the researchers used CART in conjunction with TreeNet to assure understandability about the predictive models. 2 Data Description The first group of data comprised features about claimant demographics, claims pay-out value information, and codings about various aspects of the incident (e.g. about the body part injured). To facilitate the discussion, we name this first group of data TransData. The textual data contained unstructured free-type text fields of about 200 characters each. These fields described the incident and the resulting injury. We name this second group of textual data TextData. Page 2
4 Both data groups were identified by claim numbers. The data sets represented all claims reported between 30 September 2002 and 31 March 2004, which were still open at the time of the research. This was an 18 month data history, which provided approximately 56,000 records. The target variable for prediction was a binary indicator (yes/no) of whether or not that injury report had resulted in a claim pay-out value within the top 10 percent by the end of the quarter of the report. The quarter represents a three-month time window of investigation. Page 3
5 3 Description of analytical techniques In Figure 1 we present the research process flow: 1. Prepare TextData 2. Discover concepts (text mining) 3. Reduce concepts Prepared TransData 6. Predictive modelling with concepts only 5. Derive domain relevant concepts 4. Select predictive concepts 7. Evaluate results 8. Predictive modelling with features only 9. Predictive modelling with concepts and features 10. Compare results and conclude Figure 1: Research process flow We will discuss our research process under the following headings. These headings are the same as the labels of the elements of Figure 1. Page 4
6 3.1 Prepare TextData The process started at point one with the preparation of TextData for text mining. The preparation included routine activities of extracting data, and merging TextData and TransData by the unique claim identity. 3.2 Discover concepts Text mining is the discovery of concepts from unstructured data. Concepts are words or word combinations resident in the text. These concepts can then be mined in isolation, or in combination with other data. Within this context, text mining can be considered a way of pre-processing unstructured data for use in subsequent modelling. The researchers mined TextData at point two to discover concepts from it. The mining of TextData required not only considerable time and effort, but also expertise in the insurance domain, and in the used software packages.the mining process was characterised by iterative experimentation to find optimal algorithm settings. These settings included both language and mathematical weightings. 3.3 Reduce concepts About 8000 concepts were discovered in the preceding activity. Not only did it prove difficult to make sense of so many concepts, but it would also make subsequent analysis intractable. Further, concepts with a low frequency would not be relevant within our context. Therefore at point three the researchers filtered out those concepts which had a frequency of less than 50 in TextData. After filtering 860 concepts remained. The issue of the predictive value of concepts now needed resolution. We had prior knowledge about the predictive value of the data features in TransData from existing modelling by the insurer. 3.4 Select predictive concepts The researchers resolved the issue of concept predictability by using TreeNet at point four to identify the most predictive of the 860 concepts. We present the nine most predictive concepts from this step in Table 1. The first column of Table 1 lists the concept names which were selected by TreeNet, and the second column states each concept s relative predictive importance. Page 5
7 Table 1: Concept importance using TreeNet Concept Concept name importance LEG 100 LACERATED FRACTURE STRESS_STRESS EYE HERNIA TRUCK BURN LADDER Derive domain-relevant concepts The researchers depended on insurance domain expertise for deriving additional features at point five. This encompassed the grouping and combining of concepts e.g. to injure and to hurt were set as equivalent terms. Concept derivation was assisted by referring to other targets in addition to the predicted target. 3.6 Predictive modelling with concepts only At point six the researchers built one TreeNet and one CART predictive model for claims cost, using only the predictive concepts and the derived concepts. The CART model was built for interpretability of results. The purpose with building these two models was to discover any predictive potential of the textual concepts only. 3.7 Evaluate results At point seven the researchers evaluated these two models based on the concepts alone by referring to model topology, gains charts, and model accuracy. The TreeNet model was 75.7% precise on test data. We present the gains charts for the two models in Figure 2: Page 6
8 Figure 2: Gains chart TreeNet (left)and CART (right) From these CART and TreeNet gains charts we evaluated that the concepts had predictive potential. We show the reduced model topology from CART in Figure 3: Figure 3: Topology CART model using concepts only The first split is on the concept eye, followed by lacerated, stress, fracture, hernia, truck, and ladder. This topology made domain sense, since eye-related claims have a relatively low cost, as do laceration-related claims. Table 2: Concept importance using CART Concept name Concept importance GROUP_EYE 100 GROUP_LACERATED FOREIGNBODY GROUP_STRESS GROUP_FRACTURE HERNIA KNIFE GROUP_LADDER Page 7
9 GROUP_TRUCK In Table 2 we present the most important concepts as identified by this CART model. The concept names which start with GROUP_, are derived concepts from step five. The positive evaluation of the predictive results encouraged the researchers to further experiment about the predictability of the concepts in combination with TransData features. We describe that experimentation below. 3.8 Predictive modelling with features only At this point the researchers build one CART model and one TreeNet model, predicting claims pay-out value. We build these models to create points of comparative reference for the combined models which we will describe in point 9. In both models we used only TransData features i.e. there were no concepts in these models. The CART model (called C1) we built on data sets (or observations), which captured all the insurer's incident codes. Domain experts suggested that textual information may be particularly useful for improving prediction precision for those incident codes which cover many cases and have high pay-out value variance. We therefore build the TreeNet model (called T1) on data sets (or observations) which represented such an incident injury nature code (i.e. type of injury). 3.9 Predictive modelling with features and concepts At point nine the researchers build one more CART model and one more TreeNet model, also predicting claims pay-out. In these two models we combined TransData features with predictive concepts from TextData. The combined CART model (called C2) we built on the same data sets as C1 above. The combined TreeNet model (called T2) we build on the same data sets as T1 above. In the next section we will compare the results from these four models, and make inference about the contribution of the concepts to predictive accuracy about claims cost. Page 8
10 3.10 Compare results and conclude We present the gains charts of the C1 and C2 models in Figure 4: Figure 4: Gains chart CART models C1 left and C2 right Comparing the two gains charts shows about a 5% increase in predictive performance of C2 over C1. In Figure 5 we present the gains charts of TreeNet models T1 and T2: Figure 5: Gains charts TreeNet T1 left and T2 right The gains chart of T2 shows about a 10% improvement over the gains chart of T1. We note that this improvement is twice as much as the improvement over the full data set (C2 over C1). This is consistent with the idea that textual features add value to claims pay-out prediction in large code categories. We attribute the increase in predictive value of the combined models, to resolution which is added to the models by the addition of the concepts. In Figure 6 we display the additional resolution in the C2 CART model. Page 9
11 MECHINJC MECHINJC INJDSNAT MECHINJC MECHINJC INJDSNAT MECHINJC MECHI NJC INJDSNAT DISC INJDSNAT EYE SHOULDER TRUCK LEG MECHI NJC STRESS_STRESS MECHINJC STRAINEDRIGHTSHOULDE RESIDENT Text mining for insurance claim cost prediction Figure 6: Additional CART model resolution from concepts The binary splits outside the square area in Figure 6, are on features from TransData e.g. injury location (part of body), nature of the injury, and mechanism of the injury. The splits inside the square were on concepts from TextData. The concepts found in the square are not covered by the insurer s existing coding system. Examples of such concepts were truck, roof, and ladder. Such concepts can also be useful in identifying particular industry OH&S issues, which require proactive risk management strategies. From the above results, the researchers conclude that the combined models had increased predictive precision in claim cost prediction, compared to the models with TransData features only. The conclusion is valid for the full data set, as well as for the high frequency and variance data subset. Mining unstructured textual data therefore did improve the prediction of insurance claims cost. We also discovered new concepts which complement the insurer s existing coding system. The next section contains ideas about how text mining could be used for improving other areas of the insurer s business. 4 Potential applications of text mining by the insurance industry The researchers offer their impressions of some possible uses of text mining in the insurance industry; first, we perceive application in an insurer s capital management. Improved prediction about future insurance claims cost, should bring about improvements in the quantification of re-insurance needs. Such quantification about needs will be invaluable in negotiating favourable reinsurance premiums, and terms and conditions, with re-insurers. Further, Page 10
12 such quantification should assist with better planning about working capital requirements for that portion of their risk which is not re-insurable; a second potential application is where textual data from the incident investigation is available for analysis. Insurers could identify new risk factors, and previously unknown interactions between known risk factors. Such knowledge would enable insurers to develop industry-specific proactive risk management practices with their insured clients, with consequential financial benefits to both parties; a third potential application is where textual data is available from afterthe-event therapy sessions with victims. Here, text mining could be used to discover concepts about victims attitude about and perception of risk, leading up to the risk event (Dedobbeleer and Béland 1998). Knowing such psychographic factors will enable insurers to develop industryspecific, proactive behaviour management programs with their clients. Such programs could bring about paradigm shift in the approach to risk management; a further potential application is quality control of the application of incident codlings in reporting systems, and for development of new incident reporting and investigation business rules; further, in those cases where textual data is available from market research interviews with potential retail insurance customers, text mining can be used to discover consumers attitudes about, and needs for insurance. Such concepts can then be used for segmenting the retail market simultaneously for attitude and need, using conceptual clustering techniques. Such multi-dimensional market segmentation would enable insurance retailers to develop campaign offers which are much better matched to consumer needs and attitudes, than what is possible, for instance, with demographic segmentation approaches. Such campaign offers will result in better campaign response rates, and improved competitive advantage for insurers (Berry and Linoff 2004) Chapter 4; data mining has been proven as an effective tool in fraud detection and prevention (Phua, Lee et al. Submitted). Text mining is already being used in the insurance claims fraud discovery and prevention arena (Mailvaganam Accessed February 2005) (Ellingworth and Sullivan Accessed February 2005), and the researchers perceive an underutilised opportunity by the insurance industry. This entails mining unstructured textual data from claimants contact with the insurer. Such mining would be focused at discovering both known and new concepts from that data, upon which to base both proactive and remedial fraud management. Page 11
13 Future work The researchers identify a number of issues which require further research. The first is researching the value of text mining in predicting other targets of interest to the insurance industry. Such other targets could be number of days off work, cost of medical treatment, or exact claim pay-out amount. Further, research could extend the modelling time horizon about the target past the current three-month period, to for instance six months or even 12 months. This will enable insurers to accordingly extend their planning horizon about claims cost. Research about the utility of text mining for business rule development and fraud management could be valuable. Research is also required into some of the opportunities identified in the previous section to: quantify the dollar benefits from improved capital management which is realisable from text mining; discover new risk factors or interactions between risk factors; use text mining as a psychographic profiling and segmenting tool; and quantify the monetary benefits from marketing campaigns based upon this psychographic approach compared to traditional marketing approaches. Acknowledgements The researchers acknowledge the contributions to this research of Michael Playford and Anne-Marie Feyer, both PricewaterhouseCoopers partners, for offering help and valuable insights; Dominic Roe, consultant PricewaterhouseCoopers, and Bianca Zubac, student UNSW for their assistance in project facilitation and data analysis; SAS Institute and SPSS, both for providing text mining software for the project as well as for assistance and guidance in using text mining software. Page 12
14 References Berry, M. J. A. and G. S. Linoff, 2004, Data Mining Techniques for Marketing, Sales, and Customer Relationship Management, Indianapolis, Wiley. Dedobbeleer, N. and F. Béland, 1998, Is risk perception one of the dimensions of safety climate? Occupational Injury: Risk, Prevention and Intervention. A.- M. Feyer and A. Williamson. London, Taylor & Francis: Ellingworth, M. and D. Sullivan, accessed February 2005, 'Text Mining Improves Business Intelligence and Predictive Modeling in Insurance', Feldman, R., 2003, Mining Text Data. The Handbook of Data Mining, N. Ye. London, Lawrence Erlbaum Associates: Feyer, A.-M., N. Stout, et al, 2001, 'Use of narrative analysis for comparisons of the causes of fatal accidents in three countries: New Zealand, Australia, and the United States', in Injury Prevention 7: i15-i20. Feyer, A.-M. and A. Williamson, 1998, Introduction. Occupational Injury: Risk, Prevention and Intervention. A.-M. Feyer and A. Williamson. London, Taylor & Francis: 1-3. Hastie, T., R. Tibshirani, et al, 2001, The elements of statistical learning: Data mining, inference and prediction, New York, Springer-Verlag. Kolyshkina, I., D. Steinberg, et al, 2003, 'Using Data Mining for Modeling Insurance Risk and Comparison of Data Mining and Linear Modeling Approaches', in Intelligent and Other Computational Techniques in Insurance: Theory and Applications, A. F. Shapiro and L. C. Jain. London, World Scientific. Volume 6: Mailvaganam, H., accessed February 2005, 'Text Mining for Fraud Detection: Creating cost effective data mining solutions for fraud analysis', Phua, C., V. Lee, et al. (Submitted). 'A Comprehensive Survey of Data Mining-based Fraud Detection Research.' Submitted. Stout, N. (1998). 'Analysis of narrative text fields in occupational injury data', in Occupational Injury: Risk, Prevention and Intervention, A.-M. Feyer and A. Williamson. London, Taylor & Francis: Page 13
PERCEPTION OF BUILDING CONSTRUCTION WORKERS TOWARDS SAFETY, HEALTH AND ENVIRONMENT
Journal of Engineering Science and Technology Vol. 2, No. 3 (2007) 271-279 School of Engineering, Taylor s University College PERCEPTION OF BUILDING CONSTRUCTION WORKERS TOWARDS SAFETY, HEALTH AND ENVIRONMENT
What is Customer Relationship Management? Customer Relationship Management Analytics. Customer Life Cycle. Objectives of CRM. Three Types of CRM
Relationship Management Analytics What is Relationship Management? CRM is a strategy which utilises a combination of Week 13: Summary information technology policies processes, employees to develop profitable
The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon
The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon ABSTRACT Effective business development strategies often begin with market segmentation,
Nine Common Types of Data Mining Techniques Used in Predictive Analytics
1 Nine Common Types of Data Mining Techniques Used in Predictive Analytics By Laura Patterson, President, VisionEdge Marketing Predictive analytics enable you to develop mathematical models to help better
Data Mining and Marketing Intelligence
Data Mining and Marketing Intelligence Alberto Saccardi 1. Data Mining: a Simple Neologism or an Efficient Approach for the Marketing Intelligence? The streamlining of a marketing campaign, the creation
Get to Know the IBM SPSS Product Portfolio
IBM Software Business Analytics Product portfolio Get to Know the IBM SPSS Product Portfolio Offering integrated analytical capabilities that help organizations use data to drive improved outcomes 123
REINSURANCE PROFIT SHARE
REINSURANCE PROFIT SHARE Prepared by Damian Thornley Presented to the Institute of Actuaries of Australia Biennial Convention 23-26 September 2007 Christchurch, New Zealand This paper has been prepared
Hexaware E-book on Predictive Analytics
Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,
Mine Your Business A Novel Application of Association Rules for Insurance Claims Analytics
Mine Your Business A Novel Application of Association Rules for Insurance Claims Analytics Lucas Lau and Arun Tripathi, Ph.D. Abstract: This paper describes how a data mining technique known as Association
How to Get More Value from Your Survey Data
Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.
DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,
Why Read This Guide On VBI?... 2. VBI Defined... 2. Who Uses VBI?... 3. How Can VBI Support Loss Prevention?... 3
Contents Why Read This Guide On VBI?... 2 VBI Defined... 2 Who Uses VBI?... 3 How Can VBI Support Loss Prevention?... 3 How Is VBI Helping Business Operations To Become More Effective?... 4 The Role of
TEXT ANALYTICS INTEGRATION
TEXT ANALYTICS INTEGRATION A TELECOMMUNICATIONS BEST PRACTICES CASE STUDY VISION COMMON ANALYTICAL ENVIRONMENT Structured Unstructured Analytical Mining Text Discovery Text Categorization Text Sentiment
Data Mining with SAS. Mathias Lanner [email protected]. Copyright 2010 SAS Institute Inc. All rights reserved.
Data Mining with SAS Mathias Lanner [email protected] Copyright 2010 SAS Institute Inc. All rights reserved. Agenda Data mining Introduction Data mining applications Data mining techniques SEMMA
Evaluating Data Mining Models: A Pattern Language
Evaluating Data Mining Models: A Pattern Language Jerffeson Souza Stan Matwin Nathalie Japkowicz School of Information Technology and Engineering University of Ottawa K1N 6N5, Canada {jsouza,stan,nat}@site.uottawa.ca
Predictive Analytics for Donor Management
IBM Software Business Analytics IBM SPSS Predictive Analytics Predictive Analytics for Donor Management Predictive Analytics for Donor Management Contents 2 Overview 3 The challenges of donor management
Trends in Large Common Law Personal Injury Claims
Trends in Large Common Law Personal Injury Claims Prepared by Gae Robinson and Gillian Harrex Presented to the Institute of Actuaries of Australia XIV General Insurance Seminar 2003 9-12 November 2003
Five Predictive Imperatives for Maximizing Customer Value
Executive Brief Five Predictive Imperatives for Maximizing Customer Value Applying Predictive Analytics to enhance customer relationship management Table of contents Executive summary...2 The five predictive
Five predictive imperatives for maximizing customer value
Five predictive imperatives for maximizing customer value Applying predictive analytics to enhance customer relationship management Contents: 1 Introduction 4 The five predictive imperatives 13 Products
Easily Identify Your Best Customers
IBM SPSS Statistics Easily Identify Your Best Customers Use IBM SPSS predictive analytics software to gain insight from your customer database Contents: 1 Introduction 2 Exploring customer data Where do
Role Description. Australian Museum
Role Description Business Systems Coordinator Cluster Division/Branch/Unit Location Department of Trade & Investment Australian Museum Digital, Online & ICT Sydney CBD Classification/Grade/Band Clerk Grade
Basic Steps for Developing a VAW Social Marketing Campaign
16 Basic Steps for Developing a VAW Social Marketing Campaign Facilitate. Educate. Collaborate. Page 2 of 6 The opinions expressed here are those of the authors and do not necessarily reflect the views
Data Mining for Fun and Profit
Data Mining for Fun and Profit Data mining is the extraction of implicit, previously unknown, and potentially useful information from data. - Ian H. Witten, Data Mining: Practical Machine Learning Tools
Text Analytics. A business guide
Text Analytics A business guide February 2014 Contents 3 The Business Value of Text Analytics 4 What is Text Analytics? 6 Text Analytics Methods 8 Unstructured Meets Structured Data 9 Business Application
How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK
How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK Agenda Analytics why now? The process around data and text mining Case Studies The Value of Information
CRM Forum Resources http://www.crm-forum.com
CRM Forum Resources http://www.crm-forum.com BEHAVIOURAL SEGMENTATION SYSTEMS - A Perspective Author: Brian Birkhead Copyright Brian Birkhead January 1999 Copyright Brian Birkhead, 1999. Supplied by The
CAPTURING THE VALUE OF UNSTRUCTURED DATA: INTRODUCTION TO TEXT MINING
CAPTURING THE VALUE OF UNSTRUCTURED DATA: INTRODUCTION TO TEXT MINING Mary-Elizabeth ( M-E ) Eddlestone Principal Systems Engineer, Analytics SAS Customer Loyalty, SAS Institute, Inc. Is there valuable
Safety Risk Predictive Analytics to improve safety performance
Safety Risk Predictive Analytics to improve safety performance How we can help you with your safety challenges July 2014 Safety: At the heart of it Improving health and safety Operational safety risk management
Data Mining Applications in Higher Education
Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2
Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA
Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA ABSTRACT Current trends in data mining allow the business community to take advantage of
Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index
Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index Rickard Nyman *, Paul Ormerod Centre for the Study of Decision Making Under Uncertainty,
W. Heath Rushing Adsurgo LLC. Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare. Session H-1 JTCC: October 23, 2015
W. Heath Rushing Adsurgo LLC Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare Session H-1 JTCC: October 23, 2015 Outline Demonstration: Recent article on cnn.com Introduction
Advanced Analytics. The Way Forward for Businesses. Dr. Sujatha R Upadhyaya
Advanced Analytics The Way Forward for Businesses Dr. Sujatha R Upadhyaya Nov 2009 Advanced Analytics Adding Value to Every Business In this tough and competitive market, businesses are fighting to gain
Data Mining Application in Direct Marketing: Identifying Hot Prospects for Banking Product
Data Mining Application in Direct Marketing: Identifying Hot Prospects for Banking Product Sagarika Prusty Web Data Mining (ECT 584),Spring 2013 DePaul University,Chicago [email protected] Keywords:
Data Mining Approaches to Modeling Insurance Risk. Dan Steinberg, Mikhail Golovnya, Scott Cardell. Salford Systems 2009
Data Mining Approaches to Modeling Insurance Risk Dan Steinberg, Mikhail Golovnya, Scott Cardell Salford Systems 2009 Overview of Topics Covered Examples in the Insurance Industry Predicting at the outset
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be
Foundations of Business Intelligence: Databases and Information Management
Foundations of Business Intelligence: Databases and Information Management Problem: HP s numerous systems unable to deliver the information needed for a complete picture of business operations, lack of
Data Mining Algorithms and Techniques Research in CRM Systems
Data Mining Algorithms and Techniques Research in CRM Systems ADELA TUDOR, ADELA BARA, IULIANA BOTHA The Bucharest Academy of Economic Studies Bucharest ROMANIA {Adela_Lungu}@yahoo.com {Bara.Adela, Iuliana.Botha}@ie.ase.ro
Crime Pattern Analysis
Crime Pattern Analysis Megaputer Case Study in Text Mining Vijay Kollepara Sergei Ananyan www.megaputer.com Megaputer Intelligence 120 West Seventh Street, Suite 310 Bloomington, IN 47404 USA +1 812-330-01
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
Using Data Mining to Detect Insurance Fraud
IBM SPSS Modeler Using Data Mining to Detect Insurance Fraud Improve accuracy and minimize loss Highlights: Combine powerful analytical techniques with existing fraud detection and prevention efforts Build
From Cognitive Science to Data Mining: The first intelligence amplifier
From Cognitive Science to Data Mining: The first intelligence amplifier Tom Khabaza Abstract This paper gives a brief account of two hypotheses. First that data mining is a kind of intelligence amplifier,
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO
What is Data Mining? Data Mining (Knowledge discovery in database) Data Mining: "The non trivial extraction of implicit, previously unknown, and potentially useful information from data" William J Frawley,
Why are Organizations Interested?
SAS Text Analytics Mary-Elizabeth ( M-E ) Eddlestone SAS Customer Loyalty [email protected] +1 (607) 256-7929 Why are Organizations Interested? Text Analytics 2009: User Perspectives on Solutions
TAC Claims Management Transformation The Journey Continues
TAC Claims Management Transformation The Journey Continues Prepared by Damian Poel and Natalie Pocock Presented to the Actuaries Institute Injury Schemes Seminar 10 12 November 2013 Gold Coast This paper
Text Mining in JMP with R Andrew T. Karl, Senior Management Consultant, Adsurgo LLC Heath Rushing, Principal Consultant and Co-Founder, Adsurgo LLC
Text Mining in JMP with R Andrew T. Karl, Senior Management Consultant, Adsurgo LLC Heath Rushing, Principal Consultant and Co-Founder, Adsurgo LLC 1. Introduction A popular rule of thumb suggests that
A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH
205 A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH ABSTRACT MR. HEMANT KUMAR*; DR. SARMISTHA SARMA** *Assistant Professor, Department of Information Technology (IT), Institute of Innovation in Technology
STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table
STAGE 1 COMPETENCY STANDARD FOR ENGINEERING ASSOCIATE
STAGE 1 STANDARD FOR ENGINEERING ASSOCIATE ROLE DESCRIPTION THE MATURE ENGINEERING ASSOCIATE The following characterises the senior practice role that the mature, Engineering Associate may be expected
Methodology Framework for Analysis and Design of Business Intelligence Systems
Applied Mathematical Sciences, Vol. 7, 2013, no. 31, 1523-1528 HIKARI Ltd, www.m-hikari.com Methodology Framework for Analysis and Design of Business Intelligence Systems Martin Závodný Department of Information
Text Mining for Business Intelligence
Project Proposal (Draft) Text Mining for Business Intelligence By Abhinut Srimasorn (5322793399) Advisor Dr. Thanaruk Theeramunkong School of Information, Computer and Communication Technology, Sirindhorn
How To Analyze Claims Data
ACE CLAIMS MANAGEMENT ACE 4D: POWER OF PREDICTIVE ANALYTICS THE STATE-OF-THE-ART PROGRESSES Predictive data analytics is coming out of the shadows to change the course of claims management. A new approach,
Five Fundamental Data Quality Practices
Five Fundamental Data Quality Practices W H I T E PA P E R : DATA QUALITY & DATA INTEGRATION David Loshin WHITE PAPER: DATA QUALITY & DATA INTEGRATION Five Fundamental Data Quality Practices 2 INTRODUCTION
Insightful Analytics: Leveraging the data explosion for business optimisation. Top Ten Challenges for Investment Banks 2015
Insightful Analytics: Leveraging the data explosion for business optimisation 09 Top Ten Challenges for Investment Banks 2015 Insightful Analytics: Leveraging the data explosion for business optimisation
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
Navigating Big Data business analytics
mwd a d v i s o r s Navigating Big Data business analytics Helena Schwenk A special report prepared for Actuate May 2013 This report is the third in a series and focuses principally on explaining what
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar
Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar Prepared by Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc. www.data-mines.com [email protected]
Potential Value of Data Mining for Customer Relationship Marketing in the Banking Industry
Advances in Natural and Applied Sciences, 3(1): 73-78, 2009 ISSN 1995-0772 2009, American Eurasian Network for Scientific Information This is a refereed journal and all articles are professionally screened
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
Business Intelligence and Decision Support Systems
Chapter 12 Business Intelligence and Decision Support Systems Information Technology For Management 7 th Edition Turban & Volonino Based on lecture slides by L. Beaubien, Providence College John Wiley
Recent Trends in the Australian Disability Income Protection Insurance Market
White Paper Recent Trends in the Australian Disability Income Protection Insurance Market Are There Lessons for Other Markets? FINEOS Copyright 2014 www.fineos.com Over the last two years the quarterly
Predictive Modeling for Workers Compensation Claims
Predictive Modeling for Workers Compensation Claims AASCIF Super Conference Kirsten C. Hernan Deloitte Consulting LLP October 4, 2012 NOTICE: THIS DOCUMENT IS PROPRIETARY AND CONFIDENTIAL This document
Retail analytics in the context of Segmentation, Targeting, Optimisation of the operations of convenience store franchises
Retail analytics in the context of Segmentation, Targeting, Optimisation of the operations of convenience store franchises Authors Dr. Inna Kolyshkina Head of Statistics, Advanced Analytics Synovate Aztec
Application of Predictive Model for Elementary Students with Special Needs in New Era University
Application of Predictive Model for Elementary Students with Special Needs in New Era University Jannelle ds. Ligao, Calvin Jon A. Lingat, Kristine Nicole P. Chiu, Cym Quiambao, Laurice Anne A. Iglesia
ANALYTICS. Acxiom Marketing Maturity Model CheckPoint. Are you where you want to be? Or do you need to advance your analytics capabilities?
ANALYTICS Analytics defined Analytics is the process of studying data to identify potential trends, evaluate decisions, or assess the performance of a tool, event, or scenario. The process should include
Big Data and Healthcare Payers WHITE PAPER
Knowledgent White Paper Series Big Data and Healthcare Payers WHITE PAPER Summary With the implementation of the Affordable Care Act, the transition to a more member-centric relationship model, and other
NATIONAL INSURANCE BROKERS ASSOCIATION OF AUSTRALIA (NIBA) Submission to WorkCover Western Australia. Legislative Review 2013
NATIONAL INSURANCE BROKERS ASSOCIATION OF AUSTRALIA (NIBA) ABOUT NIBA Submission to WorkCover Western Australia Legislative Review 2013 February 2014 NIBA is the peak body of the insurance broking profession
HMRC Tax Credits Error and Fraud Additional Capacity Trial. Customer Experience Survey Report on Findings. HM Revenue and Customs Research Report 306
HMRC Tax Credits Error and Fraud Additional Capacity Trial Customer Experience Survey Report on Findings HM Revenue and Customs Research Report 306 TNS BMRB February2014 Crown Copyright 2014 JN119315 Disclaimer
BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL
The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University
Data Mining Techniques
15.564 Information Technology I Business Intelligence Outline Operational vs. Decision Support Systems What is Data Mining? Overview of Data Mining Techniques Overview of Data Mining Process Data Warehouses
Using Data Mining to Detect Insurance Fraud
IBM SPSS Modeler Using Data Mining to Detect Insurance Fraud Improve accuracy and minimize loss Highlights: combines powerful analytical techniques with existing fraud detection and prevention efforts
An ageing workforce and workers compensation what are the implications, in particular with an increasing national retirement age?
An ageing workforce and workers compensation what are the implications, in particular with an increasing national retirement age? Prepared by Andrew McInerney Presented to the Institute of Actuaries of
2015 Workshops for Professors
SAS Education Grow with us Offered by the SAS Global Academic Program Supporting teaching, learning and research in higher education 2015 Workshops for Professors 1 Workshops for Professors As the market
How Big Data Transforms Data Protection and Storage
I D C E X E C U T I V E B R I E F How Big Data Transforms Data Protection and Storage August 2012 Written by Carla Arend Sponsored by CommVault Introduction: How Big Data Transforms Storage Omøgade 8 P.O.Box
Azure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
SRC Commission. Positive Performance Indicators. Measuring Safety, Rehabilitation and Compensation Performance
SRC Commission Positive Performance Indicators Measuring Safety, Rehabilitation and Compensation Performance A question for heads of corporate management Do you know if prevention and injury management
IBM SPSS Direct Marketing
IBM Software IBM SPSS Statistics 19 IBM SPSS Direct Marketing Understand your customers and improve marketing campaigns Highlights With IBM SPSS Direct Marketing, you can: Understand your customers in
TIETS34 Seminar: Data Mining on Biometric identification
TIETS34 Seminar: Data Mining on Biometric identification Youming Zhang Computer Science, School of Information Sciences, 33014 University of Tampere, Finland [email protected] Course Description Content
Third Party Distribution Delivery and Service Strategies
Third Party Distribution Delivery and Service Strategies Prepared by Sally Wijesundera and Tom Bayley Presented to the Institute of Actuaries of Australia XIV General Insurance Seminar 2003 9-12 November
C o p yr i g ht 2015, S A S I nstitute Inc. A l l r i g hts r eser v ed. INTRODUCTION TO SAS TEXT MINER
INTRODUCTION TO SAS TEXT MINER TODAY S AGENDA INTRODUCTION TO SAS TEXT MINER Define data mining Overview of SAS Enterprise Miner Describe text analytics and define text data mining Text Mining Process
DHL Data Mining Project. Customer Segmentation with Clustering
DHL Data Mining Project Customer Segmentation with Clustering Timothy TAN Chee Yong Aditya Hridaya MISRA Jeffery JI Jun Yao 3/30/2010 DHL Data Mining Project Table of Contents Introduction to DHL and the
DIGITS CENTER FOR DIGITAL INNOVATION, TECHNOLOGY, AND STRATEGY THOUGHT LEADERSHIP FOR THE DIGITAL AGE
DIGITS CENTER FOR DIGITAL INNOVATION, TECHNOLOGY, AND STRATEGY THOUGHT LEADERSHIP FOR THE DIGITAL AGE INTRODUCTION RESEARCH IN PRACTICE PAPER SERIES, FALL 2011. BUSINESS INTELLIGENCE AND PREDICTIVE ANALYTICS
ediscovery: The SAS Solution for ecrm
ediscovery: The SAS Solution for ecrm Alan Gormley E-Intelligence Program Manager SAS Europe, Middle East & Africa Infrastructure Intelligence Hits Today s Focus e-infrastructure 7/24 Response Time Secure
not possible or was possible at a high cost for collecting the data.
Data Mining and Knowledge Discovery Generating knowledge from data Knowledge Discovery Data Mining White Paper Organizations collect a vast amount of data in the process of carrying out their day-to-day
STATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
