Successfully Implementing Predictive Analytics in Direct Marketing
|
|
|
- Gregory Kelly
- 10 years ago
- Views:
Transcription
1 Successfully Implementing Predictive Analytics in Direct Marketing John Blackwell and Tracy DeCanio, The Nature Conservancy, Arlington, VA ABSTRACT Successfully Implementing Predictive Analytics in Direct Marketing An analysis of the benefits of modeling in direct marketing as well as a review of the dangers associated with common data mining mistakes. The presentation will review specific examples of how SAS data mining products have been successfully utilized at the one of the world's largest non-profit organizations to dramatically improve ROI. Specifically, this session will focus on 1) measuring the true value of predictive models and 2) preparing data for modeling and avoiding pitfalls and 3) building models and selecting the best for deployment and 4) the importance of industry expertise and questioning what the data are "saying" and 5) assessing the readiness of your organization in building analytics capacity in-house. INTRODUCTION As organizations scale back marketing budgets many are foregoing the expense associated with predictive analytics. However, the importance of analytics must be viewed in the context of being ready to bounce back when the economy rebounds as opposed to stressing that there can also be a short term benefit despite the current climate. While it is true that data-mining might not be able to completely reverse decreases in revenue, it can have immediate positive return on investment. This paper presents a case study for one of world's largest fundraising organizations who used analytics several years ago to mitigate the impact of a negative economic environment and continues to do so today in order to efficiently raise money. BACKGROUND The Nature Conservancy ( TNC ) is the world s largest environmental non-profit with a mission to preserve the plants, animals and natural communities that represent the diversity of life on Earth by protecting the lands and waters they need to survive. TNC is able to fund its mission primarily through fundraising from which it generates about five-hundred million dollars a year from its 3 million members. While data mining is used throughout the conservancy this paper will focus on its application in direct-mail campaigns. BEFORE ANALYTICS Prior to 2003, TNC s decisions about who to mail and how much to ask for were mostly done through gut-feel and rules of thumb that had often not been tested empirically. While the organization was mostly successful, events in 2003 including a weaker economy and negative press put downward pressure on TNC s ability to fundraise. Indeed, a membership base which had previously been growing steadily came to an abrupt halt. To maintain its reputation and continue its mission The Nature Conservancy needed to increase it revenues through increased efficiencies. The organization has met the challenge through making fact-based decisions with an emphasis on analytics. Today when TNC plans a direct mail solicitation they determine who to mail based on which donors are the most likely to respond with the largest gifts. The following SAS Enterprise Miner diagram shows a typical modeling development flow for a response: 1
2 Details of the value behind the ensemble model are discussed in more detail later in the paper. The score from the above model (using a binary target for response) is then multiplied by a predicted gift amount generated through a separate model. The resulting product is a prediction of gross revenue per mailing. The marketing team will then decide who to mail based on a chart summarizing the model results such as the sample below. The chart shows the donors ranked in deciles according to their model score and shows the predicted response, revenue per mailing sent, profit and the cost associated with raising each dollar. Model Rank Size of Group Resp. Rate Rev. Per Piece Profit 2 Cost Per $ Raised 1 96, % $7.46 $668,961 $ , % $2.67 $206,185 $ , % $1.70 $111,441 $ , % $1.24 $68,075 $ , % $0.98 $42,595 $ , % $0.75 $19,859 $ , % $0.55 $594 $ , % $0.41 ($11,857) $ , % $0.30 ($22,643) $ , % $0.20 ($32,891) $2.67
3 As illustrated above, the model is able to identify a significant portion of the available audience for any given mailing that is not profitable to mail (wherever the cost per dollar raised is greater than one dollar) in addition to identifying which donors will be the most profitable. MAKING THE CASE An initial hurdle in making the case for predictive modeling was to convince management that it would be beneficial not only to the organization but also to the membership population. Charitable giving is highly seasonal and while many people may prefer to give at the end of the year some are also more likely to give around tax time and others over the summer months. It was clear that we had to show that modeling could help to better understand the seasonality of the subsets of the membership population and could actually result in a decrease in the amount of direct mail sent without a corresponding decrease in revenue. For example a model could allow us to target our audience when they where ready to give as opposed to sending an expensive, steady stream of solicitations throughout the year. PROVING THE VALUE In their book Competing on Analytics, Davenport and Harris refer to the detour that some organizations need in order to prove the value of fact-based decisions before analytics can be fully implemented. At The Nature Conservancy there was much skepticism about the notion that a model could make better predictions than current business practices. For the first few campaigns that were modeled we randomly split the available audience into two segments and allowed the Marketing team to apply their usual selection criteria to the first half. For the second half, we used a model select the accounts to be mailed. At the completion of the campaign the results were analyzed and shown to the decision-makers. In all cases the modeled select significantly out-performed the standard select. The chart below shows how the modeling boosted revenue for monthly solicitation campaigns in the first year after the detour. The bars show dollar amount per piece sent and the line represents the percentage increase in total revenue per piece mailed associated with the model select. Model Standard Rev. Per Piece Increase $ % $ % $ % $ % $ % $ % $ % $ % $- 0% July September October November December January February April June While needing to prove the value of modeling took more time and resulted in foregone revenue, the results were definitive and allowed us to get the buy-in from management that we needed to expand the scope of analytics within the organization. 3
4 AVOIDING PITFALLS TOO MUCH DATA? The massive amounts of data that are now available about an organization s customers are one reason predictive modeling has been so effective. However, if not treated with caution, in some cases it can lead to a weaker model. Vendors that deal in customer data appends will often tout the hundreds of demographic and psychographic variables that can be appended to an individual s database record. In addition, the low cost of storage allows most companies to retain hundreds of data points associated with a customer relationship over time, which can be used in model development. However there is always the danger that correlations of marketing data in a development data set are merely noise and not predictors of future behavior. There are some well-known examples of chance correlations associated with political events. For example, from 1952 to 1976 when an American League team won the World Series a Republican would win the U.S. Presidency. A similar more recent example involving the Washington Redskins was true until the election of Examples such as the above are often cited as curiosities and are not necessarily taken seriously. However, because a correlation is plausible does not necessarily mean that it will provide any more value than the above. While data is the backbone of a strong predictive model in direct-marketing, it can also weaken models when the apparent correlation is not one that holds up. An article in the Economist on August 18, 2007 effectively sums up the problem: As the old saw has it, garbage in, garbage out. The difficulty comes when you do not know what garbage looks like. How do you know what variables are truly predictive and which are just correlated by chance in a development data set? While you can never say what is a true predictor with absolute certainty, there are several approaches that you can use to avoid using noise variables and to lessen their impact if they are selected by a modeling algorithm. The following approach can help to identify which variables are more likely than others to be meaningless and also to lessen the impact that the "garbage" has on a model. A CASE STUDY As illustration I created a dataset that contained a binary dependant variable and had about 50 potential independent variables taken from the marketing data base. For the purpose of providing an example, 100 random variables were appended to the actual data. The data were split into development, validation and test samples and an initial model was built only allowing for the random variables to be used as input variables. When run through a stepwise logistic regression and entropy-reduction decision tree model, each model selected between four and six variables. This example is merely to illustrate that even with high significance levels for variable selection, when the number of candidate variables is large enough some will be selected even in the event when there is nothing beyond noise. In the next scenario the same models were run using the random and the real input variables. In this case both models continue to select two of the random variables. However, the overall fit statistics on the test partition (as shown below) are strong and do not suggest any cause for concern. Model Test: Roc Index Test: Gini Coefficient Test: Kolmogorov-Smirnov Statistic Decision Tree Regression When re-running the model without the random variables we can see that the fit statistics on the test partition of the data set improve. In the case of the logistic regression the same non-random variables are selected, however, given the higher ROC index on the test data this suggests stronger coefficients. In the case of the decision tree, the random variables that had been selected were selected as the second to last split and removing them allowed the tree to grow differently, resulting in a stronger model. Since at any depth a decision tree will always look for the next most significant split without regard to the best possible combination of splits, if it is confused at one level by noise, predictive variables might never be selected. 4
5 Model Test: Roc Index Test: Gini Coefficient Test: Kolmogorov-Smirnov Statistic Decision Tree Regression The above examples serve to show that it is easy for noise variables to be selected by models, and the emphasis on any candidate predictor variable should be less about deciding whether the split is plausible and more about questioning whether the relationship is real. While it may be impractical to fully analyze every candidate variable for every model and still meet deadlines, one way to guard against noise is to perform in-depth analysis on new variables appended to the file as well as variables that are only selected by one particular modeling technique. Similar to the above, sometimes merely questioning the data and treating all relationships with scientific skepticism can be useful. A fairly well-known example of the pitfalls associated with not questioning what the data are saying is a look at the apparent correlation between SAT scores by state and the amount of money spent on education: The above chart shows that on the balance, the more spent on education the lower the test scores are. However, someone familiar with the subject would be able to explain that the correlation is not meaningful since the states that spend less on education are over-represented in states where students take the ACT as opposed to the SAT. In these states the students that take the SAT are often those that are applying to competitive east-coast schools that require the test, making the data useless as a display of the effect of money on education. The more familiar an analyst is with the industry and the data (s)he is analyzing, the less likely (s)he is to be fooled by an apparent variable that looks to be predictive as in the case above. In the SAT example perhaps it is counter-intuitive enough that most people would question it. However, if the data showed the trend reversed most people would probably never question the apparent relationship. Another effective approach to building strong models in spite of the preponderance of data is to lessen the impact of any one variable selected by a model by averaging or ensembling the scores of several models. Within any 5
6 individual model the emphasis is to make it satisfy a certain threshold using as few inputs as possible. However we have found that most of the strongest models that we have created use a large number of input variables and lessen the impact any one of them can have through averaging. For example, a simplified, yet illustrative model flow is shown below: This flow shows the development data being partitioned and fed through a neural network, regression and a decision tree node before the ensemble node averages the scores of all three individual models to create a new model. Since none of the models select the same set of independent variables the total number is larger, however, the averaging helps to ensure that no one variable has a particularly large impact on the final score. This is particularly true for variables with weaker correlations to the target since these are less likely to be selected by all three modeling techniques. And it is within these weaker yet important inputs where the noise data are most likely to be found. When the data set that combined the random and non-random variables was run through the model diagramed above the power of the ensemble model is significantly higher than any of the individual models. Random variables were still selected as predictors; however their influence was reduced. There are several reasons this could be the case including the fact that the different models treatment of the random variables largely cancel each other out when combined. The strength of ensembling is also true of the data set without the random variables. As displayed in the ROC chart below the green line represents the averaged model and the others show the individual models. 6
7 CONCLUSIONS While there are many pitfalls associated with modeling in direct marketing they can be avoided through a careful analysis of the data combined with modeling techniques that do not place too much significance on variables that may in fact be noise. Predictive analytics will almost always enable better business decisions. While customer or donor behavior may have changed due to the current economic climate, the value provided by a model will always be better than that driven by instinct or gut-feel. When organizations try and cut the expense associated with analytics it might be incumbent on the analysts to once again take the detour as described by Davenport and Harris to prove the value of analytics (even if this has been done previously) by setting up regular experiments to show the difference between the performance of modeled selections for mailings versus a non-modeled select. REFERENCES: Van den Poel Dirk, Larivière Bart (2004), Attrition Analysis for Financial Services Using Proportional Hazard Models, European Journal of Operational Research, 157 (1), Copas, JB (1983). Regression, prediction and shrinkage (with discussion). Journal of the Royal Statistical Society, B45, Derksen, S & Keselman, HJ (1992). Backward, forward and stepwise automated subset selection algorithms: Frequency of obtaining authentic and noise variables. British Journal of Mathematical and Statistical Psychology, 45, Hastie, T. and Tibshirani, R. (1996). Classification by pairwise coupling. Technical report, Stanford University and University of Toronto. ACKNOWLEDGMENTS SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the authors at: John Blackwell The Nature Conservancy 4245 N. Fairfax Drive Arlington, VA Work Phone: [email protected] Web: Tracy DeCanio The Nature Conservancy 4245 N. Fairfax Drive Arlington, VA Work Phone: [email protected] Web: 7
Gerry Hobbs, Department of Statistics, West Virginia University
Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit
Nine Common Types of Data Mining Techniques Used in Predictive Analytics
1 Nine Common Types of Data Mining Techniques Used in Predictive Analytics By Laura Patterson, President, VisionEdge Marketing Predictive analytics enable you to develop mathematical models to help better
Using Adaptive Random Trees (ART) for optimal scorecard segmentation
A FAIR ISAAC WHITE PAPER Using Adaptive Random Trees (ART) for optimal scorecard segmentation By Chris Ralph Analytic Science Director April 2006 Summary Segmented systems of models are widely recognized
A Property & Casualty Insurance Predictive Modeling Process in SAS
Paper AA-02-2015 A Property & Casualty Insurance Predictive Modeling Process in SAS 1.0 ABSTRACT Mei Najim, Sedgwick Claim Management Services, Chicago, Illinois Predictive analytics has been developing
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification
A Comparison of Decision Tree and Logistic Regression Model Xianzhe Chen, North Dakota State University, Fargo, ND
Paper D02-2009 A Comparison of Decision Tree and Logistic Regression Model Xianzhe Chen, North Dakota State University, Fargo, ND ABSTRACT This paper applies a decision tree model and logistic regression
Data Mining Applications in Higher Education
Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2
Easily Identify Your Best Customers
IBM SPSS Statistics Easily Identify Your Best Customers Use IBM SPSS predictive analytics software to gain insight from your customer database Contents: 1 Introduction 2 Exploring customer data Where do
Easily Identify the Right Customers
PASW Direct Marketing 18 Specifications Easily Identify the Right Customers You want your marketing programs to be as profitable as possible, and gaining insight into the information contained in your
IBM SPSS Direct Marketing
IBM Software IBM SPSS Statistics 19 IBM SPSS Direct Marketing Understand your customers and improve marketing campaigns Highlights With IBM SPSS Direct Marketing, you can: Understand your customers in
Marketing Strategies for Retail Customers Based on Predictive Behavior Models
Marketing Strategies for Retail Customers Based on Predictive Behavior Models Glenn Hofmann HSBC Salford Systems Data Mining 2005 New York, March 28 30 0 Objectives Inform about effective approach to direct
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing C. Olivia Rud, VP, Fleet Bank ABSTRACT Data Mining is a new term for the common practice of searching through
Modeling Lifetime Value in the Insurance Industry
Modeling Lifetime Value in the Insurance Industry C. Olivia Parr Rud, Executive Vice President, Data Square, LLC ABSTRACT Acquisition modeling for direct mail insurance has the unique challenge of targeting
By Peter Schoewe, Director of Analytics Mal Warwick Donordigital
Measuring YOur return On investment in Multichannel fundraising campaigns By Peter Schoewe, Director of Analytics Mal Warwick Integrated fundraising, advocacy and marketing in a multichannel nonprofit
Predictive Modeling of Titanic Survivors: a Learning Competition
SAS Analytics Day Predictive Modeling of Titanic Survivors: a Learning Competition Linda Schumacher Problem Introduction On April 15, 1912, the RMS Titanic sank resulting in the loss of 1502 out of 2224
A Property and Casualty Insurance Predictive Modeling Process in SAS
Paper 11422-2016 A Property and Casualty Insurance Predictive Modeling Process in SAS Mei Najim, Sedgwick Claim Management Services ABSTRACT Predictive analytics is an area that has been developing rapidly
STATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century
An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century Nora Galambos, PhD Senior Data Scientist Office of Institutional Research, Planning & Effectiveness Stony Brook University AIRPO
Role of Customer Response Models in Customer Solicitation Center s Direct Marketing Campaign
Role of Customer Response Models in Customer Solicitation Center s Direct Marketing Campaign Arun K Mandapaka, Amit Singh Kushwah, Dr.Goutam Chakraborty Oklahoma State University, OK, USA ABSTRACT Direct
Paper AA-08-2015. Get the highest bangs for your marketing bucks using Incremental Response Models in SAS Enterprise Miner TM
Paper AA-08-2015 Get the highest bangs for your marketing bucks using Incremental Response Models in SAS Enterprise Miner TM Delali Agbenyegah, Alliance Data Systems, Columbus, Ohio 0.0 ABSTRACT Traditional
Data Mining Methods: Applications for Institutional Research
Data Mining Methods: Applications for Institutional Research Nora Galambos, PhD Office of Institutional Research, Planning & Effectiveness Stony Brook University NEAIR Annual Conference Philadelphia 2014
Big Data Analytics. Benchmarking SAS, R, and Mahout. Allison J. Ames, Ralph Abbey, Wayne Thompson. SAS Institute Inc., Cary, NC
Technical Paper (Last Revised On: May 6, 2013) Big Data Analytics Benchmarking SAS, R, and Mahout Allison J. Ames, Ralph Abbey, Wayne Thompson SAS Institute Inc., Cary, NC Accurate and Simple Analysis
Predicting Churn. A SAS White Paper
A SAS White Paper Table of Contents Introduction......................................................................... 1 The Price of Churn...................................................................
Leveraging Ensemble Models in SAS Enterprise Miner
ABSTRACT Paper SAS133-2014 Leveraging Ensemble Models in SAS Enterprise Miner Miguel Maldonado, Jared Dean, Wendy Czika, and Susan Haller SAS Institute Inc. Ensemble models combine two or more models to
Data Mining Applications in Fund Raising
Data Mining Applications in Fund Raising Nafisseh Heiat Data mining tools make it possible to apply mathematical models to the historical data to manipulate and discover new information. In this study,
Target Analytics Nonprofit Cooperative Database
Target Analytics Nonprofit Cooperative Database Solution Overview Target Analytics Nonprofit Cooperative Database Lists and Predictive Models for Effective Fundraising You know how challenging it is to
IBM SPSS Direct Marketing 23
IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release
DECISION TREE ANALYSIS: PREDICTION OF SERIOUS TRAFFIC OFFENDING
DECISION TREE ANALYSIS: PREDICTION OF SERIOUS TRAFFIC OFFENDING ABSTRACT The objective was to predict whether an offender would commit a traffic offence involving death, using decision tree analysis. Four
IBM SPSS Direct Marketing 22
IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release
Chapter 7: Data Mining
Chapter 7: Data Mining Overview Topics discussed: The Need for Data Mining and Business Value The Data Mining Process: Define Business Objectives Get Raw Data Identify Relevant Predictive Variables Gain
Optimizing Enrollment Management with Predictive Modeling
Optimizing Enrollment Management with Predictive Modeling Tips and Strategies for Getting Started with Predictive Analytics in Higher Education an ebook presented by Optimizing Enrollment with Predictive
How To Make A Credit Risk Model For A Bank Account
TRANSACTIONAL DATA MINING AT LLOYDS BANKING GROUP Csaba Főző [email protected] 15 October 2015 CONTENTS Introduction 04 Random Forest Methodology 06 Transactional Data Mining Project 17 Conclusions
Developing Credit Scorecards Using Credit Scoring for SAS Enterprise Miner TM 12.1
Developing Credit Scorecards Using Credit Scoring for SAS Enterprise Miner TM 12.1 SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2012. Developing
Business Intelligence. Tutorial for Rapid Miner (Advanced Decision Tree and CRISP-DM Model with an example of Market Segmentation*)
Business Intelligence Professor Chen NAME: Due Date: Tutorial for Rapid Miner (Advanced Decision Tree and CRISP-DM Model with an example of Market Segmentation*) Tutorial Summary Objective: Richard would
Data Mining with SAS. Mathias Lanner [email protected]. Copyright 2010 SAS Institute Inc. All rights reserved.
Data Mining with SAS Mathias Lanner [email protected] Copyright 2010 SAS Institute Inc. All rights reserved. Agenda Data mining Introduction Data mining applications Data mining techniques SEMMA
Alex Vidras, David Tysinger. Merkle Inc.
Using PROC LOGISTIC, SAS MACROS and ODS Output to evaluate the consistency of independent variables during the development of logistic regression models. An example from the retail banking industry ABSTRACT
COPYRIGHT 2012 VERTICURL WHITEPAPER: TOP MISTAKES TO AVOID WHEN BUILDING A DEMAND CENTER
COPYRIGHT 2012 VERTICURL WHITEPAPER: TOP MISTAKES TO AVOID WHEN BUILDING A DEMAND CENTER For many B2B organizations, building a demand center is a no-brainer. Learn how to ensure you re successful by avoiding
Predictive Analytics for Donor Management
IBM Software Business Analytics IBM SPSS Predictive Analytics Predictive Analytics for Donor Management Predictive Analytics for Donor Management Contents 2 Overview 3 The challenges of donor management
NFP. Capitalizing on merge strategies to boost your return on donor marketing
NFP Capitalizing on merge strategies to boost your return on donor marketing April 2014 Contents Introduction 3 Improving your return with new merge strategies 4 The shrinking exchange universe 4 Determining
Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America
Application of SAS! Enterprise Miner in Credit Risk Analytics Presented by Minakshi Srivastava, VP, Bank of America 1 Table of Contents Credit Risk Analytics Overview Journey from DATA to DECISIONS Exploratory
Hurwitz ValuePoint: Predixion
Predixion VICTORY INDEX CHALLENGER Marcia Kaufman COO and Principal Analyst Daniel Kirsch Principal Analyst The Hurwitz Victory Index Report Predixion is one of 10 advanced analytics vendors included in
Nature Conservancy of Canada. Position Profile. Vice President, Development & Marketing
Position Profile Vice President, Development & Marketing Contacts: President: Deborah Legrove Senior Consultant: Linda Samis, Senior Consultant Email: [email protected] Phone: 416.781.3618 1.0
Data Mining: An Introduction
Data Mining: An Introduction Michael J. A. Berry and Gordon A. Linoff. Data Mining Techniques for Marketing, Sales and Customer Support, 2nd Edition, 2004 Data mining What promotions should be targeted
Data Mining in CRM & Direct Marketing. Jun Du The University of Western Ontario [email protected]
Data Mining in CRM & Direct Marketing Jun Du The University of Western Ontario [email protected] Outline Why CRM & Marketing Goals in CRM & Marketing Models and Methodologies Case Study: Response Model Case
Working with telecommunications
Working with telecommunications Minimizing churn in the telecommunications industry Contents: 1 Churn analysis using data mining 2 Customer churn analysis with IBM SPSS Modeler 3 Types of analysis 3 Feature
analytics stone Automated Analytics and Predictive Modeling A White Paper by Stone Analytics
stone analytics Automated Analytics and Predictive Modeling A White Paper by Stone Analytics 3665 Ruffin Road, Suite 300 San Diego, CA 92123 (858) 503-7540 www.stoneanalytics.com Page 1 Automated Analytics
A Marketer s Guide to Analytics
A Marketer s Guide to Analytics Using Analytics to Make Smarter Marketing Decisions and Maximize Results WHITE PAPER SAS White Paper Table of Contents Executive Summary.... 1 The World of Marketing Is
KnowledgeSEEKER Marketing Edition
KnowledgeSEEKER Marketing Edition Predictive Analytics for Marketing The Easiest to Use Marketing Analytics Tool KnowledgeSEEKER Marketing Edition is a predictive analytics tool designed for marketers
A fast, powerful data mining workbench designed for small to midsize organizations
FACT SHEET SAS Desktop Data Mining for Midsize Business A fast, powerful data mining workbench designed for small to midsize organizations What does SAS Desktop Data Mining for Midsize Business do? Business
A Basic Guide to Modeling Techniques for All Direct Marketing Challenges
A Basic Guide to Modeling Techniques for All Direct Marketing Challenges Allison Cornia Database Marketing Manager Microsoft Corporation C. Olivia Rud Executive Vice President Data Square, LLC Overview
Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL
Paper SA01-2012 Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL ABSTRACT Analysts typically consider combinations
Better credit models benefit us all
Better credit models benefit us all Agenda Credit Scoring - Overview Random Forest - Overview Random Forest outperform logistic regression for credit scoring out of the box Interaction term hypothesis
Numerical Algorithms Group
Title: Summary: Using the Component Approach to Craft Customized Data Mining Solutions One definition of data mining is the non-trivial extraction of implicit, previously unknown and potentially useful
Reevaluating Policy and Claims Analytics: a Case of Non-Fleet Customers In Automobile Insurance Industry
Paper 1808-2014 Reevaluating Policy and Claims Analytics: a Case of Non-Fleet Customers In Automobile Insurance Industry Kittipong Trongsawad and Jongsawas Chongwatpol NIDA Business School, National Institute
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail
Reducing the Costs of Employee Churn with Predictive Analytics
Reducing the Costs of Employee Churn with Predictive Analytics How Talent Analytics helped a large financial services firm save more than $4 million a year Employee churn can be massively expensive, and
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.
Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví Pavel Kříž Seminář z aktuárských věd MFF 4. dubna 2014 Summary 1. Application areas of Insurance Analytics 2. Insurance Analytics
Identifying SPAM with Predictive Models
Identifying SPAM with Predictive Models Dan Steinberg and Mikhaylo Golovnya Salford Systems 1 Introduction The ECML-PKDD 2006 Discovery Challenge posed a topical problem for predictive modelers: how to
The Predictive Data Mining Revolution in Scorecards:
January 13, 2013 StatSoft White Paper The Predictive Data Mining Revolution in Scorecards: Accurate Risk Scoring via Ensemble Models Summary Predictive modeling methods, based on machine learning algorithms
THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell
THE HYBID CAT-LOGIT MODEL IN CLASSIFICATION AND DATA MINING Introduction Dan Steinberg and N. Scott Cardell Most data-mining projects involve classification problems assigning objects to classes whether
Benchmarking of different classes of models used for credit scoring
Benchmarking of different classes of models used for credit scoring We use this competition as an opportunity to compare the performance of different classes of predictive models. In particular we want
2014-15 Donor Management Systems Use and Satisfaction Study
2014-15 Donor Management Systems Use and Satisfaction Study Lehman Associates, LLC 2308 Mt. Vernon Ave, Suite 713 Alexandria, VA 22301 703-373-7550 www.lehmanconsulting.com / www.lehmanreports.com [email protected]
Data-Driven Decisions: Role of Operations Research in Business Analytics
Data-Driven Decisions: Role of Operations Research in Business Analytics Dr. Radhika Kulkarni Vice President, Advanced Analytics R&D SAS Institute April 11, 2011 Welcome to the World of Analytics! Lessons
KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES
HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES Translating data into business value requires the right data mining and modeling techniques which uncover important patterns within
M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page 1. 15.7 Analytics and Data Mining 1
M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page 1 15.7 Analytics and Data Mining 15.7 Analytics and Data Mining 1 Section 1.5 noted that advances in computing processing during the past 40 years have
Classification of Bad Accounts in Credit Card Industry
Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition
Enhancing Compliance with Predictive Analytics
Enhancing Compliance with Predictive Analytics FTA 2007 Revenue Estimation and Research Conference Reid Linn Tennessee Department of Revenue [email protected] Sifting through a Gold Mine of Tax Data
Better planning and forecasting with IBM Predictive Analytics
IBM Software Business Analytics SPSS Predictive Analytics Better planning and forecasting with IBM Predictive Analytics Using IBM Cognos TM1 with IBM SPSS Predictive Analytics to build better plans and
Virtual Site Event. Predictive Analytics: What Managers Need to Know. Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015
Virtual Site Event Predictive Analytics: What Managers Need to Know Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015 1 Ground Rules Virtual Site Ground Rules PMI Code of Conduct applies for this
!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"
!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"!"#"$%&#'()*+',$$-.&#',/"-0%.12'32./4'5,5'6/%&)$).2&'7./&)8'5,5'9/2%.%3%&8':")08';:
Public, Private and Hybrid Clouds
Public, Private and Hybrid Clouds When, Why and How They are Really Used Sponsored by: Research Summary 2013 Neovise, LLC. All Rights Reserved. [i] Table of Contents Table of Contents... 1 i Executive
BIDM Project. Predicting the contract type for IT/ITES outsourcing contracts
BIDM Project Predicting the contract type for IT/ITES outsourcing contracts N a n d i n i G o v i n d a r a j a n ( 6 1 2 1 0 5 5 6 ) The authors believe that data modelling can be used to predict if an
SUSTAINABILITY & EMPLOYEE ENGAGEMENT
SUSTAINABILITY & EMPLOYEE ENGAGEMENT The State of The Art By John Davies VP Senior Analyst, GreenBiz September 2014 Finding Common Ground with HR Sustainability professionals usually think of their challenges
IBM SPSS Direct Marketing 19
IBM SPSS Direct Marketing 19 Note: Before using this information and the product it supports, read the general information under Notices on p. 105. This document contains proprietary information of SPSS
Why Nonprofits Need Nonprofit Accounting Software
Why Nonprofits Need Nonprofit Accounting Software % CONTENTS Executive Summary... 3 The Benefits of a Nonprofit Accounting System... 4 Unique Regulations and Standards for Unique Solutions... 4 Supporting
Prediction of Stock Performance Using Analytical Techniques
136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University
Internet Gambling Behavioral Markers: Using the Power of SAS Enterprise Miner 12.1 to Predict High-Risk Internet Gamblers
Paper 1863-2014 Internet Gambling Behavioral Markers: Using the Power of SAS Enterprise Miner 12.1 to Predict High-Risk Internet Gamblers Sai Vijay Kishore Movva, Vandana Reddy and Dr. Goutam Chakraborty;
Reject Inference Methodologies in Credit Risk Modeling Derek Montrichard, Canadian Imperial Bank of Commerce, Toronto, Canada
Paper ST-160 Reject Inference Methodologies in Credit Risk Modeling Derek Montrichard, Canadian Imperial Bank of Commerce, Toronto, Canada ABSTRACT In the banking industry, scorecards are commonly built
A Hybrid Modeling Platform to meet Basel II Requirements in Banking Jeffery Morrision, SunTrust Bank, Inc.
A Hybrid Modeling Platform to meet Basel II Requirements in Banking Jeffery Morrision, SunTrust Bank, Inc. Introduction: The Basel Capital Accord, ready for implementation in force around 2006, sets out
Credit Risk Analysis Using Logistic Regression Modeling
Credit Risk Analysis Using Logistic Regression Modeling Introduction A loan officer at a bank wants to be able to identify characteristics that are indicative of people who are likely to default on loans,
Using Statistical Modeling to Increase
White Paper Using Statistical Modeling to Increase Donations Executive Summary Nonprofits increasingly rely on statistical modeling to help them target their best prospects and strengthen their fundraising
WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat
Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise
Predictive Analytics Certificate Program
Information Technologies Programs Predictive Analytics Certificate Program Accelerate Your Career Offered in partnership with: University of California, Irvine Extension s professional certificate and
Model Validation Techniques
Model Validation Techniques Kevin Mahoney, FCAS kmahoney@ travelers.com CAS RPM Seminar March 17, 2010 Uses of Statistical Models in P/C Insurance Examples of Applications Determine expected loss cost
Business Intelligence Solutions for Gaming and Hospitality
Business Intelligence Solutions for Gaming and Hospitality Prepared by: Mario Perkins Qualex Consulting Services, Inc. Suzanne Fiero SAS Objective Summary 2 Objective Summary The rise in popularity and
