True-Lift Modeling: Mining for the Most Truly Responsive Customers and Prospects

Similar documents
Direct Marketing Profit Model. Bruce Lund, Marketing Associates, Detroit, Michigan and Wilmington, Delaware

Paper AA Get the highest bangs for your marketing bucks using Incremental Response Models in SAS Enterprise Miner TM

Data Mining Techniques Chapter 4: Data Mining Applications in Marketing and Customer Relationship Management

Eric Siegel, Ph.D. Program Chair. Predictive Analytics World. Predictive Analytics World

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Improve Marketing Campaign ROI using Uplift Modeling. Ryan Zhao

Stochastic Solutions

Customer Sensitivity to Credit Risk Decisions

Using Control Groups to Target on Predicted Lift:

Alignment and Preprocessing for Data Analysis

How To Perform An Ensemble Analysis

Finding Your Audience An analytic study to understand the impact of audience targeting and content channels on digital advertising yield

Better credit models benefit us all

MIC - Detecting Novel Associations in Large Data Sets. by Nico Güttler, Andreas Ströhlein and Matt Huska

Best Practice ROI Marketing

DIRECT MAIL: MEASURING VOLATILITY WITH CRYSTAL BALL

Data Mining: An Introduction

Driving Value From Big Data

Uncovering Hidden Profits: Direct Marketing Dmitry Krass. Custometrics Inc.

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

Knowledge Discovery and Data Mining. Bootstrap review. Bagging Important Concepts. Notes. Lecture 19 - Bagging. Tom Kelsey. Notes

An Overview of Predictive Analytics for Practitioners. Dean Abbott, Abbott Analytics

Advanced Analytics for Call Center Operations

The Quantcast Display Play-By-Play. Unlocking the Value of Display Advertising

Justifying Marketing Automation and Pilot Program

A Property & Casualty Insurance Predictive Modeling Process in SAS

LEAD GENERATION METRICS

A PARADIGM FOR DEVELOPING BETTER MEASURES OF MARKETING CONSTRUCTS

Direct Marketing When There Are Voluntary Buyers

Data Mining: Overview. What is Data Mining?

A NURSING CARE PLAN RECOMMENDER SYSTEM USING A DATA MINING APPROACH

Marketing Strategies for Retail Customers Based on Predictive Behavior Models

IBM SPSS Direct Marketing 23

To order the book click on the link,

IBM SPSS Direct Marketing 22

Data Mining Techniques Chapter 6: Decision Trees

FASTSTATS PEOPLESTAGE

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

A Property and Casualty Insurance Predictive Modeling Process in SAS

Easily Identify the Right Customers

Market Assessment & Campaign SLA Calculator LOGO WE OPEN THE DOOR, SO YOU CAN CLOSE IT.

Alex Vidras, David Tysinger. Merkle Inc.


Experian TAPS SM Total Annual Plastic Spend

3-Step Competency Prioritization Sequence

SOCIAL NETWORK ANALYSIS EVALUATING THE CUSTOMER S INFLUENCE FACTOR OVER BUSINESS EVENTS

Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank

KnowledgeSEEKER Marketing Edition

Stata for micro targeting using C++ and ODBC

Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Graph Mining and Social Network Analysis

Channel ROI How we do it: a step-by-step approach

Targeted Marketing, KDD Cup and Customer Modeling

FastStats PeopleStage. Campaign Management & Automation Software

Performance Metrics for Graph Mining Tasks

Characterizing Task Usage Shapes in Google s Compute Clusters

Successfully Implementing Predictive Analytics in Direct Marketing

Factors to Describe Job Shop Scheduling Problem

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

Using Pragmatic Data Mining Approaches for Donor Acquisition Programs

Maximizing Return and Minimizing Cost with the Decision Management Systems

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

CRM Analytics for Telecommunications

Applying Customer Attitudinal Segmentation to Improve Marketing Campaigns Wenhong Wang, Deluxe Corporation Mark Antiel, Deluxe Corporation

Survey Analysis: Data Mining versus Standard Statistical Analysis for Better Analysis of Survey Responses

Metric of the Month: Tickets per User per Month

How To Manage Marketing With A Cloud Based Software

Data Mining with SAS. Mathias Lanner Copyright 2010 SAS Institute Inc. All rights reserved.

How To Make A Credit Risk Model For A Bank Account

Predictive Analytics for Database Marketing

Supplementary PROCESS Documentation

Using Ensemble of Decision Trees to Forecast Travel Time

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

1 Choosing the right data mining techniques for the job (8 minutes,

Driving Insurance World through Science Murli D. Buluswar Chief Science Officer

Cross-Validation. Synonyms Rotation estimation

Determining optimum insurance product portfolio through predictive analytics BADM Final Project Report

1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability).

Myth or Fact: The Diminishing Marginal Returns of Variable Creation in Data Mining Solutions

Data Mining Cluster Analysis: Advanced Concepts and Algorithms. Lecture Notes for Chapter 9. Introduction to Data Mining

MarkerView Software for Metabolomic and Biomarker Profiling Analysis

Modeling Lifetime Value in the Insurance Industry

Social Media Mining. Data Mining Essentials

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

Quantitative Methods for Finance

Transcription:

True-Lift Modeling: Mining for the Most Truly Responsive Customers and Prospects Kathleen Kane Jane Zheng Victor Lo 1 Alex Arias-Vargas Fidelity Investments 1 Also with Bentley University New York City October 19-20 2011

2

Stop spending direct marketing dollars on customers who would purchase anyway! True-lift modeling can identify: which customers will purchase without receiving a marketing contact which customers need a direct marketing nudge to make a purchase which customers have a negative reaction to marketing (and purchase less if contacted) This discussion will describe: the basic requirements needed to succeed with true-lift modeling scenarios where this modeling method is most applicable the pros and cons of various approaches to true-lift modeling 3

Outline Why do we need true-lift modeling? 10min What are the methods of true-lift modeling? 10min What is the context where true-lift modeling is most necessary & useful? 10min Questions 10min 4

What s wrong with this picture? Pct of Treatment Responders CUME Pct of Responders Random 100% 50% 0% 0% 50% 100% Pct of Treatment Group 14% 7% Incidence of Treatment Responders 4% 2% 1% 1% 0% 0% 0% 0% 1 2 3 4 5 6 7 8 9 10 Decile A successful response model A successful marketing campaign 5

Measuring response models by lift over control Treatment "Responders" Control "Responders" Treatment "Responders" Control "Responders" Treatment "Responders" Control "Responders" 1 Decile 10 LIFT 1 Decile 10 LIFT 1 Decile 10 LIFT 1 Decile 10 1 Decile 10 1 Decile 10 Simulated data 6

Why do we need true-lift modeling? Look-alike model = find people who will take Action A =P(A) Standard response model = find people who will take Action A after receiving a treatment =P(A Treatment) True-lift model = find people who will take Action A only after receiving a treatment =P(A Treatment) P(A no Treatment) Standard response models often behave more like Lookalike models than like True-lift models Why spend marketing $$$ on people who would do Action A anyway? 7

Why do we need true-lift modeling? Look-alike model = find people who will take Action A True-lift model = find people who will take Action A only after receiving a treatment When is a look-alike model good enough? When is a true-lift model needed? Ø Responders can only take Action A if they receive one unique marketing contact Ø Responders have many opportunities to take Action A Ø Single channel Ø Multiple channels Ø Single contact Ø Multiple contacts Ø No other way to take Action A 8

The True-Lift model objective Maximize the Treatment responders while minimizing the control responders True lift Treatment "Responders" Control "Responders" True lift Treatment "Responders" Control "Responders" 1 2 3 4 5 6 7 8 9 10 Decile A standard response model 1 2 3 4 5 6 7 8 9 10 Decile A True-Lift response model Simulated data 9

True-Lift model solutions A. Difference of two models: Treatment Control B. Two sequential models: Treatment Actual Control Prediction C. Binned & Averaged dependent variable 10

Solution A1: Difference of two models: Treatment - Control Model 1 predicts P(A Treatment) Dependent variable = Action A Model Population = Treatment Group Model 2 predicts P(A no Treatment) Dependent variable = Action A Model Population = Control Group Final prediction of lift = Model 1 Score Model 2 Score Pros: simple concept, familiar execution (x2) P(A no Treatment) Cons: indirectly models true-lift, the difference may be only noise, 2x the work, scales may not compare, 2x the error, variable reduction done on indirect dependent vars 11

Solution A2: Single combined model using Treatment interactions Model population = Treatment & Control together Dependent variable = Action A Independent variables are attributes x,y,z: Conceptually: Lift P(A no Treatment) P(Action A) = P(A not Treated) + P(A only if Treated ) = {some coefficients} * {x,y,z} + 0/1 treatment flag * {some coefficients } * {x,y,z} During model development, the interaction flag is 0 for control records and 1 for treatment records Final prediction of lift = difference of two scores = Prob(response if Treated) Prob(response if not Treated) = score with interaction flag set to 1 score with interaction flag set to 0 Pros: combined model minimizes compounded errors Cons: indirectly models true-lift; large number of independent terms; collinearity of terms; reduction needed; adding two model scores may compound errors Lo, Victor S.Y. The True Lift Model - A Novel Data Mining Approach to Response Modeling in Database Marketing. SIGKDD Explorations, Volume 4, Issue 2, Dec 2002, p.78-86. 12

Solution B: Two sequential models: Treatment actual Control prediction Model 1 predicts P(A no Treatment) Dependent variable = Action A Model Population = Control Group Model 2 predicts P(A Treatment) P(A no Treatment) Dependent variable = Action A Model 1 Score Model Population = Treatment Group Final prediction of lift = Model 2 Score P(A no Treatment) Pros: more directly models true-lift; identifies variables that are directly correlated with true-lift (some of which are drivers of lift) Cons: the Model 2 dependent variable contains Model 1 errors; 2x the work, Model 1 scores and Action A should (but might not) share the same scale 13

Solution C1: Binned & averaged dependent variable Model 1 predicts P(A no Treatment) Dependent variable = Action A Model Population = Control Group Create N bins for Treatment & Control population together, ranked by Model 1 score (control response ) Calculate dependent variable value for each BIN: Treatment response rate Control response rate [Could stop here, using the bin average lift as the predicted lift, or continue with]: Model 2 predicts actual average lift of each bin Dependent variable = Average lift within each bin Model Population = Treatment Group Final prediction of lift = Model 2 Score Pros: directly models true-lift; identifies variables that are directly correlated with true-lift (some of which are drivers of true-lift) Cons: 2X the work; the approach requires variation in average lift across bins (which might not exist); control response needs to be correlated to true-lift response 14 P(A) no Treatment

Solution C2: Solution A or B + binned & averaged dependent variable Complete Solution A or B first to rank-order observations by estimated lift Use Solution A/B model score to rank and bin the observations: create N bins for Treatment & Control population together, ranked by Solution A/B score Calculate dependent variable value for each BIN: Treatment response rate Control response rate [Could stop here, using the bin average lift as the predicted lift, or continue with]: Est P(true-lift) from Solution A or B Model 3 predicts actual average lift of each bin Dependent variable = Average lift within each bin Model Population = Treatment Group Final prediction of lift = Model 3 Score Pros: directly models true-lift; this approach is more likely to maximize the variation in average lift across bins; identifies variables that are directly correlated with lift (some of which are drivers of lift) Cons: 3X the work 15

Standard response model Solution A2: Single combined model with interactions Solution B: Depvar = Treatment actual Control prediction Solution C1: Ranked & binned by Control model Solution C2: Ranking & binned by Lift model 1.3% 0.5% 1 Decile 10 1.0% 1.0% 1 Decile 10 1.0% 0.8% 1 Decile 10 0.6% 1.1% 1 Decile 10 0.5% Treatment Control 1.2% 1 Decile 10 0.09% Simulated data True Lift 0.25% 1 Decile 10 0.55% - 0.38% 1 10 Decile 0.47% - 0.32% 1 Decile 10 0.35% - 0.30% 1 Decile 10 0.29% - 0.27% 1 Decile 10 100% 0% 100% 0% 100% 0% 100% 0% 100% 0% True Lift Gains Chart 0 100% 0 100% 0 100% 0 100% 0 100% 16

Other solutions, variations & applications Decision trees Clustering / K-nearest neighbor Bootstrapping Optimization Personalized medicine Other marketing situations (how to separate very similar groups who act differently) 17

Ideal conditions for true-lift modeling A randomized control group is withheld! Treatment does not cause all responses Response is not correlated to lift (i.e., response model is not good enough) Lift-to-noise ratio is large enough If overall lift is near 0, then you need pockets of both negative lift and positive lift Repeated campaigns, or at least test campaign precedes rollout 18

Stop spending direct marketing dollars on customers who would purchase anyway! True-lift modeling can identify: which customers will purchase without receiving a marketing contact which customers need a direct marketing nudge to make a purchase which customers have a negative reaction to marketing (and purchase less if contacted) This discussion will describe: the basic requirements needed to succeed with true-lift modeling scenarios where this modeling method is most applicable the pros and cons of various approaches to true-lift modeling 19

Glossary Response = taking the desired action (Action A); might have done Action A whether treated or not True-lift = taking the desired action (Action A) only in response to the Treatment; would not have done Action A if not treated (aka uplift, net lift, incremental lift) 20