Using survival analytics to estimate lifetime value

Similar documents
CoolaData Predictive Analytics

Product recommendations and promotions (couponing and discounts) Cross-sell and Upsell strategies

Acquiring new customers is 6x- 7x more expensive than retaining existing customers

Maximize Revenues on your Customer Loyalty Program using Predictive Analytics

RFM Analysis: The Key to Understanding Customer Buying Behavior

Improve Marketing Campaign ROI using Uplift Modeling. Ryan Zhao

An Introduction to Survival Analysis

Easily Identify Your Best Customers

Onward Reserve: 2.7x Revenue From Segmented Newsletters, 18x Conversions From Automated s.

Prescriptive Analytics. A business guide

CUSTOMER VALUE MANAGEMENT PRINCIPLES. Top 5 Recommendations for Using Customer Lifetime

THE OPTIMIZER HANDBOOK:

Frequency Matters. The keys to optimizing send frequency

FINDING THE GOOD IN BAD DEBT BEST PRACTICES FOR TELECOM AND CABLE OPERATORS LAURENT BENSOUSSAN STEPHAN PICARD

CUSTOMER RELATIONSHIP MANAGEMENT (CRM) CII Institute of Logistics

NPV Versus IRR. W.L. Silber We know that if the cost of capital is 18 percent we reject the project because the NPV

Customer Lifetime Value II

Multichannel Attribution

Simple Predictive Analytics Curtis Seare

The Definitive Guide to Lifetime Value THE DEFINITIVE GUIDE TO CUSTOMER LIFETIME VALUE

Chapter 6: The Information Function 129. CHAPTER 7 Test Calibration

Investment Decision Analysis

Keeping up with the KPIs 10 steps to help identify and monitor key performance indicators for your business

MBA 8130 FOUNDATIONS OF CORPORATION FINANCE FINAL EXAM VERSION A

How to Drive. Online Marketing through Web Analytics! Tips for leveraging Web Analytics to achieve the best ROI!

Introduction to Strategic Supply Chain Network Design Perspectives and Methodologies to Tackle the Most Challenging Supply Chain Network Dilemmas

Investment Appraisal

Marketing. Marketing Your Business. The Ground Work

CALCULATOR TUTORIAL. Because most students that use Understanding Healthcare Financial Management will be conducting time

DIRECT MAIL: MEASURING VOLATILITY WITH CRYSTAL BALL

Boost Your Direct Marketing Success with Prepaid Incentive Cards

Private annuity/trust

The Credit Card Model

Why Modern B2B Marketers Need Predictive Marketing

A Contact Center Crystal Ball:

WHITE PAPER: DATA DRIVEN MARKETING DECISIONS IN THE RETAIL INDUSTRY

Pricing I: Linear Demand

The Credit Card Model

Boost Your Direct Marketing Success with Prepaid Incentive Cards

Overview, Goals, & Introductions

Business Intelligence. Tutorial for Rapid Miner (Advanced Decision Tree and CRISP-DM Model with an example of Market Segmentation*)

How value attribution will shape the future of affiliate marketing

The Reinvestment Assumption Dallas Brozik, Marshall University

ADVANCED MARKETING ANALYTICS:

SWOT Analysis of E-Commerce

CE Entrepreneurship. Investment decision making

GUIDE TO THE. 12 Must-Have KPIs for Sales Enablement

The Price Is Right. Best Practices in Pricing of Telecom Services

Whitepaper. Power of Predictive Analytics. Published on: March 2010 Author: Sumant Sahoo

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges

The lifetime value of a Land Line Phone Subscriber

Making Sense of the Mayhem: Machine Learning and March Madness

Ma r k e t i n g RO I :

Social Business Intelligence For Retail Industry

A supply chain analytics approach to product assortment optimization

Management Science Letters

Multiple Linear Regression in Data Mining

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

A Simple Guide to Churn Analysis

12/10/2012. Real-Time Analytics & Attribution. Client Case Study: Staples. Noah Powers Principal Solutions Architect, Customer Intelligence, SAS

Part 7. Capital Budgeting

Where s my real ROI? White Paper #1 February expert Services

Managing Customer Retention

PROFITABLE CUSTOMER ENGAGEMENT Concepts, Metrics & Strategies

Business Intelligence and Decision Support Systems

Paper P1 Performance Operations Post Exam Guide March 2011 Exam. General Comments

Business Process Discovery

Why Advisors Should Use Deferred-Income Annuities

How2Guide. How Marketers Can Tap into Customer Data to Improve Customer Profitability and Campaign Effectiveness

Business white paper. The road to strategic website design The Optimost Web Optimization Maturity Model

Reducing Customer Churn

TPPE17 Corporate Finance 1(5) SOLUTIONS RE-EXAMS 2014 II + III

Planning the Legacy of Your Faith

MANAGING RISK IN LIFE INSURANCE: A STUDY OF CHAGING RISK FACTORS OVER THE LAST 10 YEARS IN TAIWAN AND DRAWING LESSONS FOR ALL

INDIA S LEADING DIGITAL ADVERTISING AND ANALYTICS COMPANY USING DATA TO DRIVE YOUR DIGITAL SUCCESS

Click Here to Buy the Tutorial

BENCHMARKING PERFORMANCE AND EFFICIENCY OF YOUR BILLING PROCESS WHERE TO BEGIN

MARRYING BIG DATA WITH FASHION COMMERCE

Predictive analytics. The rise and value of predictive analytics in enterprise decision making

q-papers q-papers Return on Investment: Justifying CRM Investments Enterprise Measures Steve Schultz, Quaero

Bond Valuation. What is a bond?

The Cost of Capital, and a Note on Capitalization

Alteryx Sample Use Cases. Retail

Transcription:

Using survival analytics to estimate lifetime value Received: 30th April, 2015 Mike Grigsby has worked in marketing analytics for nearly 30 years, working at Sprint, Dell, HP and the Gap. He is now an analytic consultant at Targetbase, focusing on the retail industry. Mike has been a frequent speaker at trade/academic conferences. He teaches marketing analytics at UTD s graduate school. Mike has published in Marketing Insights, Canadian Journal of Marketing Research, Marketing Management and similar journals. His book, Marketing Analytics, was recently published by Kogan Page. E-mail: m666grigsby@yahoo.com Abstract Typically, lifetime value (LTV) is merely a calculation using descriptive/historical data. This calculation makes some rather heroic assumptions to project into the future but most importantly gives no insights into why a customer is, for example, lower valued, or how to make a customer higher valued. That is, descriptive techniques offer no insights into predicting, incentivizing or changing customer behavior. Using predictive techniques in this case survival analysis can give an indication into what causes purchases to happen. This means marketers get insights levers into how to increase LTV. This predictive modeling is strategically lucrative. This paper appeared in a different formant in Marketing Analytics, Kogan Page, June, 2015. KEYWORDS: lifetime value, LTV, predicting next purchase, time until purchase, survival modeling, retail analytics, predictive modeling, targeting, consumer behavior, financial implications. LIFETIME VALUE WITH DESCRIPTIVE ANALYSIS Lifetime value (LTV) is typically done as just a calculation, using past (historical) data. That is, it is only descriptive. While there are many versions of LTV (depending on data, industry, interest, etc.) the following is conceptually applied to all. LTV, via descriptive analysis: 1. Uses historical data to sum up each customer s total revenue. 2. This sum then has subtracted from it some costs: typically cost to serve, cost to market, maybe cost of goods sold, etc. 4. These cash flows are assumed to continue into the future and diminish over time (depending on durability, sales cycle, etc.), often decreasing arbitrarily by, say, 10 percent each year until they are effectively zero. 5. These (future, diminished) cash flows are then summed up in discounted (usually by weighted average cost of capital) to get their net present value (NPV). 6. This NPV is called LTV. This calculation is applied to each customer. 3. This net revenue is then converted into an annual average amount and depicted as a cash flow.

Thus, each customer has a value associated with it. The typical use is for marketers to find the high-valued customers (based on past purchases). These high-valued customers get most of the communications, promotions/discounts, marketing efforts, etc. Descriptive analysis is merely about targeting with already (on average) engaged, much like RFM. This seems to be a good starting point but, as is usual with descriptive analysis, contributes nothing about why. Why is one customer more valuable? Will they continue to be? Is it possible to extract additional value, but at what cost? Is it possible to garner more revenue from a lower-valued customer because they are more loyal or cost less to serve? What part of the marketing mix is each customer most sensitive to? LTV (as described above) gives no implications for strategy. The only strategy is to offer and promote to the highvalued customers. SURVIVAL MODELING: AN INTRODUCTION Survival modeling came into bio-statistics in the early 1970s, where the subject studied was an event, death. Survival analysis is about modeling the time until an event. In bio-statistics the event is typically death, but in marketing the event can be response, purchase, churn, etc. Due to the nature of survival analysis, there are a couple of characteristics that are endemic to this technique. The dependent variable is time-until-an-event, so time is built into the analysis. The second endemic thing to survival analysis is observations that are censored. A censored observation is either an observation that was lost to the study and there is no knowledge of having the event or not but we do know at some point in time that the observation has not had the event. In marketing it is common for the event to be a purchase. Imagine scoring a database of customers with time-untilpurchase. That is far more actionable than, from logistic regression, probability of purchase. What can be done about censored observations, that is, those that did not have the event? We could delete them. That would be simple, but depending on how many there are, it might discard a lot of data. Also, these might be the most interesting data of all, so deleting them is probably a bad idea. We could just give the maximum time until an event to all those that have not had the event. This would also be a bad idea, especially if a large portion of the sample is censored, as is often the case. (It can be shown that throwing away a lot of censored data will bias the results.) Thus, we need a technique that can deal with censored data. Survival modeling takes into account time until the event and includes information about those that have not had the event (yet). LIFETIME VALUE WITH PREDICTIVE ANALYSIS (USING SURVIVAL MODELING) How would LTV change using predictive analysis instead of descriptive analysis? First note that while LTV is a future-oriented metric, descriptive analysis uses historical (past) data and the entire metric is built on those, with assumptions about the future applied unilaterally to every customer. Prediction will specifically thrust LTV into the future (where it belongs) by using independent variables to estimate the next time until purchase. Since the major customer behavior driving LTV is timing and secondly, the amount and number of purchases a statistical technique needs to be used that predicts time until an event. (Ordinary regression predicting the LTV amount ignores timing.) Survival analysis is a technique designed specifically to study time-until-event problems. It has timing built into it, and thus a future view is already embedded in

the algorithm. This removes much of the arbitrariness of typical (descriptive) LTV calculations. So, what about using survival analysis to see which independent variables, say, bring in a purchase? This decreasing time until purchase tends to increase LTV. While survival analysis can predict the next time until purchase, the strategic value of survival analysis is in using the independent variables to change the timing of purchases. That is, descriptive analysis shows what happened; predictive analysis gives a glimpse of what might change in the future. Strategy using LTV dictates understanding the causes of customer value: why a customer purchases, what increases/decreases the time until purchase, probability of purchasing at future times, etc. Then, when these insights are learned, marketing levers (shown as independent variables) are exploited to extract additional value from each customer. This means knowing that the one customer is, say, sensitive to price and that offering a discount will tend to decrease their time until purchase. That is, they will purchase sooner (maybe purchase large amounts and maybe purchase more often) with a discount. Another customer prefers say, product X and product Y bundled together to increase the probability of purchase and this bundling decreases their time until purchase. This insight allows different strategies for different customer needs and sensitivities, etc. Survival analysis applied to each customer yields insights into understanding and incentivizing changes in behavior. This means that merely assuming the past behavior will continue into the future (as descriptive analysis does), with no idea Table 1: Before model implementation why, is no longer necessary. It is possible for descriptive and predictive analysis to give contradictory answers, which is why crawling might be detrimental to walking. If a firm can get a customer to purchase sooner, there is an increased change of adding purchases depending on the product. But even if the number of purchases is not increased, the firm getting revenue sooner will add to their financial value (time is money). Also, a business case can be created by showing the trade-off in giving up, say, margin but obtaining revenue faster. This means strategy can revolve around maximization of cost balanced against customer value. The idea is to model next time until purchase the baseline and see how to improve that. How is this carried out? A behaviorally based method would be to segment the customers (based on behavior) and apply a survival mode to each segment and score each individual customer. By behavior is typically meant purchasing (amount, timing, share of products, etc.) metrics and marcom (open and click, direct mail coupons, etc.) responses. AN EXAMPLE Let us use an example. Table 1 shows two customers from two different bevahioral segments. Customer XXX purchases every 88 days with an annual revenue of $43,958, costs of $7,296 for a net revenue of $36,662. Say the second year is exactly the same. So year 1 discounted at 9 percent is an NPV of $33,635 and year 2 discounted at 9 percent for two years is $30,857 for a total LTV of $64,492. Customer YYY has similar calculations for LTV of $87,898. The above (using descriptive analysis) would Customer Days between purchases Annual Purchases Costs LTV at 9% XXX 88 4.148 43,958 7,296 36,662 36,662 33,635 30,857 64,492 YYY 58 6.293 62,289 12,322 49,967 49,967 45,842 42,056 87,898

have the marketers targeting customer YYY with > $23,000 value over customer XXX. But do we know anything about why customer XXX is so much lower valued? Is there anything that can be done to make them higher valued? Applying a survival model to each segment outputs independent variables and shows their effect on the dependent variable. In this case the dependent variable is (average) time until purchase. Say the independent variables (which defined the behavioral segments) are things such as price discounts, product bundling, seasonal messages, adding additional direct mail catalogues, offering online exclusives, etc. The segmentation should separate customers based on behavior and the survival models should show how different levels of independent variables drive different strategies. Table 2 shows the results of survival modeling on the two different customers that come from two different segments. The independent variables are price discounts of 10 percent, product bundling, etc. The TTE is time until event shows what happens to time until purchase based on changing one of the independent variable. For example, for customer XXX, giving a price discount of 10 Table 2: Model Results Variables XXX YYY TTE TTE Price Discount 10% -14-2 Product bundling -4 12 Seasonal message 6 5 Five more catalogues 11-2 Online exclusive -11 3 percent on average decreases their time until they purchase by 14 days. Giving YYY a 10 percent discount decreases their time until purchase by only two days. This means XXX is far more sensitive to price then YYY which would not be known by descriptive analysis alone. Likewise, giving XXX more direct mail catalogues pushes out their TTE but pulls in YYY by two days. Note also that very little of the marketing levers affect YYY very much. We are already getting nearly all from YYY that we can no marketing effort does very much to impact the TTE. However, with XXX there are several things that can be done to bring in their purchases. Again, none of these would be known without survival modeling on each behavioral segment. Table 3 shows new LTV calculations on XXX after using survival modeling results. We decreased TTE by 24 days by using some combinations of discounts and bundling and online exclusives, etc. Note now that the LTV for XXX (after using predictive analysis) is greater than YYY. What survival analysis offers, in addition to marketing strategy levers, is a financial optimal scenario, particularly in terms of costs to market. That is, customer XXX responds to a discount. It is possible to calculate and test what is the (just) needed threshold of discounts to bring a purchase in by so many days with the estimated level of revenue. This ends up being a cost/benefit analysis that makes marketers think about strategy. This is the advantage or predictive analysis giving marketers strategic options. Table 3: After modeling implementation Customer Days between purchases Annual Purchases Costs LTV at 9% XXX 64 5.703 60,442 10,032 50,410 50,410 33,635 30,857 88,677 YYY 58 6.293 62,289 12,322 49,967 49,967 45,842 42,056 87,898

CONCLUSION While it took some time for survival analysis to move from bio-statistics into marketing, it has now become an important tool in marketing analytics. Much of this is because when an event happens it is typically more important to marketers than the probability of an event happening. Marketing is about choice and that choice happens in time. In terms of LTV, survival analysis has several advantages: 1. It is statistical, rather than merely mathematical. This means that it will not suffers as much from a recent change in a customer s purchasing behavior. That is, if a customer, say, severely decreases their purchasing in recent time periods, predictive analytics will catch/adjust for this far faster/better than descriptive analysis (which tends to use past/historical data). The converse is also true. 2. As mentioned, it is predictive. That is, LTV can be estimated by serval cause-and-effect (independent) variables. 3. Because it is predictive, these independent variables (especially marketing levers like price/discount offers, marcom vehicle/channel, product bundling, etc.) provide insight into how to change (increase) LTV. 4. Because of the above, a financial business case can be made. That is, if it, say, costs 5 percent more in discounts to increase a customer s LTV, the model can show that the increase in discounts brings about only a 2 percent increase LTV. 5. It allows the generation of a marketing strategy, rather than merely targeting high-valued customers. Because of the above, survival analysis should be incorporated into all marketing analysts toolboxes. The effort is worth it.