THE PREDICTIVE MODELLING PROCESS



Similar documents
KEEPING CUSTOMERS USING ANALYTICS

Past, present, and future Analytics at Loyalty NZ. V. Morder SUNZ 2014

DISCOVER MERCHANT PREDICTOR MODEL

Mathematics Online Instructional Materials Correlation to the 2009 Algebra I Standards of Learning and Curriculum Framework

not possible or was possible at a high cost for collecting the data.

Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

Cross Validation. Dr. Thomas Jensen Expedia.com

Experian s UK Credit Bureau Scores. Version 1.6

Determine If An Equation Represents a Function

Chapter. Break-even analysis (CVP analysis)

Data Mining Applications in Higher Education

Acquiring customers profitably. With Credit Bureau Scores

Creating the customer experience

Improving Demand Forecasting

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

Data Mining Algorithms Part 1. Dejan Sarka

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques Page 1 of 11. EduPristine CMA - Part I

THE THREE "Rs" OF PREDICTIVE ANALYTICS

The Power of Predictive Analytics

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

4. Simple regression. QBUS6840 Predictive Analytics.

A Primer on Forecasting Business Performance

Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management

IBM's Fraud and Abuse, Analytics and Management Solution

FORECASTING. Operations Management

The Top 9 Ways to Increase Your Customer Loyalty

Credit Risk Analysis Using Logistic Regression Modeling

An Introduction to Survival Analysis

analytics stone Automated Analytics and Predictive Modeling A White Paper by Stone Analytics

BANKING ON CUSTOMER BEHAVIOR

Easily Identify Your Best Customers

White Paper. Data Mining for Business

Algebra 1 Course Information

Factors Influencing Price/Earnings Multiple

TEXT ANALYTICS INTEGRATION

ADVANCED MARKETING ANALYTICS:

Callidus for Insurance

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Solve Your Toughest Challenges with Data Mining

01 In any business, or, indeed, in life in general, hindsight is a beautiful thing. If only we could look into a

Whitepaper. Power of Predictive Analytics. Published on: March 2010 Author: Sumant Sahoo

How To Recruit For A Contact Center

Credit Scorecards for SME Finance The Process of Improving Risk Measurement and Management

Customer Lifecycle Management How Infogix Helps Enterprises Manage Opportunity and Risk throughout the Customer Lifecycle

A NEW METHOD FOR DEVELOPING THE MOST COST- EFFECTIVE REHABILITATION PROGRAMS FOR OUR AGEING SEWER NETWORKS

Telecommunications Overview. Enhance customer loyalty with customer-centric communications and interaction

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

Solve your toughest challenges with data mining

QlikView for Telecommunications. Delivering Unprecedented Customer Intelligence

What is Modeling and Simulation and Software Engineering?

A Property & Casualty Insurance Predictive Modeling Process in SAS

INVENTORY MANAGEMENT, SERVICE LEVEL AND SAFETY STOCK

Asset Management Plan Overview

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

Tennessee Department of Education. Task: Sally s Car Loan

MBA Jump Start Program

consumerlab Keeping Smartphone users loyal Assessing the impact of network performance on consumer loyalty to operators

Prescriptive Analytics. A business guide

Increasing marketing campaign profitability with Predictive Analytics

Credit Scoring: Data and Decisioning

Forecasting the first step in planning. Estimating the future demand for products and services and the necessary resources to produce these outputs

Guide to Predictive Lead Scoring

APPLICATIONS OF PERFORMANCE SCORING TO ACCOUNTS RECEIVABLE MANAGEMENT IN CONSUMER CREDIT John Y. Coffman * Gary G. Chandler

Breakeven, Leverage, and Elasticity

Definition 8.1 Two inequalities are equivalent if they have the same solution set. Add or Subtract the same value on both sides of the inequality.

Application of Predictive Analytics to Higher Degree Research Course Completion Times

OBJECTIVE ASSESSMENT OF FORECASTING ASSIGNMENTS USING SOME FUNCTION OF PREDICTION ERRORS

Gerard Mc Nulty Systems Optimisation Ltd BA.,B.A.I.,C.Eng.,F.I.E.I

Balance collections with retention for each customer. Decision Analytics for debt management in retail banking

Exploiting the Single Customer View to maximise the value of customer relationships

CoolaData Predictive Analytics

Advanced analytics at your hands

Lead Scoring. Five steps to getting started. wowanalytics. 730 Yale Avenue Swarthmore, PA

Indiana State Core Curriculum Standards updated 2009 Algebra I

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

Applying the Principles of Business Intelligence to Improve Collections Performance. A Decision Analytics briefing paper from Experian

Chapter 7: Data Mining

QUANTITATIVE METHODS FOR MANAGEMENT

Towers Watson pricing software

Three Ways to Improve Claims Management with Business Analytics

Data Mining for Business Analytics

Transcription:

THE PREDICTIVE MODELLING PROCESS Models are used extensively in business and have an important role to play in sound decision making. This paper is intended for people who need to understand the process for developing predictive models because they interface in some way with technical analysts. This could be as a business user who interacts with the analytics team, a person involved with the preparation of data sets or a user of the outputs from a modelling process. This paper does not instruct people on how to build models, but covers the steps involved and the practical issues to consider. WHAT IS A MODEL? A mathematical model is an expression of relationships between variables, frequently in the form of an equation. An equation is an expression that contains an equal sign (=). To many the concept of an equation, or model, seems more complex than it actually is. The model itself is not complex, complexity arises from the process and work required to generate a good model. Consider the following, R = P x Q, where R is the revenue, P is the price and Q is the quantity sold. This equation can be used to calculate revenue by multiplying the price by the quantity sold. The equation defines mathematically the relationship between the two variables Q and R. That is, given the price of a product we can calculate R for any value of Q. This is the basis for mathematical models. Equations (such as the one above) can be represented by a straight line on a graph and are referred to as linear equations. In real life relationships between variables are more complex, to represent this complexity in equations, variables are squared, rooted and cubed etc. These equations are referred to as non-linear since they cannot be represented by a straight line. This paper focuses on the approach required to develop sound predictive models. Predictive modelling is the process of using past data to determine statistically, what is likely to happen in the future on the assumption that past trends will continue to apply. One application is in the forecasting area: predicting what will happen in the short to medium term future, in order to take pre-emptive measures such as allocating resources. Another use of predictive modelling is to identify individuals who are most likely to respond positively to some intervention. They can then be THE PREDICTIVE MODELLING PROCESS SALLY CAREY DATAMINE LTD 1

preferentially targeted to achieve maximum effect. Applications in the marketing arena include customer acquisition, customer retention and cross-selling. A key aspect of predictive modelling is the application of regular feedback, updates to reflect current conditions and maximise efficiency to ensure a model that can accurately predict an outcome for a customer base. STEPS IN PREDICTIVE MODEL DEVELOPMENT PLAN BUILD IMPLEMENT Define the objective Build model Apply model Create data sets Calculate a score Rank customers Validate model Drive initiatives Figure 1: steps in predictive model development Define the objective A clear specific objective for the model is required. Each model is developed for a specific purpose and cannot be used effectively in another situation. For example a model that predicts home loan customer churn cannot be used to predict credit card churn. An example of a clearly defined model objective contains the event or action that the model is to predict and the period it is likely to happen. For example, the objective could be to predict customers that are likely to churn their credit card within the next month. THE PREDICTIVE MODELLING PROCESS SALLY CAREY DATAMINE LTD 2

Create data sets An important step in the modelling process is the creation of the dataset to use for the model development. Broadly the dataset covers behavioural, demographic, geographic and external eg competitor information or weather. Variables that are not included in the dataset will not form part of the prediction. Variables cover both static fields such as income and triggers such as change in spend. Both technical and business people need to be involved in the decisions regarding the contents of the dataset. Focus needs to be on the behavioural information as this is more powerful for predictions than demographic data. Consideration needs to be given to which customers to exclude from the model build process. Customers need to be excluded if they are going to impair the performance of the model. Potential exclusions include bad debt, staff and new customers as they have insufficient history. The actual exclusions applied relates to the specific purpose of the model. For example, if the model is to predict customers that are likely to become bad debts, bad debt customers would be included in the model! Build statistical model The model will be built using a sample from the data set created. This is the part that can be left with the technical analysts. The resulting model will contain a subset of the original list of variables considered for the model. This is ok, and happens because some of the variables considered for the model will be correlated with each other, for example floor number and height of building. Others will have been discarded as they add little or nothing to the model s predictive power. The building of statistical models is the domain of statisticians and the technical aspects are not covered in this paper. Calculate a score The model developed will be an equation that, when applied to the customer base, will allocate a score to each customer. The score represents a customers likelihood to do whatever the model is predicting. For example, predicting a customer s likelihood to churn within a month or predicting a customer s expected order value. THE PREDICTIVE MODELLING PROCESS SALLY CAREY DATAMINE LTD 3

Validate model Typically models are validated against a hold out group. This group contains customers that have not been included in the development of the model. As such, they represent a group of previously unseen customers that are representative of the customer base. To achieve an accurate prediction of lift the hold out group must not be made up of the customers that for one reason or another have been excluded from the model development process. Figure 2: shows that by targeting the top 20% identified by the model, almost 40% of the targets the model is aimed at are actually identified. If left to a random selection only 20% would occur in 20% of the base. Apply the model The model will be built on a subset of data. Once the model is complete and has been validated the model will be run over the customer base. THE PREDICTIVE MODELLING PROCESS SALLY CAREY DATAMINE LTD 4

Rank These scores allow a customer base to be ranked in order of the predicted score, such as from highest expected order value to lowest or from most likely to least likely to churn. In reality some customers will churn and some customers will not. Therefore, in absolute terms, these predictions will not be accurate! The ranked list provides a superb base upon which to vary the treatment, and therefore the level of service or marketing spend, to groups of customers. Other useful considerations Key issues that arise are how often to run the model across the base and score the customers. This will depend on the actions being taken as a result of the model and the speed of change within the customer base and market. For example, telecommunications is a faster moving industry than insurance. Once a model has been applied and actions are being taken there is a requirement for tracking and managing the interactions. For example, control files will be required to test initiatives and to test the model performance. The results from this will help determine how frequently the model needs to be refreshed. As a rule of thumb, a model needs to be reviewed, and possibly rebuilt, annually. THE PREDICTIVE MODELLING PROCESS SALLY CAREY DATAMINE LTD 5

MEASURING EFFECTIVENESS The following provides a framework for monitoring the effectiveness of the model and communications. It is recommended that this is discussed and implications on implementation addressed during the model build process. Customer Base High Score Customers Lower Score Customers 90% for programme 10% control 10% control group 2 Tests the effectiveness of the communications Tests the accuracy of the model BUSINESS USER KEY INVOLVEMENT To summarise, the following are the key times that the business user needs involvement in the modelling process Communicating and verifying the purpose of the model Contributing to the list of variables for consideration in the model Identifying exclusions from the model build Organising the implementation of model related communicating and testing Socialising and on-boarding the use of models within the business Being a sounding board and supporting the analysts with their endeavours! THE PREDICTIVE MODELLING PROCESS SALLY CAREY DATAMINE LTD 6

BRIEF AUTHOR BIOGRAPHY Sally Carey Sally is Director of Datamine Ltd, a New Zealand based analytics consultancy that moves its clients beyond guesswork. Sally has over 25 years of B to B and B to C marketing and using quantitative approaches for business decision making. Sally has an MBA from Bradford University (UK) and is a Fellow of the Institute of Direct & Digital Marketing (UK). Sally believes that extraordinary results are achieved by a combination of analysis and intuition, and have even been referred to by some clients as magic. Key words: analytics, predictive modelling, process, model, equation THE PREDICTIVE MODELLING PROCESS SALLY CAREY DATAMINE LTD 7