analytics stone Automated Analytics and Predictive Modeling A White Paper by Stone Analytics



Similar documents
Explode Six Direct Marketing Myths

Easily Identify Your Best Customers

IBM SPSS Direct Marketing 22

IBM SPSS Direct Marketing 23

Easily Identify the Right Customers

IBM SPSS Direct Marketing 19

Predictive Coding Defensibility

IBM SPSS Direct Marketing

White Paper. Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices.

WHITEPAPER. Complying with the Red Flag Rules and FACT Act Address Discrepancy Rules

IBM SPSS Direct Marketing 20

Measuring Business Impact in Human Resources. A Link Consulting White Paper March 2014

9. 3 CUSTOMER RELATIONSHIP MANAGEMENT SYSTEMS

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Customer Analytics. Turn Big Data into Big Value

A Management Report. Prepared by:

6 Steps to Data Blending for Spatial Analytics

Optimizing Enrollment Management with Predictive Modeling

Why Modern B2B Marketers Need Predictive Marketing

Increasing marketing campaign profitability with predictive analytics

Credit Scoring and Its Role In Lending A Guide for the Mortgage Professional

The 20-Minute PPC Work Week. Making the Most of Your PPC Account in Minimal Time. A WordStream Guide

The top 10 secrets to using data mining to succeed at CRM

Product recommendations and promotions (couponing and discounts) Cross-sell and Upsell strategies

WHITEPAPER. Creating and Deploying Predictive Strategies that Drive Customer Value in Marketing, Sales and Risk

Consulting Performance, Rewards & Talent. Measuring the Business Impact of Employee Selection Systems

Solve your toughest challenges with data mining

Feature Engineering Tips for Data Scientists

Boost Your Direct Marketing Success with Prepaid Incentive Cards

Benchmark Report. Customer Marketing: Sponsored By: 2014 Demand Metric Research Corporation. All Rights Reserved.

Get Better Business Results

White Paper HOW TO INCREASE YOUR COMPANY S VALUE WITH PREDICTIVE ANALYTICS. What is Predictive Analytics?

RFM Analysis: The Key to Understanding Customer Buying Behavior

Key Metrics for Determining ROI for Business Intelligence Implementations. Elliot King, Research Analyst August 2007

Five Ways Retailers Can Profit from Customer Intelligence

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis

WHITE PAPER. The Five Fundamentals of a Successful FCR Program

birt Analytics data sheet Reduce the time from analysis to action

FutureFlowMedia. A New Weapon to Help Your Clients Reach Precisely-Target Audiences Hyper-Targeted Advertising

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

The information contained herein is proprietary to Decision Software, Inc. and is provided solely for the benefit of our customers and potential

Marketing Strategies for Retail Customers Based on Predictive Behavior Models

Planning successful data mining projects

Prescriptive Analytics. A business guide

Real World Rules for Lead Scoring & Prioritization. What Really Defines a Hot Prospect?

THE PREDICTIVE MODELLING PROCESS

CONTENTS WHAT IS ONLINE MARKETING? WHY IT'S IMPORTANT HOW TO GET STARTED // TRADITIONAL MARKETING // TYPES OF ONLINE MARKETING

Target Analytics Nonprofit Cooperative Database

THE THREE "Rs" OF PREDICTIVE ANALYTICS

Foundations of Business Intelligence: Databases and Information Management

The 2-Tier Business Intelligence Imperative

Predictive analytics. The rise and value of predictive analytics in enterprise decision making

Business Information Services. Product overview

Boost Your Direct Marketing Success with Prepaid Incentive Cards

Get to Know the IBM SPSS Product Portfolio

Myths of Digital Marketing

Introduction to Integrated Marketing: Lead Scoring

W H I T E P A P E R. Reducing Server Total Cost of Ownership with VMware Virtualization Software

THE 20-MINUTE PPC WORK WEEK MAKING THE MOST OF YOUR PPC ACCOUNT IN MINIMAL TIME

Proven Best Practices for a Successful Credit Portfolio Conversion

E-Business: Use of Information Systems

COMMERCIAL BANK. Moody s Analytics Solutions for the Commercial Bank

Web Analytics and the Importance of a Multi-Modal Approach to Metrics

Managing the Next Best Activity Decision

Speech Analytics: Best Practices for Analytics- Enabled Quality Assurance. Author: DMG Consulting LLC

Solve Your Toughest Challenges with Data Mining

Insurance Marketing White Paper The benefits of implementing marketing automation into your marketing strategy

EASI Reseller Opportunities: Demographic Estimates and Forecasts; Life Stage Clusters; Major Merchandise Lines and Minor Store Groups

Using Tableau Software with Hortonworks Data Platform

Challenge. Ten key attributes of an effective contact database vendor BUYER S GUIDE TO SELECTING A CONTACT DATABASE VENDOR CONTENTS

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

What is Marketing Operations?

How to Develop Qualification Criteria

The expression better, faster, cheaper THE BUSINESS CASE FOR PROJECT PORTFOLIO MANAGEMENT

Demand Generation vs. Marketing Automation David M. Raab Raab Associates Inc.

WhitePaper. Private Cloud Computing Essentials

Your Complete CRM Handbook

Center for Effective Organizations

Whitepaper. Power of Predictive Analytics. Published on: March 2010 Author: Sumant Sahoo

Using Adaptive Random Trees (ART) for optimal scorecard segmentation

The ROI of Test Automation

Myth or Fact: The Diminishing Marginal Returns of Variable Creation in Data Mining Solutions

Marketing Primer

WHITEPAPER. How to Credit Score with Predictive Analytics

4.5% 2014 Digital Marketing Optimization Survey results > 4.5% Top lessons learned from the leaders

Data Products and Services. The one-stop-shop for all your business-to-consumer data requirements

Transcription:

stone analytics Automated Analytics and Predictive Modeling A White Paper by Stone Analytics 3665 Ruffin Road, Suite 300 San Diego, CA 92123 (858) 503-7540 www.stoneanalytics.com

Page 1 Automated Analytics and Predictive Modeling A White Paper by Stone Analytics Why Read This Paper? According to a recent IDC study, Business analytics implementations generated an average 5- year return on investment of 431%. The study goes on to say organizations that have successfully implemented and utilized analytic applications have realized returns ranging from 17% to more than 2000%, with a median ROI of 112%. These are some pretty extraordinary claims, and there is no doubt that individual results are sure to vary. At the same time, however, there is also no doubt that analytics has a great deal to offer the decision-making process and that it is rapidly becoming a tool employed by more and more businesses of every size and category. Summary Though long used by the Fortune 1000 and other large organizations, predictive analytics has been both difficult and expensive to employ. Until very recently, it required the time and effort of one or more highly-trained and highly-paid statisticians. To be reliable, it also requires large stores of data, the kind of data that in the past was kept, or purchased, by only the largest companies. With the advent of digital technology and the Internet, however, data of all kinds is becoming more and more abundant everyday. In addition, there are now statistical modules that automate the statistical process itself, making predictive modeling an easy and cost-effective addition to the decision-making process for small and medium sized companies. Decisionmakers at all levels now have the ability to apply advanced analytics to a much broader range of business and organizational tasks, including site selection, fund-raising, and target marketing. Introduction Analytics is defined by Webster s as the method of logical analysis. To a large extent, it is synonymous with statistical analysis, which has long been an essential tool in both scientific and financial settings. More recently, analytics has also become an essential tool for solving complex logistical problems, such as those associated with manufacturing and delivery schedules. In all of these areas, it is the ability to identify and apply the relationships between, available information and a question of interest whose answer is uncertain that gives analytics its unique value. In spite of what it has to offer, however, analytics has never been cheap or easy to employ. Until recently, it required one or more highly-trained and highly-paid statisticians. To justify the cost, there had to be vast sums of money involved. At the same time, even for those willing to pay the price, data an essential ingredient of any statistical analysis was not always easy to come by. Few organizations possessed the kind of data needed to perform an accurate analysis. That has changed. Companies of all sizes and description now gather and maintain data about their customers and clients. From gasoline to diapers, practically every purchase made in the United States is recorded in some electronic manner. Data of all kinds are now readily available data about income, homeownership, education, and a thousand other demographic categories. As a result, analytics is finding new applications, particularly in the area of predictive modeling,

Page 2 which can be used in tasks ranging from site selection and fraud detection to fund raising and target marketing. What is Predictive Modeling? Think about the processes you use every day to make judgments and decisions given past experiences and a variety of observations about the question at hand, you weigh the facts and make a determination. You are, in effect, modeling. Statistical modeling is simply the standardization of this process. Predictive models, in particular, seek to explain the relationships between a critical variable and a group of factors that help predict its outcome. One obvious example of an area in which this is true is target marketing. Target marketing refers to the practice of ranking potential customers based on the analysis of available customer information. By narrowing the marketing focus to the most promising customer leads, target marketing substantially reduces marketing costs associated with selling products and services. From telephone companies and credit card issuers battling churn, to supermarkets and cable companies looking to sell additional products or services to their existing customers, target marketing is no longer confined to book and record clubs. Companies from AT&T and Sprint to Citibank and Prudential have substantial target marketing programs in place. Another example where an important business decision has to be made on the basis of limited information is site selection. Each year, chain stores and franchises have to consider and choose from among hundreds of potential locations. It is simply not possible to visit and report on each one separately. A similar situation exists for retailers regarding staffing decisions during the holiday season. To meet manpower requirements, department stores and other retailers hire tens, even hundreds, of employees so many that it would be impossible for HR personnel to go through every application in detail. Still, while it may be limited, in each of these examples some information, or data, does exist. In terms of target marketing, it might be the Recency, Frequency, and Monetary Value of a customer s purchase history, otherwise known as RFM. For site selection it might be the demographics of the surrounding neighborhoods or proximity to public transit, while for personnel hiring it might be the education level or previous employment of the applicant. The information may be limited or incomplete, and in an ideal world, one would hold off on any decisions until a more complete body of data could be assembled and examined. In the real world, however, that luxury rarely exists. Regardless of the data, at a certain point, the decision has to be made the campaign has to be mailed, the site chosen, or the person hired. In each of these situations, the decision-maker must leverage the data at hand to the best advantage, and when the stakes are high enough, companies have long been willing to pay for a team of analysts in order to get this statistical advantage.

Page 3 The Modeling Process As mentioned above, in addition to data constraints, in the past there have been substantial cost constraints as well. Depending on the scope, modeling projects can cost anywhere from $25,000 to $100,000, and take weeks or even months to complete. Below are just some of the steps involved: Identify Target Variable: The analyst must select or, in many cases create, the target variable, which is the question that is being addressed. Data Exploration: The analyst examines the data, computing and analyzing various summary statistics regarding the different variables contained in the data set. Split Data Set: The analyst may randomly split the data into two sets, one of which will be used to build, or train, the model, and the other of which will be used to test the quality of the model once it is built. The Modeling Process: Percentage of time spent on each step Categorical Variable Preprocessing: Categorical variables are variables such as gender and marital status that possess no natural numerical order. These variables must be identified and handled separately from continuous numerical variables such as age and income. Data Cleansing: The data must be cleansed of missing values and outliers. Missing values are, quite literally, missing data. For example, there may be a number of records in a database for which the age is a null value. Handling missing data may be tricky, since the information may be unavailable for a number of different reasons. For example, if the data were collected via a survey, age might be missing because the respondent elected not to reveal his or her age. On the other hand, the response may have been originally present but was lost during processing Outliers, on the other hand, are data records so different from the rest that they can skew the results of calculations. For example, there might be a household income entry of $150,000 among a group where the rest of the records range

Page 4 from $30,000 to $50,000. As with missing values, the analyst must decide whether to include, exclude, or replace them with a more typical value for that variable. Variable Reduction: The variables in the data set are examined in terms of their correlation with the target variable. Those that are redundant or highly correlated with each other are removed. For example, a data set may include monthly and yearly household income. In some situations, including both could be problematic. Variable Standardization: The remaining variables are often re-scaled so that they can be more efficiently analyzed. Create Model: Capture variables and weights that yield the best estimate of the target variable using the training data taken from the original data set. Model Selection: Several competing models are often considered. Model Validation: Run the model using the test data taken from the original data set. Automated Analytics In the past, the above steps required the expertise not only of one or more analysts, but programmers as well. It is this exact process that Stone Analytics has automated in its Decision Science product line. With Decision Science, decision-makers of all levels can use advanced analytics to guide their critical business decisions at a fraction of the cost and time. Decision Science analytic engines are easy-to-use software tools that enable business professionals to build and implement powerful predictive models directly from their desktops, making it possible to apply analytics to a much broader range of business and organizational tasks. The Modeling Process with Decision Science By automating steps 3 through 7, as well as elements of steps 2 and 8, Automatic Analyst can cut modeling time by upwards of 50%. Decision Science enables users to make predictions of two types; (1) predictions about quantities, such as number of units sold annually, and (2) predictions about the outcome of Yes or No questions, such as whether or not a potential customer is likely to purchase your product. To do

Page 5 this, it takes data with a known outcome and constructs a statistical model that relates the way in which individuals previously responded with behavioral and demographic characteristics. For example, using the results from a previous marketing promotion, Decision Science produces a model that gauges response to the promotion with an individual s unique characteristics. The model could then be applied to future marketing campaigns to estimate the probability of each individual in a prospect list responding in a positive manner. Decision Science automatically ranks, or scores, the entire list, making it easy to quickly identify the top 10 or 20 percent of likely customers, or, just as easily, exclude the bottom third. The Campaign Scenario screen of the Decision Science scoring module enables you to perform various ROI calculations. Decision Science s proprietary Automatic Analyst technology automates much of the modeling process, enabling users with little or no statistical experience to perform statistical analysis quickly and easily. Data exploration and preprocessing, which typically takes 50 to 80% of an analyst s time during the modeling process, is handled automatically by the software. At the touch of a button, Automatic Analyst scans an entire data set and Automatically distinguishes between continuous numeric variables such as income and age, and categorical variables such as gender and marital status. Automatically handles problem data, such as missing values and outliers. Automatically partitions the data into random test and train subsets, to protect against sample bias in the data. Automatically examines the relationship between each potential variable and the question you are trying to answer to find the most predictive variables.

Page 6 Automatically uses these variables to build a statistical model that optimizes the decision you are about to make. Automatically evaluates the accuracy of the models it creates. Problem solving is one of the core functions of any successful business. The problem might be handling the return of a purchase or insuring repayment of a loan without default. Automated analytic modules such as Decision Science enable business managers to identify the critical behavioral patterns within their customer and client databases, so that instead of subjectivity and conjecture, they can now base their decisions on proven analytical methods. The Customer Valuation screen of the Decision Science valuation module enables you to sort prospective customers, store locations, or anything else by predicted value. About Stone Analytics Stone Analytics develops, markets, and supports analytical products that add value to any datarich application environment. Stone Analytics statistical modeling products are easy for software developers to integrate, and easy for customers to use. These tools bring point-and-click simplicity to the tasks of rank ordering, grouping, and similar everyday business decisionmaking tasks. Stone Analytics employs the latest statistical and information management techniques to provide customers with state-of-the-art solutions that add statistical modeling functionality to their applications. Stone Analytics supplements its products with a wide range of professional services customized according to customer needs. These include integration assistance, custom model building, and consulting in the areas of data management and analytics.