Data Mining Techniques Chapter 9: Market Basket Analysis and Association Rules

Similar documents
MBA Data Mining & Knowledge Discovery

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users

The New Era of Education Big Data

Databases - Data Mining. (GF Royle, N Spadaccini ) Databases - Data Mining 1 / 25

Mining an Online Auctions Data Warehouse

Data Mining Techniques Chapter 6: Decision Trees

Market Basket Analysis and Mining Association Rules

HYPERMARKET OF TOMORROW J. DUBOC. Executive Managing Director, Hypermarkets

Overview, Goals, & Introductions

Collaborative CRM Workshop. 03 Segmentation and Scorecard

Using Data Mining and Machine Learning in Retail

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Data Mining & Knowledge Discovery: Personalization and Profiling Technologies

Assessment Vocabulary Strategies. Advertiser. Describe three types of positions typically found in an advertising agency.

Role of Social Networking in Marketing using Data Mining

Understanding Web personalization with Web Usage Mining and its Application: Recommender System

Day 2: Machine Learning & Data Mining Basics. Beibei Li Carnegie Mellon University

CHAPTER THREE Market Segmentation and Strategic Targeting

Laboratory Module 8 Mining Frequent Itemsets Apriori Algorithm

Driving shopper engagement through digital technology

Data Mining Techniques in CRM

Key Speaker. Nicole Roach Marketing Coordinator CharityNet USA. HBIF Meeting 12-09

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

Easily Identify Your Best Customers

Market Basket Analysis for a Supermarket based on Frequent Itemset Mining

Data Mining Techniques Chapter 4: Data Mining Applications in Marketing and Customer Relationship Management

The Changing Face of Mobile Customer Engagement

Introduction to Artificial Intelligence G51IAI. An Introduction to Data Mining

Loyalty Card Program Description

for Retail One solution connects retail end-to-end, driving growth and fostering customer relationships.

Data Mining Techniques

Data Mining Algorithms Part 1. Dejan Sarka

Developing Relevant Dining Visits with Oracle Advanced Analytics Olive Garden s transition toward tailoring guests experiences

One Size Doesn t Fit All: How a Segmentation Approach Can Help Guide CE Product Strategy

Foundations of Artificial Intelligence. Introduction to Data Mining

Best Practices: Advertising and Marketing

Association Rule Mining

Data Mining: Introduction. Lecture Notes for Chapter 1. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler

Leading Practices in Market Basket Analysis

!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"

Mine Your Business A Novel Application of Association Rules for Insurance Claims Analytics

What this topic is about. The purpose of marketing Different approaches to marketing Segmentation Niche &mass markets B2C & B2B marketing

World Class Direct Marketing Analytics

Digital Marketing: Strategies & Measurement

Know Your Buyer: A predictive approach to understand online buyers behavior By Sandip Pal Happiest Minds, Analytics Practice

Implementation of Data Mining Techniques to Perform Market Analysis

Marketing Intelligence. Direct Response. Loyalty Marketing. Online Marketing. 105 East 5th Street, 5th Floor Kansas City, Missouri 64106

ECM 210 Chapter 6 - E-commerce Marketing Concepts: Social, Mobile, Local

Bringing Digital into Store, now and in the future

Chapter 8 Customer Relationship Management Benefits of CRM Helps in improving customer retention and loyalty Helps in generating high customer

MapInfo Predictive Analytics Group

Protein Protein Interaction Networks

B2B opportunity predictiona Big Data and Advanced. Analytics Approach. Insert

an introduction to VISUALIZING DATA by joel laumans

Introduction. A. Bellaachia Page: 1

Building relationships with web design clients using web hosting

SHOPPING FOR FOOD. Shopping For Food

Collaborative CRM Workshop. 02 Partner Alignment & Project Objectives

Driving Results with. Dynamic Creative Optimization

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Infor M3 Assortment Replenishment Planner

Quick Introduction of Data Mining Techniques

Lecture 6 - Data Mining Processes

Data Storage And Backup

Using Survey Research to Evaluate Communications Campaigns

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?

Executing a Successful SPIF

Advanced Marketing (Master) Content Skills Learning Targets Assessment Resources & Technology CEQ: WHAT IS PROMOTION, PLACE AND PRICE?

Additional Revenue Opportunities: Flat Fee Offers, Pay-Per-Call, Marketing & Paid Placements

Azure Machine Learning, SQL Data Mining and R

Learning ZoneXpress P.O. Box 1022, Owatonna, MN

OPINION MINING IN PRODUCT REVIEW SYSTEM USING BIG DATA TECHNOLOGY HADOOP

Sample of Best Practices

Perils and Pitfalls of PPC Marketing

Introduction. Epos tills capture the sales information to ensure you make better decisions

Guide to Selling Google AdWords for Resellers Consultative, Solution Based Sales

A/B SPLIT TESTING GUIDE How to Best Test for Conversion Success

Retail Marketing Strategy

Be Strong! Get your calcium-rich foods!

Success with emarketing

Perspective. Best Practices Continuum Map for Newspaper Publishers. Abstract

CHAPTER VII SUMMARY OF MAJOR FINDINGS AND RECOMMENDATIONS

OPPORTUNITY STATEMENT:

Transcription:

Data Mining Techniques Chapter 9: Market Basket Analysis and Association Rules Market basket analysis.................................................... 2 Market basket data I..................................................... 3 Market basket data II..................................................... 4 Association rules........................................................ 5 How good is an association rule?............................................. 6 Building association rules.................................................. 7 Pizza example.......................................................... 8 Calculations........................................................... 9 Refinements........................................................... 10 Extensions............................................................ 11 1

Market basket analysis Undirected data mining technique (no target or response variable). Kind of problem: what merchandise are customers buying and when? Which products tend to be purchased together and which are amenable to promotion? suggest new store layouts; determine which products to put on special; indicate when to issue coupons; tie data to individual customers through a loyalty card or website registration. Other potential applications see p. 288. Pitfalls: rules can end up merely describing previous marketing promotions; product differentiation may not be apparent until long history. c Iain Pardoe, 2006 2 / 11 Market basket data I Point-of-sale transaction data. Three levels of market basket data: customers; orders (purchases, baskets, item sets); items. Track customers over time: average # orders per customer; average # (unique) items per order; proportion of customers purchasing a particular product; average # orders per customer including a particular product; average quantity ordered per customer including a particular product. c Iain Pardoe, 2006 3 / 11 2

Market basket data II Order characteristics. Item popularity: in a one-item order; in a multi-item order; amongst customers who are repeat purchasers; over time; over regions. Tracking marketing interventions. Clustering products by usage which product(s) in a purchase suggest the purchase of other particular products at the same time: association rules (handfuls of items); cluster analysis (larger sets). c Iain Pardoe, 2006 4 / 11 Association rules If a customer buys item A, we expect he/she will also buy item B. Actionable: useful rules with understandable, high-quality info, e.g., if Barbie then chocolate; might suggest more prominent product placement (e.g., beer and diapers see p. 298), product tie-ins and promotions, or particular ways to advertise products. Trivial: already known by managers, e.g., if maintenance contract then large appliance; may reflect past marketing or product bundling; exceptions may signal failures in business operations, data collection, or processing. Inexplicable: seem to have no explanation and do not suggest a course of action, e.g., if new hardware store then toilet bowl cleaners; may reflect over-fitting. c Iain Pardoe, 2006 5 / 11 3

How good is an association rule? IfI 1 (condition/antecedent) theni 2 (result/consequent) e.g., if OJ then milk (p. 299, beware mistakes). Support = proportion of transactions with I 1 & I 2, e.g., 1/5 = 20.0%. Confidence = (# transactions with I 1 & I 2 ) / (# transactions with I 1 ), e.g., 1/4 = 25.0%. Lift=confidence/proportionoftransactionswithI 2, e.g., (1/4) / (1/5) = 1.25: or, lift = (actual # transactions with I 1 & I 2 ) / (expected#transwithi 1 &I 2 ifno relationship) = #(I 1 &I 2 ) / (#I 1 #I 2 /N) = 1 / (4 1/5); excess = # (I 1 & I 2 ) (#I 1 #I 2 / N) = 1 (4 1/5) = 0.20. c Iain Pardoe, 2006 6 / 11 Building association rules Determine the item set: select right level of detail (start broad, repeat with finer detail to hone in); product hierarchies help to generalize items, e.g., frozen food dessert, vegetable, dinner; hybrid approach depending on price or frequency (analysis easier when roughly same number of transactions for each item); virtual items go beyond product hierarchy, e.g., designer labels, low-fat products, energy-saving options, payment method, day, demographic info, etc. Calculate counts/probabilities of items and combinations of items. Analyze support/confidence/lift to find actionable rules. c Iain Pardoe, 2006 7 / 11 4

Pizza example c Iain Pardoe, 2006 8 / 11 Calculations Pr(I 1&I 2) Pr(I 1&I 2) Pr(I 1)Pr(I 2) Pr(I 1 &I 2 ) Pr(I 1 ) Pr(I 1) Pr(I 2 ) Rule Support Confidence Lift If M then P 0.25 0.450 0.556 0.425 1.31 If P then M 0.25 0.425 0.588 0.450 1.31 If M then C 0.20 0.450 0.444 0.400 1.11 If C then M 0.20 0.400 0.500 0.450 1.11 If P then C 0.15 0.425 0.353 0.400 0.88 If C then P 0.15 0.400 0.375 0.425 0.88 If (M,P) then C 0.05 0.250 0.200 0.400 0.50 If (M,C) then P 0.05 0.200 0.250 0.425 0.59 If (P,C) then M 0.05 0.150 0.333 0.450 0.74 Best rule: if Pepperoni then Mushroom. Support 25% and confidence 58.8%. Lift 1.31 means when Pepperoni requested then Mushroom is 31% more likely to be requested than if Pepperoni & Mushroom were unrelated. c Iain Pardoe, 2006 9 / 11 5

Refinements Negative rules: when lift < 1 then negating the result produces a better rule, e.g.: Pr(I 1 &I 2 ) Pr(I 1 ) Pr(I1&I2) Pr(I Pr(I 1) Pr(I 2 ) 1&I 2) Pr(I 1)Pr(I 2) Rule Support Conf Lift If (P,C) then not M 0.10 0.150 0.667 0.550 1.21 If (M,P) then not C 0.20 0.250 0.800 0.600 1.33 Overcoming practical limits: find rules with 2 items, 3 items, etc. Pruning: reduce # of items and combinations of items considered at each step (e.g., minimum support threshold). Problems of large datasets: very computer intensive. c Iain Pardoe, 2006 10 / 11 Extensions Using association rules to compare stores, promotions at various times, geographic areas, urban vs. suburban, seasons, etc. (use virtual items). Dissociation rules, e.g., if I 1 and not I 2 then I 3 (use sparingly for only the most frequent items). Sequential analysis using association rules: requires identifying customers and tracking them over time. c Iain Pardoe, 2006 11 / 11 6