Decision Trees and other predictive models. Mathias Lanner SAS Institute

Similar documents
Agenda. Mathias Lanner Sas Institute. Predictive Modeling Applications. Predictive Modeling Training Data. Beslutsträd och andra prediktiva modeller

EXPLORING & MODELING USING INTERACTIVE DECISION TREES IN SAS ENTERPRISE MINER. Copyr i g ht 2013, SAS Ins titut e Inc. All rights res er ve d.

!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Data Mining Classification: Decision Trees

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Enhancing Compliance with Predictive Analytics

Lecture 10: Regression Trees

How To Make A Credit Risk Model For A Bank Account

Gerry Hobbs, Department of Statistics, West Virginia University

Data Mining Techniques Chapter 6: Decision Trees

Data mining techniques: decision trees

A fast, powerful data mining workbench designed for small to midsize organizations

Using multiple models: Bagging, Boosting, Ensembles, Forests

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

Data Mining for Knowledge Management. Classification

Data Mining Using SAS Enterprise Miner Randall Matignon, Piedmont, CA

Data Mining with R. Decision Trees and Random Forests. Hugh Murrell

An Overview and Evaluation of Decision Tree Methodology

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce. Authors: B. Panda, J. S. Herbach, S. Basu, R. J. Bayardo.

ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

What is Data Mining? MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling

Data Mining Applications in Fund Raising

Classification and Prediction

Data Preprocessing. Week 2

A Property and Casualty Insurance Predictive Modeling Process in SAS

Knowledge Discovery and Data Mining

DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES

Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal

Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees

SAS ENTERPRISE MINER 5.3

ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Internet Gambling Behavioral Markers: Using the Power of SAS Enterprise Miner 12.1 to Predict High-Risk Internet Gamblers

Data Mining with SAS. Mathias Lanner Copyright 2010 SAS Institute Inc. All rights reserved.

Predictive Modeling of Titanic Survivors: a Learning Competition

ENTERPRISE MINER UNIVERSITY OF AUCKLAND

CART 6.0 Feature Matrix

Survey Analysis: Data Mining versus Standard Statistical Analysis for Better Analysis of Survey Responses

COMP3420: Advanced Databases and Data Mining. Classification and prediction: Introduction and Decision Tree Induction

A Property & Casualty Insurance Predictive Modeling Process in SAS

M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page Analytics and Data Mining 1

Data Mining Algorithms Part 1. Dejan Sarka

Start-up Companies Predictive Models Analysis. Boyan Yankov, Kaloyan Haralampiev, Petko Ruskov

Specific Usage of Visual Data Analysis Techniques

Data Mining. Nonlinear Classification

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

A Comparison of Decision Tree and Logistic Regression Model Xianzhe Chen, North Dakota State University, Fargo, ND

Unit 4 DECISION ANALYSIS. Lesson 37. Decision Theory and Decision Trees. Learning objectives:

Clustering UE 141 Spring 2013

Supervised Learning (Big Data Analytics)

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

Better credit models benefit us all

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

Smart Sell Re-quote project for an Insurance company.

Classification algorithm in Data mining: An Overview

Evaluation of Feature Selection Methods for Predictive Modeling Using Neural Networks in Credits Scoring

Data Mining Part 5. Prediction

Clustering. Data Mining. Abraham Otero. Data Mining. Agenda

Interactive Data Mining and Design of Experiments: the JMP Partition and Custom Design Platforms

Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1

Principles of Data Mining by Hand&Mannila&Smyth

Predicting Student Performance by Using Data Mining Methods for Classification

Data Mining. SPSS Clementine Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

Data Mining Part 5. Prediction

Data Mining: Overview. What is Data Mining?

Classification/Decision Trees (II)

Decision Trees What Are They?

Didacticiel Études de cas

A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE

Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.

THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell

PharmaSUG2011 Paper HS03


OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

Applying Data Mining Techniques Using SAS Enterprise Miner. Course Notes

Russell K. Anderson. Visual Data Mining THE VISMINER APPROACH

Efficiency in Software Development Projects

Feature vs. Classifier Fusion for Predictive Data Mining a Case Study in Pesticide Classification

Knowledge Discovery and Data Mining. Structured vs. Non-Structured Data

Why Ensembles Win Data Mining Competitions

Chapter 12 Discovering New Knowledge Data Mining

Data Mining Practical Machine Learning Tools and Techniques

Full and Complete Binary Trees

Data Mining Lab 5: Introduction to Neural Networks

APPLICATION PROGRAMMING: DATA MINING AND DATA WAREHOUSING

A Data Mining Tutorial

Using Data Mining to Detect Insurance Fraud

What is Data mining?

IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES

An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century

Transcription:

Decision Trees and other predictive models Mathias Lanner SAS Institute

Agenda Introduction to Predictive Models Decision Trees Pruning Regression Neural Network Model Assessment 2

Predictive Modeling The Essence of Data Mining Most of the big payoff [in data mining] has been in predictive modeling. Herb Edelstein 3

Predictive Modeling Applications Database marketing Financial risk management Fraud detection Process monitoring Pattern detection 4

Predictive Modeling Training Data Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Numeric or categorical values 5

Predictive Modeling Score Data Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Score Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs????? Only input values known 6

Predictions Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Predictions Score Data case 1: inputs? case 2: inputs? case 3: inputs? case 4: inputs? case 5: inputs? 7

Predictions Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Predictions Score Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs????? 8

Predictive Modeling Essentials new case Predict new cases x 3 x 4 Select useful inputs Optimize complexity 9

Predictive Modeling Essentials new case Predict new cases x 3 x 4 Select useful inputs Optimize complexity 10

Three Prediction Types Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Predictions Decisions Rankings Estimates 11

Decision Predictions Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Decisions primary secondary tertiary primary secondary Trained model uses input measurements to make best decision for each case. 12

Ranking Predictions Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Rankings 720 520 620 580 470 Trained model uses input measurements to optimally rank each case. 13

Estimate Predictions Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Estimates 0.65 0.33 0.54 0.47 0.28 Trained model uses input measurements to optimally estimate value. 14

Model Essentials Predict Review new case Predict new cases Decide, rank, estimate x 3 x 4 Select useful inputs Optimize complexity 15

Model Essentials Select Review new case Predict new cases x 3 x 4 Select useful inputs Optimize complexity 16

Curse of Dimensionality 1 D 2 D 3 D 17

Input Selection Redundancy Irrelevancy 18

Model Essentials Select Review new case x 3 x 4 Predict new cases Select useful inputs Optimize complexity Decide, rank, estimate Eradicate redundancies irrelevancies 19

Model Essentials Optimize new case Predict new cases x 3 x 4 Select useful inputs Optimize complexity 20

Fool s Gold My model fits the training data perfectly... I ve struck it rich! 21

Model Complexity Too flexible Not flexible enough 22

23 Data Splitting

Training Data Role Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Validation Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Training data gives sequence of predictive models with increasing complexity. 24

Validation Data Role Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Validation Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs 0.73 0.78 0.49 0.53 0.41 Validation data helps select best model from sequence 25

Validation Data Role Training Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs Validation Data case 1: inputs case 2: inputs case 3: inputs case 4: inputs case 5: inputs 0.73 0.78 0.49 0.53 0.41 0.73 0.78 0.49 0.53 0.41 Validation data helps select best model from Sequence. 26

Model Essentials Optimize new case x 3 x 4 Predict new cases Select useful inputs Optimize complexity Decide, rank, estimate Eradicate redundancies irrelevancies Tune models with validation data 27

Agenda Introduction to Predictive Models DECISION TREES Pruning Regression Neural Networks Model Assessment 28

Predictive Modeling Tools Primary Decision Tree Regression Neural Network Specialty Dmine Regression MBR AutoNeural Rule Induction DMNeural Multiple Model Ensemble Two Stage 29

Predictive Modeling Tools Primary Decision Tree Regression Neural Network Specialty Dmine Regression MBR AutoNeural Rule Induction DMNeural Multiple Model Ensemble Two Stage 30

Predictive Modeling Tools Primary Decision Tree Regression Neural Network Specialty Dmine Regression MBR AutoNeural Rule Induction DMNeural Multiple Model Ensemble Two Stage 31

Model Essentials Decision Trees new case Predict new cases Prediction rules x 3 x 4 Select useful inputs Split search Optimize complexity Pruning 32

Simple Prediction Illustration Analysis goal: Predict the color of a dot based on its location in a scatter plot. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 33

Model Essentials Decision Trees new case Predict new cases Prediction rules x 3 x 4 Select useful inputs Split search Optimize complexity Pruning 34

Decision Tree Prediction Rules 40% leaf node root node <0.63 0.63 interior node <0.52 0.52 <0.51 0.51 60% 70% 55% 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 35

Decision Tree Prediction Rules new case 40% <0.63 0.63 <0.52 0.52 <0.51 0.51 60% 70% 55% 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 36

Decision Tree Prediction Rules new case Decision = Estimate = 0.70 40% <0.63 0.63 <0.52 0.52 <0.51 0.51 60% 70% 55% 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 37

Model Essentials Decision Trees new case Predict new cases Prediction rules x 3 x 4 Select useful inputs Split search Optimize complexity Pruning 38

Decision Tree Split Search left right Calculate the logworth of every partition on input. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 39

Decision Tree Split Search left right 53% 42% 47% 58% max logworth( ) 0.95 Select the partition with maximum logworth. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.52 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 40

Decision Tree Split Search left right 53% 42% 47% 58% Repeat for input. max logworth( ) 0.95 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 41

Decision Tree Split Search left right 53% 42% 47% 58% max logworth( ) 0.95 1.0 0.9 0.8 0.7 0.6 0.63 bottom top 0.5 0.4 54% 35% 46% 65% max logworth( ) 4.92 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 42

Decision Tree Split Search <0.63 0.63 Create partition rule from best partition across all inputs. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.63 43

Decision Tree Split Search <0.63 0.63 Repeat process in each subset. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 44

Decision Tree Split Search left right 61% 40% 39% 60% logworth( ) 5.72 Select the partition with maximum logworth on input. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.52 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 45

Decision Tree Split Search left right 61% 40% 39% 60% bottom top 38% 55% 62% 45% Repeat for input. logworth( ) 5.72 logworth( ) -2.01 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.02 46

Decision Tree Split Search <0.52 0.52 <0.63 0.63 Create second partition rule. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 47

Decision Tree Split Search Repeat to form maximal tree. 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 48

49 Demo Decision Tree