INTRODUCTION TO DATA MINING SAS ENTERPRISE MINER



Similar documents
An Introduction to SAS Enterprise Miner and SAS Forecast Server. André de Waal, Ph.D. Analytical Consultant

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

Data Mining with SAS. Mathias Lanner Copyright 2010 SAS Institute Inc. All rights reserved.

TEXT ANALYTICS INTEGRATION

IBM SPSS Modeler Professional

Auto Days 2011 Predictive Analytics in Auto Finance

Predictive Analytics

EXPLORING & MODELING USING INTERACTIVE DECISION TREES IN SAS ENTERPRISE MINER. Copyr i g ht 2013, SAS Ins titut e Inc. All rights res er ve d.

Beyond Traditional Management Reporting IBM Corporation

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

Data Mining from A to Z: Better Insights, New Opportunities WHITE PAPER

How to Optimize Your Data Mining Environment

2015 Workshops for Professors

Easily Identify Your Best Customers

Pentaho Data Mining Last Modified on January 22, 2007

IBM SPSS Modeler Premium

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Data Mining Solutions for the Business Environment

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Master of Science in Marketing Analytics (MSMA)

Three Open Blueprints For Big Data Success

Harnessing the power of advanced analytics with IBM Netezza

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics

IBM SPSS Direct Marketing

relevant to the management dilemma or management question.

KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES

Decision Support System For A Customer Relationship Management Case Study

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone:

Worldwide Advanced and Predictive Analytics Software Market Shares, 2014: The Rise of the Long Tail

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

IBM's Fraud and Abuse, Analytics and Management Solution

Better planning and forecasting with IBM Predictive Analytics

Fairfield Public Schools

April 2016 JPoint Moscow, Russia. How to Apply Big Data Analytics and Machine Learning to Real Time Processing. Kai Wähner.

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Make Better Decisions Through Predictive Intelligence

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics

IRMAC SAS INFORMATION MANAGEMENT, TRANSFORMING AN ANALYTICS CULTURE. Copyright 2012, SAS Institute Inc. All rights reserved.

SAS. Predictive Analytics Suite. Overview. Derive useful insights to make evidence-based decisions. Challenges SOLUTION OVERVIEW

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

Data Mining Applications in Higher Education

A fast, powerful data mining workbench designed for small to midsize organizations

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Get to Know the IBM SPSS Product Portfolio

CoolaData Predictive Analytics

High-Performance Scorecards. Best practices to build a winning formula every time

Five Predictive Imperatives for Maximizing Customer Value

Introduction to Data Mining

BUSINESSOBJECTS PREDICTIVE WORKBENCH XI 3.0

Predictive Analytics in the Public Sector: Using Data Mining to Assist Better Target Selection for Audit

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Challenges of Analytics

CA Aion Business Rules Expert r11

Oracle Real Time Decisions

An Introduction to Advanced Analytics and Data Mining

Business Intelligence Solutions for Gaming and Hospitality

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Hadoop & SAS Data Loader for Hadoop

The Predictive Data Mining Revolution in Scorecards:

Data Analysis Bootcamp - What To Expect. Damian Herrick Founder, Principal Consultant Lake Hill Analytics, LLC

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis

SAS Fraud Framework for Banking

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points

Project Management through

DATA MINING AND WAREHOUSING CONCEPTS

Model Deployment. Dr. Saed Sayad. University of Toronto

Azure Machine Learning, SQL Data Mining and R

Opportunities with Predictive Analytics. Greg Leflar, Vice President

Predictive Analytics for Database Marketing

IBM SPSS Modeler Professional

Information Visualization WS 2013/14 11 Visual Analytics

C o p yr i g ht 2015, S A S I nstitute Inc. A l l r i g hts r eser v ed. INTRODUCTION TO SAS TEXT MINER

9.4 Intelligence. SAS Platform. Overview Second Edition. SAS Documentation

A SAS White Paper: Implementing the Customer Relationship Management Foundation Analytical CRM

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

CAPTURING THE VALUE OF UNSTRUCTURED DATA: INTRODUCTION TO TEXT MINING

Three proven methods to achieve a higher ROI from data mining

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

IPMS Insurance Performance Management System

Salesforce.com and MicroStrategy. A functional overview and recommendation for analysis and application development

Career Opportunities in Healthcare Analytics presented by Kaiser Permanente Northwest Region. Today s Speakers. Friday, May 13 at 1:00 pm.

Ten Things You Need to Know About Data Virtualization

TDWI Best Practice BI & DW Predictive Analytics & Data Mining

Easily Identify the Right Customers

Transcription:

INTRODUCTION TO DATA MINING SAS ENTERPRISE MINER Mary-Elizabeth ( M-E ) Eddlestone Principal Systems Engineer, Analytics SAS Customer Loyalty, SAS Institute, Inc.

AGENDA Overview/Introduction to Data Mining and Predictive Modeling Building Models Using SAS Enterprise Miner Walk through example Essential steps: Sample, Explore, Modify, Model, Assess, Score Show selection of tools, how to change their properties and surface results Building Automated Models using Excel or SAS Enterprise Guide (Rapid Predictive Modeler)

INTRODUCTION TO DATA MINING

DATA MINING GOALS INSIGHT AGILE or DYNAMIC PERSONALIZATION SPEED PRECISION IMPROVED PROFITABILITY Better Decisions

ANALYTICS INFERENTIAL Inferential Statistics Uses patterns in the sample data to draw inferences about the population represented, accounting for randomness Answering yes/no questions about the data (hypothesis testing) Describing associations within the data (correlation) Modeling relationships within the data (regression) Source: Wikipedia

ANALYTICS PREDICTIVE Predictive Analytics Encompasses a variety of techniques from statistics, modeling, machine learning, and data mining that analyze current and historical facts to make predictions about future, or otherwise unknown, events. Include: Data Mining Forecasting Source: Wikipedia

ANALYTICS DATA MINING VERSUS FORECASTING Both are predictive and both model past behavior. DATA MINING Time independent Casual (relationship) focused Categorical, Continuous, Discrete Seldom weight more recent observations FORECASTING Time dependent Interval oriented Continuity assumed Frequently weights more recent phenomena

DATA MINING Descriptive Data Mining Predictive Data Mining

DATA MINING Descriptive Data Mining Clustering (Segmentation) Associations and Sequences Predictive Data Mining Classification Models to predict class membership Regression Models to predict a number

THE GOAL? SCORING! Scoring is the act of applying what we ve learned from data mining to new cases. Keep this goal in mind and use it to help formulate the questions and the data needed for data mining and scoring.

THE ULTIMATE GOAL? BETTER DECISIONS The ultimate goal of data mining is to improve decision making. As you formulate your problem, also keep in mind how and when model scores will be used.

EXAMPLE DEVELOPING A CLASSIFICATION MODEL Models are developed using historical data in which the behavior is observed or known. Indicates the behavior was observed in this subject Information about each subject, in this case an individual, is used as inputs to the model to see how well the model can distinguish between the people who exhibit the behavior and those who do not. For example, age, gender, previous behaviors, etc.

EXAMPLE DATA

WHY? Consider a group of subjects whose relevant behavior is unknown. The same information is available for each of these subjects (age, gender, etc.) as is available for the individuals with known behavior. We would like to know which individuals are most likely to have the relevant behavior.

EXAMPLE NEW DATA?

SCORING The output of a predictive classification model output is typically an equation. Models are applied to new cases to calculate the predicted behavior through a process called scoring. Scoring, using the equation, calculates each subject s likelihood to have the relevant behavior. (It also calculates the likelihood to not have the behavior.)

EXAMPLE SCORED DATA

THE ANALYTICS LIFECYCLE BUSINESS MANAGER Domain Expert Makes Decisions Evaluates Processes and ROI EVALUATE / MONITOR RESULTS IDENTIFY / FORMULATE PROBLEM DATA PREPARATION BUSINESS ANALYST Data Exploration Data Visualization Report Creation DEPLOY MODEL DATA EXPLORATION IT SYSTEMS / MANAGEMENT Data Preparation Model Validation Model Deployment Model Monitoring VALIDATE MODEL BUILD MODEL TRANSFORM & SELECT DATA MINER / STATISTICIAN Exploratory Analysis Descriptive Segmentation Predictive Modeling

THE ANALYTICS LIFECYCLE EVALUATE / MONITOR RESULTS IDENTIFY / FORMULATE PROBLEM DATA PREPARATION DEPLOY MODEL DATA EXPLORATION VALIDATE MODEL BUILD MODEL TRANSFORM & SELECT

MAIN TYPES OF DATA MARTS One-Row-per- Subject Data Mart Multiple-Row-per- Subject Data Mart Longitudinal Data Mart

THE ANALYTICS LIFECYCLE SAS Enterprise Miner focuses on these aspects of the process. DEPLOY MODEL EVALUATE / MONITOR RESULTS IDENTIFY / FORMULATE PROBLEM DATA PREPARATION DATA EXPLORATION VALIDATE MODEL BUILD MODEL TRANSFORM & SELECT DATA MINER / STATISTICIAN Exploratory Analysis Descriptive Segmentation Predictive Modeling

SAS ENTERPRISE MINER

SAS ENTERPRISE MINER

SAS ENTERPRISE MINER Organized and logical GUI for data mining success Unmatched suite of modeling techniques and methods Sophisticated set of data preparation, summarization and exploration tools Business-based model comparisons, reporting and management

SAS ENTERPRISE MINER Automated scoring process delivers faster results High-performance gridenabled workbench Modern, distributable data mining system suited for large enterprises Open, extensible design for ultimate flexibility

WHAT IS SAS ENTERPRISE MINER? SAS Enterprise Miner is a sophisticated graphical user interface, designed with the specific needs of data miners in mind. SAS Enterprise Miner is a data miner s workbench that manages the process and provides a comprehensive set of tools to aid the data miner throughout the essential steps, known by the acronym, SEMMA: Sample, Explore, Modify, Model, Assess. SAS Enterprise Miner streamlines the data mining process to create highly accurate predictive and descriptive models based on analysis of vast amounts of data from across an enterprise.

DATA MINING WITH SAS ENTERPRISE MINER

SAS ENTERPRISE MINER 7.1 AND 12.1 MODEL DEVELOPMENT PROCESS (SEMMA) Sample Explore Modify Model Assess Utility

SAS ENTERPRISE MINER

SAS ENTERPRISE MINER Use the desired tools to define a logical process (SEMMA) Sample Explore Modify Model Assess

SAS ENTERPRISE MINER Modify settings (properties) for the tools.

SAS ENTERPRISE MINER Run the flow and check results. Refine as needed.

DEMONSTRATION

AUTOMATED PREDICTIVE MODELING

SAS RAPID PREDICTIVE MODELER KEY DRIVERS (BUSINESS USERS) Need to generate numerous models to solve a variety of business problems in a credible manner Models need to be developed in a quick timeframe using a self-service approach Does not want to always rely on analytic professionals (e.g. statistician or modeler or data miner)

SAS RAPID PREDICTIVE MODELER KEY DRIVERS (ANALYTIC PROFESSIONALS) Solving more complex issues on hand to gain incremental value Further customize or refine models for better results

RAPID PREDICTIVE MODELER

Open your data in SAS Enterprise Guide or Microsoft Excel Use the Rapid Predictive Modeler task and modify settings Review results

Microsoft Excel

SAS Enterprise Guide

RAPID PREDICTIVE MODELER BASIC

RAPID PREDICTIVE MODELER INTERMEDIATE

RAPID PREDICTIVE MODELER ADVANCED

RAPID PREDICTIVE MODELER: SAMPLE OUTPUT

Rapid Predictive Modeler: Sample Output

Rapid Predictive Modeler: Sample Output

Rapid Predictive Modeler: Sample Output

DEMONSTRATION

IN CONCLUSION

SAS ENTERPRISE MINER BENEFITS Support the entire data mining process with a broad set of tools. Build more models faster with an easy-to-use Graphical User Interface. Enhance accuracy of predictions Surface business information and easily share results through the unique model repository

RESOURCES SAS Rapid Predictive Modeler Website Product brief, Press release, Brief product demo, etc. SAS Enterprise Miner Web Site SAS Enterprise Miner Technical Support Web Site SAS Enterprise Miner Technical Forum (Join Today!) SAS Enterprise Miner Training Rapid Predictive Modeling for Customer Intelligence SAS Global Forum 2010 paper written by Wayne Thompson and David Duling, SAS Institute Inc., Cary, NC

POTENTIAL NEXT STEPS Work through the example in Getting Started with SAS Enterprise Miner - Both the data and the documentation are available on support.sas.com http://support.sas.com/documentation/onlinedoc/miner/ Contact SAS Technical Support if you get stuck There is no charge for this it is included in your SAS software license.

THANK YOU FOR USING SAS! www.sas.com