The Oracle Data Mining Machine Bundle: Zero to Predictive Analytics in Two Weeks Collaborate 15 IOUG



Similar documents
Blazing BI: the Analytic Options to the Oracle Database. ODTUG Kscope 2013

Exadata V2 + Oracle Data Mining 11g Release 2 Importing 3 rd Party (SAS) dm models

Starting Smart with Oracle Advanced Analytics

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Statistical Analysis of Gene Expression Data With Oracle & R (- data mining)

OLSUG Workshop Oracle Data Mining

Forecasting, Prediction Models, and Times Series Analysis with Oracle Business Intelligence and Analytics. Rittman Mead BI Forum 2013

Data Visualization for Mobile Devices with OBI 11g. Collaborate 14

Predictive Analytics for Better Business Intelligence

Oracle Data Mining In-Database Data Mining Made Easy!

SQL - the best analysis language for Big Data!

Seamless Access from Oracle Database to Your Big Data

Semantic and Data Mining Technologies. Simon See, Ph.D.,

Big Data Analytics with Oracle Advanced Analytics In-Database Option

Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2

Budgeting and Planning with Microsoft Excel and Oracle OLAP

Oracle Advanced Analytics - Option to Oracle Database: Oracle R Enterprise and Oracle Data Mining. Data Warehouse Global Leaders Winter 2013

Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features

extreme Datamining mit Oracle R Enterprise

Oracle BI New Features Dan Vlamis Vlamis Software Solutions

Oracle's In-Database Statistical Functions

Advanced Dashboard Design in OBI 11g. Collaborate 2013 Session 726

Data Visualization for Oracle Business Intelligence 11g. Oracle OpenWorld 2014

Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining

Data Analysis with Various Oracle Business Intelligence and Analytic Tools

Oracle Exalytics Briefing

The Data Mining Process

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Oracle Software. Hardware. Training. Consulting. Mythics Complete.

Fraud and Anomaly Detection Using Oracle Advanced Analytic Option 12c

Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics

Oracle Big Data SQL Architectural Deep Dive

Getting Started with Oracle Data Miner 11g R2. Brendan Tierney

Data Visualization for Oracle Business Intelligence 11g. BIWA Summit 2015

Using Map Views and Spatial Analytics in OBI 11g. BIWA Summit 2014

Prerequisites. Course Outline

Tax Fraud in Increasing

What Are They Thinking? With Oracle Application Express and Oracle Data Miner

STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

High Performance Predictive Analytics in R and Hadoop:

Data Mining + Business Intelligence. Integration, Design and Implementation

Oracle Data Mining Hands On Lab

Introducing Oracle Exalytics In-Memory Machine

Building Cubes and Analyzing Data using Oracle OLAP 11g

Azure Machine Learning, SQL Data Mining and R

Advanced Analytics for Call Center Operations

Data Mining with Oracle Database 11g Release 2

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

Oracle Database Public Cloud Services

Using OBIEE for Location-Aware Predictive Analytics

General Session Oracle Business Analytics Executive Briefing

Advanced In-Database Analytics

Intel RAID SSD Cache Controller RCS25ZB040

SUN ORACLE EXADATA STORAGE SERVER

Big Data and Predictive Analytics: Fiserv Data Mining Case Study [CON8631] Data Warehouse and Big Data

SQL Server 2005 Features Comparison

præsentation oktober 2011

Understanding the Benefits of IBM SPSS Statistics Server

OBI 11g Data Visualization Best Practices

How to Build MicroStrategy Projects on Top of Big Data Sources in the Cloud

Oracle Data Mining 11g Release 2

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

SUN HARDWARE FROM ORACLE: PRICING FOR EDUCATION

TURKISH ORACLE USER GROUP

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Oracle Data Mining. Concepts 11g Release 2 (11.2) E

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Safe Harbor Statement

Anomaly and Fraud Detection with Oracle Data Mining

WebLogic on Oracle Database Appliance: Combining High Availability and Simplicity

PrimaryIO Application Performance Acceleration Date: July 2015 Author: Tony Palmer, Senior Lab Analyst

An In-Depth Look at In-Memory Predictive Analytics for Developers

Oracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers

Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7

Predictive Modeling Techniques in Insurance

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

MakeMyTrip CUSTOMER SUCCESS STORY

IBM SPSS Modeler 15 In-Database Mining Guide

Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers

Hexaware E-book on Predictive Analytics

Data Mining - The Next Mining Boom?

Intelligent Government From Data to Decision. Robert Lindsley Oracle, Public Sector Technology Group

ORACLE EXALYTICS IN-MEMORY MACHINE X3-4

Drivers to support the growing business data demand for Performance Management solutions and BI Analytics

R Tools Evaluation. A review by Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015

Managing Capacity Using VMware vcenter CapacityIQ TECHNICAL WHITE PAPER

Integrating Apache Spark with an Enterprise Data Warehouse

SUN ORACLE DATABASE MACHINE

Transcription:

The Oracle Data Mining Machine Bundle: Zero to Predictive Analytics in Two Weeks Collaborate 15 IOUG Presentation #730 Tim Vlamis and Dan Vlamis Vlamis Software Solutions 816-781-2880 www.vlamis.com

Presentation Agenda Introduction Overview of Data Mining What is the Oracle Data Mining Machine Bundle Oracle Data Miner Demo Getting started right with analytics projects

Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed more than 200 Oracle BI systems Specializes in ORACLE-based: Data Warehousing Business Intelligence Data Mining and Predictive Analytics Data Visualization Expert presenter at major Oracle conferences www.vlamis.com (blog, papers, newsletters, services) Co-authors of book Data Visualization for OBI 11g Co-author of book Oracle Essbase & Oracle OLAP Oracle University Partner Oracle Gold Partner

Tim Vlamis and Dan Vlamis Tim Vlamis 25+ years experience in business modeling and valuation, forecasting, and scenario analyses Instructor for Oracle University s Data Mining Techniques and Oracle R Enterprise Essentials Courses Professional Certified Marketer (PCM) from AMA Adjunct Professor of Business Benedictine College MBA Kellogg School of Management (Northwestern University) BA Economics Yale University Dan Vlamis Founded Vlamis Software Solutions in 1993 25+ years in business intelligence, dimensional modeling Oracle Ace Director Developer for IRI (expert in Oracle OLAP and related) BA Computer Science Brown University

Top Mistakes Bringing in experts who are incented to prove that predictive analytics is hard. Choosing a starting project involving a complex and poorly understood business process. Choosing a starting project based solely on interesting data. Setting expectations that the first project will reveal previously unknown insights worth millions. Designing an enormous project. Feeding the Golden Fleece Award mentality.

Think about results Results drive analytic projects. Present results quickly. Get executives used to iterative, developmental analytics projects. Build staff capabilities and set them up for success. Keep results flowing and avoid major transitional hurdles. Plan for success, plan for production.

Oracle Data Mining Oracle Data Mining is an option for the Enterprise Edition of the Oracle Database. A collection of APIs and specialized SQL functions. Includes a large number of specialized algorithms and built-in procedures. Makes use of many built-in capabilities of the Oracle Database ODM typically refers to Oracle Data Mining

11g Statistics & SQL Analytics (Free) Ranking functions rank, dense_rank, cume_dist, percent_rank, ntile Window Aggregate functions (moving and cumulative) Avg, sum, min, max, count, variance, stddev, first_value, last_value LAG/LEAD functions Direct inter-row reference using offsets Reporting Aggregate functions Sum, avg, min, max, variance, stddev, count, ratio_to_report Statistical Aggregates Correlation, linear regression family, covariance Linear regression Fitting of an ordinary-least-squares regression line to a set of number pairs. Frequently combined with the COVAR_POP, COVAR_SAMP, and CORR functions Descriptive Statistics DBMS_STAT_FUNCS: summarizes numerical columns of a table and returns count, min, max, range, mean, median, stats_mode, variance, standard deviation, quantile values, +/- n sigma values, top/bottom 5 values Correlations Pearson s correlation coefficients, Spearman's and Kendall's (both nonparametric). Cross Tabs Enhanced with % statistics: chi squared, phi coefficient, Cramer's V, contingency coefficient, Cohen's kappa Hypothesis Testing Student t-test, F-test, Binomial test, Wilcoxon Signed Ranks test, Chi-square, Mann Whitney test, Kolmogorov-Smirnov test, One-way ANOVA Distribution Fitting Kolmogorov-Smirnov Test, Anderson-Darling Test, Chi-Squared Test, Normal, Uniform, Weibull, Exponential

Oracle Data Miner Oracle Data Miner is a front end GUI for Oracle Data Mining. Extension for Oracle SQL Developer 3.x, 4.x a free utility program from Oracle that facilitates interaction with databases. Functions as an object oriented programing interface for designing data mining processes and procedures.

Oracle Data Mining Algorithms Problem Algorithm Applicability Classification Regression Anomaly Detection Attribute Importance Association Rules Clustering Feature Extraction A1 A2 A3 A4 A5 A6 A7 F1 F2 F3 F4 Logistic Regression (GLM) Decision Trees Naïve Bayes Support Vector Machine Linear Regression (GLM) Support Vector Machine One Class SVM Minimum Description Length Principal Component Analysis Apriori Hierarchical K-Means Hierarchical O-Cluster Nonnegative Matrix Factorization (NMF) Singular Value Decomposition (SVD) Classical statistical technique Popular / Rules / transparency Embedded app Wide / narrow data / text Classical statistical technique Wide / narrow data / text Unknown fraud cases or anomalies Attribute reduction Identify useful data Reduce data noise Market basket analysis Next Best Offer Product grouping Text mining Gene and protein analysis Text analysis Feature reduction

ODM Machine Bundle Overview Hardware Oracle Database Appliance Software Oracle Database 12c or 11g (with options) Oracle Advanced Analytics Option including Oracle Data Mining and Oracle R Enterprise Oracle SQL Developer: Data Miner Add-in (free download) Services Implementation and configuration from Vlamis Software Solutions (Oracle Gold Partner) Oracle University Oracle Data Mining Techniques course (taught by Vlamis Software Solutions) Market Basket Analysis Project performed on company data Time frame: 9 business days (less than 2 weeks)

Oracle Database Appliance A scalable engineered system Fast installation and configuration Enterprise level components

Oracle Database Appliance X4-2 Base Configuration Specifications More Processing Power 2 x 1RU x86 Servers. Each Server Contains: 2 x 12-core Intel Xeon Processors E5-2697 v2 processors 256 GB RAM (16 x 16 GB) 2 x 600 GB Boot Disks (mirrored) 4 x 10GbE Ports Redundant 10GbE Interconnect 1 x 2RU Storage Shelf Direct-Attached 4 x 2.5 200 GB SSDs for Redo 20 x 2.5 900 GB HDDs for Data 13 Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Oracle Database Appliance X4-2 Storage Expansion Shelf Specifications Double Available Storage Capacity Support for Additional 1 x 2RU Storage Expansion Shelf Direct-Attached 4 x 2.5 200 GB SSDs for Redo 20 x 2.5 900 GB HDDs for Data 14 Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Oracle Database Appliance X5-2 2 Servers, each with 36 Intel Xeon CPU cores 256 GB memory, expandable to 768 GB Redundant InfiniBand Interconnect 10 GbE Public Network Storage Shelf 800 GB flash log storage 1.6 TB flash cache storage 64 TB HDD Copyright 2015 Oracle and/or its affiliates. All rights reserved.

ODA X5-2 Storage Expansion Shelf Zero-Admin/Online Storage Expansion Double available storage capacity Additional 64 TB HDD, 128 TB total for DATA Additional 800 GB SSD, 1.6 TB total for REDO Additional 1.6 TB SSD, 3.2 TB total for FLASH Zero administration Automatically integrates when plugged in Data automatically distributes to new shelf Online expand storage Hot-plug storage expansion shelf No database downtime Copyright 2015 Oracle and/or its affiliates. All rights reserved.

Oracle Data Mining Training (2 days) Introduction Data Mining Concepts and Terminology The Data Mining Process Introducing Oracle Data Miner 11g Release 2 Using Classification Models Using Regression Models Using Clustering Models Performing Market Basket Analysis Performing Anomaly Detection Deploying Data Mining Results

Schedule Day 1: Two consultants meet with client team to review project plan, review data sources, identification of best data to start with, set technical objectives for project Day 2: Consultant One: Install ODA and configure to network (need support from client tech staff) Consultant Two: Conduct first day of ODM class with client team Day 3: Consultant One: Install new pluggable Database, SQL Developer Consultant Two: Conduct second day of ODM class with client team Day 4: Two consultants establish data plan for project with client and import data Day 5: Consultant One: Prepare tables for mining (add keys, new tables, transforms, etc.) Consultant Two: Document data plan Day 6: Consultant Two: Build market basket workflow Day 7: Consultant Two: Conduct market basket analyses Day 8: Consultant Two: Prepare presentation of findings from market basket analyses Day 9: Consultant Two: Deliver presentation with client

Methodology Implement a highly scalable infrastructure Establish a common foundational understanding of data mining Demonstrate the Value of Analytics by Completing a Market Basket Project Immediately

Scalable Infrastructure Design with growth in mind Maintain a common environment Avoid recoding for production

Establish a Shared Data Mining Foundation Create a common understanding of data mining techniques, language, processes. Everyone should know what others know of. Ask for and show patience with experts.

Deliver Fast Results Demonstrate value immediately Slow is smooth, smooth is fast Pick the right starting project Success builds support

Start with Oracle Cloud Database Bundle includes two months of Oracle Database EE High Performance Virtual Image Two week onsite activity time frame Vlamis configures cloud instance for OAA Same schedule as with ODA

Advantages of Oracle Cloud Database Highly scalable Secure Ideal for sandbox development projects Works well as a Proof of Concept project before hardware commitment Perfect for organizations who want to move to the cloud

Three Reasons Nothing is more scalable for analytics than Oracle Database 12c running on Oracle engineered system hardware. Nothing is more important for 21 st Century organizations than to add deep analytic capability and to generate returns from their operational, proprietary data sets Nothing is more important to a project s success than the key people involved

Oracle Test Drive Free to try out Oracle BI, Advanced Analytics and Big Data Go to www.vlamis.com/td Runs off of Amazon AWS Hands-on Labs based on Collaborate 2012 HOLs Test Drives for: Oracle BI Oracle Advanced Analytics Big Data Once signed up, you have private instance for 5 hours Available now

BIWA Summit 2016, Jan 26-28 Oracle HQ Conference Center Business Intelligence, Warehousing and Analytics and Spatial IOUG Special Interest Group www.biwasummit.org

Drawing for Free Book Add business card to basket or fill out card

Vlamis Collaborate Presentations Presenter Session Time Location Title Dan and Tim Vlamis OAUG Mon 12:45 1:45 PM South Seas D Data Visualization for Oracle Business Intelligence 11g Dan and Tim Vlamis OAUG Mon 3:15 4:15 PM Coral B Designing an Analytics Strategy for the 21st Century Dan and Tim Vlamis IOUG Tues 2:00 3:00 PM Jasmine E Forecasting, Prediction Models, and Time Series Analysis with Database Analytics and OBIEE Dan and Tim Vlamis IOUG Wed 3:15 4:15 PM Banyan D The Oracle Data Mining Machine Bundle: Zero to Predictive Analytics in Two Weeks Jon Clark IOUG Thurs 12:15 1:15 PM Reef F Using Cloud technology for Oracle Database and Oracle BI Sandboxes and Training Environments

Thank You! Thank You for Attending Session #730 Presenter Information Tim Vlamis, Consultant Dan Vlamis, President Vlamis Software Solutions, Inc. 816-781-2880 tvlamis@vlamis.com dvlamis@vlamis.com For more information go to www.vlamis.com