Transform Big Data into Bigger Insight with Oracle Exadata and Oracle Advanced Analytics

Similar documents
Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features

Fraud and Anomaly Detection Using Oracle Advanced Analytic Option 12c

Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining

Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2

Starting Smart with Oracle Advanced Analytics

Tax Fraud in Increasing

Exadata V2 + Oracle Data Mining 11g Release 2 Importing 3 rd Party (SAS) dm models

Anomaly and Fraud Detection with Oracle Data Mining

Big Data and Predictive Analytics: Fiserv Data Mining Case Study [CON8631] Data Warehouse and Big Data

Getting Started with Oracle Data Miner 11g R2. Brendan Tierney

Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2

The Data Mining Process

Advanced In-Database Analytics

Data Mining with Oracle Database 11g Release 2

Big Data Analytics with Oracle Advanced Analytics In-Database Option

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Oracle Data Mining 11g Release 2

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics

Ganzheitliches Datenmanagement

Data Analysis with Various Oracle Business Intelligence and Analytic Tools

Oracle Data Mining In-Database Data Mining Made Easy!

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

What Are They Thinking? With Oracle Application Express and Oracle Data Miner

KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES

Oracle Data Miner (Extension of SQL Developer 4.0)

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

The Oracle Data Mining Machine Bundle: Zero to Predictive Analytics in Two Weeks Collaborate 15 IOUG

IBM SPSS Modeler 15 In-Database Mining Guide

Up Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata

Oracle Big Data Building A Big Data Management System

ANALYTICS CENTER LEARNING PROGRAM

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

SAP Predictive Analytics: An Overview and Roadmap. Charles Gadalla, SESSION CODE: 603

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Safe Harbor Statement

ETPL Extract, Transform, Predict and Load

Oracle Big Data Essentials

Big Data Analytics with Oracle Advanced Analytics

extreme Datamining mit Oracle R Enterprise

ORACLE TAX ANALYTICS. The Solution. Oracle Tax Data Model KEY FEATURES

Introducing Oracle Exalytics In-Memory Machine

Unified Batch & Stream Processing Platform

Extend your analytic capabilities with SAP Predictive Analysis

How To Create A Business Intelligence (Bi)

KnowledgeSEEKER Marketing Edition

Harnessing the power of advanced analytics with IBM Netezza

Data Mining + Business Intelligence. Integration, Design and Implementation

Big Data Analytics Scaling R to Enterprise Data user! 2013 Albacete Spain #user2013

Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics

SAP Predictive Analysis: Strategy, Value Proposition

Operationalising Predictive Insights

EMC Greenplum Driving the Future of Data Warehousing and Analytics. Tools and Technologies for Big Data

Using OBIEE for Location-Aware Predictive Analytics

HIGH PERFORMANCE ANALYTICS FOR TERADATA

Oracle Big Data Handbook

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.

April 2016 JPoint Moscow, Russia. How to Apply Big Data Analytics and Machine Learning to Real Time Processing. Kai Wähner.

Big Data and Data Science: Behind the Buzz Words

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users.

Greenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

IBM SPSS Modeler Professional

SEIZE THE DATA SEIZE THE DATA. 2015

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Where is... How do I get to...

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE

Oracle Data Miner (Extension of SQL Developer 4.0)

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Data Mining. SPSS Clementine Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

Make Better Decisions Through Predictive Intelligence

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

SAP Predictive Analysis: Strategy, Value Proposition

An In-Depth Look at In-Memory Predictive Analytics for Developers

Actian SQL in Hadoop Buyer s Guide

Operationalising Predictive Insights

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Big Data and Analytics in Government

IBM SPSS Modeler Premium

The basic data mining algorithms introduced may be enhanced in a number of ways.

SAS Fraud Framework for Banking

Connecting Hadoop with Oracle Database

Achieve Better Insight and Prediction with Data Mining

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc.

ORACLE BUSINESS INTELLIGENCE FOUNDATION SUITE 11g WHAT S NEW

High Performance Predictive Analytics in R and Hadoop:

Data Mining Solutions for the Business Environment

Product recommendations and promotions (couponing and discounts) Cross-sell and Upsell strategies

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Make Better Decisions Through Predictive Intelligence

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Real-Time Insight with Oracle Transactional Business Intelligence

Predictive Analytics

Oracle Business Intelligence Mobile

Graph Database Performance: An Oracle Perspective

Cisco Data Preparation

Oracle Big Data SQL Technical Update

Transcription:

Transform Big Data into Bigger Insight with Oracle Exadata and Oracle Advanced Analytics Charlie Berger, Senior Director, Product Mgt. OAA Marcos Arancibia, Product Manager, OAA Michael Bramley, Science Director R&D, dunnhunby

Oracle Big Data Solution Architecture Decide Oracle Real-Time Decisions Endeca Information Discovery Oracle BI Foundation Suite Oracle Event Processing Apache Flume Oracle GoldenGate Cloudera Hadoop Oracle NoSQL Database Oracle R Distribution Oracle Big Data Connectors Oracle Data Integrator Oracle Database Oracle Advanced Analytics Oracle Spatial & Graph Stream Acquire Organize Analyze 2

Oracle In-Database Analytics Statistical Functions Data Mining & Predictive Analytics Text Mining Text Search Graph Analysis Spatial Analysis Semantic Analysis In-Database MapReduce 3

Oracle Advanced Analytics Fastest Way to Deliver Scalable Enterprise-wide Predictive Analytics Key Features In-database data mining algorithms and open source R algorithms SQL, PL/SQL, R languages Scalable, parallel in-database execution Workflow GUI and IDEs Integrated component of Database Enables enterprise analytical applications 4

Oracle Advanced Analytics Performance and Scalability with Low Total Cost of Ownership Traditional Analytics Data Import Data Mining Model Scoring Data Prep. & Transformation Data Mining Model Building Data Prep & Transformation Data Extraction Hours, Days or Weeks Oracle Advanced Analytics avings Model Scoring Embedded Data Prep Model Building Data Preparation Secs, Mins or Hours Data remains in the Database Scalable, parallel Data Mining algorithms in SQL kernel Efficient execution of R open-source packages with in-database data preparation High-performance parallel scoring of Data Mining and R open-source models Fastest path from data to insights Integrated GUI for Predictive Analytics Database scoring engine Lowest TCO Eliminate data duplication Eliminate separate analytical servers 5

Oracle Advanced Analytics Architecture SQL Developer R Client OBIEE Applications Oracle Database Enterprise Edition Oracle Advanced Analytics Native SQL-PL/SQL Analytic Libraries plus high-performance R interface Scalable, Distributed, Parallel Execution Oracle R Distribution 6

Oracle Advanced Analytics Architecture SQL Developer R Client OBIEE Applications Oracle Database Enterprise Edition Oracle Advanced Analytics Native SQL-PL/SQL Analytic Libraries plus high-performance R interface Scalable, Distributed, Parallel Execution Oracle R Distribution 7

Oracle Advanced Analytics In-Database Data Mining Algorithms Classification Algorithms Logistic Regression (GLM) Decision Trees Naïve Bayes Support Vector Machines (SVM) Applicability Classical statistical technique Popular / Rules / transparency Embedded app Wide / narrow data / text Regression Linear Regression (GLM) Support Vector Machine (SVM) Classical statistical technique Wide / narrow data / text Anomaly Detection One Class SVM Unknown fraud cases or anomalies Attribute Importance A1 A2 A3 A4 A5 A6 A7 Minimum Description Length (MDL) Principal Components Analysis (PCA) Attribute reduction, Reduce data noise Association Rules Apriori Market basket analysis / Next Best Offer Clustering Hierarchical k-means Hierarchical O-Cluster Expectation-Maximization Clustering (EM) Product grouping / Text mining Gene and protein analysis Feature Extraction F1 F2 F3 F4 Nonnegative Matrix Factorization (NMF) Singular Value Decomposition (SVD) Text analysis / Feature reduction 8

Oracle Advanced Analytics Wide Range of In-Database Data Mining and Statistical Functions Data Understanding & Visualization Summary & Descriptive Statistics Cross tabulations Tests for Correlations (t-test, Pearson s, ANOVA) Histograms, scatter plots, box plots, bar charts R graphics: 3-D plots, link plots, special R graph types Selected Base SAS equivalents Data Selection, Preparation & Transformations Joins, Tables, Views, Data Selection, Data Filter, Join multiple databases Select, Filter, Rank, SQL time windows, Sample Re-coding, Missing values Aggregations Spatial data R to SQL transparency and push down In-Database Algorithms Classification Models Regression Models Clustering Anomaly Detection Associations / Market Basket Analysis Text Mining Most OAA algorithms support unstructured data (i.e. customer comments, email, abstracts, etc.) R Integration: Additional custom Oracle R packages with algorithms that run against Database and Hadoop (like Neural Networks and Stepwise Regression) Open-source R packages ability to run open source R CRAN packages * included in every Oracle Database 9

OAA SQL DM Fraud Example R begin dbms_data_mining.create_model('claimsmodel', 'CLASSIFICATION', 'CLAIMS', 'POLICYNUMBER', null, 'CLAIMS_SET'); end; / -- Top 5 most suspicious fraud policy holder claims select POLICYNUMBER, round(prediction_probability(claimsmodel, '0' using *)*100,2) prob_fraud from CLAIMS where PASTNUMBEROFCLAIMS in ('2to4', 'morethan4') order by prob_fraud desc fetch first 5 rows only; POLICYNUMBER PERCENT_FRAUD RNK ------------ ------------- ---------- 6532 64.78 1 2749 64.17 2 3440 63.22 3 654 63.1 4 12650 62.36 5 For Automated Monthly Application! Just add: Create View CLAIMS2_30 As Select * from CLAIMS2 Where mydate > SYSDATE 30 10 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 13

Oracle Advanced Analytics Architecture SQL Developer R Client OBIEE Applications Oracle Database Enterprise Edition Oracle Advanced Analytics Native SQL-PL/SQL Analytic Libraries plus high-performance R interface Scalable, Distributed, Parallel Execution Oracle R Distribution 11 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 13

Oracle Data Miner GUI SQL Developer 4.0 Extension Free OTN Download Easy to Use Oracle Data Miner GUI for data analysts Work flow paradigm Powerful Multiple algorithms & data transformations Runs 100% in-db Build, evaluate and apply models Automate and Deploy Save and share analytical workflows Generate SQL scripts for deployment 12

Oracle Advanced Analytics Architecture SQL Developer R Client OBIEE Applications Oracle Database Enterprise Edition Oracle Advanced Analytics Native SQL-PL/SQL Analytic Libraries plus high-performance R interface Scalable, Distributed, Parallel Execution Oracle R Distribution 13

Business Intelligence + Advanced Analytics Integration through SQL and R All predictions, insights and models are in the Database any BI tool can access and query using SQL OBIEE s integrated spatial mapping can be used to Map predictions OBIEE dashboards can launch parameterized R calculations that can return data or visualizations Any BI tool or application that supports SQL can take advantage Customer most likely to be HIGH and VERY HIGH value customer in the future Advanced R Statistical graphic output directly in the Dashboard 14

Business Intelligence + Advanced Analytics Integration through SQL and R All predictions, insights and models are in the Database any BI tool can access and query using SQL OBIEE s integrated spatial mapping can be used to Map predictions OBIEE dashboards can launch parameterized R calculations that can return data or visualizations Any BI tool or application that supports SQL can take advantage 15

Oracle Advanced Analytics Architecture SQL Developer R Client OBIEE Applications Oracle Database Enterprise Edition Oracle Advanced Analytics Native SQL-PL/SQL Analytic Libraries plus high-performance R interface Scalable, Distributed, Parallel Execution Oracle R Distribution 16

Enabling Predictive Applications Example Oracle Applications Using Oracle Advanced Analytics HCM Fusion Predictive Workforce employee turnover and performance prediction and What if? analysis CRM Fusion Sales Prediction Engine--prediction of sales opportunities, what to sell, amount, timing, etc. Supply Chain Management Spend Classification real-time flagging of noncompliance and anomalies in expense submissions Identity Management Oracle Adaptive Access Manager real-time security and fraud analytics Industry Data Models Communications Data Model implements churn prediction, segmentation, profiling, etc. Retail Data Model implements loyalty and market basket analysis Airline Data Model implements analysis frequent flyers, loyalty, etc. Oracle Fin. Services Analytic Applications Customer Insight, Enterprise Risk Management Enterprise Performance, Financial Crime and Compliance OFSAA CI Retail Customer Analytics Attrition Analysis- Mortgage Prepay, Savings Account Attrition, Term Deposit, Cards Survival analysis Customer Lifetime value Propensity Models- Credit Cards <-> Auto loans, Savings <-> Cards Retail Analytics Oracle Retail Customer Analytics shopping cart analysis and next best offers Customer Support Predictive Incident Monitoring (PIM) Customer Service offering for Database customers 17

Oracle Communications Industry Data Model Pre-Built Predictive Models Fastest Way to Deliver Scalable Enterprise-wide Predictive Analytics OAA s clustering and predictions available in-db for OBIEE Automatic Customer Segmentation, Churn Predictions, and Sentiment Analysis 18

OCDM Telco Churn Enhanced by SNA Analysis Social Network Analysis of Large Volumes of CDR Data Integrated with OCDM, OBIEE, and leverages Oracle Data Mining with specialized SNA code Identification of social network communities Predictive scores for churn and influence at a node level, as well as potential revenue/value at risk User interface targeted at business users and flexible ad-hoc reporting 19

Fusion HCM Predictive Workforce Fusion Human Capital Management Powered by OAA Oracle Advanced Analytics factory-installed predictive analytics Employees likely to leave, Top reasons, expected performance Real-time "What if?" analysis 20

Oracle Advanced Analytics Architecture SQL Developer R Client OBIEE Applications Oracle Database Enterprise Edition Oracle Advanced Analytics Native SQL-PL/SQL Analytic Libraries plus high-performance R interface Scalable, Distributed, Parallel Execution Oracle R Distribution 21

Why statisticians/data analysts use R R is a statistics language similar to Base SAS or SPSS statistics The R environment is.. Powerful Extensible Graphical Extensive statistics OOTB functionality with many knobs but smart defaults Ease to install and use Free 22 2012 Oracle All Rights Reserved

Oracle Strategy for R Provide high-performance, scalable R environment tightly integrated with Oracle RDBMS and Hadoop For R users Full access to Database and HDFS objects High performance and scalability for all R operations Scalable, Natively integrated machine learning algorithms Deploy R scripts and store R calculation results in Database or Hadoop For Database & Big Data developers Execute embedded R scripts containing any R algorithm or calculation Access stored R results in Database or Hadoop Retrieve R computation results in graphical formats like XML or PNG Integrate R results into BI Applications 23

Oracle Advanced Analytics: Database Integration Using the in-database Integration and Open-Source R Packages Client Interfaces Oracle Database Server R Client Interface Oracle Database Oracle R Enterprise packages: SQL, PL/SQL or R Σ(x) Advanced Analytics Option SQL Basic Statistics Transparency Data Mining algorithms Embedded R (x) Registered R Scripts called via SQL Parallel ExtProc Interconnect SQL Interfaces Any SQL & PL/SQL New SQL Query Node in ODM GUI Oracle R Distribution Enhanced linear algebra performance Parallel distributed analytic techniques that leverage R language constructs Custom R algorithms:neural/stepwise Access to open-source R packages 24

Oracle Advanced Analytics: Hadoop Integration Using the Hadoop-HDFS Integration, Custom and Open-Source R Packages Client Interfaces Hadoop Cluster R Client Interface Oracle R Connector for Hadoop packages: Hadoop MapReduce HIVE Transparency Layer Oracle R Enterprise packages: Transparency Embedded R SQL, PL/SQL, R R, Java Oracle Database Advanced Analytics Option Oracle R Distribution Big Data Connectors Oracle R Connector for Hadoop Σ(x) (x) Translation of R requests to Hadoop: HDFS Utilities: Data Movement and Statistics, pushing data to R, Data Sampling ORCH Utilities: Connect/Disconnect R Sessions HIVE Interfaces: Load table metadata and interface ORCH Custom R algorithms: Neural, GLM, kmeans,nmf,lmf Custom R Analytics are written once for a Mapper & Reducer framework, and are reused as is. I/O is then built for both the Database and Hadoop Parallel MapReduce Calls HDFS engine 25

Oracle Advanced Analytics Summary New Features Oracle Advanced Analytics 12c New SQL data mining algorithms (Expectation Maximization, PCA, Singular Vector Decomposition, Text Mining and other algorithm improvements) Predictive SQL Queries automatic build, apply within SQL query Oracle Data Miner/SQL Developer 4.0 (for Oracle Database 11g and 12c) New Graph node (box, scatter, bar, histograms) SQL Query node + integration of R scripts Automatic SQL script generation for deployment Oracle R Enterprise 1.4 (for Oracle Database 11g and 12c) Parallelized Neural Networks with ore.neural() against Database data Scoring Database tables with open-source R Models; in-database Sampling Support for Date and Time data types for Time Series Analysis Persist and Manage R Objects in-database; Improved integration with OBIEE 26

More Information on OAA Google: Oracle Advanced Analytics OTN: http://www.oracle.com/technetwork/database/options/advanced-analytics/index.html Oracle Demo Campgrounds Demo Pod OOW Exhibit Hall Hours (Mon-Wed) Moscone South, Left, Workstation ID: SL-063, Database, Data Warehousing OAA Hands on Labs: Big Data, Bigger Insights with Oracle Advanced Analytics and Oracle SQL Developer [HOL10074] Monday, Sep 23, 3:15 PM - 4:15 PM - Marriott Marquis - Salon 3/4 Make the Right Offers to Customers Using Oracle Advanced Analytics [HOL10075] Tuesday, Sep 24, 10:30 AM - 11:30 AM - Marriott Marquis - Salon 3/4 27

28

29