Revolution R Enterprise



Similar documents
High Performance Predictive Analytics in R and Hadoop:

Revolution R Enterprise: Efficient Predictive Analytics for Big Data

Decision Trees built in Hadoop plus more Big Data Analytics with Revolution R Enterprise

In-Database Analytics Deep Dive with Teradata and Revolution R

Using Microsoft R Server to Address Scalability Issues

R and Hadoop: Architectural Options. Bill Jacobs VP Product Marketing & Field CTO, Revolution

Find the Hidden Signal in Market Data Noise

R Tools Evaluation. A review by Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015

SQL Server Everything built-in. Csom Gergely Microsoft Adat platform szakértő

SAS and Teradata Partnership

Investor Presentation. Second Quarter 2015

Up Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata

HIGH PERFORMANCE ANALYTICS FOR TERADATA

Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Delivering Value from Big Data with Revolution R Enterprise and Hadoop

INVESTOR PRESENTATION. First Quarter 2014

INVESTOR PRESENTATION. Third Quarter 2014

Harnessing the power of advanced analytics with IBM Netezza

Advanced Big Data Analytics with R and Hadoop

ADVANCED ANALYTICS AND FRAUD DETECTION THE RIGHT TECHNOLOGY FOR NOW AND THE FUTURE

Building and Deploying Customer Behavior Models

Luncheon Webinar Series May 13, 2013

Understanding Data Warehouse Needs Session #1568 Trends, Issues and Capabilities

Advanced In-Database Analytics

Focus on the business, not the business of data warehousing!

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

UNIFY YOUR (BIG) DATA

Integrated Big Data: Hadoop + DBMS + Discovery for SAS High Performance Analytics

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

The Future of Data Management

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

KnowledgeSEEKER Marketing Edition

2015 Ironside Group, Inc. 2

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Ganzheitliches Datenmanagement

Modern Data Architecture for Predictive Analytics

High-Performance Analytics

Extend your analytic capabilities with SAP Predictive Analysis

Big Data and Data Science: Behind the Buzz Words

IBM Data Warehousing and Analytics Portfolio Summary

KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES

Data Warehouse as a Service. Lot 2 - Platform as a Service. Version: 1.1, Issue Date: 05/02/2014. Classification: Open

Teradata s Big Data Technology Strategy & Roadmap

HDP Hadoop From concept to deployment.

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances

SAP Predictive Analytics: An Overview and Roadmap. Charles Gadalla, SESSION CODE: 603

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

EMC Greenplum Driving the Future of Data Warehousing and Analytics. Tools and Technologies for Big Data

Cisco Data Preparation

Ramesh Bhashyam Teradata Fellow Teradata Corporation

EVERYTHING THAT MATTERS IN ADVANCED ANALYTICS

Data Integration Checklist

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics

In-Database Analytics

IBM Netezza High Capacity Appliance

Introducing Oracle Exalytics In-Memory Machine

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Next Generation Data Warehousing Appliances

Building Your Big Data Team

Architectures for Big Data Analytics A database perspective

WHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics

SEIZE THE DATA SEIZE THE DATA. 2015

HDP Enabling the Modern Data Architecture

Application of Predictive Analytics for Better Alignment of Business and IT

Modernizing Your Data Warehouse for Hadoop

Open Source in Financial Services: Meet the challenges of new business models and disruption

Artur Borycki. Director International Solutions Marketing

Using Tableau Software with Hortonworks Data Platform

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Advanced analytics at your hands

Greenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum

Oracle Big Data Strategy Simplified Infrastrcuture

Il mondo dei DB Cambia : Tecnologie e opportunita`

9.4 Intelligence. SAS Platform. Overview Second Edition. SAS Documentation

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform

Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

6 Steps to Faster Data Blending Using Your Data Warehouse

whitepaper Predictive Analytics with TIBCO Spotfire and TIBCO Enterprise Runtime for R

VIEWPOINT. High Performance Analytics. Industry Context and Trends

Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere

Extending The Value of SAP with the SAP BusinessObjects Business Intelligence Platform Product Integration Roadmap

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

ANALYTICS CENTER LEARNING PROGRAM

Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics

WHAT S NEW IN SAS 9.4

Bringing the Power of SAS to Hadoop. White Paper

Data Refinery with Big Data Aspects

Predictive Analytics with TIBCO Spotfire and TIBCO Enterprise Runtime for R

Data Analysis with Various Oracle Business Intelligence and Analytic Tools

Netezza and Business Analytics Synergy

The Enterprise Data Hub and The Modern Information Architecture

Transcription:

Revolution R Enterprise Michele Chambers Chief Strategy Officer & VP Product Management @ Revolution Analytics Bill Franks Chief Analytics Officer @ Teradata

Agenda Emerging Big Data Analytic Patterns Teradata Overview Revolution R Enterprise Teradata + Revolution R Enterprise Q&A 2

Polling Question #1: Rate your current use/ knowledge of R: Actively Using R Now Are planning on using < 6 mo. Evaluation R now Not at all

Michele Chambers Chief Strategy Officer & VP Product Management Revolution Analytics @MCanalytics

1st Generation Predictive Analytics

Today s Challenge: Accelerating Business Cadence Changing Business Environment Fact Based Decisions Require More Data Need to Understand Tradeoffs and Best Course of Action Predictive Models Need to Continually Deliver Lift Reduced Shelf Life for Predictive Models Faster Time to Value Reduce Analytic Cycle Time Build & Deploy Models Faster Eliminate Time Consuming Data Movements Rapid Customer Facing Decisions Score More Frequently Need to Make Best Decision in Real Time 6

2nd Generation Predictive Analytics Big Data Machine Learning Quick to Fail Lift 7

Typical Challenges Our Customers Face Big Data Many new data sources Data variety & velocity Fine grain control Data movement, memory limits Complex Computation Experimentation Many small models Ensemble models Simulation Enterprise Readiness Heterogeneous landscape Write once, deploy anywhere Skill shortage Production support Production Efficiency Shorter model shelf life Volume of Models Long end-toend cycle time Pace of decision accelerated 8

Big Data Big Analytics is different

Polling Questions #2 What Analytics Challenges Are You Facing <Choose All That Apply> Data Too Large for Current Solutions Lack Timely Access To Needed Data Slow Model Development or Execution Inconsistent Solutions Across Platforms Acceptability of Open Source Cost of Solutions Too High None of the Above.

Data Analytics Integration Decision Analytic Reference Architecture Analytic Applications Middleware Analytic Tools Hadoop Data Warehouse Other Data Sources 11

Data Analytics Data & Analytics Integration Integration Decision Decision Architectural Approaches for Analytics Beside Architecture Inside Architecture Analytic Applications Analytic Applications Middleware Middleware Analytic Tools Analytic Tools Local Data Mart Hadoop Data Warehouse Other Data Sources Hadoop Data Warehouse Other Data Sources 12

Pros & Cons of Approaches Beside Architecture Inside Architecture Hybrid Architecture Analytic workflow tasks performed in a separate analytics environment outside of the source database Pros: Segregates analytic workload Cons: Doesn t leverage powerful production for transformations, introduces scoring latencies Analytics workflow tasks performed inside the source database with embedded analytics Pros: Eliminates data movement, reduces model latency, allows exploration of all data Cons: IT governance on production, potential new skills Some analytic workflow tasks performed inside the source database & others performed in a separate analytics environment Pros: Leverages strengths of each architecture Cons: Maintain multiple environments 13

Data Analytics Data & Analytics Data Analytics Building & Deploying Analytic Models Beside Architecture Data Sources 5 4 3 1 Analytic Tools Local 2 4 Data Mart Inside Architecture Analytic Tools 1 2 4 Hadoop, DW, Other Hybrid Architecture Analytic Tools Data Sources 4 3 1 1 2 Analytic Tools Local Data Mart Legend 1 Data prep / marshaling 2 Model build 3 Model recode / PMML 4 Model deploy 5 Update data 14

The Teradata Unified Data Architecture Any User, Any Data, Any Analysis TERADATA UNIFIED DATA ARCHITECTURE ERP MANAGE MOVE ACCESS Marketing Marketing Executives SCM CRM INTEGRATED DATA WAREHOUSE Applications Operational Systems Images Audio and Video DATA PLATFORM In-database & managed servers Business Intelligence Frontline Workers Customers Partners Machine Logs Text DISCOVERY PLATFORM Data Mining Engineers Data Scientists Web and Social Business Analysts' SOURCES ANALYTIC TOOLS USERS

Bill Franks Chief Analytics Officer Teradata @billfranksga

Teradata Highlights 2,400+ Customers in 77 Countries 10,000+ Employees A Top 10 Public U.S. Software Company Ranked 2nd Best Company in America Motley Fool World s Most Ethical Companies Ethisphere Institute Leader in Data Analytics Leader in Integrated Marketing Management Source: Gartner Magic Quadrant for Data Warehouse Database Management Systems (February 2013)

Teradata Portfolio Teradata is the global leader in analytic data platforms, business applications, and consulting services that help organizations become more competitive by increasing the value of their data and customer relationships. Data Warehousing Workload-specific Platforms Discovery Platforms Integrated Marketing Management Consulting/Support Services Partnerships with key analytic technology providers and system integrators

Debt<10% of Income Yes Good Credit Risks Yes NO Income>$40K Bad Credit Risks NO NO Debt=0% Yes Good Credit Risks Why Is Teradata Different? Server Based vs. In-Database Architectures Sample Data Desktop and Server Analytic Architecture Results Results In-Database Analytic Architecture In-DBS Request Exponential Performance Improvement

Raising The Bar With Teradata In-Database Analytics 2010 2011 2013 Expanding Portfolio Premier of Revolution Analytics parallel R in-dbs Aster Data Acquisition Pioneer of big data analytics Data Labs Simplified In-DBS self-service Analytics 2007 Partnership Centric Collaborated with partners to enable in-dbs processing 1998 Teradata Warehouse Miner Pioneer of in-dbs mining

Teradata Data Lab Self-Service Analytic Sandboxes With Governance Data Lab(s) inside system to easily join to production data Load experimental, untested data from external sources Rapid prototyping, exploratory and experimentation analysis An architecture design that enables governance Works within your current data warehouse environment Teradata Data Lab portlets for IT and business analyst SAS data csv data External Data Teradata Workload Management Data Labs Teradata Data Warehouse

The Teradata Unified Data Architecture Any User, Any Data, Any Analysis TERADATA UNIFIED DATA ARCHITECTURE ERP Marketing Marketing Executives SCM CRM INTEGRATED DATA WAREHOUSE Applications Operational Systems Images DATA PLATFORM Data Lab Business Intelligence Frontline Workers Audio and Video Data Mining Customers Partners Machine Logs Text DISCOVERY PLATFORM Math and Stats Engineers Data Scientists Web and Social Languages Business Analysts' SOURCES ANALYTIC TOOLS USERS

Teradata Workload-Specific Platforms Fit Broad Range of Customer Needs Data Mart Appliance Extreme Data Appliance Data Warehouse Appliance Active Enterprise Data Warehouse Appliance for Hadoop Aster Big Analytics Appliance SAS High- Performance Analytics Workloads Test / Development or Smaller Data Marts Analytical Archive, Deep Dive Analytics Strategic Intelligence, Decision Support System, Fast Scan Strategic / Operational Intelligence, Real-Time Update, Active workloads Appliance for Storing, Capturing and Refining Data. Hortonworks HDP 1.1 Discovery Platform for Big Data Analytics with embedded SQL MapReduce for new data types and sources Dedicated appliance for SAS high-performance analytic model development WE CAN COMPETE ON ANY WORKLOAD, AT ANY PRICE POINT, FOR ANY CUSTOMER

Revolution R Enterprise What is Revolution R Enterprise? How well does Revolution R Enterprise play in the Big Data zoo?

Revolution R Enterprise is. the only big data big analytics platform based on open source R the defacto statistical computing language for modern analytics High Performance, Scalable Analytics Portable Across Enterprise Platforms Easier to Build & Deploy Analytics 25

R is open source and drives analytic innovation but.has some limitations for Enterprises Big Data In-memory bound Hybrid memory & disk scalability Speed of Analysis Enterprise Readiness Analytic Breadth & Depth Commercial Viability Operates on bigger volumes & factors Single threaded Parallel threading Shrinks analysis time Community support Commercial support Delivers full service production support 5000+ innovative analytic packages Risk of deployment of open source Leverage open source packages plus Big Data ready packages Commercial license Supercharges R Eliminate risk with open source 26

Introducing Revolution R Enterprise (RRE) The Big Data Big Analytics Platform DevelopR DeployR ConnectR ScaleR DistributedR Big Data Big Analytics Ready Enterprise readiness High performance analytics Multi-platform architecture Data source integration Development tools Deployment tools 27

The Platform Step by Step: R Capabilities R+CRAN Open source R interpreter UPDATED R 3.0.2 Freely-available R algorithms Algorithms callable by RevoR Embeddable in R scripts 100% Compatible with existing R scripts, functions and packages RevoR Performance enhanced R interpreter Based on open source R Adds high-performance math Available On: Platform TM LSF TM Linux Microsoft HPC Clusters Windows & Linux Servers Windows & Linux Workstations NEW Teradata Database IBM Netezza IBM BigInsights TM Cloudera Hadoop Hortonworks Hadoop Intel Hadoop 28

The Platform Step by Step: Parallelization & Data Sourcing ScaleR Ready-to-Use high-performance big data big analytics Fully-parallelized analytics Data prep & data distillation Descriptive statistics & statistical tests Correlation & covariance matrices Predictive Models linear, logistic, GLM Machine learning Monte Carlo simulation NEW Tools for distributing customized algorithms across nodes ConnectR High-speed & direct connectors Available for: High-performance XDF SAS, SPSS, delimited & fixed format text data files Hadoop HDFS (text & XDF) Teradata Database & Aster EDWs and ADWs ODBC DistributedR Distributed computing framework Delivers portability across platforms Available on: Windows Servers Red Hat and NEW SuSE Linux Servers IBM Platform LSF Linux Microsoft HPC Clusters NEW Teradata Database NEW Cloudera Hadoop NEW Hortonworks Hadoop 29

The Platform Step by Step: Tools & Deployment DevelopR Integrated development environment for R Visual step-into debugger Available on: Windows DevelopR DeployR DeployR Web services software development kit for integration analytics via Java, JavaScript or.net APIs Integrates R Into application infrastructures Capabilities: Invokes R Scripts from web services calls RESTful interface for easy integration Works with web & mobile apps, leading BI & Visualization tools and business rules engines 30

Write Once. Deploy Anywhere. Hadoop Hortonworks Cloudera EDW NEW Teradata Database IBM Netezza ConnectR DeployR Clustered Systems Workstations & Servers IBM Platform LSF Microsoft HPC Desktop Server ScaleR In the Cloud Amazon AWS DistributedR DESIGNED FOR SCALE, PORTABILITY & PERFORMANCE 31

Data Analytics Integration Decision Revolution R Enterprise Propels Enterprises into the Future Analytic Applications Middleware Hadoop Data Warehouse Other Data Sources 32

Data Analytics Data & Analytics Integration Integration Decision Decision One Big Data Big Analytics Platform Multiple Architectures Beside Architecture Inside Architecture Analytic Applications Analytic Applications Middleware DeployR DeployR Middleware Analytic Tools DevelopR RevoR & DistributedR & ScaleR Analytic & DevelopR Tools ConnectR Local Data Mart RevoR & DistributedR & ScaleR Hadoop Data Warehouse Other Data Sources Hadoop Data Warehouse Other Data Sources 33

Data Analytics Data & Analytics Data Analytics Revolution R Enterprise Powers Write Once, Deploy Anywhere Beside Architecture Data Sources 5 4 3 1 Analytic Tools Local Data Mart 2 4 Bottom Line: Save Time, Save Money, Get Insights Faster Inside Analytic Tools Direct connectors access data without data movement Hadoop, DW, Other Architecture Push down analyzing data without movement 1 2 4 Use same R script on any platform without recoding Use right platform for the job! Hybrid Analytic Tools Architecture Data Sources 4 3 1 1 2 Analytic Tools Local Data Mart Legend 1 Data prep / marshaling 2 Model build 3 Model recode / PMML 4 Model deploy 5 Update data 34

The Power of Revolution R Enterprise Performance & Scalability V a l u e ScaleR ScaleR ScaleR ScaleR DistributedR DistributedR DistributedR RevoR Open Source Moves computation to data Moves computation to data Leverage CRAN Labor saving power Maximizes computation Powerful divide & conquer Effective memory utilization 3-50X faster Leverage latest innovation 35

RRE in Teradata Database Revolution R Enterprise: Web Services Desktop & Enterprise Applications ODBC Teradata Database Database Nodes Hybrid Storage 36

How Does It Work? RRE in Teradata Database Desktops & Servers Desktop & Enterprise Applications Revolution R Enterprise Web Services ODBC Teradata Database Parse Engine External Stored Procedure Database Nodes AMPs Table Operator Table Operator Table Operator Table Operator Table Operator Hybrid Storage Data Segments 37

Product Architecture & Feature Highlights Revolution R Enterprise ScaleR: High Performance Big Data Analytics Data Prep, Distillation & Descriptive Analytics R Data Step Descriptive Statistics Statistical Tests Sampling Data import Delimited, Fixed, SAS, SPSS, OBDC Variable creation & transformation Recode variables Factor variables Missing value handling Sort Merge Split Aggregate by category (means, sums) Min / Max Mean Median (approx.) Quantiles (approx.) Standard Deviation Variance Correlation Covariance Sum of Squares (cross product matrix for set variables) Pairwise Cross tabs Risk Ratio & Odds Ratio Cross-Tabulation of Data (standard tables & long form) Marginal Summaries of Cross Tabulations Chi Square Test Kendall Rank Correlation Fisher s Exact Test Student s t-test Subsample (observations & variables) Random Sampling 38

Product Architecture & Feature Highlights Revolution R Enterprise ScaleR: High Performance Big Data Analytics Statistical Modeling Machine Learning Predictive Models Data Visualization Variable Selection Cluster Analysis Sum of Squares (cross product matrix for set variables) Multiple Linear Regression Generalized Linear Models (GLM) - All exponential family distributions: binomial, Gaussian, inverse Gaussian, Poisson, Tweedie. Standard link functions including: cauchit, identity, log, logit, probit. User defined distributions & link functions. Covariance & Correlation Matrices Logistic Regression Classification & Regression Trees Predictions/scoring for models Residuals for all models Histogram Line Plot Scatter Plot Lorenz Curve ROC Curves (actual data and predicted values) NEW Tree Visualization Stepwise Regression (linear, NEW logistic and NEW GLM) Simulation Monte Carlo K-Means Classification Decision Trees NEW Decision Forests Deployment NEW PMML Export 39

R + Revolution R Enterprise Unequaled Big Data Big Analytics Deploy Analytics Web, Mobile, Data Visualization, BI Big Data Distributed Analytics Big Data Distributed Analytics Performance Enhanced R Performance Enhanced R 40

Integrating Enterprise Apps Revolution R Enterprise: Deploy R Web Services Desktop & Enterprise Applications ODBC Teradata Database Database Nodes Hybrid Storage 41

Decision Layer: Examples on-demand analytics On-demand sales forecasting Real-time social media sentiment analysis Leveraging the power of R from Microsoft tools 42

RRE 7.0: Alteryx Integration Comparable to SAS Enterprise Miner, IBM SPSS Modeler (aka Clementine) Easy way to create analytic models with visual based analytic flows R analytics Read any data source Create analytic flow 43

RRE 7.0: Tableau Accelerator Makes analytics consumable to end users Tableau integration is via.net (creates a Tableau document which is read by Tableau) 44

RRE 7.0: MS Excel Accelerator Makes analytics consumable to most pervasive BI tool Excel integration is.net R analytics R visualization 45

Leveraging Teradata + Revolution R Enterprise Teradata Database, DistributedR & ScaleR Scalable R-Based In-Database Transformation High-Performance, Parallel R-Based Data Transformation Augment Traditional ETL, ELT and SQL Methods Free R Users To Load and Transform Data Independently Fast, In-database Exploration & Model Development Vastly Accelerate Model Development, Validation and Re-Training Eliminate Data Movement Penalties & Duplication Costs Embrace In-database Machine Learning Next Generation Big Data Big Analytics Scalable To Meet All the Data All The Time Requirements Integrate the Enterprise With In-Database Scoring Make On-Demand Analytics a Reality For All Users 46

Polling Questions #3 How would you consider applying Teradata + Revolution R Enterprise? <click all that apply> Operations Risk Management Customer Analytics

How is RRE Used? Discovering Patterns with Big Data Customer segmentation Market basket analysis Social networking analysis Fraud detection Marketing attribution Sentiment analysis and much more Building Models Efficiently Credit risk Customer churn Propensity to buy Market risk Operational risk and much more Flexibly Deploying Models to Consumers Customer lifetime value Pricing optimization Recommendation engines and much more 48

Why Customers Buy Teradata & Revolution R Enterprise Powers R for the Enterprise Supercharge R with Teradata Scale Enabling Analytics Across the UDA Lower Cost & Risk of Big Analytics 49

www.revolutionanalytics.com 50

www.revolutionanalytics.com/products 51

http://www.teradata.com/partners/revolution-analytics/ 52

www.revolutionanalytics.com/contact-us 53

54

Thank you.