Cloudera & SAS Partnership Overview. Graham Pymm Cloudera Systems Engineer

Similar documents
Hadoop & SAS Data Loader for Hadoop

WHAT S NEW IN SAS 9.4

QUEST meeting Big Data Analytics

More Data in Less Time

Dell Cloudera Syncsort Data Warehouse Optimization ETL Offload

Interactive data analytics drive insights

Extend your analytic capabilities with SAP Predictive Analysis

The Future of Data Management

Financial, Telco, Retail, & Manufacturing: Hadoop Business Services for Industries

SAS and Teradata Partnership

locuz.com Big Data Services

Dell In-Memory Appliance for Cloudera Enterprise

Bringing the Power of SAS to Hadoop. White Paper

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

Analytics With Hadoop. SAS and Cloudera Starter Services: Visual Analytics and Visual Statistics

Dell* In-Memory Appliance for Cloudera* Enterprise

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform

Native Connectivity to Big Data Sources in MSTR 10

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

High-Performance Analytics

ANALYTICS MODERNIZATION TRENDS, APPROACHES, AND USE CASES. Copyright 2013, SAS Institute Inc. All rights reserved.

Oracle Big Data Handbook

Reimagining Business with SAP HANA Cloud Platform for the Internet of Things

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

SAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics

and Hadoop Technology

EMC Greenplum Driving the Future of Data Warehousing and Analytics. Tools and Technologies for Big Data

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Apigee Insights Increase marketing effectiveness and customer satisfaction with API-driven adaptive apps

Cisco Data Preparation

Safe Harbor Statement

AtScale Intelligence Platform

6.0, 6.5 and Beyond. The Future of Spotfire. Tobias Lehtipalo Sr. Director of Product Management

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

SQLstream 4 Product Brief. CHANGING THE ECONOMICS OF BIG DATA SQLstream 4.0 product brief

SEIZE THE DATA SEIZE THE DATA. 2015

Comprehensive Analytics on the Hortonworks Data Platform

What's New in SAS Data Management

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Driving Growth in Insurance With a Big Data Architecture

The Enterprise Data Hub and The Modern Information Architecture

Build Your Competitive Edge in Big Data with Cisco. Rick Speyer Senior Global Marketing Manager Big Data Cisco Systems 6/25/2015

Big data: Unlocking strategic dimensions

Predictive Analytics

Platfora Big Data Analytics

Big Data and the Data Lake. February 2015

Confidently Anticipate and Drive Better Business Outcomes

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Big Data Performance Growth on the Rise

Oracle Big Data Discovery (BDD) Hadoop Visualization

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Welkom! Copyright 2014 Oracle and/or its affiliates. All rights reserved.

How To Handle Big Data With A Data Scientist

Unifying the Enterprise Data Hub and the Integrated Data Warehouse

IBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse

SAS ANALYTIC SOLUTIONS RUNNING ON A HADOOP CLUSTER USING YARN JAMES KOCHUBA. Copyright 2015, SAS Institute Inc. All rights reserved.

Introducing Oracle Exalytics In-Memory Machine

CDH AND BUSINESS CONTINUITY:

Information Architecture

The Future of Data Management with Hadoop and the Enterprise Data Hub

9.4 Intelligence. SAS Platform. Overview Second Edition. SAS Documentation

Are You Big Data Ready?

Oracle Big Data Essentials

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Effective Data Integration - where to begin. Bryte Systems

Best Practices for Building Mobile Web

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

IBM Data Warehousing and Analytics Portfolio Summary

Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features

Big Data solutions to support Intelligent Systems and Applications

How to avoid building a data swamp

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

Oracle Big Data Building A Big Data Management System

Agenda. Big Data. Dell Cloud Solutions A Dell Story Summary. Concepts Market Trends and Challenges Dell Solutions

STAR WARS AND THE ART OF DATA SCIENCE

Ganzheitliches Datenmanagement

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Data Integration Checklist

Tax Fraud in Increasing

IBM Global Business Services Microsoft Dynamics CRM solutions from IBM

Accenture and SAP: Delivering Visual Data Discovery Solutions for Agility and Trust at Scale

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Data Virtualization Overview

Ten Things You Need to Know About Data Virtualization

SAP Predictive Analytics: An Overview and Roadmap. Charles Gadalla, SESSION CODE: 603

Greenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics

High Performance Data Management Use of Standards in Commercial Product Development

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

Transcription:

Cloudera & Partnership Overview Graham Pymm Cloudera Systems Engineer 1

Strong Executive & Product Level Alignment Management: Formal Alliance forged in January 2013 CTO level commitment from both companies Technical: internal development teams have a Cloudera first policy and all internal work is performed on Cloudera clusters. Dedicated Cloudera resources at Cloudera HQ and HQ working with R&D has dedicated R&D resources optimize solutions for the Cloudera platform Release activities: Joint training courses in plan, provide education on Cloudera and content for analytics on big data Engineering schedule coordination ensure quick uptake of new releases from each side

Strong Go-To-Market Alignment More than 80% of deployments on are running Cloudera Master Reciprocal Services Agreement in place Joint Data Scientist Training course ensure best practices Visual Analytics and Cloudera Enterprise Data Hub Starter Service package offering Cloudera and Confidential

and Cloudera Benefits Seize New Opportunities from All of Your Data. Make more precise decisions by analyzing all structured and unstructured data sets. Drive compelling cusmer engagements improve revenue and service levels. Accelerate Time--Value. Visual Analytics simplify working with data. Cloudera Manager simplify big data system administration. In-memory data and analytics processing for faster performance. Reduce Costs, Risks and Uncertainty. provides scalable and cost-effective big data management. Largest community of trained developers, administrars and data scientists. Joint and Cloudera research and development ensures maximum cusmer value. Utilities usable by cusmers configure and install with Cloudera

Architectural elements of on CDH Hi-performance in-memory analytics on CDH Inmemory agents Inmemory agents CDH Inmemory agents Inmemory agents EP for data processing on EP using MR EP using MR CDH EP using MR EP using MR Data extract from Hive Impala CDH

nagement ffective big data management Dedicated Rack Dedicated Rack Cloudera Cluster Cloudera Cluster nagement commercial & Cloudera have Dedicated Rack Have Integrated Across Cloudera The Cluster f p Entire Analytics imum commercial have a cusmer value. have a ue. imum p have cusmer a value. Lifecycle Embedded process enables users perform data manipulation, variable transformation, ue. er a? Embedded process enables explorary Embedded users analysis, process perform statistical enables data users manipulation, modeling perform machine data variable learning manipulation, transformation, ty er a? techniques, variable integrated transformation, analytics complements modeling terprise y plements - analytics Data Hub complements explorary Embedded analysis, process explorary statistical (EDH) comparison enables modeling users analysis, and scoring perform statistical & machine - all data modeling learning inside manipulation, & techniques, machine the variable learning integrated environment. transformation, techniques, modeling integrated modeling DH) Wouter Kroon <Wouter.Kroon@sas.com> lps erprise elevate Data the Hub conversation (EDH) explorary comparison analysis, and scoring comparison statistical - all inside modeling and scoring the & machine - all inside environment. learning the techniques, n environment. n plements Wouter Kroon <Wouter.Kroon@sas.com> integrated modeling <Wouter.Kroon@sas.com> ersation lps DH) in the elevate organization. the conversation comparison Analytics and scoring - all inside the environment. n in Analytics Wouter Kroon <Wouter.Kroon@sas.com> rsation use the cases organization. for. Analytics Analytics use p. Positioning cases for the. integration products for integrate with Cloudera egration re - Positioning has foothold the integration products for products integrate for with integrate Cloudera with Cloudera p. re d over the entire Analytics Life cycle egration ra s ability has a foothold close or shorten MANAGE products over the for entire over Analytics integrate the entire Life with cycle Analytics Cloudera Life cycle d ra s r shorten ability close or shorten MANAGE MANAGE over the entire Analytics Life cycle r shorten alignment and integration MANAGE integration iar, alignment as no other and integration Manage Data Manage Data Explore Data Explore Data integration rs iar, doop tighter as no integration other with Manage /Access Data Manage EXPLORE /Access Data Visual Analytics Explore Data rs tighter integration with Visual Explore Analytics Data doop with EXPLORE Manage /Access EXPLORE Cloudera /Access Impala In-memory for Data /Access Cloudera Impala Visual Explore Analytics Data Visual Analytics with Data Quality Accelerar /Access DEPLOY Data Quality Accelerar* Cloudera Impala EXPLORE /Access /Access Cloudera Impala Visual Analytics DEPLOY DEPLOY In-database Code Data duct Management, Sales, & /Access Data Quality Accelerar* Cloudera Accelerar Impala In-database Quality Accelerar* Code Accelerar* Sales duct Management,, & Sales, MONITOR & In-database Code Accelerar* DEPLOY Data In-database Quality Accelerar* Code Accelerar* MONITOR MONITOR Develop Models ercial distributions Develop Models In-memory Develop for Models Sales, & In-database Code Accelerar* Develop Models ributions ercial distributions MONITOR In-memory In-memory Deploy & Monir loudera, Visual Analytics, Deploy Monir HPA Products Develop for Models HPA for Products DEVELOP Scoring Accelerar ccelerar loudera, for Visual Cloudera Analytics, Deploy DEVELOP MODELS Scoring & Monir Accelerar Visual Statistics Visual HPA Statistics* Products Analytics, Deploy & Monir ibutions In-memory HPA for Products DEVELOP ccelerar for Cloudera MODELS In-memory for Scoring Accelerar Visual Statistics* era Analytics, MODELS Deploy Scoring & Monir Accelerar Visual HPA Statistics* Products DEVELOP * Available August 2014 * Available August 2014 era age MODELS Scoring Accelerar * Available Visual August Statistics* 2014 age * Available August 2014 professional services (phase I s professional (phase I er markets). services (phase I Cloudera and Confidential s er (phase markets). I adoop have a r value. cs complements ub (EDH) conversation tion. Analytics adoop. he integration othold lose or shorten and integration er ration with ent, Sales, distributions isual Analytics, loudera rvices (phase I Embedded process enables users perform data manipulation, variable transformation, explorary analysis, statistical modeling & machine learning techniques, integrated modeling comparison and scoring - all inside the environment. n Wouter Kroon <Wouter.Kroon@sas.com> DEPLOY & MONITOR MANAGE DEVELOP MODELS EXPLORE products for integrate with Cloudera over the entire Analytics Life cycle Manage Data /Access /Access Cloudera Impala Data Quality Accelerar* In-database Code Accelerar* Deploy & Monir Scoring Accelerar Explore Data Visual Analytics Develop Models In-memory for HPA Products Visual Statistics* * Available August 2014

& Cloudera Product Touch-Points Access for & Access for Impala: Enables data access in Base solutions developed on Base can connect CDH: Enterprise Miner, Data Integration Studio, Text Miner, /Stat, /Graph, Forecast Studio, /ETL and many more Data Management: Data Integration using CDH as source or sink through Data Integration Studio Metadata Server support for CDH High Performance Analytics: On-cluster, high-performance machine learning/statistics for big data Visual Analytics: On-cluster, high-performance business intelligence for big data Data Loader High-performance model scoring and deployment with faster time results Run DS/DS2 code natively inside the cluster using Map Reduce Load existing dataset in the in-memory products