The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer, Cofounder @mikeolson



Similar documents
The Future of Data Management

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

The Future of Data Management with Hadoop and the Enterprise Data Hub

Cloudera Enterprise Data Hub in Telecom:

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Data Analyst Program- 0 to 100

Information Builders Mission & Value Proposition

Hadoop Ecosystem B Y R A H I M A.

HDP Enabling the Modern Data Architecture

Comprehensive Analytics on the Hortonworks Data Platform

Driving Growth in Insurance With a Big Data Architecture

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Integrating a Big Data Platform into Government:

Fighting Cyber Fraud with Hadoop. Niel Dunnage Senior Solutions Architect

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Communicating with the Elephant in the Data Center

HDP Hadoop From concept to deployment.

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Dominik Wagenknecht Accenture

Cloudera Enterprise Data Hub. GCloud Service Definition Lot 3: Software as a Service

Dell In-Memory Appliance for Cloudera Enterprise

Upcoming Announcements

Native Connectivity to Big Data Sources in MSTR 10

Hadoop Trends and Practical Use Cases. April 2014

Financial, Telco, Retail, & Manufacturing: Hadoop Business Services for Industries

Talend Big Data. Delivering instant value from all your data. Talend

Why Spark on Hadoop Matters

Has been into training Big Data Hadoop and MongoDB from more than a year now

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

#TalendSandbox for Big Data

Large scale processing using Hadoop. Ján Vaňo

Fighting Cyber Fraud with Hadoop. Niel Dunnage Senior Solutions Architect

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Data Security in Hadoop

Interactive data analytics drive insights

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

Hadoop implementation of MapReduce computational model. Ján Vaňo

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

BIG DATA TRENDS AND TECHNOLOGIES

locuz.com Big Data Services

ITG Software Engineering

PEPPERDATA OVERVIEW AND DIFFERENTIATORS

Building & Optimizing Enterprise-class Hadoop with Open Architectures Prem Jain NetApp

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Putting Apache Kafka to Use!

How Companies are! Using Spark

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Big Data and Industrial Internet

Oracle Big Data SQL Technical Update

Workshop on Hadoop with Big Data

Solving performance and data protection problems with active-active Hadoop SOLUTIONS BRIEF

Qsoft Inc

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

Deploying an Operational Data Store Designed for Big Data

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES

Oracle Big Data Building A Big Data Management System

Big Data. Value, use cases and architectures. Petar Torre Lead Architect Service Provider Group. Dubrovnik, Croatia, South East Europe May, 2013

How To Choose A Data Flow Pipeline From A Data Processing Platform

Hadoop Development & BI- 0 to 100

Certified Big Data and Apache Hadoop Developer VS-1221

Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

More Data in Less Time

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES

Chase Wu New Jersey Ins0tute of Technology

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Self-service BI for big data applications using Apache Drill

June Production Hadoop systems in the enterprise

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Big Data Explained. An introduction to Big Data Science.

Oracle Big Data Fundamentals Ed 1 NEW

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Peers Techno log ies Pv t. L td. HADOOP

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

Big Data Course Highlights

ITG Software Engineering

The Top 10 7 Hadoop Patterns and Anti-patterns. Alex

Big Data Introduction

Big Data Storage Challenges for the Industrial Internet of Things

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

The Enterprise Data Hub and The Modern Information Architecture

Why Big Data in the Cloud?

IBM Big Data Platform

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

White Paper: What You Need To Know About Hadoop

Big Data for Investment Research Management

Application Development. A Paradigm Shift

Self-service BI for big data applications using Apache Drill

Big Data for Big Science. Bernard Doering Business Development, EMEA Big Data Software

WHAT S NEW IN SAS 9.4

Transcription:

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer, Cofounder @mikeolson 1

A New Platform for Pervasive Analytics Multiple big data opportunities in one optimized, high-performance, multi-tenant platform. Process Ingest Sqoop, Flume Transform MapReduce, Hive, Pig, Spark Discover Analytic Database Impala Search Solr Security and Administration Model Machine Learning SAS YARN, Cloudera Manager, Cloudera Navigator Unlimited Storage HDFS, HBase Serve NoSQL Database HBase Streaming Spark Streaming Batch, Interactive, and Real-Time. Leading performance and usability in one platform. End-to-end analytic workflows Access more data Work with data in new ways Enable new users 2

SAS and Cloudera: The Power to Know at Scale SAS High Performance Analytics Server SAS Visual Analytics SAS/Access Know your customer: Offer optimization Payment risk Customer link analytics 3

The Pervasive Analytics Journey 4

Customer success across industries Financial Services Telecom Healthcare & Life Sciences Media & Technology Retail & CP Public Sector 5

6 6

Ask Bigger Questions: How can we anticipate maintenance associated with specific vehicles? American multinational automaker captures every touchpoint to provide a seamless customer experience. 7

Automaker Streamlines the Customer Experience The Challenge: Each vehicle is comprised of thousands or millions of components, many streaming machine data Want to build loyalty by minimizing maintenance issues American multinational automaker improves customer loyalty through proactive care. The Solution: Cloudera correlates manufacturing data with service and customer interaction data Predictive analytics & machine learning enable dynamic customer profiles & personalization 8

Ask Bigger Questions: How can we improve our support team s productivity? NetApp AutoSupport processes 600,000+ phone home transactions weekly to offer proactive customer support. 9

NetApp Delivers Proactive Support The Challenge: 40% of phone home data transmitted within 18 hours each weekend, creating bottlenecks that affect SLAs Data storage footprint doubles every 16 months Queries take weeks; some don t run NetApp AutoSupport meets stringent SLAs with 64X faster processing. The Solution: NetApp Open Solution for Hadoop Processes machine-generated data from 600K+ weekly transactions Supports 7TB/month data volume growth 10

Ask Bigger Questions: Which semiconductor chips will fail? A Semiconductor Manufacturer uses predictive analytics to take preventative action on chips likely to fail. 11

Cloudera enables better predictions The Challenge: Want to capture greater granular and historical data for more accurate predictive yield modeling Storing 9 months data in a traditional RDBMS is expensive Semiconductor manufacturer can prevent chip failure with more accurate predictive yield models. The Solution: Dell Cloudera solution for Apache Hadoop 53 nodes; plan to store up to 10 years (~10PB) Capturing & processing data from each phase of manufacturing process 12

Ask Bigger Questions: Where will the next cyber attack attempt occur? Multinational ecommerce firm prevents cyber attacks with realtime anomaly detection of log data from hundreds of sources. 13 13

Multinational ecommerce firm The Challenge: Ingesting logs in many formats from 40,000 machines, hundreds of sources Need to find signals in the data for global threat management Symantec SIEM: costly, poor performance, doesn t scale Multinational ecommerce firm prevents cyber attacks with realtime anomaly detection. The Solution: Cloudera Enterprise: real-time log streaming, correlation, & analysis Splunk: data ingest & daily operational search 14 14

Thank you! Mike Olson @mikeolson mike.olson@cloudera.com 15