Comprehensive Analytics on the Hortonworks Data Platform

Similar documents
HDP Hadoop From concept to deployment.

HDP Enabling the Modern Data Architecture

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Upcoming Announcements

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

Big Data Realities Hadoop in the Enterprise Architecture

A Modern Data Architecture with Apache Hadoop

Data Security in Hadoop

Modern Data Architecture for Predictive Analytics

The Future of Data Management

SAP and Hortonworks Reference Architecture

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

The Future of Data Management with Hadoop and the Enterprise Data Hub

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Hadoop, the Data Lake, and a New World of Analytics

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Hortonworks Data Platform for Hadoop and SAP HANA

Modernizing Your Data Warehouse for Hadoop

Harnessing big data with Hortonworks Data Platform and Red Hat JBoss Data Virtualization

Cloudera Enterprise Data Hub in Telecom:

#TalendSandbox for Big Data

Using Tableau Software with Hortonworks Data Platform

HADOOP. Revised 10/19/2015

Apache Hadoop: The Big Data Refinery

Big Data Management and Security

Information Builders Mission & Value Proposition

Talend Big Data. Delivering instant value from all your data. Talend

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer,

Open Source in Financial Services: Meet the challenges of new business models and disruption

Bringing Big Data to People

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Dominik Wagenknecht Accenture

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

Big Data: Making Sense of it all!

Hadoop Ecosystem B Y R A H I M A.

Hadoop in the Enterprise

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Apache Hadoop's Role in Your Big Data Architecture

BIG DATA TRENDS AND TECHNOLOGIES

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Community Driven Apache Hadoop. Apache Hadoop Basics. May Hortonworks Inc.

How Companies are! Using Spark

Workshop on Hadoop with Big Data

Dell In-Memory Appliance for Cloudera Enterprise

Why Spark on Hadoop Matters

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

and Hadoop Technology

WHAT S NEW IN SAS 9.4

Modern Data Architecture for Retail with Apache Hadoop on Windows

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Big Data and Data Science. The globally recognised training program

Driving Growth in Insurance With a Big Data Architecture

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

How To Use A Data Center With A Data Farm On A Microsoft Server On A Linux Server On An Ipad Or Ipad (Ortero) On A Cheap Computer (Orropera) On An Uniden (Orran)

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

How to Hadoop Without the Worry: Protecting Big Data at Scale

Investor Presentation. Second Quarter 2015

Luncheon Webinar Series May 13, 2013

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

Virtualizing Apache Hadoop. June, 2012

QUEST meeting Big Data Analytics

Hadoop2, Spark Big Data, real time, machine learning & use cases. Cédric Carbone Twitter

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Big Data and Industrial Internet

Trafodion Operational SQL-on-Hadoop

INVESTOR PRESENTATION. Third Quarter 2014

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

Cisco IT Hadoop Journey

Hortonworks Data Platform. Buyer s Guide

Please give me your feedback

Are You Big Data Ready?

Training Catalog. Summer 2015 Training Catalog. Apache Hadoop Training from the Experts. Apache Hadoop Training From the Experts

Moving From Hadoop to Spark

Certified Big Data and Apache Hadoop Developer VS-1221

Rackspace Cloud Big Data Platform On-demand Big Data processing platform

HADOOP VENDOR DISTRIBUTIONS THE WHY, THE WHO AND THE HOW? Guruprasad K.N. Enterprise Architect Wipro BOTWORKS

Implement Hadoop jobs to extract business value from large and varied data sets

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Modern Data Architecture for Financial Services with Apache Hadoop on Windows

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

INVESTOR PRESENTATION. First Quarter 2014

Native Connectivity to Big Data Sources in MSTR 10

Apache Hadoop Patterns of Use

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Transcription:

Comprehensive Analytics on the Hortonworks Data Platform We do Hadoop. Page 1

Page 2

Back to 2005 Page 3

Vertical Scaling Page 4

Vertical Scaling Page 5

Vertical Scaling Page 6

Horizontal Scaling Page 7

Horizontal Scaling Page 8

Horizontal Scaling Page 9

Self Healing System Page 10

Hadoop 1.0 MapReduce HDFS (Hadoop Distributed File System) 1 N Page 11

Page 12

Page 13

SOURCES ANALYTICS Hadoop 2.0 Data Applications Marts Business Analytics Visualization & Dashboards Batch MP P Batch EDW Batch Interactive Real-Time Partner ISV YARN: Data Operating System HDFS (Hadoop Distributed File System) ERP CRM SCM Existing Systems Clickstream Web & Social Geolocation Sensor & Machine Server Logs Unstructured Page 14

Apache Pig Apache Hive Cascading Apache HBase Apache Accumulo Apache Solr Apache Spark Apache Storm Hortonworks Data Platform 2.2 GOVERNANCE BATCH, INTERACTIVE & REAL-TIME DATA ACCESS SECURITY OPERATIONS Apache Falcon Apache Ranger Apache Ambari Apache Sqoop Apache Knox Apache Zookeeper Apache Flume Apache Falcon Apache Oozie Apache Kafka YARN: Data Operating System (Cluster Resource Management) 1 HDFS (Hadoop Distributed File System) Page 15

Hortonworks: Hadoop for the Enterprise We Do Hadoop Winter Page 16 2015 Version 1.1

Who we are 2005 Apache Hadoop at Yahoo! 2011 Inception of Hortonworks 24 Developers and Architects 600+ Employees 900+ Partner 300+ Customers 100% Renewal Rate 30+ Migrations 5 out of 5 Support Score* 32.000 Number of Nodes at Yahoo! Page 17 * The Forrester Wave Big Data Hadoop Solutions Q1 2014

Why SAS? ANALYTICS IN-MEMORY HIGH-PERFORMANCE DATA MANAGEMENT BUSINESS INTELLIGENCE DATA VISUALIZATION Page 18

SAS is the only vendor who supports all of these methods SAS can treat Hadoop just as any other data source, pulling data from Hadoop, when it is most convenient SAS can work with Hadoop, lifting data in a purpose-built advanced analytics in-memory environment SAS can work directly in Hadoop, leveraging the distributed processing capabilities of Hadoop Page 19

SAS + from Hortonworks SAS accesses and extracts data from Hadoop to a SAS server for processing, and writes results back. Bridge to traditional SAS environments Hadoop treated as just another data source Performance limited to single pipe bandwidth DATA MOVEMENT Page 20

SAS + with Hortonworks SAS accesses and processes Hadoop data on SAS Servers while keeping the data and computations massively parallel. Supports advanced analytics via shared computing Allows the scaling of data storage and analytics separately Ideal when analytical rigor, sophistication and governance are required DATA LIFT INTO MEMORY Page 21

SAS + in Hadoop SAS processes data directly in the Hadoop cluster. SAS Embedded Process enables scalable SAS compute in Hadoop SAS compute is orchestrated via Hadoop technology (YARN) Data manipulation, data quality, and scoring support Ideal when all data is landing in Hadoop, and Hadoop is the proper place for processing SAS LOGIC Page 22

Page 23

About Rogers Media Great Brands Media advertising revenue a priority Audience Strategy the future 2013 CONSOLIDATED REVENUE BY SEGMENT (%) Page 24

AUDIENCE BUSINESS CHALLENGES 1. UNDERSTAND AUDIENCE Having the largest volume of data sets, audience segments/profiles in Canada while leading the Canadian marketplace in privacy and governance 4. MEASURE AUDIENCE Exceeding client expectations with transparent reporting, the most accurate attribution models 2. FIND AUDIENCE Being leaders in identifying and targeting audiences across channels, platforms and devices 3. ENGAGE AUDIENCE Driving engagement across platforms and formats

AUDIENCE PLATFORM THE DATA LAKE - Land massive click stream log files: - 100+ M records / day; - 30 million unique IDs / month - Cost effective / competitive - Lean methodology - Landed data always available if requirements should change - Data definition on read - Adoption of the Data Lake framework

Summary more data & better algorithms Page 27

Hortonworks Jumpstart Package Proposal for a simple production-ready Hadoop cluster in one week Page 28

Hadoop is a Platform Decision Adoption follows a consistent journey Data architecture efficiencies, new analytic apps, and ultimately to a data lake. HDP subscription supports entire lifecycle World class experience to ensure success from architecture to production to expansion. HDP: A completely open data platform Platforms are ultimately defined by open communities. HDP: A centralized architecture built on YARN Any application, any data, anywhere. Page 29

Cautionary Statement Regarding Forward-Looking Statements This presentation contains forward-looking statements involving risks and uncertainties. Such forward-looking statements in this presentation generally relate to future events, our ability to increase the number of support subscription customers, the growth in usage of the Hadoop framework, our ability to innovate and develop the various open source projects that will enhance the capabilities of the Hortonworks Data Platform, anticipated customer benefits and general business outlook. In some cases, you can identify forward-looking statements because they contain words such as may, will, should, expects, plans, anticipates, could, intends, target, projects, contemplates, believes, estimates, predicts, potential or continue or similar terms or expressions that concern our expectations, strategy, plans or intentions. You should not rely upon forward-looking statements as predictions of future events. We have based the forward-looking statements contained in this presentation primarily on our current expectations and projections about future events and trends that we believe may affect our business, financial condition and prospects. We cannot assure you that the results, events and circumstances reflected in the forward-looking statements will be achieved or occur, and actual results, events, or circumstances could differ materially from those described in the forward-looking statements. The forward-looking statements made in this prospectus relate only to events as of the date on which the statements are made and we undertake no obligation to update any of the information in this presentation. Trademarks Page 30 Hortonworks is a trademark of Hortonworks, Inc. in the United States and other