Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved



Similar documents
Comprehensive Analytics on the Hortonworks Data Platform

HDP Enabling the Modern Data Architecture

HDP Hadoop From concept to deployment.

SAP and Hortonworks Reference Architecture

Hadoop, the Data Lake, and a New World of Analytics

Hadoop s Advantages for! Machine! Learning and. Predictive! Analytics. Webinar will begin shortly. Presented by Hortonworks & Zementis

The Future of Data Management

Modern Data Architecture for Predictive Analytics

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

WHAT S NEW IN SAS 9.4

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Hadoop & SAS Data Loader for Hadoop

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Apache Hadoop's Role in Your Big Data Architecture

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Harnessing big data with Hortonworks Data Platform and Red Hat JBoss Data Virtualization

Using Tableau Software with Hortonworks Data Platform

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Big Data Realities Hadoop in the Enterprise Architecture

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

Data Refinery with Big Data Aspects

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer,

Oracle Big Data SQL Technical Update

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Transforming the Telecoms Business using Big Data and Analytics

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Oracle Database 12c Plug In. Switch On. Get SMART.

Il mondo dei DB Cambia : Tecnologie e opportunita`

The Future of Data Management with Hadoop and the Enterprise Data Hub

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Upcoming Announcements

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Evolution from Big Data to Smart Data

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Luncheon Webinar Series May 13, 2013

Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA

Manifest for Big Data Pig, Hive & Jaql

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

An Oracle White Paper November Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

Financial, Telco, Retail, & Manufacturing: Hadoop Business Services for Industries

A Modern Data Architecture with Apache Hadoop

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.

Testing Big data is one of the biggest

BIG DATA TRENDS AND TECHNOLOGIES

Cost-Effective Business Intelligence with Red Hat and Open Source

Ganzheitliches Datenmanagement

The Potential of Big Data in the Cloud. Juan Madera Technology Consultant

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Bringing the Power of SAS to Hadoop. White Paper

Information Builders Mission & Value Proposition

Oracle Big Data Strategy Simplified Infrastrcuture

Next Gen Hadoop Gather around the campfire and I will tell you a good YARN

Integrating a Big Data Platform into Government:

In-Memory Analytics for Big Data

Apache Hadoop Patterns of Use

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Modern Data Architecture for Retail with Apache Hadoop on Windows

NextGen Infrastructure for Big DATA Analytics.

How Companies are! Using Spark

Talend Real-Time Big Data Sandbox. Big Data Insights Cookbook

ANALYTICS IN BIG DATA ERA

Trafodion Operational SQL-on-Hadoop

KNIME & Avira, or how I ve learned to love Big Data

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

How To Handle Big Data With A Data Scientist

The Big Data Paradigm Shift. Insight Through Automation

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Building your Big Data Architecture on Amazon Web Services

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Are You Ready for Big Data?

Constructing a Data Lake: Hadoop and Oracle Database United!

Big + Fast + Safe + Simple = Lowest Technical Risk

Dominik Wagenknecht Accenture

Modernizing Your Data Warehouse for Hadoop

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

Big Data Analytics Nokia

SQream Technologies Ltd - Confiden7al

Big Data Introduction

The 3 questions to ask yourself about BIG DATA

Navigating Big Data business analytics

So What s the Big Deal?

Data processing goes big

Big Data Technologies Compared June 2014

Large scale processing using Hadoop. Ján Vaňo

The Enterprise Data Hub and The Modern Information Architecture

The 4 Pillars of Technosoft s Big Data Practice

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

Tap into Hadoop and Other No SQL Sources

Parallel Data Warehouse

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

Using RDBMS, NoSQL or Hadoop?

Transcription:

Hortonworks & SAS Analytics everywhere. Page 1

A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment to 1x1 Targeting to Automated Algorithms to Designer Medicine allow organizations to shift interactions from Reactive Post Transaction A shift in Retail From static branding A shift in Telco From break then fix to Real-time Personalization to repair before break Proactive Pre Decision Page 2

We estimate that within 3 years 50% of the worlds data will reside on Hadoop.

Data is doubling in size every 2 3 years. Traditional or not? APPLICATIONS DATA SYSTEM Business Analy4cs RDBMS EDW MPP REPOSITORIES Custom Applica4ons Packaged Applica4ons 2.8 ZB in 2012 85% from New Data Types 15x Machine Data by 2020 40 ZB by 2020 Source: IDC SOURCES Exis4ng Sources (CRM, ERP, Clickstream, Logs)

OLTP, ERP, CRM Systems Unstructured documents, emails Hadoop stores and processes the data your customers currently do not or cannot. Server logs Sen>ment, Web Data Sensor. Machine Data 1: Cost profile. 2: Data Structure. Clickstream Geoloca>on

Hadoop enables scalable compute & storage with a compelling cost profile. Cloud Storage HADOOP NAS Engineered System MPP Fully-loaded Cost Per Raw TB of Data (Min Max Cost) SAN $0 $20,000 $40,000 $60,000 $80,000 $180,000

Hadoop enables scalable compute & storage for all data structures. Current Reality Apply schema on write Dependent on IT Repeatable Process: SQL Determine list of ques4ons Design solu4ons Collect structured data Ask ques4ons from list Detect addi4onal ques4ons Augment w/ Hadoop Apply schema on read Support range of access patterns to data stored in HDFS: polymorphic access Right Engine, Right Job Batch Interactive Real-time Inmemory HADOOP Iterate over structure Transform and Analyze

The Net Result: A modern data architecture capable of storing, processing, correlating, analysing, matching, aggregating, searching and exposing..all data & insights.

.when integrated with the right tools capable of delivering the right results Page 9

The Modern Data Architecture is a Plus +1. APPLICATIONS Base SAS Enterprise Miner OLTP, ERP, CRM Systems Unstructured documents, emails Server logs DATA SYSTEM RDBMS EDW MPP REPOSITORIES Governance & Integration Data Access Data Management Security Operations Sen>ment, Web Data Sensor. Machine Data SOURCES OLTP, ERP, CRM Documents, Emails Web Logs, Click Streams Social Networks Machine Generated Sensor Data Geo- loca>on Data Clickstream Geoloca>on Page 10

From... With... SAS accesses and extracts data from Hadoop to a SAS server for processing, and writes results back. SAS accesses and processes Hadoop data on SAS Servers while keeping the data and computations massively parallel. and In Hadoop Page 11 SAS processes data directly in the Hadoop cluster.

SAS + from Hadoop Data Management Base SAS Enterprise Miner SAS/ACCESS to Hadoop ANY?! ANY?! ANY?! ANY?! disk Page 12

Access to Hadoop Uses Existing SAS Interfaces Standard Libname syntax PROC HADOOP Datastep and Proc SQL translated to Hive Filename support Execute Pig Scripts and MapReduce Push-down of certain procedures Custom SerDe Page 13

SAS + with Hadoop SAS Rack architecture SAS Rack Enterprise Hadoop Page 14

SAS + in Hadoop in-memory analytics (and BI) Visual Analytics Visual Statistics In-memory Statistics for Hadoop Root Node MPI LASR LASR LASR LASR memory disk SASHDAT SASHDAT SASHDAT SASHDAT Page 15

Turns Big Data Into Real- time Customer Insights Telcos Rogers Media is a subsidiary of Rogers Communications, which owns Canada's largest publishing company. Has more than 70 consumer and business publications. Rogers Media Inc. also owns 54 radio stations, and several television properties including terrestrial television stations and cable television channels. Challenge: Unable to analyze huge amounts of data to optimize and improve real-time customer insights Understand audience: Having the largest volume of data sets, audience segments/profile in Canada while leading the Canadian marketplace in privacy and governance. Find Audience: Being leaders in identifying and targeting audiences across channels, platforms and devices. Engage Audience: Driving engagement across platforms and formats. Measure Audience: Exceeding client expectations with transparent reporting and accurate attribution models. Solution Rogers Media Audience Platform: Integration of all data collected across organizations Query all data in one location: Blend of online and offline data, subscription, ecommerce, loyalty programs, etc. Land massive click stream log files: 100+ M records / day 30 million unique IDs / month Use 100% of the data for Analysis and Visualization instead of smaller random samples (over sampling) Page 16

Resources Customer Video: Rogers Media discusses SAS and Hadoop Demos: SAS Visual Analytics, Ingest SAS to Hive Webinars: SAS and the Modern Data Architecture SAS and Hortonworks use cases www.hortonworks.com/sas Page 17

Thank you. Questions Page 18