Modernizing Your Data Warehouse for Hadoop



Similar documents
Please give me your feedback

Bringing Big Data to People

Big Data Processing: Past, Present and Future

HDP Hadoop From concept to deployment.

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

HDP Enabling the Modern Data Architecture

Microsoft Analytics Platform System. Solution Brief

Modern Data Warehousing

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

Microsoft technológie pre BigData. Ľubomír Goryl Solution Professional

A Modern Data Architecture with Apache Hadoop

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

The Microsoft Modern Data Warehouse

Upcoming Announcements

Microsoft Big Data. Solution Brief

BIG DATA TRENDS AND TECHNOLOGIES

The Inside Scoop on Hadoop

Azure Data Lake Analytics

Parallel Data Warehouse

SQL Server 2012 Parallel Data Warehouse. Solution Brief

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Comprehensive Analytics on the Hortonworks Data Platform

The Future of Data Management

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Data Security in Hadoop

Agenda. Modern Data Warehouse Big Data Application examples. Analytic Platform Systems. Integration of Hadoop and APS. Architecture Hadoop

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

Investor Presentation. Second Quarter 2015

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Big Data Technologies Compared June 2014

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

The Future of Data Management with Hadoop and the Enterprise Data Hub

Hadoop in the Hybrid Cloud

Hadoop Introduction. Olivier Renault Solution Engineer - Hortonworks

Big Data Realities Hadoop in the Enterprise Architecture

Big Data on Microsoft Platform

The Role Polybase in the MDW. Brian Mitchell Microsoft Big Data Center of Expertise

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

CREATING PACKAGED IP FOR BUSINESS ANALYTICS PROJECTS

Big Data and Industrial Internet

Are You Ready for Big Data?

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Enabling Manufacturing Transformation in a Connected World. John Shewchuk Technical Fellow DX

Big Data: Making Sense of it all!

Designing Self-Service Business Intelligence and Big Data Solutions

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Hortonworks Data Platform for Hadoop and SAP HANA

Workshop on Hadoop with Big Data

Talend Big Data. Delivering instant value from all your data. Talend

Are You Ready for Big Data?

How To Extend An Enterprise Bio Solution

Deeper Insights across Data

Microsoft SQL Server 2012 with Hadoop

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Hadoop, the Data Lake, and a New World of Analytics

Course 20467: Designing Self-Service Business Intelligence and Big Data Solutions

Big Data Management and Security

Big Analytics in the Cloud. Matt Winkler PM, Big

HADOOP. Revised 10/19/2015

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Large scale processing using Hadoop. Ján Vaňo

Quickly Deploy Microsoft Private Cloud and SQL Server 2012 Data Warehouse on Hitachi Converged Solutions. September 25, 2013

SQLSaturday #399 Sacramento 25 July, Big Data Analytics with Excel

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

How to Hadoop Without the Worry: Protecting Big Data at Scale

The Evolving Apache Hadoop Eco-System

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

Modern Data Architecture for Predictive Analytics

CIO Guide How to Use Hadoop with Your SAP Software Landscape

BIG DATA What it is and how to use?

Constructing a Data Lake: Hadoop and Oracle Database United!

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Polybase for SQL Server 2016

Information Builders Mission & Value Proposition

Structured data meets unstructured data in Azure and Hadoop

Big Data Big Data/Data Analytics & Software Development

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

Big Data Too Big To Ignore

Oracle Database 12c Plug In. Switch On. Get SMART.

Hadoop implementation of MapReduce computational model. Ján Vaňo

SQL Server What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.

Transcription:

Modernizing Your Data Warehouse for Hadoop Big data. Small data. All data. Audie Wright, DW & Big Data Specialist Audie.Wright@Microsoft.com O 425-538-0044, C 303-324-2860

Unlock Insights on Any Data Taking an End-toEnd Approach to BI and Analytics Modernizing Your Data Warehouse for Hadoop

The traditional data warehouse data warehousing has reached the most significant tipping point since its inception. The biggest, possibly most elaborate data management system in IT is changing. Gartner, The State of Data Warehousing in 2012

The traditional data warehouse 2 Real time data 1 Increasing 1 data Increasing data 3 New data sources volumes volumes and types 4 Cloud-born data

The modern data warehouse

Microsoft s modern data warehouse SQL Server 2014 Analytics Platform System Microsoft Azure HDInsight Data Platform

Scale out relational data to petabytes From terabytes to multi-petabytes Scale out technologies in Analytics Platform System APS / HDInsight APS / HDInsight APS / HDInsight APS / HDInsight APS / HDInsight APS / HDInsight APS 0TB 6PB

Scale Out non-relational data Scale out big data Scale out non-relational data in HDInsight (for Microsoft Azure or APS)

In-memory performance In-memory Columnstore for next-generation performance Columnstore index representation

Concurrency and mixed workloads Great performance for mixed workloads Query Results

Near real-time insights Real-time with complex event processing Event Sources Event Targets

What is big data? Petabytes Data complexity: variety and velocity

What is Hadoop? Distributed, scalable system on commodity HW Operational services Data services AMBARI OOZIE FALCON FLUME SQOOP HBASE PIG HIVE & HCATALOG Core Services LOAD & EXTRACT NFS WebHDFS MAP REDUCE YARN HDFS Hadoop Cluster Hadoop clusters provide scale-out storage and distributed data processing on commodity hardware compute & storage.......... compute & storage

IT infrastructure optimization Legal discovery Social network analysis Traffic flow optimization Web app optimization Churn analysis Natural resource exploration Weather forecasting Healthcare outcomes Fraud detection Life sciences research Advertising analysis Equipment monitoring Smart meter monitoring

Hadoop offerings on-premise and cloud Real-time with complex event processing Microsoft Azure

Integrate relational data and Hadoop Integrated query with PolyBase in SQL APS Select Result set Microsoft Azure HDInsight Analytics Platform System PolyBase Hortonworks (Windows, Linux), Cloudera Microsoft HDInsight

Microsoft s modern data warehouse SQL Server 2014 Analytics Platform System Microsoft Azure HDInsight Data Platform

Freedom of deployment options and hybrid solutions

Appliance vs. Reference Architecture Buying an a appliance Reference Architecture Order SKU from a list of configuration options Factory Order builds hardware & tests from a BOM Hardware vendor installs & connects Customer builds & configures Microsoft validates function & performance Installs software, drivers, firmware, Hands over the etc. keys to the customer Microsoft is the single point of contact for Customer support manages multiple support channels

! Sign up for a free architectural design session for APS with your Microsoft rep! Visit Analytics Platform System at http://www.microsoft.com/aps! Try HDInsight at http://www.windowsazure.com/bigdata! Try SQL Server for data warehousing in Microsoft Azure VMs at http://www.windowsazure.com! Try Hortonworks Data Platform for Windows at http://www.hortonworks.com/ products/hdp-windows/! Try SQL Server 2014 at http://www.microsoft.com/sql/ sql-server-2014.aspx

alias@microsoft.com

Growth Topology PDW Region Only Scale Unit Base Unit Base UnitExtension

Growth Topologies Hadoop Region Extend Min

About Analytics Platform System SQL Server Parallel Data Warehouse PolyBase Microsoft HDInsight

About Hortonworks Data Platform For Windows

About Microsoft Azure HDInsight Microsoft Azure

Microsoft Contributions to Hadoop 6,000+ Engineering hours Hive (Improve performance 40x with Stinger) Contributed FileSystem implementation for Microsoft Azure Storage HDFS permissions model mapped to Windows HDP 2.0 25,000+ Code line contributions Windows, a first class OS for Hadoop REEF for creation and execution of machine learning jobs 9

Hortonworks and Microsoft Engineering alignment Corporate alignment Field Alignment

ü Data sources Non-Relational Data

Microsoft Azure ü

44

PDW Customers

HDInsight Customers

.