Big Data Processing: Past, Present and Future



Similar documents
Modernizing Your Data Warehouse for Hadoop

Microsoft Analytics Platform System. Solution Brief

Please give me your feedback

The Inside Scoop on Hadoop

Bringing Big Data to People

Modern Data Warehousing

Microsoft technológie pre BigData. Ľubomír Goryl Solution Professional

SQL Server 2012 Parallel Data Warehouse. Solution Brief

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

Big Data Technologies Compared June 2014

Parallel Data Warehouse

The Role Polybase in the MDW. Brian Mitchell Microsoft Big Data Center of Expertise

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

The Microsoft Modern Data Warehouse

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

James Serra Sr BI Architect

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

CREATING PACKAGED IP FOR BUSINESS ANALYTICS PROJECTS

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014

BIG DATA TRENDS AND TECHNOLOGIES

Information Architecture

MOC 20467B: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

SQL Server PDW. Artur Vieira Premier Field Engineer

Register on projectbotticelli.com. Introduction to BI & Big Data DAX MDX Data Mining

INTRODUCTION TO CASSANDRA

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Updating Your Skills to SQL Server 2016

Structured data meets unstructured data in Azure and Hadoop

Microsoft Big Data. Solution Brief

WINDOWS AZURE DATA MANAGEMENT

Azure Data Lake Analytics

Implementing a SQL Data Warehouse 2016

Updating Your SQL Server Skills to Microsoft SQL Server 2014

SQL 2016 and SQL Azure

SQL Server Point of View. Overview on Key Enhancements and Updates

Building a BI Solution in the Cloud

Copyright 2014, Neudesic. All rights reserved.

The Future of Data Management

Big Data and Industrial Internet

From Spark to Ignition:

Il mondo dei DB Cambia : Tecnologie e opportunita`

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Updating Your SQL Server Skills to Microsoft SQL Server 2014

Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Innovative technology for big data analytics

Agenda. Modern Data Warehouse Big Data Application examples. Analytic Platform Systems. Integration of Hadoop and APS. Architecture Hadoop

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Performance Tuning and Optimizing SQL Databases 2016

Course Outline. Upgrading Your Skills to SQL Server 2016 Course 10986A: 5 days Instructor Led

How To Scale Out Of A Nosql Database

A Breakthrough Platform for Next-Generation Data Warehousing and Big Data Solutions

BIG DATA-AS-A-SERVICE

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

10977B: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014 va

Hadoop Introduction coreservlets.com and Dima May coreservlets.com and Dima May

Investor Presentation. Second Quarter 2015

Course 10977: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014

Using Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Dell* In-Memory Appliance for Cloudera* Enterprise

The Evolution of Microsoft SQL Server: The right time for Violin flash Memory Arrays

SQL Server Everything built-in. Csom Gergely Microsoft Adat platform szakértő

SQL Server What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.

Data Refinery with Big Data Aspects

Hybrid Software Architectures for Big

Understanding Microsoft s BI Tools

How To Handle Big Data With A Data Scientist

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

Microsoft Research Windows Azure for Research Training

MS 10977B Upgrading Your SQL Server Skills to Microsoft SQL Server 2014

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Implementing a Data Warehouse with Microsoft SQL Server 2012

Microsoft Research Microsoft Azure for Research Training

Luncheon Webinar Series May 13, 2013

Understanding NoSQL on Microsoft Azure

Architectures for Big Data Analytics A database perspective

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS

SQL Server 2012 Performance White Paper

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

THE JOURNEY TO A DATA LAKE

Cost Savings Solutions for Year 5 True Ups

SQLSaturday #399 Sacramento 25 July, Big Data Analytics with Excel

How To Use Hp Vertica Ondemand

Big Data Analytics - Accelerated. stream-horizon.com

Big Data on Microsoft Platform

Transcription:

Big Data Processing: Past, Present and Future Orion Gebremedhin National Solutions Director BI & Big Data, Neudesic LLC. VTSP Microsoft Corp. Orion.Gebremedhin@Neudesic.COM B-orgebr@Microsoft.com @OrionGM

Big Data Processing: Past, Present and Future

Topics Covered History and Fundamentals of Big Data Processing SQL Server for Big Data, Past, Present and Future Summary

Characteristics of Big Data Big data is data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn t fit the strictures of your database architectures. To gain value from this data, you must choose an alternative way to process it.

Characteristics of Big Data The Vs of Big Data Volume 40 Zettabytes (43 Trillion Gigabytes) of data will be created by 2020. 300 Times increase from 2005 Most companies in the U.S have at least 100Tb of data Velocity NYSE captures 1TB of trade information every day The average modern car has over 100 sensors Variety Nearly 420 Million wearable health monitors Over 4 Billion hours of video watched on YouTube everyday

History of Big Data A big data cluster is a highly interconnected platform built from a collection of commodity parts. *Disruptive Possibilities by Jeffrey Needham Copyright 2013

Scale Up vs. Scale Out Scale up (SMP) Scale out (MPP) + (n) Upgrade components or buy bigger server each time Add nodes to the cluster Multiprocessor system where processors share resources : Operating System (OS), memory, I/O devices and connected using a common bus Multiple processors, each processor using its own OS and memory and communicating with each other using some form of messaging interface

Notable milestones in Commodity hardware CDC 6600 by Control Data Corporation. "The 6600 CPU had multiple functional units which could operate simultaneously (i.e., in parallel), allowing the CPU to overlap instructions' execution times.. http://en.wikipedia.org/wiki/cdc_6600 A Beowulf cluster (1990s) is a computer cluster of what are normally identical, commodity-grade computers networked into a small local area network with libraries and programs installed which allow processing to be shared among them. http://en.wikipedia.org/wiki/beowulf_cluster

Some Applications of Big Data Big Data supercomputers are pattern explorers. Shopping Patterns Sensor and Intelligent devices Data analytics Social Network associations and suggestions Predictive analytics Crime investigation

SQL Server for Big Data

SQL Server Optimizations

Microsoft Analytics Platform System About Analytics Platform System! SQL Server Parallel Data Warehouse PolyBase!!!! Microsoft HDInsight!!!

APS Growth Topology Scale Unit Base Unit Base UnitExtension

Introducing the Microsoft Analytics Platform System Relational and nonrelational data in a single appliance Near real-time performance with In- Memory Columnstore Industry s lowest data warehouse appliance price per terabyte Enterprise-ready Hadoop Integrated querying across Hadoop and PDW using T- SQL Direct integration with Microsoft BI tools such as Microsoft Excel Ability to scale out to accommodate growing data Removal of data warehouse bottlenecks with MPP SQL Server Concurrency that fuels rapid adoption Value through a single appliance solution Value with flexible hardware options using commodity hardware

Deployment options and hybrid solutions

Connecting islands of data with PolyBase Selec t Resul t set Provides a single T-SQL query model for PDW and Hadoop with rich features of T-SQL, including joins without ETL Microsoft Azure HDInsight Hortonworks for Windows and Linux Cloudera SQL Server Parallel Data Warehouse PolyBase Microsoft HDInsight Uses the power of MPP to enhance query execution performance Supports Windows Azure HDInsight to enable new hybrid cloud scenarios Provides the ability to query non- Microsoft Hadoop distributions, such as Hortonworks and Cloudera

Microsoft s modern data warehouse SQL Server 2014 Analytics Platform System Microsoft Azure HDInsight Data Platform

Summary Understand your data growth to determine when to Scale-Out. Determine the right tool for the workload you have.

Questions? Questions and Discussion