Agenda. Modern Data Warehouse Big Data Application examples. Analytic Platform Systems. Integration of Hadoop and APS. Architecture Hadoop

Size: px
Start display at page:

Download "Agenda. Modern Data Warehouse Big Data Application examples. Analytic Platform Systems. Integration of Hadoop and APS. Architecture Hadoop"

Transcription

1 Microsoft Analytics Platform System The turnkey modern data warehouse appliance Stefan Cronjaeger June 2014

2 Agenda Modern Data Warehouse Big Data Application examples Analytic Platform Systems Architecture Hadoop Integration of Hadoop and APS APS with external Hadoop Clusters APS with Hadoop in the Cloud APS with integrated Hadoop

3 Data sources 3

4 Data sources Non-Relational Data 4

5 Big Data: Variety, Velocity, Volume and Analytics Web Sensor and machine log Social Media Business apps

6 Technologies to drive Big Data

7 What to do with the data Geo analysis Forecast Customer interaction Keywords & Sentiment Churn Customer segmentation Shopping basket & Recommendation Scoring & Outlier 7

8 Examples for sentimental analysis: Not only Marketing Browse blogs, Twitter, News articles, Newsgroups Extract key words, pairs of key words, sentiments Analyze and correlate Campaign supervision Political campaigns and keywords Marketing campaigns Trend analysis Quality assurance Analyse internal technical discussion groups Get early warning of possible technical issues Supply chain for fashion Look in fashion blogs and discussion groups Forecast demand of specific fashion articles 8

9 Structured data: Fraud detection in large amounts of financial data where to look Not all digits are equal! 130 years ago Simon Newcomb detected that more numbers started with the digit 1. Re-discovered by Benford The idea: Look into the numbers (e.g., balance sheet), look how the numbers are usually distributed and look for deviations Application: Tax fraud in balance sheets. Actually used by auditors Manipulated numbers in scientific publications Fraud in elections, election campaign financing, 9

10 An application of Benford s law Differences in number statistics for EU reporting of Social Data and Deficit data by country Bernhard Rauch, Max Göttsche, Gernot Brähler & Thomas Kronfeld (2014) Deficit versus social statistics: empirical evidence for the effectiveness of Benford s law, Applied Economics Letters, 21:3,

11 Data sources Non-relational data

12 Agenda Modern Data Warehouse Big Data Application examples Analytic Platform Systems Architecture Hadoop Integration of Hadoop and APS APS with external Hadoop Clusters APS with Hadoop in the Cloud APS with integrated Hadoop

13 About Analytics Platform System

14 PDW Logical Architecture Control Node (virtualized) Compute/Storage Nodes (virtualized) Database host Servers Direct Attached Storage Nodes Client Queries Control Host Node Virtualization spare All servers are virtualization hosts Running Windows Server 2012 Control and compute nodes are virtual All run SQL Server 2012 Control node spreads data and workload across compute nodes Data loads are in parallel and take advantage of the power of all nodes Fast Infiniband interconnection

15 Scalability: Massively Parallel and Shared nothing Add Capacity Smallest (0TB) To Largest (5PB) Add Capacity Start small with a few Terabyte warehouse Add capacity up to 5 Petabytes 0TB 5 PB Just grow by adding scale units An SMP system would have needed to be completely reconfigured

16 2 InfiniBandFDR 36 Port Switches 2 Ethernet Switches G Control Node DL360p Failover Node DL360p For customer use The Base Unit has approximate useable storage capacity of 75TB, based on 5:1 compression. 3 additional Scale Units can fit into 1 rack, for up to 300 TB of useable storage. 3 rd Scale Unit for 8 nodes 2 ProLiant DL360p Compute Nodes Storage Block (D6000), 70 drives 2 nd Scale Unit for 6 nodes 2 ProLiant DL360p Compute Nodes Storage Block (D6000), 70 drives 1 st Scale Unit for 4 nodes 2 ProLiant DL360p Compute Nodes Storage Block (D6000), 70 drives Base Unit for 2 nodes 2 ProLiant DL360p Compute Nodes Storage Block (D6000), 70 drives Multiple racks can be configured for more useable storage. The 1TB drives can be replaced with 2TB or 3TB drives, for double or triple capacity. However, multiple Scale Units will provide better performance compared to one Base Unit with larger hard drives. For example, 3 Scale Units with 1TB drives will perform much better than 1 Base Unit with 3TB drives. Backup Node and Landing Zone (ETL Storage) is not included. The customer can order whatever they want for backup purposes, and install it themselves.

17 Software Windows Server 2012: Control Node, Mgmt. Node and Compute Nodes run in virtualized Environment Workload Management Workload classes System Center 2012: Single user i/f for management of PDW, OS, BI, custom apps and private cloud xvelocity In-memory execution Clustered columnstore SQL Server 2012 inside Visual Studio Data Tools Powerview directly on PDW Big Data Integration Polybase: T-SQL query to Hadoop External tables on Hadoop

18 A multi-region/workload appliance

19 Microsoft What is Hadoop? HCatalog Oozie HBase/Cassandra/Couch/ MongoDB Hive Mahout R Cascading Pig Flume Sqoop Zookeeper Ambari HBase(column DB) Hadoop = MapReduce + HDFS Avro Distributed, scalable system on commodity hardware composed of: HDFS distributed file system MapReduce programming model Others: HBase, R, Pig, Hive, Flume, Mahout, Avro, Zookeeper

20 APS: Parallel Data Warehouse and HDInsight region Control Node Failover Node Hadoop Head Node Hadoop redundant Head Node For customer use Configurable: Minimum 1 PDW region Additional PDW scale units Additional HDI scale units Hadoop region Hadoop region PDW scale unit PDW region

21 HDI region overview In a nutshell, it s a HDI instance running on an appliance. HDInsight is Microsoft branded Hortonworks distro. An integrated appliance for running PDW region and HDI region PDW is offered as a stand-alone workload on the appliance. HDI is offered only as an add-on to PDW, as a scale unit Based on V2 hardware. H/A for the Head Node is provided via Windows Failover Clustering (WFC), Data Node H/A is provided via HDFS/MapRed mechanisms Security add-ons to address security issues which are not contained in standard Hadoop Support for multiple user accounts

22 Query Hadoop data with T-SQL using PolyBase Bringing the worlds or big data and the data warehouse together for users and IT Select Result set Windows Azure HDInsight Cloudera Hortonworks (Windows, Linux) SQL Server Parallel Data Warehouse PolyBase Microsoft HDInsight Single T-SQL query model for PDW and Hadoop with rich features of T-SQL including joins without ETL Leverages the power of MPP to enhance query execution performance Supports Windows Azure HDInsight to enable new hybrid cloud scenarios Query non-microsoft Hadoop distributions such as Hortonworks and Cloudera

23 Big data insights for any user Native Microsoft BI integration to create new insights with familiar tools Leverages high adoption of Excel, Power View, Power Pivot, and SSAS No IT intervention required Everyone else using Microsoft BI tools Allow any users to create new insights with familiar tools Analyze PDW and Hadoop data in the same view Power Users Data Scientists

24 Differentiation: Freedom of deployment options and hybrid solutions

25 APS Management Console 1 PDW and Appliance

26 Agenda Modern Data Warehouse Big Data Application examples Analytic Platform Systems Architecture Hadoop Integration of Hadoop and APS APS with external Hadoop Clusters APS with Hadoop in the Cloud APS with integrated Hadoop

27 Polybase Use Case Category 1 Integration with external Hadoop clusters

28 Listening to SQL customers ShinSeGae Investing into Online Shopping website ( Korea s Amazon ) o SQL Server 2012 PDW & HDP 1.3/HDP 2.0 on Linux What they want 1. We want perform complex data mining on customer purchase data basket analysis. 2. We want to understand the social media data (reviews/twitter) specifically around our products & stores. 3. We will use Hadoop to keep all of our data ~ envisioned to be around 480 TB. PDW will be the efficient analysis engine for the hot data. 4. PDW & Polybaseare much faster than Hive. 5. We re interested in using data mining cloud services in Azure (hybrid scenarios) Microsoft NDA - Material

29 Listening to SQL customers TeleCom Understanding network quality o SQL Server 2012 PDW & Cloudera 4.5 on Linux What they want 1. We collect millions of network records for quality assessment and capacity planning on a daily basis. 2. Hadoop will be used for storage and ETL of these network record files. 3. PDW for more operational analysis, ad-hoc analysis, operational reports. 4. We are using Polybasealong with Oozie-based orchestration for a seamless & automated integration. Microsoft NDA - Material

30 Solution Architecture Integration with external Hadoop cluster (1) Polybase for integrating with various Hadoop distributions Support of Hortonwork s HDP 1.x & 2.x (Windows Server and Linux) Support of Cloudera scdh 4.x (on Linux) Microsoft APS Polybase Your Apps PowerPivot & PowerView External Table Push-down computation w/ AU1 release Pushing computation where data resides (Hadoop as query execution & processing aid) Transparent for users no need to learn map/reduce Seamless query experience through external tables + simplified & parallelized ETL through T-SQL (CTAS for import & CETAS for export) APS control & data nodes External Data source Polybase/APS query engine External File Format Web Apps Social Apps Mobile Apps Sensor & RFID Integration with 3 rd party tool and Microsoft insights/bi layer Existing applications simply work External tables populated through application layer like regular tables SQL Server Security Model You decide who sees what type of data SQL Server permission model adapted for each Polybase object external table, data source, and file format Microsoft NDA - Material

31 T-SQL Examples Integration with external Hadoop cluster (2) Creating external table, data source, file format Your Apps PowerPivot & PowerView CREATE EXTERNAL DATA SOURCE HDP2.0 WITH (TYPE = Hadoop, LOCATION = hdfs://hdp:8020,job_tracker_location= HDP:50300 ); CREATE EXTERNAL FILE FORMAT MyRCFile WITH(FORMAT_TYPE = RCFile, SERDE_METHOD = LazyBinarySerDe ) Microsoft APS Polybase External Table CREATE EXTERNAL TABLE Clickstream(url varchar(50),event_date date) WITH (DATA_SOURCE = HDP2.0,LOCATION = /employees/ employee.txt, FILE_FORMAT = MyRCFile); External Data source Polybase/APS query engine External File Format Querying Hadoop data SELECT user_name FROM ClickStream cs, PDW_User u WHERE cs.user_ip = u.user_ip and cs.url= ; APS control & data nodes Web Apps Social Apps Mobile Apps Sensor & RFID Persistently exporting & importing CREATE EXTERNAL TABLE Web_Sales WITH (LOCATION='/TPCDS/web_sales/, DATA_SOURCE = HDP2.0, FILE_FORMAT = MyRCFile) AS SELECT u.* FROM PDW_User CREATE TABLE PDW_Sales WITH DISTRIBUTION = Hash (id) AS SELECT FROM Web_Sales Microsoft NDA - Material

32 Solution Architecture (Details) ShinSeGae 2. Unstructured/semi-structured text data - External Polybase tables D, E, F Text (Board/SNS/ Internal Text ) Weather.. 1. Web log data(160gb/daily) External Polybase tables A, B, C Complex Event Processing (Storm) Message Queues (KAFKA, Open source) Tracking Log Servers SSG.com (renewal) Online Shopping Mall Recommendation engine & personalized advertising 3. Company s External Polybase tables G, H, I Mails Campaign HDP 1.3 on Linux (5-10 servers) raw/cold data Analytic information (right customer targeting) Polybase Queries 10 GB Ethernet APS/PDW Operational Data Store EDW Recent/hot data stored in PDW EIS OLAP (Tabular) DATA Mining Visualization (Silverlight) BI analyst Microsoft NDA - Material

33 Solution Architecture (Details) Telcom Capturing Network logs (>300 GB/per day) External Polybasetables A, B, C Usage of Hive s Metadata stores HCatalog Polybase Queries APS/PDW Network quality analysis High-frequency Event Processing (Network logs) Cloudera s CDH 4 on HP (18+ servers) raw/cold data (Petabyte of network logs) Infiniband Operational Data Store EDW Hot operational PDW data Capacity Planning Visualization (PowerPivot) BI analyst/planner/ Decision-maker Oozie Workflows Remote procedure calls via stored procedures to trigger Polybase queries Microsoft NDA - Material

34 Polybase Use Case Category 2 Integration with Microsoft Azure

35 Listening to SQL customers (5) Government Bridging the gap between cloud & onprem ocurrent POC -SQL Server 2012 PDW & HDInsightAzure What they want 1. HDInsight/Hadoop in the cloud to store and massage our raw data (XML files) generated by our web-application. 2. PDW to keep the data on-prem (legal requirement) and to have an efficient query engine for analysis purposes. 3. Polybase is a great way of accessing our files in the cloud via simple T-SQL. 4. With this solution, we can allow web users to quickly ask questions while the heavy, more complex business analysis is accomplished by PDW users. Microsoft NDA - Material

36 Solution Architecture Hybrid Scenarios Microsoft Azure Your Apps Azure HDInsight Polybase as key integrative feature Integration with external Hadoop, HDInsight region & Azure Storage Data aging strategies Aging of cold data to Azure Storage APS & HDInsight region for hot & warm data Azure Storage Public Internet Azure Express Route Queryhot data & cold aged data APS as modern cloud end-point for Azure Seamless querying of hot & cold data through APS APS as gateway allowing users to query all on-prem data via PowerBI and T-SQL examples On-premises or private cloud Your Apps Microsoft or 3 rd party Applications Microsoft APS Polybase APS control & data nodes CREATE EXTERNAL DATA SOURCE WASB WITH (TYPE = Hadoop, LOCATION = wasbs://dailylogs@myaccount.blob.core.windows.net ); CREATE EXTERNAL TABLE clickstream_hdinsights (url varchar(50), event_date date) WITH (DATA_SOURCE = WASB, LOCATION = /input/ log1.txt,file_format = MyDelimitedText); SELECT FROM clickstream_hdinsights, PDW_Table Microsoft NDA - Material

37 Solution Architecture (Details) Government HDI tools for data transformation Web apps- Generating tons of smaller XML files (~7KB each) Web Application for Tax Filing (einvoice) Other Web Feeds Transforming to large text files ~ 10 GBs each (External WASB Tables) HDI on Azure Azure Blob Storage cheap data store alternative to Hadoop onprem solution Public Internet or Azure Express Route Polybase Queries APS/PDW Operational Data Store EDW PDW/APS for fast query response & data processing of hot data Microsoft BI stack IBM Cognos Microsoft NDA - Material

38 Polybase Use Case Category 3 Unified Appliance with PDW and HDInsight region

39 Listening to SQL customers (6) Beverage & Vending Machines What are you drinking? Why is the machine down? o POC - SQL Server/APS with PDW & HDI region What they want 1. We want a complete solution stack we do not have Hadoop experts in-house and don t have the money to get it. 2. We want to store all raw data coming from vending machines into Hadoop degree of all our data structured customer data & unstructured data coming from vending machines. 4. Predicate maintenance of machines. Microsoft NDA - Material

40 Solution Architecture Unified APS appliance Your Apps External Table PowerPivot & PowerView Distributed & replicated table Unified appliance Multi-workload support with PDW and HDInsight region HDInsight powered by HDP bits No need to deal with multiple support teams ( better together ) Seamless & performing query experience through Polybase External tables can be used for HDI data PDW data nodes connected via high-speed network (Infiniband) to Hadoop data nodes Unified Microsoft APS with PDW & HDI region Simplified management & monitoring One consistent monitoring experience through appliance management tools T-SQL examples APS control & data nodes Web Apps HDI name & data nodes Social Apps Mobile Apps Sensor & RFID CREATE EXTERNAL DATA SOURCE HDI_R WITH (TYPE = Hadoop, LOCATION = 'hdfs://htukia-c-hhn01:8020,job_tracker_location ='HTUKIA-C HHN01:50300' CREATE EXTERNAL TABLE HDI_Region (url varchar(50), event_date date) WITH (DATA_SOURCE = WASB, LOCATION = /input/ log1.txt,file_format = MyDelimitedText); SELECT FROM clickstream_hdinsights, PDW_Table Microsoft NDA - Material

41 Solution Architecture (Details) Internal Microsoft Data Scientist Data scientist group 1 - using chaing of Hive queries & PowerQueryvia HiveODBC Hive & PowerQuery via Hive ODBC Analyzing ~3 TB Web Traffic msn.com Log files Microsoft servers Log files Secure Gateway & AD Integration HDI region 1 scale unit HDI region System Center & AdminConsole Polybase Queries Infiniband PDW region Full Rack PDW Data scientist group 2 -Using Polybasefor existing tooling (T-SQL, BI tools), performing processing of complex analytical queries & consistent management experience PowerQuery/PowerV iew/powermap Analytical queries via SSDT APS with PDW & HDI region Microsoft NDA - Material

42 Microsoft Digital Crime Unit Part of Microsoft LCA (Legal and Corporate Affairs) mandated to help protect Internet DCU s Challenge: To effectively combat digital crime requires the collection of huge amounts of data from multiple sources. DCU needs to be able to: Process 10s of TBs daily and house PBs of data historically (accessible as needed) House 100s of terabytes from multiple sources that is easily queryable. Use leading edge business intelligence and visualization tools.

43 Corporate Security Officers DCU Big Data Solution DCU Investigators and Analysts Predictive Analytics Embedded BI SQL Azure Azure MSFT SQL Stream Insight Data Sources Sinkholes, Passive DNS, Files, 3 rd Party Security Info. 500 TB SAN Storage PowerView HP Business S Decision S R Appliance S Hadoop 30 Node Cluster On Windows Excel with PowerPivot SSIS SharePoint, SSRS, SSAS, PowerView, PowerPoint HP EDW Appliance MSFT PDW

44 Microsoft Digital Crime Unit Data Source for BI Drop Extract Load Transform Data Source for BI Source for BI Hadoop SSIS PDW SSAS Microsoft BI Microsoft Digital Crime Unit currently being implemented) Part of Microsoft LCA (Legal and Corporate Affairs) mandated to help protect the Internet To effectively combat digital crime requires the collection of huge amounts of data from multiple sources. Process 10s of TBs daily and house PBs of data historically (accessible as needed) House 100s of terabytes from multiple sources that is easily queryable. Use leading edge business intelligence and visualization tools. 30 Node Hadoop on Windows Server Control Rack and 10 Node PDW Data Rack HP BDA (Business Decision Appliance) upgraded to SQL 2012 BI Voyage currently implementing PDW and BI portions of the project.

45 Why 2 Storage Platforms? HADOOP Parallel Data Warehouse Storage Capacity in the Petabytes Storage Capacity in the 100s of Terabytes Simplified Load, just drop unstructured or semi-structured files ETL process more complex to transform data in to reporting optimized DB structures No optimization of queries Structures can be optimized for common query patterns. Queried by IT professionals Queried by business analysts Complex and slow to query multiple sources at once Hadoop is DCU s Centralized Data Warehouse. Simple load and high capacity make it optimal for storing huge volumes of data. Optimized for fast query against key data from multiple sources. PDW is DCU s Data Mart platform. Easily accessible, intuitive data structures, and blazing fast for querying data.

46 APS Differentiators Part of a product family: From SQL server standalone to Cloud service offerings TCO: Very low, especially when looking on the whole bundle: ETL (SSIS), PDW, Data marts (SQL server) and Analytics (SSAS, SSRS) Appliance: Much lower effort for DBAs Microsoft product stack integration SSIS, SSAS, SSRS, PowerPivot, System Center, integration with Cloud services Linear Scaling via Shared Nothing xvelocity: Column Store and In-Memory execution Polybase: Integration with Big Data and Hadoop HDInsight integrated: fast Infiniband interconnect, management and security Microsoft exhibits one of the best value propositions on the market with a low cost and a highly favorable price/performance ratio - Gartner, February 2012

47

48 Columnstore Up to 100x faster queries Up to 15x more compression Updatable clustered columnstore vs. table with customary indexing 48 Parallel query execution Query Results Store data in columnar format for massive compression Load data into or out of memory for nextgeneration performance Updateable and clustered for real-time trickle loading

49 Concurrency that fuels rapid adoption Great performance with mixed workloads ETL/ELT with SSIS, DQS, MDS Analytics Platform System SQL Server SMP ERP CRM LOB APPS ETL/ELT with DWLoader PDW SSRS / SSAS Hadoop / Big Data PolyBase BI Tools Ad hoc queries HDInsight

50 MEC, a global media agency, uses SQL Server PDW with in-memory technology to cut query time helping marketers unlock the value of their data. SQL Server Analytics Platform System gives us massively parallel advantages. Whereas it would take up to four hours to run queries scaling across multiple nodes, now it takes just minutes.

51 Value through a single flexible appliance solution Why Analytics Platform System when I have SQL Server? Single appliance solution PDW Reduce the data center footprint Lower energy costs and usage Accelerate time to value and insights with no forklift required for scaling out PolyBase HDInsight Simplify management with built in System Center Reduce tuning efforts while retaining high performance

52 Value through a single flexible appliance solution Why Analytics Platform System when I have SQL Server? Your choice of hardware PDW Integrated support plan with a single Microsoft contact Co-engineered with HP, Dell and Quanta best practices PolyBase HDInsight Pre-configured, built, tuned software and hardware Leading performance with commodity hardware

53 CROSSMARK needed faster and more detailed insight into terabytes of information about product supply and demand. They deployed a turnkey business intelligence solution from Microsoft and HP that is based on the Microsoft SQL Server Parallel Data Warehouse. People can instantly create their own reports with SQL Server Power View and PowerPivot for Excel and they can build those reports 50 percent to many times faster compared with the previous system.

Please give me your feedback

Please give me your feedback Please give me your feedback Session BB4089 Speaker Claude Lorenson, Ph. D and Wendy Harms Use the mobile app to complete a session survey 1. Access My schedule 2. Click on this session 3. Go to Rate &

More information

Microsoft Analytics Platform System. Solution Brief

Microsoft Analytics Platform System. Solution Brief Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal

More information

Bringing Big Data to People

Bringing Big Data to People Bringing Big Data to People Microsoft s modern data platform SQL Server 2014 Analytics Platform System Microsoft Azure HDInsight Data Platform Everyone should have access to the data they need. Process

More information

Microsoft technológie pre BigData. Ľubomír Goryl Solution Professional

Microsoft technológie pre BigData. Ľubomír Goryl Solution Professional Microsoft technológie pre BigData Ľubomír Goryl Solution Professional Tradičný prístup Breaking points of traditional approach Breaking points of traditional approach Breaking points of traditional approach

More information

The Role Polybase in the MDW. Brian Mitchell Microsoft Big Data Center of Expertise

The Role Polybase in the MDW. Brian Mitchell Microsoft Big Data Center of Expertise The Role Polybase in the MDW Brian Mitchell Microsoft Big Data Center of Expertise Program Polybase Basics Polybase Scenarios Hadoop for Staging Ambient data from Hadoop Export Dimensions to Hadoop Hadoop

More information

Modernizing Your Data Warehouse for Hadoop

Modernizing Your Data Warehouse for Hadoop Modernizing Your Data Warehouse for Hadoop Big data. Small data. All data. Audie Wright, DW & Big Data Specialist Audie.Wright@Microsoft.com O 425-538-0044, C 303-324-2860 Unlock Insights on Any Data Taking

More information

Parallel Data Warehouse

Parallel Data Warehouse MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability

More information

Modern Data Warehousing

Modern Data Warehousing Modern Data Warehousing Cem Kubilay Microsoft CEE, Turkey & Israel Time is FY15 Gartner Survey April 2014 Piloting on premise 15% 10% 4% 14% 57% 2014 5% think Hadoop will replace existing DW solution (2013:

More information

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse SQL Server 2012 PDW Ryan Simpson Technical Solution Professional PDW Microsoft Microsoft SQL Server 2012 Parallel Data Warehouse Massively Parallel Processing Platform Delivers Big Data HDFS Delivers Scale

More information

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden

More information

SQL Server 2012 Parallel Data Warehouse. Solution Brief

SQL Server 2012 Parallel Data Warehouse. Solution Brief SQL Server 2012 Parallel Data Warehouse Solution Brief Published February 22, 2013 Contents Introduction... 1 Microsoft Platform: Windows Server and SQL Server... 2 SQL Server 2012 Parallel Data Warehouse...

More information

Big Data Technologies Compared June 2014

Big Data Technologies Compared June 2014 Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development

More information

Big Data Processing: Past, Present and Future

Big Data Processing: Past, Present and Future Big Data Processing: Past, Present and Future Orion Gebremedhin National Solutions Director BI & Big Data, Neudesic LLC. VTSP Microsoft Corp. Orion.Gebremedhin@Neudesic.COM B-orgebr@Microsoft.com @OrionGM

More information

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this

More information

Structured data meets unstructured data in Azure and Hadoop

Structured data meets unstructured data in Azure and Hadoop 1 Structured data meets unstructured data in Azure and Hadoop Sameer Parve, Blesson John sameerpa@microsoft.com Blessonj@Microsoft.com PFE SQL Server/Analytics Platform System October 30 th 2014 Agenda

More information

HDP Hadoop From concept to deployment.

HDP Hadoop From concept to deployment. HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some

More information

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM David Chappell SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM A PERSPECTIVE FOR SYSTEMS INTEGRATORS Sponsored by Microsoft Corporation Copyright 2014 Chappell & Associates Contents Business

More information

How To Create A Fact Table On Hadoop (Hadoop) On A Microsoft Powerbook 2.5.1 (Powerbook) On An Ipa 2.2 (Powerpoint) On Microsoft Microsoft 2.3

How To Create A Fact Table On Hadoop (Hadoop) On A Microsoft Powerbook 2.5.1 (Powerbook) On An Ipa 2.2 (Powerpoint) On Microsoft Microsoft 2.3 學 習 門 檻 太 高, 把 人 變 成 7x24 系 統 IT 需 要 藉 由 人 工 化 的 方 式 重 置 資 料 到 DW Learn MapReduce Prior manual IT moving HDFS into Warehouse/Data Mart before Analysis 感 應 器 HDInsight (Hadoop) SQL Server 2012 PDW SQL Server

More information

Comprehensive Analytics on the Hortonworks Data Platform

Comprehensive Analytics on the Hortonworks Data Platform Comprehensive Analytics on the Hortonworks Data Platform We do Hadoop. Page 1 Page 2 Back to 2005 Page 3 Vertical Scaling Page 4 Vertical Scaling Page 5 Vertical Scaling Page 6 Horizontal Scaling Page

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

HDP Enabling the Modern Data Architecture

HDP Enabling the Modern Data Architecture HDP Enabling the Modern Data Architecture Herb Cunitz President, Hortonworks Page 1 Hortonworks enables adoption of Apache Hadoop through HDP (Hortonworks Data Platform) Founded in 2011 Original 24 architects,

More information

SQL Server 2014 Faster Insights from any Data Level 300

SQL Server 2014 Faster Insights from any Data Level 300 SQL Server 2014 Faster Insights from any Data Level 300 Data Explorer Preview for Excel Enable self-service data discovery, query, transformation, and mashup experiences for information workers through

More information

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

TE's Analytics on Hadoop and SAP HANA Using SAP Vora TE's Analytics on Hadoop and SAP HANA Using SAP Vora Naveen Narra Senior Manager TE Connectivity Santha Kumar Rajendran Enterprise Data Architect TE Balaji Krishna - Director, SAP HANA Product Mgmt. -

More information

A Breakthrough Platform for Next-Generation Data Warehousing and Big Data Solutions

A Breakthrough Platform for Next-Generation Data Warehousing and Big Data Solutions A Breakthrough Platform for Next-Generation Data Warehousing and Big Data Solutions Writers: Barbara Kess and Dan Kogan Reviewers: Murshed Zaman, Henk van der Valk, John Hoang, Rick Byham Published: October

More information

The Future of Data Management with Hadoop and the Enterprise Data Hub

The Future of Data Management with Hadoop and the Enterprise Data Hub The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees

More information

The Inside Scoop on Hadoop

The Inside Scoop on Hadoop The Inside Scoop on Hadoop Orion Gebremedhin National Solutions Director BI & Big Data, Neudesic LLC. VTSP Microsoft Corp. Orion.Gebremedhin@Neudesic.COM B-orgebr@Microsoft.com @OrionGM The Inside Scoop

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

BIG DATA TRENDS AND TECHNOLOGIES

BIG DATA TRENDS AND TECHNOLOGIES BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.

More information

The Microsoft Modern Data Warehouse

The Microsoft Modern Data Warehouse The Microsoft Modern Data Warehouse Contents 4 Executive summary 4 The traditional data warehouse 5 Key trends breaking the traditional data warehouse 6 Increasing data volumes 6 Real-time data 7 New sources

More information

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION Syed Rasheed Solution Manager Red Hat Corp. Kenny Peeples Technical Manager Red Hat Corp. Kimberly Palko Product Manager Red Hat Corp.

More information

Whitepaper: Solution Overview - Breakthrough Insight. Published: March 7, 2012. Applies to: Microsoft SQL Server 2012. Summary:

Whitepaper: Solution Overview - Breakthrough Insight. Published: March 7, 2012. Applies to: Microsoft SQL Server 2012. Summary: Whitepaper: Solution Overview - Breakthrough Insight Published: March 7, 2012 Applies to: Microsoft SQL Server 2012 Summary: Today s Business Intelligence (BI) platform must adapt to a whole new scope,

More information

James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/

James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/ James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/ Our Focus: Microsoft Pure-Play Data Warehousing & Business Intelligence Partner Our Customers: Our Reputation: "B.I. Voyage came

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC, Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC, Bellevue, WA Legal disclaimer The information in this

More information

Building a BI Solution in the Cloud

Building a BI Solution in the Cloud Building a BI Solution in the Cloud Stacia Varga, Principal Consultant Email: stacia@datainspirations.com Twitter: @_StaciaV_ 2 SQLSaturday #467 Sponsors Stacia (Misner) Varga Over 30 years of IT experience,

More information

Tap into Hadoop and Other No SQL Sources

Tap into Hadoop and Other No SQL Sources Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data

More information

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All

More information

#TalendSandbox for Big Data

#TalendSandbox for Big Data Evalua&on von Apache Hadoop mit der #TalendSandbox for Big Data Julien Clarysse @whatdoesdatado @talend 2015 Talend Inc. 1 Connecting the Data-Driven Enterprise 2 Talend Overview Founded in 2006 BRAND

More information

Polybase for SQL Server 2016

Polybase for SQL Server 2016 Polybase for SQL Server 2016 Lukasz Grala Architect Data Platform & BI Solutions MVP SQL Server Łukasz Grala MVP SQL Server MCT MCSE Architekt i trener - Data Platform & Business Intelligence Solutions

More information

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed

More information

IBM Big Data Platform

IBM Big Data Platform IBM Big Data Platform Turning big data into smarter decisions Stefan Söderlund. IBM kundarkitekt, Försvarsmakten Sesam vår-seminarie Big Data, Bigga byte kräver Pigga Hertz! May 16, 2013 By 2015, 80% of

More information

Big Data on Microsoft Platform

Big Data on Microsoft Platform Big Data on Microsoft Platform Prepared by GJ Srinivas Corporate TEG - Microsoft Page 1 Contents 1. What is Big Data?...3 2. Characteristics of Big Data...3 3. Enter Hadoop...3 4. Microsoft Big Data Solutions...4

More information

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy Presented by: Jeffrey Zhang and Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop?

More information

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

Designing Self-Service Business Intelligence and Big Data Solutions

Designing Self-Service Business Intelligence and Big Data Solutions This five-day instructor-led course teaches students how to implement self-service Business Intelligence (BI) and Big Data analysis solutions using the Microsoft data platform. The course discusses the

More information

Azure Data Lake Analytics

Azure Data Lake Analytics Azure Data Lake Analytics Compose and orchestrate data services at scale Fully managed service to support orchestration of data movement and processing Connect to relational or non-relational data

More information

Understanding Microsoft s BI Tools

Understanding Microsoft s BI Tools Understanding Microsoft s BI Tools The purpose of this document is to provide a high level understanding of what tools Microsoft has to support the concepts of data warehousing, business intelligence,

More information

Introducing Oracle Exalytics In-Memory Machine

Introducing Oracle Exalytics In-Memory Machine Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle

More information

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to

More information

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Einsatzfelder von IBM PureData Systems und Ihre Vorteile. Einsatzfelder von IBM PureData Systems und Ihre Vorteile demirkaya@de.ibm.com Agenda Information technology challenges PureSystems and PureData introduction PureData for Transactions PureData for Analytics

More information

How To Extend An Enterprise Bio Solution

How To Extend An Enterprise Bio Solution Course 20467C: Designing Self-Service Business Intelligence and Big Data Solutions Module 1: Introduction to Self-Service Business Intelligence This module introduces self-service BI. Extending Enterprise

More information

CREATING PACKAGED IP FOR BUSINESS ANALYTICS PROJECTS

CREATING PACKAGED IP FOR BUSINESS ANALYTICS PROJECTS CREATING PACKAGED IP FOR BUSINESS ANALYTICS PROJECTS A PERSPECTIVE FOR SYSTEMS INTEGRATORS Sponsored by Microsoft Corporation 1/ What is Packaged IP? Categorizing the Options 2/ Why Offer Packaged IP?

More information

Course MS20467C Designing Self-Service Business Intelligence and Big Data Solutions

Course MS20467C Designing Self-Service Business Intelligence and Big Data Solutions 3 Riverchase Office Plaza Hoover, Alabama 35244 Phone: 205.989.4944 Fax: 855.317.2187 E-Mail: rwhitney@discoveritt.com Web: www.discoveritt.com Course MS20467C Designing Self-Service Business Intelligence

More information

SQL Server 2014. What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.

SQL Server 2014. What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft. SQL Server 2014 What s New? Christopher Speer Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.com The evolution of the Microsoft data platform What s New

More information

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING

More information

SQL Server 2016 New Features!

SQL Server 2016 New Features! SQL Server 2016 New Features! Improvements on Always On Availability Groups: Standard Edition will come with AGs support with one db per group synchronous or asynchronous, not readable (HA/DR only). Improved

More information

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld Tapping into Hadoop and NoSQL Data Sources in MicroStrategy Presented by: Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop? Customer Case

More information

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate

More information

Extend your analytic capabilities with SAP Predictive Analysis

Extend your analytic capabilities with SAP Predictive Analysis September 9 11, 2013 Anaheim, California Extend your analytic capabilities with SAP Predictive Analysis Charles Gadalla Learning Points Advanced analytics strategy at SAP Simplifying predictive analytics

More information

Dell In-Memory Appliance for Cloudera Enterprise

Dell In-Memory Appliance for Cloudera Enterprise Dell In-Memory Appliance for Cloudera Enterprise Hadoop Overview, Customer Evolution and Dell In-Memory Product Details Author: Armando Acosta Hadoop Product Manager/Subject Matter Expert Armando_Acosta@Dell.com/

More information

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved. Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!

More information

Big Data: Making Sense of it all!

Big Data: Making Sense of it all! Big Data: Making Sense of it all! Jamie Engesser E-mail : jamie@hortonworks.com Page 1 Data Driven Business? Facts not Intuition! Data driven decisions are better decisions its as simple as that. Using

More information

Talend Big Data. Delivering instant value from all your data. Talend 2014 1

Talend Big Data. Delivering instant value from all your data. Talend 2014 1 Talend Big Data Delivering instant value from all your data Talend 2014 1 I may say that this is the greatest factor: the way in which the expedition is equipped. Roald Amundsen race to the south pole,

More information

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics 1 Harnessing the Power of the Microsoft Cloud for Deep Data Analytics Today's Focus How you can operate your business more efficiently and effectively by tapping into Cloud based data analytics solutions

More information

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved Hortonworks & SAS Analytics everywhere. Page 1 A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment

More information

Microsoft SQL Server 2012 with Hadoop

Microsoft SQL Server 2012 with Hadoop Microsoft SQL Server 2012 with Hadoop Debarchan Sarkar Chapter No. 1 "Introduction to Big Data and Hadoop" In this package, you will find: A Biography of the author of the book A preview chapter from the

More information

Oracle Big Data SQL Technical Update

Oracle Big Data SQL Technical Update Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014 www.etidaho.com (208) 327-0768 Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014 5 Days About this Course This five day instructor led course teaches students how to use the enhancements

More information

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand? BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand? The Big Data Buzz big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume

More information

So What s the Big Deal?

So What s the Big Deal? So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data

More information

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

BIG DATA TECHNOLOGY. Hadoop Ecosystem

BIG DATA TECHNOLOGY. Hadoop Ecosystem BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big

More information

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to outline our

More information

A Modern Data Architecture with Apache Hadoop

A Modern Data Architecture with Apache Hadoop Modern Data Architecture with Apache Hadoop Talend Big Data Presented by Hortonworks and Talend Executive Summary Apache Hadoop didn t disrupt the datacenter, the data did. Shortly after Corporate IT functions

More information

Upcoming Announcements

Upcoming Announcements Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC jmarkham@hortonworks.com Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within

More information

Big Data Realities Hadoop in the Enterprise Architecture

Big Data Realities Hadoop in the Enterprise Architecture Big Data Realities Hadoop in the Enterprise Architecture Paul Phillips Director, EMEA, Hortonworks pphillips@hortonworks.com +44 (0)777 444 3857 Hortonworks Inc. 2012 Page 1 Agenda The Growth of Enterprise

More information

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved. Big Data Analytics 1 Priority Discussion Topics What are the most compelling business drivers behind big data analytics? Do you have or expect to have data scientists on your staff, and what will be their

More information

The Enterprise Data Hub and The Modern Information Architecture

The Enterprise Data Hub and The Modern Information Architecture The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader

More information

Modern Data Architecture for Predictive Analytics

Modern Data Architecture for Predictive Analytics Modern Data Architecture for Predictive Analytics David Smith VP Marketing and Community - Revolution Analytics John Kreisa VP Strategic Marketing- Hortonworks Hortonworks Inc. 2013 Page 1 Your Presenters

More information

BIG DATA What it is and how to use?

BIG DATA What it is and how to use? BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14

More information

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP Pythian White Paper TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP ABSTRACT As companies increasingly rely on big data to steer decisions, they also find themselves looking for ways to simplify

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

SAP and Hortonworks Reference Architecture

SAP and Hortonworks Reference Architecture SAP and Hortonworks Reference Architecture Hortonworks. We Do Hadoop. June Page 1 2014 Hortonworks Inc. 2011 2014. All Rights Reserved A Modern Data Architecture With SAP DATA SYSTEMS APPLICATIO NS Statistical

More information

Course 20467: Designing Self-Service Business Intelligence and Big Data Solutions

Course 20467: Designing Self-Service Business Intelligence and Big Data Solutions Course 20467: Designing Self-Service Business Intelligence and Big Data Solutions Type:Course Audience(s):IT Professionals Technology:Microsoft SQL Server Level:300 This Revision:C Delivery method: Instructor-led

More information

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal Information has gone from scarce to super-abundant. That brings huge new benefits. The Economist

More information

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April 9 2013

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April 9 2013 Integrating Hadoop Into Business Intelligence & Data Warehousing Philip Russom TDWI Research Director for Data Management, April 9 2013 TDWI would like to thank the following companies for sponsoring the

More information

SQL Server Parallel Data Warehouse: Architecture Overview. José Blakeley Database Systems Group, Microsoft Corporation

SQL Server Parallel Data Warehouse: Architecture Overview. José Blakeley Database Systems Group, Microsoft Corporation SQL Server Parallel Data Warehouse: Architecture Overview José Blakeley Database Systems Group, Microsoft Corporation Outline Motivation MPP DBMS system architecture HW and SW Key components Query processing

More information

Advanced In-Database Analytics

Advanced In-Database Analytics Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??

More information

Executive Summary... 2 Introduction... 3. Defining Big Data... 3. The Importance of Big Data... 4 Building a Big Data Platform...

Executive Summary... 2 Introduction... 3. Defining Big Data... 3. The Importance of Big Data... 4 Building a Big Data Platform... Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5 Infrastructure Requirements... 5 Solution Spectrum... 6 Oracle s Big Data

More information

Deeper Insights across Data

Deeper Insights across Data Deeper Insights across Data Technical White Paper Published: June 2015 Applies to: SQL Server 2016 Summary: Data warehousing, analytics, and business intelligence must adapt to a whole new scope, scale,

More information

SQLSaturday #399 Sacramento 25 July, 2015. Big Data Analytics with Excel

SQLSaturday #399 Sacramento 25 July, 2015. Big Data Analytics with Excel SQLSaturday #399 Sacramento 25 July, 2015 Big Data Analytics with Excel Presenter Introduction Peter Myers Independent BI Expert Bitwise Solutions BBus, SQL Server MCSE, SQL Server MVP since 2007 Experienced

More information

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Highly competitive enterprises are increasingly finding ways to maximize and accelerate

More information