EMC CUSTOMER UPDATE. 31 mei 2011 Fort Voordorp. Bart Sjerps. Greenplum Data Warehouse. Copyright 2011 EMC Corporation. All rights reserved.

Size: px
Start display at page:

Download "EMC CUSTOMER UPDATE. 31 mei 2011 Fort Voordorp. Bart Sjerps. Greenplum Data Warehouse. Copyright 2011 EMC Corporation. All rights reserved."

Transcription

1 EMC CUSTOMER UPDATE 31 mei 2011 Fort Voordorp Bart Sjerps Greenplum Data Warehouse 1

2 Introduction & Agenda What is Data warehousing? And what s Business Intelligence? Evolution in the Data Warehouse Business purpose Classic DWH architecture Present and future challenges EMC Solution Greenplum 2

3 What is a Data Warehouse? A Datawarehouse is not Vendor and consultant proclamations aside, a data warehouse is not: A project With a specific end date A product you buy from a vendor Like an ODS (such as SCT s) A canned warehouse supplied by istrategy Cognos ReportNet A database schema or instance Like Oracle Or SQL Server A cut-down version of your live transactional database According to Ralph Kimball and Joe Caserta, a data warehouse is: A system that extracts, cleans, conforms, and delivers source data into a dimensional data store and then supports and implements querying and analysis for the purpose of decision making. Another def.: The union of all the enterprise s data marts Aside: The Kimball model is not without some critics: E.g., Bill Inmon 3

4 History: From reports to advanced analytics Early days: run a simple report against the OLTP Database Run heavy batch reports against OLTP Database Dayly, weekly, monthly, year-end, adhoc Run custom queries against OLTP Database (using standard reporting tools) First use of what later became Business Intelligence, getting (market) knowledge from large amounts of information Offload databases for reporting and querying only Implemented as 1:1 copies, or custom designed databases (the first pure Data Warehouses) Need for Extract, Transform, Load tools (ETL) Evolved into OLAP (Online Analytical Processing); specialized methods for running Analytics This required special reporting tools as well Note: Running Batch and reporting on OLTP kills OLTP response time and performance 4

5 Classic vs. Next-gen business intelligence Old-style Datawarehousing: New style Datawarehousing: Frequently run reports/batch Built by programmers, optimized for performance and minimizing resource usage, requires huge developer and DBA efforts This is achieved by classic tuning such as using table indexes, partitioning, SQL optimization Very efficient but only for predictable queries Ad-hoc queries against OLTP data Can kill OLTP service levels, therefore this is often offloaded against prod database copy Optimizing using tricks such as materialized views Classic tuning fails (because it s unpredictable) DWH misused for pieces of business process Now mission-critical! Consider HA / DR / Compliancy Does not replace classic DWH! Get as much data from as many sources as possible Web, data feeds, legacy systems, smart electronics, etc etc Clean it up and modify it for analytics using ETL tools This is very resource intensive and typically requires long processing times Loading in the DWH can be problematic Classic DB systems again use workarounds for speeding it up Data needs to be as up-to-date as possible (less than 24 hours old) Build multi-dimensional databases That can have holes with missing data Build specialized data-marts Optimized by purpose Contains sub-set of all data 5

6 DW/BI purpose Risk Management Credit risk, Operational risk, Market risk Financial Analysis Customer credit, Cash inflow, Key financial ratios/performance Fraud Detection Internal fraud, External fraud Compliance Data integration, Reporting, Audit Customer Intelligence Behavior analysis, Spend & Value analysis, Portfolio analysis, Scorecard and Rating applications Performance Management Analysis of business methodologies, metrics, processes and systems 6

7 Classic Data Warehouse Architecture Data Sources CRM ERP Supply Chain Web Financial Systems Call Center Extract, Transform, Load (ETL) Fast Loading Staging Staging DWH Elastic Scale Data Mart Data Mart Data Mart Data Mart Reports ETL Data Warehouse Data Analysis 7

8 Datamart Sprawl EDW ~10 % of data Data is everywhere and growing 44X data growth by s of data marts Shadow databases Critical business insight is outside EDW Centralized legacy systems are expensive Data marts and personal databases ~90% of data System expansion is slow and process heavy Proprietary HW systems lag behind open systems innovation Traditional solutions cannot scale to meet the DW/BI challenges 8

9 Business Intelligence Challenges (1) Related to Infrastructure Higher service levels DWH not allowed to be down for a few days Need for backup/recovery/dr No SPOF, high-availability architecture Don t forget security, auditing, compliancy, data leakage prevention, customer privacy considerations (think Facebook and Google) Massive growth According to research firms, unstructured data will be biggest growth factor for companies Business Intelligence is #2 Soon we will see datawarehouses 100 s of Terabytes in size (And the first Petabyte customers) Business people want to store more and more in the DWH 9

10 Business Intelligence Challenges (2) Related to Infrastructure Loading time DWH needs to have up-to-date info Load times of multiple days is simply no longer acceptable 24H is max (for the whole process, not just loading) Long term, drive to real-time (ouch!) Scan time (how long does it take to run a query) More data More impatient end users More ad-hoc queries Cannot optimize this anymore with classic SQL tuning and database tricks & magic 10

11 Business Intelligence Challenges (3) Related to Infrastructure And finally New paradigms Multi-dimensional OLAP databases In-memory statistical calculations Needs to load a data subset in memory real quick Web users accessing BI data Of course, through web applications Massive scale-up in # of parallel transactions 11

12 What Takes Up the Most Time? You may be surprised to learn what DW step takes the most time Try guessing which: Hardware 30 Physical database setup Database design ETL OLAP setup st Qtr 2nd Qtr 3rd Qtr 4th Qtr Hardware East Database ETL West Schemas North OLAP tools Acc. to Kimball & Caserta, ETL will eat up 70% of the time. Other analysts give estimates ranging from 50% to 80%. The most often underestimated part of the warehouse project! 12

13 Data In. Decisions Out. 13

14 Database sizes over the years: -1996: 11 TB, in a Teradata Database -1999: 130 TB, in a Teradata Database - May 2008: 2 PB, Yahoo 14

15 It s Time for a Change... Yesterday s Data Warehouse and Analytic Infrastructure Proprietary Expensive Centralized, Monolithic Process-Heavy Batch Summarized Slow The Greenplum Future Commodity Cost-Effective Distributed Self-Service Real-Time Deep Agile

16 Greenplum True market disruption 20 Terabytes 20 kw, 8 Racks $20M 70 Terabytes 20 kw, 6 Racks $7M 100 Terabytes 12 kw, 2 Racks $1.8M 16

17 Industry Recognition: 2009 Gartner Magic Quadrant Gartner: Strengths Scale Mixed workloads Cloud ready Self service Low cost Concerns Company size Fixed by EMC R&D budget Fixed by EMC 17 *2007 was our first year on the MQ Source: Gartner (January 2010)

18 Greenplum Database Data in. Decisions out. Fastest Data Loading Advanced analytics Data in Scatter/Gather Streaming for the world s fastest data loading Eliminate data load bottlenecks Clean and integrate new data Several loading options ranging from bulk load updates to microbatching for near real-time processing In Database Analytics Optimized for fast query execution and linear scalability Move processing closer to data Shared nothing MPP scale-out architecture Computing is automatically optimized and distributed across resources Provides the best concurrent multi-workload performance Decisions out Unified data access for greater insight and value from data Enable parallel analysis across the enterprise Open platform with broad language support Certified enterprise connectivity and integration with most BI, ETL and management products 18

19 Greenplum Database Architecture Overview 19 19

20 Deployment models Greenplum Community Edition Free downloadable Limited to 2 segment servers All software is enabled Greenplum Software Only I.e. run on Vsphere / Vblock Or on standard (Intel) servers Greenplum DCD Appliance Pre-configured, tested, supported, plug & play Huge bandwidth DCD Appliance hybrid DAS / SAN 20

21 Architecture of Greenplum DCA Flexible framework for processing large datasets Process large datasets with support for both SQL and MapReduce etc SQL MapReduce UDF s: R,Java,C,Python,Perl ODBC, JDBC, OLEDB BI/ETL Tools Master servers optimize queries for the most efficient query execution Master Master Interconnect for continuous pipelining of data processing Segment servers process queries close to the data in parallel Segment Segment Segment Segment Segment MPP Scatter/Gather streaming for fast loading of data 21

22 Architecture Based on PostgreSQL (open source) database 15+ years of development Feature-rich, mission critical-ready Greenplum adds features on top of PostgreSQL Very low development cost (compared to traditional RDBMS vendors) Linear Scale-out Parallel loading Not depending on classic (OLTP) RDBMS tricks Special indexes, materialized views, 22

23 Greenplum vs OLTP DB architecture Greenplum Shared nothing master aggregates data Scale out only limited by master servers Optimized for Analytical processing As few indexes as possible (you don t know what you need anyway) Divide & Conquer Limited locking mechanisms -> no OLTP 23 OLTP Shared everything Cache fusion Scale out limited to ~ 4 nodes (if lucky) Optimized for transaction processing (OLTP) Highly optimized indexes - Extensively tuned for known queries Distribute workload (if possible) Full locking -> OLTP optimized

24 Greenplum DCA Instant Price/Performance Leadership EMC Greenplum DCA (1 rack) Oracle Exadata X2-8 (Full Rack) Netezza TwinFin 12 (Full Rack) Teradata 2580 (Full Rack) Architecture MPP Shared-Nothing MPP Shared-Disk MPP Shared-Nothing MPP Shared-Nothing Servers 16 Cores Scan (GB/s) w/o compression 192 Intel E5670 (2.93 GHz) 2 DB 14 Storage 128 DB 112 Storage Intel X Intel E5460 (3.16GHz) + 96 FPGA (2.26 GHz) + 96 FPGA 32 Intel Nehalem (2.66 GHz) 24 GB/Sec 25 GB/Sec 10 GB/Sec 10 GB/Sec Load (TB/Hr) >10TB/Hr 5TB/Hr 2 TB/Hr TBD Capacity (TB) Usable w/o Compression Capacity (TB) Usable w/ Compression 36 TB (600GB) 28 TB (600 GB) 32 TB 15 TB (450 GB) 144 TB 112 TB 128 TB 20 TB Largest Multi-Rack Configuration Max DB Cores Multi-rack Configuration 24 racks 8 racks 10 racks 10 racks

25 Greenplum 4.0: Self-Healing Fault-Tolerance Master Servers Network interconnect Segment Servers Master Servers 1. Segment server fails 2. Mirror segments take over, with no loss of service 3. Segment server is restored or replaced 4. Mirror segments restore primary via differential recovery (while online) Greenplum Database 4.0 enhances fault tolerance using a self-healing physical block replication architecture In Sync Key benefits of this architecture are: Resync ing Fast differential recovery and catch-up (always read-write) Change Tracking Improved write performance and reduced network load 25

26 Greenplum Data Computing Appliance Performance, Scalability, Reliability and Reduced TCO for DW/BI Environments Extreme Performance Optimized for fast query execution and unmatched data loading Rapidly Deployable Purpose-build data warehousing appliance Highly Available Self healing and fully redundant Elastic Scalability Expand capacity and performance online 26 Reduced TCO Consolidate Data Marts for lower costs Private Cloud Ready Data and computing are automatically optimized and distributed Advanced Backup and DR Leverage industry-leading Data Domain backup and recovery

27 Greenplum DCA Available Configurations Half rack (GP100) and Full rack (GP1000) GP100 GP1000 Expansion Bus 8 X Segment Servers Interconnect Bus 2X Master Servers 8 X Segment Servers 27

28 Greenplum DCA Specifications Half rack (GP100) and Full rack (GP1000) GP100 - Half Rack GP Full Rack Master Servers 2 Master Servers 2 Master Servers Segment Servers 8 Segment Servers 16 Segment Servers Memory per Server 48 GB 48 GB Total Memory 384 GB 768 GB Segment HDD s (SAS) Useable Capacity (uncompressed) Useable Capacity (compressed) 18 TB 36 TB 72 TB 144 TB Scan Rate 12 GB/Sec 24 GB/Sec Data Load Rate 5TB/Hour 10TB/Hour 28

29 Customers by Industry Financial Services Telco Media & Internet Retail Gov t & Health/Ins. 29

30 Overall DCD Partner Network and growing Hardware Vendors BI / ETL Tools Solutions & OEM Consultants And Resellers 30

31 Enabling Business Intelligence Through Virtual Enterprise Data Warehousing 31

Advanced In-Database Analytics

Advanced In-Database Analytics Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??

More information

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All

More information

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC 2010 Forum Series 1 Greenplum Becomes the Foundation of EMC s Data Computing Division E M C A CQ U I R E S G R E E N P L U M Greenplum,

More information

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What

More information

2009 Oracle Corporation 1

2009 Oracle Corporation 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,

More information

Inge Os Sales Consulting Manager Oracle Norway

Inge Os Sales Consulting Manager Oracle Norway Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database

More information

Main Memory Data Warehouses

Main Memory Data Warehouses Main Memory Data Warehouses Robert Wrembel Poznan University of Technology Institute of Computing Science Robert.Wrembel@cs.put.poznan.pl www.cs.put.poznan.pl/rwrembel Lecture outline Teradata Data Warehouse

More information

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting BIG DATA APPLIANCES July 23, TDWI R Sathyanarayana Enterprise Information Management & Analytics Practice EMC Consulting 1 Big data are datasets that grow so large that they become awkward to work with

More information

SAP Real-time Data Platform. April 2013

SAP Real-time Data Platform. April 2013 SAP Real-time Data Platform April 2013 Agenda Introduction SAP Real Time Data Platform Overview SAP Sybase ASE SAP Sybase IQ SAP EIM Questions and Answers 2012 SAP AG. All rights reserved. 2 Introduction

More information

EMC GREENPLUM DATABASE

EMC GREENPLUM DATABASE EMC GREENPLUM DATABASE Driving the future of data warehousing and analytics Essentials A shared-nothing, massively parallel processing (MPP) architecture supports extreme performance on commodity infrastructure

More information

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

Data Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1 Jens Teubner Data Warehousing Winter 2015/16 1 Data Warehousing Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Winter 2015/16 Jens Teubner Data Warehousing Winter 2015/16 13 Part II Overview

More information

Using Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database

Using Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database White Paper Using Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database Abstract This white paper explores the technology

More information

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the

More information

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business

More information

Big Data Technologies Compared June 2014

Big Data Technologies Compared June 2014 Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development

More information

EMC BACKUP MEETS BIG DATA

EMC BACKUP MEETS BIG DATA EMC BACKUP MEETS BIG DATA Strategies To Protect Greenplum, Isilon And Teradata Systems 1 Agenda Big Data: Overview, Backup and Recovery EMC Big Data Backup Strategy EMC Backup and Recovery Solutions for

More information

Greenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum

Greenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum Greenplum Database Getting Started with Big Data Analytics Ofir Manor Pre Sales Technical Architect, EMC Greenplum 1 Agenda Introduction to Greenplum Greenplum Database Architecture Flexible Database Configuration

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

OBIEE 11g Analytics Using EMC Greenplum Database

OBIEE 11g Analytics Using EMC Greenplum Database White Paper OBIEE 11g Analytics Using EMC Greenplum Database - An Integration guide for OBIEE 11g Windows Users Abstract This white paper explains how OBIEE Analytics Business Intelligence Tool can be

More information

Introducing Oracle Exalytics In-Memory Machine

Introducing Oracle Exalytics In-Memory Machine Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle

More information

Overview: X5 Generation Database Machines

Overview: X5 Generation Database Machines Overview: X5 Generation Database Machines Spend Less by Doing More Spend Less by Paying Less Rob Kolb Exadata X5-2 Exadata X4-8 SuperCluster T5-8 SuperCluster M6-32 Big Memory Machine Oracle Exadata Database

More information

MASSIVEDATANEWS. Load and Go: Fast Data Loading with the Greenplum Data Computing Appliance (DCA)

MASSIVEDATANEWS. Load and Go: Fast Data Loading with the Greenplum Data Computing Appliance (DCA) Greenplum Data Computing Appliance (DCA) Introduction: Why Fast and Flexible Data Loading Matters Data loading is the beginning of the entire analytics process. Everything starts by getting data into the

More information

Oracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers

Oracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers Oracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers Masood Ahmed EMEA Infrastructure Solutions Oracle/SAP Relationship Overview First SAP R/3 release

More information

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics

More information

Poslovni slučajevi upotrebe IBM Netezze

Poslovni slučajevi upotrebe IBM Netezze Poslovni slučajevi upotrebe IBM Netezze data at the Speed and with Simplicity businesses need 25. ožujak 2015. vedran.travica@hr.ibm.com Agenda A. IBM PureData for Analytics Netezza B. Scenarij 1.: Novi

More information

Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture

Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture Ron Weiss, Exadata Product Management Exadata Database Machine Best Platform to Run the

More information

Parallel Data Warehouse

Parallel Data Warehouse MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability

More information

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Applied Business Intelligence Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Agenda Business Drivers and Perspectives Technology & Analytical Applications Trends Challenges

More information

Netezza and Business Analytics Synergy

Netezza and Business Analytics Synergy Netezza Business Partner Update: November 17, 2011 Netezza and Business Analytics Synergy Shimon Nir, IBM Agenda Business Analytics / Netezza Synergy Overview Netezza overview Enabling the Business with

More information

Investor Presentation. Second Quarter 2015

Investor Presentation. Second Quarter 2015 Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

QlikView Business Discovery Platform. Algol Consulting Srl

QlikView Business Discovery Platform. Algol Consulting Srl QlikView Business Discovery Platform Algol Consulting Srl Business Discovery Applications Application vs. Platform Application Designed to help people perform an activity Platform Provides infrastructure

More information

The Enterprise Data Hub and The Modern Information Architecture

The Enterprise Data Hub and The Modern Information Architecture The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader

More information

IBM Netezza High Capacity Appliance

IBM Netezza High Capacity Appliance IBM Netezza High Capacity Appliance Petascale Data Archival, Analysis and Disaster Recovery Solutions IBM Netezza High Capacity Appliance Highlights: Allows querying and analysis of deep archival data

More information

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate

More information

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden

More information

James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/

James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/ James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/ Our Focus: Microsoft Pure-Play Data Warehousing & Business Intelligence Partner Our Customers: Our Reputation: "B.I. Voyage came

More information

How To Build An Exadata Database Machine X2-8 Full Rack For A Large Database Server

How To Build An Exadata Database Machine X2-8 Full Rack For A Large Database Server Oracle Exadata Database Machine Overview Exadata Database Machine Best Platform to Run the Oracle Database Best Machine for Data Warehousing Best Machine for OLTP Best Machine for

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to outline our

More information

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 Ralph Kimball Associates 2014 The Data Warehouse Mission Identify all possible enterprise data assets Select those assets

More information

Integrated Grid Solutions. and Greenplum

Integrated Grid Solutions. and Greenplum EMC Perspective Integrated Grid Solutions from SAS, EMC Isilon and Greenplum Introduction Intensifying competitive pressure and vast growth in the capabilities of analytic computing platforms are driving

More information

SAP HANA In-Memory in Virtualized Data Centers. Arne Arnold, SAP HANA Product Management January 2013

SAP HANA In-Memory in Virtualized Data Centers. Arne Arnold, SAP HANA Product Management January 2013 SAP HANA In-Memory in Virtualized Data Centers Arne Arnold, SAP HANA Product Management January 2013 Agenda Virtualization In-Memory Pros & Cons with In-Memory Computing Virtualized SAP HANA Platform What

More information

Oracle Database In-Memory The Next Big Thing

Oracle Database In-Memory The Next Big Thing Oracle Database In-Memory The Next Big Thing Maria Colgan Master Product Manager #DBIM12c Why is Oracle do this Oracle Database In-Memory Goals Real Time Analytics Accelerate Mixed Workload OLTP No Changes

More information

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved. Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!

More information

I/O Considerations in Big Data Analytics

I/O Considerations in Big Data Analytics Library of Congress I/O Considerations in Big Data Analytics 26 September 2011 Marshall Presser Federal Field CTO EMC, Data Computing Division 1 Paradigms in Big Data Structured (relational) data Very

More information

The HP Neoview data warehousing platform for business intelligence Die clevere Alternative

The HP Neoview data warehousing platform for business intelligence Die clevere Alternative The HP Neoview data warehousing platform for business intelligence Die clevere Alternative Ronald Wulff EMEA, BI Solution Architect HP Software - Neoview 2006 Hewlett-Packard Development Company, L.P.

More information

NextGen Infrastructure for Big DATA Analytics.

NextGen Infrastructure for Big DATA Analytics. NextGen Infrastructure for Big DATA Analytics. So What is Big Data? Data that exceeds the processing capacity of conven4onal database systems. The data is too big, moves too fast, or doesn t fit the structures

More information

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Einsatzfelder von IBM PureData Systems und Ihre Vorteile. Einsatzfelder von IBM PureData Systems und Ihre Vorteile demirkaya@de.ibm.com Agenda Information technology challenges PureSystems and PureData introduction PureData for Transactions PureData for Analytics

More information

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved. Preview of Oracle Database 12c In-Memory Option 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any

More information

<Insert Picture Here> Oracle Exadata Database Machine Overview

<Insert Picture Here> Oracle Exadata Database Machine Overview Oracle Exadata Database Machine Overview Exadata Database Machine Best Platform to Run the Oracle Database Best Machine for Data Warehousing Best Machine for OLTP Best Machine for

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

BIG DATA TRENDS AND TECHNOLOGIES

BIG DATA TRENDS AND TECHNOLOGIES BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.

More information

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform David Lawler, Oracle Senior Vice President, Product Management and Strategy Paul Kent, SAS Vice President, Big Data What

More information

Big Data and Its Impact on the Data Warehousing Architecture

Big Data and Its Impact on the Data Warehousing Architecture Big Data and Its Impact on the Data Warehousing Architecture Sponsored by SAP Speaker: Wayne Eckerson, Director of Research, TechTarget Wayne Eckerson: Hi my name is Wayne Eckerson, I am Director of Research

More information

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns Table of Contents Abstract... 3 Introduction... 3 Definition... 3 The Expanding Digitization

More information

SQL Server PDW. Artur Vieira Premier Field Engineer

SQL Server PDW. Artur Vieira Premier Field Engineer SQL Server PDW Artur Vieira Premier Field Engineer Agenda 1 Introduction to MPP and PDW 2 PDW Architecture and Components 3 Data Structures 4 PDW Tools Data Load / Data Output / Administrative Console

More information

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION EXECUTIVE SUMMARY Oracle business intelligence solutions are complete, open, and integrated. Key components of Oracle business intelligence

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

Introduction to Database as a Service

Introduction to Database as a Service Introduction to Database as a Service Exadata Platform Revised 8/1/13 Database as a Service (DBaaS) Starts With The Business Needs Establish an IT delivery model that reduces costs, meets demand, and fulfills

More information

Next-Generation Cloud Analytics with Amazon Redshift

Next-Generation Cloud Analytics with Amazon Redshift Next-Generation Cloud Analytics with Amazon Redshift What s inside Introduction Why Amazon Redshift is Great for Analytics Cloud Data Warehousing Strategies for Relational Databases Analyzing Fast, Transactional

More information

Next Generation Data Warehousing Appliances 23.10.2014

Next Generation Data Warehousing Appliances 23.10.2014 Next Generation Data Warehousing Appliances 23.10.2014 Presentert av: Espen Jorde, Executive Advisor Bjørn Runar Nes, CTO/Chief Architect Bjørn Runar Nes Espen Jorde 2 3.12.2014 Agenda Affecto s new Data

More information

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III White Paper Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III Performance of Microsoft SQL Server 2008 BI and D/W Solutions on Dell PowerEdge

More information

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this

More information

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here> s Big Data solutions Roger Wullschleger DBTA Workshop on Big Data, Cloud Data Management and NoSQL 10. October 2012, Stade de Suisse, Berne 1 The following is intended to outline

More information

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence Introduction to Oracle Business Intelligence Standard Edition One Mike Donohue Senior Manager, Product Management Oracle Business Intelligence The following is intended to outline our general product direction.

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

SAP HANA Reinventing Real-Time Businesses through Innovation, Value & Simplicity. Eduardo Rodrigues October 2013

SAP HANA Reinventing Real-Time Businesses through Innovation, Value & Simplicity. Eduardo Rodrigues October 2013 Reinventing Real-Time Businesses through Innovation, Value & Simplicity Eduardo Rodrigues October 2013 Agenda The Existing Data Management Conundrum Innovations Transformational Impact at Customers Summary

More information

Database Decisions: Performance, manageability and availability considerations in choosing a database

Database Decisions: Performance, manageability and availability considerations in choosing a database Database Decisions: Performance, manageability and availability considerations in choosing a database Reviewing offerings from Oracle, IBM and Microsoft 2012 Oracle and TechTarget Table of Contents Defining

More information

Tap into Hadoop and Other No SQL Sources

Tap into Hadoop and Other No SQL Sources Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data

More information

SQL Server 2012 Parallel Data Warehouse. Solution Brief

SQL Server 2012 Parallel Data Warehouse. Solution Brief SQL Server 2012 Parallel Data Warehouse Solution Brief Published February 22, 2013 Contents Introduction... 1 Microsoft Platform: Windows Server and SQL Server... 2 SQL Server 2012 Parallel Data Warehouse...

More information

Microsoft Analytics Platform System. Solution Brief

Microsoft Analytics Platform System. Solution Brief Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal

More information

How To Use Exadata

How To Use Exadata Exadata V2 - Oracle Exadata Database Machine Robert Pastijn Platform Technology Services (PTS) Product Development 2010 Oracle Corporation Exadata V2 Goals Ideal Database Platform

More information

Green Migration from Oracle

Green Migration from Oracle Green Migration from Oracle Greenplum Migration Approach Strong Experiences on Oracle Migration Automate all tasks DDL Migration Data Migration PL-SQL and SQL Scripts Migration Data Quality Tests ETL and

More information

Please give me your feedback

Please give me your feedback Please give me your feedback Session BB4089 Speaker Claude Lorenson, Ph. D and Wendy Harms Use the mobile app to complete a session survey 1. Access My schedule 2. Click on this session 3. Go to Rate &

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

Evolving Solutions Disruptive Technology Series Modern Data Warehouse

Evolving Solutions Disruptive Technology Series Modern Data Warehouse Evolving Solutions Disruptive Technology Series Modern Data Warehouse Presenter Kumar Kannankutty Big Data Platform Technical Sales Leader Host - Michael Downs, Solution Architect, Evolving Solutions www.evolvingsol.com

More information

Data Warehousing on System z BWDB2UG Presentation 9/12/07 v

Data Warehousing on System z BWDB2UG Presentation 9/12/07 v Data Warehousing on System z BWDB2UG Presentation 9/12/07 v John Partridge, Sr. Software Engineer BI on Z Swat Team Agenda Understanding Warehousing Terminology Data Warehousing (DW) Market Directions

More information

How to Enhance Traditional BI Architecture to Leverage Big Data

How to Enhance Traditional BI Architecture to Leverage Big Data B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...

More information

Driving Peak Performance. 2013 IBM Corporation

Driving Peak Performance. 2013 IBM Corporation Driving Peak Performance 1 Session 2: Driving Peak Performance Abstract We know you want the fastest performance possible for your deployments, and yet that relies on many choices across data storage,

More information

Distributed Architecture of Oracle Database In-memory

Distributed Architecture of Oracle Database In-memory Distributed Architecture of Oracle Database In-memory Niloy Mukherjee, Shasank Chavan, Maria Colgan, Dinesh Das, Mike Gleeson, Sanket Hase, Allison Holloway, Hui Jin, Jesse Kamp, Kartik Kulkarni, Tirthankar

More information

Virtual Data Warehouse Appliances

Virtual Data Warehouse Appliances infrastructure (WX 2 and blade server Kognitio provides solutions to business problems that require acquisition, rationalization and analysis of large and/or complex data The Kognitio Technology and Data

More information

III JORNADAS DE DATA MINING

III JORNADAS DE DATA MINING III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE

More information

Safe Harbor Statement

Safe Harbor Statement Safe Harbor Statement "Safe Harbor" Statement: Statements in this presentation relating to Oracle's future plans, expectations, beliefs, intentions and prospects are "forward-looking statements" and are

More information

Toronto 26 th SAP BI. Leap Forward with SAP

Toronto 26 th SAP BI. Leap Forward with SAP Toronto 26 th SAP BI Leap Forward with SAP Business Intelligence SAP BI 4.0 and SAP BW Operational BI with SAP ERP SAP HANA and BI Operational vs Decision making reporting Verify the evolution of the KPIs,

More information

In-Memory Analytics for Big Data

In-Memory Analytics for Big Data In-Memory Analytics for Big Data Game-changing technology for faster, better insights WHITE PAPER SAS White Paper Table of Contents Introduction: A New Breed of Analytics... 1 SAS In-Memory Overview...

More information

Innovative technology for big data analytics

Innovative technology for big data analytics Technical white paper Innovative technology for big data analytics The HP Vertica Analytics Platform database provides price/performance, scalability, availability, and ease of administration Table of

More information

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database 1 Best Practices for Extreme Performance with Data Warehousing on Oracle Database Rekha Balwada Principal Product Manager Agenda Parallel Execution Workload Management on Data Warehouse

More information

Microsoft s SQL Server Parallel Data Warehouse Provides High Performance and Great Value

Microsoft s SQL Server Parallel Data Warehouse Provides High Performance and Great Value Microsoft s SQL Server Parallel Data Warehouse Provides High Performance and Great Value Published by: Value Prism Consulting Sponsored by: Microsoft Corporation Publish date: March 2013 Abstract: Data

More information

SUN ORACLE EXADATA STORAGE SERVER

SUN ORACLE EXADATA STORAGE SERVER SUN ORACLE EXADATA STORAGE SERVER KEY FEATURES AND BENEFITS FEATURES 12 x 3.5 inch SAS or SATA disks 384 GB of Exadata Smart Flash Cache 2 Intel 2.53 Ghz quad-core processors 24 GB memory Dual InfiniBand

More information

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Business Intelligence is the #1 Priority the most important technology in 2007 is business intelligence

More information

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take

More information

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed

More information

2010 Ingres Corporation. Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation

2010 Ingres Corporation. Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation Agenda Need for Fast Data Analysis & The Data Explosion Challenge Approaches Used Till

More information

Splice Machine: SQL-on-Hadoop Evaluation Guide www.splicemachine.com

Splice Machine: SQL-on-Hadoop Evaluation Guide www.splicemachine.com REPORT Splice Machine: SQL-on-Hadoop Evaluation Guide www.splicemachine.com The content of this evaluation guide, including the ideas and concepts contained within, are the property of Splice Machine,

More information

SUN ORACLE DATABASE MACHINE

SUN ORACLE DATABASE MACHINE SUN ORACLE DATABASE MACHINE FEATURES AND FACTS FEATURES From 1 to 8 database servers From 1 to 14 Sun Oracle Exadata Storage Servers Each Exadata Storage Server includes 384 GB of Exadata Smart Flash Cache

More information

In-memory computing with SAP HANA

In-memory computing with SAP HANA In-memory computing with SAP HANA June 2015 Amit Satoor, SAP @asatoor 2015 SAP SE or an SAP affiliate company. All rights reserved. 1 Hyperconnectivity across people, business, and devices give rise to

More information

2015 Ironside Group, Inc. 2

2015 Ironside Group, Inc. 2 2015 Ironside Group, Inc. 2 Introduction to Ironside What is Cloud, Really? Why Cloud for Data Warehousing? Intro to IBM PureData for Analytics (IPDA) IBM PureData for Analytics on Cloud Intro to IBM dashdb

More information

Introduction to Datawarehousing

Introduction to Datawarehousing DIPARTIMENTO DI INGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE ANTONIO RUBERTI Master of Science in Engineering in Computer Science (MSE-CS) Seminars in Software and Services for the Information Society

More information

Oracle Database 12c Plug In. Switch On. Get SMART.

Oracle Database 12c Plug In. Switch On. Get SMART. Oracle Database 12c Plug In. Switch On. Get SMART. Duncan Harvey Head of Core Technology, Oracle EMEA March 2015 Safe Harbor Statement The following is intended to outline our general product direction.

More information

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard Hadoop and Relational base The Best of Both Worlds for Analytics Greg Battas Hewlett Packard The Evolution of Analytics Mainframe EDW Proprietary MPP Unix SMP MPP Appliance Hadoop? Questions Is Hadoop

More information