BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting



Similar documents
Big Data and Its Impact on the Data Warehousing Architecture

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Real Life Performance of In-Memory Database Systems for BI

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

Main Memory Data Warehouses

IBM Informix Warehouse Accelerator (IWA)

The Pros and Cons of Data Warehouse Appliances

Cost-Effective Business Intelligence with Red Hat and Open Source

The Forrester Wave : Enterprise Data Warehouse, Q4 2013

SAP Real-time Data Platform. April 2013

Data Warehouse Appliances: The Next Wave of IT Delivery. Private Cloud (Revocable Access and Support) Applications Appliance. (License/Maintenance)

P4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland

Evolving Data Warehouse Architectures

Whitepaper. 4 Steps to Successfully Evaluating Business Analytics Software.

Big Data Technologies Compared June 2014

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances

Il mondo dei DB Cambia : Tecnologie e opportunita`

Bringing Big Data into the Enterprise

Open Source Business Intelligence Intro

Investor Presentation. Second Quarter 2015

Focus on the business, not the business of data warehousing!

Oracle Database 12c. Andy Mendelsohn. Senior Vice President, Oracle Database Server Technologies

Performance and Scalability Overview

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Business Performance without limits how in memory. computing changes everything

Advanced In-Database Analytics

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

INVESTOR PRESENTATION. First Quarter 2014

white paper Sybase IQ, A Competitive Analysis

VIEWPOINT. High Performance Analytics. Industry Context and Trends

Modernizing Your Data Warehouse for Hadoop

Performance and Scalability Overview

Introducing Oracle Exalytics In-Memory Machine

Magic Quadrant for Data Warehouse Database Management Systems

Datalogix. Using IBM Netezza data warehouse appliances to drive online sales with offline data. Overview. IBM Software Information Management

Big Data solutions to support Intelligent Systems and Applications

Virtual Data Warehouse Appliances

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

Microsoft s SQL Server Parallel Data Warehouse Provides High Performance and Great Value

Aaron Werman.

OWB Users, Enter The New ODI World

Understanding the Value of In-Memory in the IT Landscape

Safe Harbor Statement

IBM Data Warehousing and Analytics Portfolio Summary

SQL Server 2012 Parallel Data Warehouse. Solution Brief

SAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions SAP

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Composite Software Data Virtualization Turbocharge Analytics with Big Data and Data Virtualization

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

HP Oracle Database Platform / Exadata Appliance Extreme Data Warehousing

Big Data Database Revenue and Market Forecast,

2009 Oracle Corporation 1

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

High performance analytics. Benchmark results.

A Platform for Big Data Analytics on Distributed Scale-out Storage System

Parallel Data Warehouse

Big Data and Your Data Warehouse Philip Russom

Infrastructure Matters: POWER8 vs. Xeon x86

Hadoop Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast,

IBM Netezza High-performance business intelligence and advanced analytics for the enterprise. The analytics conundrum

Big Data Market Size and Vendor Revenues

IBM BigInsights for Apache Hadoop

Breaking News! Big Data is Solved. What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER

Why EMC for SAP HANA. EMC is the #1 Storage Vendor for SAP (IDC Storage User Demand Study, Fall 2011)

Oracle Big Data SQL Technical Update

Proact whitepaper on Big Data

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

QlikView Business Discovery Platform. Algol Consulting Srl

Tap into Big Data at the Speed of Business

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances

In-Memory Analytics for Big Data

Modern Data Warehousing

I/O Considerations in Big Data Analytics

Data Doesn t Communicate Itself Using Visualization to Tell Better Stories

There s no way around it: learning about Big Data means

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Microsoft Analytics Platform System. Solution Brief

IBM InfoSphere BigInsights Enterprise Edition

SAP Lumira 1.28 Product Availability Matrix (PAM)

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Transcription:

BIG DATA APPLIANCES July 23, TDWI R Sathyanarayana Enterprise Information Management & Analytics Practice EMC Consulting 1

Big data are datasets that grow so large that they become awkward to work with using on-hand database management tools. Difficulties include capture, storage, search, sharing, analytics, and visualizing. This trend continues because of the benefits of working with larger and larger datasets allowing analysts to "spot business trends, prevent diseases, combat crime 2

What's all the fuss about 'Big Data'? Internet scale Analyze enormous volumes of data with cost efficiency and response time Diversity Integrate cross platform data Transform fundamentally what is currently possible Speed Derive insights from most recent data 3

Picture This. Rich Visualization + Analytics DATA WAREHOUSE (ENTERPRISE/CROSS ENTERPRISE) 4

Appliances : Setting the context As IT organizations build up massive numbers of databases to deal with the explosion of data, the ability to make real-time decisions on new questions (BI) that involve enormous amounts of information (DW) will need to be a core competency for many organizations. Due to this shift, DW/BI customers need a solution that can provide extreme predictable performance, scale-out architecture for Big Data analytics and an enterprise-proven feature set all at the lowest TCO. 5

Under Works : Definition of Datawarehouse Appliance Original Definition : Hardware + Software built and supported by a single vendor Partial Technology Stack : May or may not bundle with other vendors hardware and / or operating system 6

Architectural considerations are evolving Whole Technology Stack (Appliances or Bundles) Partial Technology Stack Miscellaneous Data Warehouse Appliances (DWAs) Early DWAs: Netezza (now IBM) Teradata New DWAs: EMC Greenplum DCA XtremeData dbx Very Early Products: Sequent, White Cross Relational Databases: Aster MapReduce DWA (now Teradata) Greenplum (EMC DCA) Kickfire (now Teradata) Kognitio (SaaS) Columnar Databases: ParAccel, Sybase IQ, Vertica (now HP) General Data Appliance: Dataupia Sartori Server Oracle Exadata (storage) Add-On Accelerators: Cognos Now! SAP BI Accelerator Teradata Accelerate Related Products Server/Database Bundles: IBM Balanced Warehouse Oracle Database Machine SQL server Fast Track Sybase Analytic Appliance Open-Source Databases: Infobright Ingres Postgres MySQL Commodity/Standardized Hardware: Dell, HP, IBM Cisco, EMC AMD, Intel Open-Source Op Sys: LINUX (various vendors) TDWI Research 7

Trends: Consolidation in the industry Focus is shifting in multiple areas: 1.From whole technology stack to pieces of it 2.From hardware to software 3.From proprietary to commodity hardware 4.From new vendors to infrastructure providers 5.From single to mixed workloads 6.From data marts to EDW 8

Major differences between the DW appliances Column vs. Row Storage Polymorphic Storage (Both Column and Row) Proprietary and Commodity Hardware In-Memory Processing Relationship with Existing Architecture Shared Nothing Architecture 9

The Challenges in Today s Data Warehousing Environments Sources of data and the amount of data to analyze is growing exponentially Stale data exists because DW solutions cannot ingest the vast amounts of data fast enough Lack of performance for advanced analytics and complex queries The number of users and the concurrency of users is increasing rapidly 10

Top questions from architectural viewpoints Proprietary / Non-proprietary, standardized hw Is this a shared nothing / share disk/ share everything architecture? Capability in database analytics? Scale linear scalability? What are your ingestion rates? (higher rate, better real time reporting) Can I balance my workloads? How about mixed workloads? What are my DW management overheads? 11

Considerations of DW Solution Easily scales to analyze growing amounts of data Rapidly ingests large amounts of data from sources Provides high performance in database analytics Supports high user concurrency securely, reliably Handle multiple workloads 12