EMC CUSTOMER UPDATE. 31 mei 2011 Fort Voordorp. Bart Sjerps. Greenplum Data Warehouse. Copyright 2011 EMC Corporation. All rights reserved.
|
|
- Martin Wade
- 8 years ago
- Views:
Transcription
1 EMC CUSTOMER UPDATE 31 mei 2011 Fort Voordorp Bart Sjerps Greenplum Data Warehouse 1
2 Introduction & Agenda What is Data warehousing? And what s Business Intelligence? Evolution in the Data Warehouse Business purpose Classic DWH architecture Present and future challenges EMC Solution Greenplum 2
3 What is a Data Warehouse? A Datawarehouse is not Vendor and consultant proclamations aside, a data warehouse is not: A project With a specific end date A product you buy from a vendor Like an ODS (such as SCT s) A canned warehouse supplied by istrategy Cognos ReportNet A database schema or instance Like Oracle Or SQL Server A cut-down version of your live transactional database According to Ralph Kimball and Joe Caserta, a data warehouse is: A system that extracts, cleans, conforms, and delivers source data into a dimensional data store and then supports and implements querying and analysis for the purpose of decision making. Another def.: The union of all the enterprise s data marts Aside: The Kimball model is not without some critics: E.g., Bill Inmon 3
4 History: From reports to advanced analytics Early days: run a simple report against the OLTP Database Run heavy batch reports against OLTP Database Dayly, weekly, monthly, year-end, adhoc Run custom queries against OLTP Database (using standard reporting tools) First use of what later became Business Intelligence, getting (market) knowledge from large amounts of information Offload databases for reporting and querying only Implemented as 1:1 copies, or custom designed databases (the first pure Data Warehouses) Need for Extract, Transform, Load tools (ETL) Evolved into OLAP (Online Analytical Processing); specialized methods for running Analytics This required special reporting tools as well Note: Running Batch and reporting on OLTP kills OLTP response time and performance 4
5 Classic vs. Next-gen business intelligence Old-style Datawarehousing: New style Datawarehousing: Frequently run reports/batch Built by programmers, optimized for performance and minimizing resource usage, requires huge developer and DBA efforts This is achieved by classic tuning such as using table indexes, partitioning, SQL optimization Very efficient but only for predictable queries Ad-hoc queries against OLTP data Can kill OLTP service levels, therefore this is often offloaded against prod database copy Optimizing using tricks such as materialized views Classic tuning fails (because it s unpredictable) DWH misused for pieces of business process Now mission-critical! Consider HA / DR / Compliancy Does not replace classic DWH! Get as much data from as many sources as possible Web, data feeds, legacy systems, smart electronics, etc etc Clean it up and modify it for analytics using ETL tools This is very resource intensive and typically requires long processing times Loading in the DWH can be problematic Classic DB systems again use workarounds for speeding it up Data needs to be as up-to-date as possible (less than 24 hours old) Build multi-dimensional databases That can have holes with missing data Build specialized data-marts Optimized by purpose Contains sub-set of all data 5
6 DW/BI purpose Risk Management Credit risk, Operational risk, Market risk Financial Analysis Customer credit, Cash inflow, Key financial ratios/performance Fraud Detection Internal fraud, External fraud Compliance Data integration, Reporting, Audit Customer Intelligence Behavior analysis, Spend & Value analysis, Portfolio analysis, Scorecard and Rating applications Performance Management Analysis of business methodologies, metrics, processes and systems 6
7 Classic Data Warehouse Architecture Data Sources CRM ERP Supply Chain Web Financial Systems Call Center Extract, Transform, Load (ETL) Fast Loading Staging Staging DWH Elastic Scale Data Mart Data Mart Data Mart Data Mart Reports ETL Data Warehouse Data Analysis 7
8 Datamart Sprawl EDW ~10 % of data Data is everywhere and growing 44X data growth by s of data marts Shadow databases Critical business insight is outside EDW Centralized legacy systems are expensive Data marts and personal databases ~90% of data System expansion is slow and process heavy Proprietary HW systems lag behind open systems innovation Traditional solutions cannot scale to meet the DW/BI challenges 8
9 Business Intelligence Challenges (1) Related to Infrastructure Higher service levels DWH not allowed to be down for a few days Need for backup/recovery/dr No SPOF, high-availability architecture Don t forget security, auditing, compliancy, data leakage prevention, customer privacy considerations (think Facebook and Google) Massive growth According to research firms, unstructured data will be biggest growth factor for companies Business Intelligence is #2 Soon we will see datawarehouses 100 s of Terabytes in size (And the first Petabyte customers) Business people want to store more and more in the DWH 9
10 Business Intelligence Challenges (2) Related to Infrastructure Loading time DWH needs to have up-to-date info Load times of multiple days is simply no longer acceptable 24H is max (for the whole process, not just loading) Long term, drive to real-time (ouch!) Scan time (how long does it take to run a query) More data More impatient end users More ad-hoc queries Cannot optimize this anymore with classic SQL tuning and database tricks & magic 10
11 Business Intelligence Challenges (3) Related to Infrastructure And finally New paradigms Multi-dimensional OLAP databases In-memory statistical calculations Needs to load a data subset in memory real quick Web users accessing BI data Of course, through web applications Massive scale-up in # of parallel transactions 11
12 What Takes Up the Most Time? You may be surprised to learn what DW step takes the most time Try guessing which: Hardware 30 Physical database setup Database design ETL OLAP setup st Qtr 2nd Qtr 3rd Qtr 4th Qtr Hardware East Database ETL West Schemas North OLAP tools Acc. to Kimball & Caserta, ETL will eat up 70% of the time. Other analysts give estimates ranging from 50% to 80%. The most often underestimated part of the warehouse project! 12
13 Data In. Decisions Out. 13
14 Database sizes over the years: -1996: 11 TB, in a Teradata Database -1999: 130 TB, in a Teradata Database - May 2008: 2 PB, Yahoo 14
15 It s Time for a Change... Yesterday s Data Warehouse and Analytic Infrastructure Proprietary Expensive Centralized, Monolithic Process-Heavy Batch Summarized Slow The Greenplum Future Commodity Cost-Effective Distributed Self-Service Real-Time Deep Agile
16 Greenplum True market disruption 20 Terabytes 20 kw, 8 Racks $20M 70 Terabytes 20 kw, 6 Racks $7M 100 Terabytes 12 kw, 2 Racks $1.8M 16
17 Industry Recognition: 2009 Gartner Magic Quadrant Gartner: Strengths Scale Mixed workloads Cloud ready Self service Low cost Concerns Company size Fixed by EMC R&D budget Fixed by EMC 17 *2007 was our first year on the MQ Source: Gartner (January 2010)
18 Greenplum Database Data in. Decisions out. Fastest Data Loading Advanced analytics Data in Scatter/Gather Streaming for the world s fastest data loading Eliminate data load bottlenecks Clean and integrate new data Several loading options ranging from bulk load updates to microbatching for near real-time processing In Database Analytics Optimized for fast query execution and linear scalability Move processing closer to data Shared nothing MPP scale-out architecture Computing is automatically optimized and distributed across resources Provides the best concurrent multi-workload performance Decisions out Unified data access for greater insight and value from data Enable parallel analysis across the enterprise Open platform with broad language support Certified enterprise connectivity and integration with most BI, ETL and management products 18
19 Greenplum Database Architecture Overview 19 19
20 Deployment models Greenplum Community Edition Free downloadable Limited to 2 segment servers All software is enabled Greenplum Software Only I.e. run on Vsphere / Vblock Or on standard (Intel) servers Greenplum DCD Appliance Pre-configured, tested, supported, plug & play Huge bandwidth DCD Appliance hybrid DAS / SAN 20
21 Architecture of Greenplum DCA Flexible framework for processing large datasets Process large datasets with support for both SQL and MapReduce etc SQL MapReduce UDF s: R,Java,C,Python,Perl ODBC, JDBC, OLEDB BI/ETL Tools Master servers optimize queries for the most efficient query execution Master Master Interconnect for continuous pipelining of data processing Segment servers process queries close to the data in parallel Segment Segment Segment Segment Segment MPP Scatter/Gather streaming for fast loading of data 21
22 Architecture Based on PostgreSQL (open source) database 15+ years of development Feature-rich, mission critical-ready Greenplum adds features on top of PostgreSQL Very low development cost (compared to traditional RDBMS vendors) Linear Scale-out Parallel loading Not depending on classic (OLTP) RDBMS tricks Special indexes, materialized views, 22
23 Greenplum vs OLTP DB architecture Greenplum Shared nothing master aggregates data Scale out only limited by master servers Optimized for Analytical processing As few indexes as possible (you don t know what you need anyway) Divide & Conquer Limited locking mechanisms -> no OLTP 23 OLTP Shared everything Cache fusion Scale out limited to ~ 4 nodes (if lucky) Optimized for transaction processing (OLTP) Highly optimized indexes - Extensively tuned for known queries Distribute workload (if possible) Full locking -> OLTP optimized
24 Greenplum DCA Instant Price/Performance Leadership EMC Greenplum DCA (1 rack) Oracle Exadata X2-8 (Full Rack) Netezza TwinFin 12 (Full Rack) Teradata 2580 (Full Rack) Architecture MPP Shared-Nothing MPP Shared-Disk MPP Shared-Nothing MPP Shared-Nothing Servers 16 Cores Scan (GB/s) w/o compression 192 Intel E5670 (2.93 GHz) 2 DB 14 Storage 128 DB 112 Storage Intel X Intel E5460 (3.16GHz) + 96 FPGA (2.26 GHz) + 96 FPGA 32 Intel Nehalem (2.66 GHz) 24 GB/Sec 25 GB/Sec 10 GB/Sec 10 GB/Sec Load (TB/Hr) >10TB/Hr 5TB/Hr 2 TB/Hr TBD Capacity (TB) Usable w/o Compression Capacity (TB) Usable w/ Compression 36 TB (600GB) 28 TB (600 GB) 32 TB 15 TB (450 GB) 144 TB 112 TB 128 TB 20 TB Largest Multi-Rack Configuration Max DB Cores Multi-rack Configuration 24 racks 8 racks 10 racks 10 racks
25 Greenplum 4.0: Self-Healing Fault-Tolerance Master Servers Network interconnect Segment Servers Master Servers 1. Segment server fails 2. Mirror segments take over, with no loss of service 3. Segment server is restored or replaced 4. Mirror segments restore primary via differential recovery (while online) Greenplum Database 4.0 enhances fault tolerance using a self-healing physical block replication architecture In Sync Key benefits of this architecture are: Resync ing Fast differential recovery and catch-up (always read-write) Change Tracking Improved write performance and reduced network load 25
26 Greenplum Data Computing Appliance Performance, Scalability, Reliability and Reduced TCO for DW/BI Environments Extreme Performance Optimized for fast query execution and unmatched data loading Rapidly Deployable Purpose-build data warehousing appliance Highly Available Self healing and fully redundant Elastic Scalability Expand capacity and performance online 26 Reduced TCO Consolidate Data Marts for lower costs Private Cloud Ready Data and computing are automatically optimized and distributed Advanced Backup and DR Leverage industry-leading Data Domain backup and recovery
27 Greenplum DCA Available Configurations Half rack (GP100) and Full rack (GP1000) GP100 GP1000 Expansion Bus 8 X Segment Servers Interconnect Bus 2X Master Servers 8 X Segment Servers 27
28 Greenplum DCA Specifications Half rack (GP100) and Full rack (GP1000) GP100 - Half Rack GP Full Rack Master Servers 2 Master Servers 2 Master Servers Segment Servers 8 Segment Servers 16 Segment Servers Memory per Server 48 GB 48 GB Total Memory 384 GB 768 GB Segment HDD s (SAS) Useable Capacity (uncompressed) Useable Capacity (compressed) 18 TB 36 TB 72 TB 144 TB Scan Rate 12 GB/Sec 24 GB/Sec Data Load Rate 5TB/Hour 10TB/Hour 28
29 Customers by Industry Financial Services Telco Media & Internet Retail Gov t & Health/Ins. 29
30 Overall DCD Partner Network and growing Hardware Vendors BI / ETL Tools Solutions & OEM Consultants And Resellers 30
31 Enabling Business Intelligence Through Virtual Enterprise Data Warehousing 31
Advanced In-Database Analytics
Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??
More informationBig Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum
Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All
More informationEMC/Greenplum Driving the Future of Data Warehousing and Analytics
EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC 2010 Forum Series 1 Greenplum Becomes the Foundation of EMC s Data Computing Division E M C A CQ U I R E S G R E E N P L U M Greenplum,
More informationEmerging Technologies Shaping the Future of Data Warehouses & Business Intelligence
Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What
More information2009 Oracle Corporation 1
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
More informationInge Os Sales Consulting Manager Oracle Norway
Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database
More informationMain Memory Data Warehouses
Main Memory Data Warehouses Robert Wrembel Poznan University of Technology Institute of Computing Science Robert.Wrembel@cs.put.poznan.pl www.cs.put.poznan.pl/rwrembel Lecture outline Teradata Data Warehouse
More informationBIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting
BIG DATA APPLIANCES July 23, TDWI R Sathyanarayana Enterprise Information Management & Analytics Practice EMC Consulting 1 Big data are datasets that grow so large that they become awkward to work with
More informationSAP Real-time Data Platform. April 2013
SAP Real-time Data Platform April 2013 Agenda Introduction SAP Real Time Data Platform Overview SAP Sybase ASE SAP Sybase IQ SAP EIM Questions and Answers 2012 SAP AG. All rights reserved. 2 Introduction
More informationEMC GREENPLUM DATABASE
EMC GREENPLUM DATABASE Driving the future of data warehousing and analytics Essentials A shared-nothing, massively parallel processing (MPP) architecture supports extreme performance on commodity infrastructure
More informationData Warehousing. Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1
Jens Teubner Data Warehousing Winter 2015/16 1 Data Warehousing Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Winter 2015/16 Jens Teubner Data Warehousing Winter 2015/16 13 Part II Overview
More informationUsing Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database
White Paper Using Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database Abstract This white paper explores the technology
More informationW H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
More informationNews and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren
News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business
More informationBig Data Technologies Compared June 2014
Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development
More informationEMC BACKUP MEETS BIG DATA
EMC BACKUP MEETS BIG DATA Strategies To Protect Greenplum, Isilon And Teradata Systems 1 Agenda Big Data: Overview, Backup and Recovery EMC Big Data Backup Strategy EMC Backup and Recovery Solutions for
More informationGreenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum
Greenplum Database Getting Started with Big Data Analytics Ofir Manor Pre Sales Technical Architect, EMC Greenplum 1 Agenda Introduction to Greenplum Greenplum Database Architecture Flexible Database Configuration
More informationManaging Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
More informationOBIEE 11g Analytics Using EMC Greenplum Database
White Paper OBIEE 11g Analytics Using EMC Greenplum Database - An Integration guide for OBIEE 11g Windows Users Abstract This white paper explains how OBIEE Analytics Business Intelligence Tool can be
More informationIntroducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
More informationOverview: X5 Generation Database Machines
Overview: X5 Generation Database Machines Spend Less by Doing More Spend Less by Paying Less Rob Kolb Exadata X5-2 Exadata X4-8 SuperCluster T5-8 SuperCluster M6-32 Big Memory Machine Oracle Exadata Database
More informationMASSIVEDATANEWS. Load and Go: Fast Data Loading with the Greenplum Data Computing Appliance (DCA)
Greenplum Data Computing Appliance (DCA) Introduction: Why Fast and Flexible Data Loading Matters Data loading is the beginning of the entire analytics process. Everything starts by getting data into the
More informationOracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers
Oracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers Masood Ahmed EMEA Infrastructure Solutions Oracle/SAP Relationship Overview First SAP R/3 release
More informationArchitecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
More informationPoslovni slučajevi upotrebe IBM Netezze
Poslovni slučajevi upotrebe IBM Netezze data at the Speed and with Simplicity businesses need 25. ožujak 2015. vedran.travica@hr.ibm.com Agenda A. IBM PureData for Analytics Netezza B. Scenarij 1.: Novi
More informationOracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture
Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture Ron Weiss, Exadata Product Management Exadata Database Machine Best Platform to Run the
More informationParallel Data Warehouse
MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability
More informationApplied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA
Applied Business Intelligence Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Agenda Business Drivers and Perspectives Technology & Analytical Applications Trends Challenges
More informationNetezza and Business Analytics Synergy
Netezza Business Partner Update: November 17, 2011 Netezza and Business Analytics Synergy Shimon Nir, IBM Agenda Business Analytics / Netezza Synergy Overview Netezza overview Enabling the Business with
More informationInvestor Presentation. Second Quarter 2015
Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
More informationQlikView Business Discovery Platform. Algol Consulting Srl
QlikView Business Discovery Platform Algol Consulting Srl Business Discovery Applications Application vs. Platform Application Designed to help people perform an activity Platform Provides infrastructure
More informationThe Enterprise Data Hub and The Modern Information Architecture
The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader
More informationIBM Netezza High Capacity Appliance
IBM Netezza High Capacity Appliance Petascale Data Archival, Analysis and Disaster Recovery Solutions IBM Netezza High Capacity Appliance Highlights: Allows querying and analysis of deep archival data
More informationBig Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
More informationHow to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW
How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden
More informationJames Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/
James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/ Our Focus: Microsoft Pure-Play Data Warehousing & Business Intelligence Partner Our Customers: Our Reputation: "B.I. Voyage came
More informationHow To Build An Exadata Database Machine X2-8 Full Rack For A Large Database Server
Oracle Exadata Database Machine Overview Exadata Database Machine Best Platform to Run the Oracle Database Best Machine for Data Warehousing Best Machine for OLTP Best Machine for
More informationCost-Effective Business Intelligence with Red Hat and Open Source
Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,
More informationAn Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise
An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to outline our
More informationBIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014
BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 Ralph Kimball Associates 2014 The Data Warehouse Mission Identify all possible enterprise data assets Select those assets
More informationIntegrated Grid Solutions. and Greenplum
EMC Perspective Integrated Grid Solutions from SAS, EMC Isilon and Greenplum Introduction Intensifying competitive pressure and vast growth in the capabilities of analytic computing platforms are driving
More informationSAP HANA In-Memory in Virtualized Data Centers. Arne Arnold, SAP HANA Product Management January 2013
SAP HANA In-Memory in Virtualized Data Centers Arne Arnold, SAP HANA Product Management January 2013 Agenda Virtualization In-Memory Pros & Cons with In-Memory Computing Virtualized SAP HANA Platform What
More informationOracle Database In-Memory The Next Big Thing
Oracle Database In-Memory The Next Big Thing Maria Colgan Master Product Manager #DBIM12c Why is Oracle do this Oracle Database In-Memory Goals Real Time Analytics Accelerate Mixed Workload OLTP No Changes
More informationCollaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.
Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!
More informationI/O Considerations in Big Data Analytics
Library of Congress I/O Considerations in Big Data Analytics 26 September 2011 Marshall Presser Federal Field CTO EMC, Data Computing Division 1 Paradigms in Big Data Structured (relational) data Very
More informationThe HP Neoview data warehousing platform for business intelligence Die clevere Alternative
The HP Neoview data warehousing platform for business intelligence Die clevere Alternative Ronald Wulff EMEA, BI Solution Architect HP Software - Neoview 2006 Hewlett-Packard Development Company, L.P.
More informationNextGen Infrastructure for Big DATA Analytics.
NextGen Infrastructure for Big DATA Analytics. So What is Big Data? Data that exceeds the processing capacity of conven4onal database systems. The data is too big, moves too fast, or doesn t fit the structures
More informationEinsatzfelder von IBM PureData Systems und Ihre Vorteile.
Einsatzfelder von IBM PureData Systems und Ihre Vorteile demirkaya@de.ibm.com Agenda Information technology challenges PureSystems and PureData introduction PureData for Transactions PureData for Analytics
More informationPreview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.
Preview of Oracle Database 12c In-Memory Option 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any
More information<Insert Picture Here> Oracle Exadata Database Machine Overview
Oracle Exadata Database Machine Overview Exadata Database Machine Best Platform to Run the Oracle Database Best Machine for Data Warehousing Best Machine for OLTP Best Machine for
More informationDatenverwaltung im Wandel - Building an Enterprise Data Hub with
Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees
More informationBIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
More informationSAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform
SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform David Lawler, Oracle Senior Vice President, Product Management and Strategy Paul Kent, SAS Vice President, Big Data What
More informationBig Data and Its Impact on the Data Warehousing Architecture
Big Data and Its Impact on the Data Warehousing Architecture Sponsored by SAP Speaker: Wayne Eckerson, Director of Research, TechTarget Wayne Eckerson: Hi my name is Wayne Eckerson, I am Director of Research
More informationHow Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns
How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns Table of Contents Abstract... 3 Introduction... 3 Definition... 3 The Expanding Digitization
More informationSQL Server PDW. Artur Vieira Premier Field Engineer
SQL Server PDW Artur Vieira Premier Field Engineer Agenda 1 Introduction to MPP and PDW 2 PDW Architecture and Components 3 Data Structures 4 PDW Tools Data Load / Data Output / Administrative Console
More informationORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION
ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION EXECUTIVE SUMMARY Oracle business intelligence solutions are complete, open, and integrated. Key components of Oracle business intelligence
More informationHADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics
HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop
More informationIntroduction to Database as a Service
Introduction to Database as a Service Exadata Platform Revised 8/1/13 Database as a Service (DBaaS) Starts With The Business Needs Establish an IT delivery model that reduces costs, meets demand, and fulfills
More informationNext-Generation Cloud Analytics with Amazon Redshift
Next-Generation Cloud Analytics with Amazon Redshift What s inside Introduction Why Amazon Redshift is Great for Analytics Cloud Data Warehousing Strategies for Relational Databases Analyzing Fast, Transactional
More informationNext Generation Data Warehousing Appliances 23.10.2014
Next Generation Data Warehousing Appliances 23.10.2014 Presentert av: Espen Jorde, Executive Advisor Bjørn Runar Nes, CTO/Chief Architect Bjørn Runar Nes Espen Jorde 2 3.12.2014 Agenda Affecto s new Data
More informationDell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III
White Paper Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III Performance of Microsoft SQL Server 2008 BI and D/W Solutions on Dell PowerEdge
More informationAGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
More informationOracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>
s Big Data solutions Roger Wullschleger DBTA Workshop on Big Data, Cloud Data Management and NoSQL 10. October 2012, Stade de Suisse, Berne 1 The following is intended to outline
More informationIntroduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence
Introduction to Oracle Business Intelligence Standard Edition One Mike Donohue Senior Manager, Product Management Oracle Business Intelligence The following is intended to outline our general product direction.
More informationThe Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
More informationSAP HANA Reinventing Real-Time Businesses through Innovation, Value & Simplicity. Eduardo Rodrigues October 2013
Reinventing Real-Time Businesses through Innovation, Value & Simplicity Eduardo Rodrigues October 2013 Agenda The Existing Data Management Conundrum Innovations Transformational Impact at Customers Summary
More informationDatabase Decisions: Performance, manageability and availability considerations in choosing a database
Database Decisions: Performance, manageability and availability considerations in choosing a database Reviewing offerings from Oracle, IBM and Microsoft 2012 Oracle and TechTarget Table of Contents Defining
More informationTap into Hadoop and Other No SQL Sources
Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data
More informationSQL Server 2012 Parallel Data Warehouse. Solution Brief
SQL Server 2012 Parallel Data Warehouse Solution Brief Published February 22, 2013 Contents Introduction... 1 Microsoft Platform: Windows Server and SQL Server... 2 SQL Server 2012 Parallel Data Warehouse...
More informationMicrosoft Analytics Platform System. Solution Brief
Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal
More informationHow To Use Exadata
Exadata V2 - Oracle Exadata Database Machine Robert Pastijn Platform Technology Services (PTS) Product Development 2010 Oracle Corporation Exadata V2 Goals Ideal Database Platform
More informationGreen Migration from Oracle
Green Migration from Oracle Greenplum Migration Approach Strong Experiences on Oracle Migration Automate all tasks DDL Migration Data Migration PL-SQL and SQL Scripts Migration Data Quality Tests ETL and
More informationPlease give me your feedback
Please give me your feedback Session BB4089 Speaker Claude Lorenson, Ph. D and Wendy Harms Use the mobile app to complete a session survey 1. Access My schedule 2. Click on this session 3. Go to Rate &
More information5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
More informationEvolving Solutions Disruptive Technology Series Modern Data Warehouse
Evolving Solutions Disruptive Technology Series Modern Data Warehouse Presenter Kumar Kannankutty Big Data Platform Technical Sales Leader Host - Michael Downs, Solution Architect, Evolving Solutions www.evolvingsol.com
More informationData Warehousing on System z BWDB2UG Presentation 9/12/07 v
Data Warehousing on System z BWDB2UG Presentation 9/12/07 v John Partridge, Sr. Software Engineer BI on Z Swat Team Agenda Understanding Warehousing Terminology Data Warehousing (DW) Market Directions
More informationHow to Enhance Traditional BI Architecture to Leverage Big Data
B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...
More informationDriving Peak Performance. 2013 IBM Corporation
Driving Peak Performance 1 Session 2: Driving Peak Performance Abstract We know you want the fastest performance possible for your deployments, and yet that relies on many choices across data storage,
More informationDistributed Architecture of Oracle Database In-memory
Distributed Architecture of Oracle Database In-memory Niloy Mukherjee, Shasank Chavan, Maria Colgan, Dinesh Das, Mike Gleeson, Sanket Hase, Allison Holloway, Hui Jin, Jesse Kamp, Kartik Kulkarni, Tirthankar
More informationVirtual Data Warehouse Appliances
infrastructure (WX 2 and blade server Kognitio provides solutions to business problems that require acquisition, rationalization and analysis of large and/or complex data The Kognitio Technology and Data
More informationIII JORNADAS DE DATA MINING
III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE
More informationSafe Harbor Statement
Safe Harbor Statement "Safe Harbor" Statement: Statements in this presentation relating to Oracle's future plans, expectations, beliefs, intentions and prospects are "forward-looking statements" and are
More informationToronto 26 th SAP BI. Leap Forward with SAP
Toronto 26 th SAP BI Leap Forward with SAP Business Intelligence SAP BI 4.0 and SAP BW Operational BI with SAP ERP SAP HANA and BI Operational vs Decision making reporting Verify the evolution of the KPIs,
More informationIn-Memory Analytics for Big Data
In-Memory Analytics for Big Data Game-changing technology for faster, better insights WHITE PAPER SAS White Paper Table of Contents Introduction: A New Breed of Analytics... 1 SAS In-Memory Overview...
More informationInnovative technology for big data analytics
Technical white paper Innovative technology for big data analytics The HP Vertica Analytics Platform database provides price/performance, scalability, availability, and ease of administration Table of
More information<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database
1 Best Practices for Extreme Performance with Data Warehousing on Oracle Database Rekha Balwada Principal Product Manager Agenda Parallel Execution Workload Management on Data Warehouse
More informationMicrosoft s SQL Server Parallel Data Warehouse Provides High Performance and Great Value
Microsoft s SQL Server Parallel Data Warehouse Provides High Performance and Great Value Published by: Value Prism Consulting Sponsored by: Microsoft Corporation Publish date: March 2013 Abstract: Data
More informationSUN ORACLE EXADATA STORAGE SERVER
SUN ORACLE EXADATA STORAGE SERVER KEY FEATURES AND BENEFITS FEATURES 12 x 3.5 inch SAS or SATA disks 384 GB of Exadata Smart Flash Cache 2 Intel 2.53 Ghz quad-core processors 24 GB memory Dual InfiniBand
More information<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise
Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Business Intelligence is the #1 Priority the most important technology in 2007 is business intelligence
More informationMoving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage
Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take
More informationAligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed
More information2010 Ingres Corporation. Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation
Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation Agenda Need for Fast Data Analysis & The Data Explosion Challenge Approaches Used Till
More informationSplice Machine: SQL-on-Hadoop Evaluation Guide www.splicemachine.com
REPORT Splice Machine: SQL-on-Hadoop Evaluation Guide www.splicemachine.com The content of this evaluation guide, including the ideas and concepts contained within, are the property of Splice Machine,
More informationSUN ORACLE DATABASE MACHINE
SUN ORACLE DATABASE MACHINE FEATURES AND FACTS FEATURES From 1 to 8 database servers From 1 to 14 Sun Oracle Exadata Storage Servers Each Exadata Storage Server includes 384 GB of Exadata Smart Flash Cache
More informationIn-memory computing with SAP HANA
In-memory computing with SAP HANA June 2015 Amit Satoor, SAP @asatoor 2015 SAP SE or an SAP affiliate company. All rights reserved. 1 Hyperconnectivity across people, business, and devices give rise to
More information2015 Ironside Group, Inc. 2
2015 Ironside Group, Inc. 2 Introduction to Ironside What is Cloud, Really? Why Cloud for Data Warehousing? Intro to IBM PureData for Analytics (IPDA) IBM PureData for Analytics on Cloud Intro to IBM dashdb
More informationIntroduction to Datawarehousing
DIPARTIMENTO DI INGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE ANTONIO RUBERTI Master of Science in Engineering in Computer Science (MSE-CS) Seminars in Software and Services for the Information Society
More informationOracle Database 12c Plug In. Switch On. Get SMART.
Oracle Database 12c Plug In. Switch On. Get SMART. Duncan Harvey Head of Core Technology, Oracle EMEA March 2015 Safe Harbor Statement The following is intended to outline our general product direction.
More informationHadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard
Hadoop and Relational base The Best of Both Worlds for Analytics Greg Battas Hewlett Packard The Evolution of Analytics Mainframe EDW Proprietary MPP Unix SMP MPP Appliance Hadoop? Questions Is Hadoop
More information