2010 Ingres Corporation. Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation



Similar documents
Emerging Innovations In Analytical Databases TDWI India Chapter Meeting, July 23, 2011, Hyderabad

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting

Actian Vector in Hadoop

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Big Data and Its Impact on the Data Warehousing Architecture

Performance and Scalability Overview

Imagine business analytics at the speed of thought

IBM Informix Warehouse Accelerator (IWA)

SAP Real-time Data Platform. April 2013

Cost-Effective Business Intelligence with Red Hat and Open Source

Main Memory Data Warehouses

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Performance and Scalability Overview

Big Data Technologies Compared June 2014

Lowering the Total Cost of Ownership (TCO) of Data Warehousing

Enterprise Edition Analytic Data Warehouse Technology White Paper

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Innovative technology for big data analytics

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Five Technology Trends for Improved Business Intelligence Performance

Introducing Oracle Exalytics In-Memory Machine

Real Life Performance of In-Memory Database Systems for BI

Open Source Business Intelligence Intro

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

IBM Data Retrieval Technologies: RDBMS, BLU, IBM Netezza, and Hadoop

Oracle BI Suite Enterprise Edition

In-Memory Data Management for Enterprise Applications

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

Focus on the business, not the business of data warehousing!

INFORMATICA WORLD TOUR August 2012

SQL Server 2012 Parallel Data Warehouse. Solution Brief

IBM Data Warehousing and Analytics Portfolio Summary

The Technology Evaluator s Cheat Sheets. Business Intelligence & Analy:cs

Microsoft Business Intelligence solution. What makes Microsoft BI difference

2015 Ironside Group, Inc. 2

Stay Tuned for Today s Session! NAVIGATING THE DATABASE UNIVERSE"

Infrastructure Matters: POWER8 vs. Xeon x86

Driving Peak Performance IBM Corporation

The HP Neoview data warehousing platform for business intelligence Die clevere Alternative

Exadata Database Machine

How to Build a High-Performance Data Warehouse By David J. DeWitt, Ph.D.; Samuel Madden, Ph.D.; and Michael Stonebraker, Ph.D.

Parallel Data Warehouse

SQream Technologies Ltd - Confiden7al

Determine the Right Analytic Database: A Survey of New Data Technologies

Netezza and Business Analytics Synergy

Microsoft Analytics Platform System. Solution Brief

Five Best Practices for Maximizing Big Data ROI

HP Oracle Database Platform / Exadata Appliance Extreme Data Warehousing

Business Analytics: The Big Leap Forward RUN BETTER

Why DBMSs Matter More than Ever in the Big Data Era

EMC CUSTOMER UPDATE. 31 mei 2011 Fort Voordorp. Bart Sjerps. Greenplum Data Warehouse. Copyright 2011 EMC Corporation. All rights reserved.

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Intro to BI. Mul0- dimensional Analysis

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW

Tiber Solutions. Understanding the Current & Future Landscape of BI and Data Storage. Jim Hadley

Near-line Storage with CBW NLS

One-Size-Fits-All: A DBMS Idea Whose Time has Come and Gone. Michael Stonebraker December, 2008

An Accenture Point of View. Oracle Exalytics brings speed and unparalleled flexibility to business analytics

Whitepaper. 4 Steps to Successfully Evaluating Business Analytics Software.

SQL Server 2012 Performance White Paper

Using Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

Hadoop s Entry into the Traditional Analytical DBMS Market. Daniel Abadi Yale University August 3 rd, 2010

Smarter Computing. Erich Amrehn Distinguished Engineer Chief Architect Smarter Computing technical enablement IBM Corporation

Composite Software Data Virtualization Turbocharge Analytics with Big Data and Data Virtualization

Using Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM

The Evolution of Microsoft SQL Server: The right time for Violin flash Memory Arrays

QlikView Business Discovery Platform. Algol Consulting Srl

2009 Oracle Corporation 1

Les journées SQL Server 2013

In-Memory Analytics: A comparison between Oracle TimesTen and Oracle Essbase

Rethinking SIMD Vectorization for In-Memory Databases

Whitepaper. Innovations in Business Intelligence Database Technology.

High performance ETL Benchmark

IBM Netezza High-performance business intelligence and advanced analytics for the enterprise. The analytics conundrum

Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option

SAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions SAP

Reporting trends and pain points of current and new customers IBM Corporation

Performance Verbesserung von SAP BW mit SQL Server Columnstore

Cognos Performance Troubleshooting

Next Generation Data Warehouse and In-Memory Analytics

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances

White Paper. IBM POWER8: Performance and Cost Advantages in Business Intelligence Systems. 89 Fifth Avenue, 7th Floor. New York, NY 10003

Business Performance without limits how in memory. computing changes everything

In-Memory Analytics for Big Data

In-memory computing with SAP HANA

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

"The performance driven Enterprise" Emerging trends in Enterprise BI Platforms

The Cubetree Storage Organization

Do More, Faster with IBM Cognos Business Intelligence Your hardware matters on the server and the client side

Columnstore Indexes for Fast Data Warehouse Query Processing in SQL Server 11.0

Big Data Processing: Past, Present and Future

The data warehouse DBMS market is heating

Integrating Salesforce Using Talend Integration Cloud

Fact Sheet In-Memory Analysis

Building your Big Data Architecture on Amazon Web Services

Streamline SAP HANA with Nearline Storage Solutions by PBS and IBM Elke Hartmann-Bakan, IBM Germany Dr. Klaus Zimmer, PBS Software DMM127

Transcription:

Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation

Agenda Need for Fast Data Analysis & The Data Explosion Challenge Approaches Used Till Date New Breakthrough - Harnessing The Power of The Modern Micro-processors Quick Intro - Ingres VectorWise Market Offerings Today: Gartner DW MQ 2011

Today s Challenges in BI Make informed decisions faster Analysis in seconds not minutes, minutes not hours Data explosion Collecting more information, less resources >44x growth in next 10 years Existing Tools: Too Slow Too complex Too expensive Analytical databases designed in 80s & 90s do not take advantage of today s modern hardware

Need for Fast Data Analysis Make the right decision TIMELY Boost sales: beat the competition Improve efficiencies: lower costs Maximize profits: enable earlier investments Examples Risk management: reduce risk before exposure results in losses Customer churn: act before the customer switches Fraud detection: minimize losses due to fraud

Data Explosion Challenges the Business Corporate Data Time Data continues to grow at explosive rate By 2020 we will see: 44x increase in digital information 67x increase in the files containing this information 35 trillion gigabytes of digital information in the world (a stack of DVDs reaching ½ way to Mars) Source: IDC, The Digital Universe Are You Ready May 2010

Resources to Manage Data Explosion are Constraints Data growth: 44x Human Resources to manage data growth: 40% Resources to manage data can t grow at same rate as data Time Physical data center Human Resources Budget Energy cost Expertise Impending regulation Businesses require technology breakthroughs to manage the data explosion

Challenges with Current State of BI Business Wants Faster Answers Database Too Slow Application Traditional Database Data Growth Exponential Analytical Bottleneck Complex, Risky & Expensive More Hardware OLAP Cubes Indices Rollups Hardware Star Schemas Tuning DB only use a fraction of CPU Capability Confidential 2010 Ingres Corporation

Approaches for Achieving Data Analytics Performance (1980s 2000s) Optimizations for parallel processing and minimal data retrieval Parallel Processing Column store with compression Proprietary hardware MPP Column Decompress Buffer Manager Disk RAM Data Warehouse Appliances Acceptable performance has been achieved by using more hardware or by intelligently lowering the volume of data to be processed However, none of these approaches leverages the performance features of the new modern processors ie taking the most out of each modern commodity CPU

Modern CPU Instruction Capabilities SIMD Traditional CPU processing: Single Instruction, Single Data (SISD) Modern CPU processing capabilities: Single Instruction, Multiple Data (SIMD) Out-of-order execution Chip multi-threading Large L2/L3 caches Streaming SIMD Extensions for efficient SIMD processing

Traditional Scalar Processing vs Vector Processing Traditional Scalar Processing Vector Processing One operation performed on one element at a time 1 x 1 = 1 2 x 2 = 4 3 x 3 = 9 4 x 4 = 16 5 x 5 = 25 6 x 6 = 36 7 x 7 = 49 8 x 8 = 64 Many V s 1 1 x 1 2 x 2 3 x 3 4 x 4 5 x 5 6 x 6 7 x 7 8 x 8 = 1 4 9 16 25 36 49 64 One operation performed on a set of data at a time No overheads Large overheads n x n = n 2 n x n n 2 Process even 1GB per second

Today s Database Facts Most database software originally developed in 1970s/80s Assumes SISD processing Does not use SIMD and other performance features due to complexity Moore s Law has applied to CPU processing power since 1970s CPU performance doubled at least every 2 years Recently thanks to advanced performance features Has database performance gained 2x every 2 years?

New Emerging Approach: Ingres VectorWise On Chip Vectorised Computing/Columnar Database Time / Cycles to Process 2-20 150-250 Millions On Chip Computing DISK 40-400MB RAM 2-3GB Data Processed CHIP 10GB >100X Faster to process data on chip cache than in RAM Automatic optimization of full storage hierarchy Vector Processing Core level parallel processing 100X eg SIMD, Superscalar 2 nd Gen Column Store Limit I/O to specific columns Efficient real time updates Smarter Compression Maximizes throughput via automatically optimized on-chip compression

Ingres VectorWise The World s Fastest Database Ingres VectorWise is a breakthrough Analytical Database Faster 10x - 70x faster Analytics & reporting Handle terabyte-scale data with just a single server Delivers results in seconds not minutes minutes not hours Simple Use your existing queries & SQL statements Eliminate the Buzz Words- Cubes, roll ups, indexes Deliver IT reporting projects faster with lower cost & risk Smarter Revolutionary use of modern chip technology Get more from low cost commodity hardware Database speeds increase with Moore s Law

Ingres VectorWise The World s Fastest Database Faster Insight Faster performance The World s Fastest Database Faster & deeper insight More Agile BI VectorWise could do any query we wanted in under 1 second In a pathological case, Postgres took over 13min Dan Sanders, Advanced E-commerce Research Systems Game-changing technology Don Feinberg, Gartner Group

Gartner MQ 2011 for Data-warehouse DBMSs Increasing domination of non traditional RDBMSs for data-warehousing: MPP/Appliances: Teradata/Asterdata, Netezza (IBM), Greenplum, DataAllegro Columnar/MPP: Sybase IQ, Vertica (HP), 1010 Data, Infobright, Asterdata (Teradata), ParAccel On-Chip Vectorised Computing/Columnar : Ingres VectorWise Source: Gartner Magic Quadrant for Data Warehouse Database Management Systems, Jan 2011 15

Any Questions? Reduced Costs Greater Innovation