Czy moŝna pogodzić działalność naukową i komercyjną? Na przykładzie historii silnika bazodanowego stworzonego przez firmę Infobright Inc.

Size: px
Start display at page:

Download "Czy moŝna pogodzić działalność naukową i komercyjną? Na przykładzie historii silnika bazodanowego stworzonego przez firmę Infobright Inc."

Transcription

1 Czy moŝna pogodzić działalność naukową i komercyjną? Na przykładzie historii silnika bazodanowego stworzonego przez firmę Infobright Inc. Dominik Ślęzak

2 O czym będzie Infobright dzisiaj Historia rozwoju Szczegóły technologiczne Jak się w tym nie pogubić? 2

3 Infobright Technology Innovation First commercial open source data warehouse (analytical database engine) Community & Enterprise Editions Lowest cost technology in the market Simplest & easiest to build & manage Powerful & scalable MySQL Technology unlike anything in the industry Cool Vendor in Data Management and Integration High Performance Data Warehousing Partner of the Year STARTUP TO WATCH Partner of the Year 2009 Infobright: Economic Data Warehouse Choice Strong Momentum & Mature Product Release 3.x generally available +100 customers in 10 Countries +40 Partners on 6 continents A vibrant open source community +1 million visitors 20,000 downloads 3

4 Infobright: Designed for Analytics Best Fit True Analytics Good Fit Standard Analytics Poor Fit OLTP Analytics Environment Questions about the data Aggregates: sums, counts... Some requests for data, optimized by Knowledge Grid Primarily data retrieval & mixed read/write activity (but we handle mixed read/writes in analytics) Analytical Analytical and Transaction applications ad hoc queries standard queries & operational reporting Sample Queries How many new customers were identified recently? What was the largest sale made for item A? Report the sales figures from California for X campaign for Jan 2008 SELECT * Go get the data Notes Knowledge Grid is used heavily to resolve the query almost no I/O Knowledge Grid is used to reduce I/O times by optimizing data access It does not use the advantages of a columnar database

5 Why Customers Buy Infobright (1) Fast query response with no tuning Customer s Test Alternative Infobright Analytical queries 2+ hours with MySQL <10 seconds Query (AND Left Join) 26.4 seconds in Oracle.02 seconds Oracle query set 10 seconds 15 minutes seconds BI report 7 hours in Informix 17 seconds Data load 11 hours in MySQL ISAM 11 minutes Fast and consistent data load speed as database grows. Up to 300GB/hour on a single server 5

6 Online Marketing Online Marketing Need Data Volumes online marketing data generates staggering volumes of information that needs to be captured and analyzed Real-time Response needed for rapid response to campaigns and specific targeting Deep Ad Hoc Analysis differentiation through deeper analytics across more granular data Low Maintenance insane growth rates, tight resourcing means a solution with as little maintenance and cost as possible Infobright s Solution Support for storing up to 50 TBs of data compressed, with up to 40:1 compression, providing the lowest hardware footprint Blazing response times for deep business analytic queries on TBs of granular clickstream data Lowest total cost of ownership in data warehousing simple to implement & manage, requiring little administration Customer experience, including: 6

7 Telecommunications Telecommunications Need Data Volumes are simply staggering; call center analysis, call data records (CDR), customer billing data, network data incl. alarms, alerts and events Avoiding Data Silos storing vast amounts of data and ensuring timely access typically results in multiple development projects, creating unconnected silos of data Highly Complex Ad Hoc Analysis vast cross-section of TELCO applications means widely differing requirements & highly complex, ad hoc queries 7 Infobright s Solution Support for storing up to 50 TBs of data compressed, with up to 40:1 compression, providing the lowest hardware footprint Keeps data together while providing fast response times on TBs of varied data Complex business analytics without the maintenance overhead provides the lowest TCO in data warehousing Customer experience, including:

8 Financial Services Financial Services Need Response Times Capital markets firms depend on the speed at which they can analyze and process trades Risk Management Must prevent risk and identify fraud in real-time to maintain the business Regulatory Compliance Compliance with rising regulatory requirements, directly impacts their ability to stay in business: Trading compliance (Reg NMS, MiFID) Corporate governance (Sarbanes-Oxley, Basel II) Infobright s Solution Fast analytic response times regardless of TBs of ever-increasing trade and tick data volumes Detailed business analytics to spot inconsistencies across huge volumes of live and historical data, prevents losses of money and reputation Reporting accuracy regardless of rising volumes helps maintain compliance Customer experience, including: 8

9 Dominik (back to 2005) 9

10 Challenges ( ) Data tables are incomparably larger than on this illustration but the users want the query speed to stay within minutes Data may grow very quickly but the users expect the data warehouse to adjust to the new data very quickly as well Types of queries are hard to predict and the users expect that arbitrary queries will be executed reasonably quickly O u tlo o k T em p. H u m id. W in d S p o rt? 1 S u n n y H o t H ig h W e ak N o 2 S u n n y H o t H ig h S tro n g N o 3 O v ercast H o t H ig h W e ak Y es 4 R ain M ild H ig h W e ak Y es 5 R ain C o ld N o rm al W e ak Y es 6 R ain C o ld N o rm al S tro n g N o 7 O v ercast C o ld N o rm al S tro n g Y es 8 S u n n y M ild H ig h W e ak N o 9 S u n n y C o ld N o rm al W e ak Y es 1 0 R ain M ild N o rm al W e ak Y es 1 1 S u n n y M ild N o rm al S tro n g Y es 1 2 O v ercast M ild H ig h S tro n g Y es 1 3 O v ercast H o t N o rm al W e ak Y es 1 4 R ain M ild H ig h S tro n g N o Current data warehouse solutions (MPP, indices, sorting, partitioning, etc.) are not flexible and often require special hardware 10

11 Column vs. Row-Oriented EMP_ID FNAME LNAME SALARY 1 Moe Howard Curly Joe Larry Fine 9000 Row Oriented (1,Moe,Howard,10000; 2,Curly, Joe,12000; 3,Larry,Fine,9000;) Works well if all the columns are needed for every query. Efficient for transactional processing if all the data for the row is available Column Oriented (1,2,3; Moe,Curly,Larry; Howard,Joe,Fine; 10000,12000,9000;) Works well with aggregate results (sum, count, avg, min, max,... ) Only relevant columns need to be touched Allows for very efficient compression 11

12 Data Packs and Compression 64K 64K 64K 64K Data Packs Each data pack contains 65,536 data values Compression is applied to each individual data pack The compression algorithm varies depending on data type and distribution Patent Pending Compression Algorithms Compression Results vary depending on data distribution among data packs A typical overall compression ratio seen in the field is around 10:1 Some customers have seen results have been as high as 40:1 12

13 Knowledge Grid Data Pack Nodes (DPN) A separate DPN is created for every data pack created in the database to store basic statistical information Histograms Histograms are created for every Data Pack that contains numeric data and creates 1024 MIN-MAX intervals. Character Maps (CMAPs) Every Data Pack that contains text creates a matrix that records the occurrence of every possible ASCII character This metadata layer is ~1% of the compressed volume. For example, a 1TB (raw) database would have Knowledge Grid of less than 1 GB. 13

14 A Simple Query using Knowledge Grid SELECT count (*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = TORONTO ; salary age job city All packs ignored 1. Find the Data Packs with salary > Find the Data Packs that contain age < Find the Data Packs that have job = Shipping 4. Find the Data Packs that have City = Toronto 5. Now we eliminate all rows that have been flagged as irrelevant. 6. Finally we have identified the data pack that needs to be decompressed Only this pack will be decompressed All packs ignored All packs ignored 14 Completely Irrelevant Suspect All values match

15 SELECT MAX(A) FROM T WHERE B>15; E E I/E I/E I/S/R denotes irrelevant/suspect/relevant; E exact computation (decompression) 15

16 HashBlockJoin (advanced usage of metadata) 16

17 MySQL Pluggable Storage Engine Architecture 17

18 Infobright and MySQL ( ) Integration provides a simple path to highly scalable data warehousing for MySQL users No new management interface to learn MySQL integration enables seamless connectivity to BI tools and MySQL drivers for ODBC, JDBC, C/C++,.NET, Perl, Python, PHP, Ruby, Tcl, etc. 18

19 19

20 Comparison of ICE and IEE ( ) Features Technical Support Warranty and Indemnification Forums and/or one-time 4-hr support pack No Silver, Gold, Platinum Included Mixed Read/Writes No Supported Infobright Loader Data Load Types Up to 50 GB/hr Text only Multi-threaded, Up to 300GB/hr Text Binary (up to 100% faster) MySQL Loader No Supported Temp Tables No Supported Platform Support (keeps changing) 64-bit Intel and AMD RHEL 5, CentOS 5, Debian 32-bit, Ubuntu 8.04, Fedora 9, Windows XP bit Intel and AMD Solaris 10, Windows Server, RHEL 5, RHEL AS, CentOS 5, Debian...

21 IEE Annual Subscriptions Subscription Hours of Service Response Time SLA # Named Client Contacts Support Access Methods Services * Two year PrePay Charge/TB One Year Annual Charge/TB Silver 9am-5pm EDT (no phone support) Severity 1=4 hr Severity 2=8 hr 1 Web Self-Service Knowledge Base Training and consulting services at published rate $12,950 $15,950 Gold 8am-6pm (ET for NA, CET Europe) Severity 1=2 hr Severity 2=6 hr 3 Phone Web Self-Service Knowledge Base Health Check Service included 10% Discount training and other consulting services Sev 1 hot fixes $15,950 $18,950 Platinum 7 x 24 x 365 Severity 1=1 hr Severity 2=4 hr 5 Phone Web Self-Service Knowledge Base Health Check Service included 20% discount training and other consulting services Sev 1 and 2 hot fixes $18,950 $21,950 Perpetual Perpetual (Same as Gold Support Level) 18% of Purchase Price not to increase by more then 10% per Yr. $40,000 * Two year subscription pricing requires two year prepay

22 Why Customers Buy Infobright (2) Less cost Capacity-based subscription model Less hardware required Less work Simple schemas and standard SQL Reduced monitoring requirements Reduced maintenance requirements Open source Fast implementation Open source community Broad platform support 22

23

24 Dominik (in 2010) 24

25 Dominik (in 2010) 25

26 Approximate SQL In such areas as, e.g., Business Intelligence and Web Analytics, there is an ongoing debate whether the answers to SQL statements have to be always exact. The same question occurs in the case of SQL-based machine learning algorithms, which are often based on heuristics, randomness and inexactness anyway. Motivation for SQL approximations is related also to such aspects as complexity of queries and data sources, dynamically changing data with a limited access, and huge data sets with a need to monitor convergence of query execution in time, regardless of whether the final answers are to be exact or not. 26

27 Data Granulation 27

28 SQL-based Machine Learning 28

29 Community Inspirations New Objectives New Schemas New Volumes New Queries New Statistics Count Distinct Self-Joins Decision Trees Contingencies New Data Types SQL Extensions Feature Extraction Data Compression 29

30 DZIĘKUJĘ ZA UWAGĘ!!!

High Performance Log Analytics: Database Considerations

High Performance Log Analytics: Database Considerations High Performance Log Analytics: Database Considerations "Once companies start down the path of log collection and management, organizations often discover there are things going on in their networks that

More information

Enterprise Edition Analytic Data Warehouse Technology White Paper

Enterprise Edition Analytic Data Warehouse Technology White Paper Enterprise Edition Analytic Data Warehouse Technology White Paper August 2008 Infobright 47 Colborne Lane, Suite 403 Toronto, Ontario M5E 1P8 Canada www.infobright.com info@infobright.com Table of Contents

More information

Big Data & the LAMP Stack: How to Boost Performance

Big Data & the LAMP Stack: How to Boost Performance Big Data & the LAMP Stack: How to Boost Performance Jeff Kibler, Infobright Community Manager Kevin Schroeder, Zend Technologies, Technology Evangelist Agenda The Machine-Generated Data Problem Benchmark:

More information

Analytic Applications With PHP and a Columnar Database

Analytic Applications With PHP and a Columnar Database AnalyticApplicationsWithPHPandaColumnarDatabase No matter where you look these days, PHP continues to gain strong use both inside and outside of the enterprise. Although developers have many choices when

More information

Data Integrity & Scalability The Value of Accuracy. Data Quality in Big Data

Data Integrity & Scalability The Value of Accuracy. Data Quality in Big Data Data Integrity & Scalability The Value of Accuracy Data Quality in Big Data Data Quality in the news 2 And some more examples... 3 High Quality Information as competitive differentiator Business today...

More information

Integrating Apache Spark with an Enterprise Data Warehouse

Integrating Apache Spark with an Enterprise Data Warehouse Integrating Apache Spark with an Enterprise Warehouse Dr. Michael Wurst, IBM Corporation Architect Spark/R/Python base Integration, In-base Analytics Dr. Toni Bollinger, IBM Corporation Senior Software

More information

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take

More information

Inge Os Sales Consulting Manager Oracle Norway

Inge Os Sales Consulting Manager Oracle Norway Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database

More information

JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service

JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service Overview JDSU (NASDAQ: JDSU; and TSX: JDU) innovates and markets diverse

More information

Microsoft SQL Server to Infobright Database Migration Guide

Microsoft SQL Server to Infobright Database Migration Guide Microsoft SQL Server to Infobright Database Migration Guide Infobright 47 Colborne Street, Suite 403 Toronto, Ontario M5E 1P8 Canada www.infobright.com www.infobright.org Approaches to Migrating Databases

More information

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database 1 Best Practices for Extreme Performance with Data Warehousing on Oracle Database Rekha Balwada Principal Product Manager Agenda Parallel Execution Workload Management on Data Warehouse

More information

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!) MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!) Erdélyi Ernő, Component Soft Kft. erno@component.hu www.component.hu 2013 (c) Component Soft Ltd Leading Hadoop Vendor Copyright 2013,

More information

Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option

Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option Kai Yu, Senior Principal Architect Dell Oracle Solutions Engineering Dell, Inc. Abstract: By adding the In-Memory

More information

2009 Oracle Corporation 1

2009 Oracle Corporation 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,

More information

Innovative technology for big data analytics

Innovative technology for big data analytics Technical white paper Innovative technology for big data analytics The HP Vertica Analytics Platform database provides price/performance, scalability, availability, and ease of administration Table of

More information

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC 2010 Forum Series 1 Greenplum Becomes the Foundation of EMC s Data Computing Division E M C A CQ U I R E S G R E E N P L U M Greenplum,

More information

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Better Together Writer: Bill Baer, Technical Product Manager, SharePoint Product Group Technical Reviewers: Steve Peschka,

More information

Exadata Database Machine

Exadata Database Machine Database Machine Extreme Extraordinary Exciting By Craig Moir of MyDBA March 2011 Exadata & Exalogic What is it? It is Hardware and Software engineered to work together It is Extreme Performance Application-to-Disk

More information

Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com

Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com Data Warehousing and Analytics Infrastructure at Facebook Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com Overview Challenges in a Fast Growing & Dynamic Environment Data Flow Architecture,

More information

Customized Report- Big Data

Customized Report- Big Data GINeVRA Digital Research Hub Customized Report- Big Data 1 2014. All Rights Reserved. Agenda Context Challenges and opportunities Solutions Market Case studies Recommendations 2 2014. All Rights Reserved.

More information

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc. Oracle BI EE Implementation on Netezza Prepared by SureShot Strategies, Inc. The goal of this paper is to give an insight to Netezza architecture and implementation experience to strategize Oracle BI EE

More information

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business

More information

Maximum performance, minimal risk for data warehousing

Maximum performance, minimal risk for data warehousing SYSTEM X SERVERS SOLUTION BRIEF Maximum performance, minimal risk for data warehousing Microsoft Data Warehouse Fast Track for SQL Server 2014 on System x3850 X6 (95TB) The rapid growth of technology has

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting BIG DATA APPLIANCES July 23, TDWI R Sathyanarayana Enterprise Information Management & Analytics Practice EMC Consulting 1 Big data are datasets that grow so large that they become awkward to work with

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved. Preview of Oracle Database 12c In-Memory Option 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any

More information

Oracle Database In-Memory The Next Big Thing

Oracle Database In-Memory The Next Big Thing Oracle Database In-Memory The Next Big Thing Maria Colgan Master Product Manager #DBIM12c Why is Oracle do this Oracle Database In-Memory Goals Real Time Analytics Accelerate Mixed Workload OLTP No Changes

More information

<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region

<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region 1977 Oracle Database 30 Years of Sustained Innovation Database Vault Transparent Data Encryption

More information

Fact Sheet In-Memory Analysis

Fact Sheet In-Memory Analysis Fact Sheet In-Memory Analysis 1 Copyright Yellowfin International 2010 Contents In Memory Overview...3 Benefits...3 Agile development & rapid delivery...3 Data types supported by the In-Memory Database...4

More information

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering June 2014 Page 1 Contents Introduction... 3 About Amazon Web Services (AWS)... 3 About Amazon Redshift... 3 QlikView on AWS...

More information

Information management software solutions White paper. Powerful data warehousing performance with IBM Red Brick Warehouse

Information management software solutions White paper. Powerful data warehousing performance with IBM Red Brick Warehouse Information management software solutions White paper Powerful data warehousing performance with IBM Red Brick Warehouse April 2004 Page 1 Contents 1 Data warehousing for the masses 2 Single step load

More information

Advanced In-Database Analytics

Advanced In-Database Analytics Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??

More information

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 ICOM 6005 Database Management Systems Design Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 Readings Read Chapter 1 of text book ICOM 6005 Dr. Manuel

More information

Performance and Scalability Overview

Performance and Scalability Overview Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics platform. PENTAHO PERFORMANCE ENGINEERING

More information

Introducing Oracle Exalytics In-Memory Machine

Introducing Oracle Exalytics In-Memory Machine Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle

More information

SQL Server 2005 Features Comparison

SQL Server 2005 Features Comparison Page 1 of 10 Quick Links Home Worldwide Search Microsoft.com for: Go : Home Product Information How to Buy Editions Learning Downloads Support Partners Technologies Solutions Community Previous Versions

More information

Vectorwise 3.0 Fast Answers from Hadoop. Technical white paper

Vectorwise 3.0 Fast Answers from Hadoop. Technical white paper Vectorwise 3.0 Fast Answers from Hadoop Technical white paper 1 Contents Executive Overview 2 Introduction 2 Analyzing Big Data 3 Vectorwise and Hadoop Environments 4 Vectorwise Hadoop Connector 4 Performance

More information

Gaining the Performance Edge Using a Column-Oriented Database Management System

Gaining the Performance Edge Using a Column-Oriented Database Management System Analytics in the Federal Government White paper series on how to achieve efficiency, responsiveness and transparency. Gaining the Performance Edge Using a Column-Oriented Database Management System by

More information

Introduction to Database as a Service

Introduction to Database as a Service Introduction to Database as a Service Exadata Platform Revised 8/1/13 Database as a Service (DBaaS) Starts With The Business Needs Establish an IT delivery model that reduces costs, meets demand, and fulfills

More information

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013 SAP HANA SAP s In-Memory Database Dr. Martin Kittel, SAP HANA Development January 16, 2013 Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase

More information

Lowering the Total Cost of Ownership (TCO) of Data Warehousing

Lowering the Total Cost of Ownership (TCO) of Data Warehousing Ownership (TCO) of Data If Gordon Moore s law of performance improvement and cost reduction applies to processing power, why hasn t it worked for data warehousing? Kognitio provides solutions to business

More information

Performance and Scalability Overview

Performance and Scalability Overview Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics Platform. Contents Pentaho Scalability and

More information

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION EXECUTIVE SUMMARY Oracle business intelligence solutions are complete, open, and integrated. Key components of Oracle business intelligence

More information

How To Use Hp Vertica Ondemand

How To Use Hp Vertica Ondemand Data sheet HP Vertica OnDemand Enterprise-class Big Data analytics in the cloud Enterprise-class Big Data analytics for any size organization Vertica OnDemand Organizations today are experiencing a greater

More information

Why Big Data in the Cloud?

Why Big Data in the Cloud? Have 40 Why Big Data in the Cloud? Colin White, BI Research January 2014 Sponsored by Treasure Data TABLE OF CONTENTS Introduction The Importance of Big Data The Role of Cloud Computing Using Big Data

More information

An Oracle White Paper March 2014. Best Practices for Implementing a Data Warehouse on the Oracle Exadata Database Machine

An Oracle White Paper March 2014. Best Practices for Implementing a Data Warehouse on the Oracle Exadata Database Machine An Oracle White Paper March 2014 Best Practices for Implementing a Data Warehouse on the Oracle Exadata Database Machine Introduction... 1! Data Models for a Data Warehouse... 2! Physical Model Implementing

More information

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Bringing Big Data into the Enterprise

Bringing Big Data into the Enterprise Bringing Big Data into the Enterprise Overview When evaluating Big Data applications in enterprise computing, one often-asked question is how does Big Data compare to the Enterprise Data Warehouse (EDW)?

More information

The Cubetree Storage Organization

The Cubetree Storage Organization The Cubetree Storage Organization Nick Roussopoulos & Yannis Kotidis Advanced Communication Technology, Inc. Silver Spring, MD 20905 Tel: 301-384-3759 Fax: 301-384-3679 {nick,kotidis}@act-us.com 1. Introduction

More information

Data Analytics The New Growth Opportunity for Software Developers

Data Analytics The New Growth Opportunity for Software Developers Data Analytics The New Growth Opportunity for Software Developers How the Vertica Analytic Database is powering the new wave of commercial software, SaaS and appliance-based applications and creating new

More information

PostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor

PostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor PostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor The research leading to these results has received funding from the European Union's Seventh Framework

More information

How to Choose Between Hadoop, NoSQL and RDBMS

How to Choose Between Hadoop, NoSQL and RDBMS How to Choose Between Hadoop, NoSQL and RDBMS Keywords: Jean-Pierre Dijcks Oracle Redwood City, CA, USA Big Data, Hadoop, NoSQL Database, Relational Database, SQL, Security, Performance Introduction A

More information

Apache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com

Apache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Apache Hadoop in the Enterprise Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Cloudera The Leader in Big Data Management Powered by Apache Hadoop The Leading Open Source Distribution of Apache

More information

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc.

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc. Oracle9i Data Warehouse Review Robert F. Edwards Dulcian, Inc. Agenda Oracle9i Server OLAP Server Analytical SQL Data Mining ETL Warehouse Builder 3i Oracle 9i Server Overview 9i Server = Data Warehouse

More information

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Agenda The rise of Big Data & Hadoop MySQL in the Big Data Lifecycle MySQL Solutions for Big Data Q&A

More information

Scaling Your Data to the Cloud

Scaling Your Data to the Cloud ZBDB Scaling Your Data to the Cloud Technical Overview White Paper POWERED BY Overview ZBDB Zettabyte Database is a new, fully managed data warehouse on the cloud, from SQream Technologies. By building

More information

Summary of Alma-OSF s Evaluation of MongoDB for Monitoring Data Heiko Sommer June 13, 2013

Summary of Alma-OSF s Evaluation of MongoDB for Monitoring Data Heiko Sommer June 13, 2013 Summary of Alma-OSF s Evaluation of MongoDB for Monitoring Data Heiko Sommer June 13, 2013 Heavily based on the presentation by Tzu-Chiang Shen, Leonel Peña ALMA Integrated Computing Team Coordination

More information

SQL Server Parallel Data Warehouse: Architecture Overview. José Blakeley Database Systems Group, Microsoft Corporation

SQL Server Parallel Data Warehouse: Architecture Overview. José Blakeley Database Systems Group, Microsoft Corporation SQL Server Parallel Data Warehouse: Architecture Overview José Blakeley Database Systems Group, Microsoft Corporation Outline Motivation MPP DBMS system architecture HW and SW Key components Query processing

More information

Accelerate Business Advantage with Dynamic Warehousing

Accelerate Business Advantage with Dynamic Warehousing Accelerate Business Advantage with Dynamic Warehousing Mark McConnell Marketing Executive, Information Management IBM Asia Pacific 2007 IBM Corporation Is Information Technology delivering? Source: IBM

More information

Microsoft Analytics Platform System. Solution Brief

Microsoft Analytics Platform System. Solution Brief Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal

More information

Using Tableau Software with Hortonworks Data Platform

Using Tableau Software with Hortonworks Data Platform Using Tableau Software with Hortonworks Data Platform September 2013 2013 Hortonworks Inc. http:// Modern businesses need to manage vast amounts of data, and in many cases they have accumulated this data

More information

UNIVERSE DESIGN BEST PRACTICES. Roxanne Pittman, InfoSol May 8, 2014

UNIVERSE DESIGN BEST PRACTICES. Roxanne Pittman, InfoSol May 8, 2014 UNIVERSE DESIGN BEST PRACTICES Roxanne Pittman, InfoSol May 8, 2014 SEVEN PRINCIPLES OF UNIVERSAL DESIGN BY THE CENTER FOR UNIVERSAL DESIGN (CUD) NORTH CAROLINA STATE UNIVERSITY 1. Equitable use. The design

More information

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database - Engineered for Innovation Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database 11g Release 2 Shipping since September 2009 11.2.0.3 Patch Set now

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

Virtuoso and Database Scalability

Virtuoso and Database Scalability Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of

More information

Big Data Analytics Platform @ Nokia

Big Data Analytics Platform @ Nokia Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform

More information

Tableau Metadata Model

Tableau Metadata Model Tableau Metadata Model Author: Marc Reuter Senior Director, Strategic Solutions, Tableau Software March 2012 p2 Most Business Intelligence platforms fall into one of two metadata camps: either model the

More information

SQL Server 2012 Performance White Paper

SQL Server 2012 Performance White Paper Published: April 2012 Applies to: SQL Server 2012 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication.

More information

Retail POS Data Analytics Using MS Bi Tools. Business Intelligence White Paper

Retail POS Data Analytics Using MS Bi Tools. Business Intelligence White Paper Retail POS Data Analytics Using MS Bi Tools Business Intelligence White Paper Introduction Overview There is no doubt that businesses today are driven by data. Companies, big or small, take so much of

More information

Big Data and Its Impact on the Data Warehousing Architecture

Big Data and Its Impact on the Data Warehousing Architecture Big Data and Its Impact on the Data Warehousing Architecture Sponsored by SAP Speaker: Wayne Eckerson, Director of Research, TechTarget Wayne Eckerson: Hi my name is Wayne Eckerson, I am Director of Research

More information

Big Data Analytics with IBM Cognos BI Dynamic Query IBM Redbooks Solution Guide

Big Data Analytics with IBM Cognos BI Dynamic Query IBM Redbooks Solution Guide Big Data Analytics with IBM Cognos BI Dynamic Query IBM Redbooks Solution Guide IBM Cognos Business Intelligence (BI) helps you make better and smarter business decisions faster. Advanced visualization

More information

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of

More information

Oracle Big Data, In-memory, and Exadata - One Database Engine to Rule Them All Dr.-Ing. Holger Friedrich

Oracle Big Data, In-memory, and Exadata - One Database Engine to Rule Them All Dr.-Ing. Holger Friedrich Oracle Big Data, In-memory, and Exadata - One Database Engine to Rule Them All Dr.-Ing. Holger Friedrich Agenda Introduction Old Times Exadata Big Data Oracle In-Memory Headquarters Conclusions 2 sumit

More information

Transforming the Economics of Data Warehousing with Cloud Computing

Transforming the Economics of Data Warehousing with Cloud Computing Transforming the Economics of Data Warehousing with Cloud Computing How new frontiers in on-demand computing and DBMS technology will transform business. Copyright Vertica Systems Inc. November, 2008 Table

More information

Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices

Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Proc. of Int. Conf. on Advances in Computer Science, AETACS Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Ms.Archana G.Narawade a, Mrs.Vaishali Kolhe b a PG student, D.Y.Patil

More information

Virtual Data Warehouse Appliances

Virtual Data Warehouse Appliances infrastructure (WX 2 and blade server Kognitio provides solutions to business problems that require acquisition, rationalization and analysis of large and/or complex data The Kognitio Technology and Data

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

SQL Server 2012 Parallel Data Warehouse. Solution Brief

SQL Server 2012 Parallel Data Warehouse. Solution Brief SQL Server 2012 Parallel Data Warehouse Solution Brief Published February 22, 2013 Contents Introduction... 1 Microsoft Platform: Windows Server and SQL Server... 2 SQL Server 2012 Parallel Data Warehouse...

More information

III JORNADAS DE DATA MINING

III JORNADAS DE DATA MINING III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE

More information

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Business Intelligence is the #1 Priority the most important technology in 2007 is business intelligence

More information

AtScale Intelligence Platform

AtScale Intelligence Platform AtScale Intelligence Platform PUT THE POWER OF HADOOP IN THE HANDS OF BUSINESS USERS. Connect your BI tools directly to Hadoop without compromising scale, performance, or control. TURN HADOOP INTO A HIGH-PERFORMANCE

More information

An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct

More information

ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR

ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR ORACLE DATA INTEGRATOR ENTERPRISE EDITION KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR ENTERPRISE EDITION OFFERS LEADING PERFORMANCE, IMPROVED PRODUCTIVITY, FLEXIBILITY AND LOWEST TOTAL COST OF OWNERSHIP

More information

Netezza and Business Analytics Synergy

Netezza and Business Analytics Synergy Netezza Business Partner Update: November 17, 2011 Netezza and Business Analytics Synergy Shimon Nir, IBM Agenda Business Analytics / Netezza Synergy Overview Netezza overview Enabling the Business with

More information

Understanding the Benefits of IBM SPSS Statistics Server

Understanding the Benefits of IBM SPSS Statistics Server IBM SPSS Statistics Server Understanding the Benefits of IBM SPSS Statistics Server Contents: 1 Introduction 2 Performance 101: Understanding the drivers of better performance 3 Why performance is faster

More information

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the

More information

Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere

Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere Agenda 1. Introductions & Objectives 2. Tableau Overview 3. Tableau Products 4. Tableau Architecture 5. Why Tableau? 6.

More information

The Next Wave of Data Management. Is Big Data The New Normal?

The Next Wave of Data Management. Is Big Data The New Normal? The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management

More information

Minimize cost and risk for data warehousing

Minimize cost and risk for data warehousing SYSTEM X SERVERS SOLUTION BRIEF Minimize cost and risk for data warehousing Microsoft Data Warehouse Fast Track for SQL Server 2014 on System x3850 X6 (55TB) Highlights Improve time to value for your data

More information

SUN ORACLE EXADATA STORAGE SERVER

SUN ORACLE EXADATA STORAGE SERVER SUN ORACLE EXADATA STORAGE SERVER KEY FEATURES AND BENEFITS FEATURES 12 x 3.5 inch SAS or SATA disks 384 GB of Exadata Smart Flash Cache 2 Intel 2.53 Ghz quad-core processors 24 GB memory Dual InfiniBand

More information

ANALYTICS BUILT FOR INTERNET OF THINGS

ANALYTICS BUILT FOR INTERNET OF THINGS ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that

More information

Who am I? Copyright 2014, Oracle and/or its affiliates. All rights reserved. 3

Who am I? Copyright 2014, Oracle and/or its affiliates. All rights reserved. 3 Oracle Database In-Memory Power the Real-Time Enterprise Saurabh K. Gupta Principal Technologist, Database Product Management Who am I? Principal Technologist, Database Product Management at Oracle Author

More information

Business Intelligence

Business Intelligence Business Intelligence Data Mining and Data Warehousing Dominik Ślęzak slezak@infobright.com www.infobright.com Research Interests Data Warehouses, Knowledge Discovery, Rough Sets Machine Intelligence,

More information

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce Elastic Application Platform for Market Data Real-Time Analytics Can you deliver real-time pricing, on high-speed market data, for real-time critical for E-Commerce decisions? Market Data Analytics applications

More information

How To Make Data Streaming A Real Time Intelligence

How To Make Data Streaming A Real Time Intelligence REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log

More information

DATABASE. Pervasive PSQL Performance. Key Performance Features of Pervasive PSQL. Pervasive PSQL White Paper

DATABASE. Pervasive PSQL Performance. Key Performance Features of Pervasive PSQL. Pervasive PSQL White Paper DATABASE Pervasive PSQL Performance Key Performance Features of Pervasive PSQL Pervasive PSQL White Paper June 2008 Table of Contents Introduction... 3 Per f o r m a n c e Ba s i c s: Mo r e Me m o r y,

More information

SAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions Management @ SAP

SAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions Management @ SAP SAP Analytics Roadmap for Small and Midsize Companies Kevin Chan, Director, Solutions Management @ SAP A WORLD OF ACCELERATING CHANGE An emerging middle class growing to 5B Data doubling every 18 months

More information

IBM Netezza High Capacity Appliance

IBM Netezza High Capacity Appliance IBM Netezza High Capacity Appliance Petascale Data Archival, Analysis and Disaster Recovery Solutions IBM Netezza High Capacity Appliance Highlights: Allows querying and analysis of deep archival data

More information