Big Data & the LAMP Stack: How to Boost Performance

Size: px
Start display at page:

Download "Big Data & the LAMP Stack: How to Boost Performance"

Transcription

1 Big Data & the LAMP Stack: How to Boost Performance Jeff Kibler, Infobright Community Manager Kevin Schroeder, Zend Technologies, Technology Evangelist

2 Agenda The Machine-Generated Data Problem Benchmark: Methodology & Results The Infobright Approach Zend Technology

3 The Machine-Generated Data Problem

4 The Machine-Generated Data Problem Machine-generated data is the future of data management. Curt Monash, DBMS2 Machine-generated/hybrid data Weblogs Computer, network events CDRs Financial trades Sensors, RFID etc Online game data Human-generated data - input from most conventional transactions Purchase/sale Inventory Manufacturing Employment status change Rate of Growth

5 Current Database Technology: Hitting the Wall Today s database technology requires huge effort and massive hardware How Analytic Performance Issues are Typically Addressed by Pace of Data Growth Tune or upgrade existing databases 66% 75% Upgrade server hardware/processors 54% 70% Upgrade/expand storage systems 33% 60% Archive older data on other systems 30% 44% Upgrade networking infrastructure Don't Know / Unsure 4% 7% 21% 32% High Growth Low Growth 0% 20% 40% 60% 80% 100% Source: KEEPING UP WITH EVER-EXPANDING ENTERPRISE DATA By Joseph McKendrick, Research Analyst; Unisphere Research October 2010

6 What if you could achieve much higher performance analytics with no database tuning and much less hardware?

7 Benchmark: Methodology & Results

8 Benchmark Objective Evaluate Analytic Performance of Infobright/Zend versus MySQL/PHP Parameters Evaluated Loading Time Compression Monitoring Ease Query Resolution Results Software Evaluated 1. Infobright Enterprise Edition Zend Server MySQL PHP 5

9 Configurations Hardware MySQL CentOS5 64bit w/ 16GB Ram and 16 cores Indexed with Star Schema PHP 5 Infobright EE Single, flat table; no indexing Zend Server 5.0.4

10 Machine-generated Data Used National Climatic Data Center (NCDC) Sensor Data Global Daily Weather Reports 1929 to 2011 Publicly available - ftp://ftp.ncdc.noaa.gov/pub/data/gsod/ World s Largest Archive of Climate Data

11 Data Load Time and Compression Bulk Loaders: LOAD DATA INFILE Load Time Raw Size Compression DB Size Infobright 35 Min GB 20.5 : GB MySQL 100 Min GB 0.92 : GB Infobright continues to excel as data size grows to the terabyte scale

12 Monitoring During Test, Extensive Use of: Query Execution Warnings - Severe Slow Request Execution Error/Server/Access Logs - Fatal PHP Errors (Development) Code Tracing I found ZendServer to be extremely easy to use and to configure. Its functionality allows for easy debugging and monitoring of some tough queries we threw at the db. Plus, Zend Framework helped me get the code up and running very quickly. Alba Rico, Benchmark Lead

13 Queries - Baseline Baseline: Nine Non-Analytic, Simplistic Queries MyISAM (MySQL) Baseline Queries Brighthouse (Infobright) 1 SELECT COUNT(*) FROM reports; SELECT COUNT(*) FROM reports; 2 SELECT station, COUNT(*) FROM reports GROUP BY station; SELECT Station, COUNT(*) FROM reports GROUP BY station; 3 SELECT COUNT(DISTINCT station) FROM stations; SELECT COUNT(DISTINCT station) FROM reports; 4 SELECT COUNT(DISTINCT fips_id) FROM stations; SELECT COUNT(DISTINCT fips_id) FROM reports; 5 SELECT COUNT(DISTINCT country_name) FROM countries; SELECT COUNT(DISTINCT country_name) FROM reports; 6 SELECT DISTINCT full_date FROM reports; SELECT DISTINCT full_date FROM reports; 7 SELECT DISTINCT full_date, COUNT(*) FROM reports GROUP BY full_date ORDER BY full_date DESC; SELECT DISTINCT full_date, COUNT(*) FROM reports GROUP BY full_date ORDER BY full_date DESC 8 SELECT MAX(max_temp) FROM reports; SELECT MAX(max_temp) FROM reports; 9 SELECT COUNT(tornado) FROM reports WHERE tornado = TRUE; SELECT COUNT(tornado) FROM reports WHERE tornado = TRUE;

14 Queries - Analytic Simple #1: Reports a descending list of snow depths from all stations who reported a snow depth amount on a given date. Medium #1: Reports a list of station names (and the station's country) with the average precipitation and total precipitation calculated from all days that fall within a given range of dates as well as having reported snow on the same day. Only stations located above the 5000ft elevation mark are considered in this query. Medium #2: Reports the maximum max temperature, maximum precipitation, maximum wind speed, maximum wind gust, maximum standard deviation of the maximum temperature, and the total number of tornadoes reported for the state of Illinois for a given date range. Complex #1:Reports the country or countries with the maximum standard deviation in temperature for counties between the and latitude between a given range of dates. Complex #2: Reports the station, country name, precipitation total, and a description of the precipitation report for all reports between the given date where the country name begins with United'. Complex #3: Reports the station, country name, precipitation total, and a description of the precipitation report for all reports between the given date where the country name contains island. Similar to Q5, except the pattern matching on the country name is done on the end of the string versus the front.

15 Time (seconds) Query Performance IB/Zend and MySQL/PHP Performance Base #1 Base #2 Base #3 Base #4 Base #5 Base #6 Base #7 Base #8 Base #9 Simple #1 Infobright/Zend MySQL/PHP Med. #1 Med. #2 Comp. #1 Comp. #2 Comp. #3 Baseline 3-5: MySQL/PHP pulling from dimension table; Infobright/Zend pulling from fact. Baseline speeds can be drastically improved with addition of simple dimension tables in Infobright DB.

16 Conclusions Infobright + Zend = Powerful LAMP Pair Expedite Development Capabilities Monitor LAMP Performance 20X Stronger Data Compression* 280% Faster Data Loads* Superior Analytic Query Performance * Results from this benchmark

17 The Infobright Approach

18 What Infobright Delivers High performance with much less work and lower cost Faster queries without the work Fast load / High compression Low cost Fast time to production No indexes No projections or cubes No data partitioning Faster ad-hoc analytics Multi-machine Distributed Load Processor 10:1 to 40:1+ compression Less storage and servers Low-cost subscriptions 90% less administration Download in minutes Minimal configuration Implement in days

19 Column vs. Row Orientation - Use Cases ID Job Dept City # # # # # # 1 Shipping Operations Toronto 2 Receiving Operations Toronto 3 Accounting Finance Boston Row Oriented works if All the columns are needed Transactional processing is required Column-Based Storage 1 Shipping Operations Toronto 2 Receiving Operations Toronto 3 Accounting Finance Boston Column Oriented works if Only relevant columns are needed Reports are aggregates (sum, count, average, etc.) Benefits Very efficient compression Faster results for analytical queries Reading column takes similar CPU resources as reading a row

20 Data Packs and Compression 64K 64K 64K 64K Patent-Pending Compression Algorithms Data Packs Each data pack contains 65,536 data values Compression is applied to each individual data pack The compression algorithm varies depending on data type and distribution Compression Results vary depending on the distribution of data among data packs A typical overall compression ratio seen in the field is 10:1 Some customers have seen results of 40:1 and higher For example, 1TB of raw data compressed 10 to 1 would only require 100GB of disk capacity

21 Intelligence Not Hardware Creates information (metadata) about the data upon load, automatically Stores it in the Knowledge Grid (KG) KG is loaded into memory Less than 1% of compressed data size Uses the metadata when processing a query to eliminate / reduce need to access data The less data that needs to be accessed, the faster the response Sub-second responses when answered by the KG Architecture Benefits No need to partition data, create/maintain indexes, projections or tune for performance Ad-hoc queries are as fast as static queries, so users have total flexibility

22 The Knowledge Grid Knowledge Grid applies to the whole table Knowledge Nodes built for each Data Pack Information about the data DP1 DP2 DP3 DP4 DP5 DP6 Column A Column B Global knowledge String and character data Numeric data Distributions Built during LOAD Dynamic knowledge Knowledge Nodes answer the query directly, or Identify only required Data Packs, minimizing decompression, and Predict required data in advance based on workload Built per query E.g. for aggregates, joins

23 Granular Engine 1. Query received 2. Engine iterates on Knowledge Grid 3. Each pass eliminates Data Packs 4. If any Data Packs are needed to resolve query, only those are decompressed Query Q: How are my sales doing this year? Knowledge Grid 1% Results Compressed Data

24 Infobright Customer Performance Statistics Fast query response with no tuning or indexes Analytic Queries Alternative 1 Month Report (15MM events) Alternative Oracle Query Set Alternative 2+ hours with MySQL <10 seconds 43 min with SQL Server 23 seconds 10 seconds 15 minutes seconds Alternative BI Report Alternative Data Load 7 hours in Informix 17 seconds 11 hours in MySQL ISAM 11 minutes

25 Infobright and MySQL Complementary Technologies Infobright is architected on MySQL, the world s most popular open source database Provides a simple scalability path for MySQL users and OEMs No new management interface to learn MySQL integration enables seamless connectivity to BI tools and MySQL drivers for ODBC, JDBC, C/C++,.NET, Perl, Python, PHP, Ruby, Tcl, etc.

26 Analytic Use Cases Online Analytics Vertical Log / Events OEM/ Embedded Marketing/ Advertising Mobile analytics Behavioral analysis Web analytics Telecom: Network performance / management CDR analytics Capital markets Enterprise network and systems analysis Security (SIEM) High performance queries/reporting within ISV / SaaS application Stock analytics

27 Zend Technology

28 The Zend Solution CLOUD SERVICES LAYER Cloud Services SimpleCloud Web Services FRAMEWORK SERVICES LAYER Eclipse-based Auth Security RIA MVC Mobile Data Access Zend Framework Integration Session Clustering RUNTIME SERVICES LAYER Application Monitoring Cluster Management Zend Server Integration Configuration Management Application Deployment Enterprise PHP OPERATING SYSTEM

29 Monitoring

30 Get Started Zend Technologies Zend.com Download Zend Server and Zend Studio Watch recorded webinars Get training Infobrightinfobright.org Infobright.org Download ICE (Infobright Community Edition) Join the forums and learn from the experts! Infobright.com Download a free trial of Infobright Enterprise Edition, IEE Questions Jeff.kibler@infobright.com Kevin@zend.com Social Media twitter.com/zend twitter.com/infobright

High Performance Log Analytics: Database Considerations

High Performance Log Analytics: Database Considerations High Performance Log Analytics: Database Considerations "Once companies start down the path of log collection and management, organizations often discover there are things going on in their networks that

More information

Enterprise Edition Analytic Data Warehouse Technology White Paper

Enterprise Edition Analytic Data Warehouse Technology White Paper Enterprise Edition Analytic Data Warehouse Technology White Paper August 2008 Infobright 47 Colborne Lane, Suite 403 Toronto, Ontario M5E 1P8 Canada www.infobright.com info@infobright.com Table of Contents

More information

Analytic Applications With PHP and a Columnar Database

Analytic Applications With PHP and a Columnar Database AnalyticApplicationsWithPHPandaColumnarDatabase No matter where you look these days, PHP continues to gain strong use both inside and outside of the enterprise. Although developers have many choices when

More information

JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service

JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service Overview JDSU (NASDAQ: JDSU; and TSX: JDU) innovates and markets diverse

More information

Data Integrity & Scalability The Value of Accuracy. Data Quality in Big Data

Data Integrity & Scalability The Value of Accuracy. Data Quality in Big Data Data Integrity & Scalability The Value of Accuracy Data Quality in Big Data Data Quality in the news 2 And some more examples... 3 High Quality Information as competitive differentiator Business today...

More information

Big Data & Cloud Computing. Faysal Shaarani

Big Data & Cloud Computing. Faysal Shaarani Big Data & Cloud Computing Faysal Shaarani Agenda Business Trends in Data What is Big Data? Traditional Computing Vs. Cloud Computing Snowflake Architecture for the Cloud Business Trends in Data Critical

More information

Top 10 Performance Tips for OBI-EE

Top 10 Performance Tips for OBI-EE Top 10 Performance Tips for OBI-EE Narasimha Rao Madhuvarsu L V Bharath Terala October 2011 Apps Associates LLC Boston New York Atlanta Germany India Premier IT Professional Service and Solution Provider

More information

Czy moŝna pogodzić działalność naukową i komercyjną? Na przykładzie historii silnika bazodanowego stworzonego przez firmę Infobright Inc.

Czy moŝna pogodzić działalność naukową i komercyjną? Na przykładzie historii silnika bazodanowego stworzonego przez firmę Infobright Inc. Czy moŝna pogodzić działalność naukową i komercyjną? Na przykładzie historii silnika bazodanowego stworzonego przez firmę Infobright Inc. Dominik Ślęzak O czym będzie Infobright dzisiaj Historia rozwoju

More information

Integrating Apache Spark with an Enterprise Data Warehouse

Integrating Apache Spark with an Enterprise Data Warehouse Integrating Apache Spark with an Enterprise Warehouse Dr. Michael Wurst, IBM Corporation Architect Spark/R/Python base Integration, In-base Analytics Dr. Toni Bollinger, IBM Corporation Senior Software

More information

Why DBMSs Matter More than Ever in the Big Data Era

Why DBMSs Matter More than Ever in the Big Data Era E-PAPER FEBRUARY 2014 Why DBMSs Matter More than Ever in the Big Data Era Having the right database infrastructure can make or break big data analytics projects. TW_1401138 Big data has become big news

More information

Transforming the Economics of Data Warehousing with Cloud Computing

Transforming the Economics of Data Warehousing with Cloud Computing Transforming the Economics of Data Warehousing with Cloud Computing How new frontiers in on-demand computing and DBMS technology will transform business. Copyright Vertica Systems Inc. November, 2008 Table

More information

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved. Preview of Oracle Database 12c In-Memory Option 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc. Oracle BI EE Implementation on Netezza Prepared by SureShot Strategies, Inc. The goal of this paper is to give an insight to Netezza architecture and implementation experience to strategize Oracle BI EE

More information

Scaling Your Data to the Cloud

Scaling Your Data to the Cloud ZBDB Scaling Your Data to the Cloud Technical Overview White Paper POWERED BY Overview ZBDB Zettabyte Database is a new, fully managed data warehouse on the cloud, from SQream Technologies. By building

More information

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013 SAP HANA SAP s In-Memory Database Dr. Martin Kittel, SAP HANA Development January 16, 2013 Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase

More information

ICONICS Choosing the Correct Edition of MS SQL Server

ICONICS Choosing the Correct Edition of MS SQL Server Description: This application note aims to assist you in choosing the right edition of Microsoft SQL server for your ICONICS applications. OS Requirement: XP Win 2000, XP Pro, Server 2003, Vista, Server

More information

Oracle Database Public Cloud Services

Oracle Database Public Cloud Services Oracle Database Public Cloud Services A Strategy and Technology Overview Bob Zeolla Principal Sales Consultant Oracle Education & Research November 23, 2015 Safe Harbor Statement The following is intended

More information

Data processing goes big

Data processing goes big Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,

More information

Enterprise Performance Tuning: Best Practices with SQL Server 2008 Analysis Services. By Ajay Goyal Consultant Scalability Experts, Inc.

Enterprise Performance Tuning: Best Practices with SQL Server 2008 Analysis Services. By Ajay Goyal Consultant Scalability Experts, Inc. Enterprise Performance Tuning: Best Practices with SQL Server 2008 Analysis Services By Ajay Goyal Consultant Scalability Experts, Inc. June 2009 Recommendations presented in this document should be thoroughly

More information

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Agenda The rise of Big Data & Hadoop MySQL in the Big Data Lifecycle MySQL Solutions for Big Data Q&A

More information

AtScale Intelligence Platform

AtScale Intelligence Platform AtScale Intelligence Platform PUT THE POWER OF HADOOP IN THE HANDS OF BUSINESS USERS. Connect your BI tools directly to Hadoop without compromising scale, performance, or control. TURN HADOOP INTO A HIGH-PERFORMANCE

More information

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION EXECUTIVE SUMMARY Oracle business intelligence solutions are complete, open, and integrated. Key components of Oracle business intelligence

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

Big Data Analytics Platform @ Nokia

Big Data Analytics Platform @ Nokia Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform

More information

SQL Server 2012 Performance White Paper

SQL Server 2012 Performance White Paper Published: April 2012 Applies to: SQL Server 2012 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication.

More information

Oracle BI Suite Enterprise Edition

Oracle BI Suite Enterprise Edition Oracle BI Suite Enterprise Edition Optimising BI EE using Oracle OLAP and Essbase Antony Heljula Technical Architect Peak Indicators Limited Agenda Overview When Do You Need a Cube Engine? Example Problem

More information

<Insert Picture Here> Oracle and/or Hadoop And what you need to know

<Insert Picture Here> Oracle and/or Hadoop And what you need to know Oracle and/or Hadoop And what you need to know Jean-Pierre Dijcks Data Warehouse Product Management Agenda Business Context An overview of Hadoop and/or MapReduce Choices, choices,

More information

Fact Sheet In-Memory Analysis

Fact Sheet In-Memory Analysis Fact Sheet In-Memory Analysis 1 Copyright Yellowfin International 2010 Contents In Memory Overview...3 Benefits...3 Agile development & rapid delivery...3 Data types supported by the In-Memory Database...4

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering

QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering June 2014 Page 1 Contents Introduction... 3 About Amazon Web Services (AWS)... 3 About Amazon Redshift... 3 QlikView on AWS...

More information

SQL Server 2005 Features Comparison

SQL Server 2005 Features Comparison Page 1 of 10 Quick Links Home Worldwide Search Microsoft.com for: Go : Home Product Information How to Buy Editions Learning Downloads Support Partners Technologies Solutions Community Previous Versions

More information

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What

More information

Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option

Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option Kai Yu, Senior Principal Architect Dell Oracle Solutions Engineering Dell, Inc. Abstract: By adding the In-Memory

More information

Vectorwise 3.0 Fast Answers from Hadoop. Technical white paper

Vectorwise 3.0 Fast Answers from Hadoop. Technical white paper Vectorwise 3.0 Fast Answers from Hadoop Technical white paper 1 Contents Executive Overview 2 Introduction 2 Analyzing Big Data 3 Vectorwise and Hadoop Environments 4 Vectorwise Hadoop Connector 4 Performance

More information

Enterprise and Standard Feature Compare

Enterprise and Standard Feature Compare www.blytheco.com Enterprise and Standard Feature Compare SQL Server 2008 Enterprise SQL Server 2008 Enterprise is a comprehensive data platform for running mission critical online transaction processing

More information

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server Extending Hyperion BI with the Oracle BI Server Mark Ostroff Sr. BI Solutions Consultant Agenda Hyperion BI versus Hyperion BI with OBI Server Benefits of using Hyperion BI with the

More information

Parallel Data Warehouse

Parallel Data Warehouse MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability

More information

Driving Peak Performance. 2013 IBM Corporation

Driving Peak Performance. 2013 IBM Corporation Driving Peak Performance 1 Session 2: Driving Peak Performance Abstract We know you want the fastest performance possible for your deployments, and yet that relies on many choices across data storage,

More information

2009 Oracle Corporation 1

2009 Oracle Corporation 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,

More information

College of Engineering, Technology, and Computer Science

College of Engineering, Technology, and Computer Science College of Engineering, Technology, and Computer Science Design and Implementation of Cloud-based Data Warehousing In partial fulfillment of the requirements for the Degree of Master of Science in Technology

More information

Zynga Analytics Leveraging Big Data to Make Games More Fun and Social

Zynga Analytics Leveraging Big Data to Make Games More Fun and Social Connecting the World Through Games Zynga Analytics Leveraging Big Data to Make Games More Fun and Social Daniel McCaffrey General Manager, Platform and Analytics Engineering World s leading social game

More information

Performance and Scalability Overview

Performance and Scalability Overview Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics Platform. Contents Pentaho Scalability and

More information

An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct

More information

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process ORACLE OLAP KEY FEATURES AND BENEFITS FAST ANSWERS TO TOUGH QUESTIONS EASILY KEY FEATURES & BENEFITS World class analytic engine Superior query performance Simple SQL access to advanced analytics Enhanced

More information

Data Analytics The New Growth Opportunity for Software Developers

Data Analytics The New Growth Opportunity for Software Developers Data Analytics The New Growth Opportunity for Software Developers How the Vertica Analytic Database is powering the new wave of commercial software, SaaS and appliance-based applications and creating new

More information

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA

More information

Performance and Scalability Overview

Performance and Scalability Overview Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics platform. PENTAHO PERFORMANCE ENGINEERING

More information

Inge Os Sales Consulting Manager Oracle Norway

Inge Os Sales Consulting Manager Oracle Norway Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database

More information

SQL Server Business Intelligence on HP ProLiant DL785 Server

SQL Server Business Intelligence on HP ProLiant DL785 Server SQL Server Business Intelligence on HP ProLiant DL785 Server By Ajay Goyal www.scalabilityexperts.com Mike Fitzner Hewlett Packard www.hp.com Recommendations presented in this document should be thoroughly

More information

So What s the Big Deal?

So What s the Big Deal? So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data

More information

A Novel Cloud Based Elastic Framework for Big Data Preprocessing

A Novel Cloud Based Elastic Framework for Big Data Preprocessing School of Systems Engineering A Novel Cloud Based Elastic Framework for Big Data Preprocessing Omer Dawelbeit and Rachel McCrindle October 21, 2014 University of Reading 2008 www.reading.ac.uk Overview

More information

hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau

hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau Powered by Vertica Solution Series in conjunction with: hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau The cost of healthcare in the US continues to escalate. Consumers, employers,

More information

Oracle Database Cloud Service Rick Greenwald, Director, Product Management, Database Cloud

Oracle Database Cloud Service Rick Greenwald, Director, Product Management, Database Cloud Oracle Database Cloud Service Rick Greenwald, Director, Product Management, Database Cloud Agenda Oracle Cloud Database Service Overview Cloud taxonomy What is the Database Cloud Service? Architecture

More information

Two-Phase Data Warehouse Optimized for Data Mining

Two-Phase Data Warehouse Optimized for Data Mining Two-Phase Data Warehouse Optimized for Data Mining Balázs Rácz András Lukács Csaba István Sidló András A. Benczúr Data Mining and Web Search Research Group Computer and Automation Research Institute Hungarian

More information

Using distributed technologies to analyze Big Data

Using distributed technologies to analyze Big Data Using distributed technologies to analyze Big Data Abhijit Sharma Innovation Lab BMC Software 1 Data Explosion in Data Center Performance / Time Series Data Incoming data rates ~Millions of data points/

More information

Introducing Oracle Exalytics In-Memory Machine

Introducing Oracle Exalytics In-Memory Machine Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle

More information

Jun Liu, Senior Software Engineer Bianny Bian, Engineering Manager SSG/STO/PAC

Jun Liu, Senior Software Engineer Bianny Bian, Engineering Manager SSG/STO/PAC Jun Liu, Senior Software Engineer Bianny Bian, Engineering Manager SSG/STO/PAC Agenda Quick Overview of Impala Design Challenges of an Impala Deployment Case Study: Use Simulation-Based Approach to Design

More information

SAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions Management @ SAP

SAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions Management @ SAP SAP Analytics Roadmap for Small and Midsize Companies Kevin Chan, Director, Solutions Management @ SAP A WORLD OF ACCELERATING CHANGE An emerging middle class growing to 5B Data doubling every 18 months

More information

SQream Technologies Ltd - Confiden7al

SQream Technologies Ltd - Confiden7al SQream Technologies Ltd - Confiden7al 1 Ge#ng Big Data Done On a GPU- Based Database Ori Netzer VP Product 26- Mar- 14 Analy7cs Performance - 3 TB, 18 Billion records SQream Database 400x More Cost Efficient!

More information

Microsoft Analytics Platform System. Solution Brief

Microsoft Analytics Platform System. Solution Brief Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal

More information

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 Ralph Kimball Associates 2014 The Data Warehouse Mission Identify all possible enterprise data assets Select those assets

More information

MySQL Enterprise Monitor

MySQL Enterprise Monitor MySQL Enterprise Monitor Lynn Ferrante Principal Sales Consultant 1 Program Agenda MySQL Enterprise Monitor Overview Architecture Roles Demo 2 Overview 3 MySQL Enterprise Edition Highest Levels of Security,

More information

UQC103S1 UFCE47-20-1. Systems Development. uqc103s/ufce47-20-1 PHP-mySQL 1

UQC103S1 UFCE47-20-1. Systems Development. uqc103s/ufce47-20-1 PHP-mySQL 1 UQC103S1 UFCE47-20-1 Systems Development uqc103s/ufce47-20-1 PHP-mySQL 1 Who? Email: uqc103s1@uwe.ac.uk Web Site www.cems.uwe.ac.uk/~jedawson www.cems.uwe.ac.uk/~jtwebb/uqc103s1/ uqc103s/ufce47-20-1 PHP-mySQL

More information

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Better Together Writer: Bill Baer, Technical Product Manager, SharePoint Product Group Technical Reviewers: Steve Peschka,

More information

<Insert Picture Here> Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option

<Insert Picture Here> Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option The following is intended to outline our general product direction. It is intended for

More information

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business

More information

Introduction to Database as a Service

Introduction to Database as a Service Introduction to Database as a Service Exadata Platform Revised 8/1/13 Database as a Service (DBaaS) Starts With The Business Needs Establish an IT delivery model that reduces costs, meets demand, and fulfills

More information

Overview: X5 Generation Database Machines

Overview: X5 Generation Database Machines Overview: X5 Generation Database Machines Spend Less by Doing More Spend Less by Paying Less Rob Kolb Exadata X5-2 Exadata X4-8 SuperCluster T5-8 SuperCluster M6-32 Big Memory Machine Oracle Exadata Database

More information

Why Big Data in the Cloud?

Why Big Data in the Cloud? Have 40 Why Big Data in the Cloud? Colin White, BI Research January 2014 Sponsored by Treasure Data TABLE OF CONTENTS Introduction The Importance of Big Data The Role of Cloud Computing Using Big Data

More information

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database - Engineered for Innovation Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database 11g Release 2 Shipping since September 2009 11.2.0.3 Patch Set now

More information

Application Performance Management for Enterprise Applications

Application Performance Management for Enterprise Applications Application Performance Management for Enterprise Applications White Paper from ManageEngine Web: Email: appmanager-support@manageengine.com Table of Contents 1. Introduction 2. Types of applications used

More information

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices Sawmill Log Analyzer Best Practices!! Page 1 of 6 Sawmill Log Analyzer Best Practices! Sawmill Log Analyzer Best Practices!! Page 2 of 6 This document describes best practices for the Sawmill universal

More information

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage White Paper Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage A Benchmark Report August 211 Background Objectivity/DB uses a powerful distributed processing architecture to manage

More information

Bryan Tuft Sr. Sales Consultant Global Embedded Business Unit bryan.tuft@oracle.com

Bryan Tuft Sr. Sales Consultant Global Embedded Business Unit bryan.tuft@oracle.com Bryan Tuft Sr. Sales Consultant Global Embedded Business Unit bryan.tuft@oracle.com Agenda Oracle Approach Embedded Databases TimesTen In-Memory Database Snapshots Q&A Real-Time Infrastructure Challenges

More information

SAP HANA. SAP HANA Performance Efficient Speed and Scale-Out for Real-Time Business Intelligence

SAP HANA. SAP HANA Performance Efficient Speed and Scale-Out for Real-Time Business Intelligence SAP HANA SAP HANA Performance Efficient Speed and Scale-Out for Real-Time Business Intelligence SAP HANA Performance Table of Contents 3 Introduction 4 The Test Environment Database Schema Test Data System

More information

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013 Big Data Use Case How Rackspace is using Private Cloud for Big Data Bryan Thompson May 8th, 2013 Our Big Data Problem Consolidate all monitoring data for reporting and analytical purposes. Every device

More information

White Paper. Optimizing the Performance Of MySQL Cluster

White Paper. Optimizing the Performance Of MySQL Cluster White Paper Optimizing the Performance Of MySQL Cluster Table of Contents Introduction and Background Information... 2 Optimal Applications for MySQL Cluster... 3 Identifying the Performance Issues.....

More information

ITG Software Engineering

ITG Software Engineering IBM WebSphere Administration 8.5 Course ID: Page 1 Last Updated 12/15/2014 WebSphere Administration 8.5 Course Overview: This 5 Day course will cover the administration and configuration of WebSphere 8.5.

More information

<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region

<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region 1977 Oracle Database 30 Years of Sustained Innovation Database Vault Transparent Data Encryption

More information

Scalability and Performance Report - Analyzer 2007

Scalability and Performance Report - Analyzer 2007 - Analyzer 2007 Executive Summary Strategy Companion s Analyzer 2007 is enterprise Business Intelligence (BI) software that is designed and engineered to scale to the requirements of large global deployments.

More information

CitusDB Architecture for Real-Time Big Data

CitusDB Architecture for Real-Time Big Data CitusDB Architecture for Real-Time Big Data CitusDB Highlights Empowers real-time Big Data using PostgreSQL Scales out PostgreSQL to support up to hundreds of terabytes of data Fast parallel processing

More information

Moving From Hadoop to Spark

Moving From Hadoop to Spark + Moving From Hadoop to Spark Sujee Maniyam Founder / Principal @ www.elephantscale.com sujee@elephantscale.com Bay Area ACM meetup (2015-02-23) + HI, Featured in Hadoop Weekly #109 + About Me : Sujee

More information

Oracle MulBtenant Customer Success Stories

Oracle MulBtenant Customer Success Stories Oracle MulBtenant Customer Success Stories Mul1tenant Customer Sessions at Customer Session Venue Title SAS Cigna CON6328 Mon 2:45pm SAS SoluBons OnDemand: A MulBtenant Cloud Offering CON6379 Mon 5:15pm

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

Apache Kylin Introduction Dec 8, 2014 @ApacheKylin

Apache Kylin Introduction Dec 8, 2014 @ApacheKylin Apache Kylin Introduction Dec 8, 2014 @ApacheKylin Luke Han Sr. Product Manager lukhan@ebay.com @lukehq Yang Li Architect & Tech Leader yangli9@ebay.com Agenda What s Apache Kylin? Tech Highlights Performance

More information

Database Scalability and Oracle 12c

Database Scalability and Oracle 12c Database Scalability and Oracle 12c Marcelle Kratochvil CTO Piction ACE Director All Data/Any Data marcelle@piction.com Warning I will be covering topics and saying things that will cause a rethink in

More information

In-Memory Analytics: A comparison between Oracle TimesTen and Oracle Essbase

In-Memory Analytics: A comparison between Oracle TimesTen and Oracle Essbase In-Memory Analytics: A comparison between Oracle TimesTen and Oracle Essbase Agenda Introduction Why In-Memory? Options for In-Memory in Oracle Products - Times Ten - Essbase Comparison - Essbase Vs Times

More information

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved Hortonworks & SAS Analytics everywhere. Page 1 A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment

More information

Performance And Scalability In Oracle9i And SQL Server 2000

Performance And Scalability In Oracle9i And SQL Server 2000 Performance And Scalability In Oracle9i And SQL Server 2000 Presented By : Phathisile Sibanda Supervisor : John Ebden 1 Presentation Overview Project Objectives Motivation -Why performance & Scalability

More information

Netezza and Business Analytics Synergy

Netezza and Business Analytics Synergy Netezza Business Partner Update: November 17, 2011 Netezza and Business Analytics Synergy Shimon Nir, IBM Agenda Business Analytics / Netezza Synergy Overview Netezza overview Enabling the Business with

More information

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP Your business is swimming in data, and your business analysts want to use it to answer the questions of today and tomorrow. YOU LOOK TO

More information

Enterprise Architectures for Large Tiled Basemap Projects. Tommy Fauvell

Enterprise Architectures for Large Tiled Basemap Projects. Tommy Fauvell Enterprise Architectures for Large Tiled Basemap Projects Tommy Fauvell Tommy Fauvell Senior Technical Analyst Esri Professional Services Washington D.C Regional Office Project Technical Lead: - Responsible

More information

X3 Intelligence Reporting

X3 Intelligence Reporting X3 Intelligence Reporting Frequently asked questions Sage X3 Intelligence Reporting customers Contents 1. Overview of Sage Intelligence Reporting 3 2. Comparison of Sage Intelligence Reporting and Sage

More information

PHP on IBM i: What s New with Zend Server 5 for IBM i

PHP on IBM i: What s New with Zend Server 5 for IBM i PHP on IBM i: What s New with Zend Server 5 for IBM i Mike Pavlak Solutions Consultant mike.p@zend.com (815) 722 3454 Function Junction Audience Used PHP in Zend Core/Platform New to Zend PHP Looking to

More information

The Data Access Handbook

The Data Access Handbook The Data Access Handbook Achieving Optimal Database Application Performance and Scalability John Goodson and Robert A. Steward PRENTICE HALL Upper Saddle River, NJ Boston Indianapolis San Francisco New

More information

In-Memory Data Management for Enterprise Applications

In-Memory Data Management for Enterprise Applications In-Memory Data Management for Enterprise Applications Jens Krueger Senior Researcher and Chair Representative Research Group of Prof. Hasso Plattner Hasso Plattner Institute for Software Engineering University

More information

MySQL Enterprise Edition Most secure, scalable MySQL Database, Online Backup, Development/Monitoring Tools, backed by Oracle Premier Lifetime Support

MySQL Enterprise Edition Most secure, scalable MySQL Database, Online Backup, Development/Monitoring Tools, backed by Oracle Premier Lifetime Support MySQL Enterprise Edition Most secure, scalable MySQL Database, Online Backup, Development/Monitoring Tools, backed by Oracle Premier Lifetime Support Elevator Pitch With 12 millions of active installs,

More information

About Me: Brent Ozar. Perfmon and Profiler 101

About Me: Brent Ozar. Perfmon and Profiler 101 Perfmon and Profiler 101 2008 Quest Software, Inc. ALL RIGHTS RESERVED. About Me: Brent Ozar SQL Server Expert for Quest Software Former SQL DBA Managed >80tb SAN, VMware Dot-com-crash experience Specializes

More information