Czy moŝna pogodzić działalność naukową i komercyjną? Na przykładzie historii silnika bazodanowego stworzonego przez firmę Infobright Inc.
|
|
- Noah Boone
- 8 years ago
- Views:
Transcription
1 Czy moŝna pogodzić działalność naukową i komercyjną? Na przykładzie historii silnika bazodanowego stworzonego przez firmę Infobright Inc. Dominik Ślęzak
2 O czym będzie Infobright dzisiaj Historia rozwoju Szczegóły technologiczne Jak się w tym nie pogubić? 2
3 Infobright Technology Innovation First commercial open source data warehouse (analytical database engine) Community & Enterprise Editions Lowest cost technology in the market Simplest & easiest to build & manage Powerful & scalable MySQL Technology unlike anything in the industry Cool Vendor in Data Management and Integration High Performance Data Warehousing Partner of the Year STARTUP TO WATCH Partner of the Year 2009 Infobright: Economic Data Warehouse Choice Strong Momentum & Mature Product Release 3.x generally available +100 customers in 10 Countries +40 Partners on 6 continents A vibrant open source community +1 million visitors 20,000 downloads 3
4 Infobright: Designed for Analytics Best Fit True Analytics Good Fit Standard Analytics Poor Fit OLTP Analytics Environment Questions about the data Aggregates: sums, counts... Some requests for data, optimized by Knowledge Grid Primarily data retrieval & mixed read/write activity (but we handle mixed read/writes in analytics) Analytical Analytical and Transaction applications ad hoc queries standard queries & operational reporting Sample Queries How many new customers were identified recently? What was the largest sale made for item A? Report the sales figures from California for X campaign for Jan 2008 SELECT * Go get the data Notes Knowledge Grid is used heavily to resolve the query almost no I/O Knowledge Grid is used to reduce I/O times by optimizing data access It does not use the advantages of a columnar database
5 Why Customers Buy Infobright (1) Fast query response with no tuning Customer s Test Alternative Infobright Analytical queries 2+ hours with MySQL <10 seconds Query (AND Left Join) 26.4 seconds in Oracle.02 seconds Oracle query set 10 seconds 15 minutes seconds BI report 7 hours in Informix 17 seconds Data load 11 hours in MySQL ISAM 11 minutes Fast and consistent data load speed as database grows. Up to 300GB/hour on a single server 5
6 Online Marketing Online Marketing Need Data Volumes online marketing data generates staggering volumes of information that needs to be captured and analyzed Real-time Response needed for rapid response to campaigns and specific targeting Deep Ad Hoc Analysis differentiation through deeper analytics across more granular data Low Maintenance insane growth rates, tight resourcing means a solution with as little maintenance and cost as possible Infobright s Solution Support for storing up to 50 TBs of data compressed, with up to 40:1 compression, providing the lowest hardware footprint Blazing response times for deep business analytic queries on TBs of granular clickstream data Lowest total cost of ownership in data warehousing simple to implement & manage, requiring little administration Customer experience, including: 6
7 Telecommunications Telecommunications Need Data Volumes are simply staggering; call center analysis, call data records (CDR), customer billing data, network data incl. alarms, alerts and events Avoiding Data Silos storing vast amounts of data and ensuring timely access typically results in multiple development projects, creating unconnected silos of data Highly Complex Ad Hoc Analysis vast cross-section of TELCO applications means widely differing requirements & highly complex, ad hoc queries 7 Infobright s Solution Support for storing up to 50 TBs of data compressed, with up to 40:1 compression, providing the lowest hardware footprint Keeps data together while providing fast response times on TBs of varied data Complex business analytics without the maintenance overhead provides the lowest TCO in data warehousing Customer experience, including:
8 Financial Services Financial Services Need Response Times Capital markets firms depend on the speed at which they can analyze and process trades Risk Management Must prevent risk and identify fraud in real-time to maintain the business Regulatory Compliance Compliance with rising regulatory requirements, directly impacts their ability to stay in business: Trading compliance (Reg NMS, MiFID) Corporate governance (Sarbanes-Oxley, Basel II) Infobright s Solution Fast analytic response times regardless of TBs of ever-increasing trade and tick data volumes Detailed business analytics to spot inconsistencies across huge volumes of live and historical data, prevents losses of money and reputation Reporting accuracy regardless of rising volumes helps maintain compliance Customer experience, including: 8
9 Dominik (back to 2005) 9
10 Challenges ( ) Data tables are incomparably larger than on this illustration but the users want the query speed to stay within minutes Data may grow very quickly but the users expect the data warehouse to adjust to the new data very quickly as well Types of queries are hard to predict and the users expect that arbitrary queries will be executed reasonably quickly O u tlo o k T em p. H u m id. W in d S p o rt? 1 S u n n y H o t H ig h W e ak N o 2 S u n n y H o t H ig h S tro n g N o 3 O v ercast H o t H ig h W e ak Y es 4 R ain M ild H ig h W e ak Y es 5 R ain C o ld N o rm al W e ak Y es 6 R ain C o ld N o rm al S tro n g N o 7 O v ercast C o ld N o rm al S tro n g Y es 8 S u n n y M ild H ig h W e ak N o 9 S u n n y C o ld N o rm al W e ak Y es 1 0 R ain M ild N o rm al W e ak Y es 1 1 S u n n y M ild N o rm al S tro n g Y es 1 2 O v ercast M ild H ig h S tro n g Y es 1 3 O v ercast H o t N o rm al W e ak Y es 1 4 R ain M ild H ig h S tro n g N o Current data warehouse solutions (MPP, indices, sorting, partitioning, etc.) are not flexible and often require special hardware 10
11 Column vs. Row-Oriented EMP_ID FNAME LNAME SALARY 1 Moe Howard Curly Joe Larry Fine 9000 Row Oriented (1,Moe,Howard,10000; 2,Curly, Joe,12000; 3,Larry,Fine,9000;) Works well if all the columns are needed for every query. Efficient for transactional processing if all the data for the row is available Column Oriented (1,2,3; Moe,Curly,Larry; Howard,Joe,Fine; 10000,12000,9000;) Works well with aggregate results (sum, count, avg, min, max,... ) Only relevant columns need to be touched Allows for very efficient compression 11
12 Data Packs and Compression 64K 64K 64K 64K Data Packs Each data pack contains 65,536 data values Compression is applied to each individual data pack The compression algorithm varies depending on data type and distribution Patent Pending Compression Algorithms Compression Results vary depending on data distribution among data packs A typical overall compression ratio seen in the field is around 10:1 Some customers have seen results have been as high as 40:1 12
13 Knowledge Grid Data Pack Nodes (DPN) A separate DPN is created for every data pack created in the database to store basic statistical information Histograms Histograms are created for every Data Pack that contains numeric data and creates 1024 MIN-MAX intervals. Character Maps (CMAPs) Every Data Pack that contains text creates a matrix that records the occurrence of every possible ASCII character This metadata layer is ~1% of the compressed volume. For example, a 1TB (raw) database would have Knowledge Grid of less than 1 GB. 13
14 A Simple Query using Knowledge Grid SELECT count (*) FROM employees WHERE salary > AND age < 65 AND job = Shipping AND city = TORONTO ; salary age job city All packs ignored 1. Find the Data Packs with salary > Find the Data Packs that contain age < Find the Data Packs that have job = Shipping 4. Find the Data Packs that have City = Toronto 5. Now we eliminate all rows that have been flagged as irrelevant. 6. Finally we have identified the data pack that needs to be decompressed Only this pack will be decompressed All packs ignored All packs ignored 14 Completely Irrelevant Suspect All values match
15 SELECT MAX(A) FROM T WHERE B>15; E E I/E I/E I/S/R denotes irrelevant/suspect/relevant; E exact computation (decompression) 15
16 HashBlockJoin (advanced usage of metadata) 16
17 MySQL Pluggable Storage Engine Architecture 17
18 Infobright and MySQL ( ) Integration provides a simple path to highly scalable data warehousing for MySQL users No new management interface to learn MySQL integration enables seamless connectivity to BI tools and MySQL drivers for ODBC, JDBC, C/C++,.NET, Perl, Python, PHP, Ruby, Tcl, etc. 18
19 19
20 Comparison of ICE and IEE ( ) Features Technical Support Warranty and Indemnification Forums and/or one-time 4-hr support pack No Silver, Gold, Platinum Included Mixed Read/Writes No Supported Infobright Loader Data Load Types Up to 50 GB/hr Text only Multi-threaded, Up to 300GB/hr Text Binary (up to 100% faster) MySQL Loader No Supported Temp Tables No Supported Platform Support (keeps changing) 64-bit Intel and AMD RHEL 5, CentOS 5, Debian 32-bit, Ubuntu 8.04, Fedora 9, Windows XP bit Intel and AMD Solaris 10, Windows Server, RHEL 5, RHEL AS, CentOS 5, Debian...
21 IEE Annual Subscriptions Subscription Hours of Service Response Time SLA # Named Client Contacts Support Access Methods Services * Two year PrePay Charge/TB One Year Annual Charge/TB Silver 9am-5pm EDT (no phone support) Severity 1=4 hr Severity 2=8 hr 1 Web Self-Service Knowledge Base Training and consulting services at published rate $12,950 $15,950 Gold 8am-6pm (ET for NA, CET Europe) Severity 1=2 hr Severity 2=6 hr 3 Phone Web Self-Service Knowledge Base Health Check Service included 10% Discount training and other consulting services Sev 1 hot fixes $15,950 $18,950 Platinum 7 x 24 x 365 Severity 1=1 hr Severity 2=4 hr 5 Phone Web Self-Service Knowledge Base Health Check Service included 20% discount training and other consulting services Sev 1 and 2 hot fixes $18,950 $21,950 Perpetual Perpetual (Same as Gold Support Level) 18% of Purchase Price not to increase by more then 10% per Yr. $40,000 * Two year subscription pricing requires two year prepay
22 Why Customers Buy Infobright (2) Less cost Capacity-based subscription model Less hardware required Less work Simple schemas and standard SQL Reduced monitoring requirements Reduced maintenance requirements Open source Fast implementation Open source community Broad platform support 22
23
24 Dominik (in 2010) 24
25 Dominik (in 2010) 25
26 Approximate SQL In such areas as, e.g., Business Intelligence and Web Analytics, there is an ongoing debate whether the answers to SQL statements have to be always exact. The same question occurs in the case of SQL-based machine learning algorithms, which are often based on heuristics, randomness and inexactness anyway. Motivation for SQL approximations is related also to such aspects as complexity of queries and data sources, dynamically changing data with a limited access, and huge data sets with a need to monitor convergence of query execution in time, regardless of whether the final answers are to be exact or not. 26
27 Data Granulation 27
28 SQL-based Machine Learning 28
29 Community Inspirations New Objectives New Schemas New Volumes New Queries New Statistics Count Distinct Self-Joins Decision Trees Contingencies New Data Types SQL Extensions Feature Extraction Data Compression 29
30 DZIĘKUJĘ ZA UWAGĘ!!!
High Performance Log Analytics: Database Considerations
High Performance Log Analytics: Database Considerations "Once companies start down the path of log collection and management, organizations often discover there are things going on in their networks that
More informationEnterprise Edition Analytic Data Warehouse Technology White Paper
Enterprise Edition Analytic Data Warehouse Technology White Paper August 2008 Infobright 47 Colborne Lane, Suite 403 Toronto, Ontario M5E 1P8 Canada www.infobright.com info@infobright.com Table of Contents
More informationBig Data & the LAMP Stack: How to Boost Performance
Big Data & the LAMP Stack: How to Boost Performance Jeff Kibler, Infobright Community Manager Kevin Schroeder, Zend Technologies, Technology Evangelist Agenda The Machine-Generated Data Problem Benchmark:
More informationAnalytic Applications With PHP and a Columnar Database
AnalyticApplicationsWithPHPandaColumnarDatabase No matter where you look these days, PHP continues to gain strong use both inside and outside of the enterprise. Although developers have many choices when
More informationData Integrity & Scalability The Value of Accuracy. Data Quality in Big Data
Data Integrity & Scalability The Value of Accuracy Data Quality in Big Data Data Quality in the news 2 And some more examples... 3 High Quality Information as competitive differentiator Business today...
More informationIntegrating Apache Spark with an Enterprise Data Warehouse
Integrating Apache Spark with an Enterprise Warehouse Dr. Michael Wurst, IBM Corporation Architect Spark/R/Python base Integration, In-base Analytics Dr. Toni Bollinger, IBM Corporation Senior Software
More informationMoving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage
Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take
More informationInge Os Sales Consulting Manager Oracle Norway
Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database
More informationJDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service
JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service Overview JDSU (NASDAQ: JDSU; and TSX: JDU) innovates and markets diverse
More informationMicrosoft SQL Server to Infobright Database Migration Guide
Microsoft SQL Server to Infobright Database Migration Guide Infobright 47 Colborne Street, Suite 403 Toronto, Ontario M5E 1P8 Canada www.infobright.com www.infobright.org Approaches to Migrating Databases
More information<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database
1 Best Practices for Extreme Performance with Data Warehousing on Oracle Database Rekha Balwada Principal Product Manager Agenda Parallel Execution Workload Management on Data Warehouse
More informationMySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)
MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!) Erdélyi Ernő, Component Soft Kft. erno@component.hu www.component.hu 2013 (c) Component Soft Ltd Leading Hadoop Vendor Copyright 2013,
More informationOptimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option
Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option Kai Yu, Senior Principal Architect Dell Oracle Solutions Engineering Dell, Inc. Abstract: By adding the In-Memory
More information2009 Oracle Corporation 1
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
More informationInnovative technology for big data analytics
Technical white paper Innovative technology for big data analytics The HP Vertica Analytics Platform database provides price/performance, scalability, availability, and ease of administration Table of
More informationEMC/Greenplum Driving the Future of Data Warehousing and Analytics
EMC/Greenplum Driving the Future of Data Warehousing and Analytics EMC 2010 Forum Series 1 Greenplum Becomes the Foundation of EMC s Data Computing Division E M C A CQ U I R E S G R E E N P L U M Greenplum,
More informationMicrosoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010
Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Better Together Writer: Bill Baer, Technical Product Manager, SharePoint Product Group Technical Reviewers: Steve Peschka,
More informationExadata Database Machine
Database Machine Extreme Extraordinary Exciting By Craig Moir of MyDBA March 2011 Exadata & Exalogic What is it? It is Hardware and Software engineered to work together It is Extreme Performance Application-to-Disk
More informationData Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com
Data Warehousing and Analytics Infrastructure at Facebook Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com Overview Challenges in a Fast Growing & Dynamic Environment Data Flow Architecture,
More informationCustomized Report- Big Data
GINeVRA Digital Research Hub Customized Report- Big Data 1 2014. All Rights Reserved. Agenda Context Challenges and opportunities Solutions Market Case studies Recommendations 2 2014. All Rights Reserved.
More informationOracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.
Oracle BI EE Implementation on Netezza Prepared by SureShot Strategies, Inc. The goal of this paper is to give an insight to Netezza architecture and implementation experience to strategize Oracle BI EE
More informationNews and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren
News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business
More informationMaximum performance, minimal risk for data warehousing
SYSTEM X SERVERS SOLUTION BRIEF Maximum performance, minimal risk for data warehousing Microsoft Data Warehouse Fast Track for SQL Server 2014 on System x3850 X6 (95TB) The rapid growth of technology has
More informationData Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
More informationBIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting
BIG DATA APPLIANCES July 23, TDWI R Sathyanarayana Enterprise Information Management & Analytics Practice EMC Consulting 1 Big data are datasets that grow so large that they become awkward to work with
More informationCost-Effective Business Intelligence with Red Hat and Open Source
Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,
More informationPreview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.
Preview of Oracle Database 12c In-Memory Option 1 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any
More informationOracle Database In-Memory The Next Big Thing
Oracle Database In-Memory The Next Big Thing Maria Colgan Master Product Manager #DBIM12c Why is Oracle do this Oracle Database In-Memory Goals Real Time Analytics Accelerate Mixed Workload OLTP No Changes
More information<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region
Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region 1977 Oracle Database 30 Years of Sustained Innovation Database Vault Transparent Data Encryption
More informationFact Sheet In-Memory Analysis
Fact Sheet In-Memory Analysis 1 Copyright Yellowfin International 2010 Contents In Memory Overview...3 Benefits...3 Agile development & rapid delivery...3 Data types supported by the In-Memory Database...4
More informationQLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering
QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering June 2014 Page 1 Contents Introduction... 3 About Amazon Web Services (AWS)... 3 About Amazon Redshift... 3 QlikView on AWS...
More informationInformation management software solutions White paper. Powerful data warehousing performance with IBM Red Brick Warehouse
Information management software solutions White paper Powerful data warehousing performance with IBM Red Brick Warehouse April 2004 Page 1 Contents 1 Data warehousing for the masses 2 Single step load
More informationAdvanced In-Database Analytics
Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??
More informationICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001
ICOM 6005 Database Management Systems Design Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 Readings Read Chapter 1 of text book ICOM 6005 Dr. Manuel
More informationPerformance and Scalability Overview
Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics platform. PENTAHO PERFORMANCE ENGINEERING
More informationIntroducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
More informationSQL Server 2005 Features Comparison
Page 1 of 10 Quick Links Home Worldwide Search Microsoft.com for: Go : Home Product Information How to Buy Editions Learning Downloads Support Partners Technologies Solutions Community Previous Versions
More informationVectorwise 3.0 Fast Answers from Hadoop. Technical white paper
Vectorwise 3.0 Fast Answers from Hadoop Technical white paper 1 Contents Executive Overview 2 Introduction 2 Analyzing Big Data 3 Vectorwise and Hadoop Environments 4 Vectorwise Hadoop Connector 4 Performance
More informationGaining the Performance Edge Using a Column-Oriented Database Management System
Analytics in the Federal Government White paper series on how to achieve efficiency, responsiveness and transparency. Gaining the Performance Edge Using a Column-Oriented Database Management System by
More informationIntroduction to Database as a Service
Introduction to Database as a Service Exadata Platform Revised 8/1/13 Database as a Service (DBaaS) Starts With The Business Needs Establish an IT delivery model that reduces costs, meets demand, and fulfills
More informationSAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013
SAP HANA SAP s In-Memory Database Dr. Martin Kittel, SAP HANA Development January 16, 2013 Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase
More informationLowering the Total Cost of Ownership (TCO) of Data Warehousing
Ownership (TCO) of Data If Gordon Moore s law of performance improvement and cost reduction applies to processing power, why hasn t it worked for data warehousing? Kognitio provides solutions to business
More informationPerformance and Scalability Overview
Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics Platform. Contents Pentaho Scalability and
More informationORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION
ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION EXECUTIVE SUMMARY Oracle business intelligence solutions are complete, open, and integrated. Key components of Oracle business intelligence
More informationHow To Use Hp Vertica Ondemand
Data sheet HP Vertica OnDemand Enterprise-class Big Data analytics in the cloud Enterprise-class Big Data analytics for any size organization Vertica OnDemand Organizations today are experiencing a greater
More informationWhy Big Data in the Cloud?
Have 40 Why Big Data in the Cloud? Colin White, BI Research January 2014 Sponsored by Treasure Data TABLE OF CONTENTS Introduction The Importance of Big Data The Role of Cloud Computing Using Big Data
More informationAn Oracle White Paper March 2014. Best Practices for Implementing a Data Warehouse on the Oracle Exadata Database Machine
An Oracle White Paper March 2014 Best Practices for Implementing a Data Warehouse on the Oracle Exadata Database Machine Introduction... 1! Data Models for a Data Warehouse... 2! Physical Model Implementing
More informationBIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
More informationCisco Data Preparation
Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and
More informationBringing Big Data into the Enterprise
Bringing Big Data into the Enterprise Overview When evaluating Big Data applications in enterprise computing, one often-asked question is how does Big Data compare to the Enterprise Data Warehouse (EDW)?
More informationThe Cubetree Storage Organization
The Cubetree Storage Organization Nick Roussopoulos & Yannis Kotidis Advanced Communication Technology, Inc. Silver Spring, MD 20905 Tel: 301-384-3759 Fax: 301-384-3679 {nick,kotidis}@act-us.com 1. Introduction
More informationData Analytics The New Growth Opportunity for Software Developers
Data Analytics The New Growth Opportunity for Software Developers How the Vertica Analytic Database is powering the new wave of commercial software, SaaS and appliance-based applications and creating new
More informationPostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor
PostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor The research leading to these results has received funding from the European Union's Seventh Framework
More informationHow to Choose Between Hadoop, NoSQL and RDBMS
How to Choose Between Hadoop, NoSQL and RDBMS Keywords: Jean-Pierre Dijcks Oracle Redwood City, CA, USA Big Data, Hadoop, NoSQL Database, Relational Database, SQL, Security, Performance Introduction A
More informationApache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com
Apache Hadoop in the Enterprise Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Cloudera The Leader in Big Data Management Powered by Apache Hadoop The Leading Open Source Distribution of Apache
More informationOracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc.
Oracle9i Data Warehouse Review Robert F. Edwards Dulcian, Inc. Agenda Oracle9i Server OLAP Server Analytical SQL Data Mining ETL Warehouse Builder 3i Oracle 9i Server Overview 9i Server = Data Warehouse
More informationUsing MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com
Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Agenda The rise of Big Data & Hadoop MySQL in the Big Data Lifecycle MySQL Solutions for Big Data Q&A
More informationScaling Your Data to the Cloud
ZBDB Scaling Your Data to the Cloud Technical Overview White Paper POWERED BY Overview ZBDB Zettabyte Database is a new, fully managed data warehouse on the cloud, from SQream Technologies. By building
More informationSummary of Alma-OSF s Evaluation of MongoDB for Monitoring Data Heiko Sommer June 13, 2013
Summary of Alma-OSF s Evaluation of MongoDB for Monitoring Data Heiko Sommer June 13, 2013 Heavily based on the presentation by Tzu-Chiang Shen, Leonel Peña ALMA Integrated Computing Team Coordination
More informationSQL Server Parallel Data Warehouse: Architecture Overview. José Blakeley Database Systems Group, Microsoft Corporation
SQL Server Parallel Data Warehouse: Architecture Overview José Blakeley Database Systems Group, Microsoft Corporation Outline Motivation MPP DBMS system architecture HW and SW Key components Query processing
More informationAccelerate Business Advantage with Dynamic Warehousing
Accelerate Business Advantage with Dynamic Warehousing Mark McConnell Marketing Executive, Information Management IBM Asia Pacific 2007 IBM Corporation Is Information Technology delivering? Source: IBM
More informationMicrosoft Analytics Platform System. Solution Brief
Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal
More informationUsing Tableau Software with Hortonworks Data Platform
Using Tableau Software with Hortonworks Data Platform September 2013 2013 Hortonworks Inc. http:// Modern businesses need to manage vast amounts of data, and in many cases they have accumulated this data
More informationUNIVERSE DESIGN BEST PRACTICES. Roxanne Pittman, InfoSol May 8, 2014
UNIVERSE DESIGN BEST PRACTICES Roxanne Pittman, InfoSol May 8, 2014 SEVEN PRINCIPLES OF UNIVERSAL DESIGN BY THE CENTER FOR UNIVERSAL DESIGN (CUD) NORTH CAROLINA STATE UNIVERSITY 1. Equitable use. The design
More informationOracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya
Oracle Database - Engineered for Innovation Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database 11g Release 2 Shipping since September 2009 11.2.0.3 Patch Set now
More informationHow To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
More informationVirtuoso and Database Scalability
Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of
More informationBig Data Analytics Platform @ Nokia
Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform
More informationTableau Metadata Model
Tableau Metadata Model Author: Marc Reuter Senior Director, Strategic Solutions, Tableau Software March 2012 p2 Most Business Intelligence platforms fall into one of two metadata camps: either model the
More informationSQL Server 2012 Performance White Paper
Published: April 2012 Applies to: SQL Server 2012 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication.
More informationRetail POS Data Analytics Using MS Bi Tools. Business Intelligence White Paper
Retail POS Data Analytics Using MS Bi Tools Business Intelligence White Paper Introduction Overview There is no doubt that businesses today are driven by data. Companies, big or small, take so much of
More informationBig Data and Its Impact on the Data Warehousing Architecture
Big Data and Its Impact on the Data Warehousing Architecture Sponsored by SAP Speaker: Wayne Eckerson, Director of Research, TechTarget Wayne Eckerson: Hi my name is Wayne Eckerson, I am Director of Research
More informationBig Data Analytics with IBM Cognos BI Dynamic Query IBM Redbooks Solution Guide
Big Data Analytics with IBM Cognos BI Dynamic Query IBM Redbooks Solution Guide IBM Cognos Business Intelligence (BI) helps you make better and smarter business decisions faster. Advanced visualization
More informationAnalytics in the Cloud. Peter Sirota, GM Elastic MapReduce
Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of
More informationOracle Big Data, In-memory, and Exadata - One Database Engine to Rule Them All Dr.-Ing. Holger Friedrich
Oracle Big Data, In-memory, and Exadata - One Database Engine to Rule Them All Dr.-Ing. Holger Friedrich Agenda Introduction Old Times Exadata Big Data Oracle In-Memory Headquarters Conclusions 2 sumit
More informationTransforming the Economics of Data Warehousing with Cloud Computing
Transforming the Economics of Data Warehousing with Cloud Computing How new frontiers in on-demand computing and DBMS technology will transform business. Copyright Vertica Systems Inc. November, 2008 Table
More informationEfficient Iceberg Query Evaluation for Structured Data using Bitmap Indices
Proc. of Int. Conf. on Advances in Computer Science, AETACS Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Ms.Archana G.Narawade a, Mrs.Vaishali Kolhe b a PG student, D.Y.Patil
More informationVirtual Data Warehouse Appliances
infrastructure (WX 2 and blade server Kognitio provides solutions to business problems that require acquisition, rationalization and analysis of large and/or complex data The Kognitio Technology and Data
More informationHow In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time
SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first
More informationSQL Server 2012 Parallel Data Warehouse. Solution Brief
SQL Server 2012 Parallel Data Warehouse Solution Brief Published February 22, 2013 Contents Introduction... 1 Microsoft Platform: Windows Server and SQL Server... 2 SQL Server 2012 Parallel Data Warehouse...
More informationIII JORNADAS DE DATA MINING
III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE
More information<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise
Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Business Intelligence is the #1 Priority the most important technology in 2007 is business intelligence
More informationAtScale Intelligence Platform
AtScale Intelligence Platform PUT THE POWER OF HADOOP IN THE HANDS OF BUSINESS USERS. Connect your BI tools directly to Hadoop without compromising scale, performance, or control. TURN HADOOP INTO A HIGH-PERFORMANCE
More informationAn Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database
An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct
More informationENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR
ORACLE DATA INTEGRATOR ENTERPRISE EDITION KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR ENTERPRISE EDITION OFFERS LEADING PERFORMANCE, IMPROVED PRODUCTIVITY, FLEXIBILITY AND LOWEST TOTAL COST OF OWNERSHIP
More informationNetezza and Business Analytics Synergy
Netezza Business Partner Update: November 17, 2011 Netezza and Business Analytics Synergy Shimon Nir, IBM Agenda Business Analytics / Netezza Synergy Overview Netezza overview Enabling the Business with
More informationUnderstanding the Benefits of IBM SPSS Statistics Server
IBM SPSS Statistics Server Understanding the Benefits of IBM SPSS Statistics Server Contents: 1 Introduction 2 Performance 101: Understanding the drivers of better performance 3 Why performance is faster
More informationW H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
More informationTableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere
Tableau Visual Intelligence Platform Rapid Fire Analytics for Everyone Everywhere Agenda 1. Introductions & Objectives 2. Tableau Overview 3. Tableau Products 4. Tableau Architecture 5. Why Tableau? 6.
More informationThe Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
More informationMinimize cost and risk for data warehousing
SYSTEM X SERVERS SOLUTION BRIEF Minimize cost and risk for data warehousing Microsoft Data Warehouse Fast Track for SQL Server 2014 on System x3850 X6 (55TB) Highlights Improve time to value for your data
More informationSUN ORACLE EXADATA STORAGE SERVER
SUN ORACLE EXADATA STORAGE SERVER KEY FEATURES AND BENEFITS FEATURES 12 x 3.5 inch SAS or SATA disks 384 GB of Exadata Smart Flash Cache 2 Intel 2.53 Ghz quad-core processors 24 GB memory Dual InfiniBand
More informationANALYTICS BUILT FOR INTERNET OF THINGS
ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that
More informationWho am I? Copyright 2014, Oracle and/or its affiliates. All rights reserved. 3
Oracle Database In-Memory Power the Real-Time Enterprise Saurabh K. Gupta Principal Technologist, Database Product Management Who am I? Principal Technologist, Database Product Management at Oracle Author
More informationBusiness Intelligence
Business Intelligence Data Mining and Data Warehousing Dominik Ślęzak slezak@infobright.com www.infobright.com Research Interests Data Warehouses, Knowledge Discovery, Rough Sets Machine Intelligence,
More informationElastic Application Platform for Market Data Real-Time Analytics. for E-Commerce
Elastic Application Platform for Market Data Real-Time Analytics Can you deliver real-time pricing, on high-speed market data, for real-time critical for E-Commerce decisions? Market Data Analytics applications
More informationHow To Make Data Streaming A Real Time Intelligence
REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log
More informationDATABASE. Pervasive PSQL Performance. Key Performance Features of Pervasive PSQL. Pervasive PSQL White Paper
DATABASE Pervasive PSQL Performance Key Performance Features of Pervasive PSQL Pervasive PSQL White Paper June 2008 Table of Contents Introduction... 3 Per f o r m a n c e Ba s i c s: Mo r e Me m o r y,
More informationSAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions Management @ SAP
SAP Analytics Roadmap for Small and Midsize Companies Kevin Chan, Director, Solutions Management @ SAP A WORLD OF ACCELERATING CHANGE An emerging middle class growing to 5B Data doubling every 18 months
More informationIBM Netezza High Capacity Appliance
IBM Netezza High Capacity Appliance Petascale Data Archival, Analysis and Disaster Recovery Solutions IBM Netezza High Capacity Appliance Highlights: Allows querying and analysis of deep archival data
More information