Next Generation Data Warehousing Appliances 23.10.2014

Similar documents
IBM Netezza High-performance business intelligence and advanced analytics for the enterprise. The analytics conundrum

Netezza and Business Analytics Synergy

Evolving Solutions Disruptive Technology Series Modern Data Warehouse

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances

IBM PureData Systems. Robert Božič 2013 IBM Corporation

IBM Netezza High Capacity Appliance

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Harnessing the power of advanced analytics with IBM Netezza

PureSystems: Changing The Economics And Experience Of IT

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

2015 Ironside Group, Inc. 2

Driving Peak Performance IBM Corporation

BIG Data Analytics Move to Competitive Advantage

Green Migration from Oracle

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Integrating Netezza into your existing IT landscape

SAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions SAP

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

Using Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database

IBM Data Warehousing and Analytics Portfolio Summary

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

James Serra Sr BI Architect

Bringing Big Data into the Enterprise

IBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

Main Memory Data Warehouses

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

The Pros and Cons of Data Warehouse Appliances

Introducing Oracle Exalytics In-Memory Machine

SAP Real-time Data Platform. April 2013

Oracle Database In-Memory The Next Big Thing

SAP HANA PLATFORM Top Ten Questions for Choosing In-Memory Databases. Start Here

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Virtual Data Warehouse Appliances

PostgreSQL Business Intelligence & Performance Simon Riggs CTO, 2ndQuadrant PostgreSQL Major Contributor

Big Data and Its Impact on the Data Warehousing Architecture

Evolving Data Warehouse Architectures

EMC GREENPLUM DATABASE

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Informatica Data Replication: Maximize Return on Data in Real Time Chai Pydimukkala Principal Product Manager Informatica

How to Enhance Traditional BI Architecture to Leverage Big Data

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

QlikView Business Discovery Platform. Algol Consulting Srl

Big Data and Trusted Information

Big Data & Cloud Computing. Faysal Shaarani

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Ten Things You Need to Know About Data Virtualization

IBM WebSphere DataStage Online training from Yes-M Systems

Dell* In-Memory Appliance for Cloudera* Enterprise

IBM Netezza Analytics

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

Cisco IT Hadoop Journey

Greenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum

Sybase IQ Supercharges Predictive Analytics

Migrating Discoverer to OBIEE Lessons Learned. Presented By Presented By Naren Thota Infosemantics, Inc.

Advanced In-Database Analytics

How To Use Hp Vertica Ondemand

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

WHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Scaling Your Data to the Cloud

Salesforce.com and MicroStrategy. A functional overview and recommendation for analysis and application development

Why DBMSs Matter More than Ever in the Big Data Era

Microsoft Analytics Platform System. Solution Brief

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances

Bringing Big Data to People

White Paper. Unified Data Integration Across Big Data Platforms

Unified Data Integration Across Big Data Platforms

Structure of the presentation

Cost-Effective Business Intelligence with Red Hat and Open Source

White Paper - GPU-Based SQL Database. SQream Technologies. SQream DB GPU-Based SQL Database Technical Overview White Paper

SQL Maestro and the ELT Paradigm Shift

ENABLING OPERATIONAL BI

How To Handle Big Data With A Data Scientist

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

Dell Cloudera Syncsort Data Warehouse Optimization ETL Offload

Database Performance with In-Memory Solutions

SQL Server 2012 Parallel Data Warehouse. Solution Brief

Self Service Business Intelligence - how to bring Oracle and DB2 z/os data together

Informatica and the Vibe Virtual Data Machine

Focus on the business, not the business of data warehousing!

Transcription:

Next Generation Data Warehousing Appliances 23.10.2014 Presentert av: Espen Jorde, Executive Advisor Bjørn Runar Nes, CTO/Chief Architect

Bjørn Runar Nes Espen Jorde 2 3.12.2014

Agenda Affecto s new Data Warehouse architecture - Pains and gains DW/BI/BA Appliance - Why - What does it do - How does it solve your issues Appliance customer stories

Tear down the Data Warehouse 100 times faster response 50% less operational costs At least 30% shorter projects

Best practice until now System Y System X System Z Data Sources Data ETL Integration Stage Layer Enterprise Layer DM DM DM Ad-hoc Analysis Visual Storytelling Reporting Performance Management

Typical Business Intelligence Challenges Quality and Risk Business work-around Temporary solutions Manual workload Quality issues Performance Poor query performance Long data load window Refresh rate too rare Solution Cost Too complex solutions Non-integrated tools Lack of documentation Outdated architecture and legacy solutions Time to Market Long project delivery time Large backlog Heavy maintenance Technical debt

Affecto s Reference model Data Virtualization Analytical Sandbox Analytical Modeling #2 System Y System X Streaming Real-time MDM Cloud Appliance(s) Stage Data ELT Stage Layer Integration,, ELT Enterprise ELT Layer Layer Big Data Hadoop #3 Integrated Development Environment #3 Cache DM DM VDM #1 Ad-hoc Analysis Visual Storytelling Reporting Performance Management Real-time Analysis

Agenda Affecto s new Data Warehouse architecture - Pains and gains DW/BI/BA Appliance - Why - What does it do - How does it solve your issues Appliance customer stories

What is an appliance? Something: Specialized Built for a purpose Complete solution Easy to use Standardized interface Reasonably prized

Technology Is the Driving Force Shaping the Future

11 Rapid and accelerating pace of change - Those who lag behind will quickly disappear

Why do you need higher performance?

Typical Business Intelligence Challenges Quality and Risk Business work-around Temporary solutions Manual workload Quality issues Performance Poor query performance Long data load window Refresh rate too rare Solution Cost Too complex solutions Non-integrated tools Lack of documentation Outdated architecture and legacy solutions Time to Market Long project delivery time Large backlog Heavy maintenance Technical debt

Traditional Data Warehouse Complexity

Data Warehousing Simplified

Typical Business Intelligence Challenges Quality and Risk Business work-around Temporary solutions Manual workload Quality issues Performance Poor query performance Long data load window Refresh rate too rare Solution Cost Too complex solutions Non-integrated tools Lack of documentation Outdated architecture and legacy solutions Time to Market Long project delivery time Large backlog Heavy maintenance Technical debt

Inside the IBM PureData System for Analytics Optimized Hardware + Software Hardware accelerated AMPP Purpose-built for high performance analytics Requires no tuning Disk Enclosures User data, mirror, swap partitions High speed data streaming Snippet Blades SMP Hosts SQL Compiler Query Plan Optimize Admin Hardware-based query acceleration with FPGAs Blistering fast results Complex analytics executed as the data streams from disk

Typical data load improvements Acceptable throughput using ODBC (ETL) - 2-4x High throughput using Direct Loader (ETL) - 10-75x Extreme throughput using SQL Push-Down (ELT) - 30-200x (approaching 1.5 mill trans/sec on a small appliance) 18

Query performance Mid size tables 10-100x query improvement Queries on large data volumes 100-1000x improvements 19

Sweet spot Loading HUGE tables Playing around with HUGE tables - Adding columns - Changing data ELT Querying on large volumes of detailed data In-database Analytics (R, SPSS, SAS, Phyton, m.fl.) In-database Geospatial 20

PureData Impact Drive Productivity with In-Database Analytics Reduced Effort Before PureData With PureData Simpler No data movement Easy to Govern Accurate - No sampling Lower infrastructure cost Faster In-Db scoring Improved Analyst productivity

Typical Business Intelligence Challenges Quality and Risk Business work-around Temporary solutions Manual workload Quality issues Performance Poor query performance Long data load window Refresh rate too rare Solution Cost Too complex solutions Non-integrated tools Lack of documentation Outdated architecture and legacy solutions Time to Market Long project delivery time Large backlog Heavy maintenance Technical debt

Time to market? - Appliance not the main solution, but - Simplified data modelling - Ease of creating new databases - Ease of duplicating data - Decreased time used on development and testing due to improved performance - Fast load time makes several iterations of POC s more feasible

Agenda Affecto s new Data Warehouse architecture - Pains and gains DW/BI/BA Appliance - Why - What does it do - How does it solve your issues Appliance customer stories

25 3.12.2014 Kilde: Kristian Ramsrud, GOBI 2014

26 3.12.2014 Kilde: Kristian Ramsrud, GOBI 2014

27 3.12.2014 Kilde: Kristian Ramsrud, GOBI 2014

28 3.12.2014 Kilde: Kristian Ramsrud, GOBI 2014

Appliance demands October 2013

Norsk Tipping - Goals A flexible DWH which is easily loaded during the available time period. A scalable solution enabling growth without tuning and refactoring. A DWH providing good response times to end users without using aggregates. Thereby reducing the number of scheduled standard reports and moving towards self-service BI. Data that are easily accessible for the business users and analysts. A DWH where data quality issues can be corrected automatically after the problem has been identified and solved in the source system (easy to implement ETLs that can correct errors). A DWH requiring little effort to operate (DBA, system administration ) At the end of the day: Better decision support Shorter time to market Customer focused development and adaptability.

Norsk Tipping - Requirements Minimal effort to operate. Minimal effort (migration) to get started and see gains, thereby creating room for removal of complexity, refactoring etc. Gradual migration must be possible. NT choose when to switch source/target for the different jobs. Minimal effort to convert today s Oracle relational database to the new format. New environment must support several parallel test and production instances. Backup and restore must be easy. We need good failover solutions. We must be able to access tables from e.g Toad. We want to keep ETL developed in Informatica PowerCenter. Possible to do import/export db objects to/from systems in a standard format. Must support mixed workload, inserts simultaneously as analytical queries run. Must support external workload scheduling. Must cope with parallel execution of jobs. Must be easy to test, both manually and automatically.

Is Converting its Data Warehouse from Oracle to IBM PureData for Analytics Powered by Netezza

What is the main trend evolving? - Consider the many new architectures that boost performance. If your EDW is still on an SMP platform, make migration to MPP a priority. Consider distributing your data warehouse architecture, especially to offload a workload to a standalone platform that performs well with that workload. - When possible, take analytic algorithms to the data, instead of data to the algorithm (as is the DW tradition)

So, what now? Gartner: By 2015, 15% of organizations will modernize their strategies for IM capability and exhibit 20% higher financial performance than their peers. We all will have to change our data warehouse strategies. Are you going to move while you have control, take action now reaping the benefits early? or Wait and see until the circumstances force you to fight your way out of the problems?

3.12.2014 35

Thanks! bjorn.runar.nes@affecto.com espen.jorde@affecto.com www.affecto.com

Tear down the Data Warehouse 100 times faster response 50% less operational costs At least 30% shorter projects