Netezza S's. Robert Hartevelt 31 October 2012. 2012 IBM Corporation. 2012 IBM Corporation. 2012 IBM Corporation



Similar documents
Evolving Solutions Disruptive Technology Series Modern Data Warehouse

Netezza and Business Analytics Synergy

Ubrzajte svoj Data Warehouse 100 puta i više

PureSystems: Changing The Economics And Experience Of IT

Next Generation Data Warehousing Appliances

IBM Netezza High Capacity Appliance

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances

IBM Data Warehousing and Analytics Portfolio Summary

IBM Netezza High-performance business intelligence and advanced analytics for the enterprise. The analytics conundrum

IBM PureData Systems. Robert Božič 2013 IBM Corporation

Main Memory Data Warehouses

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

Danny Bryant City of Atlanta

Building your Big Data Architecture on Amazon Web Services

2010 Ingres Corporation. Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC

Poslovni slučajevi upotrebe IBM Netezze

2009 Oracle Corporation 1

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Harnessing the power of advanced analytics with IBM Netezza

Maximum performance, minimal risk for data warehousing

Eliminating Required Waste and Non-Value Added Processing in Data Warehousing a Six Sigma LEAN Perspective

Storage in Microsoft Azure Wat moet ik daarmee? Bert

Advanced In-Database Analytics

Front cover IBM PureData System for Analytics Architecture: A Platform for High Performance Data Warehousing and Analytics

Cursusrooster Technology NEDERLAND. Nederland maart juni Learn Oracle From Oracle

Optimizing Performance. Training Division New Delhi

IBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Solutions for Communications with IBM Netezza Network Analytics Accelerator

SQL Server 2012 Parallel Data Warehouse. Solution Brief

GRIDS IN DATA WAREHOUSING

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Oracle Architecture, Concepts & Facilities

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

James Serra Sr BI Architect

Customer Insight Appliance. Enabling retailers to understand and serve their customer

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

SQL Server PDW. Artur Vieira Premier Field Engineer

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

The HP Neoview data warehousing platform for business intelligence Die clevere Alternative

Minimize cost and risk for data warehousing

Dell* In-Memory Appliance for Cloudera* Enterprise

Emerging Innovations In Analytical Databases TDWI India Chapter Meeting, July 23, 2011, Hyderabad

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances

Innovative technology for big data analytics

The Evolution of Microsoft SQL Server: The right time for Violin flash Memory Arrays

Optimizing Storage for Better TCO in Oracle Environments. Part 1: Management INFOSTOR. Executive Brief

Comprehending the Tradeoffs between Deploying Oracle Database on RAID 5 and RAID 10 Storage Configurations. Database Solutions Engineering

IBM DB2 specific SAP NetWeaver Business Warehouse Near-Line Storage Solution

Exploitation of Predictive Analytics on System z

Oracle DBA Course Contents

A financial software company

IBM System x reference architecture solutions for big data

2015 Ironside Group, Inc. 2

SAP Real-time Data Platform. April 2013

Introduction to the PureData for Analytics System (PDA) + Details on the N3001 Family

Big Data Technologies Compared June 2014

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Cray: Enabling Real-Time Discovery in Big Data

Scaling Your Data to the Cloud

Lowering the Total Cost of Ownership (TCO) of Data Warehousing

Datalogix. Using IBM Netezza data warehouse appliances to drive online sales with offline data. Overview. IBM Software Information Management

Interactive data analytics drive insights

Introducing Oracle Exalytics In-Memory Machine

Instant-On Enterprise

NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB, Cassandra, and MongoDB

An Oracle White Paper May Exadata Smart Flash Cache and the Oracle Exadata Database Machine

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

IBM PureData System for Operational Analytics

How To Use Exadata

Virtual Data Warehouse Appliances

Using Attunity Replicate with Greenplum Database Using Attunity Replicate for data migration and Change Data Capture to the Greenplum Database

Cost/Benefit Case for IBM PureData System for Analytics

IOS110. Virtualization 5/27/2014 1

HP Vertica. Echtzeit-Analyse extremer Datenmengen und Einbindung von Hadoop. Helmut Schmitt Sales Manager DACH

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices

Big Data Analytics - Accelerated. stream-horizon.com

Five Best Practices for Maximizing Big Data ROI

Big Data Analytics: Today's Gold Rush November 20, 2013

Oracle Exadata and IBM Netezza Data Warehouse Appliance compared

Data Warehouse in the Cloud Marketing or Reality? Alexei Khalyako Sr. Program Manager Windows Azure Customer Advisory Team

Sybase IQ Supercharges Predictive Analytics

Integrated Grid Solutions. and Greenplum

EMC CUSTOMER UPDATE. 31 mei 2011 Fort Voordorp. Bart Sjerps. Greenplum Data Warehouse. Copyright 2011 EMC Corporation. All rights reserved.

SAP Analytics Roadmap for Small and Midsize Companies. Kevin Chan, Director, Solutions SAP

WHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics

Five Technology Trends for Improved Business Intelligence Performance

NL VMUG UserCon March

SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)

Administering a Microsoft SQL Server 2000 Database

White Paper - GPU-Based SQL Database. SQream Technologies. SQream DB GPU-Based SQL Database Technical Overview White Paper

APRIL 22, 2014 SLIDE 1 INTRODUCTION TO VSPHERE WITH VCENTER OPERATIONS MANAGER (VSOM)

Transcription:

Netezza S's Robert Hartevelt 31 October 2012

Netezza S's Introduction to Netezza 10 minutes Q&A Speed & Smart 30 minutes NL Customer experiences Simplicity & Scalable

Netezza S's Introduction to Netezza

Why workload optimized systems for Analytics? days for a single query constant tuning - Nearly 70% of data warehouses experience performance-constrained issues of various types. specialized resources required Gartner 2010 Magic Quadrant months to deploy 4

Traditional data warehouses are just too complex They are based on databases optimized for transaction processing NOT to meet the demands of advanced analytics on big data. Too complex an infrastructure Too complicated to deploy Too much tuning required Too inefficient at analytics Too many people needed to maintain Too costly to operate Too long to get answers 5 5

Netezza s revolutionary approach The Appliance Simpler, faster, more accessible analytics This is what Netezza has done in the data warehousing market: It has totally changed the way we think about data warehousing. - Philip Howard, Bloor Research 6

Appliance Dedicated Device Optimized for purpose Complete solution Fast installation Very easy operation Standard interfaces Low cost 7

The IBM Netezza Appliance Revolutionizing Analytics Purpose-built analytics engine Integrated database, server & storage Standard interfaces Low total cost of ownership Speed: 10-100x faster than traditional systems Simplicity: Minimal administration and tuning Scalability: Peta-scale user data capacity Smart: High-performance advanced analytics

Netezza S's Introduction to Netezza So What? Purpose build Appliance

Netezza S's Introduction to Netezza Speed & Smart

The IBM Netezza Architecture Optimized Hardware + Software Purpose-built for high performance analytics Requires no tuning Streaming Data Hardware-based query acceleration Blistering fast results True MPP All processors fully utilized Maximum speed and efficiency Deep Analytics Complex analytics executed in-database Deeper insights

Massively Parallel Processing Architecture Divide and conquer MPP Shared Nothing concept Divides the work in smaller tasks A big task is sliced vertically into a series of smaller tasks The smaller tasks run independently The work is automatically balanced among the tasks to minimize the time to complete Each task is assigner the same amount of physical resources Communication between is made only at the beginning and end of the task Benefits A large task completes in a short elapsed time Maximizes use of resources Points of Attention Complexity on administration and management Communication bottlenecks 14

The IBM Netezza AMPP Architecture FPGA Memory CPU Advanced Analytics FPGA Memory CPU Host Hosts ETL BI FPGA Memory CPU Loader Disk Enclosures S-Blades Network Fabric Applications IBM Netezza Appliance

Netezza S's Introduction to Netezza So What? Purpose build out of the box Performance Speed & Smart

Netezza S's Introduction to Netezza Speed & Smart Simplicity & Scalable

Appliance Simplicity No indexes and tuning No storage administration No dbspace/tablespace sizing and configuration No redo/physical/logical log sizing and configuration No page/block sizing and configuration for tables No extent sizing and configuration for tables No Temp space allocation and monitoring No RAID level decisions for dbspaces No logical volume creations of files No integration of OS kernel recommendations No maintenance of OS recommended patch levels No JAD sessions to configure host/network/storage No software installation

Traditional Complexity... Netezza Simplicity ORACLE Indexes CREATE INDEX "MRDWDDM"."RDWF_DDM_ROOMS_SOLD_IDX1" ON "RDWF_DDM_ROOMS_SOLD" ("ID_PROPERTY", "ID_DATE_STAY", "CD_ROOM_POOL", "CD_RATE_PGM", "CD_RATE_TYPE", "CD_MARKET_SEGMENT" ) PCTFREE 10 INITRANS 6 MAXTRANS 255 STORAGE( FREELISTS 10) TABLESPACE "DDM_DATAMART_INDEX_L" NOLOGGING PARALLEL ( DEGREE 4 INSTANCES 1) LOCAL(PARTITION "PART1" PCTFREE 10 INITRANS 6 MAXTRANS 255 STORAGE(INITIAL 4194304 NEXT 4259840 MINEXTENTS 1 MAXEXTENTS 100000 PCTINCREASE 0 FREELISTS 10 FREELIST GROUPS 1 BUFFER_POOL DEFAULT) TABLESPACE "DDM_DATAMART_INDEX_L" NOLOGGING, PARTITION "PART2" PCTFREE 10 INITRANS 6 MAXTRANS 255 STORAGE(INITIAL 4194304 NEXT 4259840 MINEXTENTS 1 MAXEXTENTS 100000 PCTINCREASE 0 FREELISTS 10 FREELIST GROUPS 1 BUFFER_POOL DEFAULT) TABLESPACE "DDM_DATAMART_INDEX_L" NOLOGGING, ORACLE Bitmap index PARTITION "PART3" PCTFREE 10 INITRANS 6 MAXTRANS 255 STORAGE(INITIAL 4194304 NEXT 4259840 MINEXTENTS 1 MAXEXTENTS 100000 PCTINCREASE 0 CREATE BITMAP INDEX "CRDBO"."SNAPSHOT_MONTH_IDX13" ON FREELISTS 10 FREELIST GROUPS 1 BUFFER_POOL DEFAULT) TABLESPACE "SNAPSHOT_OPPTY_MONTH_HIST" ("SNAPSHOT_YEAR" ) PCTFREE 10 INITRANS 2 "DDM_DATAMART_INDEX_L" NOLOGGING, PARTITION "PART4" PCTFREE 10 INITRANS 6 MAXTRANS 255 STORAGE(INITIAL 4194304 NEXT 4194304 MINEXTENTS 2 MAXEXTENTS MAXTRANS 255 STORAGE(INITIAL 4194304 NEXT 4259840 MINEXTENTS 1 MAXEXTENTS 2147483645 PCTINCREASE 0 FREELISTS 1 FREELIST GROUPS 1 BUFFER_POOL 100000 PCTINCREASE 0 FREELISTS 10 FREELIST GROUPS 1 BUFFER_POOL DEFAULT) DEFAULT) TABLESPACE "SFA_DATAMART_INDEX" NOLOGGING ; TABLESPACE "DDM_DATAMART_INDEX_L" NOLOGGING, PARTITION "PART5" PCTFREE 10 ORACLE Table Clusters INITRANS 6 MAXTRANS 255 STORAGE(INITIAL 4194304 NEXT 4259840 MINEXTENTS 1 MAXEXTENTS 100000 PCTINCREASE 0 FREELISTS 10 FREELIST GROUPS 1 BUFFER_POOL DEFAULT) TABLESPACE "DDM_DATAMART_INDEX_L" NOLOGGING, PARTITION "PART6" CREATE CLUSTER "MRDW"."CT_INTRMDRY_CAL" ("ID_YEAR_CAL" NUMBER(4, 0), PCTFREE 10 INITRANS 6 MAXTRANS 255 STORAGE(INITIAL 4194304 NEXT 4259840 "ID_MONTH_CAL" NUMBER(2, 0), "ID_PROPERTY" NUMBER(5, 0)) SIZE 16384 MINEXTENTS 1 MAXEXTENTS 100000 PCTINCREASE 0 FREELISTS 10 FREELIST GROUPS PCTFREE 10 PCTUSED 90 INITRANS 3 MAXTRANS 255 STORAGE(INITIAL 1 BUFFER_POOL DEFAULT) TABLESPACE "DDM_DATAMART_INDEX_L" NOLOGGING ) ; 83886080 NEXT 41943040 MINEXTENTS 1 MAXEXTENTS 1017 PCTINCREASE 0 FREELISTS 4 FREELIST GROUPS 1 BUFFER_POOL RECYCLE) TABLESPACE "TSS_FACT" ; Netezza No indexes CREATE TABLE MRDWDDM.RDWF_DDM_ROOMS_SOLD ( ID_PROPERTY numeric(5, 0) NOT NULL, ID_DATE_STAY integer NOT NULL, CD_ROOM_POOL CHAR(4) NOT NULL, CD_RATE_PGM CHAR(4) NOT NULL, CD_RATE_TYPE CHAR(1) NOT NULL, CD_MARKET_SEGMENT CHAR(2) NOT NULL, ID_CONFO_NUM_ORIG integer NOT NULL, ID_CONFO_NUM_CUR integer NOT NULL, ID_DATE_CREATE integer NOT NULL, ID_DATE_ARRIVAL integer NOT NULL, ID_DATE_DEPART integer NOT NULL, QY_ROOMS integer NOT NULL, CU_REV_PROJ_NET_LOCAL numeric(21, 3) NOT NULL, CU_REV_PROJ_NET_USD numeric(21, 3) NOT NULL, QY_DAYS_STAY_CUR smallint NOT NULL, CD_BOOK_SOURCE CHAR(1) NOT NULL) distribute on random; No Physical Tuning/Admin Stripe data randomly, or by Columns

IBM Netezza Models 1 10......... Capacity Compression = User Data space = Effective User Data Space 21

Netezza S's Introduction to Netezza So What? Purpose build Meet BI demands out of the box Speed & Smart Simple gives new opportunities Simplicity & Scalable

Netezza S's Introduction to Netezza Speed & Smart NL Customer experiences Simplicity & Scalable

Netezza4ING NetInsight/Netezza Implementation 2012 F Deden 31-10-2012 BANKING INVESTMENTS LIFE INSURANCE RETIREMENT SERVICES 24

NetInsight Overview NetInsight is ING s main Web Analytics tool from IBM NetInsight is used to measure daily clicks on ING portal Increasing importance of Web Analytics for: Customized Campaigns Fraud detection Analyzing purposes 25

NetInsight Oracle History First NetInsight Oracle release December 2010 No State of the Art hardware Oracle 10 Increasing number of clicks Issues Slow performance Long running night batch No pre-generated reports One active profile Second NetInsight Oracle release December 2011 State of the Art hardware Oracle 11g Increasing number of clicks Issues Recurring Unique constraint error Unacceptable slow ad-hoc reporting times (up to 12 hours) Backup sizing and time to restore too large Deleting of historical data with NetInsight functionality not possible One active profile 26

NetInsight Netezza POC Proof of Concept with Netezza from February 2012 to March 2012 Followed by Project, ending 1 August 2012 Acceptance criteria Processing the internet clicks with sufficient throughput to have a manageable batch window. Webanalytics standard and ad-hoc reporting with sufficient performance to make it usable. Demonstrate that Unique constraint doesn t appear in this configuration Deleting a day of data can be done within 2 hours Demonstrate that data can be deleted within 4 hours Demonstrate that data can be backed up and restored The Unique constraint error doesn t appear in the POC setup. It can perform a (daily) batch load (against requirements of 2013 (50 million clicks) with a throughput of 10 million clicks per hour. It can perform the Reporting Requirements. Hardware Blade P70, 8 CPU s, 64 Gbyte memory, no extra storage 27

POC/Project Results 28

Plus Retail Challenge Ruud Stroet, Architect Plus Retail De vraag vanuit de Analisten: laat mij queries uitvoeren op de beschikbare datamarts De vraag vanuit de Ondernemers: standaard rapportage kunnen opvragen De uitdaging: Analisten zullen zware queries uitvoeren waarbij het risico op een instabiele performance voor de Ondernemersrapportage het gevolg kan zijn. Hierop moeten we als IT organisatie voorbereid zijn. 29

Plus Retail Action Ruud Stroet, Architect Plus Retail We hebben samen met onze partner AXI gekeken welke stappen genomen moeten worden om de uitdaging te kunnen pareren. De vraagstelling op een Datamart betreft een andere invalshoek dan we gewend zijn. Na een zoektocht vond ik een interessante technologie waarbij data opgedeeld wordt in behapbare blokken. Deze opdeling van data wordt toegekend aan een Netezza TwinFin, welke in staat is om middels ANSI standaard instructies snel uit te kunnen laten voeren. 30

Plus Retail Result Testen zijn nog in volle gang op dit moment. Maar zien er nu al veel belovend uit. We kijken nu ook naar SPSS omdat we de mogelijkheid hebben gecreeerd om door onze hele dataset te lezen op bon-niveau. Hiermee hopen we nog meer inzichten te verkrijgen uit onze data. 31

Netezza S's Introduction to Netezza So What? Purpose build Meet BI demands out of the box Speed & Smart Simple gives new opportunities Done before NL Customer experiences Simplicity & Scalable

Netezza S's Introduction to Netezza Q&A Speed & Smart NL Customer experiences Simplicity & Scalable

Questions

Netezza S's Introduction to Netezza Q&A So What? Purpose build Meet BI demands out of the box Speed & Smart Simple gives new opportunities Done before You can found me in the Netezza boot NL Customer experiences Simplicity & Scalable

Page 36 36

37