Integrating Netezza into your existing IT landscape



Similar documents
Luncheon Webinar Series May 13, 2013

Beyond the Single View with IBM InfoSphere

Service Oriented Data Management

IBM InfoSphere Discovery: The Power of Smarter Data Discovery

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Data warehouse and Business Intelligence Collateral

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

The Impact of PaaS on Business Transformation

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR

Data Integration Checklist

MDM and Data Warehousing Complement Each Other

Industry models for insurance. The IBM Insurance Application Architecture: A blueprint for success

III JORNADAS DE DATA MINING

Integrated Data Management: Discovering what you may not know

Quickly Deploy Microsoft Private Cloud and SQL Server 2012 Data Warehouse on Hitachi Converged Solutions. September 25, 2013

Building Confidence in Big Data Innovations in Information Integration & Governance for Big Data

Advanced In-Database Analytics

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

EII - ETL - EAI What, Why, and How!

IBM InfoSphere Optim Test Data Management

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

<Insert Picture Here> Oracle Data Integration 11g Overview Tim Sawyer

A Whole New World. Big Data Technologies Big Discovery Big Insights Endless Possibilities

Lection 3-4 WAREHOUSING

Integrating SAP and non-sap data for comprehensive Business Intelligence

Cloud First Does Not Have to Mean Cloud Exclusively. Digital Government Institute s Cloud Computing & Data Center Conference, September 2014

IBM Data Warehousing and Analytics Portfolio Summary

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Agile BI With SQL Server 2012

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Data Integration Alternatives & Best Practices

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 7

SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)

Accelerating the path to SAP BW powered by SAP HANA

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

ORACLE FINANCIAL SERVICES ANALYTICAL APPLICATIONS INFRASTRUCTURE

Ten Things You Need to Know About Data Virtualization

Oracle Business Intelligence 11g Business Dashboard Management

IT FUSION CONFERENCE. Build a Better Foundation for Business

Data Virtualization. Paul Moxon Denodo Technologies. Alberta Data Architecture Community January 22 nd, Denodo Technologies

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

Business Intelligence

BUSINESSOBJECTS DATA INTEGRATOR

Practical Considerations for Real-Time Business Intelligence. Donovan Schneider Yahoo! September 11, 2006

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

JOURNAL OF OBJECT TECHNOLOGY

Information Architecture

Data Integration Overview

Unified Data Integration Across Big Data Platforms

White Paper. Unified Data Integration Across Big Data Platforms

Industry models for financial markets. The IBM Financial Markets Industry Models: Greater insight for greater value

Analance Data Integration Technical Whitepaper

SAP Sybase Replication Server What s New in SP100. Bill Zhang, Product Management, SAP HANA Lisa Spagnolie, Director of Product Marketing

2009 Oracle Corporation 1

ORACLE DATA INTEGRATOR ENTEPRISE EDITION FOR BUSINESS INTELLIGENCE

Oracle Database 12c Plug In. Switch On. Get SMART.

Getting it Right: How to Find the Right BI Package for the Right Situation Norma Waugh. RMOUG Training Days February 15-17, 2011

W H I T E P A P E R B u s i n e s s I n t e l l i g e n c e S o lutions from the Microsoft and Teradata Partnership

Providing real-time, built-in analytics with S/4HANA. Jürgen Thielemans, SAP Enterprise Architect SAP Belgium&Luxembourg

Big Data, Integration and Governance: Ask the Experts

Data Warehousing. Jens Teubner, TU Dortmund Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

Next Generation Data Warehousing Appliances

OPERATIONAL DATA STORE

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Introducing Oracle Data Integrator and Oracle GoldenGate Marco Ragogna

An Oracle White Paper March Best Practices for Real-Time Data Warehousing

Using Business Intelligence to Achieve Sustainable Performance

Business Intelligence and Healthcare

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

IBM Software Integrating and governing big data

Business Intelligence In SAP Environments

Analance Data Integration Technical Whitepaper

Innovative technology for big data analytics

Cloud Ready Data: Speeding Your Journey to the Cloud

FIFTH EDITION. Oracle Essentials. Rick Greenwald, Robert Stackowiak, and. Jonathan Stern O'REILLY" Tokyo. Koln Sebastopol. Cambridge Farnham.

ORACLE TAX ANALYTICS. The Solution. Oracle Tax Data Model KEY FEATURES

NEWLY EMERGING BEST PRACTICES FOR BIG DATA

SAS Enterprise Data Integration Server - A Complete Solution Designed To Meet the Full Spectrum of Enterprise Data Integration Needs

Business Intelligence

2015 Ironside Group, Inc. 2

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Welcome. Changes and Choices

Introducing Oracle Exalytics In-Memory Machine

EMC/Greenplum Driving the Future of Data Warehousing and Analytics

Big data management with IBM General Parallel File System

Master Your Data and Your Business Using Informatica MDM. Ravi Shankar Sr. Director, MDM Product Marketing

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances

Data Warehouse: Introduction

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

Data virtualization: Delivering on-demand access to information throughout the enterprise

Real-time Data Replication

Patrick Firouzian, ebay

Gradient An EII Solution From Infosys

Transcription:

Marco Lehmann Technical Sales Professional Integrating Netezza into your existing IT landscape 2011 IBM Corporation

Agenda How to integrate your existing data into Netezza appliance? 4 Steps for creating a trusted Warehouse. Enabling Netezza for multiple site desaster recovery

How to integrate your existing data into Netezza appliance?

Information Management How will you integrate your silos of information with this appliance? FRONT LINE / BI APPLICATIONS OLAP InfoSphere Information Server IBM Netezza INFORMATION INTEGRATION / DATA QUALITY / ETL / REAL-TIME SOURCE SYSTEMS, DATA MARTS, SILOS 4 M CO N MO M A AT D A ET

Key hurdles in creating & deploying a Warehouse and Business Intelligence environment 5 Defining Business Analytics and Its Impact on Organizational Decision-Making, February 2009, ComputerWorld

Information Server integrates siloed data with Netezza for trusted decisions Network Integration Data Quality Netezza Data Warehousing Solutions Channels Divisional customer Billing Billing analysis Buyers Integration ETL Data Quality Advanced Discovery Reusable Mapping & Blueprinting Comprehensive Data Lineage Application Connectivity Packs Integrated Change Data Capture Scalable transformation & data delivery Source /Target analysis Enterprise class scalability Flexible and accurate matching with Probabilistic matching engine Customizable out-of-box integrated rule sets 6

3 Reasons why you need Information Server + Netezza 1. Lower total cost of ownership Reusable components for scalable development efficiencies Support for automated comprehensive data analysis High performance parallel engine for rapid iterations Deliver self service for business users by increasing transparency in data understanding and provenance 2. Accelerate deployment & time to value Highly visual & automated development environment Best practices & methodologies ensure project success Exhaustive pre-built connectivity User centric tooling for Business and IT Collaboration 3. Increase LOB trust & confidence Automated data quality monitoring and cleansing Consistent understanding of enterprise vocabulary with business term definition and documentation Know where information comes from with data lineage 7

4 Steps for creating a trusted Warehouse.

Information Management So how does Information Server with Netezza actually work? Understand Cleanse Transform & Deliver FRONT LINE / BI APPLICATIONS OLAP InfoSphere Information Server IBM Netezza INFORMATION INTEGRATION / DATA QUALITY / ETL / REAL-TIME SOURCE SYSTEMS, DATA MARTS, SILOS 9 M CO N MO M A AT D A ET

Understand the Process to build a Data Warehouse Understand Cleanse Transform & Deliver Increase project success rates: Initiate warehouse integration projects from trusted blueprints Reusable IBM best practices and methodology Reference architectures Tunable for your environment Task management IBM expertise built right into an automated data warehouse template for guiding your project Vision, Execution, Completion 10

Understand Data Sources, Lineage and Business Terms Understand Cleanse Transform & Deliver Speed time to deployment Industry s most advanced data discovery capabilities accelerate technical discovery by 10x Column analysis Fully automated primary-foreign key discovery Cross-source overlap analysis Cross-source transformation discovery Prototyping of data consolidation Modeling tools and industry data models accelerate modeling of the data warehouse Lower development, enhancement and audit costs Achieve consistent understanding of enterprise business vocabulary Transparency into data lineage and change management enables self service and faster, more cost effective updates Cross-tool impact analysis Business and technical data lineage 11

Workflow 1 Start from consistent blueprint, leveraging best practices 2 4 6 Create new transformation rules & document Document KPIs and associated business terms 3 Create or Modify Data Model 5 Identify where KPI s exist, relationship and level of quality 7 Report & Govern on metadata assets Assess & monitor your data quality over time

Cleanse Data going into the Warehouse Understand Cleanse Transform & Deliver Increase trust in your data warehouse information: Best in class data cleansing capabilities help load data warehouse with clean data upfront Name and address cleansing Global name recognition Probabilistic and deterministic algorithms Ongoing Data Quality monitoring Monitor data quality and data relationship quality with customized rules Exception management for reviewing and management the data exception lifecycle 13

Transform and Deliver data into your Netezza in a timely fashion Understand Cleanse Transform & Deliver Deliver even faster time to value: Increase developer efficiency Top down design Highly visual dev environment Enhanced collaboration through design asset reuse Iterate quickly and often High performance delivery options with flexible deployments Support for multiple delivery styles: ETL, ELT, Change Data Capture, SOA integration etc. High performance, parallel engine Rapidly integrates your existing and future environments Exhaustive pre-built connectivity Pre-integrated with Netezza Build once and scale with your hardware requirements 14

So Why Information Server? InfoSphere Information Server Integrating and transforming data and content to deliver accurate, consistent, timely and complete information to your Netezza Warehouse appliance Information Server is better Single integration platform with unified metadata The Difference True parallel framework - Design once, Deploy Anywhere Information Server is faster Automated Data Discovery Data integration planning & methodologies Information Server does not introduce risk The power of one vendor, one team Long history of proven warehouse deployments with Netezza 15 Information Server is extremely cost effective Lower cost, simplified packaging & deployment Supports warehouse growth

Enabling Netezza for multiple site desaster recovery and high availability

Common Requirements for Replication and Data Distribution High availability and continuous access to DW and BI applications Disaster recovery solution Scalable infrastructure supporting: Growing user population Higher levels of concurrency Wider geographic coverage for distributed users Data transformation between heterogeneous systems and data models (ODS, OLAP, OLTP, EDW) 17

IBM Netezza Replication Phase 1 Architecture Geographically wide asynchronous replication focused on: Disaster recovery Reporting scalability Replication method: SQL statement-level replication: Replication of load files SQL statements replay for DML/DDL (inserts, updates, deletes) /DCL The advantages of this approach: Low bandwidth requirements Minimal-to-none performance impact on production queries

Phase 1 Architecture: Geographically Wide, Asynchronous Replication Target BI & EDW Applications PTS BI & EDW Applications Master LAN Files Loads WAN ETL System LAN PTS Target BI & EDW Applications Trickle Feed Updates PTS LAN PTS Persistent Transport System

Persistent Transport System (PTS) External server collocated with every node in replication cluster PTS has three major purposes: Move data/files from one node to another Send control messages from one node to another Act as a persistent store for recovery from failures Automatic copy of data and sync from master to target PTS management software distributed to the server directly by Netezza host

Phase 1: Additional Features and Considerations Supported platforms: Netezza TwinFin (now known as IBM Netezza 1000) Remote site initialization using Truck mode Using full backup from the master to initialize targets nzsql command line interface for management and monitoring Replication granularity: a single database Manual selection of new master in case of failure Client-based load balancing across replication cluster WAN bandwidth dictated by latency and load requirements

Phase 1 Use Case: Disaster Recovery ETL and micro-batch loads into production data warehouse Remote site used for disaster recovery; optionally for test & dev Failover to remote data center manually controlled process Users able to access data during maintenance windows During temporary power outage no automated fail-over is required

Beyond Phase 1: The Future for Replication and Distribution Bi-directional replication Support multiple writers (masters), in separate databases with no conflict resolution Local High Availability Cluster Queryable archive and uniform data access Provide DR and archival facilities while maintaining access to the data Replication between IBM Netezza 1000 and IBM Netezza High Capacity Appliance Data transformation between heterogeneous systems and data models (ODS, OLAP, OLTP, EDW) into and from IBM Netezza Utilizing IBM s InfoSphere Change Data Capture (CDC) Improved and integrated user interface

IBM Netezza Roadshow am 1. Dezember 2011 im KochWerk in Frankfurt am Main. ibm.com/software/de/data/netezza/ 25 Marco Lehmann marco.lehmann@de.ibm.com Telefon: +4915115162301