An Oracle White Paper February 2009. Real-time Data Warehousing with ODI-EE Changed Data Capture



Similar documents
An Oracle White Paper March Best Practices for Real-Time Data Warehousing

An Oracle White Paper February Oracle Data Integrator 12c Architecture Overview

ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR

Oracle Data Integrator and Oracle Warehouse Builder Statement of Direction

ORACLE DATA INTEGRATOR ENTEPRISE EDITION FOR BUSINESS INTELLIGENCE

An Oracle White Paper October Oracle Data Integrator 12c New Features Overview

Oracle Data Integrator and Oracle Warehouse Builder Statement of Direction

An Oracle White Paper July Introducing the Oracle Home User in Oracle Database 12c for Microsoft Windows

Oracle Data Integrator Technical Overview. An Oracle White Paper Updated December 2006

An Oracle White Paper November Oracle Business Intelligence Standard Edition One 11g

Oracle Business Intelligence Enterprise Edition Plus and Microsoft Office SharePoint Server. An Oracle White Paper October 2008

Oracle Fusion Middleware

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics. An Oracle White Paper October 2013

An Oracle White Paper March Managing Metadata with Oracle Data Integrator

An Oracle White Paper June Integration Technologies for Primavera Solutions

Comprehensive Data Quality with Oracle Data Integrator. An Oracle White Paper Updated December 2007

Oracle Identity Management: Integration with Windows. An Oracle White Paper December. 2004

Oracle SQL Developer Migration. An Oracle White Paper September 2008

Oracle Primavera Gateway

Express Implementation for Electric Utilities

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Manage Oracle Database Users and Roles Centrally in Active Directory or Sun Directory. Overview August 2008

Oracle Role Manager. An Oracle White Paper Updated June 2009

An Oracle Communications White Paper December Serialized Asset Lifecycle Management and Property Accountability

An Oracle White Paper November Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

Oracle SQL Developer Migration

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Oracle Identity Management Concepts and Architecture. An Oracle White Paper December 2003

MANAGING A SMOOTH MARKETING AUTOMATION SOFTWARE IMPLEMENTATION

Achieving Sarbanes-Oxley Compliance with Oracle Identity Management. An Oracle White Paper September 2005

Integrating Tutor and UPK Content: A Complete User Documentation Solution. An Oracle White Paper April 2008

Ensuring Web Service Quality for Service-Oriented Architectures. An Oracle White Paper June 2008

One View Report Samples Warehouse Management

An Oracle White Paper August Automatic Data Optimization with Oracle Database 12c

Oracle FLEXCUBE Direct Banking Release Retail Credit Card User Manual. Part No. E

Oracle Easy Connect Naming. An Oracle White Paper October 2007

March Oracle Business Intelligence Discoverer Statement of Direction

An Oracle White Paper November Upgrade Best Practices - Using the Oracle Upgrade Factory for Siebel Customer Relationship Management

One View Report Samples Financials

An Oracle White Paper. Using Oracle GoldenGate to Achieve Operational Reporting for Oracle Applications

The Role of Data Integration in Public, Private, and Hybrid Clouds

Oracle FLEXCUBE Direct Banking Release Corporate E-Factoring User Manual. Part No. E

Monitoring and Diagnosing Production Applications Using Oracle Application Diagnostics for Java. An Oracle White Paper December 2007

Highmark Unifies Identity Data With Oracle Virtual Directory. An Oracle White Paper January 2009

Maximum Availability Architecture. Oracle Best Practices For High Availability. Backup and Recovery Scenarios for Oracle WebLogic Server: 10.

10 Questions to Ask Your On-Demand Contact Center Provider. An Oracle White Paper September 2006

An Oracle White Paper May Distributed Development Using Oracle Secure Global Desktop

Oracle Enterprise Single Sign-on Technical Guide An Oracle White Paper June 2009

Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007

Oracle Real-Time Scheduler Benchmark

An Oracle White Paper February Integration with Oracle Fusion Financials Cloud Service

Oracle Communications Network Discovery Overview. Updated June 2007

An Oracle White Paper March Oracle s Single Server Solution for VDI

An Oracle White Paper Released Sept 2008

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Migration Best Practices for OpenSSO 8 and SAM 7.1 deployments O R A C L E W H I T E P A P E R M A R C H 2015

An Oracle White Paper November Oracle Primavera P6 EPPM Integrations with Web Services and Events

An Oracle White Paper June Oracle Database Firewall 5.0 Sizing Best Practices

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Oracle Cloud Platform. For Application Development

An Oracle White Paper June Security and the Oracle Database Cloud Service

The Case for a Stand-alone Rating Engine for Insurance. An Oracle Brief April 2009

For Midsize Organizations. Oracle Product Brief Oracle Business Intelligence Standard Edition One

Oracle Business Intelligence ADF Custom Visualizations and Integration. An Oracle White Paper November 2012

Oracle Insurance General Agent Hardware and Software Requirements. Version 8.0

April Oracle Higher Education Investment Executive Brief

Technical Upgrade Considerations for JD Edwards World Customers. An Oracle White Paper February 2013

Migrating Non-Oracle Databases and their Applications to Oracle Database 12c O R A C L E W H I T E P A P E R D E C E M B E R

Oracle Directory Services Integration with Database Enterprise User Security O R A C L E W H I T E P A P E R F E B R U A R Y

An Oracle White Paper July Accelerating Database Infrastructure Using Oracle Real Application Clusters 11g R2 and QLogic FabricCache Adapters

Lowering E-Discovery Costs Through Enterprise Records and Retention Management. An Oracle White Paper March 2007

Oracle FLEXCUBE Direct Banking Release Retail Mutual Funds User Manual. Part No. E

Oracle Net Services for Oracle10g. An Oracle White Paper May 2005

An Oracle White Paper April Oracle Fusion Talent Management Overview

Driving Down the High Cost of Storage. Pillar Axiom 600

An Oracle White Paper October Frequently Asked Questions for Oracle Forms 11g

Top Ten Reasons for Deploying Oracle Virtual Networking in Your Data Center

Oracle Application Development Framework Overview

AGILE PRODUCT LIFECYCLE MANAGEMENT INTEGRATION PACK FOR JD EDWARDS ENTERPRISEONE: DESIGN TO RELEASE

Oracle Insurance Policy Administration System Quality Assurance Testing Methodology. An Oracle White Paper August 2008

An Oracle White Paper May Oracle Audit Vault and Database Firewall 12.1 Sizing Best Practices

An Oracle White Paper September Directory Services Integration with Database Enterprise User Security

ORACLE RAPID PLANNING

An Oracle White Paper January Access Certification: Addressing & Building on a Critical Security Control

An Oracle White Paper September Oracle Team Productivity Center

Oracle BI Publisher Enterprise Cluster Deployment. An Oracle White Paper August 2007

ORACLE MANUFACTURING EXECUTION SYSTEM FOR PROCESS MANUFACTURING

An Oracle White Paper March Integrating Microsoft SharePoint Server With Oracle Virtual Directory

Oracle Identity Analytics Architecture. An Oracle White Paper July 2010

Oracle Fusion Human Capital Management Overview and Frequently Asked Questions

June, 2015 Oracle s Siebel CRM Statement of Direction Client Platform Support

An Oracle White Paper February Oracle Data Integrator Performance Guide

Transcription:

An Oracle White Paper February 2009 Real-time Data Warehousing with ODI-EE Changed Data Capture

Executive Overview Today s integration project teams face the daunting challenge of deploying integrations that fully meet functional, performance, and quality specifications on time and within budget. Further, these processes must be maintainable over time, and the completed work should be reusable for further, broader integration initiatives. Traditional Extract, Transform, Load (ETL) tools closely intermix data transformation rules with the integration process procedures, requiring the development of both data transformations and data flow. ODI-EE takes a different approach to integration by clearly separating the declarative rules (the what ) from the actual implementation (the how ). With ODI-EE, declarative rules describing mappings and transformations are defined graphically, through a drag-and-drop interface, and stored independently from the implementation. ODI-EE automatically generates the data flow, which can be fine-tuned if required. This declarative design approach dramatically reduces the development and maintenance workload by up to five times over traditional ETL tools. This technical brief describes ODI-EE s use of Changed Data Capture to track changes in source data changes that are typically performed by other applications. When running integration interfaces, ODI-EE can reduce the volume of source data processed in the flow by extracting only the changed data. Then, only those changes are moved to target systems, tremendously improving the efficiency of data integration processes. Reducing the volume of source data is useful in many fields, such as data synchronization and replication. And it is essential when setting up an event-driven architecture for integration. Introduction The conventional approach to data integration involves extracting all data from the source system and then integrating the entire set possibly using an incremental strategy in the target system. This approach, which is suitable in most cases, can be inefficient when the integration process requires real-time data integration. In such situations, the amount of data involved makes data integration impossible in the given timeframes. 1

Reducing the volume of the source data to only changed data is a major issue in many fields, such as data synchronization and replication. It is an absolute requirement when implementing event-driven integration processes. Basic solutions, such as filtering records according to a timestamp column or changed flag, are possible, but they might require modifications in the applications. In addition, they usually do not sufficiently ensure that all changes are taken into account. ODI-EE s Changed Data Capture identifies and captures data as it is being inserted, updated, or deleted from datastores, and it makes the changed data available for integration processes. Today Oracle includes Oracle Data Integrator and Oracle Warehouse Builder Enterprise ETL, formerly an add-on option to Oracle Database, as the two components of ODI-EE. Going forward, these products will merge into a single unified data integration technology platform. This strategy fully preserves any existing development investments of all Oracle data integration customers and will provide a seamless, easy upgrade path from the current components to the unified platform. Customers can safely choose either component as the basis for implementations today. For further advice on the implementation choices, please contact your Oracle sales representative. Publish-and-Subscribe Model Changed Data Capture uses a publish-and-subscribe model. This model works in three steps: 1. An identified subscriber, usually an integration process, subscribes to changes that might occur in a datastore. Multiple subscribers can subscribe to these changes. 2. The Changed Data Capture framework captures changes in the datastore and then publishes them for the subscriber. 3. The subscriber an integration process can process the tracked changes at any time and consume these events. Once consumed, events are no longer available for this subscriber. ODI-EE processes datastore changes in two ways: Regularly in batches (pull mode) for example, processes new orders from the Web site every five minutes and loads them into the operational datastore (ODS) 2

In real time (push mode) as the changes occur for example, when a product is changed in the enterprise resource planning (ERP) system, immediately updates the on-line catalog Figure 1: The Changed Data Capture (CDC) framework uses a publish-and-subscribe architecture. IMPLEMENTING CHANGED DATA CAPTURE Methods for Tracking Changes ODI-EE provides two methods for tracking changes from source datastores to the Changed Data Capture framework: triggers and relational database management system (RDBMS) log mining. The triggers method creates triggers on the source tables to track changes as data is inserted, updated, or deleted. This method can be implemented on most RDBMS, but it can have an impact on the transactional performance of the source systems. The second method involves mining the RDBMS logs, which are the internal change history of the database engine. This method has no effect on the system s transactional performance; it is database-specific. This method is supported out-of-the-box for Oracle Database (through the Log Miner feature) and IBM DB2. The Changed Data Capture framework used to manage changes is generic and open. The change tracking method can be customized, and any third-party change provider can be used to load the framework with changes. Processing the Changes ODI-EE employs a powerful declarative design approach, Extract-Load, Transform (E-LT), which separates the rules from the implementation details. Its out-of-the-box integration interfaces use and process the tracked changes. Developers define the declarative rules for the captured changes within the integration processes in the ODI-EE Designer graphical user interface without having to code. With the ODI-EE 3

Designer, customers declaratively specify set-based maps between sources and targets, and then the system automatically generates the data flow from the set-based maps. The technical processes required for processing the changes captured are implemented in ODI- EE s Knowledge Modules. Knowledge Modules are scripted modules that contain database and application-specific patterns. The runtime then interprets these modules and optimizes the instructions for targets. Ensuring Data Consistency Changes frequently involve several datastores at one time. For example, when an order is created, updated, or deleted, it involves both the orders table and the order lines table. When processing a new order line, the new order to which this line is related must be taken into account. ODI-EE provides a mode of tracking changes, called Consistent Set Changed Data Capture, for this purpose. This mode allows you to process sets of changes that guarantee data consistency. Conclusion Integrating data and applications throughout the enterprise, and presenting a unified view of them, is a complex proposition. Not only are there broad disparities in data structures and application functionality, but there are also fundamental differences in integration architectures. Some integration needs are data oriented, especially those involving large data volumes. Other integration projects lend themselves to an event-oriented architecture for asynchronous or synchronous integration. Changes tracked by Changed Data Capture constitute data events. The ability to track these events and process them regularly in batches or in real time is key to the success of an eventdriven integration architecture. ODI-EE provides rapid implementation and maintenance for all types of integration projects. 4

Improve Data Integration with Changed Data Capture February 2009 Oracle Corporation World Headquarters 500 Oracle Parkway Redwood Shores, CA 94065 U.S.A. Worldwide Inquiries: Phone: +1.650.506.7000 Fax: +1.650.506.7200 oracle.com Copyright 2009, Oracle and/or its affiliates. All rights reserved. This document is provided for information purposes only and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular purpose. We specifically disclaim any liability with respect to this document and no contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission. Oracle is a registered trademark of Oracle Corporation and/or its affiliates. Other names may be trademarks of their respective owners. 0109