Bridging the Big Data Divide with Oracle Data Integration. Milomir Vojvodic, Business Development Manager, EMEA DIS



Similar documents
Introducing Oracle Data Integrator and Oracle GoldenGate Marco Ragogna

Data Integration Overview

<Insert Picture Here> Oracle Data Integration 11g Overview Tim Sawyer

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 7

Oracle Data Integrator Technical Overview. An Oracle White Paper Updated December 2006

Data Integration Checklist

An Oracle White Paper March Best Practices for Real-Time Data Warehousing

Service Oriented Data Management

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Oracle Data Integration: CON7920 Making the Move to Oracle Data Integrator

OWB Users, Enter The New ODI World

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Introducing Oracle Exalytics In-Memory Machine

Oracle Data Integration Solutions

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

SAS Enterprise Data Integration Server - A Complete Solution Designed To Meet the Full Spectrum of Enterprise Data Integration Needs

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

<Insert Picture Here> Real-Time Data Integration for BI and Data Warehousing

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR

Architecting for the Internet of Things & Big Data

Oracle Big Data Essentials

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Oracle Data Integration: CON7926 Oracle Data Integration: A Crucial Ingredient for Cloud Integration

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Oracle Corporation

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Oracle Data Integration Real Time Access to Real Time Information

<Insert Picture Here> Real-time database replication

Copyright 2013, Oracle and/or its affiliates. All rights reserved.

<Insert Picture Here> Operational Reporting for Oracle Applications with Oracle GoldenGate

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

Oracle Big Data SQL Technical Update

Big Data Big Data/Data Analytics & Software Development

What does SAS Data Management do? Why is SAS Data Management important? For whom is SAS Data Management designed? Key Benefits

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

Real-time Data Replication

<Insert Picture Here> Oracle GoldenGate and Oracle Data Integrator

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics. An Oracle White Paper October 2013

Extraction Transformation Loading ETL Get data out of sources and load into the DW

Exploring the Synergistic Relationships Between BPC, BW and HANA

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Implementing a SQL Data Warehouse 2016

How To Use An Orgo

Oracle BI Applications (BI Apps) is a prebuilt business intelligence solution.

ORACLE DATA INTEGRATOR ENTEPRISE EDITION FOR BUSINESS INTELLIGENCE

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Luncheon Webinar Series May 13, 2013

Data Modeling for Big Data

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

IT FUSION CONFERENCE. Build a Better Foundation for Business

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

Getting Value from Big Data with Analytics

Integrating Netezza into your existing IT landscape

Informatica Data Replication: Maximize Return on Data in Real Time Chai Pydimukkala Principal Product Manager Informatica

Testing 3Vs (Volume, Variety and Velocity) of Big Data

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Implementing a Data Warehouse with Microsoft SQL Server

Il mondo dei DB Cambia : Tecnologie e opportunita`

Big Data Analytics Nokia

Endeca Introduction to Big Data Analytics

Oracle Database 12c Plug In. Switch On. Get SMART.

More Data in Less Time

Getting it Right: How to Find the Right BI Package for the Right Situation Norma Waugh. RMOUG Training Days February 15-17, 2011

FIFTH EDITION. Oracle Essentials. Rick Greenwald, Robert Stackowiak, and. Jonathan Stern O'REILLY" Tokyo. Koln Sebastopol. Cambridge Farnham.

Oracle Business Activity Monitoring 11g New Features

How I Transitioned from an E-Business Suite Development to an Oracle Business Intelligence Developer

MDM and Data Warehousing Complement Each Other

Oracle Big Data Strategy Simplified Infrastrcuture

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

Oracle Data Integrator for Big Data. Alex Kotopoulis Senior Principal Product Manager

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Using OBIEE for Location-Aware Predictive Analytics

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Master Data Management and Universal Customer Master Overview

Oracle Exalytics Briefing

Implementing a Data Warehouse with Microsoft SQL Server

Business Intelligence for Big Data

SAS BI Course Content; Introduction to DWH / BI Concepts

Beyond High Availability Replication s Changing Role

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

GoldenGate and ODI - A Perfect Match for Real-Time Data Warehousing

Ganzheitliches Datenmanagement

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

An Oracle White Paper October Oracle Data Integrator 12c New Features Overview

XpoLog Competitive Comparison Sheet

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

Decision Ready Data: Power Your Analytics with Great Data. Murthy Mathiprakasam

Transcription:

Bridging the Big Data Divide with Oracle Data Integration Milomir Vojvodic, Business Development Manager, EMEA DIS

Diverse Data Sets Information Architectures Today: Decisions based on transactional data transactions, applications, structured Data Information Architectures Today: Decisions based on all your data Video and Images Documents Social Data Machine-Generated Data

Architecture Oracle Data Principles Integration Solutions and for Big Best Data Practices

Unstructured Semistructured Management Security, Governance Structured Integrated Architecture Capture Store/Process Integrate Organize Analyze Govern Master & Ref Data Transaction Data Machine Generated Social Media Text, Image Video, Audio DBMS (OLTP) Hadoop Cluster w MapReduce Key-Value Data Store DB Replication ETL/ELT CDC Real-Time Message- Based ODS Data Warehouse Data Marts Streaming (CEP Engine) Reporting & Dashboards Alerting EPM BI Applications Text Analytics and Search In-Database Analytics Advanced Analytics Visual Discovery

Integrate Big Data with DW and Transactional Data Stores Oracle Big Data Appliance Oracle Exadata Oracle Exalytics Stream Acquire Organize Analyze & Visualize Load from big data processing into your data warehouse for further analysis Access your customer information while you process through your big data in order to look for patterns

Oracle Data Integration Solutions Legacy Sources Application Sources Relational and Non-Relational Oracle Enterprise Data Quality Oracle Data Integrator Oracle GoldenGate Complete and best-of-breed approach to address enterprise integration Maximum performance with lower cost of ownership, ease of use, and reliability. Certified for leading technologies to deliver fast time to value Oracle customers report: 80% lower TCO Five times higher performance 70% reduction in development costs

Architecture DB Replica and Principles CDC within and Data Best Integration Practices Layer

What is Oracle GoldenGate? OGG Source DB Target DB

What is Oracle GoldenGate? First OGG Differentiator Accessing directly transaction logs OGG Source DB Target DB Second OGG Differentiator Moving only committed transactions

Oracle DIS Use Cases - OGG Migrations&Consolidations OGG Zero Downtime Migrations & Upgrades New DB/HW/OS/APP OGG Active/Active DB Deployment Fully Active Distributed DB OGG ADG Disaster Recovery Reporting Database Reporting Database and/or DR database OGG DW Synchronization Data Warehouse

Millions OGG is Log Based Replica OR TIME REQUIRED FOR THE END OF DAY PROCEDURE Daily load time can Hours reach 5 days with the current HW 150 100 50 0 Year1 Year2 Year3 Year4 Year5 Currently during the End Of Day utilizes the Server CPU by 40-50% and the IO by 90%. Probably the IO is the bottleneck. NO OF CPUs REQUIRED FOR SAME PERFORMANCE* No Of Required CPUs 120 100 80 60 40 20 0 Required No. CPUs can be Disaster Recovery doubled Test and Development Primary Site Year1 Year2 Year3 Year4 Year5 ESTIMATED COSTS FOR SERVER AND LICENSE** Estimated Cost of Purchase in USD $3 $2 $2 $1 $1 $- Oracle License Costs Costs can be doubled Year1 Year2 Year3 Year4 Year5

OGG Moves Only Committed Transactions OR Begin, TX 1 Insert, TX 1 Begin, TX 2 Update, TX 1 Begin, TX 2 Insert, TX 2 Pump Checkpoint Begin, TX 2 Insert, TX 2 Delivery Checkpoint Insert, TX 2 Commit, TX 2 Begin, TX 3 Capture Checkpoint Commit, TX 2 Begin, TX 3 Insert, TX 3 Commit, TX 2 Insert, TX 3 Commit, TX 3 Begin, TX 4 Commit, TX 3 Delete, TX 4

Architecture ETL and Data Principles Quality within and Data Best Integration Practices Layer

ODI is centralizing all ETL Development Custom Reporting Data Silos Data Migration Batch Scripts Analytics Data Marts Data Replication Custom Packaged Applications Data Hubs Business Intelligence Data Warehousing SQL Data Federation Java Enterprise Performance Data Access OLTP & ODS Systems Data Warehouse, Data Mart Oracle PeopleSoft, Siebel, SAP Custom Apps Files Excel XML OLAP

ODI is centralizing all ETL Development Custom Reporting Analytics Packaged Applications Business Intelligence Enterprise Performance Oracle Data Integrator OLTP & ODS Systems Data Warehouse, Data Mart Oracle PeopleSoft, Siebel, SAP Custom Apps Files Excel XML OLAP

Why is ODI different? Second ODI Differentiator ODI Declarative Design and ODI Knowledge Modules for reusing already written down level SQL code ODI E-LT Staging Server First ODI Differentiator Transformations using the power of the Target Database no staging server OGG ODI ODI Knowledge Modules Reverse Engineer Metadata Reverse CDC Sources SAP/R3 Siebel Journalize Read from CDC Source Log Miner Journal ize Sample out-of-the-box Knowledge Modules Benefits DB2 Journals SQL Server Triggers DB2 Exp/Imp Load From Sources to Staging Load Oracle DBLink JMS Queues Oracle SQL*Load er Check Constraints before Load Staging Tables Check Sybase Check Check MS Excel Type II SCD Integrate Transform and Move to Targets Service Expose Data and Transformati W W on Services S S W S Integrate Services Target Tables Error Tables TPump/ Multiload Oracle Merge Siebel EIM Schema Oracle Web Services DB2 Web Services ODI Declarative Design ODI Declarative Design 1 2 Define Automatically What Generate You Want Dataflow Define How : Built - in Templates Data Warehouse

Oracle DIS Use Cases ODI and EDQ Migrations&Consolidations EDQ OGG ODI Zero Downtime Migrations & Upgrades New DB/HW/OS/APP OGG Active/Active High Availability Fully Active Distributed DB OGG ADG Query Off-Loading and Disaster Recovery Reporting Database and/or DR database OGG ODI BI&DW Synchronization and Loading EDQ Data Warehouse

Why Do We Need Data Quality? Inconsistent formats Abbreviations (often ambiguous) Attributes non-standard, missing or invalid Customer ID Customer Name Address 1 Address 2 City State Zip Country Birth Date Gender AD23298 Mr Peter Mayhew 9407 Main St Fairfax VA 22031-4001 USA 02/23/61 M VS38611 Dr Ellen Van Der Heijde 144 E Grove St Kingston PA 18704 US 07/12/57 DC18223 Jalila Abdul-Alim (Do Not Call) 4548 Pennsylvania Ave Apt 205 Kansas City MO 64111-3349 USA 02/23/63 F CO9387A Tayside Computers Inc. 4912 E 41st N Idaho Falls ID 83401 USA 31/03/2007 N/A TZ35019 Mr Zachary P Jahn 98-1731 Ipuala Loop Aiea Hawaii 96701 1710 United States 06/12/86 Male CB27843 Mrs Edith Y Baba Junior Baba Real Est. Corp. 209 Stony Point Trl Webster NY USA 11/17/1971 M OX80306 Andrew & Mary Baxter 14 Oxbridge Way Milfrod NH 03055-4614 US 05/28/67 F JP70210 Mr RJ & Mrs FB MacDonald 57 Hadleigh Close Westlea Swindon SN5 9BZ MA - USA - Y RD48107 Mr Andy Baxter 14 Oxbridge Wy Milford NH 3056 USA 01/01/01 M Widespread duplication (often hard to spot) Compound Names Embedded Additional Information Mixed Business & Personal Names Multiple Names Mis-Fielded Data Erroneous Data International Date Formats Default or Dummy Data 19 2011 Oracle Corporation

Why Do We Need Data Quality? 10hp motor 115V Yoke mount MOT-10,115V, 48YZ,YOKE mtr, ac(115) 10 horsepower 115volts This 10hp yoke mounted motor is rated for 115V with a 5 year warranty 10 Caballos, Motor, 115 Voltios TEAO HP = 10.0 1725RPM 115V 48YZ YOKE MTR Motor, TEAO, 1725 RPM, 48YZ, 15 Voltios, Montaje de Yugo, hp = 10 Item Motor Classification 26101600 Power 10 horsepower Voltage 115 Mounting Yoke Product data is much more variable and unpredictable than other data types 20

Oracle Enterprise Data Quality Profile, Audit, Transform, Parse, Cleanse, Standardize, Match within One Unified Solution

EDQ Address Verification 300 Berry #1210 SF California Parse Validate Latitude 37.775837 Longitude -122.39557 PremiseNumber ThoroughfareName SubPremise Locality 300 Berry #1210 SF 300 Berry St Unit 1210 San Francisco Step 1 Extract pieces of the address Step 2 Check the pieces against the information in the Global Knowledge Repository to complete and find the correct abbreviations AdministrativeArea California CA Step 3 Change character set transliterate - if necessary PostCode 94158-1670 Step 4 Find Location 2012 Oracle Corporation Proprietary and Confidential 22

Architecture Oracle Data Principles Integrator and for Big Best Data Practices 2012 Oracle Corporation Proprietary and Confidential 23

ODI for Big Data Heterogeneous Integration to Hadoop Environments Transforms Via MapReduce Supports Hadoop standards Easy to configure UI for generating MapReduce Oracle Data Integrator Loads

ODI for Big Data to Oracle Optimized Integration to Oracle Exadata Oracle Big Data Connectors Transforms Via MapReduce Oracle Data Integrator Activates Oracle Loader for Hadoop Loads Hadoop Cluster Oracle Database, Oracle Exadata Oracle Big Data Appliance

Oracle Data Integrator for Big Data Putting Together the Unique Advantages Simplifies creation of Hadoop and MapReduce code to boost productivity Integrates big data heterogeneously via industry standards: Hadoop, MapReduce, Hive, NoSQL, HDFS Unifies integration tooling across unstructured/semi-structured and structured data Optimizes loading of big data to Oracle Exadata using Oracle Big Data Connectors Engineered for running on and integrating with Oracle Big Data Appliance via Big Data Connectors