JBoss Enterprise Data Services Platform in the Enterprise



Similar documents
<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

SOA REFERENCE ARCHITECTURE: SERVICE TIER

The IBM Cognos Platform

JOURNAL OF OBJECT TECHNOLOGY

Data Integration Checklist

Oracle SOA Suite: The Evaluation from 10g to 11g

MDM and Data Warehousing Complement Each Other

Service Oriented Data Management

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

White paper. Planning for SaaS Integration

IBM Software Group. IBM WebSphere Process Integration Technical Overview

Service Oriented Architecture and the DBA Kathy Komer Aetna Inc. New England DB2 Users Group. Tuesday June 12 1:00-2:15

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Oracle Warehouse Builder 10g

Accelerate Data Loading for Big Data Analytics Attunity Click-2-Load for HP Vertica

By Makesh Kannaiyan 8/27/2011 1

Efficient Data Access and Data Integration Using Information Objects Mica J. Block

replication solution Using CDC and Data Replication October 19, 2010

Informatica PowerCenter Data Virtualization Edition

JBoss Data Services. Enabling Data as a Service with. Gnanaguru Sattanathan Twitter:@gnanagurus Website: bushorn.com

BUSINESSOBJECTS DATA INTEGRATOR

Salesforce.com and MicroStrategy. A functional overview and recommendation for analysis and application development

BUSINESSOBJECTS DATA INTEGRATOR

What's New in SAS Data Management

Integrating Netezza into your existing IT landscape

August 2014 San Antonio Texas The Power of Embedded Analytics with SAP BusinessObjects

Jitterbit Technical Overview : Microsoft Dynamics CRM

ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR

EII - ETL - EAI What, Why, and How!

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Oracle SOA Suite Then and Now:

Organisaties groot en klein, beginnen zich meer en meer te realiseren dat inzicht in (real-time) data helpt

Independent process platform

Integrating Ingres in the Information System: An Open Source Approach

Introduction to TIBCO MDM

Oracle Business Activity Monitoring 11g New Features

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

A standards-based approach to application integration

ASYST Intelligence South Africa A Decision Inc. Company

Data Virtualization A Potential Antidote for Big Data Growing Pains

SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)

Jitterbit Technical Overview : Salesforce

enterprise professional expertise distilled

XpoLog Competitive Comparison Sheet

Gradient An EII Solution From Infosys

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

High-Volume Data Warehousing in Centerprise. Product Datasheet

LEVERAGING ORACLE DRM TO INTEGRATE ORACLE EBS CUSTOMER DATA WITH SALESFORCE CERVELLO WHITEPAPER

Cost Savings THINK ORACLE BI. THINK KPI. THINK ORACLE BI. THINK KPI. THINK ORACLE BI. THINK KPI.

Integrating data in the Information System An Open Source approach

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

SAP Data Services 4.X. An Enterprise Information management Solution

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

AV-005: Administering and Implementing a Data Warehouse with SQL Server 2014

Migrating Discoverer to OBIEE Lessons Learned. Presented By Presented By Naren Thota Infosemantics, Inc.

FROM DATA STORE TO DATA SERVICES - DEVELOPING SCALABLE DATA ARCHITECTURE AT SURS. Summary

MicroStrategy Course Catalog

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

whitepaper The Evolutionary Steps to Master Data Management

IBM Tivoli Directory Integrator

The ESB and Microsoft BI

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007

ORACLE DATA INTEGRATOR ENTEPRISE EDITION FOR BUSINESS INTELLIGENCE

SOA REFERENCE ARCHITECTURE: WEB TIER

EAI vs. ETL: Drawing Boundaries for Data Integration

ORACLE TAX ANALYTICS. The Solution. Oracle Tax Data Model KEY FEATURES

Module 1: Getting Started with Databases and Transact-SQL in SQL Server 2008

Sisense. Product Highlights.

Marathon Information Management Program

A roadmap to enterprise data integration.

Semarchy Convergence for Data Integration The Data Integration Platform for Evolutionary MDM

SAS Enterprise Data Integration Server - A Complete Solution Designed To Meet the Full Spectrum of Enterprise Data Integration Needs

ActiveVOS Server Architecture. March 2009

Writing Queries Using Microsoft SQL Server 2008 Transact-SQL

SOA Planning Guide The Value Enablement Group, LLC. All rights reserved.

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

AquaLogic Service Bus

The Data Integration Company. Enterprise Data Integration. Maximizing the Business Value of Your Enterprise Data

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

Building Your EDI Modernization Roadmap

Pervasive Software + NetSuite = Seamless Cloud Business Processes

Informatica Data Replication: Maximize Return on Data in Real Time Chai Pydimukkala Principal Product Manager Informatica

Data Virtualization Usage Patterns for Business Intelligence/ Data Warehouse Architectures

Data Warehousing & Business Intelligence

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Creating an Enterprise Reporting Bus with SAP BusinessObjects

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

XpoLog Center Suite Data Sheet

POLAR IT SERVICES. Business Intelligence Project Methodology

IBM Tivoli Storage Manager Version Introduction to Data Protection Solutions IBM

Integration in Action using JBoss Middleware. Ashokraj Natarajan - Cognizant

Enterprise Data Integration

BIM the way we do it. Data Virtualization. How to get your Business Intelligence answers today

Transcription:

JBoss Enterprise Data Services Platform in the Enterprise Mani Subramanyam, Genentech Michael Walker, Red Hat 1

Agenda Introduction About Genentech Challenges Integration Services DSP Selection Criteria EII Architecture at Genentech Use Cases Single view of Employee BI/OI Reporting against Cloud Future Work Q&A 2

About Genentech Leading bio-technology company, founded 1976 Wholly-owned subsidiary of Roche as of March 2009 Develops drugs for significant, unmet medical needs Oncology Immunology Tissue growth and repair Merger with Roche brings 31 new products to portfolio Annual revenue of >$11B 40% annual revenue growth >10,000 employees Fortune 100 Best Companies to Work for for the 9 th consecutive year 3

Challenges Rapid growth - Annual revenue growth @ ~ 40% Large number of business software applications and enterprise data sources No single system of record Roche integration Ad-hoc and point-to-point application integrations were implemented to meet short term requirements W e b - I W e b - I C r y s t a l O L A P - I O L A P - I B E x B O U n i v e r s e B O U n i v e r s e D W S A P R / 3 O r a c l e B r i o S A P B W E T L 4

IT Strategy: Integration Services Cross-functional group that drives integration technology adoption and reuse using: Enterprise Information Integration (EII) Enterprise Service Bus (ESB) Business-to-Business (B2B) Extract-Transform-Load (ETL) Enterprise Data Mashup (EDM) SMEs from each field drive decision-making Leverage existing technology investments and past successes 5

Enterprise Information Integration via MetaMatrix Data Services Platform Provides a single point of access to combine and query data from one or more data sources in real time. Speeds application development by simplifying access to distributed data. Transforms data structure and semantics through data virtualization. Consolidates data into a single view without the need for more data. Centralized access control, auditing through robust security infrastructure. Creates services that provision data to business process in an SOA. Enterprise-proven flexible, scalable, high-performance. 6

Business Benefits of EII Analytical Reporting across multiple data sources Integrated view from heterogeneous sources Combine and use data in various formats XML and relational Application and relational Expose to users as relational, or web services Flat Files Application Retirement Insulate Commercial reports from location of data in Cloud Insulate applications from location of Employee data in Locator or LDAP Reduced development and maintenance of mapping rules 7

Evaluation Criteria Architectural Fit Can the vendor retrieve data from majority of our sources Data Security Interface with LDAP for authentication and role management Supports data entitlements authorization at table/column/row granularity Maintainability Repository versioning and concurrent access control Ease of use Ability to provide data dictionary like descriptions to end users Performance Minimal overhead over direct access to data sources Tuning and administrative capability Data caching 8

ICC Technology Decision Matrix ESB ETL B2B Real-time integration using message bus Publish/Subscribe messaging model Batch integration with small data packets Used for data movement in more than pub/sub Operational data store extraction to analytics repository is more than for operational data stores External partner integration EII Real-time data access Virtual data aggregation from multiple systems Synchronous request/response Access to external sources SQL or Web Services client interface EDM Web Automation: Data extractions from websites to optimize B2B relationships Index and Search: powerful crawling, indexing and search of unstructured contents Unstructured content data source (documents, email ) 9

Data Services Platform: Where It Fits 10

How It Works Create logical views of physical data sources using GUI Application-specific services are defined, and mapped to logical view No programming required 11

Enterprise Designer: How It Works Defined by Models, Not Code Transformations Select Join/Aggregate Filter Functions Text/String Numeric Decode User-Defined 12

From Design Time to Runtime 13

MetaMatrix DSP Physical Architecture 6 MMX Environments SND, DEV, SQA, UAT, TRN, PRD 2 x 2-CPU Virtual Machines in Cluster Failover and High Availability Disaster Recovery Uniform Security Infrastructure LDAP Authentication Data Roles Proactive/Reactive Monitoring Capacity and Regression testing 14

MetaMatrix DSP Use Cases at Genentech 15

MetaMatrix DSP Use Cases at Genentech Master Data Management 360 View of Employee (GI) 360 View of Customers Reporting and Analytics Operational Intelligence Reporting (Commercial) Business Intelligence Reporting (Commercial) Security Integration Secure Access Layer to Data Warehouses (Commercial) 16

MetaMatrix DSP Data Sources at Genentech Relational Databases Data Warehouses Operational Data Stores Oracle, SQLServer, PeopleSoft, FileMaker Pro LAN and WAN LDAP Directory Server Salesforce.com Cloud Platform OSISoft PI System Historian DB Flat Files 17

MetaMatrix DSP Clients at Genentech 18 Web Applications MetaMatrix Hibernate Dialect Perl/CGI scripts ESB iphone OI/BI Reporting Business Objects Crystal Reports Analytics Tools Spotfire BRIO

Use Case: gpeople 19 Oracle database containing outdated employee information at Genentech Enterprise LDAP is the actual system of record for employee information Real-time data access requirement Many applications use the outdated Locator database (total # unknown) Audit requirement Additional data source requirements: Cost Center Hierarchy (SAP) Manager Hierarchy External Employee LDAP branch LDAP Change History

gpeople Project Goals Shut down legacy DB Shift many applications to Enterprise LDAP as source for employee information Build a reusable, SQL-based interface to LDAPbased employee data Combine LDAP data with additional sources Define reusable Data Access policies Track history and support for delta load for clients 20

gpeople Data Models 21

gpeople Physical Architecture 22

gpeople in Production 25+ applications using gpeople VDB Extensive re-use Enhanced as downstream demands increase Multiple virtual views Public view for more commonly used elements Private view is for HR restricted access elements More granular security for Private views Application specific views Support real-time and cached access Cache Views to reduce the load on LDAP server Monitored through Enterprise Foglight monitoring tool Referenced in other VDBs 23

gpeople: Lessons Learned Best use case for providing SQL interface to non-sql data Largest number of clients compared to any other VDBs Tracking History of changes Should have modeled all attributes at once, instead of per project Better approach on defining data access roles upfront Limitation of using Cluster on some integration technology specifically ESB 24

gpeople: Future Work Multi-valued attribute support to extract Consolidate security model Publish to JMS Queue Roche Integration Multi-source binding Active Directory data source Caching LDAP Groups Data Cleansing of gpeople data gpeople History enhancements 25

Use Case: BI/OI Reporting Against the Cloud Salesforce Cloud Platform migration Host Enterprise Patient, Case, Sales data Move off legacy systems, data warehouses Reduce cost, maintenance Business Units track various metrics, business processes 200+ reports Metrics, Brand Scorecard, Operational information Complex aggregations and process tracking Ad-hoc analytics tools 26

Cloud Reporting: Challenges SFDC reporting does not meet business need Limited support for aggregation, joins, pre-calculated data Basic reporting and formatting only Information must be organized, secured, and filtered using private data in real-time Not available in the cloud: Role Management (authorization) Territory alignment data (filtering) LDAP (authentication) Metrics must be available immediately at system/product launch Data warehouse/ods initial overhead too high Information must be encrypted 27 HIPAA-compliance

BI/OI Reporting: Challenges Need to securely integrate public cloud data with multiple private systems Uniform, reusable SQL interface needed Need blend of real-time and pre-computed data 28

BI/OI Reporting Physical Architecture Provide uniform, reusable interface to heterogeneous data sources Mask underlying interfaces to source systems Combine real-time access with cached aggregate data for reporting 29

Salesforce Connector Creates Relational Models Imports SFDC object metadata Tabular representation Translates SQL to SOQL Masks underlying XML/Web Services interface Supports DML Insert, Update, Delete Supports SFDC-specific functions in SQL statements Includes, excludes Supports custom objects 30

Salesforce Connector M e t a M a t r i x S e r v e r R S Q L e s u l t s F e d e r a t e d Q u e r y E n g i n e S Q L s u p p o r t e d R e s u l t s S O Q L C S F D C o n n e c t o r S O Q L R e s u l t s S O Q L S W e b e r v i c e App issues SQL: SELECT SUM(Reports) FROM Sales where Division = customer support ; Query Engine translates SQL into SQL Supported, based on SOQL-supported capabilities. SELECT SUM(Reports) FROM Supervisor where Division = customer support ; Salesforce Connector manages SOQL request/response, returns results Query Engine applies additional SQL to form complete result (SUM) 31

BI/OI Reporting Data Models 32

Cloud Reporting: Lessons Learned and Future Work Incremental Refresh option for SFDC data Notifications, Outbound Messaging Via ESB integration GetDeleted(), getupdated() API calls Not recommended Managing schema changes in Salesforce Release procedure Delta detection and impact analysis 33

MetaMatrix DSP at Genentech: Future Work MetaMatrix DSP Upgrade to version 6.0 RSS Aggregation Publish to JMS Queue Email Connector google Connector Delta Detection for Materialization Process Integration with Active Directory (Roche) Integrate with ODS Infrastructure upgrade to support global access 34

35

Tips and Tricks Multi-Source Binding Materialization table name ending with ST e.g. gppeople_hist Text File Descriptors accessing NAS Share Host controller to run as windows domain account Not local account Trailing Space in the where clause filter value e.g. city = 'South San Francisco ' Query Plan and Access for UNION query Correlated sub-queries 36