Integrating Business and Technical Metadata to Support Better Data Quality

Similar documents
Disparate Data, Disparate Systems, Disparate User Groups (How to Architect The Enterprise Business Needs) Robert Schork, General Dynamics IT

Practical meta data solutions for the large data warehouse

Business Intelligence for the Chief Data Officer

Essential Elements of a Master Data Management Architecture

Creating a Corporate Integrated Data Environment through Stewardship

What Business and Process Analysts Need to Know About BPM Suites

The people factor in Data Management

Quest for a Business Rules Management Environment (BRME) in the Internal Revenue Service

IPL Service Definition - Master Data Management Service

Data Governance Maturity Model Guiding Questions for each Component-Dimension

How to Run a Successful Big Data POC in 6 Weeks

US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007

Service Oriented Architecture and the DBA Kathy Komer Aetna Inc. New England DB2 Users Group. Tuesday June 12 1:00-2:15

Request for Information Page 1 of 9 Data Management Applications & Services

THOMAS R FINNERAN PRINCIPAL CONSULTANT - IDENNEDY PROJECT

SNOW SOFTWARE. Fredrik Spolén Country Manager Sales Director. Norway Denmark Finland

Challenges in Metadata Integration: BMO Financial Group Case Study

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Salesforce Certified Data Architecture and Management Designer. Study Guide. Summer 16 TRAINING & CERTIFICATION

Data Governance Primer. A PPDM Workshop. March 2015

Turning Data into Knowledge: Creating and Implementing a Meta Data Strategy

Introduction to Glossary Business

MicroStrategy Course Catalog

Explore the Possibilities

Building Your EDI Modernization Roadmap

CA Repository for z/os r7.2

Business Data Authority: A data organization for strategic advantage

Metadata Application Understanding Software Migration

Information Governance for Financial Institutions

Data Governance: The Cornerstone of MDM

Cisco Unified Security Metrics: Measuring Your Organization s Security Health

Enabling Better Business Intelligence and Information Architecture With SAP Sybase PowerDesigner Software

Certified Information Professional 2016 Update Outline

Independent Insight for Service Oriented Practice. An SOA Roadmap. John C. Butler Chief Architect. A CBDI Partner Company.

James Serra Data Warehouse/BI/MDM Architect JamesSerra.com

Avis Fleet Services United Road Towing Arizona

Information Management & Data Governance

Master Your Data and Your Business Using Informatica MDM. Ravi Shankar Sr. Director, MDM Product Marketing

Creating the Golden Record

21st Century Tax Systems: COTS or Service Oriented Architectures. August 3, 2009

Proven BPM Implementation Best Practices. William Brown Doug Sheeran

Enabling Data Quality

MDM and Data Warehousing Complement Each Other

Metadata Repositories in Health Care. Discussion Paper

CA Repository for Distributed. Systems r2.3. Benefits. Overview. The CA Advantage

Cloud Computing for Architects

Master Data Governance & SAP Information Steward Integration. Jens Sauer, SAP Switzerland September 11 th, 2013

A Holistic Framework for Enterprise Data Management DAMA NCR

An RCG White Paper The Data Governance Maturity Model

Bringing agility to Business Intelligence Metadata as key to Agile Data Warehousing. 1 P a g e.

Advanced Analytic Dashboards at Lands End. Brenda Olson and John Kruk April 2004

JOURNAL OF OBJECT TECHNOLOGY

Company Overview. Financial Snapshot. Leading Global ERP Provider Global Midmarket Divisions / Subsidiaries of Global 1000

Reliable Business Data Implementing A Successful Data Governance Strategy with Enterprise Modeling Standards

ETPL Extract, Transform, Predict and Load

Setting up an Effective Enterprise Architecture capability. Simon Townson Principal Enterprise Architect SAP

Using BPM to Implement Services in Telecom and Media. Practical Advice and Insights from the Real World

Case Study EPA. Agency-Wide Governance of Reusable Components

Enterprise Data Management

Technical Layer (Technical Interoperability) Information Layer (Information Interoperability. Business Layer (Business Process Interoperability)

A Design Technique: Data Integration Modeling

Master Data Management

Business Intelligence (BI) Data Store Project Discussion / Draft Outline for Requirements Document

Enterprise Information Management Capability Maturity Survey for Higher Education Institutions

Enabling Better Business Intelligence and Information Architecture With SAP PowerDesigner Software

ManageEngine ServiceDesk Plus - MSP Training Agenda

SAP BusinessObjects Information Steward

October 8, User Conference. Ronald Layne Manager, Data Quality and Data Governance

Paisley Enterprise GRC Audit Profile. Linda Bergs

Best Practices in Enterprise Data Governance

AV-20 Best Practices for Effective Document and Knowledge Management

Structure of the presentation

Logical Modeling for an Enterprise MDM Initiative

SOA REFERENCE ARCHITECTURE: WEB TIER

White Paper What Solutions Architects Should Know About The TOGAF ADM

B.Sc (Computer Science) Database Management Systems UNIT-V

AN OVERVIEW OF THE SALLIE MAE DATA GOVERNANCE PROGRAM

Before getting started, we need to make sure we. Business Intelligence Project Management 101: Managing BI Projects Within the PMI Process Group

Ganzheitliches Datenmanagement

EMA CMDB Assessment Service

A Process for Evaluating and Selecting a Development Environment. Jim Odrowski ComponentWave, Inc. jodrowski@componentwave.com

MDM AS A METHODOLOGY

Vermont Enterprise Architecture Framework (VEAF) Master Data Management Design

Semantic Integration in Enterprise Information Management

The Role of the BI Competency Center in Maximizing Organizational Performance

Data Integration Alternatives & Best Practices

Luncheon Webinar Series July 29, 2010

Repository-Centric Enterprise Architecture

Best Practices for Managing and Monitoring SAS Data Management Solutions. Gregory S. Nelson

Data Warehouse Overview. Srini Rengarajan

AVS SYSTEMS, INC

THOMAS RAVN PRACTICE DIRECTOR An Effective Approach to Master Data Management. March 4 th 2010, Reykjavik

Process Remixes - Mixing Legacy with Process Orchestration

Buying vs. Building Business Analytics. A decision resource for technology and product teams

Business Intelligence Project Management 101

Service Oriented Architecture (SOA) Architecture, Governance, Standards and Technologies

White Paper. Contract Document Management with SharePoint. Conceive. Build. Succeed.

Gradient An EII Solution From Infosys

SOLUTION BRIEF CA ERwin Modeling. How can I understand, manage and govern complex data assets and improve business agility?

Data Governance Data & Metadata Standards. Antonio Amorin

Transcription:

Integrating Business and Technical Metadata to Support Better Data Quality ABSTRACT How can integrating Business and Technical Metadata help in your efforts to gain better quality of your Enterprise wide information? How can it help to consolidate the multiple silos of information that your Company has and continues to produce? How can you link all of that information to a Data Steward or Business Owner who has the answers? At what Level (Enterprise or LOB) do those Business metadata elements get associated to the Technical Metadata? And how about Business Terms or Business Data Assets; does one description fit your enterprise? How about information that is outside of the ETL world; how will you integrate that metadata. Now, once these issues have been resolved, who is going to load/update these elements on an ongoing basis? And don t forget the ever growing demands for compliance and business decision support system information. See how Citigroup's Center of Technical Excellence (CTE) group within the Enterprise-wide Citi Architecture and Technology Engineering (CATE) organization, used an extended ETL Metadata Repository to create an Enterprise-wide Metadata Repository. Complete with Business terms, Grouping, Data Models and Databases, Data Governance (Business Ownership and Data Stewardship) workflow process and approvals, as well as Source to Target traceability of Metadata Elements, and auditing aspects for compliance issues. Although this presentation is made for the CIO, Enterprise Architect level, Bob s technical presentation style also appeals to the Data professional level and everyone will come out of the presentation with something useful. BIOGRAPHY Robert Schork Citi Architecture and Technology Engineering (CATE) Center for Technical Excellence (CTE) group Citi Bob is a team leader with the Citi Architecture and Technology Engineering (CATE) Center for Technical Excellence (CTE) group at Citi. Bob has over 25 years of IT experience and over 15 yrs of metadata management experience. He has installed both Platinum (CA) and Rochade (ASG) metadata repositories and has modeled Enterprise Data Warehouses and other enterprise architectures. He has also installed several packages which include tool evaluations, data classification, data mapping and API accesses from upstream and to downstream systems. Bob has also created several metamodels as well as developed the scanners for loading and extraction processes for reporting. Bob has performed metadata analysis of several Fortune 500 corporations. He has a background as a BA, Developer, DBA, Data Analyst as well as a Data Administrator. He has implemented several Metadata solutions, including the creation of Standards and Procedures to help the enterprise processes function properly. He currently works in the group that is responsible for 151

setting enterprise-wide architectural, data, and metadata standards for Citi (formally called Citigroup). Bob is also a board member of the Metadata Professional Organization (MPO) and DAMA-NJ and has presented at several Data Management conferences and FIMA events. 152

Integrating Business Metadata into Technical Metadata How Citi Extended an ETL Repository Enterprise-wide to support Information Stewardship and Data Quality MIT IQIS Conference2009 Who am I? Over 25 yrs of IT experience (including consulting) Over 15 yrs of Metadata experience including Metadata Analysis and Design for Fortune 100 corporations Implemented ASG Rochade and CA Platinum Repositories including the maintenance and reporting processes Created several Homegrown Metadata Repositories, Scanners and Governance processes. Current Board member of the DAMA NJ Chapter, and the Metadata Professional Organization (MPO) Former member of the IBM Data Governance Council and MeetTheBoss Presenter at several Metadata and FIMA conferences Former BA, Developer, DBA, Data Modeler, Data Architect 2 153

Current Assignment TEKsystems is the leading technical staffing and services company in North America. With 26 years of experience, we know the industry and the importance of deeply understanding each customer s business situation. We also know the right team is essential for a project to succeed. Our proven staffing and project delivery processes enable us to support your critical engagements with the best experts in the market those who are thoroughly qualified to achieve the results you need. 3 What is Metadata? Definition: Business Metadata Data about Data (Useless) Metadata is the definition and usage of the information about the data of your organization. Technical Metadata Operational Metadata 3 Types Business Technical Operational Few really understand it, but it is not Pie in the Sky Can not do Data Integration or true Data Quality without it. 4 154

AGENDA Citi s Dilemma and Possible Information Stewardship Solutions Information Stewardship Getting Business Buy-In Extensions and Integration points Metadata Governance Future Directions 5 Information Stewardship Definition: The willingness to be accountable for a set of business information (Data Assets) for the well-being of the larger organization by operating in service, rather than in control of those around us. Larry English Why establish this first: Does control of other groups generally work? Can you define who is accountable for your corporations Data? Does the information exist? If so, where is it? If you find the information, what does it mean? How does what you are doing, benefit their organization? Can the information be integrated? Can this support Data Profiling? 6 155

Dilemma at Citi Citi had some decisions to make Goal was to capture Business Metadata and associate that with the existing Technical and Operational Metadata. Need to incorporate that throughout the Enterprise that has several distinct Business Sectors or LOB. Should we build, buy, or use an existing system? A Metadata Repository was tried before and failed. Why? Could the Ab Initio ETL metadata repository be extended? How can we capture and incorporate Data Governance (Business Ownership and Data Stewardship) into the overall strategy. How can we sell this to the Business Customers Can the existing resources (people and tools) be utilized? 7 First Step Decisions Find a vehicle to capture Business Metadata in various forms. Define a process to associate that Business Metadata with the existing ETL Metadata elements. Architect distinct Enterprise and Business Sectors separations of Business Term Metadata. Implement a POC to determine if the ETL Repository was extensible, easily loadable, and reportable. Determine if the existing Business Process Manager workflow engine could communicate and support Business Ownership and Data Stewardship approvals. Unexpected Twist. Develop loading methods for each different metadata type. Sell it to a skeptical Sector base who has been burned before. 8 156

Metadata Analysis Train those who do not know metadata about metadata Define the scope of the Metadata elements What type of metadata to capture What vehicle to capture that metadata Test the scope The Business Users DO NOT define the repository scope Group the like metadata elements Test it for validity Create the API accesses (Loading and Reporting) Create the metadata Governance for each element Update the SDLC Auditing functions need to be established for compliance 9 Citi Data Stores 10 157

The Problem with ETL Repositories They are not a full Enterprise Repository, but they have advantages. Their enterprise view is limited to the metadata elements within their knowledge. Good meta modeling skills are needed for extension and integration. Metadata outside the ETL tool is not generally accounted for. You must create the scanning API accesses to load and integrate the outside metadata elements into the matching ETL Repository elements. Most are very weak at Business Metadata capturing and usually have an enterprise view of those terms. Problem with understanding and implementing the various complexities of Business Metadata to Technical and Operational metadata. In order to report on your extended metadata elements you may need to extend the reporting GUI that already exists. 11 AGENDA Information Stewardship Citi s Dilemma and Possible Solutions Getting Business Buy-In Extensions and Integration points Metadata Governance Future Directions 12 158

Management Reactions Some Management Reaction Buy Another tool? Who will lead it? What does the Business Customer want? 13 Citi Sector Management Reaction 14 159

Management View on Tool Purchases 15 Who Would Lead This Endeavor? 16 160

What does the Business want? 17 What does the Business really want? TIPS Find out where their pain points are. Is there a manual process that a Repository can help with? What is their biggest time consumers? FORGET what the Vendors/Consultants have told them. Where do they see their groups 5 years from now? TIPS FOR YOU Partner with them; don t do all the work Be Realistic: I don t know yet is a valid answer. Look for opportunities they may not know about. Promote and Sell to other groups. 18 161

The Team The Development Engineering group of our Enterprise Architecture group stepped up and said it can be done. The Center of Technical Excellence (CTE) group took the lead. CTE Team consisted of the following: 1. Project Manager 2. Metadata Architect 3. Sr. ETL Architect 4. Jr. ETL Architect Extend the ETL Metamodel. Develop a plan to load and report on Metadata Get the Vendor involved. 19 Citi Business Metadata Needs ACTION OPTIONS Integration between the Business and Technical metadata. Traceability of Data Elements throughout their applications. Wanted to know who owns what. (Information Stewardship) Other considerations that surfaced: Data Redaction/Obfuscation Information Security Classifications Data Governance email metadata change notifications Impact analysis for major initiatives 20 162

Key Repository Highlights Data Asset Inventory (DAI) stores consistent consolidated views of Enterprise and Business Sector Data Assets linked to technical metadata Includes Data Models, Databases and ETL information DAI incorporates Standard Templates and Scanners to load all metadata types. DAI stores, displays and provides at a glance, the contact information for the Data Governance workflow showing the Business Owners, Data Stewards and integrates that information with each metadata element (entity/table level). Allows easy reporting of metadata information via a Web Portal interface. 21 Metadata Flow Publisher Layer Integration Layer 22 163

AGENDA Information Stewardship Citi s Dilemma and Possible Solutions Getting Business Buy-In Extensions and Integration points Metadata Governance Future Directions 23 Levels and Type of Metadata The Metadata Repository currently supports the following major asset types: Enterprise Level Data Assets (Business Terms) Business Sector Level Data Assets Logical and Physical Technical Metadata Entity/Table/File Level information: Business Ownership and Data Stewardship Metadata Data Profiling Results and Valid Values Attribute/Column/Field level information: Data Obfuscation/Redaction Information Information Security Classification information Physical Operational Metadata with some parameters 24 164

Extended Metamodel Components 25 Overall Architecture Components DATABASES and Flat Files DATA MODELS Enterprise Model Library Data Profiling Tool MANUAL METADATA Catalog Results SOA Design Time DAI Repository XML Business Process Tool Legend: Green: Vendor Product Yellow: Document Blue: Created Process Lt. Blue: Manual process Business Intelligence Tool ETL 26 165

AGENDA Information Stewardship Citi s Dilemma and Possible Solutions Getting Business Buy-In Extensions and Integration points Metadata Governance Future Directions 27 Repository Loading Process 28 166

Data Governance Process 29 Features and Benefits Features Metadata element search capabilities Contains Data Model information including graphical document of the Data Model Lists all the Data Assets that stakeholder is entitled Contains links to Business Terms, Logical and Physical elements Contains all audit trails and versions of the data element Initiates the approval process notification to the Owners and Stewards. Benefits Lists all Enterprise Data Assets with its associations Enables control over Sector Business Data Assets Contains traceability information and its associations Aids in compliance tracking Can hold XML document information Can store and integrate Data Model elements outside of the Data Models Can hold and attach documents to any Metadata elements 30 167

AGENDA Information Stewardship Citi s Dilemma and Possible Solutions Getting Business Buy-In Extensions and Integration points Metadata Governance Future Directions 31 Future Architecture 32 168

Enterprise Data Asset 33 Business Sector Data Assets 34 169

Business Data Asset Links 35 Change Requests 36 170

Traceability 37 Data Profiling Results 38 171

Other Considerations Metadata must be reported on the Production Level Modify SDLC procedures to enhance metadata capturing Must integrate all local ETL instances into the Global DAI. Consolidate Reference Data to create Enterprise Entities Provide Educational Services of Data Governance Procedures Implement Data Profiling and Data Quality processes for on boarding applications DO NOT let other areas dictate to you what should be in the Repository. They will want to use it as an application. SELL, SELL, SELL BUT, Keep in mind 39 Pitfalls NEVER TRY TO TEACH A PIG TO TALK IT WILL WASTE YOUR TIME AND ANNOYS THE PIG! 40 172

Questions Contact Information Bob Schork Formerly of Citigroup, Warren, NJ Currently Consultant for TEKsystems, Charlotte, NC Assigned to Bank of America as a Metadata Architect robert.schork@bankofamerica.com bobschork@hotmail.com 41 173