Talend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain



Similar documents
Enabling Better Business Intelligence and Information Architecture With SAP Sybase PowerDesigner Software

A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems

Enabling Better Business Intelligence and Information Architecture With SAP PowerDesigner Software

What's New in SAS Data Management

Data Integration Checklist

Data processing goes big

Managing Third Party Databases and Building Your Data Warehouse

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Bringing agility to Business Intelligence Metadata as key to Agile Data Warehousing. 1 P a g e.

Course Outline. Module 1: Introduction to Data Warehousing

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

SQL Server 2012 Business Intelligence Boot Camp

Information Management Metamodel

Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? PTR Associates Limited

Java Metadata Interface and Data Warehousing

Course duration: 45 Hrs Class duration: 1-1.5hrs

Creating an Enterprise Reporting Bus with SAP BusinessObjects

Common Warehouse Metamodel (CWM): Extending UML for Data Warehousing and Business Intelligence

Federated, Generic Configuration Management for Engineering Data

Cúram Business Intelligence Reporting Developer Guide

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

Oracle Warehouse Builder 10g

SAP Data Services 4.X. An Enterprise Information management Solution

Model-Driven Data Warehousing

Implementing a Data Warehouse with Microsoft SQL Server 2012

Table of Contents Chapter 1 - Getting Started with Oracle Data Relationship Management (DRM) 1

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff

Data Mart/Warehouse: Progress and Vision

Implementing a Data Warehouse with Microsoft SQL Server 2012

Making SAP Information Steward a Key Part of Your Data Governance Strategy

QAD Business Intelligence Release Notes

POLAR IT SERVICES. Business Intelligence Project Methodology

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Data Governance and CA ERwin Active Model Templates

An Oracle White Paper March Managing Metadata with Oracle Data Integrator

Oracle Data Integrator 12c: Integration and Administration

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Big Data for Investment Research Management

ORACLE HEALTHCARE ANALYTICS DATA INTEGRATION

ER/Studio Enterprise Portal User Guide

Oracle Data Integrator 11g: Integration and Administration

An Oracle White Paper February Oracle Data Integrator 12c Architecture Overview

In principle, SAP BW architecture can be divided into three layers:

How To Use Noetix

Data Warehouse (DW) Maturity Assessment Questionnaire

Introduction to Glossary Business

SAS BI Course Content; Introduction to DWH / BI Concepts

Enterprise Resource Planning Analysis of Business Intelligence & Emergence of Mining Objects

Three Fundamental Techniques To Maximize the Value of Your Enterprise Data

Data Warehouse and Business Intelligence Testing: Challenges, Best Practices & the Solution

Metadata Management for Data Warehouse Projects

PRODUCTS What s New: BusinessObjects XI Releases 1 and 2

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Implementing a Data Warehouse with Microsoft SQL Server

MDM and Data Warehousing Complement Each Other

<no narration for this slide>

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

WHITE PAPER DATA GOVERNANCE ENTERPRISE MODEL MANAGEMENT

SOLUTION BRIEF CA ERwin Modeling. How can I understand, manage and govern complex data assets and improve business agility?

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

DSS based on Data Warehouse

Metadata Repositories in Health Care. Discussion Paper

ER/Studio 8.0 New Features Guide

NEW FEATURES ORACLE ESSBASE STUDIO

Data warehouse and Business Intelligence Collateral

Oracle Data Integrator for Big Data. Alex Kotopoulis Senior Principal Product Manager

Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS

Rational Reporting. Module 3: IBM Rational Insight and IBM Cognos Data Manager

BUSINESSOBJECTS DATA INTEGRATOR

Data Integration and ETL with Oracle Warehouse Builder NEW

Enterprise Service Bus

Business User driven Scorecards to measure Data Quality using SAP BusinessObjects Information Steward

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

IBM WebSphere DataStage Online training from Yes-M Systems

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Appendix A. Functional Requirements: Document Management

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

SAS Enterprise Data Integration Server - A Complete Solution Designed To Meet the Full Spectrum of Enterprise Data Integration Needs

Dynamic Decision-Making Web Services Using SAS Stored Processes and SAS Business Rules Manager

Conventional BI Solutions Are No Longer Sufficient

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

Analysis of the Specifics for a Business Rules Engine Based Projects

Salesforce Certified Data Architecture and Management Designer. Study Guide. Summer 16 TRAINING & CERTIFICATION

An Oracle White Paper October Oracle Data Integrator 12c New Features Overview

Data Integration and ETL with Oracle Warehouse Builder: Part 1

Online Courses. Version 9 Comprehensive Series. What's New Series

Integrating data in the Information System An Open Source approach

Data Modeling in the Age of Big Data

Databases in Organizations

JOURNAL OF OBJECT TECHNOLOGY

OWB Users, Enter The New ODI World

Oracle BI 10g: Analytics Overview

D83167 Oracle Data Integrator 12c: Integration and Administration

SEARCH The National Consortium for Justice Information and Statistics. Model-driven Development of NIEM Information Exchange Package Documentation

Model Driven Interoperability through Semantic Annotations using SoaML and ODM

BusinessObjects XI R2 Product Documentation Roadmap

Transcription:

Talend Metadata Manager Reduce Risk and Friction in your Information Supply Chain

Talend Metadata Manager Talend Metadata Manager provides a comprehensive set of capabilities for all facets of metadata management. At the heart of Talend Metadata Manager is a repository which contains repository objects, such as models and mappings that are organized into folders. Models can be harvested from Talend Data Integration models, Data Modeling tools, Data Warehouses, external metadata repositories for relational databases (RDBMS), and Data Integration and Business Intelligence tools. A particular type of repository object called Configuration, can connect metadata stitching models and mappings together to represent an Enterprise Architecture, including full support for data flow lineage and impact analysis, as well as semantic lineage definitions. Talend Metadata Manager consists of four major components: Metadata Bridge (metadata import) Metadata Manager Data Governance Metadata Authoring with Forward Engineering (metadata export) 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 2

Metadata Bridge Metadata is everywhere. Data warehousing, business intelligence, CASE and ETL tools all have their own repositories. Just about every application has its own data dictionary. XML carries the metadata with it in the message or document, and enterprise application integration environments have their own repositories and metadata mapping and integration facilities. In order to succeed, one must have a good enterprise repository integration environment that can integrate the different format of metadata from all tools. The Talend Metadata Manager repository bridges the technical and non-technical aspects of metadata, while simultaneously addressing the chasm between the different metadata source and target systems that constitute any modern information management environment. The Metadata Bridge imports all metadata via bridges (metadata import components), including Extract, Transformation and Load (ETL)/ Data Integration tools, Business Intelligence tools, Data Modeling tools, databases, most all metadata exchange standards, and numerous data formats including XML. Importing metadata from Talend Studio with Talend Metadata Manager 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 3

Metadata Manager (MM) Version and Configuration Management Not only must the repository be able to import on demand in any format and to any tool or import metadata many times as needed, it must be able to manage the versions created by this continuous activity. It must also be fundamental to the repository organization for administrators to then organize, publish and selectively present the information in appropriate configurations of metadata, as is required for the correct and precise answers to a wide range of cuts across this metadata. Talend Metadata Manager was designed from the ground up with version and configuration management as a key capability. Metadata Comparison All metadata is represented by an integrated metamodel in Talend Metadata Manager. This feature provides comparisons across metadata from data source formats supported, including design tools, databases, etc., not simply among versions of a given model. Comparing models or model versions with Talend Metadata Manager 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 4

Data Mapping Specifications Once imported, metadata can be mapped in a myriad of ways to any other metadata within Talend Metadata Manager. This ability is critical to the success of any metadata management solution. In particular, you can define data flow mappings describing data movement type relationships, e.g. when a database is read and the results written to another database, as well as semantic mappings which identify semantic relationships between elements, oftentimes conceptual or logical in nature, such as for a data dictionary or conceptual model such as a UML model. Metadata Stitching Metadata stitching is fundamental to the correct and automated analysis of the data flow and semantic lineage of metadata in the repository. It also supports version management across the constant rate of updates and changes in a repository. Talend Metadata Manager keeps complete versions of all imported metadata in self-contained models, which are then related via stitching s (simple connection mappings). In this way, version management and configuration management is not only entirely clean and isolated from the definition and maintenance of mappings, it also automatically supports updates and changes into the future. Getting a high level view of information flows across systems with metadata stitching 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 5

In this way, the enterprise architecture is correctly modeled, and data flow lineage is completely and accurately derivable. Lineage and Impact Analysis The different roles and their needs with respect to data and related metadata Once metadata is managed, metadata is then available for detailed technical and business analysis. Talend Metadata Manager supports full technical and business level lineage and impact analysis providing you new insight across all the connected metadata sources. Business User Lineage Reporting analysis is the typical use case, with questions such as: Given an item on a report, what data entry system fields impact these results? Why are the numbers on this report the way they are? How do I change the system data to correct the results of this report? Data lineage with Talend Metadata Manager 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 6

Technical User Impact Analysis Of high interest to the technical user are questions like: If I must change these elements (data type, code sets, etc.) in my operational data store, what is the downstream impact? This new ETL process is populating my staging warehouse in new ways, how does this impact the OLAP model in my reporting services? Technical User Lineage Reverse lineage type questions may also be asked by more technical users, such as: How many systems are required to determine the dimensions for this portion of the OLAP model? A business report use case is asking the lineage for particular values on a report, so where does the data come from and how is it manipulated? Business Users Impact Analysis Finally, business users may ask the forward lineage or impact analysis questions, such as: If I make a change to this field, what reports will be impacted? How is this identity information merged with the personnel system information on these other reports? Impact analysis with Talend Metadata Manager 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 7

Data Governance (DG) Critical to the development and management of a complete data architecture is a Business Glossary. Talend Metadata Manager provides an ISO 11179-based Business Glossary to capture, define, maintain and implement an enterprise Business Glossary of terminology, data definitions, code sets, domains, validation rules, etc. In addition, semantic mappings describe how elements in a source Model (more conceptual like the Business Glossary) define elements in a destination Model (closer to an implementation or representation). The Business Glossary helps an enterprise reach agreement between all stakeholders on their business assets (e.g. terms) and how they relate to data assets (e.g. database tables) and technology assets (e.g. ETL mappings). The Business Glossary can be used to document logical/physical data entities and attributes across IT collaboratively. Again, it involves tracing dependencies between business and technical assets. In Talend Metadata Manager, a Business Glossary is a self-contained collection of categories and the terms sub-categories contained within each category. In turn, the terms may be semantically mapped to objects throughout the rest of the repository, such as tables and columns in a data model. Once mapped, one may perform semantic lineage traces such as definition lookups and term semantic usage across any configurations containing the Business Glossary, mappings and mapped objects. Authoring the common business terms used in the organization with the Business Glossary 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 8

Bootstrapping a Business Glossary Building a Business Glossary can be as simple as dragging in an existing well-documented data model, via import from other sources (a CSV file format), or can be populated directly via the user interface during the process of classifying objects in other data store models. In general, a combination of such methods are employed in conjunction with one another. Workflow In order to ensure that the Business Glossary is accurate, up-to-date, available to all who need access to it, and integrated properly with the rest of the metadata in the repository, Talend Metadata Manager also provides a robust collection of Data Governance tools and methodologies. The Business Glossary provides a very flexible workflow and publication process that can address both basic and complex needs. In addition, one may maintain any number of business glossaries, each with different workflow and publication characteristics. The Business Glossary may be part of your lineage. It will appear in the repository panel and when you open a Business Glossary, you will be presented with a different UI than other (imported) Models. Workflow-driven search criteria are available allowing one to efficiently organize terms and identify what actions are required at any given time. When working with individual terms, which are at some point in the workflow process, workflow transition buttons prompt you with possible actions. Semantic Mapping A Semantic Mapping describes how elements in a source model (more conceptual) define elements in a destination model (closer to an implementation or representation). Put another way, elements in the destination model are representations or implementations of the associated element in the source model. They are three primary uses for semantic mapping: Data Standardization and Compliance Multi Level Modeling of semantic relationships from conceptual to logical, and to physical data model with a few sub cases Business Glossary term classification 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 9

Metadata Authoring (MA) with Forward Engineering (Metadata Export) Note: The following features only come with Talend Metadata Manager with Authoring. RDBMS and Big Data Documenter and Physical Data Modeler The Talend Metadata Manager Data Documenter allows users to document existing data stores, like databases, big data sources, and imported models, and publish the resulting documented data stores to the enterprise. The Data Documenter offers a different approach than traditional data modeling tools: The Business Glossary-driven Data Documenter methodology allows for immediate reuse and creation of terms and naming standards on the fly, fast tracking the data store documentation process ensuring complete semantic synchronization among your data models and data governance environment. Web-enabled Data Documenter offers better access to users than desktop tools Data Modeling and diagramming capabilities of the Data Documenter are similar to conventional data modeling tools. Full integration (import/export) to most popular data modeling tools is provided. Visualizing Data Models with Talend Metadata Manager 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 10 WP208-EN

Logical Data Modeler Talend Metadata Manager provides a completely web-enabled logical data modeling environment for producing logical and conceptual models: The Business Glossary-driven methodology allows for immediate reuse (creating of entities, attributes and domains) and creation of terms and naming standards on the fly, fast tracking the modeling process and ensuring complete semantic synchronization among your models and data governance environment. The Web-enabled modeler offers better access to users than desktop tools. The Data Modeling capabilities are competitive with conventional data modeling tools. Full integration (import/export) with most popular data modeling tools is provided. Data Mapping Designer Data Mapping Designs represents data integration process designs containing all the necessary data movement design details, such as lookups, filters, joins and transformation expressions. These Data Mapping Designs are complete enough that they may be forward engineered into Talend Data Integration using the Metadata Bridge. In this way, Talend Metadata Manager provides a completely web-based data mapping design tool that can reuse and be synchronized with all other metadata artifacts in the repository and your complete data governance environment. Defining the mappings directly in Talend Metadata Manager 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 11 WP208-EN

Visualizing the end to end information flows with Talend Metadata Manager 800 Bridge Parkway, Suite 200, Redwood City, California 94065 US Page 12 WP208-EN