Data Governance, Data Architecture, and Metadata Essentials Enabling Data Reuse Across the Enterprise



Similar documents
Data Governance. Data Governance, Data Architecture, and Metadata Essentials Enabling Data Reuse Across the Enterprise

Data Governance, Data Architecture, and Metadata Essentials

Improve Information Governance Through Clarity and Collaboration

R49 Using SAP Payment Engine for payment transactions. Process Diagram

Price and Revenue Management - Manual Price Changes. SAP Best Practices for Retail

Multi Channel Sales Order Management: Mail Order. SAP Best Practices for Retail

Enabling Better Business Intelligence and Information Architecture With SAP PowerDesigner Software

Simplify Complex Architectures and See the Potential Impact of New Technologies

Master Data Governance Find Out How SAP Business Suite powered by SAP HANA Delivers Business Value in Real Time

Integrated Finance, Risk, and Profitability Management for Insurance

Elevate Your Customer Engagement Strategy with Cloud Services

Partner Certification to Operate SAP Solutions and SAP Software Environments

Discover, Cleanse, and Integrate Enterprise Data with SAP Data Services Software

FA7 - Time Management: Attendances/Absences/Overtime/Hajj Leave. Process Diagram

Three Fundamental Techniques To Maximize the Value of Your Enterprise Data

Optimize Application Performance and Enhance the Customer Experience

Cost-Effective Data Management and a Simplified Data Warehouse

Measure Your Data and Achieve Information Governance Excellence

GR5 Access Request. Process Diagram

K88 - Additional Business Operations for Loans. Process Diagram

K75 SAP Payment Engine for Credit transfer (SWIFT & SEPA) Process Diagram

Create and Distribute Rich Media for Optimized, Omnichannel Customer Engagement

Unlock the Value of Your Microsoft and SAP Software Investments

PSM-PPM Integration SAP Product Structure Management

Quality Data in Record Time with SAP Information Steward Accelerator

SAP Customer Quote Program SAP MaxAttention Highlight Quotes

GSK Vaccines: Easing Compliance with SAP Process Control

How to Deliver a Coordinated Customer Experience across Every Channel

Transform Audit Practices and Move Beyond Assurance

Use Your Contact Center to Build a Better Customer Experience

SM250 IT Service Management Configuration

How To Use An Automotive Consulting Solution In Ansap

Challenges in the Effective Use of Master Data Management Techniques WHITE PAPER

Kennametal: Gaining Transparency in IT and Business with SAP Enterprise Support

Automotive Consulting Solution. CHEP - EDI- Container Data

Content Management for SAP Business Suite powered by SAP HANA

Managing Customer Relationships with SAP Business One

Downport to SAP GUI for documents Access Control Management

SuccessFactors Global Human Capital Management (HCM) Academy and Admin Training Schedule (Q3 Q4 2014)

Resource Management for the Oil and Gas Industry

SAP Audit Management A Preview

B2B E-Commerce Solutions Empower Wholesale Distributors

Formulate Winning Sales and Operations Strategies Through Integrated Planning

ABB: Independently Streamlining Its Organizational Setup with SAP Landscape Transformation

Sync, Share, and Store Information Across Devices Effectively and Securely

Getting Smart About Revenue Recognition and Lease Accounting

Driving Excellence in Implementation and Beyond The Underlying Quality Principles

SAP Solution Manager: The IT Solution from SAP for IT Service Management and More

Simplify Field Service Management with SAP Solutions

Enabling Better Business Intelligence and Information Architecture With SAP Sybase PowerDesigner Software

Textile Rubber and Chemical Company: Driving Cost Savings with SAP Business Suite powered by SAP HANA

Cyber Governance Preparing for the Inevitable Perimeter Breach

Preserving and Growing Value Through Enterprise Risk Management

Busting 7 Myths about Master Data Management

The Business Case for Using Big Data in Healthcare

SAP S/4HANA Embedded Analytics

Keep Enterprise Assets Productive with Effective Master Data Governance

In-Store Merchandise and Inventory Management. SAP Best Practices for Retail

Start Anywhere and Go Everywhere with Cloud Services for HR

Application Test Management and Quality Assurance

Integration capabilities of SAP S/4HANA to SAP Cloud Solutions

Managing Procurement with SAP Business One

Transform HR into a Best-Run Business Best People and Talent: Gain a Trusted Partner in the Business Transformation Services Group

Effecting Data Quality Improvement through Data Virtualization

Integration Capabilities of SAP S/4HANA to SAP Cloud Solutions

Drive Retail Sales and Enhance Loyalty by Streamlining Your Contact Center

Run Better in Weeks to Address Current and Future Business Needs

Aditro: Increasing Contact Center Efficiency for Improved Customer Satisfaction

Transform Your Bank in Measurable Steps

Visualization Starter Pack from SAP Overview Enabling Self-Service Data Exploration and Visualization

Province of North Brabant: Enhancing Efficiency by Integrating Geographic Information into SAP ERP

Effortless Customer Service with SAP Cloud for Service

Enterprise Information Management Services Managing Your Company Data Along Its Lifecycle

La Trobe University: A Model for Success Amid the Rapidly Shifting Dynamics of Higher Education

Enabling Competitive Advantage in Retail with Sales Insights

Information Technology Meets Operational Technology in the Internet of Things

Software Requirements

Keolis: Accelerating Recruitment with SAP Cloud for HR Solutions

Increasing Demand Insight and Forecast Accuracy with Demand Sensing and Shaping. Ganesh Wadawadigi, Ph.D. VP, Supply Chain Solutions, SAP

An Enterprise Resource Planning Solution (ERP) for Mining Companies Driving Operational Excellence and Sustainable Growth

Period End Closing Financial Accounting. SAP Best Practices

Bekaert: Reorganizing Sales and Distribution Units in SAP ERP and SAP BW

Adopt New SAP Software and Technology Early and Win

SAP HANA Live & SAP BW Data Integration A Case Study

Streamline Processes and Gain Business Insights in the Cloud

Run SAP Risk Management for Enterprise Risks in Life Sciences for Fast Business Value

Simplify Invoice Processing for Complex, Project-Based Spend

Data Integration Alternatives Managing Value and Quality

SAP Working Capital Analytics Overview. SAP Business Suite Application Innovation January 2014

Design the Future of Your Human Resources with SuccessFactors Solutions

SAP BusinessObjects Cloud

DMM301 Benefits and Patterns of a Logical Data Warehouse with SAP BW on SAP HANA

Data Integration Alternatives Managing Value and Quality

Your Intelligent POS Solution: User-Friendly with Expert Analysis

Outperform Financial Objectives and Enable Regulatory Compliance

Intelligent Business Operations powered by SAP HANA Infuse real-time intelligence into your business processes

Build Better Social Relationships and Realize Better Results

Leverage the Internet of Things to Transform Maintenance and Service Operations

Transcription:

Data Governance Data Governance, Data Architecture, and Metadata Essentials Enabling Data Reuse Across the Enterprise

2

Table of Contents 4 Why Business Success Requires Data Governance Data Repurposing and Data Integration Challenges for Centralization and Repurposing 7 Data Architecture and Data Governance Data Modeling and Architecture Standards Maintaining Relevant Enterprise Metadata Data Requirements Analysis Considerations for Effectively Governing Data Architecture About SAP PowerDesigner To Learn More

Why Business Success Requires Data Governance Data governance standards are critical for successful enterprise data repurposing. Ineffective oversight of data representations, semantics, and models introduces severe risks to key business applications. When developed independently, applications can have ill-defined business terms and multiple models for common data concepts, often compromising the value of consolidated data sets. Yet the huge demand for data repurposing means that data managed in an enterprise data warehouse must be trustworthy. This paper examines the root causes of data centralization failure and then reviews straightforward best practices that can help avoid such failures but are typically ignored when systems are designed in an ad hoc, organic manner (as in most organizations). Instituting data governance best practices will reduce the risks and increase trust in organizational information. These include: Data architecture and data modeling standards Enterprise metadata management Comprehensive data requirements analysis Implementing these best practices requires the integration of processes and technology, specifically data requirements management, metadata management, and data modeling. However, these tools are employed most effectively when knowledge captured within any part of the technology can be shared across the entire application development lifecycle. When the tools and techniques provide a line of sight during the design phase from the requirements through to the implementation and the transition into production, a link can be made from concept to data instance. In this way, all system impacts can be identified for any adjustments or changes in semantics or structure at all levels of data precision. Employing data management tools that are inherently engineered to provide visibility both across the data architecture and along the system lifecycle effectively supports the integration of data governance policies and practices to enhance data reuse. 4

DATA REPURPOSING AND DATA INTEGRATION In the early days of computing, application programs and their underlying data sets were developed to address specific business application needs within specific areas of the organization. Sales department applications differed from those developed to support fulfillment, finance, back-office processing, and various other lines of business. In contrast, today there is a growing trend toward data repurposing, in which selected business applications discover and ultimately reuse for their own purposes data sets that were created or acquired to meet a different business application s requirements. We are familiar with the grand enterprise-level examples enterprise resource planning (ERP), customer relationship management (CRM), business intelligence and analytics, and even reporting via data warehousing which all rely on data integrated and consolidated from across a collection of source applications. In turn, business consumers of many additional applications expect to benefit from the unified views of common business concepts incorporated and managed within master data environments. Yet the expansion of the scope of use of repurposed data sets exposes challenges and conflicts that can potentially wreak havoc on the intended results of the consuming applications. For the most part, siloed data models and applications have been designed in a vacuum, with little concern for interoperability across the line-of-business boundary. The data sets were mostly developed to support specific transactional or operational needs, and therefore they have been engineered to satisfy immediate requirements without any consideration for longer-term downstream consumption. And although the same business terms have been used, the absence of rigor in enforcing naming standards or providing clear definitions means that differences in structure, format, and meaning have crept into the data. When data sets are used for their original purpose, these variances in structure and semantics are largely irrelevant. But the by-product of data repurposing is the magnification of these structural and semantic differences. The result is that ungoverned consolidation will expose increasing complexity and difficulty in successful reuse of data for alternate purposes such as ERP, CRM, and master data management. CHALLENGES FOR CENTRALIZATION AND REPURPOSING The desire to repurpose data must be contrasted with the challenges in effectively centralizing the data sets to support the cumulative business needs. With data sets that are under consideration for centralization, even slight structural and semantic variances can inadvertently introduce inconsistencies for downstream consumers, especially after a series of data transformations are applied to force data sets to merge into an often hastily engineered target representation. Although incomplete attributes or variance in which values are perceived to be accurate can be the culprits, more often the issues of inconsistency emerge as the by-product of the absence of historical standards (and lack of governance) for the ways that different stakeholders model their core data concepts. So while organically developed applications are likely to share representations of the same concepts, their siloed development often leads to structural differences at various levels of precision (for example, data element versus table structure), as well as semantic differences at the many levels of precision. Structural Modeling Precision at the Data Element Level Even commonly used attributes are subject to structural variation. Consider a data concept with a well-defined standard: the North American Numbering Plan (NANP), which is the standard for representing telephone numbers used in the United States, Canada, and a number of other countries. The standard specifies a telephone number structure +1-NPA-NXX-xxxx, in which the NPA refers to an area code, the NXX is a central office exchange code, and the xxxx is the subscriber (or line) number. However, consider the many ways that telephone numbers are presented, using a variety of special characters (including parentheses, hyphens, periods, commas, and spaces). The underlying data elements are structured many different ways; just a few examples are shown in the following table. 5

Data Element Representations for a NANP Telephone Number Data Element Representation Example Comments CHAR(12) 301-754-6350 NPA, NXX, and subscriber number all separated by hyphens CHAR(10) 3017546350 All punctuation removed NUMERIC(10) 3017546350 Numeric representation VARCHAR(15) +1-301-754-6350 Allows for prefixed +1- VARCHAR(20) (301) 754-6350 x101 Allows extensions and alternate representations Even these few examples demonstrate differences that require parsing, standardization, and resolution of structure (especially when there are embedded extension numbers) when attempting to repurpose data sets. And this is but one example using data values that are already subject to existing standards. Consider the challenges with data elements whose values are not expected to conform to a defined standard. Structural Modeling Precision at the Table or Relationship Level As a means for establishing contact, telephone numbers factor in structural modeling precision issues as well. Early data tables and files used for batch customer transaction processing may have been designed to capture one or two telephone numbers the customer s home telephone number and possibly an office telephone number. But files structured with column space to hold only two numbers cannot capture the many possible telephone numbers that today could be associated with an individual, including mobile numbers, voice over IP (VOIP) numbers, virtual office numbers, and fax numbers, as well as many other contact mechanisms. Later, system designers will have dissociated the data attributes associated with contact mechanisms into related tables linked via foreign keys. These structural differences introduce the need for more complex rules and transformations in order to reuse data from different sources. Semantic Differences The potential complexity of the structural variation is dwarfed by the challenges of variant semantics. Just consider the many meanings for commonly used business terms. For example, to the sales organization, a customer is a party who exchanges value in return for products or services. But to the customer service organization, a customer is a party entitled to customer support services. In the situation where evaluation products are provided at no charge to interested prospects, there is a qualitative difference between sales customers and service customers, even though they are both referred to as customers. 6

Data Architecture and Data Governance The challenges introduced by the absence of governance in legacy system designs, coupled with the growing interest in repurposing data from across (and even from outside) the enterprise, have a clear message: moving forward, modeling and metadata management cannot be performed in a vacuum. Rather, oversight at the organizational level must be imposed to establish standard practices for enterprise data design, modeling, sharing, and reuse. This suggests the need for specific policies for data governance associated with different aspects of data architecture, with the intention of establishing a high level of maturity and capability, namely: Data architecture and data modeling standards to reduce variation in structure Enterprise metadata management (from a horizontal perspective) to standardize semantics and to provide visibility of use from concept to instantiation Comprehensive data requirements analysis to capture all prospective data consumer requirements DATA MODELING AND ARCHITECTURE STANDARDS The high likelihood of data reuse will influence system developers to approach their designs in a way that anticipates downstream data consumption. Standardizing the representative models will reduce the effort for subsequent extraction and consolidation, and this means increased oversight of any newly developed data models. Instituting organizational standards, along with the data governance processes overseeing observance of these standards, is the first step to resolving the challenges inherent in wholesale data consolidation. Governing data architecture combines the definition of policies for observing data element standards and data modeling guidelines with the processes to ensure that those standards are observed. This may run the gamut from rudimentary policies defining data element naming conventions, normalizing structures for common data themes, and defining schemas and canonical models for data exchange, to establishing protocols for enterprise data modeling. It can also involve instituting processes for data model review and acceptance by the members of a data governance board. Defining selected data governance policies only addresses one piece of the puzzle. When data policies are defined, there must be processes and procedures to develop business applications while meeting business objectives within a data governance framework. 7

MAINTAINING RELEVANT ENTERPRISE METADATA The flip side of defining organizational data element and modeling standards involves communicating the details of those standards and then managing compliance with them. One effective way to accomplish both of these goals uses metadata management methods. When the data management practitioners within the organization understand the ramifications of slight variations, they strive to attain a high level of metadata maturity. This means that a metadata management strategy is clearly defined and communicated to all developers and consumers, and there are centralized tools and techniques integrated as part of the enterprise development framework. A single metadata repository accessible across the organization can be used to document data element concepts, their instantiations, and any structural variances. Business terms can be mapped to data element concepts, which are then linked to their assorted instantiations across the application infrastructure. This provides a virtual line of sight between business concept and application use. Where the conceptual data elements are touched by more than one business application, the metadata analysts can review the usage map for those elements and analyze the impact of adjustments to any underlying or dependent data element definitions. DATA REQUIREMENTS ANALYSIS We are conditioned to consider the business application that either creates or initially acquires the data as the primary consumer. But while primary use of a data set can be defined as first in order, it can also be defined as first or highest in rank of importance. Increased data repurposing means that the originating application may not be the most important use of the data. If the alternate uses are high in rank of importance, they are also primary consumers. Therefore, it is critical to ensure that measured levels of structural and semantic consistency are sufficient to meet the business needs of the collected downstream data consumers, which means thinking differently about soliciting and documenting data requirements across the organization. Usually, data requirements are a by-product of the functional requirements implied by the needs of the business process whose application is being designed. In turn, those data requirements are only defined to meet an acute functional need but do not address how the data sets are potentially used by other business processes. However, as more data sets are subjected to centralization and repurposing, there is a corresponding need to adjust the system development process so that enterprise requirements are properly captured and incorporated into the data architecture. Yet again, data governance policies can help direct an approach to soliciting, capturing, and documenting data requirements that can be directly linked to the ways the underlying models will be designed and implemented. Guiding the ways that system designers engage the general community of potential data consumers will ensure that organizational information requirements are captured and managed. This reduces the need for downstream data extractions and transformations while improving general information usability. Instituting good data quality practices and governing those practices with the right tools and techniques essentially reduces structural and semantic inconsistency. 8

CONSIDERATIONS FOR EFFECTIVELY GOVERNING DATA ARCHITECTURE As the rates of data volume growth continue to rapidly increase, technical advisors suggest that unifying our views of enterprise information via enterprise data warehouses, enterprise resource planning, or master data management will increase value along the different value-driver dimensions such as increased growth, decreased expenditures, and reduction in risk. Yet we have shown that the traditional approach to data repurposing, consolidation, and reuse itself entails a number of intrinsic risks. To avoid these risks, you must institute data governance. But defining selected data governance policies only addresses one piece of the puzzle. When data policies are defined, there must be processes and procedures to back them up, with corresponding methods to develop business applications while meeting business objectives within a data governance framework. This suggests considering tools and techniques to oversee approaches to organizational data architecture that support enterprise information management and governance goals, including: Processes for defining and then approving data policies Communicating data policies and associated guidance across line-of-business boundaries Unifying the collection of data requirements for all key data concepts Documenting structural data standards Harmonizing business term definitions and semantics Unifying business models and developing standard data models for persistence and sharing Active monitoring that data requirements are being observed Assessing the requirements from across the line-ofbusiness landscape and ensuring consistent observance of those requirements through the design, development, and implementation phases of the system Data governance policies can help direct an approach to soliciting, capturing, and documenting data requirements that can be directly linked to the ways the underlying models will be designed and implemented. 9

Even if data governance practices are defined, there must be data management tools and techniques to ensure a line of sight during the design phases from requirements through implementation and transition into production, including data requirements management, metadata management, and data modeling. More important, though, these tools must support the sharing and exchange of knowledge throughout the development lifecycle. Data expectations captured during the requirementsgathering stage must be connected to the associated data elements and data models that are used by the developed business application. At the same time, during all system development lifecycle phases, the data requirements must remain visible, maintaining the link from concept to data instance so that all system impacts can be identified for any changes in definition or structure at any level of data precision. While disparate tools may support some canonical representation for sharing metadata, attempting to cobble these tools together may not only introduce additional system development complexity, but it may actually lead to a chaotic environment in which many of the data governance practices are wasted. Employing data management tools that are inherently engineered to provide visibility both across the data architecture and along the system lifecycle effectively supports the integration of data governance policies and practices to enhance data reuse across the enterprise. These become the critical criteria when evaluating tools to support governed data management. 10

ABOUT SAP POWERDESIGNER SAP PowerDesigner software, an industry-leading modeling and metadata management tool, offers a model-driven approach to empower and align business and IT. Implementing data governance best practices requires a tool that allows all levels of data to be captured, articulated, and shared. Today s complex IT environments require tools and processes to manage the flow of information between all phases of the IT organization and business community. TO LEARN MORE To find out more about best practices for data governance and how a modeling tool like SAP PowerDesigner can help you achieve a single version of the truth, please contact your SAP representative or visit us online at www.sap.com/solutions /technology/database/model-driven-architecture/index.epx. The true impact analysis offered by SAP PowerDesigner reduces the time, risk, and cost associated with changes within the business intelligence environment by: Establishing a single version of the truth for key information assets Providing consistent information, when and where needed, to improve decision making Enforcing governance and accountability for key information assets in support of compliance Enabling information to be shared and exchanged, with appropriate safeguards Supporting efficiency, collaboration, and transparency needs SAP PowerDesigner provides an integrated modeling solution that encompasses: Data modeling Business process modeling Applications modeling Business requirements modeling Metadata management Support for enterprise architecture frameworks Implementing best practices for data governance requires the integration of processes and technology, specifically data requirements management, metadata management, and data modeling. 11

www.sap.com/contactsap CMP22173 (14/01) No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG or an SAP affiliate company. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG (or an SAP affiliate company) in Germany and other countries. Please see http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark for additional trademark information and notices. Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP AG or an SAP affiliate company for informational purposes only, without representation or warranty of any kind, and SAP AG or its affiliated companies shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP AG or SAP affiliate company products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. In particular, SAP AG or its affiliated companies have no obligation to pursue any course of business outlined in this document or any related presentation, or to develop or release any functionality mentioned therein. This document, or any related presentation, and SAP AG s or its affiliated companies strategy and possible future developments, products, and/ or platform directions and functionality are all subject to change and may be changed by SAP AG or its affiliated companies at any time for any reason without notice. The information in this document is not a commitment, promise, or legal obligation to deliver any material, code, or functionality. All forward-looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions.