Implementing Data Governance at Grifols: Best Practices and



Similar documents
Business User driven Scorecards to measure Data Quality using SAP BusinessObjects Information Steward

Making SAP Information Steward a Key Part of Your Data Governance Strategy

Data Integrator: Object Naming Conventions

Measure Your Data and Achieve Information Governance Excellence

A Look at Self Service BI with SAP Lumira Natasha Kishinevsky Dunn Solutions Group SESSION CODE: 1405

Overcoming Bad Design! Michael Simpson Catch Intelligence SESSION CODE: 0807

... Foreword Preface... 19

Using SAP Master Data Technologies to Enable Key Business Capabilities in Johnson & Johnson Consumer

SAP BusinessObjects Information Steward

DATA GOVERNANCE AND DATA QUALITY

SAP Agile Data Preparation

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff

Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE

Preferred Strategies: Business Intelligence for JD Edwards

Data Quality Assessment. Approach

ENSURING A SUCCESSFUL SAP DATA MIGRATION

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Information Governance Workshop. David Zanotta, Ph.D. Vice President, Global Data Management & Governance - PMO

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

SQL Server 2012 Business Intelligence Boot Camp

Building a Data Quality Scorecard for Operational Data Governance

Master Data Governance & SAP Information Steward Integration. Jens Sauer, SAP Switzerland September 11 th, 2013

James Serra Data Warehouse/BI/MDM Architect JamesSerra.com

EAI vs. ETL: Drawing Boundaries for Data Integration

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

SAP Crystal Reports & SAP HANA: Integration & Roadmap Kenneth Li SAP SESSION CODE: 0401

MDM and Data Warehousing Complement Each Other

Implementing a Data Warehouse with Microsoft SQL Server

Contents. visualintegrator The Data Creator for Analytical Applications. Executive Summary. Operational Scenario

Making SAP Information Steward a Key Part of Your Data Governance Strategy

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Course Outline. Module 1: Introduction to Data Warehousing

Implementing a Data Warehouse with Microsoft SQL Server

ASYST Intelligence South Africa A Decision Inc. Company

Bringing agility to Business Intelligence Metadata as key to Agile Data Warehousing. 1 P a g e.

SAP Master Data Governance for Enterprise Asset Management. Dean Fitt Solution Manager, Asset Management Solutions, SAP SE Stavanger, 21 October 2015

Big Data and Big Data Governance

Getting started with a data quality program

Agenda. SAP BusinessObjects 2012 / Slide 2 Private and Confidential

Data Management Roadmap

Enabling Data Quality

Assessing and implementing a Data Governance program in an organization

Data Governance Maturity Model Guiding Questions for each Component-Dimension

Task definition PROJECT SCENARIOS. The comprehensive approach to data integration

Enterprise Data Quality

Course 20463:Implementing a Data Warehouse with Microsoft SQL Server

US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007

Data Migration in SAP environments

SharePoint 2013 for Business Process Automation

Product to Customer. through MDM. Presented by Luminita Vollmer, MBA, CDMP, CBIP

DATA GOVERNANCE AT UPMC. A Summary of UPMC s Data Governance Program Foundation, Roles, and Services

Data Warehouse and Business Intelligence Testing: Challenges, Best Practices & the Solution

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

What's New in SAS Data Management

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

BUSINESS INTELLIGENCE COMPETENCY CENTER (BICC) HELPING ORGANIZATIONS EFFECTIVELY MANAGE ENTERPRISE DATA

SAP BO 4.1 Online Training

Master Data Management and Data Warehousing. Zahra Mansoori

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

Big Data for Investment Research Management

THOMAS RAVN PRACTICE DIRECTOR An Effective Approach to Master Data Management. March 4 th 2010, Reykjavik

HROUG. The future of Business Intelligence & Enterprise Performance Management. Rovinj October 18, 2007

Implementing a Data Warehouse with Microsoft SQL Server 2012

Cisco IT Hadoop Journey

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

Information Management & Data Governance

Session 805 -End-to-End SAP Lumira: Desktop to On-Premise, Cloud, and Mobile

<Insert Picture Here> Master Data Management

Master Data Management

Implementing a Data Governance Initiative

SAS Data Management Technologies Supporting a Data Governance Process. Dave Smith, SAS UK & I

Data Integration Checklist

Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012

Talend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain

The following is intended to outline our general product direction. It is intended for informational purposes only, and may not be incorporated into

Data Governance in a Siloed Organization

Business Intelligence

Implementing a Data Warehouse with Microsoft SQL Server 2012

Predictive Analytics for Procurement Lead Time Forecasting at Lockheed Martin Space Systems

Master Data Management (MDM) in the Public Sector

Trends In Data Quality And Business Process Alignment

Accelerating the path to SAP BW powered by SAP HANA

Managing Third Party Databases and Building Your Data Warehouse

EMC PERSPECTIVE Enterprise Data Management

Dambaru Jena Senior Principal Hewlett-Packard (HP)

SAP Data Services Hacks Auto Generating Data Migration Jobs Shobhit Acharya Session# 3507

By Makesh Kannaiyan 8/27/2011 1

Data Governance. Unlocking Value and Controlling Risk. Data Governance.

Transcription:

Implementing Data Governance at Grifols: Best Practices and Lessons Learned Praneeth Padmanabhuni, Grifols Inc. Richard Hauser, Decision First Technologies SESSION CODE: 0204

LEARNING POINTS Discuss how SAP Information Steward can assist in establishing a Data Governance program Enable power users in the business to own data processing and be responsible for data quality Remove manual steps to automate data processing as much as possible Extend the out-of-the-box visualizations available in Information Steward scorecards by utilizing repository metadata Involve data stewards directly in de-duplication efforts via Match Review Tasks in Information Steward

Who Is Grifols? International healthcare company based in Barcelona, Spain with offices the Raleigh, NC as well as Los Angeles Develop and distribute life-saving protein therapies derived from human plasma Have experienced rapid growth over the past few years as a result of mergers and acquisitions

Challenges as a Growing Company 70+ files to collect data from on a monthly basis 70+ varying degrees of data quality!!! Only want to count sales at the closest point to an actual consumer Data warehouse had previously been outsourced, but volumes had reached a point where insourcing became a more attractive option Data cleansing was being performed manually via Excel files, but using a tool to process large volumes became a necessity

Decision First Technologies Who we are Atlanta-based SAP Business Objects specialists Partnered with SAP 7x Business Objects Partner of the Year SAP Business Objects, SAP EIM, and SAP HANA experts What we do Strategize and implement Data Governance solutions BI Nirvana 90 day Business Intelligence on HANA Full lifecycle data warehouse implementations Data visualizations and standard reporting

Data Governance Defined Core business process that ensures data is treated as a corporate asset and is formally managed throughout the enterprise Marriage of the following programs: Data Quality Information Management policies Security Business process management Risk management

Information Steward Information Steward was chosen to be used as the tool to help implement initial DG policies Integrates nicely with DS, which was already in use Gives visibility to data quality issues Easy for business users to pick-up and run with Not a fully blown master data solution, more of an MDM-lite

Challenges at time of enlisting DFT Cluttered ETL environment Many manual steps needed for weekly processes Data issues popping up weeks after loading of flat files Users not trustworthy of account master data

Solutions Put Forward Implement best practices in ETL environment Multiple developer repositories, central repositories, and best practices naming conventions Combine and automate common ETL jobs to the fullest extent possible Give visibility to data quality by developing an Information Steward scorecard Improve the customer account matching process and utilize DS cleansing transforms to build user trust in the data warehouse

ETL Coding Best Practices Multiple repos and landscapes Previously just PRD One repository per developer Fully fleshed-out DEV, QA, and PRD to properly test Central repo for each environment Allows for versioning and rollback in event of unintended consequences Moving objects to central forces developers to fully understand the impacts they are having to all objects Naming standards Objects properly named, data that is being sourced from or written to, initial/delta load, number in sequence if applicable E.g. DF_ACCOUNT_MASTER_INT_D

ETL Automation Combine objects into jobs, workflows, etc Went from 15 steps down to 3-4 depending on data Code objects for reusability, not one-off executions Standardize variables across all jobs and conform to a template job format Job Execution Table, Job Start Script Give power users authority to process data when ready by allowing them to run certain DS jobs that they are responsible for

DQ Visualizations Needed a way to assess DQ before it became an issue IS Data Insight was the best solution for our purposes Same data validation rules could be applied to all distributors Limit the data being analyzed to only most recent month Built an event-based process chain in the CMC to seamlessly integrate this step into the normal weekly ETL jobs

Original Sales Staging Process

New Sales Staging process with DQ

DQ Reporting Enhancements Extract data from appropriate tables/views in the IS repository database every time new DQ data is available Historical scores are readily available from the following database views: MMB_DATA_GROUP Contains project names, among many other things MMB_KEY_DATA_DOMAIN Key Data Domain descriptions MMB_KEY_DATA_DOMAIN_SCORE Historical scores for every active quality object MMB_DOMAIN_VALUE Quality dimension descriptions

MMB_KEY_DATA_DOMAIN_SCORE Contains scores for KDDs, QDs, Rules, Bindings, by key data domain, which is attached to a scorecard Column to select score type is KEY_DATA_DOMAIN_SCORE_TYPE_CD TOTL = Key Data Domain Score KDDQ = Quality Dimension Score KDDR = Rule Score KDDB = Rule Binding Score

Information Steward Repo Joins MMB_DATA_GROUP.DATA_GROUP_ID = MMB_KEY_DATA_DOMAIN.PROJECT_ID (Project description) MMB_KEY_DATA_DOMAIN.KEY_DATA_DOMAIN_ID = MMB_KEY_DATA_DOMAIN_SCORE.KEY_DATA_DO MAIN_ID where score_type_cd = TOTL (for KDD scores) MMB_KEY_DATA_DOMAIN_SCORE.SCORE_ID = MMB_DOMAIN_VALUE.DOMAIN_VALUE_ID where score_type_cd = KDDQ ( for Quality Dimension scores)

Automated DQ Chain

Automated DQ Chain

Scorecard

Scorecard Drilldown

DQ Webi Report

DQ Webi Report Drilldown

Account Master Cleanup Requirements Needed to prove to the business that account master data was trustworthy Too many overmatch and undermatch scenarios existed in the old account master Could not start from scratch because internal data had been matched to an external data source by a third party Needed the cleanup effort to have data steward input for uncertain matches Little impact as possible on all current processes

Account Master Cleanup, Step 1 Identify overmatch scenarios, i.e. accounts that had been incorrectly matched together Run all current accounts with their children through a data quality match transform Break key is on Data Warehouse ID Child can only match to their parent, not to other parent accounts Pass all potential overmatches to a review task in Information Steward for data steward input Use data steward s input to determine how to handle the record Leave alone or create a new account master

Account Master Overmatch Cleanup

Account Master Cleanup, Step 2 Improve the current delta matching logic that was part of the sales weekly data warehouse load Should see a gradual decrease in number of new accounts created over time 3K per week initially New children accounts must be matched first against existing account masters, only after that can they be considered a match with each other Account master data was frozen for one month to accomplish this task Short enough timeline to not have a critical impact on business decisions

Account Delta Process

Account Master Cleanup, Step 3 Identify undermatched accounts Accounts that should be merged together but haven t been for whatever reason Run all existing account master records through a DS match dataflow to determine if they should be merged into one If a potential match is found between 2 or more accounts, pass this match group along to an IS Match Review task for data steward review Utilize data stewardship results to determine a winning account master and deprecate the others in the group

Account Master Undermatch Cleanup

VISION FOR THE FUTURE Ultimately would like to associate Salesforce.com CRM data with actual sales data coming from distributors Provides backward-looking analysis of sales rep performance Capability to start performing some predictive analysis Find more ideal customers Identify prototypical customers Focus on these accounts to grow business Foundation is now in place to be in compliance with Sunshine Act when it goes into effect

RETURN ON INVESTMENT THUS FAR Yearly savings resulting from initial DW project: $441.5K Savings resulting from reduced time to process weekly records: $13,000/month or $156,000/year Customer targeting and predictive analytics is next No upper bound on revenue potential

BEST PRACTICES Involve the business often to showcase improvements and ask for further suggestions Necessary for all DG/DQ projects Keep history of IS Match Review results OK to leave in same table in 4.1, issues have been found in early versions of 4.2 Just fine to move to another table if too confusing Have separate Reviewer and Approver roles for Match Review tasks Easy to get fatigued when going through hundreds or thousands of records Also a good idea to allow a few days to pass between review and approval

KEY LEARNING SAP Information Steward can assist in establishing a Data Governance program and gaining momentum within your organization Empower your power users to own data processing and be responsible for data quality. Actively involve business users in all steps of the process Eliminate manual intervention to automate data processing as much as possible. This is where a large portion of ROI can be found

Questions? Praneeth Padmanabhuni praneeth.padmanabhuni@grifols.com Rich Hauser richard.hauser@decisionfirst.com

FOLLOW US Follow the ASUGNews team: Follow the ASUGNews team: Tom Wailgum: @twailgum & Courtney Bjorlin: @cbjorlin For all things SAP

THANK YOU FOR PARTICIPATING Please provide feedback on this session by completing a short survey via the event mobile application. SESSION CODE: 0204 For ongoing education on this area of focus, visit www.asug.com