Data Quality for Data Stewards by Arkady Maydanchik and Dave Wells



Similar documents
Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

10426: Large Scale Project Accounting Data Migration in E-Business Suite

Understanding and managing data: The benefits of data governance and stewardship

Proactive DATA QUALITY MANAGEMENT. Reactive DISCIPLINE. Quality is not an act, it is a habit. Aristotle PLAN CONTROL IMPROVE

Business Performance & Data Quality Metrics. David Loshin Knowledge Integrity, Inc. loshin@knowledge-integrity.com (301)

M6P. TDWI Data Warehouse Automation: Better, Faster, Cheaper You Can Have It All. Mark Peco

MDM and Data Warehousing Complement Each Other

Making Business Intelligence Easy. Whitepaper Measuring data quality for successful Master Data Management

Chapter 5. Learning Objectives. DW Development and ETL

Framework for Data warehouse architectural components

US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007

Building a Successful Data Quality Management Program WHITE PAPER

whitepaper The Evolutionary Steps to Master Data Management

dxhub Denologix MDM Solution Page 1

Data Integration and ETL with Oracle Warehouse Builder NEW

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

Measure Your Data and Achieve Information Governance Excellence

TDWI Data Integration Techniques: ETL & Alternatives for Data Consolidation

INTELLIGENT DEFECT ANALYSIS, FRAMEWORK FOR INTEGRATED DATA MANAGEMENT

HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT

DISCIPLINE DATA GOVERNANCE GOVERN PLAN IMPLEMENT

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Learning Objectives Lean Six Sigma Black Belt Course

What's New in SAS Data Management

I/A Series Software FoxDPM.com Dynamic Performance Monitor.com

Data Warehouse Overview. Srini Rengarajan

Business Intelligence In SAP Environments

... Foreword Preface... 19

Root Cause Analysis for IT Incidents Investigation

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

Operational Excellence for Data Quality

Service Modelling & Service Architecture:

Comprehensive Data Quality with Oracle Data Integrator. An Oracle White Paper Updated December 2007

How To Get More Value From The Microsoft Dmdm Data Management Module (Dmm)

Master Data Management and Data Warehousing. Zahra Mansoori

Understanding and Evaluating the BI Platform by Cindi Howson

Data Integration for the Real Time Enterprise

Unit 1: Introduction to Quality Management

ORACLE FINANCIAL SERVICES ANALYTICAL APPLICATIONS INFRASTRUCTURE

Master Data Management. Zahra Mansoori

Selection Requirements for Business Activity Monitoring Tools

How to Create a Business Focused Data Quality Assessment. Dylan Jones, Editor/Community Manager editor@dataqualitypro.com

Enterprise Data Quality

Practical Considerations for Real-Time Business Intelligence. Donovan Schneider Yahoo! September 11, 2006

Data Discovery, Analytics, and the Enterprise Data Hub

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

Integrating SAP and non-sap data for comprehensive Business Intelligence

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

ORACLE HYPERION DATA RELATIONSHIP MANAGEMENT

Lection 3-4 WAREHOUSING

ASYST Intelligence South Africa A Decision Inc. Company

Implementing Oracle BI Applications during an ERP Upgrade

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc.

Microsoft Axapta Financial Management consists of several individually packaged offerings: Microsoft Axapta Financials I and Financials II

Week 13: Data Warehousing. Warehousing

Data Governance. David Loshin Knowledge Integrity, inc. (301)

INFORMATION MANAGEMENT. Transform Your Information into a Strategic Asset

M Designing and Implementing OLAP Solutions Using Microsoft SQL Server Day Course

Business Intelligence (BI) Data Store Project Discussion / Draft Outline for Requirements Document

Finding Business Rules in COBOL Systems

DATA GOVERNANCE AND DATA QUALITY

DATA GOVERNANCE AT UPMC. A Summary of UPMC s Data Governance Program Foundation, Roles, and Services

Best Practices in Leveraging a Staging Area for SaaS-to-Enterprise Integration

Building a Data Warehouse

Data warehouse and Business Intelligence Collateral

An Enterprise Data Hub, the Next Gen Operational Data Store

A Service-oriented Architecture for Business Intelligence

Master Data Management

Implementing a Data Warehouse with Microsoft SQL Server 2014

Business Intelligence Maturity Model. Wayne Eckerson Director of Research The Data Warehousing Institute

The Data Quality Continuum*

BENEFITS OF AUTOMATING DATA WAREHOUSING

Implementing Oracle BI Applications during an ERP Upgrade

In-Guest Monitoring With Microsoft System Center

Data Quality Assessment. Approach

Chapter 10 Practical Database Design Methodology and Use of UML Diagrams

MicroStrategy Course Catalog

Realizing the True Power of Insurance Data: An Integrated Approach to Legacy Replacement and Business Intelligence

Data Quality Where did it all go wrong? Ed Wrazen, Trillium Software

SQL Server 2012 Business Intelligence Boot Camp

Headstrong: SAP Solution Helps Streamline and Accelerate Financial Services Application Development

Semarchy Convergence for MDM The Next Generation Evolutionary MDM Platform

Cloud First Does Not Have to Mean Cloud Exclusively. Digital Government Institute s Cloud Computing & Data Center Conference, September 2014

Enabling Data Quality

Cisco IT Technology Tutorial Overview of ITIL at Cisco

Implementing a SQL Data Warehouse 2016

Jagir Singh, Greeshma, P Singh University of Northern Virginia. Abstract

Semarchy Convergence for Data Integration The Data Integration Platform for Evolutionary MDM

Aspen InfoPlus.21. Family

Transcription:

Data Quality for Data Stewards by Arkady Maydanchik and Dave Wells All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks of their respective companies.

Module 0. About the Course (7 min) Module 1. Quality Basics (43 min) Quality Defined o What is Quality? o What isn t Quality o Quality vs. Qualities o Some Examples Quality and Defects o Defect Defined o Sources of Defects o Responding to Defects o Process Improvements Quality Economics o Cost of Quality o Cost of Defects o Cost of Quality Management Quality Management Practices o Quality Control o Quality Assurance o Quality Improvement Cycles Quality Management Methodologies o Statistical Process Control o SPC Control Charts o Total Quality Management o Six Sigma o Six Sigma Performance Module 2. Data Quality Basics (52 min) Data Quality Defined o Defect Free o Conforming to Specifications o Suited to Purpose o Meet Customer Expectations o Defining Data Quality o Qualities of Data Dimensions of Data Quality o Data Quality is Multidimensional o Content and Quality o Structure and Quality o Time and Quality o Business and Quality o Usage and Quality o Presentation and Quality Data Quality Projects o Common Data Quality Projects o Assessment Projects o Data Cleansing Projects o Process Improvement Projects Building Data Quality Page 2

o Data Quality and System Architecture o Data Quality and IT Projects o Application Development Projects o Data Conversion and Migration Projects o Data Warehousing Projects o Business Analytics Projects Data Quality Programs Data Quality Profession Module 3. Common Causes of Data Quality Problems (29 min) Introduction Manual Data Entry Data Integration o Initial Data Conversion o System Consolidations o Batch Feeds o Real-Time Interfaces Data Manipulation o Data Processing o Data Cleansing o Data Purging Data Decay o Changes Not Captured o System Upgrades o New Data Uses o Loss of Expertise o Process Automation Module 4. Introduction to Data Quality Assessment (59 min) Why Assess Data Quality o Role of Assessment in Data Quality Programs o Assessment Objectives Business Value of Data Quality Assessment o Planning Data Cleansing o Improving Data Collection Processes o Improving Data-Driven Business Processes o Supporting Data Migration and Data Integration Data Quality Assessment Approaches o Data Validation against Trusted Sources o Sample Data Validation o Rule-Driven Approach Project Team Project Steps o Defining the Project Scope o Loading the Data Staging Area o Gathering General Metadata o Data Profiling o Designing Data Quality Rules o Implementing Data Quality Rules o Fine-Tuning Data Quality Rules Page 3

o Building Data Quality Scorecard Data Quality Rules Overview o Attribute Level View of Data o Attribute Domain Constraints o Entity Level View of Data o Relational Integrity Constraints o Subject Level View of Data o Attribute Dependency Constraints o Advanced Business Rules o Time Dependent View of Data Building Data Quality Scorecard o School Report Card Example o Data Quality Scorecard Recurrent Data Quality Assessment Module 5. Introduction to Root Cause Analysis (53min) The Nature of Cause and Effect o Correlation o Coincidence o Confounding Variables o Contributing Variables o Influence o Complexity and Complications Cause and Effect Misconceptions o Simple vs. Complex o Linear vs. Circular o Close vs. Distant o Immediate vs. Delayed o Ordered vs. Non-ordered The Purpose of RCA o Root Cause Analysis Defined o Root Cause Defined o Root Cause Criteria o Root Cause Criteria Examples The Process of RCA o Five Steps of RCA o The Problem Description o Data Gathering o Casual Modeling o Root Cause Identification o Recommendations A First Look at Cause-Effect Models o Five-Why s o Fishbone Diagrams o Causal Loop Models Verifying Cause-Effect Conclusions o A Short Reasoning Exercise o Exercise Review o Nonsensical Thinking o Nonsense and Distortion Page 4

o Seeing What We Want To See o Logical Fallacies o Rethinking Cause and Effect o Systems Thinking o Bias of Facts and Logic o Evidence and Assumptions o Lateral Thinking Practical Application o Beyond RCA o Implement Solutions o Measure and Monitor o Watch for Side Effects o Correct Existing Issues Module 6. Ensuring Data Quality in Data Integration (41 min) Data Integration Basics o Data Integration Defined o Common Applications o Data Integration Approaches o Comparison of Integration Approaches Data Quality Perspective o Pros and Cons of Data Federation o Data Federation Quality Solution o Pros and Cons of Data Propagation o Data Propagation Quality Solution Data Propagation Overview o Propagation Frequency and Data Latency o Real-Time Interfaces o Batch Interfaces o Using Information Bus o Using Staging Area o Interface Spider Web Challenge o Information Integration Hub Batch Interfaces o Causes of Data Quality Problems o Data Quality Managmenet o Batch Monitors o Change Monitors o Error Monitors Page 5