Enterprise Data Quality Dashboards and Alerts: Holistic Data Quality



Similar documents
Enterprise Data Quality Dashboards and Alerts: Holistic Data Quality. Jay Zaidi Fannie Mae

Data Governance Demystified - Lessons From The Trenches

Building a Data Quality Scorecard for Operational Data Governance

Five Fundamental Data Quality Practices

Analance Data Integration Technical Whitepaper

JOURNAL OF OBJECT TECHNOLOGY

Business Performance & Data Quality Metrics. David Loshin Knowledge Integrity, Inc. loshin@knowledge-integrity.com (301)

Enabling Data Quality

Enterprise Data Governance

Information Governance Workshop. David Zanotta, Ph.D. Vice President, Global Data Management & Governance - PMO

DATA GOVERNANCE AND DATA QUALITY

Analance Data Integration Technical Whitepaper

What is Security Intelligence?

Master Data Management

POLAR IT SERVICES. Business Intelligence Project Methodology

Data Governance. David Loshin Knowledge Integrity, inc. (301)

Operationalizing Data Governance through Data Policy Management

Proactive DATA QUALITY MANAGEMENT. Reactive DISCIPLINE. Quality is not an act, it is a habit. Aristotle PLAN CONTROL IMPROVE

Enterprise Data Management

Data Warehouse Overview. Srini Rengarajan

Service Oriented Architecture and the DBA Kathy Komer Aetna Inc. New England DB2 Users Group. Tuesday June 12 1:00-2:15

BIG DATA THE NEW OPPORTUNITY

HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT

Q1 Labs Corporate Overview

MDM and Data Warehousing Complement Each Other

Data Integrity and Integration: How it can compliment your WebFOCUS project. Vincent Deeney Solutions Architect

Enterprise Data Quality

Making Business Intelligence Easy. Whitepaper Measuring data quality for successful Master Data Management

Data Quality Assessment. Approach

dxhub Denologix MDM Solution Page 1

Data Integration Checklist

SAP BusinessObjects Information Steward

BANKING ON CUSTOMER BEHAVIOR

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff

Management Update: The Cornerstones of Business Intelligence Excellence

Implement a unified approach to service quality management.

How to Create a Business Focused Data Quality Assessment. Dylan Jones, Editor/Community Manager editor@dataqualitypro.com

A Guide Through the BPM Maze

... Foreword Preface... 19

END-TO-END BANKING SOLUTIONS

Flexible Business Process Management enabled by SOA Full support of BPM life cycle Closing the gap between Business & IT

Building a Successful Data Quality Management Program WHITE PAPER

Master Data Management. Zahra Mansoori

Washington State s Use of the IBM Data Governance Unified Process Best Practices

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

By Makesh Kannaiyan 8/27/2011 1

Data Virtualization A Potential Antidote for Big Data Growing Pains

Improving data governance; how can health informatics practitioners help gain stakeholder support?

Consulting Solutions Disaster Recovery. Yucem Cagdar

DataFlux Data Management Studio

Master big data to optimize the oil and gas lifecycle

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Master Data Management

Logical Modeling for an Enterprise MDM Initiative

Submitted to: Service Definition Document for BI / MI Data Services

Master Data Management: More than a single view of the enterprise? Tony Fisher President and CEO

Operational Excellence for Data Quality

Why Data Governance - 1 -

Become a hunter: fi nding the true value of SIEM.

US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007

Enterprise Information Management

Big Data Governance. ISACA Chapter Annual Conference Sarova Whitesands Hotel, Mombasa 29th - 31st July, Prof. Ddembe Williams KCA University

Secure Data Transmission Solutions for the Management and Control of Big Data

Defending against modern cyber threats

Oracle Data Integrator 12c: Integration and Administration

DATA QUALITY MATURITY

Oracle Data Integrator 11g: Integration and Administration

Architecting for the Internet of Things & Big Data

Enterprise Data Governance

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

The Power of Risk, Compliance & Security Management in SAP S/4HANA

Italian Enterprises Adopt Big Data Solutions. Forrester Consulting

IRMAC SAS INFORMATION MANAGEMENT, TRANSFORMING AN ANALYTICS CULTURE. Copyright 2012, SAS Institute Inc. All rights reserved.

Service Oriented Data Management

Proven Testing Techniques in Large Data Warehousing Projects

Information Quality for Business Intelligence. Projects

SAS Data Management Technologies Supporting a Data Governance Process. Dave Smith, SAS UK & I

Big Data for Investment Research Management

Course Outline. Module 1: Introduction to Data Warehousing

Build a Streamlined Data Refinery. An enterprise solution for blended data that is governed, analytics-ready, and on-demand

Seeking Data Quality. Using Agile Methods to Test a Data Warehouse

FROM DATA STORE TO DATA SERVICES - DEVELOPING SCALABLE DATA ARCHITECTURE AT SURS. Summary

A Roadmap to Intelligent Business By Mike Ferguson Intelligent Business Strategies

Vertical Data Warehouse Solutions for Financial Services

Framework for Data warehouse architectural components

Agile Business Intelligence Data Lake Architecture

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

Westernacher Consulting

A Vision for Operational Analytics as the Enabler for Business Focused Hybrid Cloud Operations

The following is intended to outline our general product direction. It is intended for informational purposes only, and may not be incorporated into

VMware vcenter Log Insight Delivers Immediate Value to IT Operations. The Value of VMware vcenter Log Insight : The Customer Perspective

Transcription:

Enterprise Data Quality Dashboards and Alerts: Holistic Data Quality Jay Zaidi Bonnie O Neil (Fannie Mae) Data Governance Winter Conference Ft. Lauderdale, Florida November 16-18, 2011

Agenda 1 Introduction 2 Data Quality Challenges and Opportunities 3 Holistic Data Quality (HDQ) 4 Enterprise Data Quality Solutions Architecture 5 Enterprise Data Quality Dashboard Example Page 2

Meet the Authors Jay Zaidi Enterprise Data Quality Program Lead, Fannie Mae 15+ years in Enterprise Data Management and Solution Architecture Specialized in Financial Services and Healthcare domains Page 3

Meet the Authors Bonnie O Neil Technical Data Architect, Fannie Mae 20+ years as a Data Architect Author: 3 books Most recent: Business Metadata Author, over 50 articles & white papers Page 4

Data Quality Management Challenges and Opportunities Data Silos Holistic Data Quality (HDQ) Data Volumes and Velocity Data Optimization and Scalability Complex Data Architectures Simplify Data Architecture Real Time Enterprise Requirements Real Time Data Quality Monitoring Lack of of Accountability Strong Data Governance Reactive Mode Proactive Data Quality Controls Lack of of Straight Through Processing Automated controls and monitoring Structured and Unstructured Data (email, video, logs, system events etc) Leverage Big Data Solutions High level of maturity in Data Quality Management is required to address operational challenges. Page 5

The Data Quality Maturity Journey STEP ONE STEP TWO STEP THREE FOUNDATION & FRAMEWORK CONSTRUCTING THE RAILROAD EXECUTION DQ Use Cases Solution Architecture Industry Tool Selection Consistent DQ Definitions Tool Deployment Reporting Capabilities Training & Communication Change Management Awareness Proactive DQ Controls DQ Continuous Improvement Robust data quality management is required to support Regulatory Compliance, Risk Management, Accounting, Financial reporting and other business functions. Page 6

The Data Architecture Spaghetti Department Two Operational Data Store Transactional Store Data Mart Transactional Store Data Mart Data Warehouses Operational Data Store Department One Department Three Diagram by Arnon Rotem-Gal-Oz, April 2007 How do you manage the quality of business critical data in a dynamic and highly complex environment? Page 7

The Information Supply Chain Transparency into quality across supply chain Diagram by George Marinos - The Information Supply Chain: Achieving Business Objectives by Enhancing Critical Business Processes, April 2005 Each link of the information supply chain is dependant on the other strong controls are needed to manage business critical data. Page 8

Typical Current State Data Flow External Data Feeds Transactional and Operational Stores External Data Feeds Data Warehouse Data Marts Potential data quality problem The current siloed approach to data management is wasteful and doesn t provide transparency into systemic issues. Page 9

Future State Data Flow: Continuous Data Quality Monitoring External Data Feeds Transactional and Operational Stores External Data Feeds Data Warehouse Data Marts DQ Monitoring Enterprise Data Architecture should enable straight through processing and offer operational efficiencies. Page 10

Typical Business Scenario Analyze Data and Conduct Forensics (Data Quality Tool) Implement Real Time Data Quality using DQ Services (Data Quality Tool) Identify anomalies and remediate issues (Data Quality Tool and EDQ Dashboard) Internally or Externally Supplied data Enterprise Applications Reports & Executive Dashboards Enterprise Data Stores (Transactional, Operational, Marts and Warehouses) The Enterprise Data Quality Platform provides the tools, methodologies and best practices to identify and remediate data quality issues. Page 11

Holistic Data Quality Our focus should be on addressing systemic issues. This requires a switch from reactive to proactive approaches to data quality and quality that is not evaluated or managed in silos, but addressed using a holistic cross-silo approach. Holistic Data Quality (HDQ) is the term that I have coined to address this need. Jay Zaidi Implementing HDQ at the enterprise level is a strategic, multi-year effort for mid to large-sized firms. If done right - the return on investment is many fold. Page 12

Do Not Boil The Ocean Narrowing the scope of the effort will ensure success Identify data critical for the enterprise 10,000 to 20,000 General population of data elements* 2,000 to 3,000 400 to 500 Critical data for a line of business* ( LOB Critical ) Critical data for the enterprise* ( Enterprise Critical ) Initial Focus should be on Enterprise Critical data * Estimates Only Enterprise level governance and quality efforts should focus on Enterprise Critical data. Lines of business should govern and manage the quality of their business critical data. Page 13

Dimensions of Data Quality The concept of Dimensions of Data Quality has been established by many authors in the industry, such as David Loshin and Danette McGilvray: To be able to correlate data quality issues to business impacts, we must be able to both classify our data quality expectations as well as our business impact criteria. -David Loshin Dimensions are facets or specific measurements of data quality, pertaining to specific data elements The authors propose many variations but the main ones that most agree on are: Accuracy Conformity Completeness Consistency/Duplication Timeliness (sometimes called Currency) Integrity Data Quality Dimensions facilitate the consistent definition of data quality requirements and metrics across various organizations. Page 14

Dimensions of Data Quality - Explanation Accuracy: How much does the data conform to the real world? Completeness: How much required data is missing? Conformity: How much does the data conform to formats and domain values? Duplication: Does the same data exist in multiple systems? If If so, is it it represented the same? Integrity: Does the data conform to integrity rules appropriately? Are relationships between elements retained? Currency: How current is the data? When was it it last entered or refreshed? There are a dozen or more Data Quality Dimensions that can be defined, but organizations should pick the ones that best meet their needs. Page 15

Replace Paper Reports with Business Intelligence Operational Incidents Audit Findings Data Quality Issues Report Regulatory Compliance Issues Weekly Data Management Status Reports Replace mounds of paper with a business intelligence solution gain access to summary and detailed information on key quality indicators on-demand. Page 16

Business Intelligence for Enterprise Data Quality Business intelligence tool (COTS) Data quality Commercial-off-the-shelf (COTS) product Data profiling, standardization, cleansing, normalization etc. Data quality rules repository Data quality rules engine Data quality results repository Data quality data mart (custom) Data quality issue management system Extract Transform and Load (ETL) product Enterprise Service Bus (SOA and Data Quality Services) Data Quality Tool (Profiling/Rule Execution) Data Stores Files SOLUTION COMPONENTS Data Quality Rules Data Quality Results ETL Data Quality Mart Enterprise Dashboard Business Intelligence Tool Page 17

QUALITY BY LINE OF BUSINESS ENTERPRISE DATA QUALITY DASHBOARD (Enterprise View) DATA QUALITY MATURITY CRITICAL DATA BREAKDOWN RELEASE 1 WHOLESALE RETAIL COMMERCIAL WHOLESALE RETAIL COMMERCIAL RELEASE 2 TRENDING OF DATA QUALITY PRODUCT DATA CUSTOMER DATA REGIONAL TREND HEALTH INDICATORS OVERALL HEALTH QUALITY RATING FOR EACH DATA ELEMENT Page 18

OVERALL HEALTH ENTERPRISE DATA QUALITY DASHBOARD (Retail Business View) CRITICAL DATA BREAKDOWN HEALTH INDICATORS RELEASE 1 RELEASE 2 TRENDING OF DATA QUALITY BORROWER DATA LOAN DATA DATA STORE TREND QUALITY RATING FOR EACH LOB DATA ELEMENT DATA QUALITY SERVER UTILIZATION Page 19

Continuously Measure and Improve Quality Step 1 - Define Define the scope, goal, budget, duration and the data quality problem to be addressed. Step 2 - Measure All relevant data quality statistics and measures important to the enterprise are collected at this stage. Step 4 - Control Monitor the quality after remediation to ensure that data is defect free. If there are any further changes to be made, the team makes changes and again measures the quality. Step 3 - Analyze and Improve Analysis of the data collected in the previous phase is conducted and root cause(s) identified. Data remediation is implemented to improve the quality of data. The Enterprise Data Quality dashboard provides transparency into data quality hotspots that must be addressed proactively. Page 20

Summary Effective data management provides order out of chaos Implementing Holistic Data Quality provides transparency into data quality issues across the information supply chain and helps in identifying systemic issues Focus must be on Enterprise Critical data initially. Do not try to boil the ocean. The solution architecture s core components are the data quality COTS product, a data quality Data Mart and a Business Intelligence tool Proactive monitoring and measurement of data quality, coupled with an alerting mechanism, significantly reduces operational incidents Implementing HDQ is a strategic initiative and requires C-level sponsorship and support Page 21

Questions!! Page 22