ECCMA Presentation The Golden Record and Survivorship. Bud Walker Director of Data Quality Solutions



Similar documents
Six Steps to to Managing Data Data Quality with SQL Server Integration Services

Enterprise Data Quality

Master Your Data and Your Business Using Informatica MDM. Ravi Shankar Sr. Director, MDM Product Marketing

ELECTRICIANS & PLUMBERS OCCUPATIONAL GROWTH PROJECTIONS

Why is Master Data Management getting both Business and IT Attention in Today s Challenging Economic Environment?

Master Your Data. Master Your Business. Empower your business with access to consolidated and reliable business-critical data

Why You Still Need to Master Your Data Before You Master Your Business (Intelligence) Business Imperatives Addressed By Reliable, Integrated View

A Comprehensive Approach to Master Data Management Testing

Business Intelligence & Product Analytics

Orange County Office Market Report

The Informatica Solution for Improper Payments

CHAPTER 1 INTRODUCTION

Enable Business Agility and Speed Empower your business with proven multidomain master data management (MDM)

Phone Check. Reference Guide

ORACLE CUSTOMER HUB. Consolidate & govern a unique, complete and accurate set of Master Customer information from across the enterprise.

Business Intelligence: Effective Decision Making

How To Get A Facebook Subpoena

Measure Your Data and Achieve Information Governance Excellence

Getting started with a data quality program

Oracle Master Data Management MDM Summit San Francisco March 25th 2007

A Road Map to Successful Customer Centricity in Financial Services. White Paper

Data Warehouse and Business Intelligence Testing: Challenges, Best Practices & the Solution

Making SAP Information Steward a Key Part of Your Data Governance Strategy

What Your CEO Should Know About Master Data Management

Enterprise Data Governance

Data Quality Tools. Developer APIs to Validate, Correct and Enrich Contact Data.

White Paper. Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices.

IBM Cognos Analysis for Microsoft Excel

Normalization. Reduces the liklihood of anomolies

Customer Centricity Master Data Management and Customer Golden Record. Sava Vladov, VP Business Development, Adastra BG

Building an Advancement Data Warehouse. created every year according to a study. Data Facilitates all Advancement Activities

Cúram Business Intelligence and Analytics Guide

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Accurate identification and maintenance. unique customer profiles are critical to the success of Oracle CRM implementations.

Healthcare Data Management

DATA GOVERNANCE AND INSTITUTIONAL BUSINESS INTELLIGENCE WORKSHOP

Comprehensive Data Quality with Oracle Data Integrator. An Oracle White Paper Updated December 2007

Real life experiences with Continuous Controls Monitoring (CCM) on Master Data. Pat Culpan Jeet Kadam

Spend Enrichment: Making better decisions starts with accurate data

<Insert Picture Here> Oracle Master Data Management Strategy

10 FASTEST GROWING AND HIGHEST WAGE CAREERS IN ORANGE COUNTY

Enterprise Intelligence - Enabling High Quality in the Data Warehouse/DSS Environment. by Bill Inmon. INTEGRITY IN All Your INformation

James Serra Data Warehouse/BI/MDM Architect JamesSerra.com

API Endpoint Methods NAME DESCRIPTION GET /api/v4/analytics/dashboard/ POST /api/v4/analytics/dashboard/ GET /api/v4/analytics/dashboard/{id}/ PUT

Membership - Data Validation

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

Oracle OLAP. Describing Data Validation Plug-in for Analytic Workspace Manager. Product Support

Data Mining, Dashboards and Data Quality John Rome, Arizona State University. Review of BI Buzzwords

Reporting MDM Data Attribute Inconsistencies for the Enterprise Using DataFlux

An Introduction to Master Data Management (MDM)

SAS Data Management Technologies Supporting a Data Governance Process. Dave Smith, SAS UK & I

GEHC IT Solutions. Centricity Practice Solution. Centricity Analytics 3.0

Mastering Big Data. Steve Hoskin, VP and Chief Architect INFORMATICA MDM. October 2015

Core Banking Business Intelligence: Transform Data into Information. Kevin Round, Product Manager Xamine BI Products

Frequently Asked Questions Navigating IBHIS for LA County DMH Contract Providers

Intelligent Mail Tray Label Vendors List

Seeking Data Quality. Using Agile Methods to Test a Data Warehouse

HOW INTERSYSTEMS TECHNOLOGY ENABLES BUSINESS INTELLIGENCE SOLUTIONS

HANDLING MASTER DATA OBJECTS DURING AN ERP DATA MIGRATION

<Insert Picture Here> Master Data Management

Logical Modeling for an Enterprise MDM Initiative

INTRODUCTION TO BUSINESS INTELLIGENCE What to consider implementing a Data Warehouse and Business Intelligence

IBM Software A Journey to Adaptive MDM

Data Quality Assessment. Approach

A Simplified Framework for Data Cleaning and Information Retrieval in Multiple Data Source Problems

HANDLING MASTER DATA OBJECTS DURING AN ERP DATA MIGRATION

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

How Microsoft IT India s Test Organization Enabled Efficient Business Intelligence

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

Discover, Cleanse, and Integrate Enterprise Data with SAP Data Services Software

A WHITE PAPER By Silwood Technology Limited

MDM and Data Warehousing Complement Each Other

Data Integration Alternatives Managing Value and Quality

Making Business Intelligence Easy. Whitepaper Measuring data quality for successful Master Data Management

Insurance Requirements

Top Five Reasons Not to Master Your Data in SAP ERP. White Paper

Business Intelligence for Everyone

ActivePrime's CRM Data Quality Solutions

Transcription:

ECCMA Presentation The Golden Record and Survivorship Bud Walker Director of Data Quality Solutions Joseph Vertido MD-MVP Channel Manager / Data Quality Analyst 1

Data Quality Problems REPORTING ANALYTICS DASHBOARD DATA WAREHOUSE 2

Data Quality Problems Introduction of Inconsistencies Without any form of Data Quality control, allowing data to come in as-is introduces problems. 3

Data Quality Problems Standardization Format and Ordering Misspelling Misplacement Casing Validity Accuracy Stale Data 4

The Data Quality Cycle GOLDEN RECORD AND SURVIVORSHIP 5

Data Quality Problems REPORTING ANALYTICS DASHBOARD DATA WAREHOUSE 6

Data Quality Problems Issues with Consolidation Duplicates Fuzzy Duplicates Domain Specific Fuzzy Duplicates DATA WAREHOUSE 7

Data Quality Problems Issues with Consolidation Golden Record and Survivorship DATA WAREHOUSE 8

The Golden Record / Survivorship What is the Golden Record / Survivorship? The Golden Record or as many refer to as the Survivor Record, is the single accurate representation of the truth Why is it important? The implementation of Survivorship techniques can be long, complicated and tedious. It is important that a business come to an agreement on what Survivorship Criteria to implement. Failure to successfully implement Survivorship Schemas become detrimental to the efficiency and quality of the deduplication process. Ultimately results in loss of information and inaccurate Business Intelligence 9

The Golden Record Billing Joseph Vertido 123 Main St, Los Angeles CA 92688 Customer Service Joseph Vertido (800) 800 6245 Web Account Joseph Vertido 123 Main St, 92688 10

The Golden Record Billing Joseph Vertido 123 Main St, Los Angeles CA 92688 Customer Service Joseph Vertido (800) 800 6245 Web Account Joseph Vertido 123 Main St, 92688 11

The Golden Record Common/Generic Practices Most Reliable Source Web Account Joseph Vertido 123 Main St, 92688 Billing Joseph Vertido 123 Main St, Los Angeles CA 92688 Customer Service Joseph Vertido (800) 800 6245 12

The Golden Record Common/Generic Practices Most Recently Created or Updated Date Stamp Name Address City State Zip Phone 10/25/2012 Joseph Vertido 22382 Empresa Rancho Santa Margarita CA (800) 800-6245 3/7/2008 JOSEPH Vertido 22382 Avenida empresa Rancho Santa Margarita CA 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 **8008006245 13

The Golden Record Common/Generic Practices Most Complete Date Stamp Name Address City State Zip Phone 10/25/2012 Joseph Vertido 22382 Empresa Rancho Santa Margarita CA (800) 800-6245 3/7/2008 JOSEPH Vertido 22382 Avenida empresa Rancho Santa Margarita CA 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 **8008006245 14

The Golden Record Common/Generic Practices Longest String Date Stamp Name Address City State Zip Phone 10/25/2012 Joseph Vertido 22382 Empresa Rancho Santa Margarita CA (800) 800-6245 3/7/2008 JOSEPH Vertido 22382 Avenida empresa Rancho Santa Margarita CA 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 **8008006245 15

The Golden Record Common/Generic Practices Longest String 10/25/2012 Joseph Vertido 22382 Empresa Rancho Santa Margarita CA (800) 800-6245 3/7/2008 JOSEPH Vertido 22382 Avenida empresa Rancho Santa Margarita CA 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 **8008006245 16

The Golden Record Common/Generic Practices Most Frequent Occurrence Date Stamp Name Address City State Zip Phone 10/25/2012 Joseph Vertido 22382 Empresa Rancho Santa Margarita CA (800) 800-6245 3/7/2008 JOSEPH Vertido 22382 Avenida empresa Rancho Santa Margarita CA 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 8008006245 17

Survivorship 1) Understand Your Business Rules 2) Determine and Implement Column Level Rules Based Survivorship 18

Survivorship Column Level Survivorship Most Frequent Occurrence ID Name Address City State Zip Phone C-2119 Joseph Vertido 22382 Empresa Rancho Santa Margarita CA (800) 800-6245 JOSEPH Vertido 22382 Avenida empresa Rancho Santa Margarita CA 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 8008006245 Determine Surviving Records Not Null or Blank Most Frequent Occurence 19

Survivorship Column Level Survivorship ID Name Address City State Zip Phone C-2119 Joseph Vertido 22382 Empresa Rancho Santa Margarita CA (800) 800-6245 JOSEPH Vertido 22382 Avenida empresa Rancho Santa Margarita CA 92688 8008006245 Vertido, Joseph 22382 Avenida empressa 92688 8008006245 C-2119 Vertido, Joseph 22382 Avenida empressa Rancho Santa Margarita CA 92688 8008006245 Determine Surviving Records 20

Survivorship The Problem ID Name Address City State Zip Phone Date C-2119 Joseph Vertido 22382 Empresa Rancho Santa Margarita CA 9495895200 C-2119 Joe Vertido AAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAA AA AAAA AAAAAAAAA AAAAAAA C-2119 Vertido, J 123 Fake St. 11111 9495895200 6/15/2012 C-2119 Vertido 22382 Avenida empressa 92688 8008006245 1/24/2009 Generic Most Complete Recent Rule Based Record Survivorship Only Goes So Far The Answer: Reference Data 21

Knowledge Centric Survivorship Validate critical information and other sensitive information in our data such as: Social Security, Address, Phone, Email and Name Having the knowledge of which data are good and bad changes the perspective for generating Survivorship Rules. Reference Data is not limited to third party data. One can also create their own knowledgebase (eg. Product Codes). 22

Knowledge Centric Survivorship Most Reliable Frequent Source Date Stamp Name Address City State Zip Phone 10/25/2008 Joseph Vertido 123 Fake St Fake City CA 11111 Most Recent Frequent Date Stamp Name Address City State Zip Phone 3/7/2012 Joseph Vertido 123 Fake St Fake City CA 11111 (800) 800-6245 Valid Address Valid Phone Date Stamp Name Address City State Zip Phone Joseph Vertido 22382 Avenida Empressa Rancho Santa Margarita CA 92688 123123123a 23

Knowledge Centric Survivorship Date Stamp Name Address City State Zip Phone 10/29/2012 Joseph Vertido 22382 Avenida Empressa Rancho Santa Margarita CA 92688 (800) 800-6245 24

Establishing Relationships ID Name Address City State Zip Phone C-2119 Joseph Vertido 22382 AvenidaEmpresa Rancho Santa Margarita CA 92688 8008006245 C-2119 Joseph Vertido 2205 La Palma Ave Anaheim CA 92806 8008006245 25

Summary Survivorship is just as important as any process in the spectrum of data quality. Generic Survivorship < Column Based Survivorship < Knowledge Centric Survivorship Reference Data is your ally! 26

Thank You Bud Walker Director of Data Quality Solutions bud@melissadata.com Joseph Vertido Data Quality Analyst joseph@melissadata.com