DAMA Tracking the Success of Data Quality Peter R. Benson Project leader for ISO 22745 and ISO 8000 Copyright 2014 ECCMA All rights reserved Slide 1
Companies with a common interest ISO 8000 quality master data!
Information is Power Timely Relevant Accurate Data is Truth Useful Attributable Unambiguous
Data is truth - information is power 23mpg 26mpg Quality data but what about the information? 3 miles, I guess bigger is better.. 4.3gp100m Same data different information 3.8gp100m Half a gallon of gas every 100 miles... that s almost $2.00 a free lunch with every tank of gas! I can save $400 per year, that's one car payment a year! Copyright 2012-2014 by ECCMA Slide 5
Data is truth - information is power Slide 6
Defining Data and Information data: fixed form into which information is transformed so that it can be stored or moved. Peter R. Benson ISO definitions (ISO/IEC 2382-1:1993) Information knowledge concerning objects, such as facts, events, things, processes or ideas, including concepts, that within a certain context has a particular meaning Data Information Data Information Copyright only covers fixed form (you can only copyright data not information) Re-interpretable representation of information in a formalized manner suitable for communication, interpretation, or processing Copyright 2012 by Peter R. Benson
What does Quality Mean? When you order seafood from Quality Fresh Seafood, you can be confident that you are receiving the very best quality of seafood and delivery. Copyright 2012-2014 by ECCMA Slide 8
ISO 9000 Definition of Quality 3.1.1 quality degree to which a set of inherent characteristics fulfils requirements ISO 9000:2005(E) Slide 9
ISO 9001 Quality Management System Requirements ISO 9001:2000(E) Slide 10
Requirements Define Quality 3.1.2 requirement need or expectation that is stated, generally implied or obligatory ISO 9000:2005(E) Quality is about meeting requirements Quality data is data that meets stated requirements nothing more and nothing less! (data that exceeds stated requirements does not increase the quality of the data) Slide 11
These are data requirements Slide 12
The quality of the data depends on how you ask for it Slide 13
Taxonomy of Data Data Dictionary Metadata and codes Transaction data Master data Identification data Descriptive data Classifications Performance characteristics Physical characteristics
ISO 8000 Family of Standards ISO 8000 General Principles Master Data Transaction data Part 1 Introduction Part 2 Terminology Part 100 introduction Part 110 Part 120 Provenance Part 130 Accuracy Part 140 Completeness Syntax Semantic encoding Meets requirements Slide 15
The Need for Semantic Encoding In South Africa a traffic light is called a robot! Robot Copyright 2011 by ECCMA Slide 16
ECCMA Open Technical Dictionary (eotd) Just as with music notation and engineering symbols, the eotd concept identifiers are simply used to communicate more accurately in a language independent environment. Music Engineering eotd A unique public domain identifier is assigned to a concept. 0161-1#01-089388#1 table 0161-1#01-086445#1 chair 0161-1#02-018635#1 weight 0161-1#02-005808#1 length 0161-1#07-277660#1 Monday 0161-1#05-001122#1 kilogram Slide 17
Publicly Visible Terminology in a Standard Model The eotd (ECCMA Open Technical Dictionary) is an ISO 22745-20 compliant central registry of terminology. Each concept and terminological component in the eotd is assigned a unique and permanent public domain identifier. Users create corporate preferred subsets of the eotd and use the eotd concept identifiers to manage concept equivalence mapping with the concepts used by their trading partners. ISO 22745 - ECCMA Open Technical Dictionary (eotd) Terminology Terms Abbreviations Definitions Images Public Domain Concept Identifier 0161-1#xx-xxxxxx#1
Publicly Visible Terminology in a Standard Model Terminology Industry terminology Terminology Government terminology Terminology Terminology SDO terminology SDO terminology Public domain concept identifiers Free identifier resolution to underlying ( services terminology (web Hyperlink to source standards Multilingual Multiple terms, definitions and images linked to single concept identifier Slide 19
Applying Concept Equivalence to Enrich a Corporate Dictionary SABIC concept equivalence table 01-1073082 = 01-086142 01-1073082 = 01-068756 Sabic Bolt = Rockwell Bolt Sabic Bolt = ASTM bolt eotd Concept ID Term Ex ref Originating Organization Definition Status 0161-1#01-1073082#1 BOLT IR237 SABIC A fastener that is externally threaded on one end and generally with some style of head on the other end and is normally intended to be tightened or released by torquing a nut and designed to fasten objects together. Active Equivalent concepts 0161-1#01-086142#1 BOLT - Rockwell Automation, Inc A fastener consisting of a threaded pin or rod with a head at one end and designed to be inserted through holes in assembled parts and secured by a mated nut that is tightened by applying torque. Крепежная принадлежность, представляющая собой стержень с нарезанной резьбой и головкой на одной стороне, предназначенный для помещения в отверстия собираемых вместе деталей с последующей их фиксацией с помощью гайки, затягиваемой до определенного крутящего момента. Active 0161-1#01-068756#1 bolt F 1789 - F16 ASTM headed and externally threaded fastener designed to be assembled with a nut Active Copyright 2013 by ECCMA Slide 20
Building the Corporate Dictionary and Data Requirements ISO 22745-30 Corporate Data Requirements Registry edrr ECCMA Data Requirements Registry Spreadsheets Reports ISO/IEC 11179 Corporate Metadata Registry ISO 22745-10 Corporate Dictionary eotd ECCMA Open Technical Dictionary Corporate Classifications Forms New eotd concept or terminology registration Data models Slide 21
Managing a dictionary Slide 22
ISO 8000 Quality Data Data that meets stated requirements Data that is portable (defined syntax and defined semantic encoding) ISO 8000 quality data is portable data that meets stated requirements ECCMA recommends that companies own and control their own corporate dictionary, data requirements and description rules. Data coded using a licensed dictionary or validated using licensed data requirements or created using licensed description rules is subject to the terms of the license. Copyright 2012-2014 by ECCMA Slide 23
Material Number Date Updated 2405328 2013-04-24 2013-04-24:DLIS Characteristic data Property name Value Source date Source organization CLASS VALVE,BALL 2013-04-24 DLIS THREAD CLASS 2A ALL ENDS 2013-04-24 DLIS BODY MATERIAL STEEL COMP 316 2013-04-24 DLIS PIPE SIZE ¼ INCH 2013-04-24 GRAINGER CONNECTION STYLE G46A PLAIN (TUBE) ALL ENDS 2013-04-24 DLIS MAX PRESSURE 2500PSI 2013-04-24 GRAINGER Identification data Property name Value Source date Source organization NATO STOCK NUMBER DLIS:4820-01-0130242 2013-04-24 DLIS SUPPLIER REFERENCE GRAINGER:1WBL7 2013-04-24 GRAINGER MANUFACTURER REFERENCE (ALTERNATE) MANUFACTURER REFERENCE (PREFERRED) STANDARD REFERENCE NUMBER SWAGELOK:SS-AFSS12 2013-04-24 GRAINGER PARKER:4Z-MB4LPFA-SSP 2013-04-24 PARKER ASTM:A 276 Type 316 2013-04-24 ASTM
Material Number Date Updated 2405328 2013-04-24 2013-04-24:DLIS Classification data Property Value Source date Source NSC 2013 4820: Valves, Nonpowered 2013-04-24 DLIS UNSPSC 15.1101 40141607: Ball valves 2013-04-24 GS1 eclass 8.0 (basic) 37010490 : Ball valve (unspecified) 2013-04-24 eclass Harmonized Tariff Schedule of the United States 2013 8481.80.3070:hand operated steel ball valve 2013-04-24 United States International Trade Commission Material Group 25:Control System 2013-04-24 Corporate Purchasing Group 15:Pipes and Valves 2013-04-24 Purchasing Descriptions Property Description Source date Source ITEM NAME VALVE,BALL: SS, ¼, 2500PSI 2013-04-24 RG1 PURCHASE ORDER VALVE,BALL: SIZE=1/4 INCH, MAX PRESSURE=2500PSI, BODY MATERIAL=STEEL COMP 316; MANUFACTURER REFERENCE (PREFERRED)=PARKER:4Z-MB4LPFA-SSP 2013-04-24 RG2
Corporate Identifier Date Updated 2405328 2013-04-24 2013-04-24:Microsoft Characteristic data Property name Value Source date Source organization LEGAL NAME William Henry Gates III 2013-04-24 US Department of State BIRTH DATE October 28, 1955 2013-04-24 US Department of State PLACE OF BIRTH Washington, USA 2013-04-24 US Department of State Identification data Property name Value Source date Source organization Driver license GATESWH450P8 2010-01-01 Washington State Department of Licensing Passport number 13458734 2013-04-24 US Department of State Employer Issued Identifier 1 1976-11-26 Microsoft
ISO 8000-120 Data Warehouse used to Manage Content Multilingual Product Descriptions Copyright 2012-2014 by ECCMA Slide 27
Motivation for ISO 22745 and ISO 8000 From a logistics information perspective an F-15 is just 171,000 parts flying in very close formation. Commissions on Organization of the Executive Branch of the Government [Hoover Commissions 1947-1955] Studied and investigated organization and methods of operation of the Executive branch of the Federal Government, and recommended organization changes to promote economy, efficiency, and improved service Cataloging and Standardization Act, Public Law 82-436 as codified by United States Code, Title 10, Chapter 145 Cataloging and Standardization Sec. 2451. (a) The Secretary of Defense shall develop a single catalog system and related program of standardizing supplies for the Department of Defense. (b) In cataloging, the Secretary shall name, describe, classify, and number each item recurrently used, bought, stocked, or distributed by the Department of Defense, so that only one distinctive combination of letters or numerals, or both, identifies the same item throughout the Department of Defense. Only one identification may be used for each item for all supply functions from purchase to final disposal in the field or other area. Slide 28
Motivation for ISO 22745 and ISO 8000 Controlling costs requires better asset, product, component and process visibility. This is achieved through faster, better and lower cost access to authoritative characteristic data.
Why Standards LAWS (Mandatory) STANDARDS (Contractual requirements that can be independently verified) BEST PRACTICE (Voluntary) Copyright 2012-2014 by ECCMA
ISO 8000 referenced in a NATO contract ASD Specification 2000M (S2000M) is a standard that specifies the information exchange requirements for most materiel management functions commonly performed in supporting international projects. S2000M is based on a business model agreed between military customers and industry suppliers. As of Mar 2011 ASD 2000M Chapter 1B section 3.1 includes the following statement, : The Contractor shall supply identification and characteristic data in accordance with ISO 8000-110:2009 on any of the selected items covered in his contract. Following an initial codification request as specified in section 3.2, the NATO Codification Bureau (NCB) shall present a list of the required properties in accordance with the US Federal Item Identification Guides. Quality data saves money Copyright 2014 by ECCMA Slide 31
Automating the data supply chain using ISO 22745 A data provider may not have all the data requested so they in turn send a request through their data supply chain using the same ISO 22745 standard exchanges Request for data eotd-q-xml ISO 22745-35 Request for data eotd-q-xml ISO 22745-35 Data provider Sub Data exchange eotd-r-xml ISO 22745-40 Data requester Data requirement eotd-i-xml ISO 22745-30 Data exchange eotd-r-xml ISO 22745-40
Motivation for ISO 22745 and ISO 8000 Controlling costs requires better asset, product, component and process visibility. This is achieved through faster, better and lower cost access to authoritative characteristic data. quality data save money Slide 33
ISO 8000 Referenced in a Commercial Contract The supplied data shall be ISO 8000-110:2009 compliant. The data shall comply with registered ISO 22745-30 compliant data requirements The data shall be encoded using concept identifiers from an ISO 22745 compliant open technical dictionary that supports free resolution to concept definitions. The data shall be provided in ISO 22745-40 compliant Extensible Markup Language (xml). Creating ISO 8000-110:2009 compliant data does not require the payment of any license fees or the use of specialized software, it is within the technical ability of all suppliers regardless of their size. quality data save money Copyright 2014 by ECCMA Slide 34
Motivation for ISO 22745 and ISO 8000 Quality ERP Descriptions 52368965412 Tire Bridgestone 435/95 R25 56329845 Tyre BS 435/R25 Standard Purpose E3 2 Star Radial 125435 Bridge Stone 25inch 435/95 Standardised Long Description: Tire: Pneumatic, Vehicular: Service Type for Which Designed: Loader Tire Rim Nominal Diameter: 25' Tire Width: 445mm Aspect Ratio: 0.95 Tire Ply Arrangement: Radial Ply Rating: 2* Tire & Rim Association Number: E3 Tread Material: Standard Tire Air Retention Method: Tubeless Tire Load Index and Speed Symbol: NA Tread Pattern: VHB TKPH Rating: 80 965123465 Tyre Bridgestone Part Number 12345 Standardised Short Description: * 2 0.95 445mm 25 Loader: PneumaticTire
Automating the data supply chain using ISO 22745 A data provider may not have all the data requested so they in turn send a request through their data supply chain using the same ISO 22745 standard exchanges Request for data eotd-q-xml ISO 22745-35 Request for data eotd-q-xml ISO 22745-35 Data provider Sub Data exchange eotd-r-xml ISO 22745-40 Data requester Data requirement eotd-i-xml ISO 22745-30 Data exchange eotd-r-xml ISO 22745-40
Tracking the Success of Data Quality are we there yet? SUCCESS If we define success as winning or as the monetary gain that often accompanies success, we will inevitably find ourselves disappointed and disheartened. Success only matters in terms of your progress. How hard you work for what you want and how focused you remain is the only definition of success that matters. All else is just extra weight. Paul Hudson, 2013 "data quality" about 3,080,000 results (Google) chief data officer about 1,390,000 results (Google) quality data" about 1,290,000 results (Google) ISO 8000 data quality about 16,800 results (Google) The data quality challenge is clearly visible and widely acknowledged, this is a major success. We are fundamentally changing how we manage data and we are seeing a rapid development of technologies that are capable of delivering consistent and measurable quality data. Peter Benson, 2014 Copyright 2014 by ECCMA Slide 37
Tracking the Success of Data Quality are we there yet? What's coming next? Automated Quality Identifier Resolution (you already use it every time you send an email or browse a website). An identifier is an alias for authoritative data; it is issued by an organization and represents a collection of data held by the organization. A quality identifier is an identifier that can be resolved to all or part of the data it represents using standard data exchanges. The standardized resolution of quality identifiers allows the automation of the acquisition of authoritative data as well as data validation; it is simple, highly reliable and very low cost, Copyright 2014 by ECCMA Slide 38
Identifiers Dictionary of classes, properties and codes (has concept and terminology identifiers) Provides the meaning of Data Used to create Master data (has internal master data record identifiers) Identifiers used in Updates Transaction data (has transaction identifiers) Contains Characteristic data Used to validate External third party identifiers Used to create Classifications Names and descriptions
Questions? Peter R. Benson Executive Director Peter.Benson@eccma.org Information is Power Data is Truth