International Association of Data Quality Managers ISO 8000 Data Quality All ERP applications run better on ISO 8000 quality data! Dr Salomon de Jager Slide 1
Agenda Slide 2
Data is the new reality! Slide 3
Business Constraints Slide 4
International Organization for Standardization (ISO) A two year (minimum) process designed to ensure international consensus Slide 5
ISO TC 184/SC 4 voting members Australia, SAI; Standards Australia International, Ltd Austria, ON; Österreichisches Normungsinstitut Brazil, ABNT; Associação Brasileira de Normas Técnicas Bulgaria, BDS; State Agency for Standardization and Metrology China, SAC; Standardization Administration of China Czech Republic, CNI; Czech Standards Institute France, AFNOR; Association française de normalisation Germany, DIN; Deutsches Institut für Normung Italy, UNI; Italian National Standards Body Japan, JISC; Japanese Industrial Standards Committee Korea, KATS; Korean Agency for Technology and Standards Netherlands, NEN; Nederlands Normalisatieinstituut Norway, SN; Standards Norway Portugal, IPQ; Instituto Português da Qualidade Russia, GOST; Federal Agency on Technical Regulating and Metrology South Africa, SABS; South African Bureau of Standards Spain, AENOR; Asociación Española de Normalización y Certificación Sweden, SIS; Swedish Standards Institute Switzerland, SNV; Swiss Association for Standardization United Kingdom, BSI; British Standards Institution United States, ANSI; American National Standards Institute Slide 6
Standards WHY: Standards make interoperability possible, they allow us all to benefit from economies of scale, they set minimum expectations (safety for example). HOW: Standards are specifications by claiming compliance with a standard you are publicly announcing that you meet the requirements of the standard. ISO Standards contain: Definition of terms If a term is not defined in a standard, the definition in Oxford English dictionary (OED) applies, a term may be defined in another ISO standard, in which case it will be referenced. New definitions are called normative definitions, they are an integral part of the standard Normative text If you claim compliance you must follow these clauses Informative text This is helpful information, you do not have to agree with it or apply it Slide 7
Data Understanding the difference between data and information is one of the keys to solving data quality Slide 8
The philosophical view Where is the wisdom we have lost in knowledge? Where is the knowledge we have lost in information? The Rock (1934) T. S. Eliot Wisdom Knowledge meaningful information? Information meaningful data Data a datum is a disruption in a continuum Slide 9
Defining data data: elements into which information is transformed so that it can be stored or moved. Peter R. Benson Information Data Information Copyright only covers fixed form Slide 10
From data to information Data quality is an essential component of quality information but not the only component Quality data may not be timelyor relevant these are characteristics of information quality Slide 11
The need for portable data If we believe that: The software we will be using tomorrow will be different from what we are using today and we will need access to our data. forever and data that cannot be separated from licensed software is also licensed data. Then: Your data should be portable ; independent of any licensed software application. ISO 8000 quality data is portable data! Slide 12
Operating systems as a solution to hardware lock-in An operating system is the infrastructure software component of a computer system responsible for sharing the limited resources of the computer. The operating system acts as a host for applications that are run on the machine. 1969 -UNIX (we also landed on the moon!) 1974 -CP/M 1981 -MSDOS 1985 -Windows 1.0 1995 -BOB 2000 -ME 2001 -XP 2007 -VISTA 2009 -Windows 7 2009- Android Slide 13
Vision of the future The service provider s view Application Application Application Application Customer data Customer Application Customer data Storing customer data within the application creates customer lock-in Slide 14
Vision of the future The customer view Data Data Data ISO 8000 Data Application Customer Application Data standards are the antidote to application lock-in Slide 15
ISO data standards are the antidote to application lock-in Data quality analysis app ISO 22745 ISO 8000 Portable data Data de-dup app Data validation app Quality data is portable data; it is independent of the software application and accessible by any application Slide 16
Quality What does quality really mean and can we measure it? Slide 17
What does quality mean? When you order seafood from Quality Fresh Seafood, you can be confident that you are receiving the very best quality of seafood and delivery. Slide 18
ISO 9000 definition of quality ISO 9000:2005(E) 3.1.1 quality degree to which a set of inherent characteristics fulfils requirements NOTE 1 The term quality can be used with adjectives such as poor, good or excellent. NOTE 2 Inherent, as opposed to assigned, means existing in something, especially as a permanent characteristic. Slide 19
ISO 9001 quality management system Requirements ISO 9001:2000(E) Slide 20
Requirements define quality It is all about the requirements Quality data is data that meets requirements nothing more nothing less! Peter R. Benson Slide 21
What does a data requirement look like? The buyer, item manager or user as the data requestordefines their requirements for data: What is the item name and what characteristic data do I need? A data requirement statement is created as an eotd-i-xml identification guide, an XML file that conforms to ISO 22745 part 30 in which the item name is the class and the characteristic data is represented by property-value pairs What data do you need to successfully accomplish a task? Data requirements are function specific, they change over time, from company to company and from function to function What is the common name in the eotd?: Bearing,Ball,Annular What do I need to know about this bearing in order to buy or manage it?this is a data requirement statement also known as an identification guide or a cataloguing template. Data requestor eotd-i-xml (data requirements statement) ISO 22745-30 BEARING,BALL,ANNULAR Inner Diameter Outer Diameter Width # Rows Sealing Method Load Rating Speed Rating Properties selected from the eotd Slide 22
A form is a specification of a data requirement Slide 23
The quality of the data requirement is very important Slide 24
Motivation for ISO 8000 Data Quality Manager s view To obtain the data necessary for safe and efficient operation: To be able to create clear and unambiguous requests for: 1. Reference data 2. Characteristic data 3. The validation of reference and characteristic data, To automate the generation and distribution of requests for datain a simple format that can be automated by the recipient to createan integrated data exchange system. Faster Access to Better* Quality Data *recent independent tests show a 30% increase in the quality of data Slide 25
Motivation for ISO 8000 Quality Data Provider s view - Data integration is one of the keys to a long term relationship - The ability to provide customers with quality data is a significant differentiating factor. - Publishing the specifications of their products, capabilities and services on their web sites. - Looking to increase their visibility and understand that the best way to do this is to improve the quality of their data. Companies are looking for a Standard that they can use to differentiate themselves based on the quality of their data. Slide 26
ISO 8000 Family of Standards Slide 27
ISO 8000 Quality Data ISO 22745 is a standard for the exchange of ISO 8000 quality data Slide 28
The need for semantic encoding Plane Slide 29
The need for semantic encoding Safe Slide 30
The need for semantic encoding Robot Slide 31
Semantic encoding eotd (ECCMA Open Technical Dictionary): a registry of terminology from many sources where each concept is assigned a unique and permanent public domain identifier and where concept identifiers may be mapped toeach other and ranked according to their use (ISO 22745). Slide 32
ECCMA Open Technical Dictionary (eotd) Just as with music notation and engineering symbols, the eotd concept identifiers are simply used to communicate more accurately in a language independent environment. Music Engineering eotd A unique public domain identifier is assigned to a concept. 0161-1#01-089388#1 table 0161-1#01-086445#1 chair 0161-1#02-018635#1 weight 0161-1#02-005808#1 length 0161-1#07-277660#1 Monday 0161-1#05-001122#1 kilogram
Semantic encoding using ISO 22745 Property ID Value Measure ID 0161-1#02-046898#1 0161-1#07-014684#1 0161-1#02-027375#1 3225020037 0161-1#02-023822#1 1.0 0161-1#05-000798#1 0161-1#02-010200#1 1.450 0161-1#05-000798#1 0161-1#02-010196#1 1.653 0161-1#05-000798#1 0161-1#02-004968#1 0.591 0161-1#05-000798#1 0161-1#02-027376#1 10 0161-1#02-027378#1 0.80 0161-1#08-000168#1 eotd Identifier Coded Property term Value Measure term eotd CLASS NAME BOLT:MECHANICAL PRODUCT NUMBER 3225020037 NOMINAL THREAD DIAMETER 1.0 INCHES WIDTH ACROSS FLATS 1.450 INCHES WIDTH ACROSS CORNERS 1.653 INCHES HEAD HEIGHT 0.591 INCHES COUNT PER PACK 10 PACK PRICE 0.80 US DOLLAR eotd Identifiers Resolved Machine Bolt; Product Number: 3225020037; Nominal thread diameter: 1.0 inches; Width across flats: 1.450 inches; Width across corners: 1.653 inches; Head height: 0.591 inches; Count per pack: 10; Pack price: $0.80 (M-Bolt;NTD1.0 ;WAF1.45 ;CPP10) Rendered Slide 34
eotd (DR management - Multilangual) Slide 35
Linking Template (class) to classification (UNSPSC) Slide 36
IFMS Release 1 Solution System Specifications Runtime, middleware & platform Portal Web Portal (Liferay Portal 5.0) Enterprise Service Bus Application (Future) Communication / Integration backplane COTS Human Capital Management COTS Advanced Procurement COTS II ISO 8000 IFMS Java bespoke Finance IFMS Java bespoke Supply Chain IFMS Java bespoke BI/DW Application Server COTS Application Server ORACLE Application Server COTS Application Server Sun Microsystems Glassfish 2 Java EE 5 platform Database Oracle 10g Hosting Infrastructure Slide 37
Automating the Data Supply Chain A data provider may not have all the data requested so they in turn send a request through their supply chain using the same ISO 22745 standard exchanges eotd-q-xml (query) ISO 22745-35 Sub-Tier eotd-q-xml Data provider Sub Sub-Tier eotd-r-xml Data requester eotd-i-xml (data requirements statement) ISO 22745-30 eotd-r-xml (reply) ISO 22745-40 Slide 38
Vision What is impossible to do right now, but, if you could do it, would fundamentally change your business? 1990 Joel Arthur Barker Immediate access to authoritative data in a usable form Automated data mapping, traceable data. Peter R. Benson Slide 39
Bringing your master data into compliance with ISO 8000 1. Document your business language 2. Map your business language to the eotd* 3. Document your requirements for data* 4. Create your Master Data Ontology* Terminology Data requirements Class hierarchies 5. Measure the quality of your master data* (by comparing your data to your requirements for data) *there are low cost commercial software tools that make this process very straight forward Become an ECCMA certified ISO 8000 Master Data Quality Manager Slide 40
Thank You Questions Slide 41