HIGH PRECISION MATCHING AT THE HEART OF MASTER DATA MANAGEMENT



Similar documents
ACHIEVING YOUR SINGLE CUSTOMER VIEW

SINGLE DONOR VIEW FOR CRM SYSTEMS ESTABLISHING A COMPREHENSIVE VIEW OF DONOR INTERACTIONS ACROSS YOUR ORGANISATION

The Advantages of a Golden Record in Customer Master Data Management

What to Look for When Selecting a Master Data Management Solution

Harness the value of information throughout the enterprise. IBM InfoSphere Master Data Management Server. Overview

The Advantages of a Golden Record in Customer Master Data Management. January 2015

Enable Business Agility and Speed Empower your business with proven multidomain master data management (MDM)

Delivering information-driven excellence

Master Your Data and Your Business Using Informatica MDM. Ravi Shankar Sr. Director, MDM Product Marketing

IBM Software A Journey to Adaptive MDM

Copyright , Pricedex Software Inc. All Rights Reserved

Master Data Management

Informatica Master Data Management

Riversand Technologies, Inc. Powering Accurate Product Information PIM VS MDM VS PLM. A Riversand Technologies Whitepaper

Sage CRM. Communicate, Collaborate, Compete with Sage CRM

How master data management serves the business

Customer Centricity Master Data Management and Customer Golden Record. Sava Vladov, VP Business Development, Adastra BG

Why is Master Data Management getting both Business and IT Attention in Today s Challenging Economic Environment?

Automated INVOICE Processing A User s Guide

So Long, Silos: Why Multi-Domain MDM Is Better For Your Business

10426: Large Scale Project Accounting Data Migration in E-Business Suite

Four Clues Your Organization Suffers from Inefficient Integration, ERP Integration Part 1

CRM. Best Practice Webinar. Next generation CRM for enhanced customer journeys: from leads to loyalty

6 Steps to Creating a Successful Marketing Database

COCOS HOSTING (Hosting and outsourcing on the COCOS infrastructure)

Understanding Data De-duplication. Data Quality Automation - Providing a cornerstone to your Master Data Management (MDM) strategy

Lean Processes in Customer Master Data Management

How to Become a Data Driven Business

Product Information Management and Enterprise Applications

A WHITE PAPER By Silwood Technology Limited

Life insurance policy administration: Operate efficiently and capitalize on emerging opportunities.

Financial Close Optimization: Five Steps for Identifying and Resolving Systems and Process Inefficiencies

EMC PERSPECTIVE Enterprise Data Management

Software Industry KPIs that Matter

The 2-Tier Business Intelligence Imperative

Sage CRM for Travel solution by Providian

Accenture Parts Optimization: Warranty Services Gaining Control Over Warranty Costs

IBM Information Management

Harness the power of data to drive marketing ROI

Accounts Payable Automation: Top 9 Reasons to Automate: The Essential Guide to Why Your Business Needs to Automate its Invoice Processing.

MULTI-CHANNEL BUSINESS SOFTWARE. Contact Management. e-commerce solutions. Full ERP/ CRM CALL CENTRE MANAGER

Spend Enrichment: Making better decisions starts with accurate data

Material Master Data Management

Next-generation e-commerce for retail: How to optimize cross-channel marketing, sales and service.

Moreketing. With great ease you can end up wasting a lot of time and money with online marketing. Causing

Management Accountants and IT Professionals providing Better Information = BI = Business Intelligence. Peter Simons peter.simons@cimaglobal.

MANAGING USER DATA IN A DIGITAL WORLD

Turning data integration into a financial driver for mid-sized companies

Data Products and Services. The one-stop-shop for all your business-to-consumer data requirements

Data Integrity and Integration: How it can compliment your WebFOCUS project. Vincent Deeney Solutions Architect

Business Process Management in the Finance Sector

Choosing the Right Master Data Management Solution for Your Organization

Making Business Intelligence Easy. Whitepaper Measuring data quality for successful Master Data Management

NCOE whitepaper Master Data Deployment and Management in a Global ERP Implementation

THE 7 STEPS TO A SUCCESSFUL CRM IMPLEMENTATION DEPLOYING CRM IN THE NEW ERA OF CONNECTED CUSTOMERS

Three proven methods to achieve a higher ROI from data mining

Industry models for insurance. The IBM Insurance Application Architecture: A blueprint for success

FI-IMS Fertilizer Industry Information Management System

Four distribution strategies for extending ERP to boost business performance

Customer Data Management. Breaking down data silos for improved business outcomes

London Stock Exchange

Continuing the MDM journey

Agile Master Data Management A Better Approach than Trial and Error

US ONSHORING OFFERS SUPERIOR EFFECTIVENESS OVER OFFSHORE FOR CRM IMPLEMENTATIONS

A Customer Centric Digital Platform For Utilities. A Joint Capgemini and Pegasystems Solution

Introducing webmethods OneData for Master Data Management (MDM) Software AG

CUSTOMER-CENTRIC ERP: INTEGRATED SYSTEMS FOR CUSTOMER SATISFACTION

How To Use An Ibm Infosphere Mdm For Salesforce.Com

CRM On Demand now hosted locally in Europe. An Oracle White Paper 2011

Master Data Management Framework: Begin With an End in Mind

Case Study: Business benefits from implementing a global Content Management solution. White paper

AA Automated Attendant is a device connected to voice mail systems that answers and may route incoming calls or inquiries.

Exceptional Customer Experience: The New Supply Chain Management Focus

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Effecting Data Quality Improvement through Data Virtualization

Getting started with a data quality program

Master Data Governance Find Out How SAP Business Suite powered by SAP HANA Delivers Business Value in Real Time

Data quality and the customer experience. An Experian Data Quality white paper

Integrating MDM and Business Intelligence

D&B Optimizer Powered by Acxiom

Text Analytics. A business guide

Greater visibility and better business decisions with Business Intelligence

FUJITSU Software Interstage Business Operations Platform: A Foundation for Smart Process Applications

Trends In Data Quality And Business Process Alignment

CRM for Real Estate Part 2: Realizing the Vision

An Introduction to Continuous Controls Monitoring

Appendix B Data Quality Dimensions

DATA GOVERNANCE AND DATA QUALITY

Transcription:

HIGH PRECISION MATCHING AT THE HEART OF MASTER DATA MANAGEMENT Author: Holger Wandt

Management Summary This whitepaper explains why the need for High Precision Matching should be at the heart of every Single Customer View and Master Data Management solution. In the following paragraphs we explain the components and requirements of High Precision Matching.

What is High Precision Matching? Regardless of the purpose of the Single Customer View, at the heart of every MDM solution is the matching engine. The core of a comprehensive single customer view solution is wholly reliant on the power of the matching technology. If, for instance, the matching engine is not capable of recognising that BMW and Bayerische Motorenwerke is actually the same customer you will still accrue unnecessary costs or miss value creating opportunities in the company. Because the quality of the matching engine is so important we distinguish standard matching from High Precision Matching. The Neopost Customer Information Management matching technology has this capability built into its core solution, which sets it apart from other Single View solutions. The High Cost of Low Quality Data Spreads Across Multiple Systems Unfortunately, these are perceived as a fact of a complex IT environment all too often, by both employees and customers. Data audits reveal that invalid or dirty data in customer databases averages around 25 to 30 percent. The direct cost resulting from these invalid data values is usually tangible. These costs are normally related to the actions that involve doing the job right in the first place. However, most significantly, if a customer is frustrated enough that he or she decides to go to a competitor, then not only is the company image affected, but also the bottom line through loss of earning potential. There are many examples of poor, fragmented or defective customer data costing time and money, resulting in severe frustration for customers. Most data integrity and integration issues are hidden in plain sight, because they ve become part of dayto-day work and processes.

Data Mismanagement throughout your entire company Data resources, such as your customer data, are completely reusable. In fact, these resources are one of the only commodities within a business where high redundancy is accepted as a legitimate cost of doing business. Customer data quality and integration problems impact virtually every area of the data value chain of your business. From primary activities like inbound and outbound logistics, marketing, sales and operations, to supporting activities like procurement and human resources. Time and money is wasted on: 1. Adding the same customer information manually in multiple databases 2. Correcting inaccurate data 3. Identifying and building workarounds for customer data problems 4. Identifying and searching for missing data 5. Manually enriching customer data in one or more systems 6. Assembling customer data across one or more systems 7. Resolving customer data related complaints The question then arises as to why organisations have come to accept this wasteful and costly approach when it can now be easily resolved? Bottlenecks causing low quality and incomplete views Of course people make mistakes when entering customer data, but this allows the data to be inaccurate and incomplete. In addition, many companies rely on an ITinfrastructure based on multiple databases and applications. This silo structure is very common, simply because companies need to facilitate the specific business processes within the primary and supporting activities in the total value chain of the organisation. Companies typically depend on several ERP, CRM, Customer Service, Invoicing and other front and back office systems. Unfortunately this collection of separate databases and applications is often not or only partially integrated, resulting in customer data being scattered across the enterprise environment. Data quality software can solve many issues within single silos, but it does not provide accurate and complete views of every individual throughout the multiple silos. Generally this is caused through the business growing and the requirement of multiple data sets for specific business needs. Whilst the philosophy of CRM was to aim at a single system containing customer information and processes, it has turned out that this approach has failed for many organisations due to independent data silos. In addition to this is the added complication of merger and acquisitions where companies are then required to merge multiple disparate datasets and applications.

What is at the heart of any Master Data Management Solution? MDM provides companies with the sustainable answer to high costs caused by their inaccurate and disjointed database environments. According to Gartner s definition, MDM is much more than just technology: A combination of technology, processes and services to deliver an accurate, timely and complete view of the customer across multiple channels, lines of business departments and divisions drawing customer data from multiple sources and systems The matching engine is at the core of any MDM solution and takes care of the mapping and grouping of customers within multiple databases. When two or more records are spread across this pool of data belonging to the same customer, the matching engine should be capable of recognising this and assigning those records to the same group. Without matching technology it is not possible to deliver an automated single customer view to your company. With this matching technology it is possible to create a central repository for trusted customer data containing the Golden Record and references to the original data source. This means that any organisation that employs an MDM solution should then be able to achieve a single customer view and to simultaneously improve the agility of existing applications. Combining deterministic and probabilistic matching a hybrid approach Real high quality, high precision matching engines that deliver the required results must make use of deterministic and probabilistic matching. The reason for this is actually quite simple: the better the matching engine is able to determine what is what in a particular context, the better the probability calculation of a certain match or a certain nonmatch. This is, in essence, the same as humans do. We determine what we know and consequently use contextual probability and pattern recognition to assign significations to the words we come across: John Edward Smith is the combination of a very frequent first name, a common middle name and a highly frequent surname. This knowledge is a clear advantage when interpreting the string John Edward Agandonga, where the surname has a very low absolute frequency. Using contextual probability however, we are able to assign the signification surname to this particular word.

The truth about high precision matching methods There are many theories on matching, but in general there are two methods that prevail in master data management systems: the deterministic and the probabilistic approach. Both approaches to matching have advantages and disadvantages, but they also have one thing in common. The higher the level of domain-specific and statistical knowledge, the better the assessment of the degree of similarity between database records. Deterministic matching uses, among others, country- and subject-specific knowledge, linguistic rules, such as phonetic conversion and comparison, business rules and algorithms, such as letter transposition or contextual acronym resolving to determine the degree of similarity between database records. Example: The match between EVO AG and Energieversorgungsgesellschaft Offenbach is in part determined through contextual acronym resolving and subject-specific knowledge on legal forms and compound words in the German language: AG in German is the standard abbreviation for the legal form Aktiengesellschaft, of which the word gesellschaft is frequently used in compound words a very common process in the German language. NB: This example is based on extensive domain-specific knowledge, which enhances the quality of the particular deterministic matching method. Probabilistic matching uses statistical and mathematical algorithms, fuzzy logic and contextual frequency rules to assign the degree of similarity between database records. In this, patterns with regard to fault-tolerance play an important role (the matching method is able to take into account that humans make specific errors). Probabilistic matching methods usually assign the probability of a match in a percentage. Example: The word London can have different significations. It can, for example, be a surname or it can have a geographical meaning, being the capital of England. However, the probability of London being a surname is much lower than it being a geographical indication. Compare Jack London Ltd., Thompson London Ltd. and The London Consulting Group Ltd. Combining deterministic and probabilistic matching a hybrid approach Within the MDM market space, there are a lot of opinions on the difference in accuracy and performance of the aforementioned matching methods. This is, however, an insignificant discussion. Real high quality, high precision matching engines that deliver the required results must make use of deterministic and probabilistic matching. The reason for this is actually quite simple: the better the matching engine

is able to determine what is what in a particular context, the better the probability calculation of a certain match or a certain non-match. This is, in essence, the same as humans do. We determine what we know and consequently use contextual probability and pattern recognition to assign significations to the words we come across: John Edward Smith is the combination of a very frequent first name, a common middle name and a highly frequent surname. This knowledge is a clear advantage when interpreting the string John Edward Agandonga, where the surname has a very low absolute frequency. Using contextual probability however, we are able to assign the signification surname to this particular word. When comparing the numerical strings 050512 and 12052005, it is beneficial to know that there are different types of date notation, since this definitely increases the similarity probability of the two strings. A straightforward mathematical comparison would generate a lower probability rate. Interpretation based on natural language processing In order to correctly interpret business data, the interpretation engine must analyse the data in a human fashion. This is called natural language processing. The analysis consists of: Tokenising: the data is tokenised into separate words Characterising: each word is assigned attributes, such as pronounceability and prosodic structure Classification: each word is assigned a categorical classification (i.e. syntactic class = proper name) Grouping: all possible groupings of categories are generated (i.e. a + b yields c, where a, b and c are categories. Path selection: the optimal path through the (many) groupings is selected and the signification is assigned. Automated interpretation of business data is not a simple task. It must, for instance, deal with the fact that many words have more than one meaning and that data in different data sources is often non- standardised, incomplete, incorrect or otherwise deficient. In MDM, matching over a variety of data sources is common practice. Combining deterministic and probabilistic matching will yield more precise matching, with less false positives (mismatches) and less false negatives (missed matches). Probabilistic matching often uses weighting schemes that consider the frequency of information to calculate a score and/or ranking. The more common a particular data element is, the lighter the weight that should be used in a comparison. That is a sound and robust approach. However, assigning weighting factors on data that have been interpreted and enhanced with statistical information will increase the matching results to a level of precision.

It is not a question of choosing either a deterministic or a probabilistic matching method. It is a question of choosing a hybrid method that will satisfy your MDM requirements in the best possible way. Of course, such a method must include natural language processing capabilities and domain specific knowledge repositories. Convert high cost into high value with High Precision Matching at the heart of your MDM solution High Precision Matching Technology at the heart of any architectural MDM solution, as delivered by Human Inference, makes it possible to convert high costs into high value. activities can be used to add value to your company. After delivering the single customer view via a central repository of customer data it is possible to create additional value adding activities like: Reliable reports and analyses can be made to support reliable decision making The right number of mailings or catalogues can be sent to the right addresses Portals like, for instance, a call center portal with a complete view on your customers can be created for improved customer experience, cross and up selling and improved risk management Requests for changes like address information or account numbers will be processed right the first time Credit risks are available to employees preventing high risk accounts receivable. First, all costs related to the processes and actions that involve not doing jobs right the first time as the result of the lack of a (high quality) single customer view can be reduced to an absolute minimum; i.e. assembling customer data across separate databases manually and resolving customer data related complaints. Second and even more important, the possible threat of lost customer lifetime value as the result of dissatisfied and frustrated customers switching to competitors can be abolished and result in possible savings of millions of euros over time. Next to these cost saving aspects we have to consider the value of opportunities created. Time and money spent on unnecessary processes and

It is also possible to adapt your existing source systems to interact real-time with the central repository to deliver up-todate and complete customer information to the channel, lines of business, departments and divisions supported by these original source systems. The creation and monitoring of high customer experience, with the help of a single customer view, resulting in satisfied and loyal customers is the foundation for even more value creating activities: It helps in the process of market segmentation and targeting. The right knowledge and a complete view of your customers serve as an important source for product and services development. The combination of targeting the right market and right customers with the right products and services is the basis for a sustainable competitive advantage - highly trusted customer analytics. Whatever architectural MDM style you choose, the High Precision Matching engine is at the heart of the solution. Faster time to value with a step-by-step approach: Our pragmatic approach to creating your single customer view can begin with the unification of departmental data stores such as within Marketing and sales. This relies on the same High precision matching engine also used in organisation-wide implementations! So, if you opt to start with the consolidation style for simple customer data related reports providing early value to your company, you re still able to transform that MDM solution to a higher level architecture like the coexistence style or transactional style or even to a multi-domain Master Data Management solution, using the same high precision matching engine.

Conclusion The quality of your MDM solution is only going to be as powerful as the quality of your matching engine. If your matching engine has a quality level of only 80% then your MDM Solution can only deliver a maximum quality level of 80%. So an investment in an automated high precision matching engine at the heart of your master data management solution can be seen as a very important and more critically, a sustainable resource for a high quality single customer view. Ultimately this high quality customer view will help you to reduce costs and to generate value. How may we help you? The market has shown that projects of this scope, handled purely internally, typically fail to meet organisational needs. We find that this is usually down to a combination of challenges such as time constraints, lack of project expertise, as well as lack of sponsorship for the undertaking. Neopost Customer Information Management are uniquely positioned, through 25 years of experience, to enable Public Sector organisations to design and implement effective Single Customer View - Master Data Management projects.

About Human Inference, a Neopost company Human Inference, acquired by Neopost in 2012, has been helping governments and businesses to improve their customer relationships for over 25 years, by resolving all their customer data and information quality concerns. This means that: Center Parcs can send you a personalised offer, resulting in a 20% higher return on their marketing efforts ING was able to merge seamlessly with Postbank Nutricia realised the basis for even more successful marketing campaigns in just three months And Aegon, ING Lease, SNS Property Finance and many others prevent millions of euros in fraud every quarter More information? If you would like to find out more about the value of a single customer view for your organisation, or you are looking for tips and advice, please feel free to contact Neopost Customer Information Management: T: +44 (0)20 7831 3900 E: UKCIM.sales@neopost.com W: cim.neopost.co.uk Copyright Human Inference, 2015 All rights reserved. This whitepaper has been written with the greatest care. However, Human Inference cannot be held responsible for any consequences in case (a part of the) content is incomplete or incorrect.