The Value of Advanced Data Integration in a Big Data Services Company. Presenter: Flavio Villanustre, VP Technology September 2014

Size: px
Start display at page:

Download "The Value of Advanced Data Integration in a Big Data Services Company. Presenter: Flavio Villanustre, VP Technology September 2014"

Transcription

1 The Value of Advanced Data Integration in a Big Data Services Company Presenter: Flavio Villanustre, VP Technology September 2014

2 About LexisNexis We are among the largest providers of risk solutions in the market today LexisNexis Reed Elsevier is a world leading provider of information solutions. LexisNexis Risk Solutions has seen sustained revenue and profit growth LexisNexis Risk Solutions is a leading provider in the U.S. across Business Services, Insurance and Government segments. LexisNexis Risk Solutions Revenue: $M LexisNexis Risk Solutions Revenue by Segment Government 8% Business Services: 39% Insurance: 53% The Value of Advanced Data Integration in a Big Data Services Company 2

3 About LexisNexis We are among the largest providers of risk solutions in the market today LexisNexis LexisNexis Risk Solutions is a leading provider in the U.S. across Business Services, Insurance and Government segments. LexisNexis Risk Solutions Revenue by Segment Our customers include: 99 of the top 100 US banks 90% of the Fortune % of US P&C insurance carriers All 50 US states, 70% of local governments and 80% of US federal agencies 97 of Am Law 100 firms Government 8% Business Services: 39% Insurance: 53% The Value of Advanced Data Integration in a Big Data Services Company 3

4 About LexisNexis We have a unique set of capabilities: Big data, linking, analytics, and product development Big Data Technology Vast Data Resources Linking & Analytics Industry-Specific Expertise & Delivery = Customer-Focused Solutions Speed Capacity Cost savings Process Sources Coverage Advanced linking & analytics Accuracy & efficiency Protect private information Aligned with our customers industries Deep industry expertise Predict, manage and assess risk across many industries. The Value of Advanced Data Integration in a Big Data Services Company 4

5 Entity Resolution in practice Examples of some questions we answer for our customers: 1. Are you who you say you are? 2. Who else might you be, or claim to have been in the past? 3. What other people and/or assets are associated with you? 4. What kind of a risk do you represent in a given context? 1. in calculating your insurance premium, or 2. in processing your claim, or 3. in granting you access to credit, or in doing business with you as a vendor or customer 5. Can I quantify the risk that you represent in the form of a score? 6. Which of these millions of transactions should I look at in case there s something suspicious? 7. What small subset of these 1000s (or millions) of events have something in common - which will cause me to look more closely without wasting time on false positives? The Value of Advanced Data Integration in a Big Data Services Company 5

6 It all starts with data LexisNexis collects many different types of data, from sources such as: Bureaus, utility connections, and student lists. Directory assistance and cell phones. Motor vehicle, watercraft, and aircraft registrations. Property deeds, assessments, and foreclosures. Driver s licenses, professional licenses, and voter registrations. Corporation and UCC filings, and other business registrations. Bankruptcy filings, liens, judgments, and accident reports. Departments of Corrections, criminal courts, and watch lists. The Value of Advanced Data Integration in a Big Data Services Company 6

7 And significant volumes of it Access to over approximately 25 billion public record filings Break-down of record counts for the more popular data sets: Data Source # of records Data Source # of records Associates/Relatives 1.8 Billion People at Work 1.5 Billion Bankruptcy 23 Million Private Phones 172 Million Business BDID's 283 Million Professional Licenses 94 Million Business People Links 959 Million Property 2.5 Billion Canadian Phones 62 Million Sex Offenders 550,000 Consumer Header 10.8 Billion SSN's 7.2 Billion Criminal 216 Million Student Records 38 Million Date of Birth 5.2 Billion TIN 2.9 Million Death 98 Million Unique ADLs - active 257 Million Drivers Licenses 397 Million Utility 645 Million EDA Phones 124 Million Vehicle Titles 635 Million FEINs 10.4 Million Vehicle Registrations 2.5 Billion Historical Phones 800 Million White Pages 116 Million Hunting and Fishing Licenses 67 Million Wireless Phones 101 Million Liens and Judgments 244 Million Yellow Pages 14 Million The Value of Advanced Data Integration in a Big Data Services Company 7

8 How do we harness all this content? We build a Header File A credit header contains the following fields derived from financial transactions: Name Name variations Address Previous addresses Telephone number DOB Social Security number The LN Master Person Header contains the same fields, but derived from many more sources: Credit Header Property records Bankruptcy filings Vehicle registrations Voter registrations Driver s licenses Utility information Phone records The Value of Advanced Data Integration in a Big Data Services Company 8

9 We can therefore build unique identifiers such as LexisNexis LexID : Non-SSN, non-fein-dependent identifier that brings together information from disparate databases offering an up-to-date unique profile for an individual or a business. Securely and intelligently analyzes billions of partial and complete records. Filters and links that information based on relevance to connect seemingly unrelated data. Faster access and better results/match-rates due to pre-linking of data. Addresses Government and societal concerns regarding collection and usage of SSN. The Value of Advanced Data Integration in a Big Data Services Company 9

10 The data linking/integration flow The Value of Advanced Data Integration in a Big Data Services Company 10

11 The LexisNexis Open Source HPCC Systems platform WHT/ The Value of Advanced Data Integration in a Big Data Services Company 11

12 Detailed HPCC Systems Platform Architecture WHT/ The Value of Advanced Data Integration in a Big Data Services Company 12

13 Enterprise Control Language (ECL) Declarative programming language: Describe what needs to be done and not how to do it Powerful: High level data primitives as JOIN, TRANSFORM, PROJECT, SORT, DISTRIBUTE, MAP, etc. are available. Extensible: As new attributes are defined, they become primitives that other programmers can use Implicitly parallel: Parallelism is built into the underlying platform. The programmer needs not be concerned with it Maintainable: A high level programming language, without side effects and with efficient encapsulation, programs are more succinct, reliable and easier to troubleshoot Complete: ECL provides for a complete data programming paradigm Homogeneous: One language to express data algorithms across the entire HPCC platform, data integration, analytics and high speed delivery WHT/ The Value of Advanced Data Integration in a Big Data Services Company 13

14 Enterprise Control Language (ECL) The ECL optimizer generates an optimal execution plan Sub-graphs run activities in parallel, automatically persist the results and spill to disk as needed Lazy evaluation avoids re-computing results when data and code haven t changed Graphs are dynamic and display the number of records traversing the edges and data skew percentages WHT/ The Value of Advanced Data Integration in a Big Data Services Company 14

15 Scalable Automated Linking Technology (SALT) The acronym stands for Scalable Automated Linking Technology Declarative linking paradigm Compiles to ECL, which compiles to C++ Provides for automated data profiling, QA/QC, parsing, cleansing, normalization and standardization Sophisticated specificity based linking and clustering Links entities to attributes Data Sources and entities together Iterative process leveraging inference gained in prior steps Profiling Parsing Cleansing Normalization Standardization Data Preparation Processes (ETL) 42 Lines of SALT 3,980 Lines of ECL 482,410 Lines of C++ Matching Weights and Threshold Computation Blocking/ Searching Weight Assignment and Record Comparison Record Match Decision Linked Data File Additional Data Ingest Linking Iterations Record Linkage Processes The Value of Advanced Data Integration in a Big Data Services Company 15

16 Rules based vs. probabilistic record linkage Through statistical methodologies, SALT has the two significant advantages over rules: 1. Recall INPUT SALT Match, because the system has learnt that Villanustre is specific because the frequency of occurrence is small and there is only one present in Atlanta Flavio Villanustre, Atlanta Javio Villanustre, Atlanta Record 1 Record 2 Error RULES NO MATCH, because the rules determine that Flavio and Javio are not the same 2. Precision INPUT SALT NO Match, because the system has learnt that John Smith is not specific because the frequency of occurrence is large and there are many present in Atlanta John Smith, Atlanta John Smith, Atlanta Error RULES MATCH, because the rules determine that John Smith and the city for both the records match The Value of Advanced Data Integration in a Big Data Services Company 16

17 Entity linkage and associations SALT also links entities to form networks of relationships To clustering From disparate data To showing relationships The Value of Advanced Data Integration in a Big Data Services Company 17

18 LexID Linking A process that links together all records common to a single individual. Assigns a unique identifier (AKA LexID) to those records. Not rules based. Calculates a weight (the specificity) for each value of each field, relative to all of the other values for that field found within the data. Includes fuzzy matching and other capabilities to help find the best matches. Regional diversity or cultural variations can affect linking results. LexID John Doe Header data sources requires high standard of quality and reliability. The Value of Advanced Data Integration in a Big Data Services Company 18

19 Putting the big picture together Business Logic Step 1 Step 2 Request 1 or more Lex ID Pull Data PII [{ID, Score}, ] Linking Subsystem Source Data The Value of Advanced Data Integration in a Big Data Services Company 19

20 Why is Linking a Big Data problem? Candidate File 73 separate blocking criteria. 55 hours of joins across 400 machines. Every possible match is scored for specificity. 11 Billion match candidates identified. Match By Date of Birth and locale 10 Billion Records Match Candidates 16 Million Matches 400 Machines 2 hours Comparisons The Value of Advanced Data Integration in a Big Data Services Company 20

21 Linking is resilient to new data sources As records and data-sources are merged into the consumer database, the total number of identities continues to track with the population Millions 12,000 10,000 8,000 6,000 4,000 Total Records Total Identities Core Identities 2,000 0 Months During this 2-year period: Almost doubled the number of records The number of Lex IDs remained stable Core Lex ID count on track (3% net increase) The Value of Advanced Data Integration in a Big Data Services Company 21

22 All other data services naturally derive from the core linked data Source Data Matched Data Sets Scores & Attributes Primary attributes Tax Liens, Felonies, Bankruptcies Summarize a particular characteristic of a consumer Property Deeds Examples include: Professional Licenses Landline & Cell Phone Court Judgments Compiled records for more than 250M identities Number of addresses Tax assessed residence value Composite attributes Summarize aggregate consumer behavior (wealth, income, mobility) Voter Registration Created by combining primary attributes Bureau Header Data Scores Other (education, etc) Industry specific scores Custom scores The Value of Advanced Data Integration in a Big Data Services Company 22

23 Useful Links LexisNexis Open Source HPCC Systems Platform: Free Online Training: SALT: Machine Learning portal: The HPCC Systems blog: Community Forums: Our GitHub portal: JIRA: The Value of Advanced Data Integration in a Big Data Services Company 23

24 Questions? Thank you! The Value of Advanced Data Integration in a Big Data Services Company 24

Reinventing Business Intelligence through Big Data

Reinventing Business Intelligence through Big Data Reinventing Business Intelligence through Big Data Dr. Flavio Villanustre VP, Technology and lead of the Open Source HPCC Systems initiative LexisNexis Risk Solutions Reed Elsevier LEXISNEXIS From RISK

More information

A Unique Perspective into the World of Identity Fraud

A Unique Perspective into the World of Identity Fraud White Paper A Unique Perspective into the World of Identity Fraud Explore the value of LexisNexis FraudPoint solutions in fraud detection. Risk Solutions Identity Management Executive Summary There are

More information

Customer Education. LexisNexis Accurint Getting Started & Best Practices Guide. LexisNexis Risk Solutions

Customer Education. LexisNexis Accurint Getting Started & Best Practices Guide. LexisNexis Risk Solutions LexisNexis Risk Solutions Customer Education LexisNexis Accurint Getting Started & Best Practices Guide In order to meet all of your training needs, LexisNexis Customer Education offers additional training,

More information

clear.thomsonreuters.com DON T JUST FIND PEOPLE. FIND ANSWERS. CLEAR

clear.thomsonreuters.com DON T JUST FIND PEOPLE. FIND ANSWERS. CLEAR clear.thomsonreuters.com DON T JUST FIND PEOPLE. CLEAR FIND ANSWERS. CLEAR HELPS YOU FIND ANSWERS FASTER. CLEAR is a powerful research platform with a vast collection of public and proprietary records.

More information

90% of your Big Data problem isn t Big Data.

90% of your Big Data problem isn t Big Data. White Paper 90% of your Big Data problem isn t Big Data. It s the ability to handle Big Data for better insight. By Arjuna Chala Risk Solutions HPCC Systems Introduction LexisNexis is a leader in providing

More information

clear.thomsonreuters.com DON T JUST FIND PEOPLE. FIND ANSWERS. CLEAR

clear.thomsonreuters.com DON T JUST FIND PEOPLE. FIND ANSWERS. CLEAR clear.thomsonreuters.com DON T JUST FIND PEOPLE. CLEAR FIND ANSWERS. CLEAR HELPS YOU FIND ANSWERS FASTER. CLEAR is a powerful research platform with a vast collection of public and proprietary records.

More information

Leveraging Big Data for the Next Generation of Health Care Ken Cunningham, VP Analytics Pam Jodock, Director Business Development

Leveraging Big Data for the Next Generation of Health Care Ken Cunningham, VP Analytics Pam Jodock, Director Business Development Leveraging Big Data for the Next Generation of Health Care Ken Cunningham, VP Analytics Pam Jodock, Director Business Development December 6, 2012 Health care spending to Reach 20% of U.S. Economy by 2020

More information

Price Schedule for Accurint, Social Media Monitor, Batch, Instant Verification and Instant Authentication

Price Schedule for Accurint, Social Media Monitor, Batch, Instant Verification and Instant Authentication Price Schedule for Accurint, Social Media Monitor, Batch, Instant Verification and Instant Authentication The prices and terms set forth herein supersede any and all pricing and terms set forth anywhere

More information

LexisNexis InstantID. Technical White Paper. Analyzing Results. For the following: LexisNexis Bridger Insight XG. Contact LexisNexis Sales:

LexisNexis InstantID. Technical White Paper. Analyzing Results. For the following: LexisNexis Bridger Insight XG. Contact LexisNexis Sales: Technical White Paper LexisNexis InstantID Analyzing Results For the following: LexisNexis Bridger Insight XG Contact LexisNexis Sales: 877-922-5757 C ONTENTS Executive Summary..................................................

More information

Technology teach in. November 19, 2015 London

Technology teach in. November 19, 2015 London Technology teach in November 19, 2015 London 1 FORWARD LOOKING STATEMENTS This presentation contains forward looking statements within the meaning of Section 27A of the US Securities Act of 1933, as amended,

More information

Risk Solutions. Industrial Big Data Analytics Lessons from the trenches Flavio Villanustre LexisNexis Risk Solutions

Risk Solutions. Industrial Big Data Analytics Lessons from the trenches Flavio Villanustre LexisNexis Risk Solutions Industrial Big Data Analytics Lessons from the trenches Flavio Villanustre LexisNexis Big Data funnel 2 HPCC Systems Big Data platform Open source distributed data processing and storage architecture Splits

More information

Thomson Reuters CLEAR THE SMARTER WAY TO GET YOUR INVESTIGATIVE FACTS STRAIGHT.

Thomson Reuters CLEAR THE SMARTER WAY TO GET YOUR INVESTIGATIVE FACTS STRAIGHT. Thomson Reuters CLEAR THE SMARTER WAY TO GET YOUR INVESTIGATIVE FACTS STRAIGHT. CLEAR. EASIER SEARCH. BETTER RESULTS. SMARTER PREFERENCES. Bring the facts into focus. CLEAR features a simple, intuitive

More information

WHY IDENTITY MANAGEMENT MATTERS TO MEDICAID. Clint Fuhrman National Director Government Healthcare

WHY IDENTITY MANAGEMENT MATTERS TO MEDICAID. Clint Fuhrman National Director Government Healthcare WHY IDENTITY MANAGEMENT MATTERS TO MEDICAID Clint Fuhrman National Director Government Healthcare Opportunities Greater focus on the individuals and entities in the program Are beneficiaries enrolling

More information

SCHEDULE A Accurint for Government (Per User Subscription)

SCHEDULE A Accurint for Government (Per User Subscription) LexisNexis Risk Solutions SCHEDULE A Accurint for Government (Per User Subscription) Agency (Customer) Name: Jefferson County District Attorney Billgroup #: 1437370 LN Account Manager: Abbey Willis This

More information

LexisNexis SmartLinx Comprehensive Reports Getting Started & Search Tips Guide. LexisNexis Customer Support: 800.543.6862

LexisNexis SmartLinx Comprehensive Reports Getting Started & Search Tips Guide. LexisNexis Customer Support: 800.543.6862 LexisNexis SmartLinx Comprehensive Reports Getting Started & Search Tips Guide Visit our user site for Nexis tips, training, self-paced tutorials and more. www.lexisnexis.com/bis-user-information/ Signing

More information

Unlimited. Premier Investigative Database for Law Enforcement & Government Agencies. For A FREE TRIAL

Unlimited. Premier Investigative Database for Law Enforcement & Government Agencies. For A FREE TRIAL Unlimited America s #1 Link Analysis Tool Person & PHONE Find Relatives, Neighbors, Associates and Common Residency Instantly. Link & Connect Current & Historical Data on Your Subject in Real Time Comprehensive

More information

Key Factors for Payers in Fraud and Abuse Prevention. Protect against fraud and abuse with a multi-layered approach to claims management.

Key Factors for Payers in Fraud and Abuse Prevention. Protect against fraud and abuse with a multi-layered approach to claims management. White Paper Protect against fraud and abuse with a multi-layered approach to claims management. October 2012 Whether an act is technically labeled health insurance fraud or health insurance abuse, the

More information

Insurance Solutions. 17 October 2013. Risk Solutions

Insurance Solutions. 17 October 2013. Risk Solutions Insurance Solutions 17 October 2013 FORWARD LOOKING STATEMENTS This presentation contains forward looking statements within the meaning of Section 27A of the US Securities Act of 1933, as amended, and

More information

WHITEPAPER. Complying with the Red Flag Rules and FACT Act Address Discrepancy Rules

WHITEPAPER. Complying with the Red Flag Rules and FACT Act Address Discrepancy Rules WHITEPAPER Complying with the Red Flag Rules and FACT Act Address Discrepancy Rules May 2008 2 Table of Contents Introduction 3 ID Analytics for Compliance and the Red Flag Rules 4 Comparison with Alternative

More information

Sample Report: LexisNexis RiskView Report

Sample Report: LexisNexis RiskView Report Sample Report: LexisNexis RiskView Report LexisNexis RiskView Report delivers insights into key consumer data and behavior attributes to help strengthen lending decisions, expand your addressable market

More information

16.1 MAPREDUCE. For personal use only, not for distribution. 333

16.1 MAPREDUCE. For personal use only, not for distribution. 333 For personal use only, not for distribution. 333 16.1 MAPREDUCE Initially designed by the Google labs and used internally by Google, the MAPREDUCE distributed programming model is now promoted by several

More information

March 22, 2013. Tennessee State Employees Association 627 Woodland Street Nashville, TN 37206

March 22, 2013. Tennessee State Employees Association 627 Woodland Street Nashville, TN 37206 March 22, 2013 March 22, 2013 Tennessee State Employees Association 627 Woodland Street Nashville, TN 37206 InfoArmor is pleased to present the Tennessee State Employees Association (TSEA) with the following

More information

Fraud and Improve Provider and Member Management?

Fraud and Improve Provider and Member Management? Using IdentityManagement andpredictiveanalyticsto to Prevent Fraud and Improve Provider and Member Management? Kathy Mosbaugh, Director, Health Care February 6, 2013 LexisNexis leverages data about people,

More information

How To Make Sense Of Data With Altilia

How To Make Sense Of Data With Altilia HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to

More information

White Paper. High Value Data and Analytics: Building a Platform for Growth

White Paper. High Value Data and Analytics: Building a Platform for Growth White Paper High Value Data and Analytics: Building a Platform for Growth What s hidden in your data? Analyze, realize and optimize the possibilities. Created by industry experts, this publication is the

More information

Real-time customer information data quality and location based service determination implementation best practices.

Real-time customer information data quality and location based service determination implementation best practices. White paper Location Intelligence Location and Business Data Real-time customer information data quality and location based service determination implementation best practices. Page 2 Real-time customer

More information

Unlocking The Value of the Deep Web. Harvesting Big Data that Google Doesn t Reach

Unlocking The Value of the Deep Web. Harvesting Big Data that Google Doesn t Reach Unlocking The Value of the Deep Web Harvesting Big Data that Google Doesn t Reach Introduction Every day, untold millions search the web with Google, Bing and other search engines. The volumes truly are

More information

Data Mining in the Swamp

Data Mining in the Swamp WHITE PAPER Page 1 of 8 Data Mining in the Swamp Taming Unruly Data with Cloud Computing By John Brothers Business Intelligence is all about making better decisions from the data you have. However, all

More information

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

Survey of External Data Possibilities for Commercial Insurance

Survey of External Data Possibilities for Commercial Insurance Survey of External Data Possibilities for Commercial Insurance 2015 CAS Ratemaking and Product Management Seminar Tom Kolde, FCAS, MAAA Kathryn Walker, FCAS, MAAA, CPCU March 11, 2015 Discussion Topics

More information

Don t be the last to know.

Don t be the last to know. White Paper Commercial Portfolio Risk Management Doesn t Stop with Underwriting Don t be the last to know. Carriers that proactively manage risk during the policy term can better anticipate and respond

More information

Graph Database Proof of Concept Report

Graph Database Proof of Concept Report Objectivity, Inc. Graph Database Proof of Concept Report Managing The Internet of Things Table of Contents Executive Summary 3 Background 3 Proof of Concept 4 Dataset 4 Process 4 Query Catalog 4 Environment

More information

Customer Data Management. Breaking down data silos for improved business outcomes

Customer Data Management. Breaking down data silos for improved business outcomes Customer Data Management Breaking down data silos for improved business outcomes November 2011 Table of Contents 1 Executive summary 1 Introduction 2 Selling in Today s Complex B2B Climate 3 The Solution:

More information

Get More Scalability and Flexibility for Big Data

Get More Scalability and Flexibility for Big Data Solution Overview LexisNexis High-Performance Computing Cluster Systems Platform Get More Scalability and Flexibility for What You Will Learn Modern enterprises are challenged with the need to store and

More information

HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team

HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team HPCC Monitoring and Reporting (Technical Preview) Boca Raton Documentation Team Copyright 2015 HPCC Systems. All rights reserved

More information

Real vs. Synthetic Web Performance Measurements, a Comparative Study

Real vs. Synthetic Web Performance Measurements, a Comparative Study Real vs. Synthetic Web Performance Measurements, a Comparative Study By John Bartlett and Peter Sevcik December 2004 Enterprises use today s Internet to find customers, provide them information, engage

More information

LEXISNEXIS OFFERS ENTERPRISE DATA FUSION TO GOVERNMENT AGENCIES TO MEET NATIONAL SECURITY CHALLENGES:

LEXISNEXIS OFFERS ENTERPRISE DATA FUSION TO GOVERNMENT AGENCIES TO MEET NATIONAL SECURITY CHALLENGES: WHITE PAPER LEXISNEXIS OFFERS ENTERPRISE DATA FUSION TO GOVERNMENT AGENCIES TO MEET NATIONAL SECURITY CHALLENGES: Data Integration Platform Offers Immediate Solution for Large Scale Disparate Data Challenges

More information

Business Intelligence Solutions for Gaming and Hospitality

Business Intelligence Solutions for Gaming and Hospitality Business Intelligence Solutions for Gaming and Hospitality Prepared by: Mario Perkins Qualex Consulting Services, Inc. Suzanne Fiero SAS Objective Summary 2 Objective Summary The rise in popularity and

More information

Pattern Insight Clone Detection

Pattern Insight Clone Detection Pattern Insight Clone Detection TM The fastest, most effective way to discover all similar code segments What is Clone Detection? Pattern Insight Clone Detection is a powerful pattern discovery technology

More information

The Predictive Fraud and Abuse Analytic and Risk Management System

The Predictive Fraud and Abuse Analytic and Risk Management System The Predictive Fraud and Abuse Analytic and Risk Management System Empowering healthcare payers and stakeholders in preventing and recovering fraudulent healthcare payments IkaIntegrity : Your real-time

More information

Manage large-scale disparate data challenges.

Manage large-scale disparate data challenges. White Paper Manage large-scale disparate data challenges. Unlock the intelligence in real-world data with enterprise data fusion. February 2010 Risk Solutions Government Introduction In a recent report

More information

Experian s UK Credit Bureau Scores. Version 1.6

Experian s UK Credit Bureau Scores. Version 1.6 Experian s UK Credit Bureau Scores Version 1.6 January 2014 About Experian Decision Analytics Experian Decision Analytics enterprise-wide solutions combine data intelligence, predictive analytics, decisionenabling

More information

Mastering Big Data. Steve Hoskin, VP and Chief Architect INFORMATICA MDM. October 2015

Mastering Big Data. Steve Hoskin, VP and Chief Architect INFORMATICA MDM. October 2015 Mastering Big Data Steve Hoskin, VP and Chief Architect INFORMATICA MDM October 2015 Agenda About Big Data MDM and Big Data The Importance of Relationships Big Data Use Cases About Big Data Big Data is

More information

ENHANCING INTELLIGENCE SUCCESS: DATA CHARACTERIZATION Francine Forney, Senior Management Consultant, Fuel Consulting, LLC May 2013

ENHANCING INTELLIGENCE SUCCESS: DATA CHARACTERIZATION Francine Forney, Senior Management Consultant, Fuel Consulting, LLC May 2013 ENHANCING INTELLIGENCE SUCCESS: DATA CHARACTERIZATION, Fuel Consulting, LLC May 2013 DATA AND ANALYSIS INTERACTION Understanding the content, accuracy, source, and completeness of data is critical to the

More information

Waste Fleet Safety: Tips and Tools to Ensure Safe Driving. White Paper. www.fleetmind.com

Waste Fleet Safety: Tips and Tools to Ensure Safe Driving. White Paper. www.fleetmind.com Waste Fleet Safety: Tips and Tools to Ensure Safe Driving White Paper www.fleetmind.com Table of Contents Introduction 1 CSA 2010 2 Fleet safety planning 3 Influencing driver behavior 5 The required tools

More information

How To Use Ibm Tivoli Monitoring Software

How To Use Ibm Tivoli Monitoring Software Monitor and manage critical resources and metrics across disparate platforms from a single console IBM Tivoli Monitoring Highlights Help improve uptime and shorten Help optimize IT service delivery by

More information

Challenge. Solutions. Early results. Personal Lines Case Study Celina Insurance Reduces Expenses & Improves Processes Across the Business.

Challenge. Solutions. Early results. Personal Lines Case Study Celina Insurance Reduces Expenses & Improves Processes Across the Business. Celina Insurance Reduces Expenses & Improves Processes Across the Business About Celina Insurance Group Founded in 1914, Celina Insurance Group is composed of four mutual property and casualty insurance

More information

Why is Internal Audit so Hard?

Why is Internal Audit so Hard? Why is Internal Audit so Hard? 2 2014 Why is Internal Audit so Hard? 3 2014 Why is Internal Audit so Hard? Waste Abuse Fraud 4 2014 Waves of Change 1 st Wave Personal Computers Electronic Spreadsheets

More information

Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall.

Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall. Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall. 5401 Butler Street, Suite 200 Pittsburgh, PA 15201 +1 (412) 408 3167 www.metronomelabs.com

More information

CREDIT SCORE USER GUIDE

CREDIT SCORE USER GUIDE Page 1 of 11 ABOUT EQUIFAX Equifax empowers businesses and consumers with information they can trust. A global leader in information solutions, we leverage one of the largest sources of consumer and commercial

More information

The Role of D&B s DUNSRight Process in Customer Data Integration and Master Data Management. Dan Power, D&B Global Alliances March 25, 2007

The Role of D&B s DUNSRight Process in Customer Data Integration and Master Data Management. Dan Power, D&B Global Alliances March 25, 2007 The Role of D&B s DUNSRight Process in Customer Data Integration and Master Data Management Dan Power, D&B Global Alliances March 25, 2007 Agenda D&B Today and Speaker s Background Overcoming CDI and MDM

More information

Commercial Credit Report Pro Guide

Commercial Credit Report Pro Guide Page 1 Reading the Commercial Credit Report Pro Get up to double the business credit information with our Advantage series reports including data from Experian, Dun & Bradstreet and Equifax. Too many times

More information

Dan French Founder & CEO, Consider Solutions

Dan French Founder & CEO, Consider Solutions Dan French Founder & CEO, Consider Solutions CONSIDER SOLUTIONS Mission Solutions for World Class Finance Footprint Financial Control & Compliance Risk Assurance Process Optimization CLIENTS CONTEXT The

More information

APRIL 7-8, 2014 / RIO ALL-SUITE HOTEL & CASINO / LAS VEGAS. Using Data: Big Data in rental car Know more about your renter and your business

APRIL 7-8, 2014 / RIO ALL-SUITE HOTEL & CASINO / LAS VEGAS. Using Data: Big Data in rental car Know more about your renter and your business Using Data: Big Data in rental car Know more about your renter and your business The elevator pitch Analysis of internal data can help your day to day operations. Using external and shared industry data

More information

Putting IBM Watson to Work In Healthcare

Putting IBM Watson to Work In Healthcare Martin S. Kohn, MD, MS, FACEP, FACPE Chief Medical Scientist, Care Delivery Systems IBM Research marty.kohn@us.ibm.com Putting IBM Watson to Work In Healthcare 2 SB 1275 Medical data in an electronic or

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

DNBi Risk Management. Unparalleled Data Insight to Drive Profitable Growth. Insights from Data. Relationships from Insights

DNBi Risk Management. Unparalleled Data Insight to Drive Profitable Growth. Insights from Data. Relationships from Insights DNBi Risk Management Unparalleled Data Insight to Drive Profitable Growth Insights from Data Relationships from Insights DNBi is a powerful, web-based credit risk management solution that offers Dun &

More information

Susan J Hyatt President and CEO HYATTDIO, Inc. Lorraine Fernandes, RHIA Global Healthcare Ambassador IBM Information Management

Susan J Hyatt President and CEO HYATTDIO, Inc. Lorraine Fernandes, RHIA Global Healthcare Ambassador IBM Information Management Accurate and Trusted Data- The Foundation for EHR Programs Susan J Hyatt President and CEO HYATTDIO, Inc. Lorraine Fernandes, RHIA Global Healthcare Ambassador IBM Information Management Healthcare priorities

More information

BIG DATA SURVEY 2014 SURVEY

BIG DATA SURVEY 2014 SURVEY BIG DATA SURVEY 2014 SURVEY There has been a tremendous amount of hype around Big Data projects and applications in recent years, but relatively little quantifiable evidence proving what, if any, business

More information

Informatica and the Vibe Virtual Data Machine

Informatica and the Vibe Virtual Data Machine White Paper Informatica and the Vibe Virtual Data Machine Preparing for the Integrated Information Age This document contains Confidential, Proprietary and Trade Secret Information ( Confidential Information

More information

CiteSeer x in the Cloud

CiteSeer x in the Cloud Published in the 2nd USENIX Workshop on Hot Topics in Cloud Computing 2010 CiteSeer x in the Cloud Pradeep B. Teregowda Pennsylvania State University C. Lee Giles Pennsylvania State University Bhuvan Urgaonkar

More information

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02) Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we

More information

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success Developing an MDM Strategy Key Components for Success WHITE PAPER Table of Contents Introduction... 2 Process Considerations... 3 Architecture Considerations... 5 Conclusion... 9 About Knowledgent... 10

More information

InterAction combines an organization s relationship data and industry knowledge and transforms it into actionable relationship. intelligence.

InterAction combines an organization s relationship data and industry knowledge and transforms it into actionable relationship. intelligence. North Carolina Biotechnology Center (NC Biotech) Overview Location: Research Triangle Park, NC Industry: Economic development/ biotechnology Customer Profile: NC Biotech is a state-funded nonprofit corporation

More information

Alternative Data and Fair Lending

Alternative Data and Fair Lending White Paper 81% of historically underserved minority customers that are unscorable using traditional credit bureau scores are scorable using alternative data. August 2013 By Jeffrey Feinstein, PhD Table

More information

White Paper Voice of Customer: Using Customer Actions That Speak

White Paper Voice of Customer: Using Customer Actions That Speak White Paper Voice of Customer: Using Customer Actions That Speak Rajesh Babu Kuppili Devendra Malekar Ribha Mehrotra Tavant Technologies www.tavant.com P-1 Executive Summary Companies have been using Voice

More information

Predictive Coding Defensibility and the Transparent Predictive Coding Workflow

Predictive Coding Defensibility and the Transparent Predictive Coding Workflow Predictive Coding Defensibility and the Transparent Predictive Coding Workflow Who should read this paper Predictive coding is one of the most promising technologies to reduce the high cost of review by

More information

What to Expect when On-Boarding to ILHIE Master Patient Index (MPI) Presentation By : Alexander Danel

What to Expect when On-Boarding to ILHIE Master Patient Index (MPI) Presentation By : Alexander Danel What to Expect when On-Boarding to ILHIE Master Patient Index (MPI) Presentation By : Alexander Danel What Constitutes Demographics? MPI contains patient demographics. Patient information related to identification,

More information

The New Reality of Synthetic ID Fraud How to Battle the Leading Identity Fraud Tactic in The Digital Age

The New Reality of Synthetic ID Fraud How to Battle the Leading Identity Fraud Tactic in The Digital Age How to Battle the Leading Identity Fraud Tactic in The Digital Age In the 15 years since synthetic identity fraud emerged as a significant threat, it has become the predominant tactic for fraudsters. The

More information

Moreketing. With great ease you can end up wasting a lot of time and money with online marketing. Causing

Moreketing. With great ease you can end up wasting a lot of time and money with online marketing. Causing ! Moreketing Automated Cloud Marketing Service With great ease you can end up wasting a lot of time and money with online marketing. Causing frustrating delay and avoidable expense right at the moment

More information

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics White Paper Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics Contents Self-service data discovery and interactive predictive analytics... 1 What does

More information

The Use of Credit Reports in Employment Background Screening

The Use of Credit Reports in Employment Background Screening White Paper The Use of Credit Reports in Employment Background Screening an Overview for Job Applicants Research provided by: Lester Rosen, CEO, Employment Screening Resources Kerstin Bagus, Director Global

More information

Frequently Asked Questions

Frequently Asked Questions FAQs Frequently Asked Questions Topic Page Setting up your account and account activation 1 Payment options, credit card information, and invoicing. 2 Placing an order and order updates.. 3 The difference

More information

Advanced Big Data Analytics with R and Hadoop

Advanced Big Data Analytics with R and Hadoop REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional

More information

Replicates / Improves Investigative Process. Accesses Data From Disparate Sources In Real Time. Cuts Administrative Costs

Replicates / Improves Investigative Process. Accesses Data From Disparate Sources In Real Time. Cuts Administrative Costs Automated Fraud Investigation Replicates / Improves Investigative Process Accesses Data From Disparate Sources In Real Time Cuts Administrative Costs Improves Investigative Accuracy / Throughput Reduces

More information

Welcome to. Business Intelligence 101

Welcome to. Business Intelligence 101 Welcome to Business Intelligence 101 Hi There! Before choosing a (BI) partner, you ll want to understand the essentials about BI including the various categories of analytics, what sort of insight is possible,

More information

Big Data and Scripting Systems beyond Hadoop

Big Data and Scripting Systems beyond Hadoop Big Data and Scripting Systems beyond Hadoop 1, 2, ZooKeeper distributed coordination service many problems are shared among distributed systems ZooKeeper provides an implementation that solves these avoid

More information

White Paper. Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices.

White Paper. Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices. White Paper Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices. Contents Data Management: Why It s So Essential... 1 The Basics of Data Preparation... 1 1: Simplify Access

More information

Best Practices for Background Screening in SOUTH CAROLINA. schools. Resource

Best Practices for Background Screening in SOUTH CAROLINA. schools. Resource Best Practices for Background Screening in SOUTH CAROLINA schools A Resource Introduction Background screening in schools certainly has its challenges; however, it s an important part of keeping your schools

More information

www.hcltech.com Get Ready for Tomorrow, Today. Redefine Your Security Intelligence

www.hcltech.com Get Ready for Tomorrow, Today. Redefine Your Security Intelligence www.hcltech.com Get Ready for Tomorrow, Today. Redefine Your Security Intelligence Balancing Accessibility and Risk The challenge before enterprises is to provide accessibility and protect their online

More information

Big Data With Hadoop

Big Data With Hadoop With Saurabh Singh singh.903@osu.edu The Ohio State University February 11, 2016 Overview 1 2 3 Requirements Ecosystem Resilient Distributed Datasets (RDDs) Example Code vs Mapreduce 4 5 Source: [Tutorials

More information

Understanding the Use of Credit and Scores for Insurance Underwriting

Understanding the Use of Credit and Scores for Insurance Underwriting Understanding the Use of Credit and Scores for Insurance Underwriting 1. Why do insurers use credit? Insurance companies use financial history along with other factors (such as years of driving experience

More information

Behavioral Segmentation

Behavioral Segmentation Behavioral Segmentation TM Contents 1. The Importance of Segmentation in Contemporary Marketing... 2 2. Traditional Methods of Segmentation and their Limitations... 2 2.1 Lack of Homogeneity... 3 2.2 Determining

More information

BIG Data Analytics Move to Competitive Advantage

BIG Data Analytics Move to Competitive Advantage BIG Data Analytics Move to Competitive Advantage where is technology heading today Standardization Open Source Automation Scalability Cloud Computing Mobility Smartphones/ tablets Internet of Things Wireless

More information

3 Step Approach to Improving Customer Experience and Driving Engagement

3 Step Approach to Improving Customer Experience and Driving Engagement 3 Step Approach to Improving Customer Experience and Driving Engagement 2011 Experian Information Solutions, Inc. All rights reserved. Experian and the marks used herein are service marks or registered

More information

Veritas ediscovery Platform

Veritas ediscovery Platform TM Veritas ediscovery Platform Overview The is the leading enterprise ediscovery solution that enables enterprises, governments, and law firms to manage legal, regulatory, and investigative matters using

More information

Assessing Your Business Analytics Initiatives

Assessing Your Business Analytics Initiatives Assessing Your Business Analytics Initiatives Eight Metrics That Matter WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 The Metrics... 1 Business Analytics Benchmark Study.... 3 Overall

More information

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data SOLUTION BRIEF Understanding Your Customer Journey by Extending Adobe Analytics with Big Data Business Challenge Today s digital marketing teams are overwhelmed by the volume and variety of customer interaction

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Foundations of Business Intelligence: Databases and Information Management Problem: HP s numerous systems unable to deliver the information needed for a complete picture of business operations, lack of

More information

Cisco Unified Workforce Optimization for Cisco Unified Contact Center Express

Cisco Unified Workforce Optimization for Cisco Unified Contact Center Express Cisco Unified Workforce Optimization for Cisco Unified Contact Center Express Cisco Unified Communications is a comprehensive IP communications system of voice, video, data, and mobility products and applications.

More information

How To Get More Out Of Leads

How To Get More Out Of Leads Lead Scoring for Success A practical guide to achieving better results with lead scoring Lead Scoring The Growing Need for Lead Scoring The Growing Need for Lead Scoring A company s website is still one

More information

Big Data. Fast Forward. Putting data to productive use

Big Data. Fast Forward. Putting data to productive use Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize

More information

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America Application of SAS! Enterprise Miner in Credit Risk Analytics Presented by Minakshi Srivastava, VP, Bank of America 1 Table of Contents Credit Risk Analytics Overview Journey from DATA to DECISIONS Exploratory

More information

Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect

Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect Matteo Migliavacca (mm53@kent) School of Computing Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect Simple past - Traditional

More information

Ten Things You Need to Know About Data Virtualization

Ten Things You Need to Know About Data Virtualization White Paper Ten Things You Need to Know About Data Virtualization What is Data Virtualization? Data virtualization is an agile data integration method that simplifies information access. Data virtualization

More information

Interactive Data. LeadMD Case Study. Putting the Focus back on Revenue

Interactive Data. LeadMD Case Study. Putting the Focus back on Revenue LeadMD Case Study Interactive Data Putting the Focus back on Revenue Interactive Data LLC (ID) began as a directory assistance information provider in October 2001, one month after September 11th Since

More information

Combining Financial Management and Collections to Increase Revenue and Efficiency

Combining Financial Management and Collections to Increase Revenue and Efficiency Experience the commitment SOLUTION BRIEF FOR CGI ADVANTAGE ERP CLIENTS Combining Financial Management and Collections to Increase Revenue and Efficiency CGI Advantage ERP clients have a unique opportunity

More information

Fleet Optimization with IBM Maximo for Transportation

Fleet Optimization with IBM Maximo for Transportation Efficiencies, savings and new opportunities for fleet Fleet Optimization with IBM Maximo for Transportation Highlights Integrates IBM Maximo for Transportation with IBM Fleet Optimization solutions Offers

More information

The Criminal Justice Dashboard (The Dashboard) Category: Information Communications Technology (ICT) Innovations. State of Maryland.

The Criminal Justice Dashboard (The Dashboard) Category: Information Communications Technology (ICT) Innovations. State of Maryland. The Criminal Justice Dashboard (The Dashboard) Category: Information Communications Technology (ICT) Innovations State of Maryland June 1, 2011 1 Section B. The Criminal Justice Dashboard (Dashboard) is

More information