Mastering Big Data. Steve Hoskin, VP and Chief Architect INFORMATICA MDM. October 2015

Similar documents
Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Ganzheitliches Datenmanagement

VIEWPOINT. High Performance Analytics. Industry Context and Trends

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

SAP Predictive Analytics: An Overview and Roadmap. Charles Gadalla, SESSION CODE: 603

BEYOND BI: Big Data Analytic Use Cases

How To Make Sense Of Data With Altilia

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

IBM Big Data in Government

Real World Application and Usage of IBM Advanced Analytics Technology

Apigee Insights Increase marketing effectiveness and customer satisfaction with API-driven adaptive apps

The Future of Data Management with Hadoop and the Enterprise Data Hub

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management

Extend your analytic capabilities with SAP Predictive Analysis

The Future of Data Management

CONNECTING DATA WITH BUSINESS

Big Data for Investment Research Management

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

DATAMEER WHITE PAPER. Beyond BI. Big Data Analytic Use Cases

Big Data and Analytics in Government

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

SAS Fraud Framework for Banking

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Integrating a Big Data Platform into Government:

The Future of Business Analytics is Now! 2013 IBM Corporation

Submitted to: Service Definition Document for BI / MI Data Services

This Symposium brought to you by

Big Data overview. Livio Ventura. SICS Software week, Sept Cloud and Big Data Day

More Data in Less Time

Big Data Analytics Nokia

Using SAP Master Data Technologies to Enable Key Business Capabilities in Johnson & Johnson Consumer

Three Open Blueprints For Big Data Success

Big Data and Your Data Warehouse Philip Russom

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

Oracle Big Data Building A Big Data Management System

Safe Harbor Statement

Predictive Analytics: Turn Information into Insights

Bruhati Technologies. About us. ISO 9001:2008 certified. Technology fit for Business

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

MDM and Data Warehousing Complement Each Other

The New Landscape of Business Intelligence & Analytics New Opportunities, Roles and Outcomes. Summit 2015 Orlando London Frankfurt Madrid Mexico City

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

Master Your Data and Your Business Using Informatica MDM. Ravi Shankar Sr. Director, MDM Product Marketing

Predictive Customer Intelligence

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Business Intelligence mit SAP: Strategie, Neuerungen, Nutzen. Andreas Forster / Solution Advisor June 2013

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Driving Better Marketing Results with Big Data and Analytics David Corrigan, IBM, Director of Product Marketing

Are You Ready for Big Data?

III JORNADAS DE DATA MINING

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

Tax Fraud in Increasing

Cloudera Enterprise Data Hub in Telecom:

Real-Time Big Data Analytics + Internet of Things (IoT) = Value Creation

The Big Data Paradigm Shift. Insight Through Automation

Utility Analytics, Challenges & Solutions. Session Three September 24, 2014

Are You Big Data Ready?

Customer Case Studies on MDM Driving Real Business Value

A New Era Of Analytic

Business Intelligence. Advanced visualization. Reporting & dashboards. Mobile BI. Packaged BI

Cisco Data Preparation

JOURNAL OF OBJECT TECHNOLOGY

Reinventing Business Intelligence through Big Data

PREDICTIVE ANALYTICS DEMYSTIFIED

Demonstration of SAP Predictive Analysis 1.0, consumption from SAP BI clients and best practices

Master Data Management What is it? Why do I Care? What are the Solutions?

SQL Server Master Data Services A Point of View

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer,

Session 805 -End-to-End SAP Lumira: Desktop to On-Premise, Cloud, and Mobile

WHITE PAPER. Talend Infosense Solution Brief Master Data Management for Health Care Reference Data

Implementation of Big Data and Analytics Projects with Big Data Discovery and BICS March 2015

Microsoft Big Data. Solution Brief

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

Interactive data analytics drive insights

April 2016 JPoint Moscow, Russia. How to Apply Big Data Analytics and Machine Learning to Real Time Processing. Kai Wähner.

EVERYTHING THAT MATTERS IN ADVANCED ANALYTICS

Cisco IT Hadoop Journey

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Data Virtualization A Potential Antidote for Big Data Growing Pains

KnowledgeSEEKER Marketing Edition

IDC MaturityScape Benchmark: Big Data and Analytics in Government. Adelaide O Brien Research Director IDC Government Insights June 20, 2014

Turning Big Data into More Effective Customer Experiences. Experience the Difference with Lily Enterprise

The Business Analyst s Guide to Hadoop

Safe Harbor Statement

TEXT ANALYTICS INTEGRATION

Redefining Role of Business Analyst in the paradigm of Big Data in Healthcare

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Voice. listen, understand and respond. enherent. wish, choice, or opinion. openly or formally expressed. May Merriam Webster.

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

Protecting Big Data Data Protection Solutions for the Business Data Lake

Informatica Data Quality Product Family

Transforming Data Into Business Value. Dr. Rado Kotorov Chief Innovation Officer & VP November 30th, 2015

Achieving Business Value through Big Data Analytics Philip Russom

IDC MaturityScape Benchmark: Big Data and Analytics in Government

How to avoid building a data swamp

Transcription:

Mastering Big Data Steve Hoskin, VP and Chief Architect INFORMATICA MDM October 2015

Agenda About Big Data MDM and Big Data The Importance of Relationships Big Data Use Cases

About Big Data Big Data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. Big Data is also sometimes used as a generic label for the Hadoop frameworks that allow for the processing and management of these data sets

Need for Hadoop

HDFS is not the only place for Big Data

Big Data Stack without Big Data? Don t need really Big Data to be able to gain the benefits of the Big Data Stack Scalable batch execution environment Reduced database costs Open source projects provide capabilities Machine Learning Graph Analytics

Downsides? Numerous distributions with frequent releases Steep learning curves Solutions only work for the 80% - how to build the rest? Currently better suited to analytics than operational use cases Its just tooling still need to build/buy business solutions So many choices, potential dead-ends Requires different hardware deployment

A Master Data Management Primer Data Acquisition Data Quality and Enrichment Authoring Matching & Deduplication Relationship Discovery and Management Survivorship - aka Golden Record and Best Version of Truth Search Workflow Governance Real-time consumption Publishing and consumption

Hadoop is a good fit for these MDM functions Today Data Acquisition Data Quality and Enrichment Authoring Matching & Deduplication Relationship Discovery and Management Survivorship - aka Golden Record and Best Version of Truth Search Workflow Governance Operational consumption Publishing

Intelligent Layers of Big Data Catalog, Relate & Score Big Data Catalog

Intelligent Layers of Big Data Organize, Fix & Enrich Trusted Reference Data Catalog, Relate & Score Big Data Catalog

Intelligent Layers of Big Data De-dup, Enrich & Relate Big Data Relationship Management Organize, Fix & Enrich Trusted Reference Data Catalog, Relate & Score Big Data Catalog

Intelligent Layers of Big Data Big Data Consumption and Analytics De-dup, Enrich & Relate Big Data Relationship Management Organize, Fix & Enrich Trusted Reference Data Catalog, Relate & Score Big Data Catalog

What data do we have, and how useful is it? Content Inference Sensitive Data Tracking Stewardship Smart Suggestions Crawl Index Cluster Classify Relate Infer Semantics Catalog of Data Assets Relationships Quality Score Statistics Rules Glossary Ratings All IT Repositories Applications, Business Semantics 3rd Party BI, Modeling, Big Data User Ratings, Feedback, Operational Stats

Big Data Quality Make Sense of Big Data Ingest Deliver

Big Data Quality Make Sense of Big Data 10110101 10010011 Explore: Identify common patterns Find outliers Help ask the right questions Ingest Deliver

Big Data Quality Make Sense of Big Data 10110101 10010011 Explore: Identify common patterns Find outliers Help ask the right questions Recommend: Suggest actions based on the data Recommend the next best step Predict outcomes Ingest Deliver

Big Data Quality Make Sense of Big Data 10110101 10010011 Explore: Identify common patterns Find outliers Help ask the right questions Recommend: Suggest actions based on the data Recommend the next best step Predict outcomes Learn: From system recommendations From user actions From data itself Ingest Deliver

Big Data Quality Make Sense of Big Data 10110101 10010011 Explore: Identify common patterns Find outliers Help ask the right questions Recommend: Suggest actions based on the data Recommend the next best step Predict outcomes Learn: From system recommendations From user actions From data itself Ingest Deliver

Relationships & Social MDM

Relationships & Social MDM John Q. Jones 1 John Quincy Jones Jonathan Quincy Jones Location Product Customer Single Person View ASSERTED Account

Relationships & Social MDM John Q. Jones 1 John Quincy Jones Jonathan Quincy Jones Location Product Customer Single Person View ASSERTED Account Purchase History Claims Product Reviews Complaints Payment 2 Family & Business Relationship Transactional Data Social Data 360 0 View of Person Relationships OBSERVED

Relationships & Social MDM John Q. Jones 1 John Quincy Jones Jonathan Quincy Jones Location Product Customer Single Person View ASSERTED Account Purchase History Claims Product Reviews Complaints Payment RFM Calculation Fraud Detection Product Sentiment Customer Churn 2 Family & Business Relationship Transactional Data Social Data 360 0 View of Person Relationships OBSERVED 3 Customer Segmentation Churn Prediction Sentiment Analysis Fraud Management Complete View of Person Interactions and Predictions DERIVED

Relationships & Social MDM John Q. Jones 1 John Quincy Jones Jonathan Quincy Jones Purchase History Claims Product Reviews RFM Calculation Fraud Detection Product Sentiment Governance Visualization Prediction Location Product Customer Single Person View ASSERTED Account Complaints Payment Customer Churn 2 Family & Business Relationship Transactional Data Social Data 360 0 View of Person Relationships OBSERVED 3 Customer Segmentation Churn Prediction Sentiment Analysis Fraud Management Complete View of Person Interactions and Predictions DERIVED Social MDM

MDM Relationships Add Value 17

Common Graphs in MDM Organizational Hierarchy Social Network Product Hierarchy

Relate Business Entities in MDM Vertex/Node Party, Product, Claims, Complaints etc. BE s Edges Relationship (Accident, Bad Service etc.) Relationship

MDM Graph Database Asserted Data Customer BE Party BE Sales Person BE Product BE Observed Data Observed Data MDM GRAPH Transaction Data Relationship Social Data Derived Data Prediction

Big Data MDM Use Cases

Use Cases Financial Services Fraud Detection Risk & Portfolio Analysis Investment Recommendations Retail & Telco Proactive Customer Engagement Location Based Services Media & Entertainment Online & In-Game Behavior Customer X/Up-Sell Manufacturing Connected Vehicle Predictive Maintenance Healthcare & Pharma Predicting Patient Outcomes Total Cost of Care Drug Discovery Public Sector Health Insurance Exchanges Public Safety Tax Optimization Fraud Detection

Large Insurance Company Customer Intelligence Example 1200+ Input Files 718 Million Records 7Use Cases 10 Nodes Hadoop Cloudera Informatica Datameer Business need Challenge Solution and results 360 degree view of consumers for marketing, planning, and analytics Discover and mine relationships Create highly targeted and individualized marketing programs Rich data environment across organizational business units, comprised of many source systems across various platforms Providing a consistent enterprise view of data across business units Seven use cases with increasing complexity Provides single platform to house customer and prospect data from disparate sources Provides for rapid intake of new data sources (structured and unstructured) Eliminates data intake and append bottleneck Empowers Analysts to explore all data elements Increases processing power for statistical analysis

Fraud & Intelligence System Use Case Unrelated Events? MDM can be leveraged to build linear scalable Fraud Management system that provides link analysis, data clustering and also offers very best search and match against large data volume

Fraud & Intelligence System Use Case Unrelated Events? Or Fraud MDM can be leveraged to build linear scalable Fraud Management system that provides link analysis, data clustering and also offers very best search and match against large data volume

Questions? 25