Data Governance for Regulated Industries

Similar documents
and NoSQL Data Governance for Regulated Industries Using Hadoop Justin Makeig, Director Product Management, MarkLogic October 2013

Making Sense of Big Data in Insurance

Endeca Introduction to Big Data Analytics

You Have Your Data, Now What?

Building Confidence in Big Data Innovations in Information Integration & Governance for Big Data

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Optimized for the Industrial Internet: GE s Industrial Data Lake Platform

ITIL, the CMS, and You BEST PRACTICES WHITE PAPER

White Paper November Technical Comparison of Perspectium Replicator vs Traditional Enterprise Service Buses

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Big Data, Integration and Governance: Ask the Experts

Optimized for the Industrial Internet: GE s Industrial Data Lake Platform

The Future of Data Management

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Simplifying Data Governance and Accelerating Real-time Big Data Analysis in Financial Services with MarkLogic Server and Intel

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Simplifying Data Governance and Accelerating Real-time Big Data Analysis for Government Institutions with MarkLogic Server and Intel

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

Big Data for Banking. Kaleem Chaudhry Senior Director, Sales Consulting, ASEAN. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Big Data Analytics Nokia

Data Virtualization and ETL. Denodo Technologies Architecture Brief

EMC DOCUMENTUM CONTENT ENABLED EMR Enhance the value of your EMR investment by accessing the complete patient record.

... Foreword Preface... 19

Introducing Red Hat s JBoss Portfolio

Big Data and Analytics in Government

The Enterprise Data Hub and The Modern Information Architecture

SAP Database Strategy Overview. Uwe Grigoleit September 2013

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Traditional BI vs. Business Data Lake A comparison

90% of your Big Data problem isn t Big Data.

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Real World Strategies for Migrating and Decommissioning Legacy Applications

How to Run a Successful Big Data POC in 6 Weeks

Top 3 Ways Big Data Impacts Financial Services

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Management Accountants and IT Professionals providing Better Information = BI = Business Intelligence. Peter Simons peter.simons@cimaglobal.

The Future of Data Management with Hadoop and the Enterprise Data Hub

HITACHI DATA SYSTEMS HADOOP SOLUTION JUNE 12, 2012

Data Discovery, Analytics, and the Enterprise Data Hub

Agile Business Intelligence Data Lake Architecture

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Case Management and Real-time Data Analysis

Informatica Data Replication: Maximize Return on Data in Real Time Chai Pydimukkala Principal Product Manager Informatica

Investment Bank Case Study: Leveraging MarkLogic for Records Retention and Investigation

Providing real-time, built-in analytics with S/4HANA. Jürgen Thielemans, SAP Enterprise Architect SAP Belgium&Luxembourg

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

Simplifying Data Governance and Accelerating Real-time Big Data Analysis for Healthcare with MarkLogic Server and Intel

Melissa Coates. Tools & Techniques for Implementing Corporate and Self-Service BI. Triad SQL BI User Group 6/25/2013. BI Architect, Intellinet

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Ganzheitliches Datenmanagement

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015

Assessing and implementing a Data Governance program in an organization

10 Biggest Causes of Data Management Overlooked by an Overload

WHITE PAPER. Five Steps to Better Application Monitoring and Troubleshooting

The Lab and The Factory

Ten Things You Need to Know About Data Virtualization

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS

A 360 Degree View of Anything

Data Integration Checklist

Oracle Business Intelligence 11g Business Dashboard Management

IBM Software Integrating and governing big data

More Data in Less Time

Creating a Business Intelligence Competency Center to Accelerate Healthcare Performance Improvement

Business Process Management & Workflow Solutions

ENZO UNIFIED SOLVES THE CHALLENGES OF REAL-TIME DATA INTEGRATION

How To Create A Data Science System

SAP and Hortonworks Reference Architecture

Business Intelligence Solution for Small and Midsize Enterprises (BI4SME)

MDM and Data Warehousing Complement Each Other

Master Your Data and Your Business Using Informatica MDM. Ravi Shankar Sr. Director, MDM Product Marketing

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Enterprise Content Management. A White Paper. SoluSoft, Inc.

Deploying an Operational Data Store Designed for Big Data

Data Governance for Financial Institutions

Why You Still Need to Master Your Data Before You Master Your Business (Intelligence) Business Imperatives Addressed By Reliable, Integrated View

Integrating Netezza into your existing IT landscape

Business Intelligence for Healthcare Benefits

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

Service Oriented Data Management

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

BUSINESSOBJECTS DATA INTEGRATOR

ORACLE HYPERION DATA RELATIONSHIP MANAGEMENT

CA Service Desk Manager

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

Data, Data Everywhere

Chapter 5. Learning Objectives. DW Development and ETL

IBM Master Data Management and data governance November IBM Master Data Management: Effective data governance

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

IT FUSION CONFERENCE. Build a Better Foundation for Business

Data Integration for the Real Time Enterprise

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

Transcription:

Data Governance for Regulated Industries Amir Halfon CTO, Worldwide Financial Service

Agenda Components of Data Governance Challenges Solutions and Case Studies Q&A SLIDE: 2

Data Governance Considerations Security SLIDE: 3

Data Governance Considerations Security Privacy SLIDE: 4

Data Governance Considerations Security Privacy Provenance SLIDE: 5

Data Governance Considerations Security Retention Privacy Provenance SLIDE: 6

Data Governance Considerations Security Retention Privacy Continuity Provenance SLIDE: 7

Data Governance Considerations Security Retention Privacy Provenance Continuity Compliance SLIDE: 8

Why is this difficult? And risky? And expensive? And behind schedule? SLIDE: 9

It s Complicated Reference Data Unstructured OLTP Warehouse Documents, Messages { } Social Metadata Video Audio Signals, Logs, Streams Archives Data Marts Search SLIDE: 10

Can Anything be Done? SLIDE: 11

Case Studies Records Retention and Investigations Trade Operational Data Store Regulatory Compliance Customer On-Boarding SLIDE: 12

Case Study: Records Retention and Investigations Accurately respond to litigation Hold, review, produce data across current, legacy systems Repatriate and reconcile distributed data Demonstrate fidelity and audit trail Reduce infrastructure and maintenance costs SLIDE: 13

Old Generation Records Retention and Investigations Oracle Mainframe 87 total systems Sybase SLIDE: 14

New Generation Records Retention and Investigations Ingest Query Oracle 100TB 40TB Mainframe MarkLogic MarkLogic 87 total systems Sybase Offline Replication Shared Storage NAS HDFS SLIDE: 15

Data Retention and Tiered Storage Provide multiple Service Level Agreements (SLAs) in a single system Decrease time and costs of ETL to bring offline content back online Empower your operations team without imposing burdens on your developers SLIDE: 16

Tiered Storage Architecture Data tiers are defined based on indexes balanced into forests by tier Query one tier or the other tier or both at once! All with no downtime, and 100% consistency! SLIDE: 17

Case Study: Operational Trade Data Store Comply with regulations requiring operational insights Quickly operationalize business innovation Support risk management requirements Reduce costs per trade Trade processing exceptions infrastructure and maintenance costs SLIDE: 18

Old Generation Operations and Analytics Limited, fragmented analytics and reporting capabilities Long, costly development cycles Expensive, error-prone post-trade processing Derivatives Rates FX ETL ETL ETL Matching Clearing Settlement etc. etc. Multiple Relational Data Stores for different instrument types SLIDE: 19

New Generation Operations, Compliance and Analytics Single source of truth Surveillance, Risk & Compliance Matching Clearing Settlement etc. Single ODS for all instrument types persisted as-is off a message bus Executed Trades MarkLogic Simplified workflow archtecture Post Trade Processing Exceptions Management HDFS Historical Analysis Tiered Storage using Hadoop SLIDE: 20

Event Handling and Workflow Integration Queries can be serialized and indexed, just like documents Index these queries For a given data document, quickly find all possible matching queries Narrow down these candidates to exact matches using a unified expression tree Fire off events to drive workflow, or alert users SLIDE: 21

Tiered Storage with Hadoop Minimize duplication and ETL, reduce risk MarkLogic Active ~$25/GB Historical ~$1/GB HDFS SLIDE: 22

Case Study: On-Boarding Compliance Thousands of rules, 1 2M accounts, 30 40M documents Encoding, adjusting, and matching rules must scale Impossible to pre-define dimensions, relationships Vet new accounts and show your work Real-time decision-making SLIDE: 23

Old Generation On-Boarding Compliance Regulations Policies Documents SLIDE: 24

New Generation On-Boarding Compliance MarkLogic Onboarding Workflow Regulations Policies Documents SLIDE: 25

Semantics Facts about the data, expressed in a flexible triple format Bring context to the content Improve query precision with search and semantic queries SLIDE: 26

Data Provenance Using Semantics <Trade> <Cashflows> <PartyIdentifier> <TradeID> 123456 </TradeID> </PartyIdentifier> </Cashflows> <provenance> <triple> <subject> Cashflows </subject> <predicate> wasderivedfrom </predicate> <object> CDS_xyz </object> </triple> <triple> <subject> TradeID </subject> <predicate> wasattributedto </predicate> <object> System_123 </object> </triple> </provenance> </Trade> SLIDE: 27

Case Study: Dodd Frank Compliance Trace lineage of order lifecycle for OTC derivatives Search, link supporting communications, documents Strict reporting and retention rules, response times Existing policies, point solutions don t scale SLIDE: 28

Old Generation Regulatory Compliance email Operations Reporting Trade Records Categorization Linking Reference Data SLIDE: 29

New Generation Regulatory Compliance email { } Metadata Reporting Surveillance Ad hoc analysis Operations Trade Records Reference Data MarkLogic Categorization Enrichment Linking SLIDE: 30

Enrichment and Linking SLIDE: 31

Management Dashboard SLIDE: 32

Recap SLIDE: 33

New Generation Data Governance Security Retention Privacy Provenance Continuity Compliance SLIDE: 34

Parting Thoughts One way to do more with less is to do less.. ETL, schema design, database maintenance.. to spend less on complex systems and human intervention and to focus more on data governance AND business agility SLIDE: 35

Thank You SLIDE: 36