Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015



Similar documents
Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Ganzheitliches Datenmanagement

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Data Governance in the Hadoop Data Lake. Kiran Kamreddy May 2015

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Using Tableau Software with Hortonworks Data Platform

Apigee Insights Increase marketing effectiveness and customer satisfaction with API-driven adaptive apps

North Highland Data and Analytics. Data Governance Considerations for Big Data Analytics

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Luncheon Webinar Series May 13, 2013

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

More Data in Less Time

The Enterprise Data Hub and The Modern Information Architecture

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Protecting Big Data Data Protection Solutions for the Business Data Lake

Testing Big data is one of the biggest

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

What's New in SAS Data Management

Descriptive to Predictive to Prescriptive Analytics: Move Up the Value Chain. Suren Nathan CTO

How To Use Hp Vertica Ondemand

By Makesh Kannaiyan 8/27/2011 1

Detect & Investigate Threats. OVERVIEW

Databricks. A Primer

This Symposium brought to you by

Data Governance and Big Data - A Necessary Convergence. Richard Goldberg Chief Data Governance Officer Citibank Global Consumer Bank

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Data Integration Checklist

Hadoop & Spark Using Amazon EMR

How the oil and gas industry can gain value from Big Data?

Trustworthiness of Big Data

Dashboard Engine for Hadoop

Traditional BI vs. Business Data Lake A comparison

The Future of Data Management with Hadoop and the Enterprise Data Hub

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

Are You Big Data Ready?

Getting Started Practical Input For Your Roadmap

How To Use Big Data For Business

Databricks. A Primer

Next-Generation Cloud Analytics with Amazon Redshift

NEWLY EMERGING BEST PRACTICES FOR BIG DATA

IBM Big Data Platform

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Putting Apache Kafka to Use!

Data Integration Hub

Data Virtualization A Potential Antidote for Big Data Growing Pains

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Time series IoT data ingestion into Cassandra using Kaa

How to Run a Successful Big Data POC in 6 Weeks

Oracle Big Data Building A Big Data Management System

Qlik Sense Enabling the New Enterprise

European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project

Constructing a Data Lake: Hadoop and Oracle Database United!

The Future of Data Management

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Turkish Journal of Engineering, Science and Technology

Your Path to. Big Data A Visual Guide

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Information Architecture

Informatica Platform v10 for: Next Generation Analytics Cloud Modernization Data Archiving. Presented by Ilya Gershanov

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

Optimized for the Industrial Internet: GE s Industrial Data Lake Platform

Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations

90% of your Big Data problem isn t Big Data.

BIG DATA GOVERNANCE: BALANCING BIG DATA VELOCITY & INFORMATION GOVERNANCE

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

Oracle Data Integrator 12c: Integration and Administration

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Oracle Data Integrator 11g: Integration and Administration

Five Steps to Integrate SalesForce.com with 3 rd -Party Systems and Avoid Most Common Mistakes

Agile Business Intelligence Data Lake Architecture

Data Refinery with Big Data Aspects

3 Top Big Data Use Cases in Financial Services

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Hadoop in the Hybrid Cloud

Il mondo dei DB Cambia : Tecnologie e opportunita`

VIEWPOINT. High Performance Analytics. Industry Context and Trends

Salesforce.com and MicroStrategy. A functional overview and recommendation for analysis and application development

Cloud Ready Data: Speeding Your Journey to the Cloud

Using distributed technologies to analyze Big Data

10 Biggest Causes of Data Management Overlooked by an Overload

PLATFORA SOLUTION ARCHITECTURE

Transcription:

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO Big Data Everywhere Conference, NYC November 2015

Agenda 1. Challenges with Risk Data Aggregation and Risk Reporting (RDARR) 2. How a managed Hadoop Data Lake can serve as an ideal data acquisition hub for RDARR analytics and reporting 3. Architectural considerations and required capabilities BCBS 239 addressing Risk Data Aggregation and Risk Reporting 4. Potential solution leveraging Zaloni s Bedrock, a Hadoop data management platform

Overview of BCBS 239 and RDARR Basel Committee of Banking Supervision (BCBS) 239 Requires that the information used to drive decision making, captures all risks with appropriate accuracy and timeliness Overarching principles of effective risk management reporting and governance Data Governance Adaptability Data and IT architecture Frequency Accuracy and Integrity Distribution Completeness Review

Challenges with BCBS 239 compliance 1. Performance requirements: Computationally intensive models Systems must scale and retain security and resiliency 2. Large volumes of data: Demands to manage and record every transaction in real time Long term data retention requirements 3. Fragmented systems: Risk data is often scattered across the organization in silos Current relational stores have different schemas, limiting cross enterprise visibility 4. Cost pressures: Increasing cost of compliance while profits and budgets decline 5. Data Governance: One of the key aspects of RDARR is proper Data Governance

Modernizing your data architecture as the path to success A Hadoop Data Lake is the optimal underlying architecture: Provides the most scalable solution Dramatically more cost effective than traditional data storage solutions Enables you to deal with the volume, variety and velocity of data that is coming in Breaks down the silos built up through the traditional database architecture Potential challenges with a Hadoop Data Lake solution: Data Management: Metadata, Lineage, Data Quality, Automation for Data acquisition Does it have enterprise grade data integrity and security? (e.g. Access control, Data masking) Will it integrate in my existing data environment? (e.g. can data flow with required frequency for SLAs, etc.)

Data Lake reference architecture for RDARR Hadoop Data Lake Source Systems Transient Loading Zone Raw Data Refined Data Integrate to common format Data Validation Data Cleansing Aggregations Consumption Zone OLTP or ODS Enterprise Data Warehouse File Data DB Data Original unaltered data attributes Trusted Data Reference Data Master Data Logs (or other unstrctured data) ETL Extracts Tokenized Data Discovery Sandbox Data Wrangling Data Discovery Exploratory Analytics Business Analysts Researchers Data Scientists Streaming Cloud Services { } APIs Metadata Data Quality Data Catalog Security

Data Acquisition Framework for RDARR Automated Data Acquisition Framework providing timeliness of data Capture Metadata in all phases: Ingestion, Transformation Integration with Enterprise Metadata Management Integrated Data Quality Analysis Metadata repositories Metadata Management solution Register/ update metadata Source Systems RDBMS Extract/ Read metadata Operational Metadata Generation Mainframes Data Acquisition Automation Data Ingestion Data Quality and Validation Layout Standardization Flat files Binary files Data at Rest

Metadata Registration Considerations: Business metadata: Business names, descriptions, tags, quality and masking rules Operational metadata: Source and target locations of data, size, number of records, lineage Technical metadata: Type of data (text, JSON, Avro), structure of the data (the fields and their types) Integration with Enterprise Metadata Management Solutions Edge-node to Cluster metadata file START API retrieve metadata origin info, timestamp, etc. Enterprise Metadata Repositories Hadoop Cluster Metadata check-in copy to repository add tags END operational metadata file

Data Transformation and Aggregation Considerations: Layout Standardization Create rationalized data models for RDARR Data Completeness - Capture and aggregate all material risk data across the banking group. Transformation Raw data from multiple sources is aggregated to create risk reports. Timeliness and Frequency - Generate aggregate and up-to-date risk data in a timely manner. Security and Controls Masking for PII Access controls for Risk Reports Risk Reports: Credit Risk Market Risk Liquidity Risk Capital Risk Stress Testing

Non functional considerations Ability to handle a variety of input sources and output destinations Handle fluctuations in input data High throughput and low latency handling Validation and tagging on the fly Preserve order Fault tolerance Non-stop modifications Simple to build and operate

Managed Data Lake enabled for RDARR Data Types Edge Node Data Lake Consumers Relational Change Data Data Analytical Applications Streaming File Stream Adapters File Collectors Hadoop Cluster Export Enterprise Data Warehouse Apps/ Analytics Tools Data Sources Portfolios Positions Market Data Social Enterprise Data Bedrock Application Manager Configure Ingestion Operations and Metadata Store Transformations Bedrock Applications Manager Administer Metadata Data Quality & Rules Engine Query Builder Manage, Monitor, Schedule Work flow Executor Risk reporting: Credit Risk Market Risk Liquidity Risk Capital Risk Stress Testing Scorecards Enterprise Reports

Benefits Addressing some of the key principles of BCBS 239: Data architecture and IT infrastructure that supports normal and high stress scenarios Accuracy and integrity of data for effective risk management and decision-making Completeness of data to ensure informed decision-making Additional benefits beyond demonstrating compliance: Banks can make timely, defensible, informed decisions related to Risk exposure. Reduced probability and severity of loss from fraud, etc. Improved strategic planning and ability to launch new products and services Reduced cost compared to traditional EDW solutions (60-70% OPEX reduction over 3 years)

Join us for Upcoming Webinar Governance of the Big Data Lake with a focus on RDARR for Financial Services Identifying Critical Data Elements for RDARR Establishing Data Standards for Critical Data Elements Supporting data lineage for Big Data to support regulatory compliance Managing Data Quality Hardening Information Security Register at www.zaloni.com Speakers: Sunil Soares, Principle, Information Asset Ben Sharma, CEO, Zaloni

Visit zaloni.com or Contact us at info@zaloni.com