How Predictive Analytics & Big Data are Disrupting Financial Services



Similar documents
Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.

Hortonworks CISC Innovation day

Integrating a Big Data Platform into Government:

Comprehensive Analytics on the Hortonworks Data Platform

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

Ganzheitliches Datenmanagement

HDP Hadoop From concept to deployment.

Dansk IT Big Data i de største danske banker

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

ANALYTICS CENTER LEARNING PROGRAM

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

Hadoop in the Hybrid Cloud

Data Governance in the Hadoop Data Lake. Kiran Kamreddy May 2015

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Hadoop s Advantages for! Machine! Learning and. Predictive! Analytics. Webinar will begin shortly. Presented by Hortonworks & Zementis

Unlocking the Intelligence in. Big Data. Ron Kasabian General Manager Big Data Solutions Intel Corporation

RapidMiner OrangePaper Big Data Security on Hadoop

Data Security in Hadoop

The Future of Data Management with Hadoop and the Enterprise Data Hub

How to Hadoop Without the Worry: Protecting Big Data at Scale

Hadoop in the Enterprise

Building Your Big Data Team

Getting Started & Successful with Big Data

BIG DATA TECHNOLOGY. Hadoop Ecosystem

HDP Enabling the Modern Data Architecture

Self-service BI for big data applications using Apache Drill

INDUS / AXIOMINE. Adopting Hadoop In the Enterprise Typical Enterprise Use Cases

The Big Data Revolution: welcome to the Cognitive Era.

#TalendSandbox for Big Data

Self-service BI for big data applications using Apache Drill

Constructing a Data Lake: Hadoop and Oracle Database United!

The Future of Data Management

Moving From Hadoop to Spark

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

Big Data for Investment Research Management

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Private Cloud Management

Big Data and Data Science: Behind the Buzz Words

Deploying Hadoop with Manager

Making big data simple with Databricks

Peers Techno log ies Pv t. L td. HADOOP

Testing 3Vs (Volume, Variety and Velocity) of Big Data

Ubuntu and Hadoop: the perfect match

Data Services Advisory

Open Source for Cloud Infrastructure

SAP Predictive Analytics: An Overview and Roadmap. Charles Gadalla, SESSION CODE: 603

Virtualizing Apache Hadoop. June, 2012

Savanna Hadoop on. OpenStack. Savanna Technical Lead

Best Practices for Hadoop Data Analysis with Tableau

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Microsoft Big Data. Solution Brief

BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM. An Overview

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer

Driving Growth in Insurance With a Big Data Architecture

More Data in Less Time

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

MapR: Best Solution for Customer Success

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Cisco Data Preparation

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

ITG Software Engineering

Hadoop Ecosystem B Y R A H I M A.

Testing Big data is one of the biggest

Making Sense of Big Data in Insurance

Hadoop is hard. Rackspace makes it easy.

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Dominik Wagenknecht Accenture

Big Data Analytics OverOnline Transactional Data Set

Cost-Effective Business Intelligence with Red Hat and Open Source

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

GO BEYOND DATA Real-time Analytics for Application Performance Management

Hadoop Job Oriented Training Agenda

Dealing with Data Especially Big Data

Big Data Realities Hadoop in the Enterprise Architecture

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT

Upcoming Announcements

Bringing Big Data to People

Big Data and Hadoop. Module 1: Introduction to Big Data and Hadoop. Module 2: Hadoop Distributed File System. Module 3: MapReduce

Open source Google-style large scale data analysis with Hadoop

The Digital Enterprise Demands a Modern Integration Approach. Nada daveiga, Sr. Dir. of Technical Sales Tony LaVasseur, Territory Leader

Implement Hadoop jobs to extract business value from large and varied data sets

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Lofan Abrams Data Services for Big Data Session # 2987

HADOOP. Revised 10/19/2015

Evolution from Big Data to Smart Data

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Using Tableau Software with Hortonworks Data Platform

HPC ABDS: The Case for an Integrating Apache Big Data Stack

Qsoft Inc

SQLSaturday #399 Sacramento 25 July, Big Data Analytics with Excel

Using Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM

Big Data and Hadoop for the Executive A Reference Guide

Oracle Big Data Fundamentals Ed 1 NEW

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Distributed DataFrame on Spark: Simplifying Big Data For The Rest Of Us

Transcription:

#1 Agile Predictive Analytics Platform for Today s Modern Analysts How Predictive Analytics & Big Data are Disrupting Financial Services Vamsi Chemitiganti General Manager Financial Services

State of Global Banking 2016 RapidMiner, Inc. All rights reserved. - 2 -

Financial Services and Big Data Focused around business and technology vectors Technology vectors Cloud computing (OpenStack) DevOps and PaaS Mobility Big Data and analytics BPM and microservices Software-defined datacenters Digital Bank Bank 3.0s Business vectors Regulation and risk management Compliance and regulation Trading systems Omni-channel wealth management Payments systems Bank 3.0 2016 RapidMiner, Inc. All rights reserved. - 3 -

Areas of Impact 2016 RapidMiner, Inc. All rights reserved. - 4 -

Key areas within the financial services industry 2016 RapidMiner, Inc. All rights reserved. - 5 -

Lifecycle of Big Data adoption HDP helps FSIs drive efficiency gain.. 2016 RapidMiner, Inc. All rights reserved. - 6 -

Predictive Analytics on Hadoop Write data prep and predictive analytics code for Hadoop It s complex, requires programming and specialized knowledge of each Hadoop technology Which would you prefer? Push automatically generated computations into Hadoop It s code-free, speaks Hadoop for you, and is 10 40 x faster to implement 2016 RapidMiner, Inc. All rights reserved. - 7 -

Impact of RapidMiner & Data Science A representative sample only Survey of ML algorithms used (stated briefly for confidentiality purposes) Classification & Class Probability Estimation Regression Similarity Matching Clustering Co-Occurence Grouping Profiling Link Prediction Causal Modeling Most use cases typically revolve around a single view of Entity 2016 RapidMiner, Inc. All rights reserved. - 8 -

Digital Transformation 2016 RapidMiner, Inc. All rights reserved. - 9 -

The digital journey in banking 2016 RapidMiner, Inc. All rights reserved. - 10 -

Cyber Security 2016 RapidMiner, Inc. All rights reserved. - 11 11 -

Cyber security 2016 RapidMiner, Inc. All rights reserved. - 12 -

Customer Segmentation 2016 RapidMiner, Inc. All rights reserved. - 13 13 -

Customer segmentation process 2016 RapidMiner, Inc. All rights reserved. - 14 -

Regulatory Risk Management 2016 RapidMiner, Inc. All rights reserved. - 15 15 -

Proposed Solution Hortonworks Data Platform Golden Source & Feeds Master Data Transaction sqoop/hadoop fs/ nfs LANDING DATA ZONE L0 RAW Data hdfs) Hive/Spark/Scala STANDARDIZED DATA ZONE L1 Standardized Data (Hive/Orc) CANONICAL DATA ZONE L2 Canonical Transaction Data (Hive/orc) Scala/Python/R etc Scala/Java REPORTING/ ANALYTICS ZONE L3 Scenarios Results (Hive/orc) Regulatory Reports Internal Reports Balances Contrats Positions Kafka/Storm Java/Scala Original Data (hdfs) Unstructured Data (hdfs) Hive/Spark/Scala MHive/Spark/Scal Standardized Data (Hive/Orc) Standardized Data (Hive/orc) Hive/Spark/Scala Hive/Spark/Scala Canonical Position Data (hive/orc) Materialized View (Hive/orc) TBD?? Hive/Spark Data Aggregations (Hive/orc/ Hbase) Analytics/ Reports (Hive/orc/ HBase) External Reports Market Data Hive/Spark/Scala Hive/Spark/Scala Hive/Spark Revision History (Hive/orc) Search Factors/ Scenarios Common Repositories/Meta Data Management Apache Atlas/Falcon/ Custom Solution Security Apache Ranger/ Atlas and Custom/Partner Solution 2016 RapidMiner, Inc. All rights reserved. - 16 -

AML Compliance 2016 RapidMiner, Inc. All rights reserved. - 17 17 -

Fraud/AML/Compliance Reference architecture 2016 RapidMiner, Inc. All rights reserved. - 18 -

Fraud Monitoring & Detection 2016 RapidMiner, Inc. All rights reserved. - 19 19 -

Fraud Detection Reference architecture 2016 RapidMiner, Inc. All rights reserved. - 20 -

Modern Data Architecture with HWX and RM 2016 RapidMiner, Inc. All rights reserved. - 21 -

RapidMiner Radoop Big Data Predictive Analytics Extends RapidMiner s visual predictive analytics to Hadoop and Spark We speak Hadoop so you don t have to Translates predictive analytics into native Hive, MapReduce, Spark, Pig and Mahout you concentrate on competitive analytics, not Hadoop programming COMPLETE insights into your Big Data Pushes analytic instructions into Hadoop for computation, so you can analyze the full breadth and variety of your Big Data Structured and non-structured Not just drag & drop: use your favorite Hadoop scripts, too! Incorporates your favorite SparkR, PySpark, Pig and HiveQL scripts within your predictive analytics workflow Safe and sound Integrates with Kerberos authentication, supports data access authorization for Apache Sentry and Apache Ranger seamless for users, easy admin for IT 2016 RapidMiner, Inc. All rights reserved. - 22 -

Vamsi Chemitiganti General Manager Financial Services Hortonworks vchemitiganti@hortonworks.com #1 Agile Predictive Analytics Platform for Today s Modern Analysts 2016 RapidMiner, Inc. All rights reserved. CONFIDENTIAL - 24 - - 24 -