Big Data in Banking: Hype or Future?

Similar documents
BIG DATA What it is and how to use?

Hadoop & SAS Data Loader for Hadoop

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

Big Data, Applied. Overview of Big Data from our experiences at Telefonica s digital services. Jose Luis

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

The big data revolution

Distributed Systems. Lec 2: Example use cases: Cloud computing, Big data, Web services

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90

Big data and its transformational effects

Leveraging unstructured data for improved decision making: A retail banking perspective

Big Data. Fast Forward. Putting data to productive use

Leveraging Big Social Data

The? Data: Introduction and Future

Big Analytics: A Next Generation Roadmap

L1: Introduction to Hadoop

Harnessing Digital. November 2014

Driving Better Marketing Results with Big Data and Analytics David Corrigan, IBM, Director of Product Marketing

Social Networks in Data Mining: Challenges and Applications

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

SOCIAL MEDIA FOR MSMEs A turning point. By DR. PRALAY DEY National Small Industries Corporation (NSIC)

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

A PERFORMANCE ANALYSIS of HADOOP CLUSTERS in OPENSTACK CLOUD and in REAL SYSTEM

Open source Google-style large scale data analysis with Hadoop

Social media has CHANGED THE WORLD as we know it by connecting people, ideas and products across the globe.

Harnessing the True Power of Data

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Big Data Big Deal? Salford Systems

What is Big Data? Concepts, Ideas and Principles. Hitesh Dharamdasani

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

Copyright 2014, Neudesic. All rights reserved.

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Professional Diploma in Digital Marketing

Computing in clouds: Where we come from, Where we are, What we can, Where we go

Introduction to Big Data the four V's

Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware

Big Data and Open Data

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

Hadoop Usage At Yahoo! Milind Bhandarkar

Data Mining in the Swamp

Chapter 7. Using Hadoop Cluster and MapReduce

How To Learn To Use Big Data

INTRO TO BIG DATA. Djoerd Hiemstra. Big Data in Clinical Medicinel, 30 June 2014

Big Data Efficiencies That Will Transform Media Company Businesses

U.S. Mobile Benchmark Report

Ecommerce Tips - Part Two

Outstanding performance track record

Hadoop Distributed File System. T Seminar On Multimedia Eero Kurkela

Colleen s Interview With Ivan Kolev

Outline. What is Big data and where they come from? How we deal with Big data?

Big Data and Privacy in a Digital World

The Pioneer in Social Targeting. Marketing to Social Connections on the Web

WHITE PAPER: DATA DRIVEN MARKETING DECISIONS IN THE RETAIL INDUSTRY

Manifest for Big Data Pig, Hive & Jaql

Big Data & the Cloud: The Sum Is Greater Than the Parts

EXECUTIVE REPORT. Big Data and the 3 V s: Volume, Variety and Velocity

Trends in Business Intelligence

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Leveraging Big Data. A case study from Thomson Reuters

MapReduce, Hadoop and Amazon AWS

Opportunities with Predictive Analytics. Greg Leflar, Vice President

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Hadoop for Enterprises:

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

CPS 216: Advanced Database Systems (Data-intensive Computing Systems) Shivnath Babu

The key to knowing the best price is to fully understand consumer behavior.

Hadoop IST 734 SS CHUNG

A Berkeley View of Big Data

Social media has changed the world as we know it by connecting people, ideas and products across the globe.

Big Data Scoring. April 2014

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg

How To Scale Out Of A Nosql Database

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

Tap into Hadoop and Other No SQL Sources

# Not a part of 1Z0-061 or 1Z0-144 Certification test, but very important technology in BIG DATA Analysis

Online analytics survey

CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing. University of Florida, CISE Department Prof.

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

BIG DATA TRENDS AND TECHNOLOGIES

Introduction to Predictive Analytics. Dr. Ronen Meiri

The Next Wave of Data Management. Is Big Data The New Normal?

Age of Big data. Presented by: Mohammad Iqbal BCM -2014

AppSymphony White Paper

The Trends and Roadblocks in Retail e-commerce: A Recap of the 2012 etail West Conference by Mogreet

Social Media. Marketing Guide B2B

Big Data Big Data/Data Analytics & Software Development

Annex: Concept Note. Big Data for Policy, Development and Official Statistics New York, 22 February 2013

BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?

Advanced Big Data Analytics with R and Hadoop

BIG DATA AND ANALYTICS

A Brief Outline on Bigdata Hadoop

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Leveraging Social Media

INTRODUCTION TO APACHE HADOOP MATTHIAS BRÄGER CERN GS-ASE

Transcription:

Big Data in Banking: Hype or Future? prof. David Martens david.martens@uantwerp.be www.applieddatamining.com

Agenda What s new about Big Data? Marketing applications in banking o Response modeling using payment data o Customer acquisition using browsing data Risk management applications in banking o Retail default prediction using Facebook data o SME Default prediction using board of director data Challenges for the future in banking

What is Big Data?

What is Big Data? Big Data: data that is so large that traditional data processing systems are unable to deal with it o Storage and analysis 4

What is Big Data? Hadoop o o Open-source framework for data-intensive distributed processing on commodity hardware Derived from MapReduce and Google File System (GFS) papers Big Data > Hadoop

What is Big Data? Google o 24,000 TB per day (2009) Pinterest o 20 TB per day Twitter o 12 GB per day or 800 tweets per second (2010) PrediCube, spinoff UA o 30 GB per day Bank payment data o 5-10 GB for all payment data of one year Is Banking data really Big?

What is Data Mining? Data mining: automatic extraction of knowledge from data Setting the scene with credit scoring example Data Data mining technique Pattern Predictions 7

What is Big Data? Transaction ID 0001 0002 0003 0004 0005....... 0052 0053 Items Bread, Milk, Apple Bread, Milk, Eggs, Pen Cold Drink, Chocolate, Milk Bread, Orange Fish, Vegetables....... Paper, Pencil Meat, Oil, Milk Data mining +

What is Big Data?

What is Big Data? Banking The story of Signet Bank o 1990 o Fairbanks and Morris o Model profitability, not just default o No interest from big banks o Signet Bank invested in data assets o Huge success, credit operations spinoff Now Capital One Defining and leveraging data assets

Agenda What s new about Big Data? Marketing applications in banking o Response modeling using payment data o Customer acquisition using browsing data Risk management applications in banking o Retail default prediction using Facebook data o SME Default prediction using board of director data Challenges for the future in banking

Case Study 1: Mining Payment Data From payment data to pseudo-social network o 21 million transactions o Response modeling o Significant improvement over traditional modeling Payment receivers customers Little Bookstore John John Alex An Pete DeliC Amazon Alex An Jeff SportCenterX Pete Jeff EnergyInc [David Martens, Foster Provost, Pseudo-social network targeting from consumer, transaction data, New York University - Stern School of Business - Working paper CeDER-11-05 Patent application PCT/US2011/028175]

Case Study 1: Mining Payment Data From payment data to pseudo-social network o 21 million transactions o Response modeling o Anonymized! customers X00123 Payment receivers M0011 X00123 X00560 X10353 X00056 M8463 M9963 X00560 X10353 X11333 M1365 X00056 X11333 M8005

Case Study 1: Mining Payment Data Application: response modeling Target variable on two products Pension fund and Long term deposit account `Socio-demographic data 289 variables Socio-demographic, product possession, product use, customer behavior When sending offer to top 1%: 3 times more conversions! Variety of (already available) data improves performance 14

Case Study 1: Mining Payment Data Is Bigger Data Better? SD: Using bank s structured data Bigger not better! Size of data set used for training (% of 1.2 million consumers in total) [Junqué de Fortuny, Martens and Provost 2013. Predictive Modeling with Big Data: Is Bigger Really Better? Big Data Journal 1(4): 215-226.]

Case Study 1: Mining Payment Data Is Bigger Data Better? Bigger is better! % of a data set of 1.2 million consumers used for training % of a data set of 1.2 million consumers used for training PSN: prediction based on data on fine-grained behavior SD: traditional predictive modeling based on socio-demographic data PSN + SD: ensemble model combining both Volume of (already available) data improves performance

Case Study 2: Customer Acquisition Predict product interest based on web browsing data o Show ad only to those that are predicted to be interested o Spinoff of UA Data: 1 billion records of persons visiting webpages Results: +20-300% conversions Work done with Dieter Devlaminck (PrediCube) www,predicube.com

Agenda What s new about Big Data? Marketing applications in banking o Response modeling using payment data o Customer acquisition using browsing data Risk management applications in banking o Retail default prediction using Facebook data o SME Default prediction using board of director data Challenges for the future in banking

Case Study 3: Default prediction with Facebook data Default prediction with Facebook data for micro-finance o o o In collaboration with NY-based Lenddo Philippines, 1000 5000 $ loans Facebook data (opt-in) Messages Likes Sociodemo Tags Friends Work done with Sofie De Cnudde, Ellen Tobback, Julie Moeyersoms, Marija Stankova (Universiteit Antwerpen) and Vinayak Javaly (Lenddo)

Cluster of befriended defaulters Case Study 3: Default prediction with Facebook data Friends network Cluster of befriended nondefaulters friends defaulter non-defaulter

Case Study 3: Default prediction with Facebook data Liking data Facebook page like defaulter non-defaulter

Case Study 3: Default prediction with Facebook data Facebook data is very predictive for default prediction Accuracy of predictions Behavioral data is more valuable than social network data

Case Study 4: SME default prediction SME default prediction Data (Belfirst) o 400.000+ SMEs o Financial + Board members and managers Network of companies Work done with Ellen Tobback, Julie Moeyersoms, Marija Stankova (Universiteit Antwerpen)

Case Study 4: SME default prediction Def Def Predictions based on profile of connected companies Def Def Def Def Def Def Def Example network of connected defaulting SMEs, due to two directors

Case Study 4: SME default prediction Big Data? Just 1% of the data

Case Study 4: SME default prediction Sample network of connected defaulting SMEs, with three clusters due to three persons

Case Study 4: SME default prediction Predictive power for default prediction o If we consider the 1% most riskiest SMEs, we can find 58% more defaulters Variety of (already available) data improves performance and insight

Agenda What s new about Big Data? Marketing applications in banking o Response modeling using payment data o Customer acquisition using browsing data Risk management applications in banking o Retail default prediction using Facebook data o SME Default prediction using board of director data Challenges for the future in banking

Privacy! Hey, you re having a baby! Target

Privacy vs data as an asset - a spectrum Using and selling all data Selling payment data Selling socio-demo data Selling anonymised data Using Facebook data Tracking users online Using anonymized payment data for marketing Using anonymized payment data for risk management Using socio-demo and credit data for risk management Nothing

Competition from Google & Co? Mainly in payment Data Fees and data asset Leveraging their existing data asset

Competition from FinTech? Niche products Loyal Belgians: ability of banks to retain their customers FinTech startups: collaborate or acquire

References Data Science for Business o By Foster Provost and Tom Fawcett o O Reilly Predictive Analytics: Techniques and Applications in Credit Risk Modelling o By Tony van Gestel, Bart Baesens and David Martens o Oxford University Press

Conclusion Big Data in Banking: Hype or Future? o Both! Big Data o Banks already quite advanced in data analyses o Don t be fooled by the hype What to do? o Define and leverage data assets! o Collaborate with emerging FinTech startups o Main challenge: attracting data scientists

Q&A Prof. dr. ir. David Martens Applied Data Mining Faculteit Toegepaste Economische Wetenschappen Universiteit Antwerpen E: david.martens@uantwerpen.be T: @ApplDataMining W: www.applieddatamining.com