Churn problem in retail banking Current methods in churn prediction models Fuzzy c-means clustering algorithm vs. classical k-means clustering

Similar documents

Machine Learning with MATLAB David Willingham Application Engineer

Segmentation of stock trading customers according to potential value

Using Data Mining for Mobile Communication Clustering and Characterization

There are a number of different methods that can be used to carry out a cluster analysis; these methods can be classified as follows:

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining

Data Mining Practical Machine Learning Tools and Techniques

Data Mining + Business Intelligence. Integration, Design and Implementation

Cluster Analysis. Alison Merikangas Data Analysis Seminar 18 November 2009

Fuzzy Signature Neural Network

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Data Mining. Nonlinear Classification

Data Mining Clustering (2) Sheets are based on the those provided by Tan, Steinbach, and Kumar. Introduction to Data Mining

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Calculation of Minimum Distances. Minimum Distance to Means. Σi i = 1

Chapter 12 Discovering New Knowledge Data Mining

Hierarchical Cluster Analysis Some Basics and Algorithms

Azure Machine Learning, SQL Data Mining and R

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and

TNS EX A MINE BehaviourForecast Predictive Analytics for CRM. TNS Infratest Applied Marketing Science

Enhanced Customer Relationship Management Using Fuzzy Clustering

Data Mining and Visualization

Clustering. Danilo Croce Web Mining & Retrieval a.a. 2015/201 16/03/2016

Data Mining Algorithms Part 1. Dejan Sarka

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Summary Data Mining & Process Mining (1BM46) Content. Made by S.P.T. Ariesen

A Neuro-Fuzzy Classifier for Customer Churn Prediction

Social Media Mining. Data Mining Essentials

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

Information Retrieval and Web Search Engines

Clustering. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca

Knowledge Discovery and Data Mining

Data Mining 資料探勘. 分群分析 (Cluster Analysis)

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

MS1b Statistical Data Mining

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Clustering Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining

Clustering UE 141 Spring 2013

Data Mining: Overview. What is Data Mining?

How To Make A Credit Risk Model For A Bank Account

Robust Outlier Detection Technique in Data Mining: A Univariate Approach

Random Forest Based Imbalanced Data Cleaning and Classification

Why is Internal Audit so Hard?

Meta-learning. Synonyms. Definition. Characteristics

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

An Overview of Knowledge Discovery Database and Data mining Techniques

CRM Analytics for Telecommunications

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining

Introduction to Data Mining

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Expedia

Using multiple models: Bagging, Boosting, Ensembles, Forests

Chapter 6. The stacking ensemble approach

Maschinelles Lernen mit MATLAB

Predictive Modeling and Big Data

Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus

Machine Learning for Data Science (CS4786) Lecture 1

Fortgeschrittene Computerintensive Methoden: Fuzzy Clustering Steffen Unkel

Steven M. Ho!and. Department of Geology, University of Georgia, Athens, GA

Decision Support System Methodology Using a Visual Approach for Cluster Analysis Problems

Data Mining - Evaluation of Classifiers

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

Neural Networks Lesson 5 - Cluster Analysis

An Introduction to Data Mining

1. Classification problems

MHI3000 Big Data Analytics for Health Care Final Project Report

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

W6.B.1. FAQs CS535 BIG DATA W6.B If the distance of the point is additionally less than the tight distance T 2, remove it from the original set

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES

Predictive Modeling Techniques in Insurance

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Data Mining for Fun and Profit

Bank Customers (Credit) Rating System Based On Expert System and ANN

Clustering Artificial Intelligence Henry Lin. Organizing data into clusters such that there is

How To Solve The Cluster Algorithm

Cluster Analysis: Advanced Concepts

A Novel Fuzzy Clustering Method for Outlier Detection in Data Mining

A Study of Web Log Analysis Using Clustering Techniques

Introduction to Clustering

Analysis of kiva.com Microlending Service! Hoda Eydgahi Julia Ma Andy Bardagjy December 9, 2010 MAS.622j

SPSS Tutorial. AEB 37 / AE 802 Marketing Research Methods Week 7

Distances, Clustering, and Classification. Heatmaps

Decision Trees from large Databases: SLIQ

Predicting outcome of soccer matches using machine learning

A Comparative Study of clustering algorithms Using weka tools

Lecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions

Essential Components of an Integrated Data Mining Tool for the Oil & Gas Industry, With an Example Application in the DJ Basin.

A Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML

DATA MINING TECHNIQUES AND APPLICATIONS

A successful market segmentation initiative answers the following critical business questions: * How can we a. Customer Status.

ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION

COC131 Data Mining - Clustering

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier

Transcription:

CHURN PREDICTION MODEL IN RETAIL BANKING USING FUZZY C- MEANS CLUSTERING Džulijana Popović Consumer Finance, Zagrebačka banka d.d. Bojana Dalbelo Bašić Faculty of Electrical Engineering and Computing University of Zagreb

Overview Theoretical basis Churn problem in retail banking Current methods in churn prediction models Fuzzy c-means clustering algorithm vs. classical k-means clustering algorithm

Study and results Canonical discriminant analysis in outliers detection and variables selection Poor results of hierarchical clustering and crisp k-means algorithm Very good results of the fuzzy c-means algorithm Introduction of fuzzy transitional conditions of the 1st and of the 2nd degree and the sums of membership functions from distance of k instances (abb. DOKI sums) Final models results Conclusions

Churn problem in retail banking No unique definition - generally, term churn refers to all types of customer attrition whether voluntary or involuntary Precise definitions of the churn event and the churner are crucial In this study: moment of churn is the moment when client cancels ( closes ) his last product or service in the bank churner is client having at least one product at time t n and having no product at time t n+1 If client still holds at least one product at time t n+1 - non-churner

Current methods in churn prediction models Logistic regression Survival analysis Decision trees Neural networks Random forests To the best of our knowledge no fuzzy logic based clustering for churn prediction in banking industry!

Fuzzy c-means clustering algorithm vs. classical k-means clustering algorithm Possible advantages of fuzzy c-means: More robust against outliers presence High true positives rate and acceptable accuracy after just a few iterations Additional information hidden in the values of the membership functions Fuzzy nature of the problem requires fuzzy methods

Canonical discriminant analysis (CDA) in outliers detection and variables selection Final data set: 5000 individual clients of the retail bank Classes: 2500 churners vs. 2500 non-churners CDA helped a lot in: variable selection process outlier detection and their further analysis graphical exploration of different data samples

Results of CDA applied on the data set with churners (black), non-churners (red) and returners (green)

Results of CDA applied on the data set with only churners (black) and non-churners (red) and variables in t 0 and t 2

Results of hierarchical clustering and crisp k-means algorithm were very poor, especially for crisp k-means k-means algorithm broke on even modest outliers only Ward s method and Flexible Beta method performed better NOTE: removing outliers from the database will not always be possible and desirable in the real banking situations churn prediction becomes extremely important in periods of financial crises models need to be robust, stable and fast

Results of the classical clustering in terms of true positives, false negatives, accuracy and specificity

Dendrogram of the Average Linkage method and standardization with range shows typical problem of hierarchical clustering: chaining

Dendrogram of Ward s Minimum Variance method and standardization with range

Results of the fuzzy c-means were significantly better than the results of classical clustering, regarding true positives, false positives and accuracy (z-test) 10 different values of the fuzzification parameter m were applied different number of iterations were tested fast reaction is very important in banking industry! in order to improve the prediction results three definitions were introduced: fuzzy transitional condition of the 1st and of the 2nd degree distance of k instances fuzzy sum (DOKI sum)

Results of the fuzzy c-means with different values of the fuzzification parameter m Value m=1,25 chosen for application on training data set, due to the highest true positives rate (significance in difference tested)

Final models results PE = Prediction Engine PE-1: apply fuzzy c-means algorithm to the training dataset; find the best parameter m; add new clients from the validation set and reapply fuzzy c-means PE-2: apply fuzzy c-means algorithm to the training dataset; extract only correctly classified clients; add new clients from the validation set and reapply fuzzy c-means; PE-3: apply fuzzy c-means algorithm on the training dataset; new client from the validation set belongs to the cluster of his 1st nearest neighbor

PE-4: apply fuzzy c-means to the training dataset; for every new client from the validation set find k nearest neighbors and calculate DOKI sums; client belongs to the cluster with highest value of DOKI sum PE-4 model applying DOKI sums performed best, no matter if tested on balanced or non-balanced test sets PE-2 had insignificantly lower tp rate, but is at least twice slower than PE-4 and every delay in the reaction increases the losses!

Conclusions classical clustering methods totally failed on the real banking data due to the modest outliers fuzzy c-means algorithm showed great robustness in outlier presence introduction of DOKI sums significantly improved churn prediction in comparison to other fuzzy models introduction of fuzzy transitional conditions revealed hidden information about product characteristics of these clients fuzzy methods can be successfuly applied on banking data

Questions? dzulijana.popovic@unicreditgroup.zaba.hr bojana.dalbelo@fer.hr