Business Analytics and Credit Scoring



Similar documents
Credit Risk Models. August 24 26, 2010

EXPLORING & MODELING USING INTERACTIVE DECISION TREES IN SAS ENTERPRISE MINER. Copyr i g ht 2013, SAS Ins titut e Inc. All rights res er ve d.

Customer and Business Analytic

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

Revenue s Business Context

The Predictive Data Mining Revolution in Scorecards:

WHITEPAPER. How to Credit Score with Predictive Analytics

Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

not possible or was possible at a high cost for collecting the data.

Principles of Data Mining by Hand&Mannila&Smyth

Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets

Data Mining Methods: Applications for Institutional Research

ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION

Data Mining Algorithms Part 1. Dejan Sarka

Machine Learning Logistic Regression

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

A Basic Guide to Modeling Techniques for All Direct Marketing Challenges

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90

Statistics in Retail Finance. Chapter 6: Behavioural models

Data Mining. Nonlinear Classification

KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES

An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century

Banking Analytics Training Program

TNS EX A MINE BehaviourForecast Predictive Analytics for CRM. TNS Infratest Applied Marketing Science

Data mining and statistical models in marketing campaigns of BT Retail

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Data Mining with SAS. Mathias Lanner Copyright 2010 SAS Institute Inc. All rights reserved.

CoolaData Predictive Analytics

The future of credit card underwriting. Understanding the new normal

Potential Value of Data Mining for Customer Relationship Marketing in the Banking Industry

MERGING BUSINESS KPIs WITH PREDICTIVE MODEL KPIs FOR BINARY CLASSIFICATION MODEL SELECTION

Cross-Tab Weighting for Retail and Small-Business Scorecards in Developing Markets

Comparison of Data Mining Techniques used for Financial Data Analysis

Master of Science in Marketing Analytics (MSMA)

Prediction of Stock Performance Using Analytical Techniques

Behavior Model to Capture Bank Charge-off Risk for Next Periods Working Paper

Discovering, Not Finding. Practical Data Mining for Practitioners: Level II. Advanced Data Mining for Researchers : Level III

Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1

The Data Mining Process

Predictive modelling around the world

Chapter 12 Discovering New Knowledge Data Mining

UNDERSTANDING THE EFFECTIVENESS OF BANK DIRECT MARKETING Tarun Gupta, Tong Xia and Diana Lee

Use of Data Mining in Banking

from Larson Text By Susan Miertschin

USING LOGIT MODEL TO PREDICT CREDIT SCORE

ANALYTICS CENTER LEARNING PROGRAM

Model-Based Recursive Partitioning for Detecting Interaction Effects in Subgroups

An Introduction to Advanced Analytics and Data Mining

Statistics in Retail Finance. Chapter 2: Statistical models of default

Easily Identify Your Best Customers

Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal

Data Analytical Framework for Customer Centric Solutions

DATA MINING IN BANKING AND FINANCE: A NOTE FOR BANKERS. Rajanish Dass Indian Institute of Management Ahmedabad rajanish@iimahd.ernet.in.

Data Mining for Business Intelligence. Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. 2nd Edition

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Data Mining in CRM & Direct Marketing. Jun Du The University of Western Ontario jdu43@uwo.ca

Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees

Predicting Customer Default Times using Survival Analysis Methods in SAS

D&B integrate the data into our database through our patented Entity Matching, which produces a single accurate picture of each business.

Data Mining Part 5. Prediction

Didacticiel Études de cas

Weight of Evidence Module

Data Mining: Overview. What is Data Mining?

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Prediction of Car Prices of Federal Auctions

Predictive Modeling Using Transactional Data

How to Optimize Your Data Mining Environment

How To Make A Credit Risk Model For A Bank Account

Data mining techniques: decision trees

Data Mining Part 5. Prediction

Credit Scorecards for SME Finance The Process of Improving Risk Measurement and Management

Experian s UK Credit Bureau Scores. Version 1.6

Knowledge Discovery and Data Mining

Managing Consumer Credit Risk *

IBM SPSS Direct Marketing 23

Prerequisites. Course Outline

Car Insurance. Prvák, Tomi, Havri

Business Analytics and Data Mining for CRM Business Analytics and Data Mining for CRM: Jumpstart workshop

Data Mining - Evaluation of Classifiers

SAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics

IBM SPSS Direct Marketing 22

White Paper. Data Mining for Business

Predictive Data modeling for health care: Comparative performance study of different prediction models

MACHINE LEARNING IN HIGH ENERGY PHYSICS

Azure Machine Learning, SQL Data Mining and R

Maximizing Return and Minimizing Cost with the Decision Management Systems

THE USE OF PREDICTIVE MODELLING TO BOOST DEBT COLLECTION EFFICIENCY

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

AdTheorent s. The Intelligent Solution for Real-time Predictive Technology in Mobile Advertising. The Intelligent Impression TM

New D&B Failure Score and Recommended Credit Limit Models for UK and Ireland

Over the past several years, the issuance of small business loan securitizations

Transcription:

Study Unit 5 Business Analytics and Credit Scoring ANL 309 Business Analytics Applications

Introduction Process of credit scoring The role of business analytics in credit scoring Methods of logistic regression and decision trees

Constructing a Credit Scoring Model The construction of a credit or behavioural scoring model may be broadly broken down into the following phases: Defining Risky Customers Data Gathering and Analysis Scorecard Generation Implementation and Credit Risk Strategy

Defining Risky Customers First, define clearly the customers who the institution would classify as risky customers. Depends on the overall risk that the institution is willing to expose itself to, coupled with its profitability expectations. This is the basis on which the entire scoring framework is built upon.

Data Gathering and Analysis Next, identify all the dimensions which have an effect on the customer s propensity to default. Gather all the available information related to past credit behaviour of the customers. Perform the necessary data mining analysis to determine the significant relationships between demographic, behavioural dimensions, and the customer s propensity to default.

Scorecard Generation Scorecards can be generated using various techniques in data mining, such as logistic regression, which is a parametric statistical technique, or neutral network, which is a nonti technique. parametric A scorecard is a linear combination of the various attributes with appropriate weights assigned to each of them.

Implementation and Credit Risk Strategy Once every customer in the system is assigned a credit score, banks or lending institutions will re-formulate credit policies and operational strategies based on the portfolio. For example, customers with higher credit scores will enjoy better rates, tenures and faster approvals compared to the others. The institution may also decide to deny credit to customers who have very low credit scores.

Credit Scoring and Business Analytics There are a number of business analytical methods that can be applied in credit scoring. Two popular methods are: logistic regression decision trees

Model Variability Validity monitoring i is conducted d to ensure that t the model differentiates or slopes behaviour that is consistent with the business needs and expectations. When the performance or slope has degraded significantly, it indicates that the business needs are not being served and corrective measures must be taken. Validity monitoring should be viewed as the final defense mechanism because it identifies model failures after they have occurred.

Model Stability New models are assessed for stability that begins three months after their first use in production. For existing models, assessment occurs on a quarterly basis. Population stability will be assessed via the Population Stability Index (PSI) and a score distribution report. The statistic will be calculated by comparing a benchmark score distribution with the most recent score distribution.

Population Stability The Population Stability Index (PSI) calculations are performed monthly on the Small Business Card population used to the score the SL02 and NA01 models. The PSI value indicates if the population is stable or if there are significant shifts in the population. As such, model breakdown or data inconsistencies can be easily detected.

Logistic Regression Logistic regression is similar to linear regression, except that the dependent variable is not continuous. The dependent variable is discrete/ categorical, e.g. 1=respond to an offer, 0=did not respond to an offer; or 1=default on loan, 0=did not default on loan.

Logistic Regression

Logistic Regression: Assumptions The true conditional probabilities are a logistic function of the independent variables. No omission of important variables. No extraneous variables are included. No measurement error for the independent variables. Independence of observations. o s The independent variables are not linear combinations of each other.

Decision Trees Very popular in business analytics applications mainly because it produces visual model and generate rules that can be easily interpreted. Examine all possible questions which can distinguish the data into segments which are nearly homogeneous in characteristics.

Types of Decision Trees There are many types of decision tree approaches: C&RT ID3 C4.5/C5 CHAID. Their main difference is how they partition the data.

Decision Trees: Stopping Rule A decision tree algorithm will stop growing the tree when one of the following criteria is satisfied: Segment contains only one record. All records in the segment have identical characteristics. Improvement is not substantial to warrant growing the tree further.

Over-fitting and Cross-validation Once the tree has grown to a certain size, depending on the stopping rule, it is also important to check the tree for over-fitting of the data. Cross-validation and test set validation may be applied.