Issues in Credit Scoring



Similar documents
Credit Risk Models. August 24 26, 2010

Despite its emphasis on credit-scoring/rating model validation,

Measuring the Discrimination Quality of Suites of Scorecards:

Weight of Evidence Module

Credit Risk Analysis Using Logistic Regression Modeling

Model Validation Techniques

Cross-Tab Weighting for Retail and Small-Business Scorecards in Developing Markets

Appendix B D&B Rating & Score Explanations

Innovations and Value Creation in Predictive Modeling. David Cummings Vice President - Research

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Existing Account Management: Building Effective Portfolio Management Tools May 2011

Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios

STATISTICA Formula Guide: Logistic Regression. Table of Contents

New Predictive Analytics for Measuring Consumer Capacity for Incremental Credit

Credit & Risk Management. Lawrence Marsiello Vice Chairman and Chief Lending Officer

OCC OCC BULLETIN

To Score or Not to Score?

Statistics in Retail Finance. Chapter 2: Statistical models of default

Basel Committee on Banking Supervision. Working Paper No. 17

UBI-5: Usage-Based Insurance from a Regulatory Perspective

Developing Credit Scorecards Using Credit Scoring for SAS Enterprise Miner TM 12.1


Credit Scorecards for SME Finance The Process of Improving Risk Measurement and Management

Validation of Internal Rating and Scoring Models

ING Bank, fsb (ING DIRECT) appreciates the opportunity to comment on the Base1 I1 notice of proposed rulemaking (NPR).

Counterparty Credit Risk for Insurance and Reinsurance Firms. Perry D. Mehta Enterprise Risk Management Symposium Chicago, March 2011

Risk pricing for Australian Motor Insurance

Summary. January 2013»» white paper

Credit Score Basics, Part 1: What s Behind Credit Scores? October 2011

Piecewise Logistic Regression: an Application in Credit Scoring

Cool Tools for PROC LOGISTIC

Benchmarking default prediction models: pitfalls and remedies in model validation

Banco Sabadell Stress test results. 15 th July 2011

GLM, insurance pricing & big data: paying attention to convergence issues.

SURVEY ON ASSET LIABILITY MANAGEMENT PRACTICES OF CANADIAN LIFE INSURANCE COMPANIES

ECLT5810 E-Commerce Data Mining Technique SAS Enterprise Miner -- Regression Model I. Regression Node

The Western Hemisphere Credit & Loan Reporting Initiative (WHCRI)

The art of probability-of-default curve calibration

How to set the main menu of STATA to default factory settings standards

Discussion Paper On the validation and review of Credit Rating Agencies methodologies

Stress Test Modeling in Cards Portfolios

SAMPLE SELECTION BIAS IN CREDIT SCORING MODELS

An Application of the Cox Proportional Hazards Model to the Construction of Objective Vintages for Credit in Financial Institutions, Using PROC PHREG

SOA 2013 Life & Annuity Symposium May 6-7, Session 30 PD, Predictive Modeling Applications for Life and Annuity Pricing and Underwriting

CASE STUDY CARDS: DRIVING PROFITABLE GROWTH

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

OCC Docket Number (RIN 1557-AC91) Federal Reserve Board Docket No. R-1261 FDIC RIN 3064-AC73 OTS Docket No (RIN 1550-AB56)

Gerry Hobbs, Department of Statistics, West Virginia University

( ) = ( ) = {,,, } β ( ), < 1 ( ) + ( ) = ( ) + ( )

How To Make A Credit Risk Model For A Bank Account

Product Management Case Study. What makes Effective Product Management Process?

Statistics 305: Introduction to Biostatistical Methods for Health Sciences

PRINCIPLES FOR THE MANAGEMENT OF CONCENTRATION RISK

Credit Risk Stress Testing

Credibility and Pooling Applications to Group Life and Group Disability Insurance

How To Price Insurance In Canada

Usage-based Auto Insurance (UBI)

Binary Logistic Regression

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

Small Business Lending and Social Capital: Are Rural Relationships Different?

MORTGAGE LENDER PROTECTION UNDER INSURANCE ARRANGEMENTS Irina Genriha Latvian University, Tel ,

The Predictive Data Mining Revolution in Scorecards:

Managing Consumer Credit Risk *

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

AEL Financial, LLC. Presentation for: Equipment Leasing Association

IMPLEMENTATION NOTE. Validating Risk Rating Systems at IRB Institutions

Leveraging CRM spend in retail banking

Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank

Data Mining Techniques Chapter 4: Data Mining Applications in Marketing and Customer Relationship Management

11. Analysis of Case-control Studies Logistic Regression

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

Automated valuation models: Changes in the housing market require additional risk management considerations

Credit Card Market Study Interim Report: Annex 5 Firm business model analysis

Driving Insurance World through Science Murli D. Buluswar Chief Science Officer

Modeling Lifetime Value in the Insurance Industry

Attachment. OCC Guidance on Due Diligence Requirements in Determining Whether Securities Are Eligible for Investment

BUSINESS CREDIT AND COLLECTION RISK ANALYSIS

Reducing Credit Union Member Attrition with Predictive Analytics

Achieving Inventory Accuracy

Service Contracts, Warranties and Insurance. Christopher M. Holt, ACAS, MAA, FCA Consulting Actuary

Umbrella & Excess Liability - Understanding & Quantifying Price Movement

IBM SPSS Direct Marketing 23

Calibration of Uncertainty (P10/P90) in Exploration Prospects*

Statistics in Retail Finance. Chapter 6: Behavioural models

Measuring per-mile risk for pay-as-youdrive automobile insurance. Eric Minikel CAS Ratemaking & Product Management Seminar March 20, 2012

Credit Risk Scorecard Design, Validation and User Acceptance

SOA Pandemic Model Spreadsheet Documentation

Transcription:

Issues in Credit Scoring Model Development and Validation Dennis Glennon Risk Analysis Division Economics Department The Office of the Comptroller of the Currency The opinions expressed are those of the author and do not necessarily reflect those of the Office of the Comptroller of the Currency. 1

Model Development and Validation Outline 1. Credit Risk vs. Model Risk 2. Model Risk Analysis 3. Model Purpose i. Classification ii. Prediction 2

Model Review Scope of a Review I. Credit Risk II. The risk to earnings or capital of an obligor's failure to meet the terms of any contract with the bank or otherwise fail to perform as agreed. Model Risk Although model risk contributes to the overall portfolio or credit risk, it represents a conceptually distinct exposure that emerges from an overly broad interpretation or application of a model beyond that for which it was developed. 3

Model Review Scope of a Review I. Credit Risk Analysis i. evaluate strategies ii. assess current portfolio performance II. Model Risk Analysis i. evaluate model validity, reliability, and accuracy 4

Model Review Model Risk Analysis I. Are the models developed using valid statistical or industry-accepted methods? i. Appropriate sample design a. truncated/censored samples b. over-sample 5

Model Review Model Risk Analysis (continued) ii. Valid model design a. satisfy minimum statistical requirements b. in-sample performance (including holdout sample) c. out-of-time performance 6

Model Review Model Risk Analysis (continued) II. Are the models used in ways that are consistent with the original purpose for which the model was developed? i. Model purpose a. classification b. prediction 7

Model Purpose Model Purpose The underlying objective of a classification-based model is different from that of a prediction model. As such, a model should be evaluated within the scope of its primary objective. 8

Model Purpose Models as Classification Tools Banks are developing or purchasing models that are designed as classification tools. That is, the models are developed for the purpose of partitioning populations or portfolios into groups by their expected relative performance. Modeling Objective: Maximize the divergence or separation between the distributions of good and bad accounts. 9

Classification Design: Example A Comparison of Model Performance Performance Distribution Perform ance D istribution 6000 5000 4000 3000 2000 1000 0 250000 200000 150000 100000 50000 0 6000 4000 2000 0 250000 200000 150000 100000 50000 0 150 170 190 210 230 250 270 290 150 170 190 210 230 250 270 290 score score bads goods bads goods K-S = 64.0 K-S = 26.5 10

Model Purpose Classification Objective Interpretation: If, for example, the good/bad odds ratio associated with the score interval between 200-210 is 30:1, then the odds ratio for the intervals above (below) 200-210 will be greater (less) than 30:1. Result: A model that maintains its ability to rank-order performance is considered to be reliable. 11

Model Purpose Classification-Based Models Valid Purpose: models developed under this criteria are valid as decision tools if the objective is to simply identify segments of the population that, as a group, perform poorly. Appropriate for identifying and excluding specific segments of the population -- a strategy that, in practice, often improves average portfolio performance relative to a random-selection method. 12

Classification Model 7 6 5 Log Odds Curve Development (K-S = 32.1) Validation (K-S = 34.3) ln(20/1) = 3 bad rate =.05 ln(good/bad) 4 3 2 ln(4/1) = 1.39 bad rate =.20 1 0 644 653 665 675 684 693 706 715 725 739 753 Score Bands 13

Model Purpose Alternative Purpose: Predicting Performance. Banks want models for risk-based pricing/re-pricing and profitability analysis -- models that are designed specifically to address the issue of trading risk for margin (i.e., return). For that purpose, banks need models that are accurate predictors of performance. 14

Model Selection: Which model is better? obs. good (G) - y=0 obs. bad (B) - y=1 K-S = 48 Model 1 K-S = 48 Model 2 y 1 3 5 7 9 y 1 5 4 4 11 1 [0.1] [bad rate] [0.3] [0.5] [0.7] [0.9] 1 [0.08] [0.45] [0.44] [0.67] [0.92] [bad rate] 9 7 5 3 1 11 6 5 2 1 0 0 10 20 30 40 50 60 70 80 90 100 Score (quintiles) 0 [#B / (#G + #B)] 0 10 30 50 70 90 20 40 60 80 100 Score (quintiles) 15

Model Purpose: Prediction Models as Prediction Tools Purpose: to predict the expected frequency at which accounts with similar attributes perform (e.g., respond, attrite, default). For example, predict the probability of default. Modeling Objective: Minimize the difference between the predicted and actual percentage of defaults within each score range (i.e., maximize the goodness-of-fit). 16

Model Purpose: Prediction Prediction-Based Models Interpretation: If within the interval 200-210 the risk model predicts a probability of default of.04, then for every 100 account that score within that range, four should default. A model that satisfies this condition is considered to be accurate. This is a much stronger condition than that associated with a classification objective (i.e., reliable). 17

Model Purpose: Prediction Prediction-Based Models Valid Purpose: models developed under this approach are valid as actuarial tools; as such, they are appropriate in situations in which the actual, not just the relative, measure of performance is required. 18

Model Purpose: Prediction Limitations of a Prediction-Based Model The model-development process is significantly more complex especially when data across all aspects of the behavior decision (i.e., individual, market, and industry) are limited. 19

Model Purpose Conclusion: Models are developed for different purposes -- e.g., classification or prediction. As such, the choices of: - sample design, - modeling technique, and - validation procedures are driven by the intended purpose for which the model will ultimately be used. 20

Model Purpose Observation: The choice of modeling objective is important not only because it defines how we assess its validity, but also because it defines a full set of technical estimation procedures that are used to select the best model under the chosen objective. 21

Issues in Credit Scoring Model Development and Validation The End 22