Financial Risk Forecasting Chapter 8 Backtesting and stresstesting

Similar documents

Contents. List of Figures. List of Tables. List of Examples. Preface to Volume IV

Effective Techniques for Stress Testing and Scenario Analysis

Reducing Active Return Variance by Increasing Betting Frequency

Simple Linear Regression Inference

Validating Market Risk Models: A Practical Approach

An Aggregate Reserve Methodology for Health Claims

The Best of Both Worlds:

Review of Basic Options Concepts and Terminology

Least Squares Estimation

Dr Christine Brown University of Melbourne

ALM Stress Testing: Gaining Insight on Modeled Outcomes

Likelihood Approaches for Trial Designs in Early Phase Oncology

Validation of Internal Rating and Scoring Models

Multinomial and Ordinal Logistic Regression

How To Check For Differences In The One Way Anova

Optimal Risk Management Before, During and After the Financial Crisis

OBJECTIVE ASSESSMENT OF FORECASTING ASSIGNMENTS USING SOME FUNCTION OF PREDICTION ERRORS

Statistics in Retail Finance. Chapter 6: Behavioural models

How to assess the risk of a large portfolio? How to estimate a large covariance matrix?

Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification

STATISTICAL SIGNIFICANCE AND THE STANDARD OF PROOF IN ANTITRUST DAMAGE QUANTIFICATION

A Guide to the Insider Buying Investment Strategy

Financial Assets Behaving Badly The Case of High Yield Bonds. Chris Kantos Newport Seminar June 2013

Reject Inference in Credit Scoring. Jie-Men Mok

How To Know Market Risk

Overview. October Investment Portfolios & Products. Approved for public distribution. Investment Advisory Services

EVALUATING THE PERFORMANCE CHARACTERISTICS OF THE CBOE S&P 500 PUTWRITE INDEX

An introduction to Value-at-Risk Learning Curve September 2003

Effective Stress Testing in Enterprise Risk Management

Understanding Margins. Frequently asked questions on margins as applicable for transactions on Cash and Derivatives segments of NSE and BSE

Bayesian logistic betting strategy against probability forecasting. Akimichi Takemura, Univ. Tokyo. November 12, 2012

Industry Environment and Concepts for Forecasting 1

Statistical Machine Learning

APPROACHES TO COMPUTING VALUE- AT-RISK FOR EQUITY PORTFOLIOS

Risk Management for Fixed Income Portfolios

Algorithmic Trading Session 1 Introduction. Oliver Steinki, CFA, FRM

Stress Testing Trading Desks

LDA at Work: Deutsche Bank s Approach to Quantifying Operational Risk

A Review of Cross Sectional Regression for Financial Data You should already know this material from previous study

Risk Visualization: Presenting Data to Facilitate Better Risk Management

FINANCIAL ECONOMICS OPTION PRICING

Understanding Margins

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

MAKE VOLATILITY YOUR ASSET. LJM PRESERVATION & GROWTH FUND

Implementing Reveleus: A Case Study

Volatility modeling in financial markets

SUPERVISORY GUIDANCE ON MODEL RISK MANAGEMENT

I.e., the return per dollar from investing in the shares from time 0 to time 1,

Guidelines on interest rate risk in the banking book

Basics of Statistical Machine Learning

The Trip Scheduling Problem

The Investment Implications of Solvency II

Forecast Confidence Level and Portfolio Optimization

Chapter 4: Vector Autoregressive Models

Diversifying with Negatively Correlated Investments. Monterosso Investment Management Company, LLC Q1 2011

Active Versus Passive Low-Volatility Investing

Testing Simple Markov Structures for Credit Rating Transitions

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

STRESS TESTING GUIDELINE

Recall this chart that showed how most of our course would be organized:

Sales forecasting # 2

Application of Quantitative Credit Risk Models in Fixed Income Portfolio Management

CONTENTS. List of Figures List of Tables. List of Abbreviations

Building and Interpreting Custom Investment Benchmarks

Implementing Point and Figure RS Signals

Tutorial 5: Hypothesis Testing

EVALUATING THE APPLICATION OF NEURAL NETWORKS AND FUNDAMENTAL ANALYSIS IN THE AUSTRALIAN STOCKMARKET

Poisson Models for Count Data

CAB TRAVEL TIME PREDICTI - BASED ON HISTORICAL TRIP OBSERVATION

Yao Zheng University of New Orleans. Eric Osmer University of New Orleans

Auditing Asset-Liability Management (ALM) Functions

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Sample Size and Power in Clinical Trials

by Maria Heiden, Berenberg Bank

Setting the scene. by Stephen McCabe, Commonwealth Bank of Australia

Fundamental Review of the Trading Book: A Revised Market Risk Framework Second Consultative Document by the Basel Committee on Banking Supervision

Java Modules for Time Series Analysis

Risk Management for Alternative Investments

Risk Based Capital Guidelines; Market Risk. The Bank of New York Mellon Corporation Market Risk Disclosures. As of December 31, 2013

Statistics for Retail Finance. Chapter 8: Regulation and Capital Requirements

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Key performance indicators

The Coming Volatility

A Simple Model for Intra-day Trading

Forecasting in supply chains

Third Edition. Philippe Jorion GARP. WILEY John Wiley & Sons, Inc.

The Emperor has no Clothes: Limits to Risk Modelling

Testing Market Efficiency in a Fixed Odds Betting Market

Life Cycle Asset Allocation A Suitable Approach for Defined Contribution Pension Plans

Transcription:

Financial Risk Forecasting Chapter 8 Backtesting and stresstesting Jon Danielsson London School of Economics 2015 To accompany Financial Risk Forecasting http://www.financialriskforecasting.com/ Published by Wiley 2011 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 1 of 80

Financial Risk Forecasting 2011,2015 Jon Danielsson, page 2 of 80

Introduction Financial Risk Forecasting 2011,2015 Jon Danielsson, page 3 of 80

The focus of this chapter is on Backtesting Application of backtesting Significance of backtests Bernoulli coverage test Testing the independence of violations Joint test Loss-function-based backtests Expected shortfall backtesting Problems with backtesting Stress testing Financial Risk Forecasting 2011,2015 Jon Danielsson, page 4 of 80

Notation W T Testing window size T = W E +W T Number of observations in a sample η t = 0,1 Indicates whether a VaR violation occurs (i.e., η t = 1) υ i,i = 0,1 Number of violations (i = 1) and no violations (i = 0) observed in W T υ ij Number of instances where j follows i in W T Financial Risk Forecasting 2011,2015 Jon Danielsson, page 5 of 80

Backtesting Financial Risk Forecasting 2011,2015 Jon Danielsson, page 6 of 80

Forecasting VaR - Example Imagine you have 10 years of data, from 2004 to 2014 It is 2006 and you are trying to forecast risk for January 1st 2006 You will be using 2004 and 2005 to forecast daily VaR for the first day of 2006 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 7 of 80

The 500 trading days in 2004 and 2005 constitute the first estimation window W E is then moved up by one day to obtain the risk forecast for the second day of 2006, etc. Start End VaR forecast 1/1/2004 31/12/2005 VaR(1/1/2006) 2/1/2004 1/1/2006 VaR(2/1/2006)... 31/12/2012 30/12/2014 VaR(31/12/2014) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 8 of 80

What is backtesting? Backtesting is a procedure that can be used to compare various risk models It aims to take ex ante VaR forecasts from a particular model and compare them with ex post realized return (i.e., historical observations) Whenever losses exceed VaR, a VaR violation is said to have occurred Financial Risk Forecasting 2011,2015 Jon Danielsson, page 9 of 80

Usefulness of backtesting Useful in identifying the weaknesses of risk-forecasting models and providing ideas for improvement Not informative about the causes of weaknesses Models that perform poorly during backtesting should have their assumptions and parameter estimates questioned Backtesting can prevent underestimation of VaR and, hence, ensure that a bank carries sufficiently high capital However, it can also reduce the likelihood of overestimating VaR, which can lead to excessive conservatism Financial Risk Forecasting 2011,2015 Jon Danielsson, page 10 of 80

Assessment of VaR forecasts should ideally be done by tracking the performance of a model in the future using operational criteria However: as violations are only observed infrequently, a long period of time would be required Backtesting evaluates VaR forecasts by checking how a VaR forecast model performs over a period in the past Financial Risk Forecasting 2011,2015 Jon Danielsson, page 11 of 80

Estimation window (W E ): the number of observations used to forecast risk. If different procedures or assumptions are compared, the estimation window is set to whichever one needs the highest number of observations Testing window (W T ): the data sample over which risk is forecast (i.e., the days where we have made a VaR forecast) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 12 of 80

The entire sample size T is equal to the sum of W E and W T : t = 1 Entire data sample t = T Financial Risk Forecasting 2011,2015 Jon Danielsson, page 13 of 80

The entire sample size T is equal to the sum of W E and W T : t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 14 of 80

The entire sample size T is equal to the sum of W E and W T : t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) t = 2 Second estimation window t = W E +1 VaR(W E +2) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 15 of 80

The entire sample size T is equal to the sum of W E and W T : t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) t = 2 Second estimation window t = W E +1 VaR(W E +2) t = 3 Third estimation window t = W E +2 VaR(W E +3) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 16 of 80

The entire sample size T is equal to the sum of W E and W T : t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) t = 2 Second estimation window t = W E +1 VaR(W E +2) t = 3 Third estimation window t = W E +2 VaR(W E +3). Financial Risk Forecasting 2011,2015 Jon Danielsson, page 17 of 80

The entire sample size T is equal to the sum of W E and W T : t = 1 Entire data sample t = T t = 1 First estimation window t = W E VaR(W E +1) t = 2 Second estimation window t = W E +1 VaR(W E +2) t = 3 Third estimation window t = W E +2 VaR(W E +3). t = T W E Last estimation window t = T 1 VaR(T) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 18 of 80

VaR forecasts can be compared with the actual outcome: the daily 2001 to 2009 returns are already known Instead of referring to calendar dates (e.g., 1/1/2004), refer to days by indexing the returns, assuming 250 trading days per year: y 1 is the return on 1/1/1999 y 2,500 is the return on the last day, 31/12/2009 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 19 of 80

The estimation window W E is set at 500 days, and the testing window W T is therefore 2,000 days: t t +W E 1 VaR(t +W E ) 1 500 VaR(501) 2 501 VaR(502)... 1,999 2,499 VaR(2,500) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 20 of 80

VaR violation If the actual return on a particular day exceeds the VaR forecast, then the VaR limit is said to have been violated VaR violation: an event such that: η t = { 1, if y t VaR t 0, if y t > VaR t. υ 1 is the count of η t = 1 and υ 0 is the count of η t = 0, which are simply obtained by: υ 1 = Ση t υ 0 = W T υ 1. Financial Risk Forecasting 2011,2015 Jon Danielsson, page 21 of 80

Violation ratios The main tools used in backtesting are violation ratios, where the actual number of VaR violations are compared with the expected value. Violation ratio: VR = Observed number of violations Expected number of violations = υ 1 p W T If the violation ratio is greater than one the VaR model underforecasts risk If smaller than one the model overforecasts risk Financial Risk Forecasting 2011,2015 Jon Danielsson, page 22 of 80

One can make a judgment on the quality of the VaR forecasts by recording the relative number of violations or use more sophisticated statistical techniques We record the violations as η t, which takes the value 1 when a violation occurs and 0 otherwise The number of violations are collected in the variable υ, where υ 1 is the number of violations and υ 0 is the number of days without violations These two add up to make the testing period Financial Risk Forecasting 2011,2015 Jon Danielsson, page 23 of 80

Market risk regulations Financial institutions regulated under the Basel II Accords are required to set aside a certain amount of capital due to market risk, credit risk and operational risk It is based on multiplying the maximum of previous day 1%VaR and 60 days average VaR by a constant, Ξ, which is determined by the number of violations that happened previously: ( ) Market risk capital t = Ξ t max VaR 1% t,var 1% t +constant Financial Risk Forecasting 2011,2015 Jon Danielsson, page 24 of 80

VaR 1% t is average reported 1% VaR over the previous 60 trading days The multiplication factor Ξ t varies with the number of violations, υ 1, that occurred in the previous 250 trading days - the required testing window length for backtesting in the Basel Accords. This is based on three ranges for the number of violations, named after the three colors of traffic lights: 3, if υ 1 4 (Green) Ξ t = 3+0.2(υ 1 4), if 5 υ 1 9 (Yellow) 4, if 10 υ 1 (Red). Financial Risk Forecasting 2011,2015 Jon Danielsson, page 25 of 80

Estimation window length The W E length is determined by the choice of VaR model and probability level Different methods have different data requirements (EWMA requires c. 30 days, HS needs at least 300 for VaR 1%, and GARCH even more) When making comparisons, the estimation window should be sufficiently large to accommodate the most stringent data criteria Even within the same method, it may be helpful to compare different window lengths Financial Risk Forecasting 2011,2015 Jon Danielsson, page 26 of 80

Testing window length VaR violations are infrequent events: with a 1% VaR, a violation is expected once every 100 days, so that 2.5 violations are expected per year Therefore, the actual sample size of violations is quite small, causing difficulties for statistical inference We might need perhaps 10 violations for reliable statistical analysis, or 4 years of data Financial Risk Forecasting 2011,2015 Jon Danielsson, page 27 of 80

Violation ratios VR=1 is expected, but how can we ascertain whether any other value is statistically significant? A useful rule of thumb: If VR [0.8,1.2] it is a good forecast If VR < 0.5 or VR > 1.5 the model is imprecise Both bounds narrow with increasing testing window lengths As a first attempt, one could plot the actual returns and VaR together, thereby facilitating a quick visual inspection of the quality of the VaR forecast Financial Risk Forecasting 2011,2015 Jon Danielsson, page 28 of 80

Application of backtesting Financial Risk Forecasting 2011,2015 Jon Danielsson, page 29 of 80

Volatility and VaR: extreme example 140% EWMA $110 Volatility 100% 60% 20% 20% 60% 100% $90 $70 $50 $30 $10 VaR 0 500 1000 1500 2000 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 30 of 80

Volatility Volatility and VaR: extreme example 140% 100% 60% 20% 20% 60% 100% EWMA MA 0 500 1000 1500 2000 $110 $90 $70 $50 $30 $10 VaR Financial Risk Forecasting 2011,2015 Jon Danielsson, page 31 of 80

Volatility and VaR: extreme example Volatility 140% 100% 60% 20% 20% 60% 100% EWMA MA HS 0 500 1000 1500 2000 $110 $90 $70 $50 $30 $10 VaR Financial Risk Forecasting 2011,2015 Jon Danielsson, page 32 of 80

Volatility and VaR: extreme example Volatility 140% 100% 60% 20% 20% 60% 100% EWMA MA HS GARCH 0 500 1000 1500 2000 $110 $90 $70 $50 $30 $10 VaR Financial Risk Forecasting 2011,2015 Jon Danielsson, page 33 of 80

10% Volatility and VaR for S&P 500 EWMA $120 $100 Volatility 5% $80 0% $60 VaR 5% $40 $20 10% Jan 98 Jan 02 Jan 06 Jan 10 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 34 of 80

10% Volatility and VaR for S&P 500 EWMA MA $120 $100 Volatility 5% $80 0% $60 VaR 5% $40 $20 10% Jan 98 Jan 02 Jan 06 Jan 10 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 35 of 80

Volatility 10% 5% 0% Volatility and VaR for S&P 500 EWMA MA HS $120 $100 $80 $60 VaR 5% $40 $20 10% Jan 98 Jan 02 Jan 06 Jan 10 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 36 of 80

Volatility and VaR for S&P 500 Volatility 10% 5% 0% EWMA MA HS GARCH $120 $100 $80 $60 VaR 5% $40 $20 10% Jan 98 Jan 02 Jan 06 Jan 10 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 37 of 80

Volatility and VaR for S&P 500, focus on 2003-4 3% 2% EWMA $40 $35 Volatility 1% 0% 1% 2% 3% $30 $25 $20 $15 VaR Jan 03 Apr 03 Jul 03 Oct 03 Jan 04 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 38 of 80

Volatility and VaR for S&P 500, focus on 2003-4 Volatility 3% 2% 1% 0% 1% 2% EWMA MA $40 $35 $30 $25 $20 VaR 3% $15 Jan 03 Apr 03 Jul 03 Oct 03 Jan 04 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 39 of 80

Volatility and VaR for S&P 500, focus on 2003-4 3% 2% EWMA MA HS $40 $35 Volatility 1% 0% 1% 2% 3% $30 $25 $20 $15 VaR Jan 03 Apr 03 Jul 03 Oct 03 Jan 04 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 40 of 80

Volatility and VaR for S&P 500, focus on 2003-4 Volatility 3% 2% 1% 0% 1% EWMA MA HS GARCH $40 $35 $30 $25 VaR 2% 3% $20 $15 Jan 03 Apr 03 Jul 03 Oct 03 Jan 04 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 41 of 80

Volatility and VaR for S&P 500, focus on 2008 Volatility 10% 5% 0% EWMA $100 $80 $60 VaR 5% $40 10% Jul 08 Sep 08 Nov 08 Jan 09 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 42 of 80

Volatility and VaR for S&P 500, focus on 2008 Volatility 10% 5% 0% EWMA MA $100 $80 $60 VaR 5% $40 10% Jul 08 Sep 08 Nov 08 Jan 09 $20 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 43 of 80

Volatility and VaR for S&P 500, focus on 2008 Volatility 10% 5% 0% EWMA MA HS $100 $80 $60 VaR 5% $40 10% $20 Jul 08 Sep 08 Nov 08 Jan 09 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 44 of 80

Volatility and VaR for S&P 500, focus on 2008 Volatility 10% 5% 0% 5% EWMA MA HS GARCH $120 $100 $80 $60 $40 VaR 10% $20 Jul 08 Sep 08 Nov 08 Jan 09 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 45 of 80

Significance of Backtests Financial Risk Forecasting 2011,2015 Jon Danielsson, page 46 of 80

Focus on two issues: The number of violations (tested by the unconditional coverage) Clustering (tested by independence tests) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 47 of 80

Unconditional coverage property Ensures that the theoretical confidence level p matches the empirical probability of violation: For a 1% VaR backtest, we would expect to observe a VaR violation 1% of the time. If, instead, violations are observed more often, say 5% of the time, the VaR model is systematically underestimating risk at the 1% level In other words, the 1% VaR produced from the model is in reality the 5% VaR Financial Risk Forecasting 2011,2015 Jon Danielsson, page 48 of 80

Independence property Requires any two observations in the hit sequence to be independent of each other Intuitively, the fact that a violation has been observed today should not convey any information about the likelihood of observing a violation tomorrow If VaR violations cluster, we can predict a violation today if there was one yesterday, and therefore the probability of a loss exceeding 1% VaR today is higher than 1% In this case, a good VaR model would have increased the 1% VaR forecast following a violation Financial Risk Forecasting 2011,2015 Jon Danielsson, page 49 of 80

Pros and cons Unconditional coverage and independence are distinct and it is entirely possible that a VaR model satisfying one of them would not satisfy the other The main downside of these tests is that they rely on asymptotic distributions Given that violations are rare events, the effective sample size is relatively small and, therefore, tests may not be as robust as we would like One may be better off obtaining confidence bounds by means of simulations Financial Risk Forecasting 2011,2015 Jon Danielsson, page 50 of 80

Bernoulli coverage test η t is a sequence of Bernoulli-distributed random variables We then use the Bernoulli coverage test to ascertain the proportion of violations The null hypothesis for VaR violations is: H 0 : η B(p), where B stands for the Bernoulli distribution. Financial Risk Forecasting 2011,2015 Jon Danielsson, page 51 of 80

The Bernoulli density is given by: (1 p) 1 ηt (p) ηt,η t = 0,1. The probability p can be estimated by: ˆp = υ 1 W T The likelihood function is given by: L U (ˆp) = T t=w E +1 (1 ˆp) 1 ηt (ˆp) ηt = (1 ˆp) υ 0 (ˆp) υ 1 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 52 of 80

Denote this as the unrestricted likelihood function because it uses estimated probability ˆp in contrast to the likelihood function below where we restrict the probability value to p. Under the H 0, p = ˆp, so the restricted likelihood function is: L R (p) = T t=w E +1 = (1 p) υ 0 (p) υ 1 (1 p) 1 ηt (p) ηt Financial Risk Forecasting 2011,2015 Jon Danielsson, page 53 of 80

We can use an LR test to see whether L R = L U or, equivalently, whether p = ˆp: LR = 2(logL U (p) logl U (ˆp)) = 2log (1 ˆp)υu (ˆp) υ 1 (1 p) υ 0 (p) υ 1 asymptotic χ 2 (1) Choosing a 5% significance level for the test, the null hypothesis is rejected if LR > 3.84 The choice of significance level affects the power of the test: a low type I error implies a higher type II error and therefore a lower power for the test. Financial Risk Forecasting 2011,2015 Jon Danielsson, page 54 of 80

The Bernoulli coverage test is nonparametric in the sense that it does not assume a distribution for the returns and often provides good benchmarks for the assessment of accuracy of VaR models It does not have much power when sample sizes are small, like the one-year size specified in the Basel Accords Financial Risk Forecasting 2011,2015 Jon Danielsson, page 55 of 80

Testing the independence of violations It is also of interest to test whether violations cluster (i.e., whether all violations happen one after the other), indicating a sequence of losses since violations should theoretically spread out over time Calculate the probabilities of two consecutive violations (i.e., p 11 ) and the probability of a violation if there was no violation on the previous day (i.e., p 01 ) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 56 of 80

More generally, the probability that: p i,j = Pr(η t = i η t 1 = j) where i and j are either 0 or 1 The first-order transition probability matrix is defined as: ( ) 1 p01 p Π 1 = 01 1 p 11 p 11 The restricted likelihood function, where the transition matrix from the null hypothesis is used since the hit sequence is Bernoulli distributed, is: L R (Π 1 ) = (1 p 01 ) υ 00 p υ 01 01 (1 p 11) υ 10 p υ 11 11 where υ ij is the number of observations where j follows i. Financial Risk Forecasting 2011,2015 Jon Danielsson, page 57 of 80

Maximum likelihood (ML) estimates are obtained by maximizing L R (Π 1 ): υ 01 υ 00 ˆΠ 1 = υ 00 +υ υ 01 10 υ 00 +υ υ 01 11 υ 10 +υ 11 υ 10 +υ 11 Under the null hypothesis of no clustering, the probability of a violation tomorrow does not depend on today seeing a violation Financial Risk Forecasting 2011,2015 Jon Danielsson, page 58 of 80

Then, p 01 = p 11 = p and the estimated transition matrix is simply: ( ) 1 ˆp ˆp ˆΠ 0 = 1 ˆp ˆp where υ 01 +υ 11 ˆp = υ 00 +υ 10 +υ 01 +υ 11 The unrestricted likelihood function according to null hypothesis uses the estimated transition matrix and is: L U (ˆΠ 0 ) = (1 ˆp) υ 00+υ 10ˆp υ 01+υ 11 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 59 of 80

Likelihood ratio test The likelihood ratio test: ( LR = 2 logl U (ˆΠ 0 ) logl R (ˆΠ 1 ) ) asymptotic χ 2 (1) This test does not depend on true p and only tests for independence Financial Risk Forecasting 2011,2015 Jon Danielsson, page 60 of 80

The main problem with tests of this sort is that they must specify the particular way in which independence is breached However, there are many possible ways in which the independence property is not fulfilled: The test will have no power to detect departures from independence if the likelihood of VaR being violated today depends on whether VaR was violated 2 days ago not on yesterdays VaR being violated. Financial Risk Forecasting 2011,2015 Jon Danielsson, page 61 of 80

Testing S&P 500: comparison of the four VaR models January 30, 1998 to December 31, 2009 (W T = 3000) with crisis Model Coverage test Independence test Test statistic p-value Test statistic p-value EWMA 18.1 0.00 0.00 0.96 MA 81.2 0.00 7.19 0.01 HS 24.9 0.00 4.11 0.04 GARCH 16.9 0.00 0.00 0.99 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 62 of 80

Testing S&P 500: comparison of the four VaR models January 30, 1998 to November 1, 2006 (W T = 2000) - without crisis Model Coverage test Independence test Test statistic p-value Test statistic p-value EWMA 2.88 0.09 0.68 0.41 MA 6.15 0.01 2.62 0.11 HS 0.05 0.82 1.52 0.22 GARCH 1.17 0.28 0.99 0.32 Financial Risk Forecasting 2011,2015 Jon Danielsson, page 63 of 80

Joint test We can jointly test whether violations are significantly different from those expected and whether there is violation clustering by constructing the following test statistic: LR(joint) = LR(coverage)+LR(independence) χ 2 (2) The joint test has less power to reject a VaR model which only satisfies one of the two properties Individual tests should be used if the user has some prior knowledge of weaknesses in the VaR model. Financial Risk Forecasting 2011,2015 Jon Danielsson, page 64 of 80

Loss-function-based backtests An alternative backtest could be constructed based on a general loss function: l(var t (p),y t ) It would take into consideration the magnitude of VaR violation. An example of such a loss function is: l(y,x) = 1+(y x) 2,y x Otherwise, it takes the value zero If y indicates returns and x negative VaR, then we would typically calculate sample average loss as: ˆL = 1 W T T t=w E +1 l(y t,var t (p)) Financial Risk Forecasting 2011,2015 Jon Danielsson, page 65 of 80

Pros and cons Advantage of loss-function-based backtests The functional form of the loss function is flexible and can be tailored to address specific concerns Disadvantages of loss-function-based backtests In order to determine whether ˆL is too large relative to what is expectedan explicit assumption about P/L distribution needs to be made. A null hypothesis for ˆL should be mandatory Financial Risk Forecasting 2011,2015 Jon Danielsson, page 66 of 80

Given the uncertainty over the true P/L distribution, a finding that ˆL is too large relative to what is expected could be due to: Inaccuracy in the VaR model An inaccurately assumed distribution In other words, the test is a joint test of the VaR model and the P/L distribution. This is problematic Loss-function-based backtests may consequently be better suited for discriminating among competing VaR models than judging the absolute accuracy of a single model Financial Risk Forecasting 2011,2015 Jon Danielsson, page 67 of 80

Expected Shortfall Backtesting Financial Risk Forecasting 2011,2015 Jon Danielsson, page 68 of 80

It is harder to backtest expected shortfall ES than VaR because we are testing an expectation rather than a single quantile Fortunately, there exists a simple methodology for backtesting ES that is analogous to the use of violation ratios for VaR For days when VaR is violated, normalized shortfall NS is calculated as: NS t = y t ES t where ES t is the observed ES on day t Financial Risk Forecasting 2011,2015 Jon Danielsson, page 69 of 80

From the definition of ES, the expected Y t, VaR is violated, is: E[Y t Y t < VaR t ] ES t = 1 Therefore, average NS, denoted by NS, should be one and this forms our null hypothesis: H 0 : NS = 1 In what follows, we opted to implement just the EWMA and HS versions, since the other two (MA and GARCH) are quite similar Financial Risk Forecasting 2011,2015 Jon Danielsson, page 70 of 80

The reliability of any ES backtest procedure is likely to be much lower than that of VaR backtest procedures: With ES, we are testing whether the mean of returns on days when VaR is violated is the same as average ES on these days. Much harder to create formal tests to ascertain whether normalized ES equals one or not than the coverage tests developed above for VaR violations Hence, backtesting ES requires many more observations than backtesting VaR In instances where ES is obtained directly from VaR, and gives the same signal as VaR (i.e., when VaR is subadditive), it is better to simply use VaR Financial Risk Forecasting 2011,2015 Jon Danielsson, page 71 of 80

Problems with Backtesting Financial Risk Forecasting 2011,2015 Jon Danielsson, page 72 of 80

Backtesting assumes that there have been no structural breaks in the data throughout the sample period: But financial markets are continually evolving, and new technologies, assets, markets and institutions affect the statistical properties of market prices Unlikely that the statistical properties of market data in the 1990s are the same as today, implying that a risk model that worked well then might not work well today Financial Risk Forecasting 2011,2015 Jon Danielsson, page 73 of 80

Data mining and intellectual integrity: Backtesting is only statistically valid if we have no ex ante knowledge of the data in the testing window If we iterate the process, continually refining the risk model with the same test data and thus learning about the events in the testing window, the model will be fitted to those particular outcomes, violating underlying statistical assumptions Financial Risk Forecasting 2011,2015 Jon Danielsson, page 74 of 80

Stresstesting Financial Risk Forecasting 2011,2015 Jon Danielsson, page 75 of 80

The purpose of stresstesting is to create artificial market outcomes in order to see how risk management systems and risk models cope with the artificial event, and to assess the ability of a bank to survive a large shock The gap that stress testing aims to fill is model failure to encounter rare situations that could cause a severe loss, since backtesting relies on recent historical data Financial Risk Forecasting 2011,2015 Jon Danielsson, page 76 of 80

Scenario analysis The main aim is to come up with scenarios that are not well represented in historical data but are nonetheless possible and detrimental to portfolio performance We then revalue the portfolio in this hypothetical environment and obtain an estimate of maximum loss This procedure then allows the bank to set aside enough capital for such an eventuality Financial Risk Forecasting 2011,2015 Jon Danielsson, page 77 of 80

Examples of historical scenarios Scenario Period Stock market crash October 1987 ERM crisis September 1992 Bond market crash April 1994 Asian currency crisis Summer 1997 LTCM and Russia crisis August 1998 Global crisis 2007-2009 Eurozone crisis Since 2010 Scenarios can be classified into two broad types: Simulating shocks that have never occurred or are more likely to occur than historical data suggest Simulating shocks that reflect permanent or temporary structural breakswhere historical relationships do not hold Financial Risk Forecasting 2011,2015 Jon Danielsson, page 78 of 80

Issues in scenario analysis The results indicate that banks should set aside a large amount of capital to absorb the worst case loss, which is usually not practical But there are other actions that a bank can take such as buying insurance for the events in question and changing the composition of the portfolio to reduce exposure to a particular market variable The lack of a mechanism to reliably judge the probability of a stress test scenario since they are both highly subjective and difficult to pinpoint The potential for feedback effects, or endogenous risk since the focus is on institution-level risk, where feedback effects are disregarded Financial Risk Forecasting 2011,2015 Jon Danielsson, page 79 of 80

Scenario analysis and risk models It is straightforward to integrate scenario analysis with risk models in use. Assume a risk manager has assigned a probability ξ to a particular scenario and the potential loss arising from it. The probability can be purely a subjective judgment from the managers experience or could be derived from a model Financial Risk Forecasting 2011,2015 Jon Danielsson, page 80 of 80