Analytic Models of the ROC Curve: Applications to Credit Rating Model Validation

Size: px
Start display at page:

Download "Analytic Models of the ROC Curve: Applications to Credit Rating Model Validation"

Transcription

1 QUANTITATIVE FINANCE RESEARCH CENTRE QUANTITATIVE FINANCE RESEARCH CENTRE Research Paper 8 August 2006 Analtic Models of the ROC Curve: Applications to Credit Rating Model Validation Stephen Satchell and Wei Xia ISSN

2 Analtic Models of the ROC Curve: Applications to Credit Rating Model Validation Steve Satchell Facult of Economics and Politics Universit of Cambridge Wei Xia Birkbeck College Universit of London August,2006. Abstract: In this paper, the authors use the concept of the population ROC curve to build analtic models of ROC curves. Information about the population properties can be used to gain greater accurac of estimation relative to the non-parametric methods currentl in vogue. If used properl this is particularl helpful in some situations where the number of sick loans is rather small; a situation frequentl met in periods of benign macro-economic background. Kewords: Validation, Credit Analsis, Rating Models, ROC, Basel II Acknowledgement: Wei Xia would like to thank Birkbeck College for the generous funding support and Ron Smith for helpful comments. Introduction

3 The Internal Rating-Based (IRB) credit risk modelling approach is now allowed for qualified banks in their risk modelling and economical capital calculation, after the Basel Committee on Banking Supervision (BCBS) published the International Convergence of Capital Measurement and Capital Standards: A Revised Framework, more commonl known as Basel II, in June One of the important components of most IRB risk models is the rating sstem used for transforming and assigning the Probabilit of Default (PD) for each obligor in the credit portfolio. Banks and public rating agencies have developed a variet of rating methodologies in the last three decades. Therefore, questions arise as to which of these methodologies deliver acceptable discriminator power between the defaulting and non-defaulting obligor e ante, and which methodologies are to be preferred for different obligor sub-groups. It has become increasingl important for both the regulator and the banks to quantif and judge the qualit of rating sstems. This concern is reflected and stressed in the recent BCBS working paper No.4 (2005). This summarizes a number of statistical methodologies for the assessment of discriminator power which have been suggested in the literature. For eample, Cumulative Accurac Profile (CAP), Receiver Operating Characteristic (ROC), Baesian error rate, Conditional Information Entrop Ratio (CIER), Kendall s τ and Somers D, Brier score inter alia. Among those methodologies, the most popular ones are CAP and its summar inde, the Accurac Ratio (AR) as well as ROC and its summar inde; this inde is called the Area Under the ROC (AUROC). It is worth noting that, unlike some other measures that do not take sample size into account and are therefore substantiall affected b statistical errors, the CAP and the ROC measures eplicitl account for the size of the default sample and thus can be used for direct rating model comparison. A detailed eplanation of the CAP is presented in Sobehart, Keenan and Stein (2000), and Sobehart and Keenan (2004). ROC has been long used in medicine, pscholog and signal detection theor. There is a large bod of literature that analses the properties of the ROC curve. Bamber (975) shows the AUROC is related to the 2

4 Mann-Whitne Statistic and several different methods for constructing the confidence intervals are also discussed. An overview of possible applications of the ROC curves is given b Swets (988). Sobehart and Keenan (200) introduced the ROC concept to internal rating model validation. The focus on the calculation and interpretation of the ROC measure. Engelmann, Haden and Tasche (2003) showed that AR is a linear transformation of AUROC and the complement Sobehart and Keenan (200) with more statistical analsis of the ROC. However, previous works that we are familiar with, have been done using a finite sample of empirical data or simulated data. None of the above have analzed the analtic properties of the ROC Curve and the ROC measure under parametric assumptions concerning the distribution of the rating scores. In this paper, we further eplore the statistical properties of the ROC Curve and its summar indices, especiall under a number of rating score distribution assumptions. We focus on the analtical properties of ROC Curve onl since the CAP measure is just a linear transformation of the ROC measure. In section 2, in order to keep this paper self-contained, we first briefl introduce the credit rating model validation background and eplain the concepts and definitions of ROC and CAP. A general equation for the ROC Curve is derived. B assuming that there eists probabilit densit functions of the two variables that construct the ROC Curve, an unrestrictive assumption, we show that there is a link between the first derivative of the curve and the likelihood ratio of the two variables, a result derived b different methods in Bamber(975). In section 3, b further assuming certain statistical distributions for the credit rating scores, we derive analtic solutions for the ROC Curve and its summar indices respectivel. In particular, when the underling distributions are both Negative Eponential distributions, we have a closed form solution. 3

5 In section 4, we appl the results derived in section 3 to simulated data. Performance evaluation reports are presented. Section 5 concludes: We find that estimation results b our analtic approach are as good as and, in man cases, better than the non-parametric AUROC ones. Although the accurac of the approach in this paper is limited b the continuous rating score assumption and also affected b the accurac of estimation of the distribution parameters on rating score samples in some cases, it offers direct insight into more comple situations and a better tool in credit rating model selection procedure since the analtic solution can be used as object function. 2 Theoretical Implication and Applications: In this section, we first briefl review the credit rating sstem methodolog, in particular, the CAP and the ROC measures. The content presented in this part is ver similar to Engelmann, Haden and Tasche (2003) and BCBS working paper No.4 (2005). Then we introduce the Ordinar Dominance Graph (ODG) where ROC is a special case of ODG and some interesting theoretical implications of the ROC curve will be given. 2. The validation of credit rating sstem. The statistical analsis of rating models is based on the assumption that for a predefined time horizon there are two groups of a bank s obligor: obligors that will be in default, called defaulters, and obligors that will not be in default, called non-defaulters. It is not observable in advance whether an obligor is a defaulter or a non-defaulter in the net time horizon. Banks have loan books or credit portfolio, the have to assess an obligor s future status based on a set of his or her present observable characteristics. Rating sstems ma be regarded as classification tools to provide signals and indications of the obligor s possible future status. A rating score 4

6 is returned for each obligor based on a rating model, usuall an Epert Judgment Model. The main principal of rating sstems is that the better a grade, the smaller the proportion of defaulters and the greater the proportion of non-defaulters that are assigned this grade. Some eamples are the famous Altman s Z score, or some score from a Logit model or from other approach. Therefore, the qualit of a rating sstem is determined b its discriminator power between non-defaulting obligors and defaulters e ante for a specific time horizon, usuall a ear. The CAP measure and ROC provide statistical measures to assess the discriminator power of various rating models based on historical data. 2.2 Cumulative Accurac Profile and Accurac Ratio Consider an arbitrar rating model that produces a rating score. A high rating score is usuall an indicator of a low default probabilit. To obtain the CAP curve, all debtors are first ordered b their respective scores, from riskiest to safest, i.e. from the debtor with the lowest score to the debtor with the highest score. For a given fraction of the total number of debtors the CAP curve is constructed b calculating the percentage d() of the defaulters whose rating scores are equal to or lower than the maimum score of fraction. This is done for ranging from 0% to 00%. Figure illustrates CAP curves. 5

7 Figure. Cumulative Accurac Profile Fraction of defaulters a P a R Perfect Model Rating Model Random Model 0 Fraction of all obligors Apparentl a perfect rating model will assign the lowest scores to the defaulters. In this case the CAP is increasing linearl and than staing at one. For a random model without an discriminative power, the fraction of all debtors with the lowest rating scores will contain percent of all defaulters. Real rating sstem lies somewhere in between these two etremes. The qualit of a rating sstem is measured b the Accurac Ratio (AR). It is defined as the ratio of the area ar between the CAP of the rating model being validated and the CAP of the random model, and the area a P between the CAP of the perfect rating model and the CAP of the random model. a AR = a R P It is eas to see that for real rating models the AR range from zero to one and the rating model is the better if AR is closer to one. 2.3 Receiver Operating Characteristic and the Area Under the ROC curve The construction of a ROC curve is illustrated in Figure 2 which shows possible 6

8 distributions of rating scores for defaulting and non-defaulting debtors. For a perfect rating model the left distribution and the right distribution in Figure 2 would be separate. For real rating sstems, perfect discrimination in general is not possible. Both distributions will overlap as illustrated in Figure 2. Figure 2: Distribution of rating scores for defaulting and non-defaulting debtors Assume someone has to find out from the rating scores which debtors will survive during the net period and which debtors will default. One possibilit for the decision-maker would be to introduce a cut-off value C as in Figure 2, and to classif each debtor with a rating score lower than C as a potential defaulter and each debtor with a rating score higher than C as a non-defaulter. Then four decision results would be possible. If the rating score is below the cut-off value C and the debtor defaults subsequentl, the decision was correct. Otherwise the decision-maker wrongl classified a non-defaulter as a defaulter. If the rating score is above the cut-off value and the debtor does not default, the classification was correct. Otherwise a defaulter was incorrectl assigned to the non-defaulters group. Then one can define a hit rate HR(C) as: ( ) HR C ( ) H C = N where H(C) is the number of defaulters predicted correctl with the cut-off value C, D 7

9 and N D is the total number of defaulters in the sample. This means that the hit rate is the fraction of defaulters that was classified correctl for a given cut-off value C. The false alarm rate then FAR(C) is defined as: ( ) FAR C ( ) F C = N where F(C) is the number of false alarms, i.e. the number of non-defaulters that were classified incorrectl as defaulters b using the cut-off value C. The total number of non-defaulters in the sample is denoted b N. In Figure 2, HR(C) is the area to the left of the cut-off value C under the score distribution of the defaulters (coloured plus hatched area), while FAR(C) is the area to the left of C under the score distribution of the non-defaulters (chequered plus hatched area). ND ND The ROC curve is constructed as follows. For all possible cut-off values C that are contained in the range of the rating scores the quantities HR(C) and FAR(C) are computed. The ROC curve is a plot of HR(C) versus FAR(C). This is illustrated in Figure 3. A rating model s performance is the better the steeper the ROC curve is at the left end and the closer the ROC curve s position is to the point (0,). Similarl, the model is the better the larger the area under the ROC curve is. This area is called AUROC and is denoted b A. B means of a change of variable, it can be calculated as A = 0 HR( FAR) d( FAR) The area A is 0.5 for a random model without discriminative power and it is.0 for a perfect model. It is between 0.5 and.0 for an reasonable rating model in practice. 8

10 Figure 3: Receiver Operating Characteristic Curves. (Taken from Tasche. (2003)) It has been shown in Engelmann, Haden and Tasche (2003) that: ( A 0.5) a N R ND AR= = = 2( A 0.5) = 2A a 0.5N P ND 2.4 Some further statistical properties of ROC measures The ROC stems from the Ordinal Dominance Graph (ODG). Assume we have two sets of continuous random variables, X and Y. Let C be an arbitrar constant. Define: ( ) ( ) and ( ) ( ) = prob Y C = F C = prob X C = F C Y, where and lie between [0, ] and C lies in (, + ). Then the ODG is simpl a plot of against. See Figure 4. There are some straight forward properties of the ODG: ) The ODG curve is never decreasing as increases cannot decrease. 2) If Prob(Y C) = Prob(X C), then and are identicall distributed, = and the ODG curve is a 45 line. X 9

11 3) If X first order stochastic dominance (FSD) Y, then the ODG curve lies above the 45 line and vice versa. Proof: ( ) ( ) ( ) If X FSD Y F C F C, C X, lies above the 45 line Y If we regard as score signals of defaulters in the net predefined period and as those of the non-defaulters, then we epect an sensible rating sstem produces Prob(Y C) Prob(X C) for all C. Thus for all C and the ODG curve is above the 45 line. It is referred to as the ROC curve in the risk literature. In terms of section 2.3, is the HR(C) and is the FAR(C). B assuming there eists probabilit densit functions (PDF) of F X and F Y, i.e. that the are both absolutel continuous, the following lemma can be derived: Lemma : If = F ( C), C F ( ) X C = X, then = f C X ( ), where f ( ) X C is the PDF of X.. Proof: ( ) ( ) FX C FX C C C C = =. = fx ( C), = C f C X ( ) Lemma 2: fy C = Y, then = f C If F ( C) X ( ) ( ). We see from Lemma 2 that the slope of the ODG curve is just the likelihood ratio (LR) of Y and X evaluated at C. Proof: ( ) ( ) ( ) ( ) FY C FY C C C fy C = =. = fy ( C). = C f C X 0

12 Theorem: If If f ( C) ( ) Y f X C is increasing in C, then f ( C) ( ) Y f X C is decreasing in C, then is increasing the ODG curve is conve. is decreasing the ODG curve is concave. The later case is the one that we are interested in, as it is the ROC curve. See Figure 4. Figure 4: Ordinal Dominance Graph Concave Conve 0 We have assumed that X and Y have PDF s, thus the are continuous random variables. The AUROC can then be epressed as: A = Prob( ) = Prob ( = ) Prob( = ) Y X X C X C dc Since X and Y are scores from different obligor groups, the are independent. We have: Prob ( Y X X = C) = Prob( Y C). Since = F ( C) = Prob( Y C) and ( ) Y = f C C A = FY ( C) fx ( C) dc= Y X ( ) 0 A modelling eercise ma choose a rating model that maimizes the AUROC with X ( ) F F d ()

13 respect to the obligor group under review. But how would one estimate Prob( )? Bamber(975) and Engelmann, Haden and Tasche (2003) show that given a rating scores sample of the obligors assigned b a rating model, the AUROC could be estimated in an non-parametric wa using the Mann-Whitne U statistic. On the other hand, if we had a parametric distribution model for X and Y, then we could eplicitl derive a formula for the ROC curve and for the AUROC. In the net section, we will review some plausible distributions for X and Y and derive the close form solutions wherever possible. 3 Choices of Distributions In this section, we derive analtical formulae for the ROC b assuming that the rating scores produced b rating models follow some plausible distributions. The distributions we presented here are Weibull Distribution (including Eponential Distribution), Logistic Distribution, Normal Distribution and Mied models for X and Y respectivel. In the cases that we have eplicit closed forms for the ROC curve, we derive the closed form AUROC as well. The case of mied distributions for X and Y can be easil etended from the discussion followed. We use the smbol M for the location parameters (sometimes the mean, sometimes the minimum), λ for the scale parameter and α for the shape parameter in the distribution functions where appropriate. 3. Weibull Distribution We first present solutions under a Weibull Distribution assumption of rating scores. The Weibull Distribution is fleible and rich. A three parameter Weibull distribution cumulative probabilit function (CDF) is: 2

14 ( ) ( ) α z M λ F z P Z z = e, where z>m, α >0 and λ > 0 The inverse CDF of a three parameter Weibull Distribution is: ( ) ( ) = + λ ln F p M p α, where p [ 0,]. Assuming is the HR(C) and is the FAR(C), the three-parameter α C M Weibull Distribution ROC is derived as: λ ( ) λ ln( ) = FX C = e C = M + Y ( ) ( ) = F C = P Y C = e α C M λ α λ λ ROC : = ep ln( α ) + ( M M ) α (2) The above ROC formula is ver difficult to use to deduce an eplicit closed form formula for the AUROC, although a numerical solution eists in this case once all the parameters are estimated. However, the situation becomes much better if we impose a slightl stronger assumption that the shape parameters α of the F X and F Y are equal to one. We then have analtical closed form solutions in this case and the Weibull distribution degenerates to a truncated Eponential Distribution Famil if M is positive and an etended Eponential if M is negative. Assume: α = α =, we then rewrite the above equation as following: ROC : = λ ( ) M M e λ λ Let M M λ =, θ K e λ =, then λ M M ( θ ) K λ λ AUROC = K d = = e, (3) 0 + θ λ + λ We net discuss some of the properties of equation 3. 3

15 Propert. M (,0) M for an plausible rating sstem. The smaller is M M (or the larger is M M ), the closer is the AUROC to one. Recall that M is the location parameter, in this case the minimum. Therefore, the rating sstem will receive a higher AUROC if it can better discriminate the defaulter from the non-defaulter b the difference in their minimum values. λ It is also interesting to see that if M M 0, AUROC. As we λ + λ illustrated in earl section that AUROC of a plausible rating sstem is above 0.5 if it is not a random selection sstem. This implies that the value of the scale parameters that the rating scores being assigned have to be such that 0 < λ λin this case. Note that this condition is implied b both groups being eponential but also b both groups being truncated or etended eponentials with the same minima. Propert 2. AUROC is monotonicall increasing with respect to λ, but monotonicall decreasing with respect to λ. Proof: d K AUROC = M 2 M dλ λ + λ λ λ + λ λ λ + 2λ ( )( ) ( ) ( ) ( M M)( λ λ) λ( λ λ) ( λ λ)( M M λ) < + < Since plausible rating sstems, we epect M M, K 0 and λ > 0 d AUROC 0 dλ 4

16 3.2 Logistic Distribution A two parameter Logistic distribution CDF is: z M λ e F( z) P( Z z) = + e z M λ, where z and λ > 0. Here M is a mean parameter. The inverse CDF of a two parameter Logistic Distribution is: p ( ) = + λ ln p, F p M where p [ 0,]. Again assuming is the HR(C) and is the FAR(C), we have: C M = X ( ) = λ e = + λln C M λ F C C M + e and M M C M λ λ e e Y ( ) Y ( X ( )) C M M M λ λ λ = F C = F F = = + e + e Similar to the Weibull Distribution case, the AUROC with the above ROC specification can alwas be evaluated numericall. Moreover, b assuming λ = λ =, and in what follows assume that K does not equal, K = e, the above ROC equation can be simplified to M M e K = = M M + e + ( K ) The AUROC can be now derived analticall. ( ) ( K ) ( ) K AUROC = K d = d 0+ K K 0+ K λ λ λ M M 5

17 Let = + ( ) u K K K K lnk AUROC = du = lim AUROC = u K K K ( ) 2 K 3.3 Normal Distribution A two parameter Normal distribution CDF is: 2 M 2 λ ( ) ( ) z z F z P Z z e d M = =Φ λ 2π λ, where z and the Φ (). is the standard Normal probabilit distribution function.. The inverse CDF of a two parameter Logistic Distribution is: F ( p) M λ ( p) where p [ 0,]. For is the HR(C) and is the FAR(C), we have: C M =Φ = + Φ ( ) C M λ λ C M ( M ) M + λφ ( ) =Φ =Φ λ λ Therefore, AUROC ( M M ) + λφ ( ) = Φ 0 λ, which gives the ROC curve. d This function can be easil evaluated numericall to obtain the AUROC. = + Φ, Propert AUROC increases with M M and in particular if M M = 0, and λ = Proof: λ.,auroc = 0.5 ( M M ) + λφ ( ) λ ( ) AUROC = Φ d = Φ Φ d = d = λ 0 0 λ The above propert is also quite intuitive. If the means and variances of two normal distributed 6

18 populations equal each other,the distributions are equal and then overall there is no discriminator power of the models based on this rating mechanism. So the AUROC is Mied Models It is obvious that as long as we have parametric distribution families for the defaulter and non-defaulters, one can alwas calculate an AUROC for the two score samples from equation () in section 2 even with two different parametric distributions for the two populations respectivel. 4. Performance Evaluation on the AUROC Estimation with Simulated Data In this section we carr out a performance evaluation on AUROC estimations b the non-parametric Mann-Whitne Statistic and the analtic approach suggested in this paper respectivel with simulated data. We first assume known parametric distributions for the credit scores of defaulters and non-defaulters. B doing this we would know the value of the theoretical AUROC. After generating simulated sample data from the assumed distributions for defaulter score and non-defaulter s, we estimate the AUROC and its confidence interval (CI) b the above two approaches on the simulated samples. We repeat the simulation and estimation procedure a number of times. We then compare the accurac of the AUROC estimation and the CI of the two approaches. Finall, we change the parameter values of the assumed distribution and repeat the simulation. We repeat the above procedures to evaluate the performance of the two approaches subject to different theoretical AUROC inde values with different sample sizes of defaulters. We choose two-parameter Normal Distributions, one-parameter Eponential Distributions and Weibull Distributions with various shape and scale parameters. 7

19 4. Performance evaluations under Normal Distribution Assumption Normal Distributions are assumed as our parametric distributions of the credit scores of the defaulters and the credit scores of the non-defaulters in this performance evaluation. The theoretical value of AUROC for the normal score samples is evaluated numericall. The non-parametric estimate of AUROC is carried out using the ROC module in SPSS and we use the bootstrap to re-sample 000 replications to obtain the estimates of the analtic approach which also generates a two-sided 95% CI. The parameters of the parametric distribution are estimated for each replication and substituted back to the analtic AUROC formula. We then define error as the difference between model estimates based on a sample and the theoretical AUROC value, and compare the mean error and mean absolute error for the two approaches. The width of the confidence interval is also compared. We generate 50 normal samples from si different settings respectivel. Settings, 2 and 3, consisting of Group, target the AUROC at a low value, while settings 4, 5 and 6, Group 2, target the AUROC at a high value. Within each group the var in defaulter s sample size, which ranges at 20, 00 and 500. Credit rating models can be applied to at least three different tpes of groups: credit risk with Corporate, counter part default risk in Trading Books, and credit risk in credit card and other loan tpe Banking Books. The default sample of Corporate is usuall small, such as 50 in ten ears, especiall under a good economic ccle. Meanwhile, the number of defaults in a loan book or a credit card book in a commercial bank s banking book can be fairl large, possibl in ecess of several hundreds. The reason for selecting different defaulter sample sizes in the test is to assess for which tpe of problem the analtic approach outperforms. We define a performance statistic as follows: (Ratio to N) = Difference / ( Non-Parametric Estimate). Normal Setting -3: Normal Distributions Setting Setting 2 Setting 3 Sample Mean Standard # of # of # of 8

20 deviation Observation Observation Observation X Y Theoretical AUROC= Report on estimation error with 50 simulated normal samples & 000 replication bootstrap Setting Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 2 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 3 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Normal Setting 4-6: Normal Distributions Setting 4 Setting 5 Setting 6 Sample Mean Standard # of # of # of deviation Observation Observation Observation X Y Theoretical AUROC= Report on estimation error with 50 simulated normal samples & 000 replication bootstrap Setting 4 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % 9

21 Setting 5 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 6 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % In tables -6, all mean confidence interval widths show that the estimates of the analtic approach are better then the non-parametric estimates. As for the mean error and the mean absolute error, analtic estimates outperform the non-parametric estimates in tables, 2 and 4-6. (Ration to N) shows the percentage of the difference out of the non-parametric approach estimate. The larger the (Ration to N), the more the analtic approach outperform the non-parametric approach. 20

22 4.2Performance evaluations under Eponential Distribution Assumption Eponential Distributions are assumed as our parametric distribution of the credit scores of the defaulters and the credit scores of the non-defaulters in this performance evaluation. The theoretical value of AUROC for the Eponential score samples is evaluated analticall b the closed form formula (3) in section 3.. The performance evaluation setting is ver similar to the Normal Distribution one. There are 6 settings across different AUROC value and defaulter sample size as well. Eponential Setting -3: Normal Distributions Setting Setting 2 Setting 3 Scale Parameter # of Sample (Lamda) Observation # of Observation # of Observation X Y Theoretical AUROC= Report on estimation error with 50 simulated normal samples & 000 replication bootstrap Setting Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 2 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 3 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % 2

23 Eponential Setting 4-6: Normal Distributions Setting Setting 2 Setting 3 Scale Parameter # of Sample (Lamda) Observation # of Observation # of Observation X Y Theoretical AUROC= Report on estimation error with 50 simulated normal samples & 000 replication bootstrap Setting 4 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 5 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 6 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % In table -6, all the mean absolute error and the mean confidence interval widths show that the estimates of the analtic approach are better then the non-parametric estimates. (Ration to N) shows that the non-parametric approach estimates provide a significantl better confidence interval than the non-parametric estimates. 22

24 4.3Performance evaluations under Weibull Distribution Assumption Weibull Distributions with scale and shape parameters are assumed as our parametric distribution of the credit scores of the defaulters and the credit scores of the non-defaulters in this performance evaluation. The theoretical value of AUROC for the Weibull score samples is evaluated analticall b the closed form formula (2) in section 3. with the location parameters set to zero. The maimum likelihood estimations of sample distribution parameters are obtained b a numerical approimation. Since we have a shape parameter for Weibull Distribution, which ma shift the shape of the distribution significantl, we evaluate the performance of the two approaches under two cases: with the same shape parameter for defaulter and non-defaulter sample, and with different shape parameters. The theoretical value of AUROC for the normal score samples is evaluated numericall as well 2. The rest of the performance evaluation setting is ver similar to the Normal Distribution one. There are 6 settings across different AUROC values and defaulter sample sizes as well. 23

25 Weibull Setting -3: Normal Distributions Setting 4 Setting 5 Setting 6 Sample Shape # of # of # of Scale parameter Observation Observation Observation X Y Theoretical AUROC= Report on estimation error with 50 simulated normal samples & 000 replication bootstrap Setting Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 2 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 3 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % 24

26 Weibull Setting 4-6: Normal Distributions Setting 4 Setting 5 Setting 6 Sample Shape # of # of # of Scale parameter Observation Observation Observation X Y Theoretical AUROC= 0.75 Report on estimation error with 50 simulated normal samples & 000 replication bootstrap Setting 4 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 5 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % Setting 6 Approach Non-parametric Analtic Difference Ratio to N Mean Error Mean ABS Error % Mean CI Width % In tables -6, all mean confidence interval widths show that the estimates of the analtic approach are marginall better then the non-parametric estimates. As for the mean error and the mean absolute error, analtic estimates marginall outperform the non-parametric estimates in tables 2, 3, 5 and 6. Because we use numerical approimation for sample maimum likelihood estimates and the estimation error could be fairl large when we have a small sample, we observe this estimation error is passed through our analtic estimation for the AUROC inde, which made the mean 25

27 absolute errors estimated from the analtic approach are large than the non-parametric approach in setting and 3. This also reduces the gain of the analtic approach over the non-parametric approach in the Weibull Distribution case when comparing with the previous tests. Summar: Although the analtic approach gives no better estimates than non-parametric one when we use approimated maimum likelihood estimates for small samples, the performance evaluation shows that the analtic approach works at least as well as the non-parametric approach in above tests and provides better estimate in terms of mean absolute error estimates and confidence interval estimates in most cases. The above discussion has the following implications. If one can identif some appropriate parametric distributions to the scores of defaulter and non-defaulter, then one could estimate the AUROC and its confidence interval more accuratel b the analtic approach. On the other hand, if we can design the rating model so that the score sample is generated b some specific parametric distribution families, then we are able to find a better rating model b using the analtic AUROC as the objective function to maimize in the model selection process. Another interesting eperiment which has not been conducted in previous research is the effect of defaulter s sample size on AUROC. The above eperiments clearl show the level of estimation error in both methods with different sample sizes and the resulting error can be large if we onl have a small defaulter sample. In addition, although not ver clear but from the results in section 4. and 4.2, the analtic approach provides more gain over non-parametric approach when AUROC inde is in its high value region than low value region. The reason for this is not clear, we hope to investigate this in future research. 26

28 5 Conclusions This paper reviewed some of the prevailing credit rating model validation approaches and, in particular, studied the analtic properties of the ROC curve and its summar inde AUROC. We use the concept of the population ROC curve to build analtic models of ROC curves. It has been shown through simulation studies that greater accurac of estimation relative to the non-parametric methods can be achieved. We also show that there are some situations that the accurac gain of analtic ROC model ma decrease, which should be taken into account when appling the analtic models to practical applications. Moreover, with some particular distributions, where the closed form solution of AUROC is available, analtic AUROC can be directl used as an objective function to maimize during the rating model selection procedure. This means if we can transform the rating scores into those distributions, analtic AUROC could offer a powerful model selection tool. Finall, we also studied the performance of both non-parametric and analtic ROC models under different assumptions for the sample size of defaulters.. The magnitude of error size can be significant when we have a small sample on defaulters, which is a frequentl met situation in corporate credit risk stud and in periods of benign macro-economic background. 27

29 Appendi A: Non-parametric ROC curve for Normall Distributed Samples Non-parametric ROC curve under setting : Non-parametric ROC curve under setting 2: Non-parametric ROC curve under setting 3: Non-parametric ROC curve under setting 4: Non-parametric ROC curve under setting 5: Non-parametric ROC curve under setting 6: For eample, ROC curve Vi is plotted using the data of simulated sample number i generated under a specified setting. 28

30 Appendi A2: Non-parametric ROC curve for Eponentiall Distributed Samples Non-parametric ROC curve under setting : Non-parametric ROC curve under setting 2: Non-parametric ROC curve under setting 3: Non-parametric ROC curve under setting 4: Non-parametric ROC curve under setting 5: Non-parametric ROC curve under setting 6: For eample, ROC curve Vi is plotted using the data of simulated sample number i generated under a specified setting. 29

31 Appendi A3: Non-parametric ROC curve for Weibull Distributed Samples Non-parametric ROC curve under setting : Non-parametric ROC curve under setting 2: Non-parametric ROC curve under setting 3: Non-parametric ROC curve under setting 4: Non-parametric ROC curve under setting 5: Non-parametric ROC curve under setting 6: For eample, ROC curve Vi is plotted using the data of simulated sample number i generated under a specified setting. 30

32 References: Bamber, Donald (975), The Area above the Ordinal Dominance Graph and the Area below the Receiver Operating Characteristic Graph. Journal of Mathematical Pscholog 2, p Basel Committee on Banking Supervision (BCBS) (2004), International Convergence of Capital Measurement and Capital Standards: A Revised Framework. Bank for International Settlements, June. Basel Committee on Banking Supervision (BCBS) (2005) Working paper No.4 Blochwitz, Stefan, Stefan Hohl and Carsten Wehn (2003), Reconsidering Ratings. Unpublished Working Paper. Engelmann, Bernd, Eveln Haden, and Dirk Tasche, (2003), Testing Rating Accurac. Risk, Januar 2003, p Sobehart, Jorge R and Sean C Keenan (200), Measuring Default Accuratel, Risk Magazine, March 200, p Sobehart, Jorge R and Sean C Keenan (2004), Performance Evaluation for Credit Spread and Default Risk Models. In: Credit Risk: Models and Management. David Shimko (ed.). Second Edition. Risk Books: London, p Sobehart JR, Sean C Keenan and R Stein (2000), Validation methodologies for default risk models, Credit (4), p Swets JA, (988), Measuring the accurac of diagnostic sstems, Science 240, p The theoretical AUROC is approimated b partitions, while the bootstrap estimation is approimated b 0000 partitions. 2 The theoretical AUROC is approimated b partitions, while the bootstrap estimation is approimated b 0000 partitions. 3

LESSON EIII.E EXPONENTS AND LOGARITHMS

LESSON EIII.E EXPONENTS AND LOGARITHMS LESSON EIII.E EXPONENTS AND LOGARITHMS LESSON EIII.E EXPONENTS AND LOGARITHMS OVERVIEW Here s what ou ll learn in this lesson: Eponential Functions a. Graphing eponential functions b. Applications of eponential

More information

Implicit Differentiation

Implicit Differentiation Revision Notes 2 Calculus 1270 Fall 2007 INSTRUCTOR: Peter Roper OFFICE: LCB 313 [EMAIL: [email protected]] Standard Disclaimer These notes are not a complete review of the course thus far, and some

More information

The Big Picture. Correlation. Scatter Plots. Data

The Big Picture. Correlation. Scatter Plots. Data The Big Picture Correlation Bret Hanlon and Bret Larget Department of Statistics Universit of Wisconsin Madison December 6, We have just completed a length series of lectures on ANOVA where we considered

More information

Solving Quadratic Equations by Graphing. Consider an equation of the form. y ax 2 bx c a 0. In an equation of the form

Solving Quadratic Equations by Graphing. Consider an equation of the form. y ax 2 bx c a 0. In an equation of the form SECTION 11.3 Solving Quadratic Equations b Graphing 11.3 OBJECTIVES 1. Find an ais of smmetr 2. Find a verte 3. Graph a parabola 4. Solve quadratic equations b graphing 5. Solve an application involving

More information

So, using the new notation, P X,Y (0,1) =.08 This is the value which the joint probability function for X and Y takes when X=0 and Y=1.

So, using the new notation, P X,Y (0,1) =.08 This is the value which the joint probability function for X and Y takes when X=0 and Y=1. Joint probabilit is the probabilit that the RVs & Y take values &. like the PDF of the two events, and. We will denote a joint probabilit function as P,Y (,) = P(= Y=) Marginal probabilit of is the probabilit

More information

Exponential and Logarithmic Functions

Exponential and Logarithmic Functions Chapter 6 Eponential and Logarithmic Functions Section summaries Section 6.1 Composite Functions Some functions are constructed in several steps, where each of the individual steps is a function. For eample,

More information

Functions and Graphs CHAPTER INTRODUCTION. The function concept is one of the most important ideas in mathematics. The study

Functions and Graphs CHAPTER INTRODUCTION. The function concept is one of the most important ideas in mathematics. The study Functions and Graphs CHAPTER 2 INTRODUCTION The function concept is one of the most important ideas in mathematics. The stud 2-1 Functions 2-2 Elementar Functions: Graphs and Transformations 2-3 Quadratic

More information

More Equations and Inequalities

More Equations and Inequalities Section. Sets of Numbers and Interval Notation 9 More Equations and Inequalities 9 9. Compound Inequalities 9. Polnomial and Rational Inequalities 9. Absolute Value Equations 9. Absolute Value Inequalities

More information

2.7 Applications of Derivatives to Business

2.7 Applications of Derivatives to Business 80 CHAPTER 2 Applications of the Derivative 2.7 Applications of Derivatives to Business and Economics Cost = C() In recent ears, economic decision making has become more and more mathematicall oriented.

More information

Section 7.2 Linear Programming: The Graphical Method

Section 7.2 Linear Programming: The Graphical Method Section 7.2 Linear Programming: The Graphical Method Man problems in business, science, and economics involve finding the optimal value of a function (for instance, the maimum value of the profit function

More information

1 Maximizing pro ts when marginal costs are increasing

1 Maximizing pro ts when marginal costs are increasing BEE12 Basic Mathematical Economics Week 1, Lecture Tuesda 12.1. Pro t maimization 1 Maimizing pro ts when marginal costs are increasing We consider in this section a rm in a perfectl competitive market

More information

SECTION 5-1 Exponential Functions

SECTION 5-1 Exponential Functions 354 5 Eponential and Logarithmic Functions Most of the functions we have considered so far have been polnomial and rational functions, with a few others involving roots or powers of polnomial or rational

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

Downloaded from www.heinemann.co.uk/ib. equations. 2.4 The reciprocal function x 1 x

Downloaded from www.heinemann.co.uk/ib. equations. 2.4 The reciprocal function x 1 x Functions and equations Assessment statements. Concept of function f : f (); domain, range, image (value). Composite functions (f g); identit function. Inverse function f.. The graph of a function; its

More information

Mathematics Placement Packet Colorado College Department of Mathematics and Computer Science

Mathematics Placement Packet Colorado College Department of Mathematics and Computer Science Mathematics Placement Packet Colorado College Department of Mathematics and Computer Science Colorado College has two all college requirements (QR and SI) which can be satisfied in full, or part, b taking

More information

Zero and Negative Exponents and Scientific Notation. a a n a m n. Now, suppose that we allow m to equal n. We then have. a am m a 0 (1) a m

Zero and Negative Exponents and Scientific Notation. a a n a m n. Now, suppose that we allow m to equal n. We then have. a am m a 0 (1) a m 0. E a m p l e 666SECTION 0. OBJECTIVES. Define the zero eponent. Simplif epressions with negative eponents. Write a number in scientific notation. Solve an application of scientific notation We must have

More information

1. a. standard form of a parabola with. 2 b 1 2 horizontal axis of symmetry 2. x 2 y 2 r 2 o. standard form of an ellipse centered

1. a. standard form of a parabola with. 2 b 1 2 horizontal axis of symmetry 2. x 2 y 2 r 2 o. standard form of an ellipse centered Conic Sections. Distance Formula and Circles. More on the Parabola. The Ellipse and Hperbola. Nonlinear Sstems of Equations in Two Variables. Nonlinear Inequalities and Sstems of Inequalities In Chapter,

More information

1.6. Piecewise Functions. LEARN ABOUT the Math. Representing the problem using a graphical model

1.6. Piecewise Functions. LEARN ABOUT the Math. Representing the problem using a graphical model . Piecewise Functions YOU WILL NEED graph paper graphing calculator GOAL Understand, interpret, and graph situations that are described b piecewise functions. LEARN ABOUT the Math A cit parking lot uses

More information

INVESTIGATIONS AND FUNCTIONS 1.1.1 1.1.4. Example 1

INVESTIGATIONS AND FUNCTIONS 1.1.1 1.1.4. Example 1 Chapter 1 INVESTIGATIONS AND FUNCTIONS 1.1.1 1.1.4 This opening section introduces the students to man of the big ideas of Algebra 2, as well as different was of thinking and various problem solving strategies.

More information

Classifying Solutions to Systems of Equations

Classifying Solutions to Systems of Equations CONCEPT DEVELOPMENT Mathematics Assessment Project CLASSROOM CHALLENGES A Formative Assessment Lesson Classifing Solutions to Sstems of Equations Mathematics Assessment Resource Service Universit of Nottingham

More information

Estimating the Mean and Variance of. Activity Duration in PERT

Estimating the Mean and Variance of. Activity Duration in PERT International Mathematical Forum, 5, 010, no. 18, 861-868 Estimating the Mean and Variance of Activit Duration in PERT N. Ravi Shankar 1, K. Sura Naraana Rao, V. Sireesha 1 1 Department of Applied Mathematics,GIS

More information

Physics 53. Kinematics 2. Our nature consists in movement; absolute rest is death. Pascal

Physics 53. Kinematics 2. Our nature consists in movement; absolute rest is death. Pascal Phsics 53 Kinematics 2 Our nature consists in movement; absolute rest is death. Pascal Velocit and Acceleration in 3-D We have defined the velocit and acceleration of a particle as the first and second

More information

5.2 Inverse Functions

5.2 Inverse Functions 78 Further Topics in Functions. Inverse Functions Thinking of a function as a process like we did in Section., in this section we seek another function which might reverse that process. As in real life,

More information

SAMPLE. Polynomial functions

SAMPLE. Polynomial functions Objectives C H A P T E R 4 Polnomial functions To be able to use the technique of equating coefficients. To introduce the functions of the form f () = a( + h) n + k and to sketch graphs of this form through

More information

15.1. Exact Differential Equations. Exact First-Order Equations. Exact Differential Equations Integrating Factors

15.1. Exact Differential Equations. Exact First-Order Equations. Exact Differential Equations Integrating Factors SECTION 5. Eact First-Order Equations 09 SECTION 5. Eact First-Order Equations Eact Differential Equations Integrating Factors Eact Differential Equations In Section 5.6, ou studied applications of differential

More information

4.1 Ordinal versus cardinal utility

4.1 Ordinal versus cardinal utility Microeconomics I. Antonio Zabalza. Universit of Valencia 1 Micro I. Lesson 4. Utilit In the previous lesson we have developed a method to rank consistentl all bundles in the (,) space and we have introduced

More information

Pearson s Correlation Coefficient

Pearson s Correlation Coefficient Pearson s Correlation Coefficient In this lesson, we will find a quantitative measure to describe the strength of a linear relationship (instead of using the terms strong or weak). A quantitative measure

More information

C3: Functions. Learning objectives

C3: Functions. Learning objectives CHAPTER C3: Functions Learning objectives After studing this chapter ou should: be familiar with the terms one-one and man-one mappings understand the terms domain and range for a mapping understand the

More information

Core Maths C2. Revision Notes

Core Maths C2. Revision Notes Core Maths C Revision Notes November 0 Core Maths C Algebra... Polnomials: +,,,.... Factorising... Long division... Remainder theorem... Factor theorem... 4 Choosing a suitable factor... 5 Cubic equations...

More information

Linear Equations in Two Variables

Linear Equations in Two Variables Section. Sets of Numbers and Interval Notation 0 Linear Equations in Two Variables. The Rectangular Coordinate Sstem and Midpoint Formula. Linear Equations in Two Variables. Slope of a Line. Equations

More information

9.5 CALCULUS AND POLAR COORDINATES

9.5 CALCULUS AND POLAR COORDINATES smi9885_ch09b.qd 5/7/0 :5 PM Page 760 760 Chapter 9 Parametric Equations and Polar Coordinates 9.5 CALCULUS AND POLAR COORDINATES Now that we have introduced ou to polar coordinates and looked at a variet

More information

LINEAR FUNCTIONS OF 2 VARIABLES

LINEAR FUNCTIONS OF 2 VARIABLES CHAPTER 4: LINEAR FUNCTIONS OF 2 VARIABLES 4.1 RATES OF CHANGES IN DIFFERENT DIRECTIONS From Precalculus, we know that is a linear function if the rate of change of the function is constant. I.e., for

More information

Core Maths C3. Revision Notes

Core Maths C3. Revision Notes Core Maths C Revision Notes October 0 Core Maths C Algebraic fractions... Cancelling common factors... Multipling and dividing fractions... Adding and subtracting fractions... Equations... 4 Functions...

More information

THIS CHAPTER INTRODUCES the Cartesian coordinate

THIS CHAPTER INTRODUCES the Cartesian coordinate 87533_01_ch1_p001-066 1/30/08 9:36 AM Page 1 STRAIGHT LINES AND LINEAR FUNCTIONS 1 THIS CHAPTER INTRODUCES the Cartesian coordinate sstem, a sstem that allows us to represent points in the plane in terms

More information

Example: Document Clustering. Clustering: Definition. Notion of a Cluster can be Ambiguous. Types of Clusterings. Hierarchical Clustering

Example: Document Clustering. Clustering: Definition. Notion of a Cluster can be Ambiguous. Types of Clusterings. Hierarchical Clustering Overview Prognostic Models and Data Mining in Medicine, part I Cluster Analsis What is Cluster Analsis? K-Means Clustering Hierarchical Clustering Cluster Validit Eample: Microarra data analsis 6 Summar

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

Partial Fractions. and Logistic Growth. Section 6.2. Partial Fractions

Partial Fractions. and Logistic Growth. Section 6.2. Partial Fractions SECTION 6. Partial Fractions and Logistic Growth 9 Section 6. Partial Fractions and Logistic Growth Use partial fractions to find indefinite integrals. Use logistic growth functions to model real-life

More information

Mathematical goals. Starting points. Materials required. Time needed

Mathematical goals. Starting points. Materials required. Time needed Level A7 of challenge: C A7 Interpreting functions, graphs and tables tables Mathematical goals Starting points Materials required Time needed To enable learners to understand: the relationship between

More information

QUALITATIVE ANALYSIS OF DIFFERENTIAL EQUATIONS. Alexander Panfilov

QUALITATIVE ANALYSIS OF DIFFERENTIAL EQUATIONS. Alexander Panfilov QUALITATIVE ANALYSIS OF DIFFERENTIAL EQUATIONS Aleander Panfilov QUALITATIVE ANALYSIS OF DIFFERENTIAL EQUATIONS Aleander Panfilov Theoretical Biolog, Utrecht Universit, Utrecht c 010 Februar 3, 010 Tentative

More information

D.2. The Cartesian Plane. The Cartesian Plane The Distance and Midpoint Formulas Equations of Circles. D10 APPENDIX D Precalculus Review

D.2. The Cartesian Plane. The Cartesian Plane The Distance and Midpoint Formulas Equations of Circles. D10 APPENDIX D Precalculus Review D0 APPENDIX D Precalculus Review SECTION D. The Cartesian Plane The Cartesian Plane The Distance and Midpoint Formulas Equations of Circles The Cartesian Plane An ordered pair, of real numbers has as its

More information

Exponential Functions: Differentiation and Integration. The Natural Exponential Function

Exponential Functions: Differentiation and Integration. The Natural Exponential Function 46_54.q //4 :59 PM Page 5 5 CHAPTER 5 Logarithmic, Eponential, an Other Transcenental Functions Section 5.4 f () = e f() = ln The inverse function of the natural logarithmic function is the natural eponential

More information

THE POWER RULES. Raising an Exponential Expression to a Power

THE POWER RULES. Raising an Exponential Expression to a Power 8 (5-) Chapter 5 Eponents and Polnomials 5. THE POWER RULES In this section Raising an Eponential Epression to a Power Raising a Product to a Power Raising a Quotient to a Power Variable Eponents Summar

More information

Polynomials. Jackie Nicholas Jacquie Hargreaves Janet Hunter

Polynomials. Jackie Nicholas Jacquie Hargreaves Janet Hunter Mathematics Learning Centre Polnomials Jackie Nicholas Jacquie Hargreaves Janet Hunter c 26 Universit of Sdne Mathematics Learning Centre, Universit of Sdne 1 1 Polnomials Man of the functions we will

More information

D.3. Angles and Degree Measure. Review of Trigonometric Functions

D.3. Angles and Degree Measure. Review of Trigonometric Functions APPENDIX D Precalculus Review D7 SECTION D. Review of Trigonometric Functions Angles and Degree Measure Radian Measure The Trigonometric Functions Evaluating Trigonometric Functions Solving Trigonometric

More information

Zeros of Polynomial Functions. The Fundamental Theorem of Algebra. The Fundamental Theorem of Algebra. zero in the complex number system.

Zeros of Polynomial Functions. The Fundamental Theorem of Algebra. The Fundamental Theorem of Algebra. zero in the complex number system. _.qd /7/ 9:6 AM Page 69 Section. Zeros of Polnomial Functions 69. Zeros of Polnomial Functions What ou should learn Use the Fundamental Theorem of Algebra to determine the number of zeros of polnomial

More information

2.1 Three Dimensional Curves and Surfaces

2.1 Three Dimensional Curves and Surfaces . Three Dimensional Curves and Surfaces.. Parametric Equation of a Line An line in two- or three-dimensional space can be uniquel specified b a point on the line and a vector parallel to the line. The

More information

SECTION 2.2. Distance and Midpoint Formulas; Circles

SECTION 2.2. Distance and Midpoint Formulas; Circles SECTION. Objectives. Find the distance between two points.. Find the midpoint of a line segment.. Write the standard form of a circle s equation.. Give the center and radius of a circle whose equation

More information

Solving Absolute Value Equations and Inequalities Graphically

Solving Absolute Value Equations and Inequalities Graphically 4.5 Solving Absolute Value Equations and Inequalities Graphicall 4.5 OBJECTIVES 1. Draw the graph of an absolute value function 2. Solve an absolute value equation graphicall 3. Solve an absolute value

More information

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Mathematics 31 Pre-calculus and Limits

Mathematics 31 Pre-calculus and Limits Mathematics 31 Pre-calculus and Limits Overview After completing this section, students will be epected to have acquired reliability and fluency in the algebraic skills of factoring, operations with radicals

More information

M1 in Economics and Economics and Statistics Applied multivariate Analysis - Big data analytics Worksheet 1 - Bootstrap

M1 in Economics and Economics and Statistics Applied multivariate Analysis - Big data analytics Worksheet 1 - Bootstrap Nathalie Villa-Vialanei Année 2015/2016 M1 in Economics and Economics and Statistics Applied multivariate Analsis - Big data analtics Worksheet 1 - Bootstrap This worksheet illustrates the use of nonparametric

More information

Florida Algebra I EOC Online Practice Test

Florida Algebra I EOC Online Practice Test Florida Algebra I EOC Online Practice Test Directions: This practice test contains 65 multiple-choice questions. Choose the best answer for each question. Detailed answer eplanations appear at the end

More information

7.3 Solving Systems by Elimination

7.3 Solving Systems by Elimination 7. Solving Sstems b Elimination In the last section we saw the Substitution Method. It turns out there is another method for solving a sstem of linear equations that is also ver good. First, we will need

More information

The Mathematics of Engineering Surveying (3)

The Mathematics of Engineering Surveying (3) The Mathematics of Engineering Surveing (3) Scenario s a new graduate ou have gained emploment as a graduate engineer working for a major contractor that emplos 2000 staff and has an annual turnover of

More information

2 Binomial, Poisson, Normal Distribution

2 Binomial, Poisson, Normal Distribution 2 Binomial, Poisson, Normal Distribution Binomial Distribution ): We are interested in the number of times an event A occurs in n independent trials. In each trial the event A has the same probability

More information

6. The given function is only drawn for x > 0. Complete the function for x < 0 with the following conditions:

6. The given function is only drawn for x > 0. Complete the function for x < 0 with the following conditions: Precalculus Worksheet 1. Da 1 1. The relation described b the set of points {(-, 5 ),( 0, 5 ),(,8 ),(, 9) } is NOT a function. Eplain wh. For questions - 4, use the graph at the right.. Eplain wh the graph

More information

MEMORANDUM. All students taking the CLC Math Placement Exam PLACEMENT INTO CALCULUS AND ANALYTIC GEOMETRY I, MTH 145:

MEMORANDUM. All students taking the CLC Math Placement Exam PLACEMENT INTO CALCULUS AND ANALYTIC GEOMETRY I, MTH 145: MEMORANDUM To: All students taking the CLC Math Placement Eam From: CLC Mathematics Department Subject: What to epect on the Placement Eam Date: April 0 Placement into MTH 45 Solutions This memo is an

More information

2.3 Quadratic Functions

2.3 Quadratic Functions 88 Linear and Quadratic Functions. Quadratic Functions You ma recall studing quadratic equations in Intermediate Algebra. In this section, we review those equations in the contet of our net famil of functions:

More information

COMPLEX STRESS TUTORIAL 3 COMPLEX STRESS AND STRAIN

COMPLEX STRESS TUTORIAL 3 COMPLEX STRESS AND STRAIN COMPLX STRSS TUTORIAL COMPLX STRSS AND STRAIN This tutorial is not part of the decel unit mechanical Principles but covers elements of the following sllabi. o Parts of the ngineering Council eam subject

More information

3 Optimizing Functions of Two Variables. Chapter 7 Section 3 Optimizing Functions of Two Variables 533

3 Optimizing Functions of Two Variables. Chapter 7 Section 3 Optimizing Functions of Two Variables 533 Chapter 7 Section 3 Optimizing Functions of Two Variables 533 (b) Read about the principle of diminishing returns in an economics tet. Then write a paragraph discussing the economic factors that might

More information

Lecture 8: More Continuous Random Variables

Lecture 8: More Continuous Random Variables Lecture 8: More Continuous Random Variables 26 September 2005 Last time: the eponential. Going from saying the density e λ, to f() λe λ, to the CDF F () e λ. Pictures of the pdf and CDF. Today: the Gaussian

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: [email protected]

More information

Finance 400 A. Penati - G. Pennacchi Market Micro-Structure: Notes on the Kyle Model

Finance 400 A. Penati - G. Pennacchi Market Micro-Structure: Notes on the Kyle Model Finance 400 A. Penati - G. Pennacchi Market Micro-Structure: Notes on the Kyle Model These notes consider the single-period model in Kyle (1985) Continuous Auctions and Insider Trading, Econometrica 15,

More information

MORTGAGE LENDER PROTECTION UNDER INSURANCE ARRANGEMENTS Irina Genriha Latvian University, Tel. +371 26387099, e-mail: irina.genriha@inbox.

MORTGAGE LENDER PROTECTION UNDER INSURANCE ARRANGEMENTS Irina Genriha Latvian University, Tel. +371 26387099, e-mail: irina.genriha@inbox. MORTGAGE LENDER PROTECTION UNDER INSURANCE ARRANGEMENTS Irina Genriha Latvian University, Tel. +371 2638799, e-mail: [email protected] Since September 28, when the crisis deepened in the first place

More information

Some Tools for Teaching Mathematical Literacy

Some Tools for Teaching Mathematical Literacy Some Tools for Teaching Mathematical Literac Julie Learned, Universit of Michigan Januar 200. Reading Mathematical Word Problems 2. Fraer Model of Concept Development 3. Building Mathematical Vocabular

More information

5.1. A Formula for Slope. Investigation: Points and Slope CONDENSED

5.1. A Formula for Slope. Investigation: Points and Slope CONDENSED CONDENSED L E S S O N 5.1 A Formula for Slope In this lesson ou will learn how to calculate the slope of a line given two points on the line determine whether a point lies on the same line as two given

More information

Slope-Intercept Form and Point-Slope Form

Slope-Intercept Form and Point-Slope Form Slope-Intercept Form and Point-Slope Form In this section we will be discussing Slope-Intercept Form and the Point-Slope Form of a line. We will also discuss how to graph using the Slope-Intercept Form.

More information

FINAL EXAM REVIEW MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

FINAL EXAM REVIEW MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. FINAL EXAM REVIEW MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Determine whether or not the relationship shown in the table is a function. 1) -

More information

Higher. Polynomials and Quadratics 64

Higher. Polynomials and Quadratics 64 hsn.uk.net Higher Mathematics UNIT OUTCOME 1 Polnomials and Quadratics Contents Polnomials and Quadratics 64 1 Quadratics 64 The Discriminant 66 3 Completing the Square 67 4 Sketching Parabolas 70 5 Determining

More information

Introduction to Plates

Introduction to Plates Chapter Introduction to Plates Plate is a flat surface having considerabl large dimensions as compared to its thickness. Common eamples of plates in civil engineering are. Slab in a building.. Base slab

More information

( ) is proportional to ( 10 + x)!2. Calculate the

( ) is proportional to ( 10 + x)!2. Calculate the PRACTICE EXAMINATION NUMBER 6. An insurance company eamines its pool of auto insurance customers and gathers the following information: i) All customers insure at least one car. ii) 64 of the customers

More information

Addition and Subtraction of Vectors

Addition and Subtraction of Vectors ddition and Subtraction of Vectors 1 ppendi ddition and Subtraction of Vectors In this appendi the basic elements of vector algebra are eplored. Vectors are treated as geometric entities represented b

More information

Polynomial Degree and Finite Differences

Polynomial Degree and Finite Differences CONDENSED LESSON 7.1 Polynomial Degree and Finite Differences In this lesson you will learn the terminology associated with polynomials use the finite differences method to determine the degree of a polynomial

More information

Introduction to polarization of light

Introduction to polarization of light Chapter 2 Introduction to polarization of light This Chapter treats the polarization of electromagnetic waves. In Section 2.1 the concept of light polarization is discussed and its Jones formalism is presented.

More information

Graphing Linear Equations

Graphing Linear Equations 6.3 Graphing Linear Equations 6.3 OBJECTIVES 1. Graph a linear equation b plotting points 2. Graph a linear equation b the intercept method 3. Graph a linear equation b solving the equation for We are

More information

6.3 PARTIAL FRACTIONS AND LOGISTIC GROWTH

6.3 PARTIAL FRACTIONS AND LOGISTIC GROWTH 6 CHAPTER 6 Techniques of Integration 6. PARTIAL FRACTIONS AND LOGISTIC GROWTH Use partial fractions to find indefinite integrals. Use logistic growth functions to model real-life situations. Partial Fractions

More information

Find the Relationship: An Exercise in Graphing Analysis

Find the Relationship: An Exercise in Graphing Analysis Find the Relationship: An Eercise in Graphing Analsis Computer 5 In several laborator investigations ou do this ear, a primar purpose will be to find the mathematical relationship between two variables.

More information

I think that starting

I think that starting . Graphs of Functions 69. GRAPHS OF FUNCTIONS One can envisage that mathematical theor will go on being elaborated and etended indefinitel. How strange that the results of just the first few centuries

More information

Quadratic Equations and Functions

Quadratic Equations and Functions Quadratic Equations and Functions. Square Root Propert and Completing the Square. Quadratic Formula. Equations in Quadratic Form. Graphs of Quadratic Functions. Verte of a Parabola and Applications In

More information

CHAPTER TEN. Key Concepts

CHAPTER TEN. Key Concepts CHAPTER TEN Ke Concepts linear regression: slope intercept residual error sum of squares or residual sum of squares sum of squares due to regression mean squares error mean squares regression (coefficient)

More information

Probability, Mean and Median

Probability, Mean and Median Proaility, Mean and Median In the last section, we considered (proaility) density functions. We went on to discuss their relationship with cumulative distriution functions. The goal of this section is

More information

Vasicek Single Factor Model

Vasicek Single Factor Model Alexandra Kochendörfer 7. Februar 2011 1 / 33 Problem Setting Consider portfolio with N different credits of equal size 1. Each obligor has an individual default probability. In case of default of the

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Discussion Paper On the validation and review of Credit Rating Agencies methodologies

Discussion Paper On the validation and review of Credit Rating Agencies methodologies Discussion Paper On the validation and review of Credit Rating Agencies methodologies 17 November 2015 ESMA/2015/1735 Responding to this paper The European Securities and Markets Authority (ESMA) invites

More information

Linear Inequality in Two Variables

Linear Inequality in Two Variables 90 (7-) Chapter 7 Sstems of Linear Equations and Inequalities In this section 7.4 GRAPHING LINEAR INEQUALITIES IN TWO VARIABLES You studied linear equations and inequalities in one variable in Chapter.

More information

Math Review. The second part is a refresher of some basic topics for those who know how but lost their fluency over the years.

Math Review. The second part is a refresher of some basic topics for those who know how but lost their fluency over the years. Math Review The Math Review is divided into two parts: I. The first part is a general overview of the math classes, their sequence, basic content, and short quizzes to see if ou are prepared to take a

More information

About the Gamma Function

About the Gamma Function About the Gamma Function Notes for Honors Calculus II, Originally Prepared in Spring 995 Basic Facts about the Gamma Function The Gamma function is defined by the improper integral Γ) = The integral is

More information

Algebra 2 Unit 8 (Chapter 7) CALCULATORS ARE NOT ALLOWED

Algebra 2 Unit 8 (Chapter 7) CALCULATORS ARE NOT ALLOWED Algebra Unit 8 (Chapter 7) CALCULATORS ARE NOT ALLOWED. Graph eponential functions. (Sections 7., 7.) Worksheet 6. Solve eponential growth and eponential decay problems. (Sections 7., 7.) Worksheet 8.

More information

MATH REVIEW SHEETS BEGINNING ALGEBRA MATH 60

MATH REVIEW SHEETS BEGINNING ALGEBRA MATH 60 MATH REVIEW SHEETS BEGINNING ALGEBRA MATH 60 A Summar of Concepts Needed to be Successful in Mathematics The following sheets list the ke concepts which are taught in the specified math course. The sheets

More information

Autonomous Equations / Stability of Equilibrium Solutions. y = f (y).

Autonomous Equations / Stability of Equilibrium Solutions. y = f (y). Autonomous Equations / Stabilit of Equilibrium Solutions First order autonomous equations, Equilibrium solutions, Stabilit, Longterm behavior of solutions, direction fields, Population dnamics and logistic

More information

In this this review we turn our attention to the square root function, the function defined by the equation. f(x) = x. (5.1)

In this this review we turn our attention to the square root function, the function defined by the equation. f(x) = x. (5.1) Section 5.2 The Square Root 1 5.2 The Square Root In this this review we turn our attention to the square root function, the function defined b the equation f() =. (5.1) We can determine the domain and

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

Trading Networks with Price-Setting Agents

Trading Networks with Price-Setting Agents Trading Networks with Price-Setting Agents Larr Blume Dept. of Economics Cornell Universit, Ithaca NY [email protected] David Easle Dept. of Economics Cornell Universit, Ithaca NY [email protected]

More information

Online Appendix to Stochastic Imitative Game Dynamics with Committed Agents

Online Appendix to Stochastic Imitative Game Dynamics with Committed Agents Online Appendix to Stochastic Imitative Game Dynamics with Committed Agents William H. Sandholm January 6, 22 O.. Imitative protocols, mean dynamics, and equilibrium selection In this section, we consider

More information

SECTION 7-4 Algebraic Vectors

SECTION 7-4 Algebraic Vectors 7-4 lgebraic Vectors 531 SECTIN 7-4 lgebraic Vectors From Geometric Vectors to lgebraic Vectors Vector ddition and Scalar Multiplication Unit Vectors lgebraic Properties Static Equilibrium Geometric vectors

More information

TOPIC 4: DERIVATIVES

TOPIC 4: DERIVATIVES TOPIC 4: DERIVATIVES 1. The derivative of a function. Differentiation rules 1.1. The slope of a curve. The slope of a curve at a point P is a measure of the steepness of the curve. If Q is a point on the

More information

Price Theory Lecture 3: Theory of the Consumer

Price Theory Lecture 3: Theory of the Consumer Price Theor Lecture 3: Theor of the Consumer I. Introduction The purpose of this section is to delve deeper into the roots of the demand curve, to see eactl how it results from people s tastes, income,

More information