ESTIMATION OF STOP LOSS PREMIUM IN FIRE INSURANCE. C. P. WELTEN Amsterdam, Holland



Similar documents
CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

BINOMIAL OPTIONS PRICING MODEL. Mark Ioffe. Abstract

Hypothesis testing - Steps

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = Adverse impact as defined by the 4/5ths rule was not found in the above data.

Study Guide for the Final Exam

Regression Analysis: A Complete Example

MATHEMATICAL METHODS OF STATISTICS

Chapter 3 RANDOM VARIATE GENERATION

Simple Linear Regression Inference

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Mixing internal and external data for managing operational risk

APS REINSURANCE. By BRUNO KOLLER, NICOLE DETTWYLER

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Simple Regression Theory II 2010 Samuel L. Baker

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

REINSURANCE PROFIT SHARE

A Log-Robust Optimization Approach to Portfolio Management

MATHEMATICAL INDUCTION. Mathematical Induction. This is a powerful method to prove properties of positive integers.

Confidence Intervals for the Difference Between Two Means

The Study of Chinese P&C Insurance Risk for the Purpose of. Solvency Capital Requirement

11. Analysis of Case-control Studies Logistic Regression

Final Exam Practice Problem Answers

Part 2: Analysis of Relationship Between Two Variables

Least Squares Estimation

Online Appendices to the Corporate Propensity to Save

METHODS OF STUDYING THE RISK PROCESS IN DISABILITY INSURANCE. Stockhohn

Compensation Analysis

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

A Comparison of Strategic Reinsurance Programs (SRP) with Banking Activities and other Insurance and Reinsurance Activities

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

The correlation coefficient

STAT 360 Probability and Statistics. Fall 2012

Quantitative Impact Study 1 (QIS1) Summary Report for Belgium. 21 March 2006

Testing Research and Statistical Hypotheses

Introduction. Appendix D Mathematical Induction D1

2. Linear regression with multiple regressors

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)

Marginal Person. Average Person. (Average Return of College Goers) Return, Cost. (Average Return in the Population) (Marginal Return)

International Statistical Institute, 56th Session, 2007: Phil Everson

12: Analysis of Variance. Introduction

From the help desk: Swamy s random-coefficients model

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

E3: PROBABILITY AND STATISTICS lecture notes

Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes

Simple linear regression

Quantitative Inventory Uncertainty

Calculating the Probability of Returning a Loan with Binary Probability Models

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

Section 13, Part 1 ANOVA. Analysis Of Variance

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Using Excel for inferential statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Teaching Multivariate Analysis to Business-Major Students

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Recall this chart that showed how most of our course would be organized:

WORKERS COMPENSATION CLASSIFICATION CREDIBILITIES. By Howard C. Mahler, FCAS

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

Testing for Lack of Fit

1 Overview and background

A Model of Optimum Tariff in Vehicle Fleet Insurance

Chapter 7: Simple linear regression Learning Objectives

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

General and statistical principles for certification of RM ISO Guide 35 and Guide 34

Module 2 Probability and Statistics

LOGNORMAL MODEL FOR STOCK PRICES

Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification

Applications to Data Smoothing and Image Processing I

Introduction to Regression and Data Analysis

DEDUCTIBLES IN INDUSTRIAL FIRE INSURANCE

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Understanding the Impact of Weights Constraints in Portfolio Theory

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Descriptive Statistics

Multiple Linear Regression

Descriptive statistics parameters: Measures of centrality

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

RESERVES OF MOTOR-VEHICLE INSURANCE IN FINLAND

Elements of statistics (MATH0487-1)

The indoor Rn-concentration: schools, kindergartens and official buildings versus homes.

Two Correlated Proportions (McNemar Test)

August 2012 EXAMINATIONS Solution Part I

Example: Boats and Manatees

Quantitative Methods for Finance

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

One-year reserve risk including a tail factor : closed formula and bootstrap approaches

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

3.4 Statistical inference for 2 populations based on two samples

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Buy a number of shares,, and invest B in bonds. Outlay for portfolio today is S + B. Tree shows possible values one period later.

Article from: ARCH Proceedings

Transcription:

ESTIMATION OF STOP LOSS PREMIUM IN FIRE INSURANCE C. P. WELTEN Amsterdam, Holland The estimation of stop loss premiums can be based on some knowledge about the distribution function of the sum of all claims in a year (assuming that the stop loss insurance relates to a period of one calender year). Generally speaking there are two methods to obtain this knowledge about the distribution function. I. The first method is to construct a distribution function from data concerning: a. the distribution function of the number of claims per year, taking into account the variability of the parameter(s) of this distribution function. b. the distribution function of the insured sums. c. the distribution function of partial claims. d. the correlation between the insured sum and the probability of occurring of a claim. e. the probability of contagion. 2. The second method is to derive a distribution function from the year's totals of claims over a long series of years, expressed in e.g. units of the totals of insured sums in that years. In practice it is often difficult to find a useful basis to apply one of these methods. Data concerning the distribution function of the number of claims per year, of the insured sums, and of partial claims are mostly available, but often nothing is known about the correlation between the insured sum and the probability of occurring of a claim. The second method is mostly not applicable because, if the year's totals of claims over a long series of, by preference recent, years are available, these data often turn out to be heterogeneous

ESTIMATION OF STOP LOSS PREMIUM IN FIRE INSURANCE 357 or to be correlated with time. If, in that case, only the data of the most recent years are used, the number of these data is often a too small basis for the construction of a distribution function. A special difficulty concerning the calculation of stop loss premiums is that we have to know as much as possible about events that happen only seldom. Often it is not easy to obtain that knowledge. In the following we will describe a case of a stop loss cover in fire insurance. In this case, concerning a fire insurance company with a rather small number of insurances with relative high insured sums, the values rj = total of claims, occurring in the year j X lo 4 total of insured sums in the year j were available over a period of years (1916-196o). As well in the numerator as in the denominator the reinsured parts of claims resp. reinsured parts of insured sums are not included. The data, necessary for application of the first method, were only partly available, and it was not possible to gather them as yet. The coefficient of correlation between r i and j proved to be -- o,o98. According to Fisher's z'-method the deviation of this value from zero is not significant, so that the hypothesis, that the values r i have not a linear trend, can not be rejected. Further investigations have not given rise to the assumption that there is a nonlinear trend of some importance, and indications for lack of homogenity are not found. For this reasons we assumed that the observed values r i can be considered as a random sample from a homogeneous population. An obvious method to find a useful distribution function from the values r i is based on the assumption that this distribution function belongs to the wellknown class of Pearson-curves. To apply this method the central moments of the sample (the values r x... r ) are calculated first. These moments turn out to be : rn 1 = 2,256 m3 = 9,176 rn~ = 4,2o5 m 4 = 49,131

- 358 ESTIMATION OF STOP LOSS PREMIUM IN FIRE INSURANCE From these sample-moments unbiased estimates of the central moments of the population can be derived (see e.g. Cram6r, Mathematical methods of Statistics, p. 35I). These estimates are in the present case: S (Wl) = 2,256 S (~,) = 4,3Ol S (W3) = 9, 821 S (~4) = 51,554 The Pearson-curve that has these central moments, proves to be a B-function: (r - f(r) = (s-- p)ra+n-1. B(m.n) where p = 0,758 s = 6,638 m = o,i34 n = 0,392 ( '758<r<6'638) A first consideration showed already that IO out of values of the sample are situated below, and one above the interval of the B-function. Classification of the values in the five intervals o - I ; I - 2 ; 2-3 ; 3-5 and > 5, and testing with the "expected" numbers in these intervals on the basis of the above mentioned B-function gives a value X 2 = 8,I6I, with four degrees of freedom; this does not lead to rejection of the considered frequency-function. Nevertheless it is remarkable that according to the B-function II out of values have to fall in the interval > 5 ; in fact 7 values fall in this interval. It is therefore very doubtful, whether the above mentioned B-function gives a usefull representation of the "tail" of the actual frequency-function. Finally we mention about the B-function, that this function leads to a risk-premium for the stop-loss-cover (without safety-loading) ad I,I6 at a stop-loss-limit of 2,5, or a premium ad 0,79 at a limit of 3,5 (per IO.OOO units of the total of insured sums). The inacceptable results of application of the B-distribution have given rise to attempt another frequency-function namely the lognormal frequency-function.

ESTIMATION OF STOP LOSS PREMIUM IN FIRE INSURANCE 359 where f(r) dr, -IL)_~ I I -- (?. -- e ~ot r,/_---- Vz~r dts Z ln r i ~t- -- 0,394 #6 Z (ln ri -- V.)2 62 ~_f-t = 0,9 Classification of the observed values r i in the same intervals as mentioned in the description of the B-distribution leads to 7. * = 4,I48 with four degrees of freedom; this figure does not lead to rejection of the lognormal frequency-function. The "expected" number of values in the interval > 5 is 4,4; the observed number is 7. The deviation is indeed not significant, but nevertheless there is some suspicion against the lognormal frequency-function that it underestimates the "tail" of the actual frequency-function of r. Finally we mention about the lognormal frequency-function, that on the basis of this function the stop-loss-premium amounts to 0,72 at a stop-loss-limit of 2,5 and to 0,48 at a limit of 3,5 (per IO.OOO units of the total of insured sums). It must doubtlessly be possible to obtain a frequency-function, which fits the observed data of the sample reasonably well, also in the interval > 5. But then too it is not at all certain that such a frequency-function gives a good representation of the "tail" of the actual frequency-function; only if the sample should contain some hundreds of observed values it could be possible to make sure that the "tail" is well represented. For that reason it is perhaps useful to look for an estimate of the stop-loss-premium, which is not based on any assumption about the analytical shape of the underlying distribution function. This can be done as follows. If the random variable r (being lo 4 the ratio between the total of claims in a year and the total of insured sums in that year) 24

360 ESTIMATION OF STOP LOSS PREMIUM IN FIRE INSURANCE has the frequency-function f(r), a random variable h can be defined with the frequency-function F(h): F(h) = o if h < o 6 F(o) = f(r)dr/ o F(h) = f(h + G) ifh >o Then the mean P of h is the risk premium for the stop loss cover at the limit G ;,P, beeing the standard deviation of h, is the standarddeviation of the yearly stop-loss-risk. The observed values h i can be derived from the values rj according to: h i = o h i-~ ri--g if r i ~ G ifri>g The values h can be considered as observed values of the random variable h.=. F(h). From this follows" i~ hj I. that P -- ~-1 is an unbiased estimate of the stop loss premium P. (hi- p)~ 2 --1 2. that S~ = ~- is an unbiased estimate of 44 ap' 3. that ~', beeing the expected value of (P--P)*, is Therefore the estimate of the standard deviation of the estimate can be calculated as ~ --. S~ 44 In the present case the estimates P and ~ 2 turn out to be: at a limit G = 2,5; P = o,77; % = o,22 at a limit G = 3,5; P = o,51; ~ = o,15 In practice the stop loss premium can be calculated as i 6 + ~ + ~*~

ESTIMATION OF STOP LOSS PREMIUM IN FIRE INSURANCE 361 In this formula the value of P + ~ can be considered as the upper limit of a confidence interval, which contains the stop loss premium P with a probability, for which Gauss'inequality gives a low estimate. The value ~r~ is a security-loading. CONCLUSION The abovementioned case demonstrates that in calculating stop-loss-premiums a cautious choise of the underlying distribution function is necessary, and that it is perhaps better to make no assumption concerning the shape of that function. It proves--at least in some cases--to be possible to obtain a "quantified uncertainty" about the stop loss premium without such assumption. APPENDIX j,~ j r, I 0,66 24 0,58 2 6,47 25 0,26 3 0, 26 1,36 4 4,51 27 1,98 5 6,48 28 4,90 6 2,03 29 3,36 7 4,69 30 1,90 8 O, 26 31 0,97 9 0,67 3 z 1,40 10 3, 88 33 6,O9 II 2,50 34 o,81 12 0,30 35 o,77 13 0,95 36 o,49 14 I,I9 37 2,07 15 2,78 38 o,76 16 o,5i 39 5,37 17 1,28 4 0,79 18 7,26 41 1,7o 19 0,44 42 o,93 2o I,OI 43 5, II 21 1,59 44 I,II 22 o,88 1,61 23 6,42