Generating Random Samples from the Generalized Pareto Mixture Model

Size: px
Start display at page:

Download "Generating Random Samples from the Generalized Pareto Mixture Model"

Transcription

1 Generating Random Samples from the Generalized Pareto Mixture Model MUSTAFA ÇAVUŞ AHMET SEZER BERNA YAZICI Department of Statistics Anadolu University Eskişehir TURKEY Abstract: The Generalized Pareto Distribution is a very useful tool for modeling in many areas of economics, finance and insurance. The Generalized Pareto Distribution is commonly used for extreme value problems. Especially, the values which exceed the finite threshold, is the focus in extreme value problems like in insurance sector. The Generalized Pareto Distribution is well approach for modeling the samples which include these extreme values. Sometimes, intended samples might have a heterogeneous distribution. In such cases, the mixture models are better way for modeling the data.in this study, we generate random samples from the Generalized Pareto Mixture Distribution for modeling of heterogeneous data. For this purpose, we use two three-parameters Generalized Pareto Distribution as components of the Generalized Pareto Mixture Distribution. For generating random samples, The Inverse Transformation Method is used in simulation study. The parameters of the mixture models are shape, scale and location are fixed. After generating random samples, Chi-Square Goodness-of-Fit Test is used for checking whether the generated samples are distributed based on The Generalized Pareto Distribution. Then, appropriate samples are combined for the determined mixture model with mixture weights. In simulation study, R-Statistical Programming Language is used. Key words: The Generalized Pareto Mixture Distribution, Mixture Models, The Inverse Transformation Method, Chi-Square Goodness-of-Fit Test, Generating Random Samples, Pareto Distribution ISBN:

2 1.Introduction The Generalized Pareto Distribution (GPD) was introduced by Pikands (1975).[1]Then further studied was worked by Davison, Smith (1984), Castillo (1997, 2008).[2][3]Mierlus-Mazilu (2010) studied on generalized pareto distribution, especially on generating random samples from it by the inverse transformation method[4]. He did not use any goodness-of-fit test for the generated random samples. The generalized pareto distribution is very important tool for modeling of economical, financial and insurance data. The companies are especially work on these areas want to plan their financial parameters. For instance, the financial crisis or same cases which are unexpected can be damage to the company. Thus, to isolate the damaged case companies should model the extreme events with the statistical distributions. By this way, companies can predict the extreme and damaged events for their financial balance. In this study, we focused on how to generate random samples from the generalized pareto mixture model and especially we test whether they fit well the generalized pareto mixture model by chi-squared goodness-of-fit test. The generalized pareto mixture model is important tool when the data has heterogeneous distribution. In real life, the risks in the financial sector can be heterogeneous dispersion so the generalized pareto mixture model is used on these areas. By now, Chi-Square goodness-of-fit test was not used in studies which are related generating random samples. For example, Beirlant(2006) used the Jackson statistics as a goodness-of-fit test for the generated random samples from Pareto-type behavior.[5] In the study, it is claimed that the log-transformed Pareto random variables are exponentially distributed and Jackson statistics, originally proposed as a goodness-of-fit statistics for testing exponentially. 2.Generalized Pareto Mixture Model 2.1.Pareto Distribution Pareto Distribution is generally used for modeling of the data which is consist of income. It was proposed by an Italian economist and sociologist Vilfredo Frederico Damaso Pareto (1897). This distribution is constructed on Pareto Principle. Based on Pareto Principle, a large portion of wealth of many societies is owned by a smaller percentage of the people in that society. This principle is explained more simply as rule which says that 20% of the population owns 80% of the wealth.[7] Pareto Distribution is also useful for modeling of finance, actuary and economics. It can be used many situations in which an equilibrium is found in the distribution of the small to the large. The probability density function of Pareto Distribution; The Pareto Distribution is characterized by is shape parameter which is positive and is scale parameter which measures the heaviness in the upper tail.mostly used notation of Pareto Distribution is with shape parameter and scale parameter. In Table 1, you can see the cumulative probability function, moment generation function, expected value and variance of Pareto Distribution. Cumulative Distribution Moment Generation Expected Value Variance Table 1: Characteristics of Pareto Distribution 2.2.Generalized Pareto Distribution Generalized Pareto Distribution is one of the most preferred method for modeling of Extreme Value problems. Extreme Value Theory interests in the values of the function which exceed the finite threshold. Especially in actuarial sector, the ISBN:

3 payments of companies consist regular policies which are in expected limits. If the unexpected events are occurred, the payments of companies can be rise up extremely. Thus the insurance companies need to model of rare events which are very important on payments. Let X is a random variable of F distribution. The interested values are exceed the finite threshold of this distribution. Generalized Pareto Distribution applies in modeling of this values which are named rare events. These values constructs of the right tail of the distribution line. The Generalized Pareto Distribution was introduced by Pikands(1975) and has been further studied by Davison, Smith(1984), Castillo(1997,2008). It has a wide application area in economics, insurance and finance.[1][2][3] The probability density function of Generalized Pareto Distribution; The Generalized Pareto Distribution is characterized by is shape parameter which domain is negative infinity to positive infinity. is scale parameter which measures the heaviness in the upper tail. is location parameter can be explains as the thresold. In Table 1, you can see the cumulative probability function, moment generation function, expected value and variance of The Generalized Pareto Distribution. Cumulative Distribution Variance Table 2: Characteristics of The Generalized Pareto Distribution 2.3.Mixture Model In statistics, there are many available method for modeling the data. These methods are named as statistical distributions. Statistical distributions are classifying to discrete probability distribution and continuous probability distributions. Data analysis used these statistical distributions which is appropriate to data s behavior. All analysis want to describe the data is concerned optimally. Occasionally, some data scan be distributed in different clusters that as seen on the distribution graph of the data. This difference also can be named as heterogeneity on the distribution. In such cases there is a new approach of modeling these data is known as the mixture models in literature. The Mixture Models provide a natural representation of heterogeneity in a number of clusters. Mixture Models provide a method of describing more complex probability distributions which is mentioned earlier as clusters, by combining several probability distributions. The combining probability distributions can be same distributions with different parameters and also different distributions. These distributions are combined with mixture weights. The notation of the probability distribution of the mixture models; Moment Generation Expected Value The Mixture Models are weighted with parameters are defined the weights of each component distributions. is defined as the parameter of the component distribution. In the formula, there is only a parameter, it can be differ according to the component distributions. The model can be constructed by combining of two or more probability density function. Defining of the component distributions is related to data analyst and his own experience. ISBN:

4 Density Plot of the data is very useful tool for defining of the component distribution. The advantages of mixture models can be explained with four main titles. First, component distributions can be multimodal. Second, the mixture model cover the data well to standard models. Third, it includes well-studied statistical inference techniques available. Last one is flexibility in choosing the component distributions. 2.4.Generalized Pareto Mixture Model Generalized Pareto Mixture Model (GPMM) is a parametric probability density function represented as a weighted sum of Generalized Pareto component densities. It is a weighted sum of k component Generalized Pareto densities as given by the equation; where x is a data vector,, are the mixture weights and, i=1, 2,,k are the component Generalized Pareto densities. Each component density is a Generalized Pareto probability density function of the form; where are location parameters, are shape parameter and are scale parameter of the Generalized Pareto probability density function. The mixture weights satisfy the constraint that 3.Simulation Study 3.1.Inverse Transformation Method Generating random samples is one of the most important subject in simulation studies. Researchers often refer generating random samples for their studies. Simulation study is not a proof for the problems but it can be preferable way for demonstrating some facts. There are several methods are used for generating random samples. Purpose of applying these methods is to generate random samples from an arbitrary distribution. Most of them are quite complex and requiring computer support. Some of them can be easier for researchers. One of the simple methods is the inverse transformation. The Inverse Transformation Method applies with the uniform distribution. After the calculation of cumulative probability function of intended distribution, random samples can be generatedwith the inverse transformation of random samples which are generated from uniform distribution. Let be a random variable with cumulative distribution function is. is a nondecreasing function, the inverse function may be defined as Let. The cumulative distribution function of the inverse transform is given by Thus, to generate a random variable with cumulative distribution, draw and set. This leads to the general method for generating random samples from an arbitrary cumulative probability distribution. [6] Algorithm: 1.Generate 2.Set. Requirements for applying the inverse transformation method: 1.The cumulative probability function of intended distribution must be nondecreasing 2.The inverse of the cumulative probability function of intended distribution must be found analitically. 3.2.Chi-Square Goodness-of-Fit Test In the simulation study, generated random samples must be check by a goodness-of-fit test be certain of belong the indented distribution. For this purpose, chi-square goodness-of-fit test is used and R is used. ISBN:

5 Chi-square goodness-of-fit test calculate a test statistic from the differences between observed frequencies and expected frequencies of theoretical distribution. The cumulative probability function of the theoretical distribution is used for calculating the expected frequencies. The test statistics is calculated below; The result is calculated from the formula is in (3.2.1) named as chi-square test statistics. This is compared with chis-square table value and then the conclusion is defined about the hypotheses. 3.3.Numerical Results In the application part of this study, the random samples are generated from GPMM by the inverse transformation method. For this, the R code block is programmed. In this code block, the threeparameters of the distribution are fixed when the random samples are generated. The mixture weights are used as 0.3 and 0.7. Algorithm; 1. Generate 2. If then set Else (or ) then set By these fixed parameters and values, the programmed R codes are worked 1000 times with different sample sizes and the appropriate samples which have appropriate distribution are detected according to the level of significance The success ratio of the results are showed as (successful trials/all trials) in the figure. Random samples are generated from six generalized pareto distribution with different parameters. Then this step is repeated 1000 times with different sample sizes which are 10, 20, 50, 100. Each cell shows the success of the generated random samples which are passed the goodness-of-fit test successfully Table3: The Results of Chi-Square Test As you seen on the table of the results, if the sample size is 100, the generated random samples distribute generalized pareto with threeparameters appropriately more than 90% success with significance level is If the sample size is increased, the success of the generator is be higher. After the seeing accordance of the generator we can generate the random samples from the GPMM.According to the algorithm, the generalized pareto distributions which are shown in Table 1 as indexes 7 and 8 are mixtured with mixture weights are 0.3 and 0.7 is used for generating random samples. Then generated random samples are shown in Figure 1. In Figure 2, the generalized pareto distributions as indexes 1 and 6 are mixtured with mixture weights are 0.3 and 0.7 is used for generating random samples and the result is shown. Shape Scale Location n=10 n=20 n=50 n=100 Index 5 = ISBN:

6 samples from the generalized pareto distribution. After researchers obtain the success of the generator, the mixture model is used for generating random samples with mixture weights. We can say that the random samples from GPMM can be safely generated by the inverse transformation method. Figure 1 Figure 2 4.Conclusions Researchers are faced with homogeneousdata in their studies. They can model these data with a known distribution. In the growing research work area, the modeling might be easy like this. There are many data which are heterogeneous in many areas. In these cases, the mixture models can be more appropriate for modeling the data. In this study, the GPMM is constructed with finite mixture weights with two components. After this, the random samples are generated from this mixture model. The results are tested with chisquared goodness-of-fit test, and the success rate of the inverse transformation method is shown in Table 1 with significance level is In this test, the components of mixture model is tested separately. After the seeing the success of the method, the generated random samples graphsare drow in Figure 1 and Figure 2. As a result, the inverse transformation method is useful way for generating random References [1] Pickands J. (1975), "Statistical Inference Using Extreme Order Statistics," The Annals of Statistics, 3: [2] Davidson A.C. (1984), Modeling excesses over high threshold with an application, In: J. Tiago de Oliveria (ed.), Statistical Extremes and Applications, Reidel, Dordrech, pp [3] Castillo, E. and A.S. Hadi, (1997), Fitting the Generalized Pareto Distribution to Data, JASA, 92( 440): Castillo J., Daoudib J. (2008), Estimation of the generalized Pareto distribution, Statistics & Probability Letters, In Press. [4] Mierlus-Mazilu (2010), On Generalized Pareto Distributions, Romanian Journal of Economic Forecasting, 8: [5] Beirlant J., Tertius de Wet, Goegebeur Y.(2006), A goodness-of-fit statistic for Pareto-type behaviour, Journal of Computational and Applied Mathematics 186 (2006) [6] Kroese D.P., Taimre T., Botev Z.I., Handbook of Monte Carlo Methods, 2011, John Wiley & Sons, Inc. [7] Raja T.A., Mir A.H.(2013), On fitting of Generalized Pareto Distribution, Global Journal of Human Social Science Economics, Volume 13 Issue 2. [8] Li Jia, Mixture Models, Lecture Notes, The Pennsylvania State University, Department of Statistics. ISBN:

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Practical Calculation of Expected and Unexpected Losses in Operational Risk by Simulation Methods

Practical Calculation of Expected and Unexpected Losses in Operational Risk by Simulation Methods Practical Calculation of Expected and Unexpected Losses in Operational Risk by Simulation Methods Enrique Navarrete 1 Abstract: This paper surveys the main difficulties involved with the quantitative measurement

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session STS040) p.2985

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session STS040) p.2985 Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session STS040) p.2985 Small sample estimation and testing for heavy tails Fabián, Zdeněk (1st author) Academy of Sciences of

More information

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

More information

2WB05 Simulation Lecture 8: Generating random variables

2WB05 Simulation Lecture 8: Generating random variables 2WB05 Simulation Lecture 8: Generating random variables Marko Boon http://www.win.tue.nl/courses/2wb05 January 7, 2013 Outline 2/36 1. How do we generate random variables? 2. Fitting distributions Generating

More information

Java Modules for Time Series Analysis

Java Modules for Time Series Analysis Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series

More information

Supplement to Call Centers with Delay Information: Models and Insights

Supplement to Call Centers with Delay Information: Models and Insights Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290

More information

Lloyd Spencer Lincoln Re

Lloyd Spencer Lincoln Re AN OVERVIEW OF THE PANJER METHOD FOR DERIVING THE AGGREGATE CLAIMS DISTRIBUTION Lloyd Spencer Lincoln Re Harry H. Panjer derives a recursive method for deriving the aggregate distribution of claims in

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

delta r R 1 5 10 50 100 x (on log scale)

delta r R 1 5 10 50 100 x (on log scale) Estimating the Tails of Loss Severity Distributions using Extreme Value Theory Alexander J. McNeil Departement Mathematik ETH Zentrum CH-8092 Zíurich December 9, 1996 Abstract Good estimates for the tails

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

Article from: ARCH 2014.1 Proceedings

Article from: ARCH 2014.1 Proceedings Article from: ARCH 20141 Proceedings July 31-August 3, 2013 1 Comparison of the Standard Rating Methods and the New General Rating Formula Muhamed Borogovac Boston Mutual Life Insurance Company*; muhamed_borogovac@bostonmutualcom

More information

Alessandro Birolini. ineerin. Theory and Practice. Fifth edition. With 140 Figures, 60 Tables, 120 Examples, and 50 Problems.

Alessandro Birolini. ineerin. Theory and Practice. Fifth edition. With 140 Figures, 60 Tables, 120 Examples, and 50 Problems. Alessandro Birolini Re ia i it En ineerin Theory and Practice Fifth edition With 140 Figures, 60 Tables, 120 Examples, and 50 Problems ~ Springer Contents 1 Basic Concepts, Quality and Reliability Assurance

More information

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

Algebra 1 Course Information

Algebra 1 Course Information Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through

More information

Evaluating Trading Systems By John Ehlers and Ric Way

Evaluating Trading Systems By John Ehlers and Ric Way Evaluating Trading Systems By John Ehlers and Ric Way INTRODUCTION What is the best way to evaluate the performance of a trading system? Conventional wisdom holds that the best way is to examine the system

More information

THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE

THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE Batsirai Winmore Mazviona 1 Tafadzwa Chiduza 2 ABSTRACT In general insurance, companies need to use data on claims gathered from

More information

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

Stochastic Processes and Queueing Theory used in Cloud Computer Performance Simulations

Stochastic Processes and Queueing Theory used in Cloud Computer Performance Simulations 56 Stochastic Processes and Queueing Theory used in Cloud Computer Performance Simulations Stochastic Processes and Queueing Theory used in Cloud Computer Performance Simulations Florin-Cătălin ENACHE

More information

A teaching experience through the development of hypertexts and object oriented software.

A teaching experience through the development of hypertexts and object oriented software. A teaching experience through the development of hypertexts and object oriented software. Marcello Chiodi Istituto di Statistica. Facoltà di Economia-Palermo-Italy 1. Introduction (1) This paper is concerned

More information

Chapter 9 Monté Carlo Simulation

Chapter 9 Monté Carlo Simulation MGS 3100 Business Analysis Chapter 9 Monté Carlo What Is? A model/process used to duplicate or mimic the real system Types of Models Physical simulation Computer simulation When to Use (Computer) Models?

More information

International College of Economics and Finance Syllabus Probability Theory and Introductory Statistics

International College of Economics and Finance Syllabus Probability Theory and Introductory Statistics International College of Economics and Finance Syllabus Probability Theory and Introductory Statistics Lecturer: Mikhail Zhitlukhin. 1. Course description Probability Theory and Introductory Statistics

More information

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document

More information

Master of Mathematical Finance: Course Descriptions

Master of Mathematical Finance: Course Descriptions Master of Mathematical Finance: Course Descriptions CS 522 Data Mining Computer Science This course provides continued exploration of data mining algorithms. More sophisticated algorithms such as support

More information

Non Linear Dependence Structures: a Copula Opinion Approach in Portfolio Optimization

Non Linear Dependence Structures: a Copula Opinion Approach in Portfolio Optimization Non Linear Dependence Structures: a Copula Opinion Approach in Portfolio Optimization Jean- Damien Villiers ESSEC Business School Master of Sciences in Management Grande Ecole September 2013 1 Non Linear

More information

Approximation of Aggregate Losses Using Simulation

Approximation of Aggregate Losses Using Simulation Journal of Mathematics and Statistics 6 (3): 233-239, 2010 ISSN 1549-3644 2010 Science Publications Approimation of Aggregate Losses Using Simulation Mohamed Amraja Mohamed, Ahmad Mahir Razali and Noriszura

More information

COMMON CORE STATE STANDARDS FOR

COMMON CORE STATE STANDARDS FOR COMMON CORE STATE STANDARDS FOR Mathematics (CCSSM) High School Statistics and Probability Mathematics High School Statistics and Probability Decisions or predictions are often based on data numbers in

More information

Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes

Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes Simcha Pollack, Ph.D. St. John s University Tobin College of Business Queens, NY, 11439 pollacks@stjohns.edu

More information

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay Lecture - 17 Shannon-Fano-Elias Coding and Introduction to Arithmetic Coding

More information

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses Michael R. Powers[ ] Temple University and Tsinghua University Thomas Y. Powers Yale University [June 2009] Abstract We propose a

More information

Chapter 4 Lecture Notes

Chapter 4 Lecture Notes Chapter 4 Lecture Notes Random Variables October 27, 2015 1 Section 4.1 Random Variables A random variable is typically a real-valued function defined on the sample space of some experiment. For instance,

More information

A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS

A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS Eusebio GÓMEZ, Miguel A. GÓMEZ-VILLEGAS and J. Miguel MARÍN Abstract In this paper it is taken up a revision and characterization of the class of

More information

STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400&3400 STAT2400&3400 STAT2400&3400 STAT2400&3400 STAT3400 STAT3400

STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400&3400 STAT2400&3400 STAT2400&3400 STAT2400&3400 STAT3400 STAT3400 Exam P Learning Objectives All 23 learning objectives are covered. General Probability STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 1. Set functions including set notation and basic elements

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

An Introduction to Extreme Value Theory

An Introduction to Extreme Value Theory An Introduction to Extreme Value Theory Petra Friederichs Meteorological Institute University of Bonn COPS Summer School, July/August, 2007 Applications of EVT Finance distribution of income has so called

More information

What is Data Analysis. Kerala School of MathematicsCourse in Statistics for Scientis. Introduction to Data Analysis. Steps in a Statistical Study

What is Data Analysis. Kerala School of MathematicsCourse in Statistics for Scientis. Introduction to Data Analysis. Steps in a Statistical Study Kerala School of Mathematics Course in Statistics for Scientists Introduction to Data Analysis T.Krishnan Strand Life Sciences, Bangalore What is Data Analysis Statistics is a body of methods how to use

More information

Applying Generalized Pareto Distribution to the Risk Management of Commerce Fire Insurance

Applying Generalized Pareto Distribution to the Risk Management of Commerce Fire Insurance Applying Generalized Pareto Distribution to the Risk Management of Commerce Fire Insurance Wo-Chiang Lee Associate Professor, Department of Banking and Finance Tamkang University 5,Yin-Chuan Road, Tamsui

More information

COMPARISON BETWEEN ANNUAL MAXIMUM AND PEAKS OVER THRESHOLD MODELS FOR FLOOD FREQUENCY PREDICTION

COMPARISON BETWEEN ANNUAL MAXIMUM AND PEAKS OVER THRESHOLD MODELS FOR FLOOD FREQUENCY PREDICTION COMPARISON BETWEEN ANNUAL MAXIMUM AND PEAKS OVER THRESHOLD MODELS FOR FLOOD FREQUENCY PREDICTION Mkhandi S. 1, Opere A.O. 2, Willems P. 3 1 University of Dar es Salaam, Dar es Salaam, 25522, Tanzania,

More information

MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics

MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS title- course code: Program name: Contingency Tables and Log Linear Models Level Biostatistics Hours/week Ther. Recite. Lab. Others Total Master of Sci.

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Exam P - Total 23/23 - 1 -

Exam P - Total 23/23 - 1 - Exam P Learning Objectives Schools will meet 80% of the learning objectives on this examination if they can show they meet 18.4 of 23 learning objectives outlined in this table. Schools may NOT count a

More information

What is Statistics? Lecture 1. Introduction and probability review. Idea of parametric inference

What is Statistics? Lecture 1. Introduction and probability review. Idea of parametric inference 0. 1. Introduction and probability review 1.1. What is Statistics? What is Statistics? Lecture 1. Introduction and probability review There are many definitions: I will use A set of principle and procedures

More information

Modelling operational risk in the insurance industry

Modelling operational risk in the insurance industry April 2011 N 14 Modelling operational risk in the insurance industry Par Julie Gamonet Centre d études actuarielles Winner of the French «Young Actuaries Prize» 2010 Texts appearing in SCOR Papers are

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Pr(X = x) = f(x) = λe λx

Pr(X = x) = f(x) = λe λx Old Business - variance/std. dev. of binomial distribution - mid-term (day, policies) - class strategies (problems, etc.) - exponential distributions New Business - Central Limit Theorem, standard error

More information

Description. Textbook. Grading. Objective

Description. Textbook. Grading. Objective EC151.02 Statistics for Business and Economics (MWF 8:00-8:50) Instructor: Chiu Yu Ko Office: 462D, 21 Campenalla Way Phone: 2-6093 Email: kocb@bc.edu Office Hours: by appointment Description This course

More information

How To Calculate The Power Of A Cluster In Erlang (Orchestra)

How To Calculate The Power Of A Cluster In Erlang (Orchestra) Network Traffic Distribution Derek McAvoy Wireless Technology Strategy Architect March 5, 21 Data Growth is Exponential 2.5 x 18 98% 2 95% Traffic 1.5 1 9% 75% 5%.5 Data Traffic Feb 29 25% 1% 5% 2% 5 1

More information

Probability Distribution

Probability Distribution Lecture 4 Probability Distribution Continuous Case Definition: A random variable that can take on any value in an interval is called continuous. Definition: Let Y be any r.v. The distribution function

More information

Spatial Data Analysis

Spatial Data Analysis 14 Spatial Data Analysis OVERVIEW This chapter is the first in a set of three dealing with geographic analysis and modeling methods. The chapter begins with a review of the relevant terms, and an outlines

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

Credibility and Pooling Applications to Group Life and Group Disability Insurance

Credibility and Pooling Applications to Group Life and Group Disability Insurance Credibility and Pooling Applications to Group Life and Group Disability Insurance Presented by Paul L. Correia Consulting Actuary paul.correia@milliman.com (207) 771-1204 May 20, 2014 What I plan to cover

More information

Logistic Regression (1/24/13)

Logistic Regression (1/24/13) STA63/CBB540: Statistical methods in computational biology Logistic Regression (/24/3) Lecturer: Barbara Engelhardt Scribe: Dinesh Manandhar Introduction Logistic regression is model for regression used

More information

CATASTROPHIC RISK MANAGEMENT IN NON-LIFE INSURANCE

CATASTROPHIC RISK MANAGEMENT IN NON-LIFE INSURANCE CATASTROPHIC RISK MANAGEMENT IN NON-LIFE INSURANCE EKONOMIKA A MANAGEMENT Valéria Skřivánková, Alena Tartaľová 1. Introduction At the end of the second millennium, a view of catastrophic events during

More information

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express

More information

Sums of Independent Random Variables

Sums of Independent Random Variables Chapter 7 Sums of Independent Random Variables 7.1 Sums of Discrete Random Variables In this chapter we turn to the important question of determining the distribution of a sum of independent random variables

More information

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1]. Probability Theory Probability Spaces and Events Consider a random experiment with several possible outcomes. For example, we might roll a pair of dice, flip a coin three times, or choose a random real

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring Non-life insurance mathematics Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring About the lecturer PhD in risk analysis, Institute for Mathematics, University of Oslo 11 years experience

More information

Load Balancing and Switch Scheduling

Load Balancing and Switch Scheduling EE384Y Project Final Report Load Balancing and Switch Scheduling Xiangheng Liu Department of Electrical Engineering Stanford University, Stanford CA 94305 Email: liuxh@systems.stanford.edu Abstract Load

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk

More information

Introduction to Markov Chain Monte Carlo

Introduction to Markov Chain Monte Carlo Introduction to Markov Chain Monte Carlo Monte Carlo: sample from a distribution to estimate the distribution to compute max, mean Markov Chain Monte Carlo: sampling using local information Generic problem

More information

Simple Data Analysis Techniques

Simple Data Analysis Techniques Simple Data Analysis Techniques Three of the most common charts used for data analysis are pie, Pareto and trend charts. These are often linked together in a data trail. Pie Charts Pie charts provide a

More information

Monte Carlo testing with Big Data

Monte Carlo testing with Big Data Monte Carlo testing with Big Data Patrick Rubin-Delanchy University of Bristol & Heilbronn Institute for Mathematical Research Joint work with: Axel Gandy (Imperial College London) with contributions from:

More information

Simulation and Risk Analysis

Simulation and Risk Analysis Simulation and Risk Analysis Using Analytic Solver Platform REVIEW BASED ON MANAGEMENT SCIENCE What We ll Cover Today Introduction Frontline Systems Session Ι Beta Training Program Goals Overview of Analytic

More information

PROJECT RISK MANAGEMENT - ADVANTAGES AND PITFALLS

PROJECT RISK MANAGEMENT - ADVANTAGES AND PITFALLS 1 PROJECT RISK MANAGEMENT - ADVANTAGES AND PITFALLS Kenneth K. Humphreys 1, PE CCE DIF 1 Past Secretary-Treasurer, ICEC, Granite Falls, NC, United States Abstract Proper project decision-making requires

More information

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

A Correlation of. to the. South Carolina Data Analysis and Probability Standards A Correlation of to the South Carolina Data Analysis and Probability Standards INTRODUCTION This document demonstrates how Stats in Your World 2012 meets the indicators of the South Carolina Academic Standards

More information

Estimation and attribution of changes in extreme weather and climate events

Estimation and attribution of changes in extreme weather and climate events IPCC workshop on extreme weather and climate events, 11-13 June 2002, Beijing. Estimation and attribution of changes in extreme weather and climate events Dr. David B. Stephenson Department of Meteorology

More information

This PDF is a selection from a published volume from the National Bureau of Economic Research. Volume Title: The Risks of Financial Institutions

This PDF is a selection from a published volume from the National Bureau of Economic Research. Volume Title: The Risks of Financial Institutions This PDF is a selection from a published volume from the National Bureau of Economic Research Volume Title: The Risks of Financial Institutions Volume Author/Editor: Mark Carey and René M. Stulz, editors

More information

NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York

NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York DEPARTMENT: Mathematics COURSE: MAT 1272/ MA 272 TITLE: DESCRIPTION: TEXT: Statistics An introduction to statistical methods and statistical

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct

More information

THE CENTRAL LIMIT THEOREM TORONTO

THE CENTRAL LIMIT THEOREM TORONTO THE CENTRAL LIMIT THEOREM DANIEL RÜDT UNIVERSITY OF TORONTO MARCH, 2010 Contents 1 Introduction 1 2 Mathematical Background 3 3 The Central Limit Theorem 4 4 Examples 4 4.1 Roulette......................................

More information

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania Oriana Zacaj Department of Mathematics, Polytechnic University, Faculty of Mathematics and Physics Engineering

More information

On the Interaction and Competition among Internet Service Providers

On the Interaction and Competition among Internet Service Providers On the Interaction and Competition among Internet Service Providers Sam C.M. Lee John C.S. Lui + Abstract The current Internet architecture comprises of different privately owned Internet service providers

More information

Characterizing Task Usage Shapes in Google s Compute Clusters

Characterizing Task Usage Shapes in Google s Compute Clusters Characterizing Task Usage Shapes in Google s Compute Clusters Qi Zhang 1, Joseph L. Hellerstein 2, Raouf Boutaba 1 1 University of Waterloo, 2 Google Inc. Introduction Cloud computing is becoming a key

More information

Short-selling in the Chinese stock market: What would happen with returns?

Short-selling in the Chinese stock market: What would happen with returns? Department of Economics and Society, Dalarna University C-level essay in Statistics, Fall 2006 Short-selling in the Chinese stock market: What would happen with returns? Supervisor: Kenneth Carling Authors:

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

Cover. Optimal Retentions with Ruin Probability Target in The case of Fire. Insurance in Iran

Cover. Optimal Retentions with Ruin Probability Target in The case of Fire. Insurance in Iran Cover Optimal Retentions with Ruin Probability Target in The case of Fire Insurance in Iran Ghadir Mahdavi Ass. Prof., ECO College of Insurance, Allameh Tabataba i University, Iran Omid Ghavibazoo MS in

More information

Data Preparation and Statistical Displays

Data Preparation and Statistical Displays Reservoir Modeling with GSLIB Data Preparation and Statistical Displays Data Cleaning / Quality Control Statistics as Parameters for Random Function Models Univariate Statistics Histograms and Probability

More information

A Model of Optimum Tariff in Vehicle Fleet Insurance

A Model of Optimum Tariff in Vehicle Fleet Insurance A Model of Optimum Tariff in Vehicle Fleet Insurance. Bouhetala and F.Belhia and R.Salmi Statistics and Probability Department Bp, 3, El-Alia, USTHB, Bab-Ezzouar, Alger Algeria. Summary: An approach about

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Lecture 8. Confidence intervals and the central limit theorem

Lecture 8. Confidence intervals and the central limit theorem Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of

More information

MARKETS, INFORMATION AND THEIR FRACTAL ANALYSIS. Mária Bohdalová and Michal Greguš Comenius University, Faculty of Management Slovak republic

MARKETS, INFORMATION AND THEIR FRACTAL ANALYSIS. Mária Bohdalová and Michal Greguš Comenius University, Faculty of Management Slovak republic MARKETS, INFORMATION AND THEIR FRACTAL ANALYSIS Mária Bohdalová and Michal Greguš Comenius University, Faculty of Management Slovak republic Abstract: We will summarize the impact of the conflict between

More information

STATISTICAL ANALYSIS TOWARDS THE IDENTIFICATION OF ACCURATE PROBABILITY DISTRIBUTION MODELS FOR THE COMPRESSIVE STRENGTH OF CONCRETE

STATISTICAL ANALYSIS TOWARDS THE IDENTIFICATION OF ACCURATE PROBABILITY DISTRIBUTION MODELS FOR THE COMPRESSIVE STRENGTH OF CONCRETE October 1-17, 008, Beijing, China STATISTICAL ANALYSIS TOWADS THE IDENTIFICATION OF ACCUATE POBABILITY DISTIBUTION MODELS FO THE COMPESSIVE STENGTH OF CONCETE S. Silvestri 1, G. Gasparini 1, T. Trombetti

More information

Statistical methods to expect extreme values: Application of POT approach to CAC40 return index

Statistical methods to expect extreme values: Application of POT approach to CAC40 return index Statistical methods to expect extreme values: Application of POT approach to CAC40 return index A. Zoglat 1, S. El Adlouni 2, E. Ezzahid 3, A. Amar 1, C. G. Okou 1 and F. Badaoui 1 1 Laboratoire de Mathématiques

More information

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Yong Bao a, Aman Ullah b, Yun Wang c, and Jun Yu d a Purdue University, IN, USA b University of California, Riverside, CA, USA

More information

LOGNORMAL MODEL FOR STOCK PRICES

LOGNORMAL MODEL FOR STOCK PRICES LOGNORMAL MODEL FOR STOCK PRICES MICHAEL J. SHARPE MATHEMATICS DEPARTMENT, UCSD 1. INTRODUCTION What follows is a simple but important model that will be the basis for a later study of stock prices as

More information

Lecture 7: Continuous Random Variables

Lecture 7: Continuous Random Variables Lecture 7: Continuous Random Variables 21 September 2005 1 Our First Continuous Random Variable The back of the lecture hall is roughly 10 meters across. Suppose it were exactly 10 meters, and consider

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Benchmark Rates for XL Reinsurance Revisited: Model Comparison for the Swiss MTPL Market

Benchmark Rates for XL Reinsurance Revisited: Model Comparison for the Swiss MTPL Market Benchmark Rates for XL Reinsurance Revisited: Model Comparison for the Swiss MTPL Market W. Hürlimann 1 Abstract. We consider the dynamic stable benchmark rate model introduced in Verlaak et al. (005),

More information

Monte Carlo Simulation

Monte Carlo Simulation 1 Monte Carlo Simulation Stefan Weber Leibniz Universität Hannover email: sweber@stochastik.uni-hannover.de web: www.stochastik.uni-hannover.de/ sweber Monte Carlo Simulation 2 Quantifying and Hedging

More information

3. Data Analysis, Statistics, and Probability

3. Data Analysis, Statistics, and Probability 3. Data Analysis, Statistics, and Probability Data and probability sense provides students with tools to understand information and uncertainty. Students ask questions and gather and use data to answer

More information