Selecting the Right Distribution in @RISK



Similar documents
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Choosing Probability Distributions in Simulation

Quantitative Methods for Finance

ModelRisk for Insurance and Finance. Quick Start Guide

Modelling Uncertainty in Claims Experience to Price General Insurance Contracts

Practical Calculation of Expected and Unexpected Losses in Operational Risk by Simulation Methods

Pricing Alternative forms of Commercial insurance cover. Andrew Harford

Simulation and Risk Analysis

Important Probability Distributions OPRE 6301

Operational Risk. Operational Risk in Life Insurers. Operational Risk similarities with General Insurance? unique features

LECTURE 16. Readings: Section 5.1. Lecture outline. Random processes Definition of the Bernoulli process Basic properties of the Bernoulli process

Stochastic Analysis of Long-Term Multiple-Decrement Contracts

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Supplement to Call Centers with Delay Information: Models and Insights

Probability Calculator

Asymmetry and the Cost of Capital

4. Continuous Random Variables, the Pareto and Normal Distributions

Chapter 3 RANDOM VARIATE GENERATION

Basel II: Operational Risk Implementation based on Risk Framework

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

seven Statistical Analysis with Excel chapter OVERVIEW CHAPTER

Chapter 9 Monté Carlo Simulation

Quantitative Risk Analysis with Microsoft Project

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

( ) = 1 x. ! 2x = 2. The region where that joint density is positive is indicated with dotted lines in the graph below. y = x

Financial Simulation Models in General Insurance

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Review of Random Variables

Forecast Confidence Level and Portfolio Optimization

STAT 315: HOW TO CHOOSE A DISTRIBUTION FOR A RANDOM VARIABLE

Ch.3 Demand Forecasting.

INT 3 Schedule Risk Analysis

Random Variate Generation (Part 3)

COMMON CORE STATE STANDARDS FOR

Skewness and Kurtosis in Function of Selection of Network Traffic Distribution

Permutation Tests for Comparing Two Populations

Probability, statistics and football Franka Miriam Bru ckler Paris, 2015.

Pricing Alternative forms of Commercial Insurance cover

Modelling operational risk in the insurance industry

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Stop Loss Reinsurance

STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400 STAT2400&3400 STAT2400&3400 STAT2400&3400 STAT2400&3400 STAT3400 STAT3400

THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE

Load Balancing and Switch Scheduling

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Probability density function : An arbitrary continuous random variable X is similarly described by its probability density function f x = f X

Regulatory and Economic Capital

Simplified Option Selection Method

Foundation of Quantitative Data Analysis

Approximation of Aggregate Losses Using Simulation

Characteristics of Binomial Distributions

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Probability Concepts Probability Distributions. Margaret Priest Nokuthaba Sibanda

Statistical Functions in Excel

Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification

How To Understand And Solve A Linear Programming Problem

RISKY LOSS DISTRIBUTIONS AND MODELING THE LOSS RESERVE PAY-OUT TAIL

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Exam P - Total 23/

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

240ST014 - Data Analysis of Transport and Logistics

Sums of Independent Random Variables

Example 1: Dear Abby. Stat Camp for the Full-time MBA Program

Featured article: Evaluating the Cost of Longevity in Variable Annuity Living Benefits

Statistical Rules of Thumb

Encyclopedia of Operations Management

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

Quantitative Impact Study 1 (QIS1) Summary Report for Belgium. 21 March 2006

Fairfield Public Schools

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

NOTES ON THE BANK OF ENGLAND OPTION-IMPLIED PROBABILITY DENSITY FUNCTIONS

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL

6 Scalar, Stochastic, Discrete Dynamic Systems

Chapter 5. Discrete Probability Distributions

Notes on Continuous Random Variables

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

Monte Carlo Estimation of Project Volatility for Real Options Analysis

Using simulation to calculate the NPV of a project

CORPORATE FINANCE RISK ANALYSIS WITH CRYSTAL BALL ARE WE ADDING VALUE?

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Bachelor's Degree in Business Administration and Master's Degree course description

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

PROBABILITY SECOND EDITION

Risk Analysis and Quantification

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania

International College of Economics and Finance Syllabus Probability Theory and Introductory Statistics

USING A CLAIM SIMULATION MODEL FOR RESERVING & LOSS FORECASTING FOR MEDICAL PROFESSIONAL LIABILITY. By Rajesh V. Sahasrabuddhe, FCAS, MAAA

Queuing Theory. Long Term Averages. Assumptions. Interesting Values. Queuing Model

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Java Modules for Time Series Analysis

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

STATISTICAL METHODS IN GENERAL INSURANCE. Philip J. Boland National University of Ireland, Dublin

6 PROBABILITY GENERATING FUNCTIONS

Transcription:

Selecting the Right Distribution in @RISK October 2009 By Michael Rees mrees@palisade.com

Michael Rees Professional Experience» 20 years Strategy consultant Equity analyst Independent consultant» Palisade Director of Training and Consulting Qualifications» B.A. in Mathematics and D.Phil. in Mathematical Modeling from Oxford» MBA from INSEAD» Certificate in Quantitative Finance» Financial Modelling in Practice (John Wiley & Sons, October) [see Wiley.com for free.pdf of Ch 1; without the models] 2

Topics» A framework for selecting distributions» Distributions and properties that are new in v5.0/5.5» Examples

Is there a right distribution?» Theoretically correct? E.g. Based on some implicit model of the input s behavior» Conforms with industry standards?» Data driven? E.g. Data fitting or re-sampling Pragmatic? E.g. Intuitive to communicate and implement Mixtures of the above

Industry standards: Examples Finance and Insurance» Normal distribution for change in price from one period to the next (for short periods; lognormal for longer periods)» Poisson distribution for number of events (crashes, earthquakes )» Insurance: Pareto (or Lognormal) distribution for severity/magnitude of events Oil and Gas» Lognormal distribution: Volumetric uncertainty on reserves, permeability» Normal distribution: Core porosity (%), % of abundant minerals, chemical elements or oxides in rocks

Data-driven approaches Fitting» Data points are from the same random process (e.g. changes to, not the level of, house prices)» Distribution is one of those available in the fitting software» Fitting may not be appropriate where the data arises from compound processes, such as those involving several possible event risks Resampling» (v4.5-5.0) RiskDUniform distribution (sampling with replacement) with historic or assumed data (sometimes: Discrete if data has already been grouped into frequency classes)» (v5.5) RiskResample to allow also for sampling without replacement, as well as re-sampling in order

Pragmatic Approaches Continuous» Uniform Discrete» Binomial» Triangular» Discrete» PERT» Distribution Artist» Alternate parameter form Incl. NormalAlt, LogNormAlt i.e. mixed approaches

Distributions: Summary Uses Conceptual Pragmatic Data driven General Binomial, Poisson (RiskCompound) Normal, Lognormal Ext Value, Hypergeometric (Uniform) Triangular PERT Binomial Discrete Alternate Parameter RiskGeneral, RiskCumul (Distribution Artist) Any fitted distribution Resampling Waiting Time Geometric, Exponential, Weibull Negative Binomial Gamma, Erlang Any: via intensity function for probs Any fitted distribution Resampling Parameter Estimation & Statistical Beta, BetaGeneral, Gamma Chi-squared, Student, Pearson (F dist.) PERT Any fitted distribution Resampling Other Logistic, LogLogistic, InvGauss Pareto, Compound and MakeInput Any fitted distribution Resampling 8

Lognormal distribution» Multiplication of random processes creates a positive asymmetry (skew) Small multiplied by small is small Large multiplied by large is very large» Conceptual models: Lognormal arises from the multiplication of many random factors» Examples of applications Future value of an asset with random independent periodic growth rates Uncertainty of oil reserves» Log(A.B.C..) = LogA + LogB + LogC + =Normal distribution Product of many random variables is Lognormally distributed i.e. the logarithm of the product is normally distributed

Normal distribution» Addition of random processes creates symmetry e.g. Binomial process with n=100 and p=0.1; average no. of heads/tails = 10/90. Around this, there are: 90 ways for a tail to become a head, each with probability 0.1 10 ways for a head to become a tail, each with probability 0.9 So around the average, the distribution is approximately symmetric» Examples of applications NPV of cash flows over several years Goals in a soccer season Earthquakes in the world during next 30 years RTAs in the world per day/in a city per year People arriving in a queue per day (or per minute over many queues) Electricity used by all houses in a city Travel time for a long journey Total company expenditure, built from departmental expenditure Total sales, sum of individual products Total amount of oil in the world, based on amount in each field

Poisson distribution» Answers How many?, rather than yes/no?» Goals in football/soccer Number of goals in a short period of a game?: RiskBinomial(1,p) Number of goals during the entire game? RiskBinomial(n,p) where n is large and p small =>RiskPoisson(np) Number of goals over the whole season? RiskBinomial(nm,p) where m is large, and np constant; RiskPoisson(mnp)=>RiskNormal(mnp, mnp)» Other examples Breakdown/maintenance modelling Queuing/service modelling (e.g. arrival rate) Earthquakes in a country per year Road traffic accidents in a city per day

Time to occurrence (constant probability or intensity) Discrete Time» Questions Which period will first goal occur in soccer (or: periods between goals)? How many coin tosses of coin before getting a head? How often will a gambler think she is ahead of the game (before losing!)» Simulation model (e.g. using MATCH function) to verify Geometric distribution Continuous Time» Questions Time until first goal (or between goals)? Etc.» =>Exponential distribution

Time to occurrence other examples Multiple occurrences with constant intensity» Negative Binomial: No. of failures before s successes = RiskNegBin(s,p)+s can be derived from mathematics (strictly: this is the definition of the negative binomial distribution) RiskNegBin(1,p) = RiskGeomet(p)» Gamma: inter-arrival times for multiple events from a Poisson process Gamma(1, β)=expondist(β) Variable intensity processes» Typical examples assumed: Weibull distribution Weibull1(1,β)=Exponent ial(β) has constant probability Weibull(3.5, β) is approximately Normal Gamma distribution» Often easy mathematics based on intensity functions for any assumed distribution

Parameter Estimation» The Beta distribution represents the uncertainty distribution for probability of a binomial process given observed data You toss a coin 10 times and get 4 heads. True probability of a head is uncertain, and given by Beta(4+1,10-4+1)) If you score 40 heads with 100 coin tosses, Beta(40+1,100-40+1) has much narrower range» Similarly, the Gamma distribution represents the uncertainty for the intensity of a Poisson process

Summary: Key first line Distributions Continuous» Symmetric uncertainty Triangular: Pragmatic/use for (approx.) symmetric ranges/double bounded Normal: Conceptually the sum of many random processes/ Symmetric/double unbounded» Asymmetric uncertainty PERT: Pragmatic/use for nonsymmetric ranges/double bounded LogNormal: Conceptually the product of many random processes/nonsymmetric/unbounded on one side Distribution Artist Discrete» Event risks Binomial for occurrence or not (and as approximation to low intensity Poisson) Discrete for several outcomes Poisson for multiple occurrences (e.g. use within RiskCompound)» Data Distribution fitting Resampling from data (RiskResample or RiskDuniform)» Industry standards» Alternate parameter formulation can allow convergence of methods e.g. LognormAlt 15

Using RiskTheo functions» General exploration of properties of distributions Properties of alternate parameter distributions e.g. Implied volatility of a process that is calibrated with percentiles Effect of alternate parameters on implied non-symmetry of distributions Standard deviation of inputs e.g. when looking at tornado graphs Implied biases of base cases in the context of non-symmetric uncertainty e.g. Proportion of distribution above/below the modal values Assessing the effect of sample sizes on model calibration» Approximation of one distribution by another Normal by PERTAlt Continuous by discrete 16

V5.0 and 5.5: Some new distributions and features» RiskTheo statistics functions Cross-calibration when using alternate parameters Distribution matching (e.g. Swanson s rule)» RiskResample Allows for sampling with/without replacement, or stepping through historic data» RiskCompound (RiskMakeInput) Frequency-severity modelling (creation of single sources of risk from multiple sources, e.g. for tornado charts)» RiskMakeInput Event-Impact model Aggregation of small risks into categories (e.g. annual operating expenses, where opex is made up of several line items)» RiskSplice Creates single continuous distribution formed by joining two distributions together at the splice point» Johnson family 17

RiskCompound» Frequency-Severity Modelling e.g. Damages from accidents, fires (including forest fires) Operational losses e.g. resulting from accounting problems, fraud, rogue trading etc. Medical injury modelling General insurance and reinsurance pricing: Accidents, fires, property, catastrophes, earthquakes, weather damages, directors liability insurance etc. Capital requirements (e.g. using value at risk methods to determine e.g. the P99.5 point of a loss distribution) Credit loss modelling Other finance e.g. Modelling extreme exchange rate movements Other: customer service costs» RiskCompound has optional arguments to layer the severity distribution (e.g. for excess of loss treaty) 18

RiskMakeInput» Treats argument to the function as a distribution for the purposes of sensitivity analysis, and excludes the distributions to this from all sensitivity analysis (even for that of another output)» Examples of uses Event-Impact model with binomial distribution with n=1 for each event (could also use RiskCompound) Aggregation of small risks into categories (e.g. annual operating expenses, where opex is made up of several line items) 19

Selection of Distributions: Recap» Conceptual?» Industry standards? e.g. Oil and Gas: Reserves estimation using Lognormal distribution Insurance: Number of events using Poisson distribution etc» Availability of data and its use? Fitting a distribution to data (Fitting icon) Re-sampling from the data (RiskResample or RiskDuniform distribution) Use data to calibrate a pre-selected distribution» Pragmatic Discrete versus continuous? Symmetric or skewed? Accuracy/defensibility versus communication objectives? Requirement to capture un-boundedness?» Parameterisation using percentiles? 20

Last words» Don t panic!!! There is generally no absolutely correct distribution: Correct distributions exist in only idealized or conceptual situations» Be optimistic!!! As long as you use some common sense, your @RISK model will be a large improvement over your static model!!» Use a combination of techniques: Think about the conceptual side Think about the pragmatic side Check whether historic data is valid for fitting etc