A Unifying Pricing Theory for Insurance and Financial Risks: Applications for a Unified Risk Management Alejandro Balbás 1 and José Garrido 1,2 1 Department of Business Administration University Carlos III of Madrid, Spain 2 Department of Mathematics and Statistics Concordia University, Canada Instituto MEFF April 24, 2002 1
Overview: 1. Introduction: 1.1 Distortion Operators, Insurance Pricing, 1.2 Choquet Pricing of Assets and Losses. 2. Wang s Distortion Operator, 3. Applications: 3.1 Asset Pricing, 3.2 Capital Asset Pricing Model, 3.3 Black-Scholes Formula, 3.4 Deviance and Tail Index. 4. Dynamic Coherent Measures. 2
1. Introduction Insurance and financial risks are becoming more integrated. The actuarial literature argues for a unified pricing theory linking both fields [Smith (1986), Cummins (1990, 1991) or Embrechts (1996), Wang (2000), Cheuk (2001)]. The price of an insurance risk (excluding other expenses) is called a risk-adjusted premium. From classical expected utility theory [Borch (1961), Bühlmann (1980), Goovaerts et al. (1984)] a dual theory of risk has emerged in the economic literature [Yaari (1987)]. Similarly, the price of a financial instrument (e.g. options) also carries risk loads. But the markets and the strategies by which insurance and financial risks are sold differ substantially. 3
We will discuss Wang s proposal of a risk-adjustment method that distorts the survival function of an insurance risk. Wang (2000) shows that his (one-parameter) distortion operator reproduces Black-Scholes theory for option pricing and extends the Capital Asset Pricing Model (CAPM) to insurance pricing. Major and Venter (1999), Gao and Qiu (2001) and Cheuk (2001) consider two-parameter Wang transforms. With it Cheuk defines a generalized Value at Risk (VaR) measure. We characterize Wang s transform as a coherent measure and propose a dynamic (stochastic process) version. Also a tail index for (financial or insurance) risk distributions is derived from it. 4
1.1 Distortion Operators in Insurance Pricing. X is a non-negative loss variable, distributed as F X S X = 1 F X as its survival function. and with The net insurance premium (excluding other expenses) is E[X] = 0 ydf X(y) = 0 S X(y)dy. An insurance layer X (a,a+m] of X is defined by the payoff function X (a,a+m] = 0 0 X < a X a a X < a + m m a + m X where a is a deductible and m a payment limit., 5
The survival function of this insurance layer is given by S X as S X(a,a+m] (y) = { SX (a + y) 0 y < m 0 m y. It yields a net premium of E[X (a,a+m] ] = 0 S X (a,a+m] (y)dy = a+m a S X (x)dx. Wang (1996) suggested to introduce the risk loading by first distorting the survival function before taking expectations. 6
Distorting the survival function of any risk, Wang obtains a riskadjusted premium: H g [X] = g[s X(x)]dx, 0 where g : [0, 1] [0, 1] is increasing with g(0) = 0 and g(1) = 1. g is a distortion operator. It transforms S X into a new survival function g S X, of the ground-up distribution. For example, the risk adjusted premium of a risk layer is then H g [X (a,a+m] ] = 0 g[s X (a,a+m] ]dy = a+m a g[s X (x)]dx. 7
For general insurance pricing, g should satisfy: 0 < g(u) < 1, g(0) = 0 and g(1) = 1, g increasing and concave, g (0) = +, where it is defined. Wang (1996) shows that in a class of one-parameter functions, only satisfies all of the above. g(u) = u r, u [0, 1], 0 < r 1, This corresponds to the proportional hazards (PH) transform of Wang (1995). 8
Despite some desirable properties, Wang s PH transform suffers some major drawbacks. The PH transform: of a lognormal is not lognormal (Black-Scholes formula for option pricing), lacks flexibility yielding fast increasing risk loadings for high layers, cannot be applied simultaneously to assets and liabilities. It yields to serious inconsistencies; see the following section. 9
1.2 Choquet Pricing of Assets and Losses Consider an asset A as a negative loss X = A. The Choquet integral with respect to the distortion g is then given by: 0 H g [X] = {g[s X(x)] 1}dx + g[s X(x)]dx. 0 It has been proposed as a general pricing method for financial markets with frictions [Chateauneuf et al. (1996)]. Definition: For any risk X and a real valued h, the payoff Y = h(x) is a derivative of X. If h is non-decreasing, then Y is called a comonotone derivative of the underlying X. 10
Theorem: Under the Choquet integral the price of a comonotone derivative Y = h(x), H g [Y ] = 0 {g[s Y (y)] 1}dy + 0 g[s Y (y)]dy, is equivalent to the expectation of h(x α ), where X α has the ground-up distribution with survival function S Xα = g S X. The Choquet integral H g provides a risk-neutral valuation of comonotone derivatives, and hence of insurance risk layers. But the above equivalence does not hold for derivatives which are not comonotone with the underlying risk; a distortion with a concave g always produces non-negative loadings, while the ground-up distribution can yield negative loadings. 11
Denneberg (1994) shows that for an asset A > 0 H g [ A] = H g [A], where g (u) = 1 g(1 u) is the dual distortion operator of g. A loss X distorted by a concave g always yields H g [X] E[X]. When g is concave, g is convex and H g [A] E[A]. For most choices g, the dual g belongs to a different parametric family. In particular, if the PH transform g(u) = u r is applied to losses, the dual g (u) = 1 (1 u) r would apply to assets. Symmetric treatment of assets and liabilities is possible for a new class of distortion operators. 12
2. Wang s Distortion Operator Wang (2000) suggests a new distortion operator defined as: g α (u) = Φ[Φ 1 (u) + α], u [0, 1], where α R and Φ is the standard normal distribution with density function: φ(x) = 1 2π e x2 2, x R. For α > 0, g α satisfies all the desirable properties of a distortion operator for insurance pricing: 1. 0 < g α (u) < 1 for u [0, 1], with g α (0) = lim g α(u) = 0 and u 0 + g α (1) = lim g α(u) = 1. u 1 13
2. g α is increasing and for x = Φ 1 (u), g α(u) = g α(u) u = φ(x + α) φ(x) = e αx x2 2 > 0, u (0, 1). 3. For α > 0, g α is concave and g α(u) = 2 g α (u) αφ(x + α) u 2 = φ(x) 2 < 0, u (0, 1). 4. For α > 0, g α (u) becomes unbounded as u approaches 0. 5. The dual operator g α(u) = 1 g α (1 u) = g α (u). 6. g α is convex when g α is concave, as in 3. 14
For g α the Choquet integral gives 0 H[X; α] = {g α[s X (x)] 1}dx + which enjoys some very nice properties: 0 g α[s X (x)]dx, 1. H[c; α] = c and H[X + c; α] = H[X; α] + c, for any constant c. 2. H[bX; α] = bh[x; α] while H[ bx; α] = bh[x; α], for any b > 0. In particular H[ X; α] = H[X; α]. 3. If X 1, X 2 are comonotone H[X 1 +X 2 ; α] = H[X 1 ; α]+h[x 2 ; α]. Since two layers of the same risk are comonotone, then for any a < b < c. H[X (a,b] ; α] + H[X (b,c] ; α] = H[X (a,c] ; α], 15
4. For any two variables X 1 and X 2 H[X 1 + X 2 ; α] H[X 1 ; α] + H[X 2 ; α], for α > 0, H[X 1 ; α] + H[X 2 ; α], for α < 0, showing the benefit of diversification. 5. H[X; α] is an increasing function of α, with min[x] H[X; α] max[x]. 6. With α > 0 for losses and α < 0 for assets, H[X; α] preserves 1st and 2nd order stochastic dominance [see Rothschild and Stiglitz (1971)]. 7. For α > 0, g α(u) goes unbounded as u approaches 0. For example, if X Bernoulli(θ) then lim θ 0 H[X; α] E[X] = lim θ 0 Φ[Φ 1 (θ) + α] θ = g α(0) = +. 16
If X Normal(µ, σ 2 ) then the ground-up survival function S Xα = g α S X defines a new X α Normal(µ + ασ, σ 2 ), that is H[X; α] = E[X α ] = E[X] + ασ[x], which is the usual standard deviation principle. If Y Lognormal(µ, σ 2 ) then the ground-up survival function S Yα = g α S Y defines a new Y α Lognormal(µ + ασ, σ 2 ). g α can be applied to any distribution, but closed forms other than Bernoulli, normal and lognormal do not exist. Numerical evaluations are simple (e.g. in Excel NORMDIST, NORMINV). The generalization to multivariate variables is possible, simply applying the distortion to the joint distribution function. 17
3. Applications 3.1 Asset Pricing Consider asset i with current price A i (0) and future price A i (1) after one period. R i = A i(1) 1 is the annual return, assumed normal with mean A i (0) E[R i ] and standard deviation σ[r i ]. Now, if assets can be priced by applying H[ ; α i ] to the present value of the future asset price at some time, then A A i (0) = H[ i (1) (1 + r f ) ; α i] = H[ A i(0)(1 + R i ) ; α i ], (1 + r f ) where r f is the risk-free return compounded annually. 18
It follows that the risk-adjusted rate of return must coincide with the risk-free rate and hence which implies [Wang (2000)]: H[R i ; α i ] = E[R i ] α i σ[r i ] = r f, α i = E[R i] r f σ[r i ]. A similar implied α applies to an asset portfolio or, by extension, to the market portfolio M. Under similar assumptions α M = E[R M] r f σ[r M ] which is called the market price of risk (Cummins, 1990) in the expression for the capital market line (CML) in the CAPM., 19
The idea can be generalized to T periods. For asset i let R it be the annual return in period t = 1,..., T and A i (t) the price at time t. Then R it = ln A i(t) A i (t 1). Now assume that the R it are iid Normal. return for asset i is with and R i (T ) = ln A i(t ) A i (0) = E[R i (T )] = V [R i (T )] = T t=1 T t=1 T t=1 R it, E[R it ] = T E[R i ] {σ[r it ]} 2 = T {σ[r i ]} 2. The total T -period 20
Again this implies that H[R i (T ); α i ] = T E[R i ] α i T σ[ri ] = T r f, and hence [Wang (2000)]: α i = T { E[Ri ] r f σ[r i ] }. The idea is easily extended to the continuous time model, with geometric Brownian motion (GBM) prices, A t satisfying da i (t) A i (t) = µ idt + σ i dw (t). 21
Let A i (0) be the current asset price. At any future date T, the price A i (T ) is solution to the above equation and has a lognormal distribution [Hull (1997)]: ln A i(t ) A i (0) Normal[(µ σ2 2 )T, σ2 T ]. No-arbitrage with a continuous risk-free return r c implies: A i (0) = H[e r ct Ai (T ); α i ] = e r ct H[Ai (T ); α i ], which implies again [Wang (2000)]: α i = (µ i r c ) T σ i. Notice that α i shares many properties with the volatility and is proportional to the square root of the horizon T. 22
3.2 Capital Asset Pricing Model (CAPM) As seen, the market price of risk α M in the CML of CAPM links the systematic risk of X to the portfolio risk. The CAPM measures a portfolio s risk under the constraint of an expected return level in one compounding period and the assumption of an efficient market. Mixing a risk-free return r f with an efficient market portfolio, gives a minimal risk portfolio for a given level of expected return. Let the expected return and risk of this optimal strategy be µ p and σ p, respectively. Then the following capital market line (CML) relation holds: µ p = r f + α M σ p. 23
For asset i with return R i Normal(µ i, σi 2 ) and a market portfolio return R M Normal(µ M, σ M ), consider the security market line (SML): µ i = r f + β i (µ M r f ), where β i = Cov[R i, R M ] σ 2 M = ρ i,m σ i σ M. Since α M = µ M r f σ, the above SML relation can be rewritten as M µ i = r f + ρ i,m α M σ i, which in turn yields: ρ i,m α M = µ i r f σ A = α i. Wang (2000) generalizes CAPM to include insurance risks that are not normally distributed, valuing them with H[ ; α]. 24
3.3 Black-Scholes Formula Black-Scholes option pricing formula is often seen as a stop-loss reinsurance contract under a lognormal risk neutral price. Wang (2000) shows how the distortion operator reproduces Black- Scholes formula under the specific α of the CAPM. Let A(t) be the current price on day t of an asset and K a European call option strike price, with a right to exercise after T periods. The payoff of this option can be expressed as A(t + T ; K, ) = { 0 if A(t + T ) K A(t + T ) K if A(t + T ) > K. 25
The expected payoff is then E[A(t + T ; K, )] = S A(t+T ;K, ) (x)dx = S A(t+T ) (y)dy, 0 K while the price under the distortion operator g α is e r ft H[A(t + T ; K, ); α] = e r f T K [S A α (t+t ) (y)]dy. It easily seen that with a GBM return of rate with mean µ and volatility σ, the ground-up variables A α(t) implies a lognormal distribution ln A t+t A t Normal[(r f 1 2 σ2 )T, σ 2 T ], for α = (µ r f) T σ. Hence, the distortion and its resulting pricing formula reproduce exactly Black-Scholes model. 26
3.4 Deviance and Tail Index Both, insurance and financial analysts, use indexes to measure and compare riskyness, in the form of a volatility measure, down-side risk, variance, deviance or more generally speaking, a risk measure. The distortion operator g α induces a risk loading H[X; α]. It is natural to think that it can also be used to define a risk measure. Since min[x] H[X; α] max[x], for any α R, and in particular H[X; 0] = E(X) for α = 0, Wang (2000) defines right and left deviance measures. 27
Definition: For α > 0 fixed and any risk X, the right and left deviance indexes are given, respectively, by RD α [X] = 1 {H[X; α] E(X)}, α LD α[x] = 1 {E(X) H[X; α]}. α Note that these deviances measures are function of α, but converge to a common limit at α = 0 : lim RD α[x] = α 0 lim LD α[x] = α 0 H[X; α] α, α=0 which measures the infinitesimal force of distortion at α = 0. This motivates our definition of a deviance index. Definition: For α > 0 fixed and any risk X, the deviance index is given by DI[X] = H[X; α] α. α=0 Note that by definition of H[X; α], this derivative always exists. 28
In finance the down-side risk is usually measured in relation to an underlying normal distribution of asset returns. Wang (2000) uses his right and left deviance indexes to define right and left tail indexes, with the normal distribution serving as a benchmark. Definition: For α > 0 fixed and any risk X, the right and left tail indexes are given, respectively, by RT I α [X] = RD 2α[X] RD α [X] and LT I α [X] = LD 2α[X] LD α [X]. These are ment to be scaleless and differ from his deviance indexes which are dependent on the parameter α. 29
Example: If X normal(µ, σ 2 ) then H[X; α] = µ + ασ and E(X) = µ, for α, µ R and σ > 0. It implies that RD α [X] = µ + ασ µ = µ µ + ασ = LD α [X] = σ > 0, α α reproducing the standard deviation. Note that the same result is obtained with our deviance index DI[X] = σ, a measure always independent of the parameter α. Similarly RT I α [X] = 1 = LT I α [X], showing how normal tails are used here as a benchmark; index values larger than 1 indicate fatter than normal tails. 30
Example: If X lognormal(µ, σ 2 ) then H[X; α] = e (µ+ασ)+σ2 2 and E(X) = e µ+σ2 2, for α, µ R and σ > 0. Hence RD α [X] = e(µ+ασ)+σ2 2 e µ+σ2 2 = e µ+σ2 (eασ 1) 2, α α while the left deviance index has no physical interpretation here: LD α [X] = e µ+σ2 (1 e ασ ) 2. α Note that these deviances depend on the parameter α and differ from the standard deviation: σ[x] = e µ+σ2 2 (e σ2 1). By contrast, our deviance index is closer in value to σ[x] : DI[X] = σe µ+σ2 2. 31
Finally, Wang s tail indexes are given here by: RT I α [X] = eασ + 1 2 and LT I α [X] = 1 + e ασ. 2 Oddly enough, here σ values need to be such that e ασ > 1 for lognormal tails to be fatter than normal. This illustrates the undesirable dependence of Wang s tail index on the parameter α. Alternative tail indexes, based on higher derivatives of H[X; α] at α = 0, are being studied. 32
Example: For X Bernoulli(θ 0 ), H[X; α] = θ α = Φ[Φ 1 (θ 0 )+α] and E[X] = θ 0, where α R and 0 θ 0 1. Hence RD α [X] = Φ[Φ 1 (θ 0 ) + α] θ 0 = θ α θ 0, α α while again, the left deviance index has no physical interpretation: LD α [X] = θ 0 Φ[Φ 1 (θ 0 ) α] α = θ 0 θ α. α These deviance measures also differ from the standard deviation σ[x] = θ 0 (1 θ 0 ) and depend on the parameter α. By contrast, our deviance index is given by: DI[X] = φ[φ 1 (θ 0 )] = 1 2π e [Φ 1 (θ 0 )] 2 2. 33
Finally, Wang s tail indexes and RT I α [X] = Φ[Φ 1 (θ 0 ) + 2α] θ 0 2{Φ[Φ 1 (θ 0 ) + α] θ 0 } = θ 2α θ 0 2(θ α θ 0 ) LT I α [X] = θ 0 Φ[Φ 1 (θ 0 ) 2α] 2{θ 0 Φ[Φ 1 (θ 0 ) α]} = θ 0 θ 2α 2(θ 0 θ α ), depend on α in an intricate way. 34
4. Dynamic Coherent Measures Wang s use of distortion operators and Choquet integrals has been embedded in the framework of coherent risk measures [Artzner, Delbaen, Eber and Heath (1999)]. In Balbás, Garrido and Mayoral (2002) we assume a dynamic framework which reflects the stochastic behavior of (insurance) risks and (financial) pay-offs. As a special application, we study the extensions of VaR (Value at Risk) and TCE (Tail Conditional Expectation) to this dynamic setting. 35
Bibliography: Borch, K.H. (1961) The utility concept applied to the theory of insurance, ASTIN Bulletin, 11, 52-60. Bülmann, H. (1980) An economic premium principle, ASTIN Bulletin, 16, 2, 52-60. Chateauneuf, A., Kast, R. and Lapied, A. (1996) Choquet pricing for financial markets with frictions, Mathematical Finance, 6, 145-166. Cheuk, W.L. (2001) Value at Risk and the Distortion Operator, M.Sc. Thesis, Concordia University, Montreal. Cummins, J.D. (1990) Asset pricing models and insurance ratemaking, ASTIN Bulletin, 20, 2, 125-166. Cummins, J.D. (1991) Statistical and financial models of insurance pricing, J. of Risk and Insurance, 58, 261-302. Denneberg, D. (1994) Non-Additive Measure and Integral, Kluwer Academic Publishers, Boston. 36
Embrechts, P. (1996) Actuarial versus financial pricing of insurance, Conference on Risk Management of Insurance Firms, May 15-17, 1996, The Wharton School of the University of Pennsylvania. Gao, J. and Qiu, W. (2001) An improved distortion operator for pricing insurance risks, submitted to North American Actuarial J.. Goovaerts, M.J., De Vylder, F. and Haezendonck, J. (1984) Insurance Premiums: Theory and Applications, North-Holland, Amsterdam. Hull, J. (1997) Options, Futures and Other Derivatives, 3rd edition, Prentice Hall, New Jersey. Rothschild, M. and Stiglitz, J.E. (1971) Increasing risk I: a definition, J. of Economic Theory, 2, 225-243. Smith, C.W., Jr. (1986) On the convergence of insurance and finance research, J. of Risk and Insurance, 53, 693-717. Wang, S.S. (1995) Insurance pricing and increased limits ratemaking by proportional hazards transforms, Insurance: Mathematics and Economics, 17, 43-54. Wang, S.S. (1996) Premium calculation by transforming the layer premium density, ASTIN Bulletin, 26, 1, 71-92. Yaari, M.E. (1987) The dual theory of choice under risk, Econometrica, 55, 95-115. 37