Corrected Diffusion Approximations for the Maximum of Heavy-Tailed Random Walk



Similar documents
1. (First passage/hitting times/gambler s ruin problem:) Suppose that X has a discrete state space and let i be a fixed state. Let

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails

Non-Life Insurance Mathematics

Asymptotics for ruin probabilities in a discrete-time risk model with dependent financial and insurance risks

Supplement to Call Centers with Delay Information: Models and Insights

Internet Traffic Variability (Long Range Dependency Effects) Dheeraj Reddy CS8803 Fall 2003

UNIFORM ASYMPTOTICS FOR DISCOUNTED AGGREGATE CLAIMS IN DEPENDENT RISK MODELS

Lectures on Stochastic Processes. William G. Faris

Master s Theory Exam Spring 2006

The Exponential Distribution

FULL LIST OF REFEREED JOURNAL PUBLICATIONS Qihe Tang

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

Nonparametric adaptive age replacement with a one-cycle criterion

Performance Analysis of a Telephone System with both Patient and Impatient Customers

M/M/1 and M/M/m Queueing Systems

Random access protocols for channel access. Markov chains and their stability. Laurent Massoulié.

Load Balancing and Switch Scheduling

Asian Option Pricing Formula for Uncertain Financial Market

Stationary random graphs on Z with prescribed iid degrees and finite mean connections

Exam Introduction Mathematical Finance and Insurance

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Is a Brownian motion skew?

Estimating the Degree of Activity of jumps in High Frequency Financial Data. joint with Yacine Aït-Sahalia

COMPLETE MARKETS DO NOT ALLOW FREE CASH FLOW STREAMS

The Analysis of Dynamical Queueing Systems (Background)

Observingtheeffectof TCP congestion controlon networktraffic

LECTURE 15: AMERICAN OPTIONS

Marshall-Olkin distributions and portfolio credit risk

A Model of Optimum Tariff in Vehicle Fleet Insurance

Hydrodynamic Limits of Randomized Load Balancing Networks

Mathematical Finance

1 Short Introduction to Time Series

Probability and Statistics

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 18. A Brief Introduction to Continuous Probability

Wald s Identity. by Jeffery Hein. Dartmouth College, Math 100

STRATEGIC CAPACITY PLANNING USING STOCK CONTROL MODEL

Quickest Detection in Multiple ON OFF Processes Qing Zhao, Senior Member, IEEE, and Jia Ye

CATASTROPHIC RISK MANAGEMENT IN NON-LIFE INSURANCE

Large Insurance Losses Distributions

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

INSURANCE RISK THEORY (Problems)

Maximum Likelihood Estimation

Monte Carlo Methods and Models in Finance and Insurance

Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh

Message-passing sequential detection of multiple change points in networks

LOGNORMAL MODEL FOR STOCK PRICES

The CUSUM algorithm a small review. Pierre Granjon

ALGORITHMIC TRADING WITH MARKOV CHAINS

Examining Self-Similarity Network Traffic intervals

LECTURE 16. Readings: Section 5.1. Lecture outline. Random processes Definition of the Bernoulli process Basic properties of the Bernoulli process

IN THIS PAPER, we study the delay and capacity trade-offs

Non-Stationary Time Series andunitroottests

Sign changes of Hecke eigenvalues of Siegel cusp forms of degree 2

Lecture 3: Linear methods for classification

On the mathematical theory of splitting and Russian roulette

F. ABTAHI and M. ZARRIN. (Communicated by J. Goldstein)

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)

Network Performance Measurement and Analysis

QUEUEING MODELS FOR LARGE SCALE CALL CENTERS

Continued Fractions and the Euclidean Algorithm

Metric Spaces. Chapter Metrics

Credit Risk Models: An Overview

t := maxγ ν subject to ν {0,1,2,...} and f(x c +γ ν d) f(x c )+cγ ν f (x c ;d).

Inner Product Spaces

LECTURE 4. Last time: Lecture outline

On the Traffic Capacity of Cellular Data Networks. 1 Introduction. T. Bonald 1,2, A. Proutière 1,2

IEOR 6711: Stochastic Models, I Fall 2012, Professor Whitt, Final Exam SOLUTIONS

Monte Carlo Methods in Finance

Stochastic Models for Inventory Management at Service Facilities

An Introduction to Extreme Value Theory

Probability Generating Functions

Stochastic Processes and Queueing Theory used in Cloud Computer Performance Simulations

THE NUMBER OF GRAPHS AND A RANDOM GRAPH WITH A GIVEN DEGREE SEQUENCE. Alexander Barvinok

Principle of Data Reduction

5. Continuous Random Variables

1.5 / 1 -- Communication Networks II (Görg) Transforms

Scheduling Algorithms for Downlink Services in Wireless Networks: A Markov Decision Process Approach

Lecture 13: Martingales

Optimal reinsurance with ruin probability target

Stochastic Inventory Control

MATH 132: CALCULUS II SYLLABUS

Binomial lattice model for stock prices

Transcription:

Corrected Diffusion Approximations for the Maximum of Heavy-Tailed Random Walk Jose Blanchet and Peter Glynn December, 2003. Let (X n : n 1) be a sequence of independent and identically distributed random variables with E (X 1 )=0,andletS =(S n : n 0) be its associated random walk (so that S 0 =0and S n = X 1 +... + X n for n 1). Let us introduce a small location parameter δ > 0 representing the drift of the random walk. That is, let us consider a parametric family of random walks, S δ = S δ n : n 0, defined by S δ n = S n nδ. Our focus here is on developing high accuracy approximations for the distribution of the r.v. M δ =max S δ n 1 n when the increment rv s posses heavy-tails. For our purposes here, we say that X has heavy-tails if for all θ 6= 0, E exp (θ X ) =. (1) In other words, X does not have exponential moments. We say that X has light tails, if E exp (θ X ) < for some θ > 0. The distribution of M δ is of importance in a number of different disciplines. For x>0, {M δ >x} = {τ δ (x) < }, whereτ δ (x) =inf{n 1:Sn δ >x}, sothat computing the tail of M δ is equivalent to computing a level crossing probability for the random walk S δ. Because of this level crossing interpretation, the tail of M δ is of great interest to both the sequential analysis and risk theory communities. In particular, in the setting of insurance risk, P (τ δ (x) < ) is the probability that an insurer will face ruin in finite time (when the insurer starts with initial reserve x and is subjected to iid claims over time); see, for example, Asmussen (2001). The distribution of M δ also arises in the analysis of the single most important model in queueing theory, namely the single-server queue. If the inter-arrival and service times for successive customers are iid with a mean arrival rate less than the mean service rate, then W = (W n : n 0) is a positive recurrent Markov chain 1

on [0, ), wherew n is the waiting time (exclusive of service) for customer n. If W is a random variable having the stationary distribution of W, then Kiefer and Wolfowitz (1956) showed that W has the distribution of M δ for an appropriately defined random walk. As a consequence, computing the distribution of M δ is of great interest to queueing theorists. Since W is a positive recurrent Markov chain, the distribution of M δ can be computed as the solution to the equation describing the stationary distribution of W. This linear integral equation is known as Lindley s equation (see Lindley (1952)) and is of Wiener-Hopf type; it is challenging to solve, both analytically and numerically. As a result, approximations are frequently employed instead. One important such approximation holds as δ & 0.. This asymptotic regime corresponds in risk theory to the setting in which the safety loading is small (i.e. the premium charged is close to the typical pay-out for claims) and in queueing theory to the heavy traffic setting in which the server is utilized close to 100% of the time. Thus, this asymptotic regime is of great interest from an application standpoint. Kingman (1963) showed that the approximation P (M δ >x) exp 2δx/σ 2 (2) is valid as δ & 0,whereσ 2 = Var(X 1 ). Becausetherighthandsideof(2)istheexact value of the level crossing probability for the natural Brownian approximation to the random walk S, (2) is often called the diffusion approximation to the distribution of M. However, as with any such approximation, there are applications for which (2) delivers poor results. For the light-tailed case, Siegmund (1979) proposed a so-called corrected diffusion approximation (CDA) that reflects information in the increment distribution beyond the mean and variance. The full asymptotic description in the development initiated by Siegmund was given in the important case of Gaussian random walks by Chang and Peres (1997), and more recently, this complete description in the light tailed case was extended to cover general increment distributions by the authors in Blanchet and Glynn (2004). The strategy followed by Siegmund (1979), Chang and Peres (1997) and Blanchet and Glynn (2004) in the light-tail case makes use of an exponential change-of-measure (see, for example, Siegmund (1985), p. 13.) which yields the convenient representation P (M δ >x)=p (τ δ (x) < ) =exp( θ δ x) E θδ exp θδ S δ τ δ (x) x, (3) where θ δ is assumed to be a positive solution to E exp (θ δ X 1 δθ δ )=1and P θδ generated by the product extension of is df θδ =exp(θ δ x) df. F being the distribution function of X 1. It is clear that, in the heavy-tail context, a direct attack using such change-of-measure ideas is infeasible (since the basic elements 2

used to write equation (3) are not well defined in this case.) Thus, in order to develop CDA s for the distribution of M δ one has to approach the problem in a different way. Many problems in applied probability motivate study of models with heavy tails. In risk insurance, for example, statistical evidence suggest that most claims sizes should be modeled as heavy tailed random variables (see, for example, Embrechts, Kluppelberg and Mikosch (1997)). In this context, a popular choice is a Weibull distribution (recall that X Weibull(a, b), fora>0 and b (0, 1] if its support is [0, ) and P (X >x)=exp ax b ), which possesses moments of all orders but is heavy tailed in our sense (for b<1). Queueing theory also gives rise to heavy-tails. Forexample, whenmodelingdatatraffic in communication networks, evidence has been found suggesting that exponential tail features (present in traditional models of data traffic) are not compatible with empirical observations (see Adler, Feldman and Taqqu (1998)). Therefore, developing asymptotic analysis for systems with heavy tail characteristics is an important applied problem. However, as we discussed earlier, developing CDA s in the presence of heavy-tailed increments presents a major mathematical complication as the standard techniques that work well in the presence of light tails (such as exponential change-of-measure) cannot be applied in the heavy tailed context. Also, it often occurs that the nature of the conclusions obtained is completely different for very similar models under these two different tail environments. In the context of risk insurance, for example, it is well known (see Asmussen (2001)) that if the claim distribution is heavy tailed (more precisely, subexponential) the insurance company s bankruptcy is (roughly) driven by a single large claim; in contrast, if the claim size is light tailed then the ruin occurs due to a long sequence of claims that behave according to an (exponentially) twisted version of the underlying claim distribution. For heavy-tailed increments, Hogan (1986) proposed a first order CDA to (2). In particular, assuming that E X 1 5 < and under integrability conditions on the characteristic function of X 1, Hogan showed that as δ & 0 P (M δ >x) exp 2δx/σ 2 (1 2δ/σ) β 1 δ/σ + o (δ). The constant β 1 was computed by Siegmund (1979) as β 1 = 1 6 EX3 1 1 Z 1 2π θ 2 Re log{2(1 g (θ)) /θ2 }dθ. (4) Hogan s strategy consisted, essentially, in applying direct Fourier inversion to the characteristic function of M (δ). His method of proof does not extend directly to higher order correction terms. Our proposed approximation improves upon Hogan s by further reducing the error of the CDA as δ & 0 (in the presence of higher order moments), and by significantly relaxing the integrability assumptions imposed in Hogan s development. In particular, we only require the distribution of X 1 being strongly non-lattice, which means 3

that for each ε > 0 inf 1 g (θ) > 0, (5) θ >ε where g (θ) =E exp (iθx 1 ) is the characteristic function of X 1. The form of our approximation is given in the following theorem. Theorem 1 Suppose that E X 1 p < for p>4 and that the distribution of X 1 is strongly non-lattice, then there exist constants (δ) and r (δ) such that P (M δ >x/δ) =exp( (δ) x/δ + r (δ)) + o δ p 4. (6) Moreover, these constants can be computed explicitly in terms of various moments and the characteristic function of X 1 The expressions for the constants (δ) and r (δ) closely resemble their light tail analogues obtained in Blanchet and Glynn (2004). Computing these constants require essentially evaluating a single integral of the form of (4), which makes possible an efficient implementation of approximation (6) in practical situations. In some models with special structure (such as the M/G/1 model in queueing theory or the classical risk model in risk insurance) greatly simplify (avoiding the need for numerical integration). In our view, an important contribution of the paper concerns the method of proof of Theorem 1. The strategy applied to prove Theorem 1 follows a two step procedure. First we use a geometric sum representation for M δ in terms of the ascending ladder heights of S δ (c.f. Kalashnikov (1997), Section 1.3). This representation permits then to formulate the problem so as to take advantage of rates of convergence in the renewal theorem. Second we use a convenient truncated random walk for which light tail techniques (such as (3)) apply. The truncated random walk is shown to be suitably close to S δ via the geometric sum representation and renewal theory arguments, thereby providing the conclusion of the Theorem. The proof of Theorem 1 reveals many useful insights that can be applied to related problems under heavy tail environments. For example, we suggest a plausible corrected diffusion approximation for the probability of ruin within a finite time for insurance companies with heavy tail claims. This problem was declared open by Soren Asmussen (see Asmussen (2001), Ch. 4 remarks). A heuristic approximation was also given by Siegmund (1979) for the light-tailed case (c.f. Siegmund (1985)). It was later adapted to the risk theory context by Asmussen (1982) and has been used successfully (even without a rigorous proof) in many application settings (c.f. Asmussen (1982) and Asmussen and Hojgaard (1999)). To put this problem in perspective note that even in the light-tailed case, the rigorous justification of the finite horizon approximation is known to be hard, see Siegmund (1979). A more compelling argument (although still incomplete) was given later in Theorem 10.41 of Siegmund (1985). 4

An interesting question that remains open arises when the increment distribution has infinite variance (this case is motivated by many models in data traffic andcommunication networks (see Willinger et al (1995)). If Var(X 1 )= but E ( X 1 ) < it is known that there is also a heavy-traffic regime (based on maxima of stable processes) that approximates the distribution of M (see Whitt (2001)). In that case it would be of interest to develop analogous corrected approximations. References [1]Adler,J.,Feldman,R.,andTaqqu,M.(Editors.)A Practical Guide to Heavy Tails: Statistical Techniques and Applications. Birkhauser. [2] Asmussen, S. (1984) Approximations for the Probability of Ruin within Finite Time.Scand.Act.J.31-57. [3] Asmussen, S. (2001) Ruin Probabilities. World Scientific. [4] Asmussen, S., and Hojgaard, B. (1999) Approximations for Finite Horizon Ruin ProbabilitiesintheRenewalModel.Scand.Act.J.2,106-119. [5] Blanchet, J., and Glynn, P. (2004) Corrected Diffusion Approximations for Light- Tailed Random Walk. In preparation. [6] Chang, J. (1992) On Moments of the First Ladder Height of Random Walks with Small Drift. Ann. of App Prob. 2, 714-738. [7] Chang, J., and Peres, Y. (1997) Ladder Heights, Gaussian Random Walks and the Riemann Zeta Function. Annals of Probability 25, 787-802. [8] Embrechts, P., Klüppelberg, C., and Mikosch, T. (1999) Modelling extreme events with applications to insurance and finance. Springer-Verlag. [9] Hogan, M. (1986) Comment on Corrected Diffusion Approximations in Certain Random Walk Problems. J. Appl. Probab. 23, 89-96. [10] Kalashnikov, V. (1997) Geometric Sums: Bounds for Rare Events with Applications. Kluwer. [11] Kiefer, J. and Wolfowitz, J. (1956) On the Characteristics of the General Queueing Process with Applications to Random Walks. Trans. Amer. Math. Soc. 78, 1-18. [12] Kingman, J. (1963) Ergodic Properties of Continuous Time Markov Processes and Their Discrete Skeletons. Proc. London. Math. soc. 13, 593-604. 5

[13] Lai, T. (1976) Asymptotic Moments of Random Walks with Applications to Ladder Variables and Renewal Theory. Annals of Probability. 4, 51-66. [14] Lindey, D. The Theory of a Queue with a Single-Server. Proc. Cambr. Philos. Soc. 48, 277-289. [15] Siegmund D. (1979) Corrected Diffusion Approximations in Certain Random Walk Problems. Adv. Appl. Prob. 11, 701-719. [16] Siegmund D. (1985) Sequential Analysis. Springer-Verlag. [17] Whitt, W. (2001) Stochastic-Process Limits. Springer-Verlag. [18] Willinger, W., Taqqu, M., Leland, W., and Wilson, D. (1995) Self-Similarity in High-Speed Packet Traffic: Analysis and Modeling of Ethernet Traffic Measurements. Statistical Science. Vol. 10, 67 85. [19] Woodroofe, M. (1979) Repeated Likelihood Ratio Tests. Biometrika. 66, 453-463. [20] Woodroofe, M. (1982) Non-Linear Renewal Theory in Sequential Analysis. Society for Industrial and Applied Mathematics. 6