Tests for exponentiality against the M and LM-classes of life distributions



Similar documents
Nonparametric adaptive age replacement with a one-cycle criterion

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

THE CENTRAL LIMIT THEOREM TORONTO

LOGNORMAL MODEL FOR STOCK PRICES

The Variability of P-Values. Summary

E3: PROBABILITY AND STATISTICS lecture notes

Gamma Distribution Fitting

Marshall-Olkin distributions and portfolio credit risk

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Review of Random Variables

ON THE EXISTENCE AND LIMIT BEHAVIOR OF THE OPTIMAL BANDWIDTH FOR KERNEL DENSITY ESTIMATION

CHAPTER II THE LIMIT OF A SEQUENCE OF NUMBERS DEFINITION OF THE NUMBER e.

ON LIMIT LAWS FOR CENTRAL ORDER STATISTICS UNDER POWER NORMALIZATION. E. I. Pancheva, A. Gacovska-Barandovska

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

MATHEMATICAL METHODS OF STATISTICS

Differentiating under an integral sign

Principle of Data Reduction

Chapter 3 RANDOM VARIATE GENERATION

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails

THE DYING FIBONACCI TREE. 1. Introduction. Consider a tree with two types of nodes, say A and B, and the following properties:

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

Inference on the parameters of the Weibull distribution using records

2WB05 Simulation Lecture 8: Generating random variables

Continued Fractions and the Euclidean Algorithm

ARBITRAGE-FREE OPTION PRICING MODELS. Denis Bell. University of North Florida

Lectures 5-6: Taylor Series

1 if 1 x 0 1 if 0 x 1

Managing uncertainty in call centers using Poisson mixtures

Distribution (Weibull) Fitting

Undergraduate Notes in Mathematics. Arkansas Tech University Department of Mathematics

A note on exact moments of order statistics from exponentiated log-logistic distribution

SUMAN DUVVURU STAT 567 PROJECT REPORT

INSURANCE RISK THEORY (Problems)

Goodness of Fit Tests for the Gumbel Distribution with Type II right Censored data

Adaptive Online Gradient Descent

2.3 Convex Constrained Optimization Problems

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

2DI36 Statistics. 2DI36 Part II (Chapter 7 of MR)

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

Lecture 15 Introduction to Survival Analysis

Further Study on Strong Lagrangian Duality Property for Invex Programs via Penalty Functions 1

VERTICES OF GIVEN DEGREE IN SERIES-PARALLEL GRAPHS

4. Expanding dynamical systems

Statistics in Retail Finance. Chapter 6: Behavioural models

Important Probability Distributions OPRE 6301

Duality of linear conic problems

Statistics Graduate Courses

Stocks paying discrete dividends: modelling and option pricing

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

CITY UNIVERSITY LONDON. BEng Degree in Computer Systems Engineering Part II BSc Degree in Computer Systems Engineering Part III PART 2 EXAMINATION

Mathematics Course 111: Algebra I Part IV: Vector Spaces

Asymptotics for ruin probabilities in a discrete-time risk model with dependent financial and insurance risks

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Exam C, Fall 2006 PRELIMINARY ANSWER KEY

Numerical Methods for Option Pricing

Lecture 13: Martingales

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

A characterization of trace zero symmetric nonnegative 5x5 matrices

Order Statistics: Theory & Methods. N. Balakrishnan Department of Mathematics and Statistics McMaster University Hamilton, Ontario, Canada. C. R.

U.C. Berkeley CS276: Cryptography Handout 0.1 Luca Trevisan January, Notes on Algebra

Understanding Basic Calculus

Section 5.1 Continuous Random Variables: Introduction

About the Gamma Function

INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS

Integer Factorization using the Quadratic Sieve

Supplement to Call Centers with Delay Information: Models and Insights

OPTIMAl PREMIUM CONTROl IN A NON-liFE INSURANCE BUSINESS

Mathematical Methods of Engineering Analysis

Mathematics Review for MS Finance Students

Life Table Analysis using Weighted Survey Data

Corrected Diffusion Approximations for the Maximum of Heavy-Tailed Random Walk

Sums of Independent Random Variables

FIRST YEAR CALCULUS. Chapter 7 CONTINUITY. It is a parabola, and we can draw this parabola without lifting our pencil from the paper.

A Model of Optimum Tariff in Vehicle Fleet Insurance

Date: April 12, Contents

A Simple Model of Price Dispersion *

Integer roots of quadratic and cubic polynomials with integer coefficients

Department of Economics

Non-Inferiority Tests for One Mean

Vilnius University. Faculty of Mathematics and Informatics. Gintautas Bareikis

Week 1: Introduction to Online Learning

Metric Spaces. Chapter Metrics

Properties of Future Lifetime Distributions and Estimation

The Heat Equation. Lectures INF2320 p. 1/88

Stationary random graphs on Z with prescribed iid degrees and finite mean connections

A study on the bi-aspect procedure with location and scale parameters

The Exponential Distribution

Chapter 4: Statistical Hypothesis Testing

Review of Fundamental Mathematics

Taylor and Maclaurin Series

CHAPTER 5 Round-off errors

Transcription:

Tests for exponentiality against the M and LM-classes of life distributions B. Klar Universität Karlsruhe Abstract This paper studies tests for exponentiality against the nonparametric classes M and LM of life distributions introduced by Klar and Müller 23). The test statistics are integrals of the difference between the empirical moment generating function of given data and the moment generating function of a fitted exponential distribution. We derive the limit distributions of the test statistics in case of a general underlying distribution and the local approximate Bahadur efficiency of the procedures against several parametric families of alternatives to exponentiality. The finite sample behavior of the tests is examined by means of a simulation study. Finally, the tests under discussion are applied to two data sets, and we discuss the applicability of the tests under random censorship. Key words: M-class, LM-class, moment generating function order, Exponential distribution, goodness-of-fit test, local approximate Bahadur efficiency. AMS 1991 Subject Classifications: Primary 62G1; Secondary: 62N5 1 Introduction Notions of positive aging play an important role in reliability theory, survival analysis and other fields. Therefore a multitude of classes of distributions describing aging have been introduced in the literature. Likewise, a large number of tests for exponentiality against special classes of life distributions have been proposed. The rationale for using Institut für Mathematische Stochastik, Universität Karlsruhe, Englerstr. 2, 76128 Karlsruhe, Germany. Email: Bernhard.Klar@math.uni-karlsruhe.de 1

such tests instead of omnibus tests for exponentiality is well-known: omnibus goodnessof-fit tests distribute their power over a rather small set of alternatives. Therefore, if one has some knowledge about the class of distributions which may occur, it is reasonable to use tests that are well adapted to detect the possible alternatives. On the other hand, it may be even more dangerous to overly restrict the set of possible alternatives. Hence, using a fairly large class of aging distributions seems to be a reasonable compromise. One of the more commonly used classes is the so called L-class. It was introduced by Klefsjö 1983) as a large class of distributions that contains most of previously known classes like IFR, DMRL, NBU, NBUE or HNBUE. Tests for exponentiality against the L-class have been proposed by Chaudhuri 1997), Henze and Klar 21), Basu and Mitra 22) and Klar 23). However, Klar 22) gives an example of a distribution belonging to class L having an infinite third moment and a hazard rate that tends to zero as time approaches infinity. This example leads to serious doubts whether the L-class should be considered as a reasonable notion of positive aging. Therefore, in Klar and Müller 23), a new so-called M-class of life distributions is presented. It is defined similarly as the L-class; only the ordering of the Laplace transforms is replaced by the ordering of the moment generating functions. The M- class also contains the above listed classes of life distributions. However, it does not have the property of the L-class mentioned above. In fact, if F is an absolutely continuous distribution with hazard rate r and mean µ belonging to the M-class, then Klar and Müller 23), Corollary 3.1). lim sup rt) 1 t µ Hence, if the data at hand come from an aging distribution, it seems worthwhile to consider tests for exponentiality against the M-class. The formal definition of this class is as follows. 1.1 Definition a) A non-negative random variable with mean µ = E > and distribution function F is said to be in the M-class if Ee t = Here, Mt, 1/µ) = 1 µt) 1 distribution with mean µ. b) is said to be in the L-class, if e tx F dx) 1 1 µt for all t < 1/µ. 1) is the moment generating function of an exponential Ee t = e tx F dx) 1 1 + µt for all t. 2

c) is said to be in the LM-class, if it is in the L-class and in the M-class. For the LM-class the inequality in 1) must hold for all < t < 1/µ. This paper proposes tests for exponentiality against the classes M and LM. procedures are based on the empirical moment generating function The M n t) = e t df n x) = 1 n e t i, i=1 of a random sample 1,..., n of size n from F, where F n x) = n 1 n 1{ j x} is the empirical distribution function. Let M F t) = E[e t ] denote the moment generating function of, and write F t, λ) = 1 e λt, t, for the distribution function of an exponential distribution with mean µ = 1/λ. In view of 1), it seems natural to base a test of against the alternative or H : F E = {F, λ), λ > } H 1 : F M and F / E H 2 : F LM and F / E on the empirical counterpart M n t) Mt, 1/ n ) of M F t) Mt, 1/µ). Here, n = 1/n n i=1 i denotes the sample mean. As a class of test statistics for a test against H 1 we propose T n,a ), where a/n T n,a = n Mn t) Mt, 1/ n ) ) dt, 2) and a, 1/2) is a positive constant. Since, for t [, 1/µ), M F t) Mt, 1/µ) is nonpositive for alternatives from the M-class, H is rejected for large negative values of T n,a. The statistic T n,a takes the form T n,a = 1 n expay j ) 1 Y j + log1 a), 3) where Y j = j / n, 1 j n. 3

A class of test statistics for a test against H 2 is T n,a ) <a<1/2, where T n,a = n a/n = 1 n Mn t) Mt, 1/ n ) ) dt 4) a/ n ) expay j ) exp ay j ) 1 a + log. Y j 1 + a 1.2 Remark If Y j is zero, then expay j ) 1)/Y j in 3) has to be replaced by its limiting value a. Similarly, expay j ) exp ay j ))/Y j has to be replaced by 2a. Note that it is possible that an M-class distribution has a point mass at zero. In contrast, one has > almost surely for distributions in class L or LM. A test statistic based on sup t M n t) Mt, 1/ n ) has been studied in Csörgő and Welsh 1989) as omnibus tests for exponentiality. However, there don t exist simple computational formulas for this and corresponding one sided statistics, contrary to the usual supremum statistics based on the empirical distribution function. For this reason, we prefer integral test statistics as defined above. The paper is organized as follows. In Section 2 we state the asymptotic behavior of the statistics T n,a and T n,a as a and derive their limit distributions in case of a general underlying distribution. A test for exponentiality rejecting H for large negative values of T n,a and T n,a is seen to be consistent against each fixed alternative from the class M and LM, respectively. Section 3 is devoted to the calculation of local approximate Bahadur efficiencies of the proposed tests of exponentiality with respect to some standard families of alternative distributions from the class LM. In Sections 4, we present the results of a simulation study. The tests are applied to two data sets in Section 5. In the last section, we discuss the applicability of the tests under random censorship. 2 The asymptotic distribution of the test statistics Our first result shows that both T n,a and T n,a, when suitably scaled, approach the same limit if the parameter a tends to. 2.1 Proposition For fixed n, we have T n 2Tn,a lim + 2 ) a a 3 3 ) Tn,a = lim + 2 a a 3 3 = 1 3n Y 2 j. 4

Proof: Tn,a lim a = lim a = 1 n Series expansions yield + 1 a 3 3 1 n ) Y 2 j 6. 1 = lim a n expay j ) 1 a 3 Y j + 1 a + Y j 2 2a + Y j 2 6 + Oa) The proof of the second assertion is similar. ) log1 a) + 1 a 3 3 ) 2 + a3 3 + Oa4 ) ) 1a 3 a + a2 + 1 3 ) Up to one-to-one transformations, T n coincides with Greenwood s statistic G n = 1/n 2 n Y j 2 Greenwood 1946)), with the sample coefficient of variation CV n = S n / n, where Sn 2 = n 1 n j n ) 2 denotes the sample variance, and with the first nonzero component of Neyman s smooth test of fit for exponentiality see, e.g., Koziol 1987)); it is asymptotically most powerful for testing H against the linear failure rate distribution Doksum and Yandell 1984)). However, since the exponential distribution is not characterized within the LM-class by the property that V ) = E) 2 Klar and Müller 23)), a test based on T n is not consistent against the M- or LM-class. It is well-known that n3t n /2 1) has a limiting unit normal distribution under H. The next theorem gives the asymptotic distribution of T n,a for < a < 1/2. Since the representation of T n,a in 3) shows that T n,a is scale-invariant, we assume µ = 1 in the following. For simplicity, we further assume P j > ) = 1. Otherwise, expressions like expax) 1)/x have to be replaced by expax) 1)/x I{x > }+a I{x = }. 2.2 Theorem Assume 1,..., n is a random sample of a nonnegative nondegenerate random variable with E expc) < for c < 1. Further, let < a < 1/2. Then, as n, n 1 n expay j ) 1 Y j expa) 1 E ) ) D N, σ 2 ), where σ 2 = E κ 1 1) + expa) 1 ) 2 µ 1 5) 5

and ) 1 a) expa) 1 κ 1 = E, µ 1 = E expa) 1 ). 6) Under H, we have n T n,a σ 2 = 1 2a) log1 2a) 2 Proof: where U n,1 = 1 n Notice that 1 n n D N, σ), 2 where 1 a + a 1 a expay j ) 1 Y j ) expaj ) 1 µ 1, U n,2 = 1 j n ) log1 a) 2log1 a)) 2 ) ) expa) 1 E = U n,1 + U n,2, a 2 1 a) 2. 7) expayj ) 1 expa ) j) 1. Y j j A Taylor expansion of the function gt) = expa j /t) 1)/ j /t) around t = 1 yields U n,2 = n n 1 ) 1 1 a j ) expa j ) 1 + o P 1) n j where = 1 n j 1) κ 1 + R n + o P 1), R n = n n 1 ) 1 n ) 1 a j ) expa j ) 1 κ 1. j Since R n P, we obtain U n,1 + U n,2 = 1 n ) expaj ) 1 µ 1 + j 1) κ 1 j + o P 1). By the Central limit theorem and Slutsky s lemma, U n,1 + U D n,2 N, σ 2 ), where σ 2 is given in 5). The formula 7) for σ 2 in case of H follows from straightforward calculations. As a consequence, the asymptotic distribution of Tn,a = n T n,a /σ is standard normal under the hypothesis of exponentiality. The next result states the asymptotic distribution of T n,a. The proof follows the reasoning given above and will thus be omitted. 6

2.3 Theorem Assume 1,..., n is a random sample of a nonnegative nondegenerate random variable with E expc) < for c < 1. Further, let < a < 1/2. Then, as n, where and n 1 n ) expay j ) exp ay j ) D µ 2 N, σ 2 ), Y j ) expa) exp a) µ 2 = E, ) 1 a) expa) 1 + a) exp a) κ 2 = E σ 2 = E ) 2 expa) exp a) κ 2 1) + µ 2. D Under H, we have n T n,a N, σ ), 2 where σ 2 4a = 1 a log 1 + a 2 1 a 2 log 1 + a ) 2 4a2 1 a 1 a 2 ) 2 +1 2a) log1 2a) + 1 + 2a) log1 + 2a). 8) We now consider the behavior of T n,a and T n,a, < a < 1/2, under fixed alternatives to H, with the aim of showing the consistency of the corresponding tests. To this end, suppose that the distribution of is from the M-class; in particular, has finite positive expectation µ. We then have Mt, 1/ n ) Mt, 1/µ) almost surely, and by Fatou s Lemma, it follows that lim inf n T n,a n µ a/µ Mt, 1/µ) E[expt)]) dt 9) almost surely. Since the behaviour of the moment generating function on any interval [, a], < a < 1/µ, completely specifies H, the right-hand side of 9) is positive if the distribution of comes from class M but is not exponential. Hence, a test that rejects H for small values of T n,a is consistent against any such alternative. The same reasoning yields the consistency of the test based on T n,a against alternatives from the LM-class. Note that the property of consistency of both tests continues to hold under the condition E < and Ee t = where t < a/µ, since, in this case, we have M n t) almost surely for t t. An example is the Lognormal distribution which is used in the simulation study in Section 4 as alternative distribution. 7

3 Local approximate Bahadur efficiency In this section, we investigate the efficiency of the tests for exponentiality based on T n,a and T n,a against several one-parametric families of distributions from the class LM. In each case, the distributions have a density function fx, ϑ), and the parameter space, denoted by Θ, is some subinterval of, ). Depending on the specific alternative family, the unit exponential distribution corresponds either to the parameter value ϑ = 1 or to the value ϑ =. As in Henze and Klar 21), our measure of efficiency is the local approximate Bahadur slope see, e.g., Nikitin 1995), Section 1.2). First, note that P ϑ T n,a t) = P ϑ nt n,a /σ t) Φt) as n, where Φ denotes the standard normal distribution function. Further, log 1 Φt)) t 2 /2, t. Since T n,a n P 1 { ) expa/µϑ)) 1 E ϑ σ /µϑ) )} expa) 1 E ϑ under P ϑ, where µϑ) = E ϑ [], the approximate Bahadur slope c, a) of the sequence T n,a) of test statistics is Nikitin 1995), p. 1) [ 1 c ϑ, a) = σ { ) expa/µϑ)) 1 E ϑ /µϑ) )}] 2 expa) 1 E ϑ. We now consider the local behavior of c ϑ, a) as ϑ ϑ, assuming the one-parametric family of alternative distributions to be sufficiently regular to allow a Taylor expansion of order two of c ϑ, a) with respect to ϑ. Moreover, differentiation of E ϑ [expa/µϑ)) 1)//µϑ))] may be done under the integral sign. These assumptions hold for each of the four families of distributions considered later in this section. After straightforward calculations, one obtains c ϑ, a) l F ϑ T n,a )) 2 ϑ ϑ ) 2 as ϑ ϑ, σ 2 where l Fϑ T n,a ) = expa) 1 ϑ fx, ϑ) dx µ ϑ ) ϑ=ϑ ) a + log1 a), 1 a and σ 2 is given in 7). Here, µ ϑ ) = µϑ) ϑ ϑ=ϑ. Bahadur efficiency of T n,a is given by Then, the local approximate e Fϑ T n,a ) = l F ϑ T n,a )) 2. σ 2 8

Efficiency.2.4.6.8 1. Efficiency.1.2.3.4.5.6..1.2.3.4.5 a..1.2.3.4.5 a Figure 1: Local approximate Bahadur efficiency of T n,a solid line) and T n,a dashed) against LFR left) and Makeham alternatives right) Similarly, we have e Fϑ T 2 n,a ) = lfϑ n,a)) T / σ 2, where expa) exp a) lfϑ T n,a ) = fx, ϑ) ϑ 2a µ ϑ ) 1 a log 1 + a ), 2 1 a ϑ=ϑ dx and σ 2 is given in 8). We have calculated e F T n,a ) and e Fϑ T n,a ) for linear failure rate LFR), Makeham, Weibull and gamma alternatives. All of them belong to the much narrower class of IFR distributions. The pertaining distribution functions are F 1) ϑ x) = 1 exp x + ϑx 2 /2 )) for x, ϑ, F 2) ϑ x) = 1 exp x + ϑ x + e x 1 ))) for x, ϑ, F 3) ϑ x) = 1 exp x ϑ) for x, ϑ >, x x) = Γϑ) 1 F 4) ϑ t ϑ 1 e t dt for x, ϑ >, respectively. For F 1) ϑ and F 2) ϑ, H corresponds to ϑ = ϑ =, and for F 3) ϑ and F 4) ϑ, we have ϑ = 1. Calculations give l F 1)T n,a ) = a 1 a a 2 + log1 a) 21 a) 2 9

and l F 1) T n,a ) = 2a 1 a 2 2a3 1 a 2 ) 2 log 1 + a 1 a for < a < 1/2. The efficiencies e F 1)T n,a ) and e F 1) T n,a ) are plotted in Figure 1 left). They have a maximum value at a = with e F 1) = 1. As noted above, the test based on T n is asymptotically most powerful for testing H against the linear failure rate distribution. Next, we obtain l F 2)T n,a ) = 2 log 2 a 2 3 a log1 a) 2 log 2 2 21 a) and l F 2) T n,a ) = 2 log 2 a 2 + a + 3 2 log 1 + a 1 a a 1 a 2 for < a < 1/2. e F 2) has a maximum value at a = with e F 2)T n,a ) = e F 2) T n,a ) = 1/16. Note that 1/12 is the efficiency of the asymptotically most powerful test of exponentiality against the Makeham distribution Doksum and Yandell 1984)). Figure 1 right) shows the local approximate Bahadur efficiencies of T n,a and T n,a against Makeham alternatives. For F 3) ϑ, closed form expressions for l F 3)T n,a) and l F 3) T n,a ) do not exist; we have and l F 3)T n,a ) = l F 3) T n,a ) = e ax 1 x e ax e ax x ) a 1 + 1 x) logx)) e x dx + 1 γ) + log1 a) 1 a 2a 1 + 1 x) logx)) e x dx + 1 γ) 1 a log 1 + a ) 2 1 a for < a < 1/2, where γ.577 is Euler s constant. The maximum value of e F 3) is 1 at a =. For the family of gamma alternatives, the efficiencies are and l F 4)T n,a ) = l F 4) T n,a ) = e ax 1 x log x + γ) e x dx a log1 a) 1 a e ax e ax log x + γ) e x dx 2a x 1 a + log 1 + a 2 1 a 1

Efficiency..2.4.6.8 1. Efficiency..5.1.15.2.25..1.2.3.4.5 a..1.2.3.4.5 a Figure 2: Local approximate Bahadur efficiency of T n,a solid line) and T n,a dashed) against Weibull left) and gamma alternatives right) for < a < 1/2. Again, e F 4) has a maximum value at ; here, e F 4)T n ) = e F 4) T n, ) = 1/4. Figure 2 displays the efficiencies of T n,a and T n,a against Weibull and Gamma alternatives for a, 1/2). Obviously, local approximate Bahadur efficiency heavily depends on the value of a. The shape of the curves in Figures 1 and 2 is always similar: for each of the four distributions, efficiency decreases if a tends to 1/2. However, for fixed positive value of a, the efficiency of T n,a is always above the efficiency of T n,a. The decrease of Bahadur efficiency with increasing a can be explained as follows. Since the tail behavior of a distribution is reflected on the behavior of its moment generating function around zero, choosing a small value of a renders the test statistics powerful against distributions with markedly different tail behaviour than those in E. This is the case for the IFR distribution used in this section. Hence, tests looking only at the mean/variance ratio as T n are well suited for these particular alternatives. 4 Simulations This section presents the results of two Monte Carlo studies. The first simulation study was conducted in order to obtain critical points of the statistics under discussion which is necessary due to the slow convergence of the finite sample distributions of the test statistics to their limit distribution. Tables 1 and 2 show the p-quantiles of Tn,a = nt n,a /σ and T n,a = n T n,a / σ 11

Tn,a T n,a a.5.1.2.3.4.45.5.1.2.3.4.45 n = 1-1.5 -.99 -.89 -.76 -.6 -.49-1.8-1.7-1. -.88 -.7 -.58 n = 2-1.18-1.13-1.2 -.88 -.7 -.59-1.21-1.19-1.13-1. -.81 -.68 n = 3-1.24-1.2-1.9 -.95 -.77 -.65-1.28-1.26-1.19-1.7 -.88 -.73 n = 5-1.32-1.28-1.17-1.4 -.85 -.72-1.35-1.34-1.27-1.15 -.95 -.81 n = 1-1.4-1.36-1.27-1.14 -.96 -.82-1.43-1.41-1.36-1.24-1.5 -.9 n = 2-1.46-1.44-1.36-1.24-1.5 -.91-1.49-1.48-1.42-1.32-1.14 -.99 n = 5-1.53-1.5-1.44-1.35-1.17-1.2-1.55-1.54-1.51-1.41-1.25-1.9 n = 1-1.57-1.55-1.5-1.41-1.25-1.1-1.58-1.57-1.53-1.47-1.31-1.17 Table 1: Empirical 5%-quantiles of Tn,a and T n,a based on 1 replications Tn,a T n,a a.5.1.2.3.4.45.5.1.2.3.4.45 n = 1 -.95 -.9 -.89 -.7 -.55 -.46 -.97 -.96 -.9 -.8 -.64 -.53 n = 2-1.4-1. -.91 -.79 -.64 -.54-1.6-1.5 -.99 -.89 -.73 -.61 n = 3-1.8-1.4 -.96 -.85 -.69 -.58-1.11-1.9-1.4 -.94 -.78 -.66 n = 5-1.13-1.1-1.2 -.91 -.75 -.64-1.15-1.14-1.9 -.99 -.83 -.71 n = 1-1.18-1.16-1.8 -.98 -.83 -.71-1.2-1.19-1.14-1.6 -.91 -.78 n = 2-1.21-1.2-1.14-1.5 -.9 -.78-1.23-1.22-1.18-1.11 -.97 -.84 n = 5-1.25-1.23-1.18-1.12 -.99 -.87-1.25-1.25-1.23-1.16-1.4 -.92 n = 1-1.26-1.25-1.22-1.16-1.4 -.93-1.26-1.26-1.24-1.2-1.9 -.97 Table 2: Empirical 1%-quantiles of Tn,a and T n,a based on 1 replications under exponentiality for several sample sizes and p =.5 and.1, respectively. The parameter a was chosen to be a =.5,.1,.2,.3,.4 and.45. The entries in Tables 1 and 2 are based on 1 replications; here, we always used µ = 1. The speed of convergence to the asymptotic values is generally quite low, and it differs for different values of a. The asymptotic quantiles should not be used for a test based on Tn,a or T n,a even for the sample size n = 1. The finite sample quantiles are not symmetric around. A second simulation study has been conducted to examine the dependence of the power of the tests on the exponent. As alternative distributions from the LM-class, we used the Weibull, Gamma and Linear failure rate distribution with scale parameter 1 and shape 12

parameter ϑ, denoted by W ϑ), Γϑ) and LF Rϑ), respectively. A further alternative is the inverse Gaussian distribution IGµ, λ) with mean µ = 1, denoted by IGλ) which belongs to the L-class for λ 1; it belongs to the M-class and, hence, to the LM-class) if λ 2. In addition, we included the Birnbaum-Saunders distribution BSγ, δ) with δ = γ 2 /2 + 1) 1 and, hence, a mean value of one), denoted by BSγ). A simple method of generating BSγ, δ) distributed random variables is to generate a standard normal variate, Z, and then use the formula = δ 1 + γ 2 Z 2 /2 + γz γ 2 Z 2 /4 + 1 ) ) 1/2 Rieck 23)). The Birnbaum-Saunders distribution BSγ) belongs to the L-class for γ 1; it is a member of the class M if γ 2/3 Klar and Müller 23)). We finally considered the uniform distribution U, 1) which is symmetric and, hence, in the M-class Corollary 3.2 in Klar and Müller 23)) and the Lognormal distribution LN, τ 2 ). The latter is not in the M-class, but it belongs to the L-class if τ 2 < log 2 Klar 22)). By the remark at the end of Section 2, the tests are consistent against this alternative. Table 3 shows power estimates of the tests based on Tn,a and T n,a for n = 3. All entries are the percentages of 1 Monte Carlo samples that resulted in rejection of H, rounded to the nearest integer. The nominal level of the test is α =.5. The first three lines of Table 4 show three examples of power estimates of contiguous alternatives for n = 5. The results for the binomial distribution Bin1, 1/2) are given in the last line of Table 4. This distribution is symmetric and, hence, in the M-class. However, since V ) = E) 2 = 1/4, a test based on Tn or on Tn,a for small values of a can not detect this alternative. The values in Table 4 are computed as in Table 3. The main conclusions that can be drawn from the simulation results are the following: 1. The tests based on Tn,a and T n,a behave fairly similar, whereby the power of the tests depends only to a small extent on a. An exception is the binomial distribution. 2. For all distributions used in the simulation apart from the binomial distribution, power decreases for increasing values of a. 3. Despite the strong decrease of local approximate Bahadur efficiency for W ϑ), Γϑ) and LF Rϑ) as a tends to 1/2, the results in Table 4 indicate that the actual loss of power is small, even for n = 5. 13

Tn,a T n,a a.5.1.2.3.4.45.5.1.2.3.4.45 Exp1) 5 5 5 5 5 5 5 5 5 5 5 5 W 1.3) 44 45 44 43 43 42 45 44 45 44 43 45 W 1.5) 78 77 77 77 76 75 78 79 79 78 77 78 W 1.7) 95 95 95 95 94 93 96 95 95 95 94 95 Γ1.5) 34 35 33 33 33 31 34 35 34 34 33 34 Γ2.) 7 69 69 68 67 64 7 71 7 69 69 69 Γ3.) 97 97 97 96 96 95 97 97 97 97 96 97 LF R.5) 26 25 25 26 25 25 25 26 25 25 25 26 LF R2.) 61 6 6 61 59 58 6 6 6 6 59 6 LF R5.) 81 81 81 81 8 79 81 82 81 81 8 82 IG1.2) 26 26 25 24 23 22 27 26 26 25 25 25 IG1.5) 46 44 43 42 41 39 46 45 45 45 43 44 IG2.) 73 71 7 69 67 66 72 73 73 72 7 7 IG3.) 95 95 94 93 92 92 96 96 95 95 94 95 BS1.2) 1 1 1 1 1 1 1 2 2 2 1 2 BS1) 1 8 9 8 9 8 9 9 9 9 9 9 BS 2/3) 39 38 37 37 34 32 38 38 37 37 36 37 BS.7) 7 69 67 66 65 63 7 71 7 69 67 69 U, 1) 98 98 98 98 98 98 97 97 98 98 98 98 LN,.6) 35 34 33 32 31 3 36 36 35 34 33 34 LN,.4) 76 74 74 72 7 69 76 76 75 74 73 74 Table 3: Empirical power of the tests based on Tn,a and T n,a, α =.5, n = 3, 1 replications Tn,a T n,a a.5.1.2.3.4.45.5.1.2.3.4.45 W 1.1) 7.9 69.7 67.9 65.1 64.1 61.3 71.9 71.1 7.7 68.4 66.2 65.1 Γ1.2) 67.4 66.3 65.4 6.7 58.7 56.8 69. 68.5 66.7 65.4 61.2 59.8 LF R.2) 87.4 87.6 87.3 85.8 86.4 85.7 87.2 86.6 86.8 86.8 86.2 85.7 Bin, 1) 6.8 11. 25.4 4.9 62.2 72.2 5.3 6. 7.1 13.1 21.8 31. Table 4: Empirical power of the tests based on Tn,a and T n,a, α =.5, n = 5, 1 replications 14

M nt)..5 1. 1.5 2. 2.5 3. M nt) 1 2 3 4.2.15.1.5..5.1.15 t.2.15.1.5..5.1 t Figure 3: Left: Empirical moment generating function of data set 1 solid line) and moment generating function under exponentiality dashed). Right: The same plots for data set 2. 4. In the majority of cases, the power of T n,a in Table 4 is higher than the power of T n,a for a fixed value of a which coincides with the results of Section 3. 5 Examples We applied the tests under discussion to two data sets. The first example was also considered by Pavur et al. 1992). The results recorded in the following table are the number of revolutions in millions) to failure of n = 23 ball bearings in a life test study. 17.88 28.92 33. 41.52 42.12 45.6 48.48 51.84 51.96 54.12 55.56 67.8 68.64 68.64 69.88 84.12 93.12 98.64 15.12 15.84 127.92 128.4 173.4 Mean value and standard deviation of the data are 71.8 and 38.2, which results in a coefficient of variation of.53. The empirical moment generating function lies below the moment generating function under exponentiality, given by 1 71.8 t) 1 see Figure 3, left). Therefore, it is plausible to assume that the distribution underlying the data belongs to the LM-class. The second data set consists of n = 16 intervals in operating hours between successive failures of airconditioning equipment in a Boeing 72 aircraft see Edgeman et al. 1988)). 15

12 29 14 57 54 32 67 59 134 152 27 14 23 66 61 34 The mean value of 82. and the standard deviation of 66.3 yield a coefficient of variation of.81. The plot on the right-hand side of Figure 3 looks similar as the plot on the left; again, it is plausible to assume that the underlying distribution belongs to the LM-class. To assess wether the data could come from exponential distributions, we applied the tests based on Tn,a and T n,a for several values of a to the two data sets. The values of the test statistics are given in Table 5. Comparing the values for the first data set with the 5%-quantiles for n = 3 in Table 1, we can see that all tests reject the hypotheses of exponentiality at the level.5. However, the values for the second data set are always larger than the 1%-quantiles for n = 1 in Table 2. Hence, the hypotheses of exponentiality is not rejected at the 1%-level. Therefore, we can retain the exponential model for the second data set, whereas it is necessary to look for other parametric distributions from class LM for a better description of the data in the first example. 6 The test statistics under random censorship In this section, we discuss how the test statistics for testing for exponentiality against H 1 have to be modified in the case of randomly censored data. Tests for exponentiality against HNBUE alternatives under random censorship have been proposed by Aly 1992) and Hendi, Al-Nachawati, Montasser, and Alwasel 1998). Let 1, 2,... be a sequence of nonnegative independent lifetimes with d.f. F. Along with the -sequence, let Y 1, Y 2,... be a sequence of independent censoring random variables with d.f. G also being independent of the s. We assume that F and G are continuous. We observe the censored lifetimes Z i = min i, Y i ) together with δ i = Tn,a T n,a a.1.2.3.4.1.2.3.4 Data set 1 n = 23) -1.59-1.41-1.19 -.93-1.71-1.6-1.4-1.11 Data set 2 n = 16) -.73 -.67 -.59 -.48 -.76 -.72 -.66 -.54 Table 5: Values of T n,a and T n,a for the two data sets 16

1 {i Y i } indicating the cause of death. The nonparametric maximum likelihood estimate of F is given by the Kaplan-Meier product-limit estimator ˆF n defined by 1 ˆF n t) = n i=1 1 δ ) 1{Zi:n t} [i:n]. n i + 1 Here Z 1:n... Z n:n are the ordered Z-values, and δ [i:n] is the concomitant of the i-th order statistic, that is, δ [i:n] = δ j if Z i:n = Z j. If ϕ is integrable, one has the representation where for 1 i n, ϕx)d ˆF n x) = W in ϕz i:n ), i=1 W in = δ [i:n] i 1 ) δ[j:n] n j. n i + 1 n j + 1 In particular, ˆM n t) = e tx d ˆF n x) = W in e Z i:n i=1 is a plausible estimate of the moment generating function Mt) under random censorship. Using the results of Stute and Wang 1993), it follows that ˆMn t) {x<τ H } etx d ˆF n x) a.s. where τ H = inf{x : Hx) = 1} and H is the d.f. of the Z s, i.e. 1 H) = 1 F )1 G). Consider first the special case of Type 1 censoring, i.e. Y i = t, 1 i n. Then, ˆM n t) is in general not a consistent estimator of Mt) since it is impossible to observe the right tail of F. Hence, one could only test if the observable part of F corresponds to an exponential distribution; the moment generating function and the M-class are not suitable for this purpose. We assume therefore τ H = in the following. If F t) = F t, λ) is the exponential distribution, the maximum likelihood estimator of λ under random censorship is given by see, e.g., Kalbfleisch and Prentice 198), Section 3) ˆλ n = n i=1 δ i n i=1 Z. i 17

λ 1 p n = 25 n = 1 n = 4 n = 16 n = 64 n = 256.5-2.62-2.56-2.41-2.2-1.99-1.87.1-2.5-1.99-1.86-1.67-1.56-1.49.25 -.34 -.5 -.69 -.86 -.9 -.9 5.5 -.17 -.23 -.28 -.3 -.29 -.25.75 -.3.4.12.21.31.4.9.15.34.54.73.9 1..95.3.55.83 1.9 1.26 1.37.5 -.77-1.12-1.47-1.59-1.62-1.65.1 -.56 -.84-1.11-1.26-1.3-1.31.25 -.39 -.56 -.68 -.73 -.76 -.74 25.5 -.2 -.22 -.2 -.15 -.12 -.7.75.3.19.35.5.55.61.9.32.67 1. 1.14 1.2 1.24.95.58 1.3 1.38 1.58 1.63 1.63.5 -.88-1.18-1.42-1.49-1.57-1.59.1 -.76 -.99-1.16-1.21-1.25-1.23.25 -.56 -.67 -.73 -.71 -.74 -.67 1.5 -.3 -.25 -.2 -.11 -.8 -.4.75.8.28.43.56.59.65.9.57.94 1.11 1.27 1.25 1.32.95.98 1.39 1.57 1.69 1.71 1.71 Table 6: Empirical p-quantiles of n ˆTn,.25 /ˆσ n based on 1 replications, where ˆσ 2 n is the empirical variance of ˆT n,.25 Hence, a suitable modification of the test statistics T n,a for testing for exponentiality against H 1 is aˆλn ˆT n,a = 1/ˆλ n ˆMn t) Mt, ˆλ n )) dt = W in e au i:n 1 U i:n + log1 a), where a, 1/2) and U i:n = ˆλ n Z i:n, 1 j n. Table 6 shows the p-quantiles of n ˆTn,.25 /ˆσ n, where ˆσ n 2 is the empirical variance of ˆT n,.25, for increasing sample sizes. The distributions of 1 and Y 1 are exponential with parameter λ 1 and 1, respectively; hence, the censored portion of the data is 1/λ 1 + 1). The entries in Tables 6 are based on 1 replications. 18

Looking at the critical values, they seem to converge to the quantiles of a standard normal distribution. This is not unexpected in view of the central limit theorem under random censorship Stute 1995)). However, the limiting variance does not only depend on a as in the case without censoring, but also on the d.f. s F and G. Furthermore, convergence is very slow, depending on the degree of censoring. To perform the test in practice, estimating the unknown variance of ˆT n,a, presumably by some resampling procedure, would be necessary. Acknowledgements The author would like to thank the referees for their constructive comments. References Aly, E. 1992). On testing exponentiality against HNBUE alternatives. Statistics and Decisions, 1, 239 25. Basu, S., & Mitra, M. 22). Testing exponentiality against Laplace order dominance. Statistics, 36, 223 229. Chaudhuri, G. 1997). Testing exponentiality against L-distributions. J. Statist. Plann. Inf., 64, 249 255. Csörgő, S., & Welsh, A. 1989). Testing for exponential and Marshall-Olkin distributions. J. Statist. Plann. Inf., 23, 287 3. Doksum, K., & Yandell, B. 1984). Tests for exponentiality. Elsevier Science Publishers. Edgeman, R., Scott, R., & Pavur, R. 1988). A modified Kolmogorov-Smirnov test for the inverse Gaussian density with unknown parameters. Commun. Statist. Simula., 17, 123 1212. Greenwood, M. 1946). The statistical study of infectious diseases. J. Roy. Statist. Soc. Ser. A, 19, 85 11. Hendi, M., Al-Nachawati, H., Montasser, M., & Alwasel, I. 1998). An exact test for HNBUE class of life distributions. J. Statist. Comput. Simul., 6, 261 275. Henze, N., & Klar, B. 21). Testing exponentiality against the L-class of life distributions. Mathematical Methods of Statistics, 1, 232 246. 19

Kalbfleisch, J., & Prentice, R. 198). The statistical analysis of failure time data. Wiley. Klar, B. 22). A note on the L-class of life distributions. J. Appl. Prob., 39, 11 19. Klar, B. 23). On a test for exponentiality against Laplace order dominance. Statistics. to appear) Klar, B., & Müller, A. 23). Characterizations of classes of lifetime distributions generalizing the NBUE class. J. Appl. Prob., 4, 2-32. Klefsjö, B. 1983). A useful ageing property based on the Laplace transform. J. Appl. Prob., 2, 615-626. Koziol, J. 1987). An alternative formulation of Neyman s smooth goodness of fit test under composite alternatives. Metrika, 34, 17 24. Nikitin, Y. 1995). Asymptotic efficiency of nonparametric tests. Cambridge University Press. Pavur, R., Edgeman, R., & Scott, R. 1992). Quadratic statistics for the goodness-of-fit test of the inverse Gaussian distribution. IEEE Trans. Reliab., 41, 118 123. Rieck, J. 23). A comparison of two random number generators for the Birnbaum- Saunders distribution. Commun. Statist. Th. Meth., 32, 929 934. Stute. 1995). The central limit theorem under random censorship. Ann. Stat., 23, 422 439. Stute, W., & Wang, J.-L. 1993). The strong law under random censorship. Ann. Stat., 21, 1591 167. 2