Interval estimation for bivariate t-copulas via. Kendall s tau

Similar documents
Statistics and Probability Letters. Goodness-of-fit test for tail copulas modeled by elliptical copulas

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Least-Squares Intersection of Lines

Multivariate Normal Distribution

Credit Risk Models: An Overview

Introduction to General and Generalized Linear Models

Modelling the dependence structure of financial assets: A survey of four copulas

Actuarial and Financial Mathematics Conference Interplay between finance and insurance

MATHEMATICAL METHODS OF STATISTICS

Non Linear Dependence Structures: a Copula Opinion Approach in Portfolio Optimization

Modelling the Dependence Structure of MUR/USD and MUR/INR Exchange Rates using Copula

Uncertainty quantification for the family-wise error rate in multivariate copula models

An Internal Model for Operational Risk Computation

Copula model estimation and test of inventory portfolio pledge rate

Systems of Linear Equations

APPLYING COPULA FUNCTION TO RISK MANAGEMENT. Claudio Romano *

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Pricing of a worst of option using a Copula method M AXIME MALGRAT

Lecture 8: Gamma regression

171:290 Model Selection Lecture II: The Akaike Information Criterion

Basics of Statistical Machine Learning

Least Squares Estimation

Sections 2.11 and 5.8

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Simple Linear Regression Inference

Statistics Graduate Courses

GENERATING SIMULATION INPUT WITH APPROXIMATE COPULAS

Similarity and Diagonalization. Similar Matrices

Life Table Analysis using Weighted Survey Data

Safety margins for unsystematic biometric risk in life and health insurance

Continued Fractions and the Euclidean Algorithm

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Statistical Machine Learning

Convex Hull Probability Depth: first results

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m

Statistics in Retail Finance. Chapter 6: Behavioural models

An introduction to Value-at-Risk Learning Curve September 2003

Modeling Operational Risk: Estimation and Effects of Dependencies

Monte Carlo Simulation

Introduction to Matrix Algebra

1 Homework 1. [p 0 q i+j p i 1 q j+1 ] + [p i q j ] + [p i+1 q j p i+j q 0 ]

A Composite Likelihood Approach to Analysis of Survey Data with Sampling Weights Incorporated under Two-Level Models

NOTES ON LINEAR TRANSFORMATIONS

ESTIMATING IBNR CLAIM RESERVES FOR GENERAL INSURANCE USING ARCHIMEDEAN COPULAS

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

Chapter 6: Multivariate Cointegration Analysis

ON MODELING INSURANCE CLAIMS

Copula Simulation in Portfolio Allocation Decisions

arxiv: v1 [q-fin.rm] 3 Dec 2014

A linear algebraic method for pricing temporary life annuities

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS

CONDITIONAL, PARTIAL AND RANK CORRELATION FOR THE ELLIPTICAL COPULA; DEPENDENCE MODELLING IN UNCERTAINTY ANALYSIS

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Lecture 3: Linear methods for classification

Regression III: Advanced Methods

Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS

An extension of the factoring likelihood approach for non-monotone missing data

Using Excel for inferential statistics

Some probability and statistics

Name: Section Registered In:

LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as

COMPLETE MARKETS DO NOT ALLOW FREE CASH FLOW STREAMS

The Dangers of Using Correlation to Measure Dependence

Assistant Professor of Actuarial Science, University of Waterloo. Ph.D. Mathematics, Georgia Institute of Technology. Advisor: Liang Peng 2012.

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

Managing uncertainty in call centers using Poisson mixtures

Principle of Data Reduction

A Model of Optimum Tariff in Vehicle Fleet Insurance

The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.

Numerical Methods for Option Pricing

THE FUNDAMENTAL THEOREM OF ALGEBRA VIA PROPER MAPS

Metric Spaces. Chapter Metrics

4. Continuous Random Variables, the Pareto and Normal Distributions

Hierarchical Insurance Claims Modeling

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL

Autocovariance and Autocorrelation

Factorization Theorems

1 Determinants and the Solvability of Linear Systems

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

T ( a i x i ) = a i T (x i ).

Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression

Quantitative Operational Risk Management

Marshall-Olkin distributions and portfolio credit risk

Non Parametric Inference

Asymptotics for ruin probabilities in a discrete-time risk model with dependent financial and insurance risks

Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS

Chapter 1 Introduction. 1.1 Introduction

From the help desk: Bootstrapped standard errors

CONTINUED FRACTIONS AND PELL S EQUATION. Contents 1. Continued Fractions 1 2. Solution to Pell s Equation 9 References 12

Monte Carlo-based statistical methods (MASM11/FMS091)

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

A comparison of Value at Risk methods for measurement of the financial risk 1

VERTICES OF GIVEN DEGREE IN SERIES-PARALLEL GRAPHS

Java Modules for Time Series Analysis

Goodness of fit assessment of item response theory models

Numerical Analysis Lecture Notes

Understanding the Impact of Weights Constraints in Portfolio Theory

Chapter 4: Vector Autoregressive Models

Transcription:

Interval estimation for bivariate t-copulas via Kendall s tau Liang Peng and Ruodu Wang February 2, 24 Abstract Copula models have been popular in risk management. Due to the properties of asymptotic dependence and easy simulation, the t-copula has often been employed in practice. A computationally simple estimation procedure for the t-copula is to first estimate the linear correlation via Kendall s tau estimator and then to estimate the parameter of the number of degrees of freedom by maximizing the pseudo likelihood function. In this paper, we derive the asymptotic limit of this two-step estimator which results in a complicated asymptotic covariance matrix. Further, we propose jackknife empirical likelihood methods to construct confidence intervals/regions for the parameters and the tail dependence coefficient without estimating any additional quantities. A simulation study shows that the proposed methods perform well in finite sample. Key-words: Jackknife empirical likelihood, Kendall s tau, t-copula School of Mathematics, Georgia Institute of Technology, Atlanta, GA 3332-6, USA. Email address: peng@math.gatech.edu Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON N2L 3G, Canada. Email address: ruodu.wang@uwaterloo.ca

Introduction For a random vector (X, Y ) with continuous marginal distributions F and F 2, its copula is defined as C(x, y) = P (F (X) x, F 2 (Y ) y) for x, y. Due to its invariant property with respect to marginals, requirements in Basel III for banks and Solvency 2 for insurance companies enhance the popularity of copula models in risk management. In practice the family of elliptical copulas is arguably the most commonly employed class because it is quite easy to simulate and is able to specify different levels of correlations. Peng (28) used elliptical copulas to predict a rare event, and Landsman (2) used elliptical copulas for capital allocation. Two important classes of elliptical copulas are Gaussian copula and t-copula. It is known that financial time series usually exhibits tail dependence, but Gaussian copula has an asymptotically independent tail while t-copula has an asymptotically dependent tail. Breymann, Dias and Embrechts (23) and Mashal, Naldi and Zeevi (23) showed that empirical fit of the t-copula is better than the Gaussian copula. Some recent applications and generalization of t-copula include: Schloegl and O Kane (25) provided formulas for the portfolio loss distribution when t-copula is employed; de Melo and Mendes (29) priced the options related with retirement funds by using the Gaussian and t copulas; Chan and Kroese (2) used t-copula to model and estimate the probability of a large portfolio loss; Manner and Segers (2) studied the tails of correlation mixtures of the Gaussian and t copulas; grouped t-copula were given in Chapter 5 of McNeil, Frey and Embrechts (25); Luo and Shevchenko (2) and Venter et al. (27) extended the grouped t-copula; tail dependence for multivariate t-copula and its monotonicity were studied by Chan and Li (28). 2

The t-copula is an elliptical copula defined as C(u, v; ρ, ν) = t ν (u) t ν (v) 2π( ρ 2 ) /2 { + x2 2ρxy + y 2 ν( ρ 2 } (ν+2)/2 dydx, () ) where ν > is the number of degrees of freedom, ρ [, ] is the linear correlation coefficient, t ν is the distribution function of a t-distribution with ν degrees of freedom and t ν denotes the generalized inverse function of t ν. When ν =, the t-copula is also called the Cauchy copula. In order to fit the t-copula to a random sample (X, Y ),, (X n, Y n ), one has to estimate the unknown parameters ρ and ν first. A popular estimation procedure for fitting a parametric copula is the pseudo maximum likelihood estimate proposed by Genest, Ghoudi and Rivest (995). Although, generally speaking, the pseudo MLE is efficient, its computation becomes a serious issue when applying to t-copulas especially with a large dimension. A more practical method to estimate ρ is through the Kendall s tau, defined as τ = E(sign((X X 2 )(Y Y 2 ))) = 4 It is known that τ and ρ have a simple relationship C(u, u 2 ) dc(u, u 2 ). ρ = sin(πτ/2). By noting this relationship, Lindskog, McNeil and Schmock (23) proposed to first estimate ρ by ˆρ = sin(πˆτ/2), where ˆτ = 2 n(n ) i<j n sign((x i X j )(Y i Y j )), and then to estimate ν by maximizing the pseudo likelihood function where c(u, v; ρ, ν) = n c(f n (X i ), F n2 (Y i ); ˆρ, ν), 2 u v C(u, v; ρ, ν) is the density of the t-copula defined in (), F n (x) = (n + ) I(X i x) and F n2 (y) = (n + ) I(Y i y) are marginal 3

empirical distributions. In other words, the estimator ˆν is defined as a solution to the score equation where l(ρ, ν; u, v) = l(ˆρ, ν; F n (X i ), F n2 (Y i )) =, (2) ν log c(u, v; ρ, ν). ˆτ is called the Kendall s tau estimator. The asymptotic results of the pseudo MLE for the t-copula are known in Genest, Ghoudi and Rivest (995). A recent attempt to derive the asymptotic distribution for the twostep estimator (ˆρ, ˆν) is given by Fantazzini (2), who employed the techniques for estimating equations. Unfortunately the derived asymptotic distribution in Fantazzini (2) is not correct since the Kendall s tau estimator is a U-statistic rather than an average of independent observations. Numeric comparisons for the two estimation procedures are given in Dakovic and Czado (2). Explicit formulas for the partial derivatives of log c(u, v; ρ, ν) can be found in Dakovic and Czado (2) and Wang, Peng and Yang (23). In this paper, we first derive the asymptotic distribution of the two-step estimator (ˆρ, ˆν) by using techniques for U-statistics. We remark that one may also derive the same asymptotic results by combining results in Genest, Ghoudi and Rivest (995) and Barbe and Genest (996). But it ends up with checking the (rather-complicated) regularity conditions in Barbe and Genest (996). It is known that interval estimation is an important way of quantifying the estimation uncertainty and is directly related to hypothesis tests. Efficient interval estimation remains a necessary part of estimation procedure in fitting a parametric family to data. As showed in Section 2, the asymptotic covariance matrix for the proposed two-step estimators is very complicated, and hence some ad hoc procedures such as bootstrap method are needed for constructing confidence intervals/regions for parameters and some related quantities. However, it is known that naive bootstrap method performs badly in general; see the simulation study in Section 3. In order to avoid estimating the compli- 4

cated asymptotic covariance matrix, we further investigate the possibility of applying an empirical likelihood method to construct confidence intervals/regions as empirical likelihood method has been demonstrated to be an efficient way in interval estimation and hypothesis test. See Owen (2) for an overview on the empirical likelihood method. Since Kendall s tau estimator is a nonlinear functional, a direct application of the empirical likelihood method fails to have a chi-square limit in general, i.e., Wilks Theorem does not hold. In this paper we propose to employ the jackknife empirical likelihood method in Jing, Yuan and Zhou (29) to construct a confidence interval for ν without estimating any additional quantities. We also propose a jackknife empirical likelihood method to construct a confidence region for (ρ, ν), and a profile jackknife empirical likelihood method to construct a confidence interval for the tail dependence coefficient of the t-copula. We organize the paper as follows. Methodologies are given in Section 2. Section 3 presents a simulation study to show the advantage of the proposed jackknife empirical likelihood method for constructing a confidence interval for ν. Data analysis is given in Section 4. All proofs are delayed till Section 5. 2 Methodologies and Main Results 2. The asymptotic distribution of ˆρ, ˆν and ˆλ As mentioned in the introduction, the asymptotic distribution for the two-step estimator (ˆρ, ˆν) only appears in Fantazzini (2), who unfortunately neglected the fact that Kendall s tau estimator is a U-statistic rather than an average of independent observations. Here we first derive the joint asymptotic limit of (ˆρ, ˆν) as follows. 5

Theorem. As n, we have and n{ˆρ ρ} = cos( πτ 2 ) π n 4{C(F (X i ), F 2 (Y i )) EC(F (X ), F 2 (Y ))} cos( πτ 2 ) π n 2{F (X i ) + F 2 (Y i ) } + o p () (3) n{ˆν ν} where = K ν { n l(ρ, ν; F (X i ), F 2 (Y i )) + K ρ n(ˆρ ρ) + n n l u(ρ, ν; u, v){i(f (X i ) u) u}c(u, v) dudv n n l v(ρ, ν; u, v){i(f 2 (Y i ) v) v}c(u, v) dudv} + o p (), + l u (ρ, ν; u, v) = l(ρ, ν; u, v), u l v(ρ, ν; u, v) = l(ρ, ν; u, v), v (4) and ( ) K a = E a l(ρ, ν; F (X ), F 2 (Y )) = l(ρ, ν; u, v) dc(u, v), a = ν, ρ. a Using the above theorem, we can easily obtain that n(ˆρ ρ, ˆν ν) T d N (, ) T σ 2 σ 2,, (5) σ 2 σ2 2 where σ 2, σ 2 and σ 2 2 are constants whose values are given in the proof of Theorem in Section 5. Another important quantity related with t-copula is the tail dependence coefficient (ν+)( ρ) λ = 2t ν+ ( +ρ ), which plays an important role in studying the extreme comovement among financial data sets. A natural estimator for λ based on the above two-step estimator is ˆλ = 2tˆν+ ( (ˆν + )( ˆρ)/ + ˆρ), and the asymptotic distribution of ˆλ immediately follows from (5) as given in the following theorem. 6

Theorem 2. As n, we have n{ˆλ λ} d N(, σ 2 ), where σ 2 = ( ) λ 2 σ 2 + ρ ( ) λ 2 σ 2 ν 2 + 2 λ ρ λ ν σ 2, and λ = λ(ρ, ν) = 2t ν+ ( (ν + )( ρ)/ + ρ). Using the above theorems, one can construct confidence intervals/regions for ν, ρ, λ by estimating the complicated asymptotic variance/covariance (see the values of σ 2, σ 2 and σ 2 2 in Section 5 for instance). While estimators for σ2, σ 2, σ 2 2 can be obtained by replacing ρ and ν in those involved integrals by ˆρ and ˆν, respectively, evaluating those integrals remains computationally non-trivial. Hence we seek ways of constructing confidence intervals/regions without estimating the asymptotic variances. A commonly used way is to employ bootstrap method. However, it is known that naive bootstrap method performs badly in general; see the simulation results given in Section 3 below. An alternative way is to employ empirical likelihood method, which does not need to estimate any additional quantities either. Due to the fact that Kendall s tau estimator is non-linear, a direct application of empirical likelihood method can not ensure that Wilks Theorem holds. Here we investigate the possibility of employing the jackknife empirical likelihood method. By noting that ˆρ is a U-statistic, one can directly employ the jackknife empirical likelihood method in Jing, Yuan and Zhou (29) to construct a confidence interval for ρ without estimating the asymptotic variance σ 2 of ˆρ. Therefore in the next we focus on constructing confidence intervals/regions for ν, (ρ, ν) and the tail dependence coefficient λ = 2t ν+ ( (ν + )( ρ)/ + ρ). 7

2.2 Interval estimation for ν In order to construct a jackknife sample as in Jing, Yuan and Zhou (29), we first define for i =,, n ˆρ i = sin(πˆτ i /2), ˆτ i = 2 (n )(n 2) F n,i (x) = I(X j x), n j i and then define the jackknife sample as Z i (ν) = j<l n,j i,l i sign((x j X l )(Y j Y l )), F n2,i (y) = I(Y j y), n l(ˆρ, ν; F n (X j ), F n2 (Y j )) l(ˆρ i, ν; F n,i (X j ), F n2,i (Y j )) j i j= j i for i =,, n. Based on this jackknife sample, the jackknife empirical likelihood function for ν is defined as n L (ν) = sup{ (np i ) : p,, p n, By the Lagrange multiplier technique, we have where λ = λ (ν) satisfies l (ν) := 2 log L (ν) = 2 p i =, p i Z i (ν) = }. log{ + 2λ Z i (ν)}, Z i (ν) + λ Z i (ν) =. The following theorem shows that Wilks Theorem holds for the above jackknife empirical likelihood method. Theorem 3. As n, l (ν ) converges in distribution to a chi-square limit with one degree of freedom, where ν denotes the true value of ν. Based on the above theorem, one can construct a confidence interval with level α for ν without estimating the asymptotic variance as I (α) = {ν : l (ν) χ 2,α}, 8

where χ 2,α denotes the α-th quantile of a chi-square limit with one degree of freedom. The above theorem can also be employed to test H : ν = ν against H a : ν ν with level α by rejecting H whenever l (ν ) > χ 2,α. For computing l (ν), one can simply employ the R package emplik as we do in Section 3. For obtaining I (α), one has to compute l (ν) for all ν, which is usually done by step searching as we do in Section 4. 2.3 Interval estimation for (ρ, ν) As mentioned previously, the Kendall s tau estimator is not a linear functional, hence one can not apply the empirical likelihood method directly to construct a confidence region for (ρ, ν). Here we employ the jackknife empirical likelihood method by defining the jackknife empirical likelihood function as L 2 (ρ, ν) Theorem 4. = sup{ n (np i) : p,, p n, p i =, p iz i (ν) =, p i(nˆρ (n )ˆρ i ) = ρ}. As n, 2 log L 2 (ρ, ν ) converges in distribution to a chi-square limit with two degrees of freedom, where (ρ, ν ) T denotes the true value of (ρ, ν) T. Based on the above theorem, one can construct a confidence region with level α for (ρ, ν ) T without estimating the asymptotic covariance matrix as I 2 (α) = {(ρ, ν) : 2 log L 2 (ρ, ν) χ 2 2,α}, where χ 2 2,α denotes the α-th quantile of a chi-square limit with two degrees of freedom. 2.4 Interval estimation for the tail dependence coefficient λ In order to construct a jackknife empirical likelihood confidence interval for the tail dependence coefficient λ for the t-copula, one may construct a jackknife sample based on the estimator ˆλ given in Section 2.. This requires calculating the estimator for ν without the i-th observation. Since ˆν has no explicit formula, it is very computationally intensive to formulate the jackknife sample. Although the approximate jackknife 9

empirical likelihood method in Peng (22) may be employed to reduce computation, it requires computing the complicated partial derivatives of the log density of the t- copula. Here we propose the following profile empirical likelihood method by treating ν as a nuisance parameter. Define Z i (ν) = 2nt ν+ ( (ν + )( ˆρ)/ + ˆρ) 2(n )t ν+ ( (ν + )( ˆρ i )/ + ˆρ i ) for i =,, n. Based on the jackknife sample Zi (ν, λ) = (Z i(ν), Z i (ν) λ) for i =,, n, and the fact that λ = 2t ν+ ( (ν + )( ρ)/ + ρ), we define the jackknife empirical likelihood function for (ν, λ) as n L 3 (ν, λ) = sup{ (np i ) : p,, p n, p i =, p i Zi (ν, λ) = }. By the Lagrange multiplier technique, we have l 3 (ν, λ) = 2 log L 3 (ν, λ) = 2 where λ 3 = λ 3 (ν, λ) satisfies log{ + 2λ T 3 Zi (ν, λ)}, Zi (ν, λ) + λ T =. 3 Z i (ν, λ) Since we are only interested in the tail dependence coefficient λ, we consider the profile jackknife empirical likelihood function l p 3 (λ) = min l 3(ν, λ). ν> As in Qin and Lawless (994), we first show that there is a consistent solution for ν, say ν = ν(λ), and then show that Wilks theorem holds for l 3 ( ν(λ ), λ ), where λ denotes the true value of λ. Lemma. With probability tending to one, l 3 (ν, λ ) attains its minimum value at some point ν such that ν ν n /3. Moreover ν and λ 3 = λ 3 ( ν, λ ) satisfy Q n ( ν, λ 3 ) = and Q 2n ( ν, λ 3 ) =,

where and Q 2n (ν, λ 3 ) = n Q n (ν, λ 3 ) = n Z i (ν, λ ) + λ T 3 Z i (ν, λ ) + λ T 3 Z i (ν, λ ) { Z i (ν, λ ) } T λ 3. ν The next theorem establishes the Wilks theorem for the proposed jackknife empirical likelihood method. Theorem 5. As n, l 3 ( ν(λ ), λ ) converges in distribution to a chi-square limit with one degree of freedom. Based on the above theorem, one can construct a confidence interval with level α for λ without estimating the asymptotic variance as I 3 (α) = {λ : l 3 ( ν(λ), λ) χ 2,α}. 3 Simulation Study We investigate the finite sample behavior of the proposed jackknife empirical likelihood method for constructing confidence intervals for ν and compare it with the parametric bootstrap method in terms of coverage probability. We employ the R packge copula to draw, random samples with size n = 2 and 5 from the t-copula with ρ =.,.5,.9 and ν = 3, 8. For computing the confidence interval based on the normal approximation, we use the parametric bootstrap method. More specifically, we draw, random samples with size n from the t- copula with parameters ˆρ and ˆν. Denote the samples by {(X (j) i, Y (j) i } n, where j =,,. For each j =,,, we recalculate the two-step estimator based on the sample {(X (j) i, Y (j) } n, which results in (ˆρ (j), ˆν (j) ). Let a and b denote the largest i [n( α)/2]-th and [n( + α)/2]-th values of ˆν () ˆν,, ˆν () ˆν. Therefore a bootstrap confidence interval for ν with level α is [ˆν b, ˆν a]. The R package emplik

Table : Coverage probabilities are reported for the proposed jackknife empirical likelihood method (JELM) and the Normal approximation method based on ˆν (NAM). (n, ρ, ν) JELM NAM JELM NAM Level 9% Level 9% Level 95% Level 95% (2,., 3).886.83.935.844 (2,.5, 3).849.77.98.82 (2,.9, 3).878.826.928.849 (2,., 8).83.6.99.65 (2,.5, 8).85.594.886.6 (2,.9, 8).837.664.92.68 (5,., 3).87.825.923.853 (5,.5, 3).874.838.933.87 (5,.9, 3).876.844.932.869 (5,., 8).87.728.939.76 (5,.5, 8).862.747.92.769 (5,.9, 8).892.774.942.797 is employed to compute the coverage probability of the proposed jackknife empirical likelihood method. These coverage probabilities are reported in Table, showing that the proposed jackknife empirical likelihood method is much more accurate than the normal approximation method, and both intervals become more accurate when the sample size gets large. 2

4 Empirical Study First we fit the bivariate t-copula to the log-returns of the exchange rates between Euro and US dollar and those between British pound and US dollar from January 3, 2 till December 9, 27, which gives ˆρ =.726 and ˆν = 7.543. In Figure, we plot the empirical likelihood ratio function l (ν) against ν from 4.5 to 4 with step.5. The proposed jackknife empirical likelihood intervals for ν are (6.25,.23) for level.9, and (5.4,.9) for level.95. We also calculate the intervals for ν based on the normal approximation method as in Section 3, which result in (4.656, 9.68) for level.9 and (3.847, 9.864) for level.95. As we see, the intervals based on the jackknife empirical likelihood method are slightly shorter and more skewed to the right than those based on the normal approximation method. Second we fit the bivariate t-copula to the data set on 3283 daily log-returns of equity for two major Dutch banks, ING and ABN AMRO Bank, over the period 99 23, giving ˆρ =.682 and ˆν = 2.67. The empirical likelihood ratio function l (ν) is plotted against ν in Figure 2 from.5 to 3.5 with step., which shows that the proposed jackknife empirical likelihood intervals for ν are (2.28, 3.42) for level.9 and (2.246, 3.29) for level.95. The normal approximation based intervals for ν are (2.257, 2.9) for level.9 and (2.95, 2.962) for level.95. As we see, the intervals based on the jackknife empirical likelihood method are slightly wider and more skewed to the right than those based on the normal approximation method. Note that this data set has been analyzed by Einmahl, de Haan and Li (26) and Chen, Peng and Zhao (29) by fitting nonparametric tail copulas and copulas. Finally we fit the t copula to the nonzero losses to building and content in the Danish fire insurance claims. This data set is available at www.ma.hw.ac.uk/ mcneil/, which comprises 267 fire losses over the period 98 to 99. We find that ˆρ =.34 3

and ˆν = 9.474. The empirical likelihood ratio function l (ν) is plotted in Figure 3 against ν from 5.5 to 2 with step.5. The proposed jackknife empirical likelihood intervals for ν are (6.83, 6.285) and (6.45, 7.785) for levels.9 and.95 respectively, and the normal approximation based intervals for ν are (.978, 2.79) and ( 2.242, 3.7) for levels.9 and.95 respectively. The above negative value is due to some large values of the bootstrapped estimators of ν, which is a disadvantage of using the bootstrap method. It is clear that the proposed jackknife empirical likelihood intervals are shorter and more skewed to the right than the normal approximation based intervals. 5 Proofs Proof of Theorem. Define g(x, y) = Esign((x X )(y Y )) τ = 4{C(F (x), F 2 (y)) EC(F (X ), F 2 (Y ))} 2{F (x) 2 } 2{F 2 (y) 2 }, ψ(x, y, x 2, y 2 ) = sign((x x 2 )(y y 2 )) g(x, y ) g(x 2, y 2 ). It follows from the Hoeffding decomposition and results in Hoeffding (948) that n{ˆτ τ} = n 2 g(x i, Y i ) + 2 n n(n ) i<j n ψ(x i, Y i, X j, Y j ) = 2 n g(x i, Y i ) + o p (), (6) 4

which implies (3). By the Taylor expansion, we have = n l(ˆρ, ˆν; F n(x i ), F n2 (Y i )) = n l(ρ, ν; F n(x i ), F n2 (Y i )) + n { ρ l(ρ, ν; F n(x i ), F n2 (Y i ))}(ˆρ ρ) + n { ν l(ρ, ν; F n(x i ), F n2 (Y i ))}(ˆν ν) + o p () = n l(ρ, ν; F (X i ), F 2 (Y i )) (7) + n l u(ρ, ν; F (X i ), F 2 (Y i )){F n (X i ) F (X i )} + n l v(ρ, ν; F (X i ), F 2 (Y i )){F n2 (Y i ) F 2 (Y i )} + n { ρ l(ρ, ν; F (X i ), F 2 (Y i ))} n(ˆρ ρ) + n { ν l(ρ, ν; F (X i ), F 2 (Y i ))} n(ˆν ν) + o p (), which implies (4). More details can be found in Wang, Peng and Yang (23). The values of σ 2, σ 2 and σ 2 2 can be calculated straightforward by using the Law of Large Numbers, which are σ 2 = cos 2 ( πτ 2 )π2 {8 {2C2 (u, v) 2(u + v)c(u, v) + uv} dc(u, v) + 5 3 τ 2 + 2τ}, where R = R 2 = R 3 = R 4 = σ 2 2 = K 2 ν (K 2 + R + R 2 + 2R 3 + 2R 4 + 2R 5 + K 2 ρσ 2 + 2K ρ (L + L 2 + L 3 )), σ 2 2 = K ν (K ρ σ 2 + L + L 2 + L 3 ), K 2 = l(ρ, ν; u, v) 2 dc(u, v), l u (ρ, ν; u, v )l u (ρ, ν; u 2, v 2 )(u u 2 u u 2 ) dc(u, v )dc(u 2, v 2 ), l v (ρ, ν; u, v )l v (ρ, ν; u 2, v 2 )(v v 2 v v 2 ) dc(u, v )dc(u 2, v 2 ), l u (ρ, ν; u, v )l v (ρ, ν; u 2, v 2 )(C(u, v 2 ) u v 2 ) dc(u, v )dc(u 2, v 2 ), l u (ρ, ν; u, v )l(ρ, ν; u 2, v 2 )(I(u 2 u ) u ) dc(u, v )dc(u 2, v 2 ), 5

R 5 = L = cos( πτ 2 )π l v (ρ, ν; u, v )l(ρ, ν; u 2, v 2 )(I(v 2 v ) v ) dc(u, v )dc(u 2, v 2 ), l(ρ, ν; u, v){4c(u, v) 2u 2v} dc(u, v), L 2 = cos( πτ 2 )π l u(ρ, ν; u, v ){4C(u 2, v 2 ) 2u 2 2v 2 } and L 3 {I(u 2 u ) u } dc(u, v )dc(u 2, v 2 ), = cos( πτ 2 )π l v(ρ, ν; u, v ){4C(u 2, v 2 ) 2u 2 2v 2 } {I(v 2 v ) v } dc(u, v )dc(u 2, v 2 ). Proof of Theorem 2. It follows from (5) and the fact that n(ˆλ λ) = n(λ(ˆρ, ˆν) λ(ρ, ν)) = n λ ρ (ˆρ ρ) + n λ ν (ˆν ν) + o p(). Proof of Theorem 3. Here we use similar arguments in Wang, Peng and Yang (23). Write Z i = Z i (ν ). Then it suffices to prove the following results: and n d Z i N(, σ 2 3 ) as n, (8) n Z 2 i p σ 2 3 as n, (9) max Z i = o p ( n), () i n where σ 2 3 = K2 νσ 2 2. n Write Z i = n + n l(ˆρ, ν ; F n (X j ), F n2 (Y j )) {l(ˆρ, ν ; F n (X j ), F n2 (Y j )) l(ˆρ i, ν ; F n,i (X j ), F n2,i (Y j ))} j i and use the arguments in the proof of Lemma in Wang, Peng and Yang (23), we have n {l(ˆρ, ν ; F n (X j ), F n2 (Y j )) l(ˆρ i, ν ; F n,i (X j ), F n2,i (Y j ))} = o p (). j i 6

Hence, it follows from (7) that n Z i = n l(ˆρ, ν ; F n (X j ), F n2 (Y j )) + o p () = n l(ρ, ν ; F (X i ), F 2 (Y i )) + n l u(ρ, ν ; F (X i ), F 2 (Y i )){F n (X i ) F (X i )} + n l v(ρ, ν ; F (X i ), F 2 (Y i )){F n2 (Y i ) F 2 (Y i )} () + n { ρ l(ρ, ν ; F (X i ), F 2 (Y i ))} n(ˆρ ρ) + o p () = n { ν l(ρ, ν ; F (X i ), F 2 (Y i ))} n(ˆν ν ) + o p () d N(, σ 3 ), i.e., (8) holds. Similarly we can show (9) and (). Proof of Theorem 4. Write Y i = nˆρ (n )ˆρ i ρ. Similar to the proof of Theorem 3, it suffices to show that and where n n (Z i, Y i ) T d N(, Σ) as n, (2) (Z i, Y i ) T (Z i, Y i ) p Σ as n, (3) max (Z i, Y i ) T = o p ( n), (4) i n Σ = σ3 2 K ν σ 2. K ν σ 2 σ 2 Since (ˆτ ˆτ i) = and ˆτ ˆτ i = O(/n) almost surely, we have Y i = n(ˆρ ρ) + n (ˆρ ˆρ i ) n n = n(ˆρ ρ) + n n = n(ˆρ ρ) + O( n) = n(ˆρ ρ) + O( n) = n(ˆρ ρ) + o p (). ( π cos(πˆτ ) 2 2 )(ˆτ ˆτ i) + O()(ˆτ ˆτ i ) 2 (ˆτ ˆτ i ) 2 (O(/n)) 2 7

Thus result (2) follows from (5) and (). Results (3) and (4) can be shown by some similar expansions as in Wang, Peng and Yang (23). Proof of Lemma. Similar to the proof of Theorem 3, we have and n n Z i (ν ) = n { Z i (ν ) λ } = { λ ρ (ρ, ν )} n(ˆρ ρ ) + o p () ν l(ρ, ν ; F (X i ), F 2 (Y i )) n(ˆν ν ) + o p (), where λ(ρ, ν) is defined in Theorem 2. That is, we can show that n Zi (ν, λ ) d N(, Σ ), (5) where the covariance matrix Σ can be calculated by (5). Further we can show that and n Z i (ν, λ )Z T i (ν, λ ) p Σ (6) max i n Z i (ν, λ ) = o p (n /2 ). (7) Hence, the lemma can be shown in the same way as Lemma of Qin and Lawless (994) by using (5) (7). Proof of Theorem 5. Using the same arguments in the proof of Theorem of Qin and Lawless (994), it follows from (5) (7) that λ 3 = S n Q n (ν, ) + o p (n /2 ), ν ν o p (n /2 ) where p S n = Q n (ν,) Q n (ν,) λ 3 ν Q 2n (ν,) λ 3 S S 2 = S 2 E{Z (ν, λ )Z T (ν, λ )} E{ Z (ν,λ ) E{ Z (ν,λ ) ν } T 8 ν }.

By the standard arguments of empirical likelihood method (see proof of Theorem in Owen (99)), it follows from Lemma that l 3 ( ν(λ ), λ ) = 2 log{ + λ T 3 Z i ( ν, λ )} = 2n( λ T 3, νt ν T )(QT n (ν, ), ) T +n( λ T 3, νt ν T )S n( λ T 3, νt ν T )T + o p () = n(q T n (ν, ), )Sn (Q T n (ν, ), ) T + o p () = (W T S S 2, ) (W T, ) T + o p (), S 2 (8) as n, where W is multivariate normal random variable with mean zero and covariance matrix S. Since S S 2 S 2 = S S S 2 S 2 S S S 2 S 2 S, where = S 2 S S 2, we have (W T, ) S S 2 S 2 (W T, ) T = W T {S S S 2 S 2 S }W = {( S ) /2 W } T S /2 {S S S 2 S 2 S }S/2 {( S ) /2 W } = {( S ) /2 W } T {I S /2 S 2 S 2 S /2 }{( S ) /2 W }. (9) Since tr(s /2 S 2 S 2 S /2 ) = tr( S 2 S S 2) =, we have tr(i S /2 S 2 S 2 S /2 ) =. Hence it follows from (8) and (9) that l 3 ( ν(λ ), λ ) d χ 2 () as n. 9

Acknowledgment We thank a reviewer for his/her helpful comments. Peng s research was supported by the Actuarial Foundation. References [] W. Breymann, A. Dias and P. Embrechts (23). Dependence structures for multivariate high-frequency data in finance. Quant. Finance 3, 4. [2] J.C.C. Chan and D.P. Kroese (2). Efficient estimation of large portfolio loss probabilities in t-copula models. European Journal of Operational Research 25, 36 367. [3] Y. Chan and H. Li (28). Tail dependence for multivariate t-copulas and its monotonicity. Insurance: Mathematics and Economics 42, 763 77. [4] J. Chen, L. Peng and Y. Zhao (29). Empirical likelihood based confidence intervals for copulas. Journal of Multivariate Analysis, 37 5. [5] R. Dakovic and C. Czado (2). Comparing point and interval estimates in the bivariate t-copula model with application to financial data. Statistical Papers 52, 79 73. [6] E.F.L. de Melo and B.V.M. Mendes (29). Pricing participating inflation retirement funds through option modeling and copulas. North American Actuarial Journal 3(2), 7 85. [7] J.H.J. Einmahl, L. de Hann, D. Li (26). Weighted approximations of tail copula processes with application to testing the bivariate extreme value condition. Ann. Statist. 34, 987 24. 2

[8] D. Fantazzini (2). Three-stage semi-parametric estimation of T-copulas: asymptotics, finite-sample properties and computational aspects. Computational Statistics and Data Analysis 54, 2562 2579. [9] C. Genest, K. Ghoudi and L.-P. Rivest (995). A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika 82, 543 552. [] W. Hoeffding (948). A class of statistics with asymptotically normal distribution. Ann. Math. Statist. 9, 293 325. [] B.Y. Jing, J.Q. Yuan and W. Zhou (29). Jackknife empirical likelihood. J. Amer. Statist. Assoc. 4, 224 232. [2] Z. Landsman (29). Elliptical families and copulas: tilting and premium, capital allocation. Scand. Actuar. J. 2, 85 3. [3] F. Lindskog, A.J. McNeil and U. Schmock (23). Kendall s tau for elliptical distributions. In Credit Risk: Measurement, Evaluation and Management (ed. G. Bol, G. Nakhaeizadeh, S.T. Rachev, T. Ridder and K.-H. Vollmer), pp. 49 56, Heidelberg: Physica. [4] H. Manner and J. Segers (2). Tails of correlation mixtures of elliptical copulas. Insurance: Mathematics and Economics 48, 53 6. [5] A. Owen (2). Empirical Likelihood. Chapman & Hall/CRC. [6] L. Peng (28). Estimating the probability of a rare event via elliptical copulas. North American Actuarial Journal 2(2), 6 28. [7] L. Peng (22). Approximate jackknife empirical likelihood method for estimating equations. Canadian Journal of Statistics 4, 23. 2

[8] J. Qin and J.F. Lawless (994). Empirical likelihood and general estimating equations. Ann. Statist. 22, 3 325. [9] L. Schloegl and D. O Kane (25). A note on the large homogeneous portfolio approximation with the Student-t copula. Finance Stochast. 9, 577 584. [2] G. Venter, J. Barnett, R. Kreps and J. Major (27). Multivariate copulas for financial modeling. Variance, 4 9. [2] R. Wang, L. Peng and J. Yang (23). Jackknife empirical likelihood for parametric copulas. Scandinavian Actuarial Journal 5, 325 339. 22

ratio 5 5 4 6 8 2 4 nu Figure : Exchange rates. The empirical likelihood ratio l (ν) is plotted against ν from 4.5 to 4 with step.5 for the log-returns of the exchange rates between Euro and US dollar and those between British pound and US dollar from January 3, 2 till December 9, 27. 23

ratio 2 3 4 5.5 2. 2.5 3. 3.5 nu Figure 2: Equity. The empirical likelihood ratio l (ν) is plotted against ν from.5 to 3.5 with step. for the daily log-returns of equity for two major Dutch banks (ING and ABN AMRO Bank) over the period 99 23. 24

ratio 2 4 6 8 5 5 2 nu Figure 3: Danish fire losses. The empirical likelihood ratio l (ν) is plotted against ν from 5.5 to 2 with step.5 for the nonzero losses to building and content in the Danish fire insurance claims. 25