PLUG-IN BANDWIDTH SELECTOR FOR THE KERNEL RELATIVE DENSITY ESTIMATOR

Size: px
Start display at page:

Download "PLUG-IN BANDWIDTH SELECTOR FOR THE KERNEL RELATIVE DENSITY ESTIMATOR"

Transcription

1 PLUG-IN BANDWIDTH SELECTOR FOR THE KERNEL RELATIVE DENSITY ESTIMATOR ELISA MARÍA MOLANES-LÓPEZ AND RICARDO CAO Departamento de Matemáticas, Facultade de Informática, Universidade da Coruña, Campus de Elviña s/n, 57 A Coruña, Spain BANDWIDTH SELECTION FOR THE RELATIVE DENSITY Abstract Tis paper is focused on two kernel relative density estimators in a two-sample problem An asymptotic expression for te mean integrated squared error of tese estimators is found and, based on it, two solve-te-equation plug-in bandwidt selectors are proposed In order to examine teir practical performance a simulation study and a practical application to a medical dataset are carried out Key words and prases: kernel-type estimates, smooting parameter, solve-teequation rules, survival analysis, two-sample problem Introduction Te study of differences among groups or canges over time is a goal in fields suc as medical researc and social science researc Te traditional metod for tis purpose is te usual parametric location and scale analysis However, tis is a very restrictive tool, since a lot of te information available in te data is unaccessible In order to make a better use of tis information it is convenient to focus on distributional analysis, ie,

2 on te general two-sample problem of comparing te cumulative distribution functions cdf, F and F, of two random variables, X and X Useful tools for tis purpose are te relative distribution function, R t, and te relative density function, r t, of X wit respect to wrt X : R t = P F X t = F F t, < t < were F t = inf {x F x t} denotes te quantile function of F and r t = R t = f F t f F t, < t < were f and f are te densities pertaining to F and F, respectively Tese two curves, as well as estimators for tem, ave been studied by Gastwirt 968, Ćwik and Mielniczuk 993, Hsie 995, Hsie and Turnbull 996, Cao et al 2, 2 and Handcock and Janssen 22 Tese functions, R and r, are closely related to oter statistical metods Te ROC curve, used in te evaluation of te performance of medical tests for separating two groups, is related to R troug te relationsip ROC t = R t see, for instance, Holmgren 996 and Li et al 996 for details and te density ratio x = fx f x,x R, used by Silverman 978 is linked to r troug x = r F x Trougout te paper, we will focus on two kernel-type estimators of r, similar to te one already proposed by Ćwik and Mielniczuk 993 In te following section we will give some notation and obtain an asymptotic representation for te MISE of te relative density estimators Tis is a difference wit respect to Ćwik and Mielniczuk 993, were an asymptotic expression for te MISE of only te dominant part of te estimator was found Section 3 is concerned wit automatic global bandwidt selection Two solve-te-equation plug-in bandwidt selectors based on te ideas by Seater and Jones 2

3 99 are proposed A simulation study is sown in Section 4 were te performance of te data-driven selectors proposed in tis paper is compared wit te selector proposed in Ćwik and Mielniczuk 993 A medical application is presented in Section 5 Finally, te proofs of te results presented in Sections 2 and 3 are included in Section 6 2 Kernel relative density estimators Consider te two-sample problem wit completely observed data: {X,,X n }, {X,,X m } were te X i s are independent and identically distributed as X ; and te X j s are independent and identically distributed as X Tese two sequences are independent eac oter Trougout tis paper all te asymptotic results are obtained if bot sample sizes m and n tend to infinity in suc a way tat, for some constant < λ <, m lim m n = λ We assume te following conditions on te underlying distributions, te kernels K and M and te bandwidts and to be used in te estimators see, 2 and 3 below: F F and F ave continuous density functions, f and f, respectively F2 f is a tree times differentiable density function wit f 3 bounded R r is a differentiable density wit compact support contained in [, ] wit r bounded 3

4 K K is a symmetric four times differentiable density function wit compact support [, ] and K 4 bounded K2 M is a symmetric density and continuous function except at a finite set of points B and m 3 B2 and n 4 Since K t z dr z is close to r t and for smoot distributions it is satisfied tat: t z K dr z = K t F z df z, a natural way to define a kernel-type estimator of r t is to replace te unknown functions F and F by some appropriate estimators We consider two proposals: ˆr t = K t F n zdf m z = m m K t F n X j j= and 2 ˆr, t = K t F n z df m z = m m j= K t F n X j, were K t = K t, K is a kernel function, is te bandwidt used to estimate r, F n and F m are te empirical distribution functions based on X i s and X j s, respectively, and F n is a kernel-type estimate of F given by: n x 3 Fn = n Xi M i= were M denotes te cdf of te kernel M and is te bandwidt used to estimate F 4

5 Using a Taylor expansion, ˆr t can be written as follows: ˆr t = K t F z df m z + K t F z F z F n zdf m z + F z F n z 2 sk 2 t F z s F n z F zdsdf m z Let us define Ũn = F n F and R m = F m F Ten, ˆr t can be rewritten in a useful way for te study of its mean integrated squared error MISE: 4 ˆr t = r t + A + A 2 + B, were B = r t = A 2 = n A = K t F zdf m z = m n i= F z F n z 2 v Ũnv m K t F X j j= K t vd Rm R v F w {X i w}k t F wdfw sk 2 t F z s F n z F z dsdf m z Proceeding in a similar way, we can rewrite ˆr, as follows: 5 ˆr, t = r t + A + A 2 + Â + ˆB, were ˆB = Â = F n w F n w K t F wdf m w F z F n z 2 sk 2 t F z s Fn z F z dsdf m z Our main result is an asymptotic representation for te MISE of ˆr t and ˆr, t Wit te purpose of simplifying te exposition of te results obtained, from ere on we will denote C g = g2 xdx, for any square integrable function g 5

6 Teorem AMISE Assume conditions F, R, K and B Ten were d K = x2 K xdx MISEˆr = AMISE + o m o n AMISE = m CK d 2 KCr 2 + n CrCK If te conditions F2, K2, and B2 are assumed as well, ten te same result is satisfied for te MISEˆr, Remark From Teorem it follows tat te optimal bandwidt, minimizing te asymptotic mean integrated squared error of any of te estimators considered for r, is given by CKλCr + 6 AMISE = d 2 K Cr2 m 5 Remark 2 Note tat AMISE derived from Teorem does not depend on te bandwidt A more iger-order analysis sould be considered to address simultaneously te bandwidt selection problem of and 3 Bandwidt selectors 3 Estimation of density functionals It is very simple to sow tat, under sufficiently smoot conditions on r r C 2l R, te functionals 7 C r l = r l x 2 dx appearing in 6, are related to oter general functionals of r, denoted by Ψ 2l : 8 C r l = l r 2l xr xdx = l Ψ 2l 6

7 were Ψ l = r l xr xdx = E [ r l F X ] Te equation above suggests a natural kernel-type estimator for Ψ l as follows 9 ˆΨl g = m [ m m ] m Ll g F n X j F n X k j= k= were L is a kernel function and g is a smooting parameter called pilot bandwidt Likewise in te previous section, tis is not te unique possibility and we could consider anoter estimator of Ψ l, Ψl g = m [ m m m Ll g Fn X j F ] n X k, j= k= were F n in 9 is replaced by F n Since te difference between bot estimators decreases as tends to zero, it is expected to obtain te same teoretical results for bot estimators Terefore, we will only sow teoretical results for ˆΨ l g We will obtain te asymptotic mean squared error of ˆΨ l g under te following assumptions R3 Te relative density r C l+6 R K3 Te kernel L is a symmetric kernel of order 2, L C l+7 R and satisfies tat l 2 +2 L l d L >, L l = L l+ =, wit d L = x2 Lxdx B3 g = g m is a positive-valued sequence of bandwidts satisfying lim g = and m lim m mgmax{α,β} = were α = 2 l + 7,β = l

8 Condition R3 implies a smoot beaviour of r in te boundary of its support, contained in [, ] If tis smootness fails, te quantity C r l could be still estimated troug its definition, using a kernel estimation for r l see Hall and Marron 987 for te one-sample problem setting Condition K3 can only old for even l Observe tat in condition B3 for even l, max {α,β} = α for l =, 2 and max {α,β} = β for l = 4, 6, Teorem 2 Assume conditions F, R3, K3 and B3 Ten it follows tat [ MSE ˆΨl g = + λψ mg l+ll + 2 d LΨ l+2 g 2 + O g 4 ng l+ +o ] m 2 g 2l+Ψ C L l + o m 2 g 2l+ + O n Remark 3 Te first term in te rigt-and side of corresponds to te squared bias term of MSE Note tat, using K3 and 8, te main bias term can be made to vanis by coosing g as g l 2L l λψ + g l = d L Ψ l+2 m l+3 = 2L l d 2 K Ψ l l+3 AMISE d L Ψ l+2 CK 32 STE rules based on Seater & Jones ideas As in te context of ordinary density estimation, te practical implementation of te kernel-type estimators proposed ere see and 2, requires te coice of te smooting parameter Our two proposals, SJ and SJ2, as well as te selector b 3c recommended by Ćwik and Mielniczuk 993, are modifications of Seater & Jones 99 Since te Seater & Jones selector is te solution of an equation in te bandwidt, it is also known as a solve-te-equation STE rule Motivated by te formula 6 8

9 for te AMISE-optimal bandwidt and te relation 8, solve-te-equation rules require tat is cosen to satisfy te relationsip = C K λ Ψ γ + d 2 K Ψ 4 γ 2 m were te pilot bandwidts for te estimation of Ψ and Ψ 4 are functions of γ and γ 2, respectively and Motivated by Remark 3, we suggest taking 2 L d 2 K γ = Ψ 4 g 4 d L Ψ2 g 2 CK 5, L 4 d 2 K γ 2 = Ψ 7 4 g 4 5 7, d L Ψ6 g 6 CK were Ψ j, j =, 2, 4, 6 are kernel estimates Note tat tis way of proceeding leads us to a never ending process in wic a bandwidt selection problem must be solved at every stage To make tis iterative process feasible in practice one possibility is to propose a stopping stage in wic te unknown quantities are estimated using a parametric scale for r Tis strategy is known in te literature as te stage selection problem see Wand and Jones 995 Wile te selector b 3c in Ćwik and Mielniczuk 993 used a Gaussian scale, now for te implementation of SJ2, we will use a mixture of betas based on te Weierstrass approximation teorem and Bernstein polynomials associated to any continuous function on [, ] see Kakizawa 24 and references terein for te motivation of tis metod Later on we will sow te formula for computing te reference scale above-mentioned, togeter wit te selector b 3c we used in Section 4 In te following we denote te Epanecnikov kernel by K, te uniform density in [, ] by M and we define L as follows Lx = Γ8 8 x + x + 8 { x } 2Γ9Γ

10 Next, we detail te steps required in te implementation of SJ2 Step Obtain PR ˆΨ j j =, 4, 6, 8, parametric estimates for Ψ j j =, 4, 6, 8, wit te replacement of r in C r j/2 see 7, by a mixture of betas, bx, as it will be explained later on see 2 Step 2 Compute kernel estimates for Ψ j j = 2, 4, 6, by using Ψ j gj PR j = 2, 4, 6, wit g PR j = 2 Lj λˆψ PR + d L ˆΨ PR j+2 m j+3, j = 2, 4, 6 Step 3 Estimate Ψ and Ψ 4, using, by means of Ψ ˆγ and Ψ 4 ˆγ 2, were ˆγ = 2 L d 2 K Ψ 4 g PR 4 d L Ψ2 g2 PR C K and ˆγ 2 = 2 L 4 d 2 K Ψ 4 g PR 4 d L Ψ6 g6 PR C K Step 4 Select te bandwidt SJ2 as te one tat solves te following equation in : = C K λ Ψ ˆγ + d 2 K Ψ 4 ˆγ 2 m In order to solve te equation above, it will be necessary to use a numerical algoritm In te simulation study we will use te false-position metod Te main reason is tat te false-position algoritm does not require te computation of te derivatives, wat simplifies considerably te implementation of te proposed bandwidt selectors At te same time, tis algoritm presents some advantages over oters because it tries to combine te speed of metods suc as te secant metod wit te security afforded by te bisection metod 5

11 Unlike te Gaussian parametric reference, used to obtain b 3c, te selector SJ2 uses in Step a mixture of betas as follows: 2 bx = N j R n,m N j= R j n,m βx,j,n j + N were 3 4 m R n,m x = m x M F n X j, g j= 2 xm x M xdx g = nd 2 M C r 3, βx,a,b stands for te beta density βx,a,b = and N is te number of betas in te mixture Γa + b ΓaΓb xa x b,x [, ], Since we are trying to estimate a density wit support in [, ] it seems more suitable to consider a parametric reference wit tis support A mixture of betas is an appropriate option because it is flexible enoug to model a large variety of relative densities, wen derivatives of order, 3 and 4 are also required Note tat, for te sake of simplicity, we are using above te AMISE-optimal bandwidt g for estimating a distribution function in te setting of a one-sample problem see Polansky and Baker 2 for more details in te kernel-type estimate of a distribution function Te use of tis bandwidt requires te previous estimation of te unknown functional, C r We will consider a quick and dirty metod, te rule of tumb, tat uses a parametric reference for r to estimate te above-mentioned unknown quantity More specifically, our reference scale will be a beta wit parameters p, q estimated from te smooted relative sample { Fn X j } m j=, using te metod of moments Following te same ideas as for 3 and 4, te bandwidt selector used for te

12 kernel-type estimator F n introduced in 3 is based on te AMISE-optimal bandwidt in te one-sample problem: = 2 xm x M xdx nd 2 M C f As it was already mentioned above, in most of te cases tis metodology will be applied to survival analysis, so it is natural to assume tat our samples come from distributions wit support on te positive real line Terefore, a gamma reference distribution, Gammaα,β, as been considered, were te parameters α,β are estimated from te smooted relative sample { Fn X j } m j=, using te metod of moments For te implementation of SJ, we proceed analogously to tat of SJ2 above Te only difference now is tat trougout te previous discussion, Ψj and F n are replaced by, respectively, ˆΨ j and F n As a variant of te selector tat Ćwik and Mielniczuk 993 proposed, b 3c is obtained as te soluction to te following equation: b 3c = CK + λˆψ a d 2 ˆΨ K 4 α 2 b 3 c m { } were a = 78ˆσm 3, ˆσ = min s m,îqr/349, s m and ÎQR denote, respectively, te empirical standard deviation and te sample interquartile range of te relative data, {F n X j } m j=, α 2 b 3c = 7694 were GS stands for standard Gaussian scale, 5 3 ˆΨ4 g GS 7 4 b 5 7 ˆΨ 6 g6 GS 3c,, g 4 GS =247ˆσm 7,g6 GS =234ˆσm 9 and te estimates ˆΨ j wit j =, 4, 6 were obtained using 9, wit L replaced by te standard Gaussian kernel and wit data driven bandwidt selectors derived from 2

13 reducing te two-sample problem to a one-sample problem It is interesting to note tat all te kernel-type estimators presented previously ˆr t, ˆr, t, Rn,m x and F n x were not corrected to take into account, respectively, te fact tat r and R ave support on [, ] instead of on te wole real line, and te fact tat f is supported only on te positive real line Terefore, in order to correct te boundary effect in practical applications we will use te well known reflecting metod to modify ˆr t, ˆr, t, R n,m x and F n x, were needed 4 Simulations We compare, troug a simulation study, te performance of te bandwidt selectors SJ and SJ2, proposed in Section 3, wit te standard competitor b 3c recommended by Ćwik and Mielniczuk 993 Altoug we are aware tat te smooting parameter N introduced in 2 sould be selected by some optimal way based on te data, tis issue goes beyond te scope of tis article Consequently, from ere on, we will consider N = 4 components in te beta mixture reference scale model given by 2 We will consider te first sample coming from te random variate X = W U and te second sample coming from te random variate X = W S, were U denotes a uniform distribution in te compact interval [, ], W is te distribution function of a Weibull distribution wit parameters 2, 3 and S is a random variate from one of te following five different populations see Figure : a A beta distribution wit parameters 4 and 7 β 4, 7 b A mixture consisting of V wit probability 4 5 and V 2 wit probability 5, were V = β 4, 37 and V 2 = β 4, 2 3

14 c A mixture consisting of V wit probability 3 and V 2 wit probability 2 3, were V = β 34, 5 and V 2 = β 5, 3 Put Figure about ere Coosing different values for te pair of sample sizes m and n and under eac of te models presented above, we start drawing 5 pair of random samples and, according to every metod, we select te bandwidts ĥ Ten, in order to ceck teir performance we approximate by Monte Carlo te mean integrated squared error, EM, between te true relative density and te kernel-type estimate for r, given by wen ĥ = b 3c, SJ or by 2 wen ĥ = SJ 2 Te computation of te kernel-type estimations can be very time consuming by using a direct algoritm Terefore, we will use linear binned approximations tat, tanks to teir discrete convolution structures, can be fast computed by using te fast Fourier transform FFT see Wand and Jones 995 for more details For all te models, te values of tis criterion for te tree bandwidt selectors, SJ, SJ2 and b 3c, can be found in Table Put Table about ere A careful look at te table points out tat te new selector SJ2 presents a muc better beaviour tan te selector b 3c, especially wen te sample sizes are equal or wen m is larger tan n Te improvement is even larger for unimodal relative densities model a and b On te oter and, it is observed tat te oter proposal, SJ, presents only a moderate improvement over b 3c for unimodal relative densities model a and b and performs only sligtly better or even worse tan b 3c for bimodal relative densities model c Te ratio m n produces an important effect on te beaviour of any of te 4

15 tree selectors considered For instance, it is clearly seen an asymmetric beaviour of te selectors in terms of te sample sizes Oter proposals for selecting ave been investigated For instance, versions of SJ2 were considered in wic eiter te unknown functionals Ψ are estimated from te viewpoint of a one-sample problem or te STE rule is modified in suc a way tat only te function ˆγ 2 is considered in te equation to be solved see Step 4 in Section 3 After a simulation study similar to te one detailed ere, but now carried out for tese versions of SJ2, it was observed a similar practical performance to tat observed for SJ2 However, a worse beaviour was observed wen, in te implementation of tese versions of SJ2, te smoot estimate of F is replaced by te empirical distribution function F n Terefore, altoug SJ2 requires te selection of two bandwidt parameters, a clear better practical beaviour is observed wen considering te smooted relative data instead of te non-smooted ones 5 A medical application In tis section we apply te plug-in STE selector SJ2 detailed above, to estimate te relative density for a real data set concerned wit prostate cancer PC Te data consist of 599 patients suffering from PC + and 835 patients PC-free - For eac patient te illness status as been determined troug a prostate biopsy carried out for first time in Hospital Juan Canalejo Galicia, Spain between January of 22 and September of 25 In te literature, tere exists an increasingly interest in finding a good diagnostic test tat elps in te early detection of PC and avoids te need of undergoing a prostate biopsy Tere are several studies in wic, troug ROC curves, it was investigated te 5

16 performance of different diagnostic tests based on some analytic measurements suc as te total prostate specific antigen tpsa, te free PSA fpsa or te complexed PSA cpsa As it was mentioned in Section, tere exists a close relation between te concepts of ROC curve and relative density Relative density estimates can provide more detailed information about te performance of a diagnostic test wic can be useful not only in comparing different tests but also in designing an improved one Tis issue goes beyond te scope of tis article and terefore it will not be investigated ere In tis section we compare from a distributional point of view te above mentioned measurements tpsa, fpsa and cpsa among te two groups in te data set PC+ and PC- To tis end we start computing te appropriate bandwidts using te datadriven bandwidt selector SJ2 and ten te corresponding relative density estimates are computed using 2 Tese estimates are sown in Figure 2 Put Figure 2 about ere It is clear from Figure 2 tat te relative density estimate is above one in a central interval accounting for a probability of about 4% of te PC- distribution for te variables tpsa and cpsa Tis is also te case in te % rigt tail of te PC- group for tpsa and cpsa, but not for fpsa In te case of fpsa te central interval wit relative density above one is sligtly sifted to te left of te PC- distribution: between percentiles 2% and 6% 6 Proofs Te proof of Teorem will be a direct consequence of some previous lemmas were eac one of te terms tat result from expanding te expression for te MISE are studied 6

17 Some of tem will produce dominant parts in te final expression for te MISE wile oters will yield negligible terms Lemma Assume te ypotesis above Ten i E[ r t rt 2 ]dt = CK r tdt + m 4 4 d 2 K Cr2 + o + 4 m ii E[A2 2]dt = n CrCK + o n iii E[A2 ]dt = o n iv E[B2 ]dt = o n v E[2A r rt]dt = vi E[2A 2 r rt]dt = vii E[2B r rt]dt = o m + 4 viii E[2A A 2 ]dt = o m + 4 ix E[2A B]dt = o m + 4 x E[2A 2B]dt = o n + 4 = O n Lemma 2 Assume te ypotesis above Ten i E[Â2 ]dt = o n ii E[ ˆB 2 ]dt = o n Proof of Lemma Te proof of i is not included ere because it is a classical result in te setting of ordinary density estimation in a one-sample problem see Wand and Jones 995 for details We next prove ii Standard algebra gives E[A 2 2] = n 2 4 n i= n j= K t F w E [ F w {Xi w }F w 2 {Xj w 2 } ] K t F w 2 dfw dfw 2 7

18 Due to te independence between X i and X j for i j, and using te fact tat Cov {F X i u }, {F X i u 2 } = u u 2 g u u 2, were g t = t, te previous expression can be rewritten as follows t E[A 2 2] = 2 n 4 = n 4 t u u 2 g u 2 K u u 2 [ t g u 2 d u K u u 2 Now, using integration by parts, it follows tat t K u2 ru du ] 2 ru ru 2 du du 2 5 E[A 2 2] = n 4 + n 4 lim g u 2 Gu u 2 n 4 Gu 2 2 g u 2 du 2 lim g u 2 Gu 2 2 u 2 + were Gu 2 = t u K u ru du u 2 Since G is a bounded function and g =, te second term in te rigt and side of 5 vanises to zero On te oter and, due to te boundedness of K and r, it follows tat Gu 2 K r u is zero as well Terefore, 2, wic let us conclude tat te first term in 6 E[A 2 2] = n 4 Gu 2 2 g u 2 du 2 Now, using integration by parts, it follows tat t u Gu 2 = u ru K + u 2 t u K [ ru + u r u ]du u 2 t u2 = u 2 ru 2 K + t u K [ u r u ru ]du, u 2 8

19 and plugging tis last expression in 6, it is concluded tat E[A 2 2] = n 2I 2 + 2I 22 + I 23, were I 22 = u 2 K I 2 = t r 2 u 2 K 2 u2 du 2 t u 2 ru u2 2K t u [ u r u ru ]du du 2 I 23 = K u 2 2 t u u 2 K t u [ u r u ru ] u 2 [ u r u ru ]du du du 2 Terefore, 7 E[A 2 2]dt = n 2I 2dt + 2 n 2I 22dt + n 2I 23dt Next, we will study eac summand in 7 separately Te first term can be andled by using canges of variable and a Taylor expansion: n 2I 2 dt = n r 2 u 2 u 2 u 2 K 2 sds du 2 Let us define K 2 x = x K2 sds and rewrite te previous term as follows n 2I 2 dt = n r 2 u2 u 2 K 2 K 2 u 2 du 2 Now, by splitting te integration interval into tree subintervals: [,], [, ] and [, ], using canges of variable and te fact tat CK x K 2 x = x, 9

20 it is easy to sow tat n 2I 2 dt = CK Cr + O n n Below, we will study te second term in te rigt and side of 7 By using canges of variable, Caucy-Scwarz inequality and conditions r <, r < and CK <, straigtforward calculations lead to n 2I 22 dt = O n + = O n 2 n 2 Similar arguments give n 2I 23 dt = O n 2 Terefore, it as been sown tat E[A 2 2]dt = n CrCK + O Finally te proof of ii concludes using condition B We now prove iii Direct calculations lead to + O n n 2 8 E[A 2 ] = 4E [I ] were 9 [ I =E t v Ũnv v 2 Ũnv 2 K v t K v2 d R m Rv d R m Rv 2 /X,,X n ] To tackle wit 8 we first study te conditional expectation 9 It is easy to see tat I = V ar[v/x,,x n ] were V = m m j= t X j ŨnX j K Xj 2

21 Tus { I = [ ] 2 t v v m ŨnvK drv [ and E[A 2 ] = m 4 E Taking into account tat { [ ] }[ 2 t v E v Ũnv K [ v Ũnv v 2 Ũnv 2 [ ] E sup Ũnv v 2 = v ] K t v t v v ŨnvK ] 2 drv m 4 t K v2 P sup Ũnv v 2 > c dc, v we can use te Dvoretzky-Kiefer-Wolfowitz inequality, to conclude tat [ ] 2 E sup Ũnv v 2 v 2e 2nc dc = 2 n ye y2 dy = O ] 2 } drv drv drv 2 n Consequently, using 2 and te conditions r < and K < we obtain tat E[A 2 ] = O mn 4 Te proof of iii is concluded using condition B Te results appearing in items iv-x can be proved by first conditioning to some appropriate random variables and ten andling te conditional moments using standard arguments For tis reason teir proofs are not included ere ten Proof of Lemma 2 We start proving i Let us define D n w = F n w F n w, E[Â2 ] = E[E[Â2 /X,,X m ]] [ ] = E E[D n w D n w 2 ]K t F w K t F w 2 df m v df m v 2 Based on te results set for D n w in Hjort and Walker 2, te conditions F2 and K2 and since E[D n w D n w 2 ] = CovD n w,d n w 2 + E[D n w ]E[D n w 2 ], 4 it follows tat E[D n w D n w 2 ] = O + O 4 n 2

22 Terefore, for any t [, ], we can bound E[Â2 ], using suitable constants C 2 and C 3 as follows 4 2 E[Â2 t ] = C 2 K F z fzdz 4 m 4 m t +C 3 K F z t K F z 2 fz 4 fz 2 dz dz 2 m Besides, te condition R allows us to conclude tat 2 K t F z fzdz = O and K t F z K t F z 2 fz fz 2 dz dz 2 = O 2 for all t [, ] Terefore, E [Â2 and B2, implies i ] dt = O 4 n 3 +O 4 2, wic, taking into account conditions B We next prove ii Te proof is parallel to tat of item iv in Lemma Te only difference now is tat instead of requiring E[sup F n x F x p ] = On p 2, were p is an integer larger tan, it is required tat [ 2 E sup F n x F x p ] = On p 2 To conclude te proof, below we sow tat 2 is satisfied Define H n =sup F n x F x, ten, as it is stated in Amad 22, it follows tat H n E n +W n were E n = sup F n x F x and W n = sup E F n x F x = O 2 Using te binomial formula it is easy to obtain tat, for any integer p, H p n p j= Cp j W n p j E j n, were te constants C p j s wit j {,,,p,p} are te binomial coefficients Terefore, since E[E j n] = On j 2 and W p j n = O 2p j, condition B2 leads to W p j n E [E j n] = On p 2 As a straigtforward consequence, 2 olds and te proof of ii is concluded Proof of teorem 2 Below, we will briefly detail te steps followed to study te asymptotic beaviour of te mean squared error of ˆΨ l g defined in 9 First of all, let 22

23 us observe tat wic implies: ˆΨ l g = m Ll g + m 2 m m j= k=,j k L l g F n X j F n X k, ] E [ˆΨl g = + E [ L mg l+ll g l F n X F n X 2 ] m Starting from te equation E [ L l g F n X F n X 2 ] = E [ E [ ]] L g l F n X F n X 2 /X,,X n [ ] = E L g l F n x F n x 2 f x f x 2 dx dx 2 = and using a Taylor expansion, we ave E [ L l g F n x F n x 2 ] f x f x 2 dx dx 2 22 E [ L l g F n X F n X 2 ] = 7 i= I i were I i = I = E g l+ll i!g l+i+ll+i F x F x 2 g F x F x 2 f x f x 2 dx dx 2 g [ F n x F x F n x 2 + F x 2 i] f x f x 2 dx dx 2 i =,,6 I 7 = 7!g l+7+e [ L l+7 ξ n F n x F x F n x 2 + F x 2 7] f x f x 2 dx dx 2 and ξ n is a value between F x F x 2 g and F nx F n x 2 g 23

24 Now, consider te first term, I, in 22 It is easy to see tat I = = = z2 /g L l g z z 2 rz rz 2 dz dz 2 L g xrz z 2 r l z 2 dz dz 2 Lxrz 2 + gxr l z 2 dxdz 2, ence using a Taylor expansion, we ave I = Ψ l + /2d L Ψ l+2 g 2 + O g 4 Assume x > x 2 and define Z = n i= {x 2 <X i x } Ten, te random variable Z as a Bin,p distribution wit p = F x F x 2 and mean µ = np It is easy to sow tat, for i =,,6, I i = 2 x 2 i!g l+i+ll+i F x F x 2 g f x f x 2 n iµ i Z dx dx 2 were µ r Z = E [Z E [Z] r ] = m k = E [ Z k] = S m,n = n j= k j= r r j m r j µ j, j j= S k,j n!p j, n j! n j j n j m n! Noting µ Z = and µ 2 Z = nf x F x 2 F x + F x 2, we ave I = and 23 I 2 = L l+2 F x F x 2 fx ng l++2 fx 2 g F x F x 2 F x + F x 2 dx dx 2 = u v L l+2 u v u vrurvdudv ng l++2 g = ng l+ v v/g L l+2 xx gxrv + gxrvdxdv 24

25 Using a Taylor expansion of gxrv + gx and noting xll+2 xdx = L l see condition K3, we ave from 23 I 2 = ng l+ψ L l + O ng l Similar arguments can be used to andle I i = On 2 g l+2 for i = 3, 4 and I i = O n 3 g l+3 for i = 5, 6 Coming back to te last term in 22 and using Dvoretzky- Kiefer-Wolfowitz inequality and condition K3, it is easy to sow tat I 7 =O n2g 7 l+8 Terefore, ] E[ˆΨl g =Ψ l + 2 d LΨ l+2 g Ψ mg l+ll ng l+ll +O g 4 +o ng l+ In order to study te variance of ˆΨ l g, note tat ] 24 V ar [ˆΨl g = were 3 c n,i V l,i i= c n, = c n,2 = c n,3 = 2 m m 3 4 m m 2 m 3 m m 2 m 3 m V l, =V ar [ L l g F n X F n X 2 ] V l,2 =Cov [ L g l F n X F n X 2,L l g F n X 2 F n X 3 ] V l,3 =Cov [ L g l F n X F n X 2,L l g F n X 3 F n X 4 ] Terefore, in order to get an asymptotic expression for te variance of ˆΨ l g, we will start getting asymptotic expressions for te terms 25, 26 and 27 in 24 To deal wit te term 25, we will use 28 V l, = E [ L l g F n X F n X 2 ] 2 E [ 2 L g l F n X F n X 2 ] 25

26 and study separately eac term in te rigt and side of 28 Note tat te expectation of L g l F n X F n X 2 as been already studied wen dealing wit te expectation of ˆΨ l g Next we study te first term in te rigt and side of 28 Using a Taylor expansion, te term: [ L l E g F n X F n X 2 ] 2 [ [ L l = E E g F n X F n X 2 ]] 2 /X,,X n [ ] = E F n x F n yf x f ydxdy = E L l2 g [ L l2 g F n x F n y ] f xf ydxdy can be decomposed in a sum of six terms tat can be bounded easily Te first term in tat decomposition can be rewritten as Ψ g 2l+ C L l + o after applying some g 2l+ canges of variable and a Taylor expansion Te oter terms can be easily bounded using Dvoretzky-Kiefer-Wolfowitz inequality and standard canges of variable Tese bounds and condition B3 prove tat te order of tese terms is o Consequently, g 2l+ V l, = g 2l+Ψ C L l + o Te term 26 can be andled using g 2l+ = g 2l+Ψ C L l Ψ 2 l + o Ψ l + o 2 + o g 2l+ 29 V l,2 =E [ L l g F n X F n X 2 L g l F n X 2 F n X 3 ] E 2 [ L l g F n X F n X 2 ] As for 28, it is only needed to study te first term in te rigt and side of 29 26

27 Note tat E [ L g l F n X F n X 2 L g l F n X 2 F n X 3 ] = E [ E [ L l g = F n X F n X 2 L l g F n X 2 F n X 3 /X,, X n ]] f yf zf tdydzdt E [ L g l F n y F n zl g l F n z F n t ] Taylor expansions, canges of variable, Caucy-Scwarz inequality and Dvoretzky- Kiefer-Wolfowitz inequality, give: = E [ L l g F n X F n X 2 L g l F n X 2 F n X 3 ] r l2 z r z dz + O + O + O + O n n 2 n 3 Consequently, using B3 and 29, V l,2 = O To study te term V l,3 in 27, let us define A l = [ L l g It is easy to sow tat: Now a Taylor expansion gives 3 V ar A l = were n 4 g 2l++4 F n y F n z L l g F y F z ] f yf zdydz V l,3 = V ar A l N V ar T k + k= A l = N k= N T k, k= N Cov T k,t l, F y F z T k = f yf z k!g l+ll+k g k Fn y F n z F y F z dydz, for k =,,N, g l= k l 27

28 T N = ξ N!g l+ll+n n f yf z Fn y F n z F y F z g N dydz, for some positive integer N We will only study eac one of te first N summands in 3 Te rest of tem will be easily bounded using Caucy-Scwarz inequality and te bounds obtained for te first N terms Now te variance of T k is studied First of all, note tat V art k E [ T k 2] = L l+k F y F z g k y,z,y 2,z 2 dy dz dy 2 dz 2 k!g l+k+ L l+k F y 2 F z 2 g 2 fy fz fy 2 fz 2 were k y,z,y 2,z 2 = E { [F n y F n z F y F z ] k [F n y 2 F n z 2 F y 2 F z 2 ] k} Using canges of variable we can rewrite E[Tk 2 ] as follows: E [ T k 2] = = 2 rs rt rs 2 rt 2 L l+k g s t L g l+k s 2 t 2 k! k F s,f t,f s 2,F t 2 ds dt ds 2 dt 2 s2 s s 2 s k! 2 rs rs u rs 2 rs 2 u 2 L g l+k u L l+k g u 2 k F s,f s u,f s 2,F s 2 u 2 du ds du 2 ds 2 Note tat closed expressions for k can be obtained using te expressions for te moments of order r = r,r 2,r 3,r 4,r 5 of Z, a random variable wit multinomial distribution wit parameters n;p,p 2,p 3,p 4,p 5 Based on tese expressions, te condition 28

29 R3 and te use of integration by parts we can rewrite E [T k 2 ] as follows: were E [ T k 2] = s2 s s 2 2l+k u l+k u2 l+k s 2 L g u L g u 2 k! k u,s,u 2,s 2 du ds du 2 ds 2, k u,s,u 2,s 2 = rs rs u rs 2 rs 2 u 2 k F s,f s u,f s 2,F s 2 u 2 Besides, based on te multinomial moments we can sow tat sup z R 4 k z = O n k Tis result and condition R3 allow us to conclude tat V ar Tk E T 2 k = O n, for k < N, wic implies tat V ar k Tk = o n, for 2 k < N A Taylor expansion of order N = 6, gives V ar T 6 = O condition B3, proves V ar T 6 = o n Consequently, ] V ar [ˆΨl g =, wic using n N g 2N+l+ 2 m 2 g 2l+Ψ C L l m + o 2 g 2l+ + O n were Remark 4 If equation 22 is replaced by a tree-term Taylor expansion 2 i= I i+i 3 I3 = E [ L l+3 ζ 3!g l+4 n F n x F x F n x 2 + F x 2 3] f x f x 2 dx dx 2, and ζ n is a value between F x F x 2 and F nx F n x 2, ten I g g 3 = O and we n 3 2 g l+4 would ave to ask for te condition ng 6 to conclude tat I3 = o However, ng l+ tis condition is very restrictive because it is not satisfied by te optimal bandwidt g l wit l =, 2, wic is g l n l+3 We could consider 3 i= I i+i 4 and ten we would need to ask for te condition ng 4 However, tis condition is not satisfied by g l wit 29

30 l = In fact, it follows tat ngl 4 if l = and ng4 l if l = 2, 4, Someting similar appens wen we consider 4 i= I i+i 5 or 5 i= I i+i 6, ie, te condition required in g, it is not satisfied by te optimal bandwidt wen l = Only wen we stop in I 7, te required condition, ng 4 5, it is satisfied for all even l If equation 3 is reconsidered by te mean-value teorem, and ten we consider tat A l = T wit T = ζ g l+2ll+ n [F n y F n z F y F z]f x f x 2 dydz, it follows tat V ar A l = O However, assuming tat g, it is impossible ng 2l+2 to conclude from ere tat V ar A l = o n Acknowledgements Researc supported by Grants BES-23-7 EU ESF support included for te first autor, XUGA PGIDIT3PXIC55-PN for te second autor and BFM and MTM EU ERDF support included for bot autors Te autors would like to tank two anonymous referees and an associate editor wose comments ave elped to improve te paper substantially Tanks are also due to Sonia Pértega Díaz and Francisco Gómez Veiga from te Hospital Juan Canalejo in A Coruña for providing te prostate cancer data set References Amad, IA 22 On moment inequalities of te supremum of empirical processes wit applications to kernel estimation, Statistics & Probability Letters, 57, Cao, R, Janssen, P and Veraverbeke, N 2 Relative density estimation wit censored data, Te Canadian Journal of Statistics, 28, 97 3

31 Cao, R, Janssen, P and Veraverbeke, N 2 Relative density estimation and local bandwidt selection for censored data, Computational Statistics & Data Analysis, 36, Ćwik, J & Mielniczuk, J 993 Data-dependent bandwidt coice for a grade density kernel estimate, Statistics & Probability Letters, 6, Gastwirt, JL 968 Te first-median test a two-sided version of te control median test, Journal of te American Statistical Association, 63, Hall, P and Marron, JS 987 Estimation of integrated squared density derivatives, Statistics & Probability Letters, 6, 9 5 Handcock, M and Janssen, P 22 Statistical inference for te relative density, Sociological Metods & Researc, 3, Hjort, NL and Walker, SG 2 A note on kernel density estimators wit optimal bandwidts, Statistics & Probability Letters, 54, Holmgren, EB 996 Te P-P plot as a metod for comparing treatment effects, Journal of te American Statistical Association, 9, Hsie, F 995 Te empirical process approac for semiparametric two-sample models wit eterogenous treatment effect, Journal of te Royal Statistical Society Series B, 57, Hsie, F and Turnbull, BW 996 Nonparametric and semiparametric estimation of te receiver operating caracteristic curve, Te Annals of Statistics, 24, 25 4 Kakizawa, Y 24 Bernstein polynomial probability density estimation, Journal of Nonparametric Statistics, 6, Li, G, Tiwari, RC and Wells, MT 996 Quantile comparison functions in twosample problems wit applications to comparisons of diagnostic markers, Journal of 3

32 te American Statistical Association, 9, Polansky, AM and Baker, ER 2 Multistage plug-in bandwidt selection for kernel distribution function estimates, Journal of Statistical Computation & Simulation, 65, 63 8 Seater, S J and Jones, M C 99 A reliable data-based bandwidt selection metod for kernel density estimation, Journal of te Royal Statistical Society Series B, 53, Silverman, B W 978 Density ratios, empirical likeliood and cot deat, Applied Statistics, 27, Wand, MP and Jones, MC 995 Kernel Smooting, Capman and Hall, London 32

33 Table Values of EM for SJ, SJ2 and b 3c for models a-c EM Model a Model b Model c n,m SJ SJ2 b 3c SJ SJ2 b 3c SJ SJ2 b 3c 5, , , , , , , , ,

34 6 b a c Fig Plots of te relative densities a-c Fig 2 Relative density estimate of te PC+ group wrt te PC- group for te variables tpsa solid line, SJ2 = 84, cpsa dotted line, SJ2 = 8 and fpsa dased line, SJ2 = 74 34

Verifying Numerical Convergence Rates

Verifying Numerical Convergence Rates 1 Order of accuracy Verifying Numerical Convergence Rates We consider a numerical approximation of an exact value u. Te approximation depends on a small parameter, suc as te grid size or time step, and

More information

FINITE DIFFERENCE METHODS

FINITE DIFFERENCE METHODS FINITE DIFFERENCE METHODS LONG CHEN Te best known metods, finite difference, consists of replacing eac derivative by a difference quotient in te classic formulation. It is simple to code and economic to

More information

Tangent Lines and Rates of Change

Tangent Lines and Rates of Change Tangent Lines and Rates of Cange 9-2-2005 Given a function y = f(x), ow do you find te slope of te tangent line to te grap at te point P(a, f(a))? (I m tinking of te tangent line as a line tat just skims

More information

Derivatives Math 120 Calculus I D Joyce, Fall 2013

Derivatives Math 120 Calculus I D Joyce, Fall 2013 Derivatives Mat 20 Calculus I D Joyce, Fall 203 Since we ave a good understanding of its, we can develop derivatives very quickly. Recall tat we defined te derivative f x of a function f at x to be te

More information

In other words the graph of the polynomial should pass through the points

In other words the graph of the polynomial should pass through the points Capter 3 Interpolation Interpolation is te problem of fitting a smoot curve troug a given set of points, generally as te grap of a function. It is useful at least in data analysis (interpolation is a form

More information

Instantaneous Rate of Change:

Instantaneous Rate of Change: Instantaneous Rate of Cange: Last section we discovered tat te average rate of cange in F(x) can also be interpreted as te slope of a scant line. Te average rate of cange involves te cange in F(x) over

More information

Geometric Stratification of Accounting Data

Geometric Stratification of Accounting Data Stratification of Accounting Data Patricia Gunning * Jane Mary Horgan ** William Yancey *** Abstract: We suggest a new procedure for defining te boundaries of te strata in igly skewed populations, usual

More information

Computer Science and Engineering, UCSD October 7, 1999 Goldreic-Levin Teorem Autor: Bellare Te Goldreic-Levin Teorem 1 Te problem We æx a an integer n for te lengt of te strings involved. If a is an n-bit

More information

The EOQ Inventory Formula

The EOQ Inventory Formula Te EOQ Inventory Formula James M. Cargal Matematics Department Troy University Montgomery Campus A basic problem for businesses and manufacturers is, wen ordering supplies, to determine wat quantity of

More information

Chapter 7 Numerical Differentiation and Integration

Chapter 7 Numerical Differentiation and Integration 45 We ave a abit in writing articles publised in scientiþc journals to make te work as Þnised as possible, to cover up all te tracks, to not worry about te blind alleys or describe ow you ad te wrong idea

More information

CHAPTER 7. Di erentiation

CHAPTER 7. Di erentiation CHAPTER 7 Di erentiation 1. Te Derivative at a Point Definition 7.1. Let f be a function defined on a neigborood of x 0. f is di erentiable at x 0, if te following it exists: f 0 fx 0 + ) fx 0 ) x 0 )=.

More information

Lecture 10: What is a Function, definition, piecewise defined functions, difference quotient, domain of a function

Lecture 10: What is a Function, definition, piecewise defined functions, difference quotient, domain of a function Lecture 10: Wat is a Function, definition, piecewise defined functions, difference quotient, domain of a function A function arises wen one quantity depends on anoter. Many everyday relationsips between

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: [email protected]

More information

Distances in random graphs with infinite mean degrees

Distances in random graphs with infinite mean degrees Distances in random graps wit infinite mean degrees Henri van den Esker, Remco van der Hofstad, Gerard Hoogiemstra and Dmitri Znamenski April 26, 2005 Abstract We study random graps wit an i.i.d. degree

More information

Bandwidth Selection for Nonparametric Distribution Estimation

Bandwidth Selection for Nonparametric Distribution Estimation Bandwidth Selection for Nonparametric Distribution Estimation Bruce E. Hansen University of Wisconsin www.ssc.wisc.edu/~bhansen May 2004 Abstract The mean-square efficiency of cumulative distribution function

More information

Math 113 HW #5 Solutions

Math 113 HW #5 Solutions Mat 3 HW #5 Solutions. Exercise.5.6. Suppose f is continuous on [, 5] and te only solutions of te equation f(x) = 6 are x = and x =. If f() = 8, explain wy f(3) > 6. Answer: Suppose we ad tat f(3) 6. Ten

More information

Schedulability Analysis under Graph Routing in WirelessHART Networks

Schedulability Analysis under Graph Routing in WirelessHART Networks Scedulability Analysis under Grap Routing in WirelessHART Networks Abusayeed Saifulla, Dolvara Gunatilaka, Paras Tiwari, Mo Sa, Cenyang Lu, Bo Li Cengjie Wu, and Yixin Cen Department of Computer Science,

More information

An inquiry into the multiplier process in IS-LM model

An inquiry into the multiplier process in IS-LM model An inquiry into te multiplier process in IS-LM model Autor: Li ziran Address: Li ziran, Room 409, Building 38#, Peing University, Beijing 00.87,PRC. Pone: (86) 00-62763074 Internet Address: [email protected]

More information

OPTIMAL DISCONTINUOUS GALERKIN METHODS FOR THE ACOUSTIC WAVE EQUATION IN HIGHER DIMENSIONS

OPTIMAL DISCONTINUOUS GALERKIN METHODS FOR THE ACOUSTIC WAVE EQUATION IN HIGHER DIMENSIONS OPTIMAL DISCONTINUOUS GALERKIN METHODS FOR THE ACOUSTIC WAVE EQUATION IN HIGHER DIMENSIONS ERIC T. CHUNG AND BJÖRN ENGQUIST Abstract. In tis paper, we developed and analyzed a new class of discontinuous

More information

SAT Subject Math Level 1 Facts & Formulas

SAT Subject Math Level 1 Facts & Formulas Numbers, Sequences, Factors Integers:..., -3, -2, -1, 0, 1, 2, 3,... Reals: integers plus fractions, decimals, and irrationals ( 2, 3, π, etc.) Order Of Operations: Aritmetic Sequences: PEMDAS (Parenteses

More information

ACT Math Facts & Formulas

ACT Math Facts & Formulas Numbers, Sequences, Factors Integers:..., -3, -2, -1, 0, 1, 2, 3,... Rationals: fractions, tat is, anyting expressable as a ratio of integers Reals: integers plus rationals plus special numbers suc as

More information

Solutions by: KARATUĞ OZAN BiRCAN. PROBLEM 1 (20 points): Let D be a region, i.e., an open connected set in

Solutions by: KARATUĞ OZAN BiRCAN. PROBLEM 1 (20 points): Let D be a region, i.e., an open connected set in KOÇ UNIVERSITY, SPRING 2014 MATH 401, MIDTERM-1, MARCH 3 Instructor: BURAK OZBAGCI TIME: 75 Minutes Solutions by: KARATUĞ OZAN BiRCAN PROBLEM 1 (20 points): Let D be a region, i.e., an open connected set

More information

2 Limits and Derivatives

2 Limits and Derivatives 2 Limits and Derivatives 2.7 Tangent Lines, Velocity, and Derivatives A tangent line to a circle is a line tat intersects te circle at exactly one point. We would like to take tis idea of tangent line

More information

Optimized Data Indexing Algorithms for OLAP Systems

Optimized Data Indexing Algorithms for OLAP Systems Database Systems Journal vol. I, no. 2/200 7 Optimized Data Indexing Algoritms for OLAP Systems Lucian BORNAZ Faculty of Cybernetics, Statistics and Economic Informatics Academy of Economic Studies, Bucarest

More information

Can a Lump-Sum Transfer Make Everyone Enjoy the Gains. from Free Trade?

Can a Lump-Sum Transfer Make Everyone Enjoy the Gains. from Free Trade? Can a Lump-Sum Transfer Make Everyone Enjoy te Gains from Free Trade? Yasukazu Icino Department of Economics, Konan University June 30, 2010 Abstract I examine lump-sum transfer rules to redistribute te

More information

Pre-trial Settlement with Imperfect Private Monitoring

Pre-trial Settlement with Imperfect Private Monitoring Pre-trial Settlement wit Imperfect Private Monitoring Mostafa Beskar University of New Hampsire Jee-Hyeong Park y Seoul National University July 2011 Incomplete, Do Not Circulate Abstract We model pretrial

More information

Lecture 13: Martingales

Lecture 13: Martingales Lecture 13: Martingales 1. Definition of a Martingale 1.1 Filtrations 1.2 Definition of a martingale and its basic properties 1.3 Sums of independent random variables and related models 1.4 Products of

More information

Section 3.3. Differentiation of Polynomials and Rational Functions. Difference Equations to Differential Equations

Section 3.3. Differentiation of Polynomials and Rational Functions. Difference Equations to Differential Equations Difference Equations to Differential Equations Section 3.3 Differentiation of Polynomials an Rational Functions In tis section we begin te task of iscovering rules for ifferentiating various classes of

More information

TRADING AWAY WIDE BRANDS FOR CHEAP BRANDS. Swati Dhingra London School of Economics and CEP. Online Appendix

TRADING AWAY WIDE BRANDS FOR CHEAP BRANDS. Swati Dhingra London School of Economics and CEP. Online Appendix TRADING AWAY WIDE BRANDS FOR CHEAP BRANDS Swati Dingra London Scool of Economics and CEP Online Appendix APPENDIX A. THEORETICAL & EMPIRICAL RESULTS A.1. CES and Logit Preferences: Invariance of Innovation

More information

Sections 3.1/3.2: Introducing the Derivative/Rules of Differentiation

Sections 3.1/3.2: Introducing the Derivative/Rules of Differentiation Sections 3.1/3.2: Introucing te Derivative/Rules of Differentiation 1 Tangent Line Before looking at te erivative, refer back to Section 2.1, looking at average velocity an instantaneous velocity. Here

More information

The Derivative as a Function

The Derivative as a Function Section 2.2 Te Derivative as a Function 200 Kiryl Tsiscanka Te Derivative as a Function DEFINITION: Te derivative of a function f at a number a, denoted by f (a), is if tis limit exists. f (a) f(a+) f(a)

More information

Research on the Anti-perspective Correction Algorithm of QR Barcode

Research on the Anti-perspective Correction Algorithm of QR Barcode Researc on te Anti-perspective Correction Algoritm of QR Barcode Jianua Li, Yi-Wen Wang, YiJun Wang,Yi Cen, Guoceng Wang Key Laboratory of Electronic Tin Films and Integrated Devices University of Electronic

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

College Planning Using Cash Value Life Insurance

College Planning Using Cash Value Life Insurance College Planning Using Cas Value Life Insurance CAUTION: Te advisor is urged to be extremely cautious of anoter college funding veicle wic provides a guaranteed return of premium immediately if funded

More information

Comparison between two approaches to overload control in a Real Server: local or hybrid solutions?

Comparison between two approaches to overload control in a Real Server: local or hybrid solutions? Comparison between two approaces to overload control in a Real Server: local or ybrid solutions? S. Montagna and M. Pignolo Researc and Development Italtel S.p.A. Settimo Milanese, ITALY Abstract Tis wor

More information

Catalogue no. 12-001-XIE. Survey Methodology. December 2004

Catalogue no. 12-001-XIE. Survey Methodology. December 2004 Catalogue no. 1-001-XIE Survey Metodology December 004 How to obtain more information Specific inquiries about tis product and related statistics or services sould be directed to: Business Survey Metods

More information

SAMPLE DESIGN FOR THE TERRORISM RISK INSURANCE PROGRAM SURVEY

SAMPLE DESIGN FOR THE TERRORISM RISK INSURANCE PROGRAM SURVEY ASA Section on Survey Researc Metods SAMPLE DESIG FOR TE TERRORISM RISK ISURACE PROGRAM SURVEY G. ussain Coudry, Westat; Mats yfjäll, Statisticon; and Marianne Winglee, Westat G. ussain Coudry, Westat,

More information

Note nine: Linear programming CSE 101. 1 Linear constraints and objective functions. 1.1 Introductory example. Copyright c Sanjoy Dasgupta 1

Note nine: Linear programming CSE 101. 1 Linear constraints and objective functions. 1.1 Introductory example. Copyright c Sanjoy Dasgupta 1 Copyrigt c Sanjoy Dasgupta Figure. (a) Te feasible region for a linear program wit two variables (see tet for details). (b) Contour lines of te objective function: for different values of (profit). Te

More information

SAT Math Must-Know Facts & Formulas

SAT Math Must-Know Facts & Formulas SAT Mat Must-Know Facts & Formuas Numbers, Sequences, Factors Integers:..., -3, -2, -1, 0, 1, 2, 3,... Rationas: fractions, tat is, anyting expressabe as a ratio of integers Reas: integers pus rationas

More information

ON THE EXISTENCE AND LIMIT BEHAVIOR OF THE OPTIMAL BANDWIDTH FOR KERNEL DENSITY ESTIMATION

ON THE EXISTENCE AND LIMIT BEHAVIOR OF THE OPTIMAL BANDWIDTH FOR KERNEL DENSITY ESTIMATION Statistica Sinica 17(27), 289-3 ON THE EXISTENCE AND LIMIT BEHAVIOR OF THE OPTIMAL BANDWIDTH FOR KERNEL DENSITY ESTIMATION J. E. Chacón, J. Montanero, A. G. Nogales and P. Pérez Universidad de Extremadura

More information

To motivate the notion of a variogram for a covariance stationary process, { Ys ( ): s R}

To motivate the notion of a variogram for a covariance stationary process, { Ys ( ): s R} 4. Variograms Te covariogram and its normalized form, te correlogram, are by far te most intuitive metods for summarizing te structure of spatial dependencies in a covariance stationary process. However,

More information

Notes: Most of the material in this chapter is taken from Young and Freedman, Chap. 12.

Notes: Most of the material in this chapter is taken from Young and Freedman, Chap. 12. Capter 6. Fluid Mecanics Notes: Most of te material in tis capter is taken from Young and Freedman, Cap. 12. 6.1 Fluid Statics Fluids, i.e., substances tat can flow, are te subjects of tis capter. But

More information

MATHEMATICS FOR ENGINEERING DIFFERENTIATION TUTORIAL 1 - BASIC DIFFERENTIATION

MATHEMATICS FOR ENGINEERING DIFFERENTIATION TUTORIAL 1 - BASIC DIFFERENTIATION MATHEMATICS FOR ENGINEERING DIFFERENTIATION TUTORIAL 1 - BASIC DIFFERENTIATION Tis tutorial is essential pre-requisite material for anyone stuing mecanical engineering. Tis tutorial uses te principle of

More information

2.1: The Derivative and the Tangent Line Problem

2.1: The Derivative and the Tangent Line Problem .1.1.1: Te Derivative and te Tangent Line Problem Wat is te deinition o a tangent line to a curve? To answer te diiculty in writing a clear deinition o a tangent line, we can deine it as te iting position

More information

Compute the derivative by definition: The four step procedure

Compute the derivative by definition: The four step procedure Compute te derivative by definition: Te four step procedure Given a function f(x), te definition of f (x), te derivative of f(x), is lim 0 f(x + ) f(x), provided te limit exists Te derivative function

More information

f(x) f(a) x a Our intuition tells us that the slope of the tangent line to the curve at the point P is m P Q =

f(x) f(a) x a Our intuition tells us that the slope of the tangent line to the curve at the point P is m P Q = Lecture 6 : Derivatives and Rates of Cange In tis section we return to te problem of finding te equation of a tangent line to a curve, y f(x) If P (a, f(a)) is a point on te curve y f(x) and Q(x, f(x))

More information

The modelling of business rules for dashboard reporting using mutual information

The modelling of business rules for dashboard reporting using mutual information 8 t World IMACS / MODSIM Congress, Cairns, Australia 3-7 July 2009 ttp://mssanz.org.au/modsim09 Te modelling of business rules for dasboard reporting using mutual information Gregory Calbert Command, Control,

More information

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators... MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................

More information

Improved dynamic programs for some batcing problems involving te maximum lateness criterion A P M Wagelmans Econometric Institute Erasmus University Rotterdam PO Box 1738, 3000 DR Rotterdam Te Neterlands

More information

Theoretical calculation of the heat capacity

Theoretical calculation of the heat capacity eoretical calculation of te eat capacity Principle of equipartition of energy Heat capacity of ideal and real gases Heat capacity of solids: Dulong-Petit, Einstein, Debye models Heat capacity of metals

More information

Chapter 10: Refrigeration Cycles

Chapter 10: Refrigeration Cycles Capter 10: efrigeration Cycles Te vapor compression refrigeration cycle is a common metod for transferring eat from a low temperature to a ig temperature. Te above figure sows te objectives of refrigerators

More information

How To Ensure That An Eac Edge Program Is Successful

How To Ensure That An Eac Edge Program Is Successful Introduction Te Economic Diversification and Growt Enterprises Act became effective on 1 January 1995. Te creation of tis Act was to encourage new businesses to start or expand in Newfoundland and Labrador.

More information

Welfare, financial innovation and self insurance in dynamic incomplete markets models

Welfare, financial innovation and self insurance in dynamic incomplete markets models Welfare, financial innovation and self insurance in dynamic incomplete markets models Paul Willen Department of Economics Princeton University First version: April 998 Tis version: July 999 Abstract We

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

Writing Mathematics Papers

Writing Mathematics Papers Writing Matematics Papers Tis essay is intended to elp your senior conference paper. It is a somewat astily produced amalgam of advice I ave given to students in my PDCs (Mat 4 and Mat 9), so it s not

More information

Math Test Sections. The College Board: Expanding College Opportunity

Math Test Sections. The College Board: Expanding College Opportunity Taking te SAT I: Reasoning Test Mat Test Sections Te materials in tese files are intended for individual use by students getting ready to take an SAT Program test; permission for any oter use must be sougt

More information

- 1 - Handout #22 May 23, 2012 Huffman Encoding and Data Compression. CS106B Spring 2012. Handout by Julie Zelenski with minor edits by Keith Schwarz

- 1 - Handout #22 May 23, 2012 Huffman Encoding and Data Compression. CS106B Spring 2012. Handout by Julie Zelenski with minor edits by Keith Schwarz CS106B Spring 01 Handout # May 3, 01 Huffman Encoding and Data Compression Handout by Julie Zelenski wit minor edits by Keit Scwarz In te early 1980s, personal computers ad ard disks tat were no larger

More information

2.23 Gambling Rehabilitation Services. Introduction

2.23 Gambling Rehabilitation Services. Introduction 2.23 Gambling Reabilitation Services Introduction Figure 1 Since 1995 provincial revenues from gambling activities ave increased over 56% from $69.2 million in 1995 to $108 million in 2004. Te majority

More information

Keskustelualoitteita #65 Joensuun yliopisto, Taloustieteet. Market effiency in Finnish harness horse racing. Niko Suhonen

Keskustelualoitteita #65 Joensuun yliopisto, Taloustieteet. Market effiency in Finnish harness horse racing. Niko Suhonen Keskustelualoitteita #65 Joensuun yliopisto, Taloustieteet Market effiency in Finnis arness orse racing Niko Suonen ISBN 978-952-219-283-7 ISSN 1795-7885 no 65 Market Efficiency in Finnis Harness Horse

More information

New Vocabulary volume

New Vocabulary volume -. Plan Objectives To find te volume of a prism To find te volume of a cylinder Examples Finding Volume of a Rectangular Prism Finding Volume of a Triangular Prism 3 Finding Volume of a Cylinder Finding

More information

A Multigrid Tutorial part two

A Multigrid Tutorial part two A Multigrid Tutorial part two William L. Briggs Department of Matematics University of Colorado at Denver Van Emden Henson Center for Applied Scientific Computing Lawrence Livermore National Laboratory

More information

Høgskolen i Narvik Sivilingeniørutdanningen STE6237 ELEMENTMETODER. Oppgaver

Høgskolen i Narvik Sivilingeniørutdanningen STE6237 ELEMENTMETODER. Oppgaver Høgskolen i Narvik Sivilingeniørutdanningen STE637 ELEMENTMETODER Oppgaver Klasse: 4.ID, 4.IT Ekstern Professor: Gregory A. Chechkin e-mail: [email protected] Narvik 6 PART I Task. Consider two-point

More information

A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS

A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS Eusebio GÓMEZ, Miguel A. GÓMEZ-VILLEGAS and J. Miguel MARÍN Abstract In this paper it is taken up a revision and characterization of the class of

More information

by Siegfried Heiler 4 Problems of simple kernel estimation and restricted approaches

by Siegfried Heiler 4 Problems of simple kernel estimation and restricted approaches 1 A Survey on Nonparametric Time Series Analysis 1Introduction by Siegfried Heiler 2 Nonparametric regression 3 Kernel estimation in time series 4 Problems of simple kernel estimation and restricted approaces

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

M(0) = 1 M(1) = 2 M(h) = M(h 1) + M(h 2) + 1 (h > 1)

M(0) = 1 M(1) = 2 M(h) = M(h 1) + M(h 2) + 1 (h > 1) Insertion and Deletion in VL Trees Submitted in Partial Fulfillment of te Requirements for Dr. Eric Kaltofen s 66621: nalysis of lgoritms by Robert McCloskey December 14, 1984 1 ackground ccording to Knut

More information

Cyber Epidemic Models with Dependences

Cyber Epidemic Models with Dependences Cyber Epidemic Models wit Dependences Maocao Xu 1, Gaofeng Da 2 and Souuai Xu 3 1 Department of Matematics, Illinois State University [email protected] 2 Institute for Cyber Security, University of Texas

More information

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails 12th International Congress on Insurance: Mathematics and Economics July 16-18, 2008 A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails XUEMIAO HAO (Based on a joint

More information

Pretrial Settlement with Imperfect Private Monitoring

Pretrial Settlement with Imperfect Private Monitoring Pretrial Settlement wit Imperfect Private Monitoring Mostafa Beskar Indiana University Jee-Hyeong Park y Seoul National University April, 2016 Extremely Preliminary; Please Do Not Circulate. Abstract We

More information

A New Cement to Glue Nonconforming Grids with Robin Interface Conditions: The Finite Element Case

A New Cement to Glue Nonconforming Grids with Robin Interface Conditions: The Finite Element Case A New Cement to Glue Nonconforming Grids wit Robin Interface Conditions: Te Finite Element Case Martin J. Gander, Caroline Japet 2, Yvon Maday 3, and Frédéric Nataf 4 McGill University, Dept. of Matematics

More information

Bonferroni-Based Size-Correction for Nonstandard Testing Problems

Bonferroni-Based Size-Correction for Nonstandard Testing Problems Bonferroni-Based Size-Correction for Nonstandard Testing Problems Adam McCloskey Brown University October 2011; Tis Version: October 2012 Abstract We develop powerful new size-correction procedures for

More information

Staffing and routing in a two-tier call centre. Sameer Hasija*, Edieal J. Pinker and Robert A. Shumsky

Staffing and routing in a two-tier call centre. Sameer Hasija*, Edieal J. Pinker and Robert A. Shumsky 8 Int. J. Operational Researc, Vol. 1, Nos. 1/, 005 Staffing and routing in a two-tier call centre Sameer Hasija*, Edieal J. Pinker and Robert A. Sumsky Simon Scool, University of Rocester, Rocester 1467,

More information

Lecture 8: More Continuous Random Variables

Lecture 8: More Continuous Random Variables Lecture 8: More Continuous Random Variables 26 September 2005 Last time: the eponential. Going from saying the density e λ, to f() λe λ, to the CDF F () e λ. Pictures of the pdf and CDF. Today: the Gaussian

More information

OPTIMAL FLEET SELECTION FOR EARTHMOVING OPERATIONS

OPTIMAL FLEET SELECTION FOR EARTHMOVING OPERATIONS New Developments in Structural Engineering and Construction Yazdani, S. and Sing, A. (eds.) ISEC-7, Honolulu, June 18-23, 2013 OPTIMAL FLEET SELECTION FOR EARTHMOVING OPERATIONS JIALI FU 1, ERIK JENELIUS

More information

WORKING PAPER SERIES THE INFORMATIONAL CONTENT OF OVER-THE-COUNTER CURRENCY OPTIONS NO. 366 / JUNE 2004. by Peter Christoffersen and Stefano Mazzotta

WORKING PAPER SERIES THE INFORMATIONAL CONTENT OF OVER-THE-COUNTER CURRENCY OPTIONS NO. 366 / JUNE 2004. by Peter Christoffersen and Stefano Mazzotta WORKING PAPER SERIES NO. 366 / JUNE 24 THE INFORMATIONAL CONTENT OF OVER-THE-COUNTER CURRENCY OPTIONS by Peter Cristoffersen and Stefano Mazzotta WORKING PAPER SERIES NO. 366 / JUNE 24 THE INFORMATIONAL

More information

What is Advanced Corporate Finance? What is finance? What is Corporate Finance? Deciding how to optimally manage a firm s assets and liabilities.

What is Advanced Corporate Finance? What is finance? What is Corporate Finance? Deciding how to optimally manage a firm s assets and liabilities. Wat is? Spring 2008 Note: Slides are on te web Wat is finance? Deciding ow to optimally manage a firm s assets and liabilities. Managing te costs and benefits associated wit te timing of cas in- and outflows

More information

f(x + h) f(x) h as representing the slope of a secant line. As h goes to 0, the slope of the secant line approaches the slope of the tangent line.

f(x + h) f(x) h as representing the slope of a secant line. As h goes to 0, the slope of the secant line approaches the slope of the tangent line. Derivative of f(z) Dr. E. Jacobs Te erivative of a function is efine as a limit: f (x) 0 f(x + ) f(x) We can visualize te expression f(x+) f(x) as representing te slope of a secant line. As goes to 0,

More information

Equilibria in sequential bargaining games as solutions to systems of equations

Equilibria in sequential bargaining games as solutions to systems of equations Economics Letters 84 (2004) 407 411 www.elsevier.com/locate/econbase Equilibria in sequential bargaining games as solutions to systems of equations Tasos Kalandrakis* Department of Political Science, Yale

More information

Training Robust Support Vector Regression via D. C. Program

Training Robust Support Vector Regression via D. C. Program Journal of Information & Computational Science 7: 12 (2010) 2385 2394 Available at ttp://www.joics.com Training Robust Support Vector Regression via D. C. Program Kuaini Wang, Ping Zong, Yaoong Zao College

More information

6. Differentiating the exponential and logarithm functions

6. Differentiating the exponential and logarithm functions 1 6. Differentiating te exponential and logaritm functions We wis to find and use derivatives for functions of te form f(x) = a x, were a is a constant. By far te most convenient suc function for tis purpose

More information

Metric Spaces. Chapter 7. 7.1. Metrics

Metric Spaces. Chapter 7. 7.1. Metrics Chapter 7 Metric Spaces A metric space is a set X that has a notion of the distance d(x, y) between every pair of points x, y X. The purpose of this chapter is to introduce metric spaces and give some

More information

The Method of Least Squares

The Method of Least Squares The Method of Least Squares Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract The Method of Least Squares is a procedure to determine the best fit line to data; the

More information

BANACH AND HILBERT SPACE REVIEW

BANACH AND HILBERT SPACE REVIEW BANACH AND HILBET SPACE EVIEW CHISTOPHE HEIL These notes will briefly review some basic concepts related to the theory of Banach and Hilbert spaces. We are not trying to give a complete development, but

More information

Average and Instantaneous Rates of Change: The Derivative

Average and Instantaneous Rates of Change: The Derivative 9.3 verage and Instantaneous Rates of Cange: Te Derivative 609 OBJECTIVES 9.3 To define and find average rates of cange To define te derivative as a rate of cange To use te definition of derivative to

More information

Tis Problem and Retail Inventory Management

Tis Problem and Retail Inventory Management Optimizing Inventory Replenisment of Retail Fasion Products Marsall Fiser Kumar Rajaram Anant Raman Te Warton Scool, University of Pennsylvania, 3620 Locust Walk, 3207 SH-DH, Piladelpia, Pennsylvania 19104-6366

More information

1.6. Analyse Optimum Volume and Surface Area. Maximum Volume for a Given Surface Area. Example 1. Solution

1.6. Analyse Optimum Volume and Surface Area. Maximum Volume for a Given Surface Area. Example 1. Solution 1.6 Analyse Optimum Volume and Surface Area Estimation and oter informal metods of optimizing measures suc as surface area and volume often lead to reasonable solutions suc as te design of te tent in tis

More information

5. Linear Regression

5. Linear Regression 5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4

More information

Sums of Independent Random Variables

Sums of Independent Random Variables Chapter 7 Sums of Independent Random Variables 7.1 Sums of Discrete Random Variables In this chapter we turn to the important question of determining the distribution of a sum of independent random variables

More information

Taylor and Maclaurin Series

Taylor and Maclaurin Series Taylor and Maclaurin Series In the preceding section we were able to find power series representations for a certain restricted class of functions. Here we investigate more general problems: Which functions

More information

The Heat Equation. Lectures INF2320 p. 1/88

The Heat Equation. Lectures INF2320 p. 1/88 The Heat Equation Lectures INF232 p. 1/88 Lectures INF232 p. 2/88 The Heat Equation We study the heat equation: u t = u xx for x (,1), t >, (1) u(,t) = u(1,t) = for t >, (2) u(x,) = f(x) for x (,1), (3)

More information

A strong credit score can help you score a lower rate on a mortgage

A strong credit score can help you score a lower rate on a mortgage NET GAIN Scoring points for your financial future AS SEEN IN USA TODAY S MONEY SECTION, JULY 3, 2007 A strong credit score can elp you score a lower rate on a mortgage By Sandra Block Sales of existing

More information

MULTIVARIATE PROBABILITY DISTRIBUTIONS

MULTIVARIATE PROBABILITY DISTRIBUTIONS MULTIVARIATE PROBABILITY DISTRIBUTIONS. PRELIMINARIES.. Example. Consider an experiment that consists of tossing a die and a coin at the same time. We can consider a number of random variables defined

More information