arxiv: v1 [math.st] 21 Aug 2009

Size: px
Start display at page:

Download "arxiv:0908.3095v1 [math.st] 21 Aug 2009"

Transcription

1 The Aals of Statistics 2009, Vol. 37, No. 5A, DOI: /08-AOS640 c Istitute of Mathematical Statistics, 2009 arxiv: v1 [math.st] 21 Aug 2009 ESTIMATING THE DEGREE OF ACTIVITY OF JUMPS IN HIGH FREQUENCY DATA By Yacie Aït-Sahalia 1 ad Jea Jacod Priceto Uiversity ad UPMC (Uiversité Paris-6) We defie a geeralized idex of jump activity, propose estimators of that idex for a discretely sampled process ad derive the estimators properties. These estimators are applicable despite the presece of Browia volatility i the process, which makes it more challegig to ifer the characteristics of the small, ifiite activity jumps. Whe the method is applied to high frequecy stock returs, we fid evidece of ifiitely active jumps i the data ad estimate their idex of activity. 1. Itroductio. Usig high frequecy fiacial data, which are ow widely available, we ca hope to aswer a umber of questios regardig the characteristics of the process that drives asset returs. Let us model the log-price X of some asset as a 1-dimesioal process, which we will observe over a fixed time iterval [0,T] at discrete times 0,,2,... with a time iterval betwee successive observatios that is small. This is the essece of high frequecy data. Let us further assume that this process is a Itô semimartigale, meaig that its characteristics are absolutely cotiuous with respect to Lebesgue measure. So, it has a drift, a cotiuous martigale part that is the itegral of a possibly stochastic process with respect to a Browia motio, ad we will also let it have jumps with a possibly stochastic Lévy measure. For modelig purposes, oe would like to ifer the characteristics of X from the observatios; that is, its drift, volatility ad Lévy measure. Whe the time iterval goes to 0, it is well kow that oe ca cosistetly ifer the volatility uder very weak assumptios. However, such cosistet iferece is impossible for the drift or the Lévy measure, if the overall time iterval [0,T] is kept fixed. Received March 2008; revised July Supported i part by NSF Grat DMS AMS 2000 subject classificatios. Primary 62F12, 62M05; secodary 60H10, 60J60. Key words ad phrases. Jumps, idex of activity, ifiite activity, discrete samplig, high frequecy. This is a electroic reprit of the origial article published by the Istitute of Mathematical Statistics i The Aals of Statistics, 2009, Vol. 37, No. 5A, This reprit differs from the origial i pagiatio ad typographic detail. 1

2 2 Y. AÏT-SAHALIA AND J. JACOD I fact, eve i the urealistic case where the whole path of X is observed over [0,T], oe ca ifer either the drift or the Lévy measure. Oe ca, however, hope to be able to characterize the behavior of the Lévy measure ear 0: first, whether it does ot explode ear 0, meaig that the umber of jumps is fiite; ad secod, whe this umber is ifiite, we would like to be able to say somethig about the cocetratio of small jumps. Our objective i doig so is to provide specificatio tools for fiacial models, where the presece or at least possibility of large jumps is geerally accepted. There is much less cosesus i the literature regardig the ature or eve the eed for small jumps. For this purpose, let us defie, for a geeric semimartigale X, (1) B(r) t = X s r, I t = {r 0:B(r) t < }, β t = if(i t ), s t where X s = X s X s is the size of the jump at time s, ad r 0, with the covetio 0 0 = 0. Necessarily, the (radom) set I t cotais the iterval (β t, ), whereas it may cotai β t itself or ot. Moreover, 2 I t always, ad, of course, t β t is odecreasig. Hece, if we observe the whole path of X over [0,T], we kow the sets I t (ω) ad the umbers β t (ω) for all t T. We call β T (ω) the jump activity idex for the path t X t (ω) at time T (or, more precisely, up to time T). We defie this idex i aalogy with the special case where X is a Lévy process. I this case, I t ad β t are o loger radom. Further, they do ot deped o the time t, ad I t is also the set of all r 0 such that { x 1} x r F(dx) <, where F is the Lévy measure. This property shows that, for a Lévy process, the jump activity idex coicides with the Blumethal Getoor idex of the process [see Blumethal ad Getoor (1961)]. I the further special case where X is a stable process, β is also the stable idex of the process. Whe X is a Lévy process, the iterval I ad the idex β are, of course, oly tiy elemets of the whole Lévy measure F, which covey approximately the same iformatio (I gives slightly more iformatio tha β). However, the value of β is probably the most iformative kowledge oe ca draw about F from the observatio of the path t X t for all t T whe T is fiite. Thigs are very differet whe T, though, sice observig X over [0, ) completely specifies F. But, whe the time horizo T is kept fixed, ad with the whole path observed over [0,T], we ca ifer oly the behavior of the Lévy measure F ear 0 (because we eed a potetially ifiite umber of observatios for cosistet estimatio). The, β captures a essetial qualitative feature of F, which is its level of activity, which is that whe β icreases, the (small) jumps ted to become more ad more frequet. β is related to the degree of activity of jumps. All Lévy measures put fiite mass o the set (, ε] [ε,+ ) for ay arbitrary ε > 0; therefore,

3 ESTIMATING JUMP ACTIVITY 3 if the process has ifiite jump activity, the it must be because of the small jumps, which are defied as those smaller tha ε. If F([ ε,ε]) <, the the process has fiite activity ad 0 I, or, equivaletly, β = 0. But, if F([ ε,ε]) =, the the process has ifiite activity, ad, i additio, β > 0 as log as the Lévy measure F([ ε,ε]) diverges ear 0 at a rate larger tha a power ε a for some a > 0. The higher β gets (up to 2), the more active the small jumps become. The same remarks also apply for geeral semimartigales. These properties are what motivate our callig β a jump activity idex ad our iterest i estimatig it. I the more realistic situatio where the semimartigale X is oly observed at times i over [0,T], the estimatio problem is made more challegig by the presece i X of a cotiuous martigale part. By its very ature, β T characterizes the behavior of F ear 0. Hece, it is atural to expect that the small icremets of the process are goig to be the oes that are most iformative about β T. But, those small icremets are precisely the oes where the cotributio from the cotiuous martigale part of the process is iexorably mixed with the cotributio from the small jumps. Beig able to see through the cotiuous part of the semimartigale i order to say somethig about the umber ad cocetratio of small jumps is goig to be the challege we face as we attempt to estimate β T. Related to this paper are Woerer (2006), who proposes a estimator of the jump activity idex ad of the Hurst expoet, but i the absece of a cotiuous Browia part to the semimartigale, ad Cot ad Macii (2007), who propose a test for the fiiteess of the variatio of the jump part. Also, Belomesty (2008) estimates the same idex whe there is o Browia part ad whe, together with the prices, some optio prices also are recorded. Aother related problem is the estimatio of the idex β of a stable process [see, e.g., DuMouchel (1983)]. However, our situatio here is fudametally differet from those, i that we also have a cotiuous part i the semimartigale. The situatio is also differet from that i Aït- Sahalia ad Jacod (2008), where we studied Fisher s iformatio for the parameters of a Browia plus stable pure jump process, but the jump process was the domiat compoet i that paper. Here, the cotiuous part of the semimartigale domiates the small icremets, ad we estimate the activity idex of a pure jump process where the domiat compoet is a Browia motio. The aim of this paper is to costruct estimators ˆβ (T) for β T, which are cosistet whe 0, ad to provide rates of covergece ad asymptotic distributios. Ideally, we would also like to have estimators that are, as much as possible, model-free, i the sese that they behave well without too strog assumptios o the form of the drift, the volatility or the Lévy measure. As it turs out, a fully model-free behavior of the estimators may be too much to ask. The assumptios we make below o the drift ad the

4 4 Y. AÏT-SAHALIA AND J. JACOD volatility process are quite urestrictive, but obtaiig rates of covergece will require more specific assumptios o the Lévy measure. I particular, we will assume that the mai part of the Lévy measure ear 0 behaves locally like the Lévy measure of a stable process, ad we will provide estimators ad their properties whe the idex is β > 0. This assumptio seems to be uavoidable, sice, as we shall also see, eve whe X is a Lévy process, strog assumptios o the Lévy measure are ecessary. At this jucture, it may be worth otig that cosiderig semimartigales rather tha simply Lévy processes, or expoetials of such, does ot chage or weake the results. The paper is orgaized as follows. I Sectio 2, we formally defie the idex of jump activity, costruct estimators for it ad preset the mai properties of the estimators i the geeral case where the process is a semimartigale. Sectio 3 is devoted to the special ad simpler case of a symmetric stable process, ad Sectio 4 is about more geeral Lévy processes. We propose a small sample bias correctio i Sectio 5. We preset the results of Mote Carlo simulatios i Sectio 6, ad we compute our estimators over all 2006 trasactios of the Dow Joes stocks i Sectio 7, focusig i particular o Itel ad Microsoft. Sectio 8 is devoted to techical results ad to the proof of the mai theorems, which apply to Itô semimartigales, uder suitable assumptios o the Lévy measure. 2. The model ad mai results Defiig a idex of jump activity. Our structural assumptio is that X is a 1-dimesioal Itô semimartigale o some filtered space (Ω, F,(F t ) t 0,P), which meas that its characteristics (B,C,ν) are absolutely cotiuous with respect to Lebesgue measure [see Jacod ad Shiryaev (2003) for all otios ot explaied here]. I other words, the characteristics of X have the form (2) B t = t 0 b s ds, C t = t 0 σ 2 s ds, ν(dt,dx) = dtf t(dx). Here, b = (b t ) ad σ = (σ t ) are real-valued optioal processes, ad F t = F t (ω,dx) is a predictable radom measure, meaig that for all Borel sets A i R the process (F t (A)) is predictable (possibly takig the value + ). This model is quite geeral. For istace, the drift, volatility ad jump measures ca be stochastic ad jump themselves. There are other ways of expressig this assumptio, for example through a Wieer process W ad a Poisso radom measure µ with compesator ν(dt,dx) = dt dx (up to a possible elargemet of the space), as X t = X 0 + t 0 b s ds + t 0 σ s dw s

5 (3) + + ESTIMATING JUMP ACTIVITY 5 t 0 R t 0 R δ(s,x)1 { δ(s,x) 1} (µ ν)(ds,dx) δ(s,x)1 { δ(s,x) >1} µ(ds,dx). I this formulatio, b ad σ are the same as i (2), δ = δ(ω,t,x) is a predictable fuctio ad the coectio with F t is that F t (ω,dx) is the restrictio to R \ {0} of the image of the Lebesgue measure by the map x δ(ω,t,x). However, it is easier for the problem at had to express the assumptios o F t rather tha o δ, which, moreover, is ot uique (whereas F t is uiquely defied, up to ull sets). Below, for ay measure H o R we deote by H its (symmetrical) tail fuctio (4) x > 0 H(x) = H([ x,x] c ). Observe that B(r) t < if ad oly if B (r) t <, where B (r) t = s t X s r 1, ad the process B (r) is fiite-valued if ad oly if it is locally itegrable. I other words, divergece, whe it occurs, is caused by the small jumps. Moreover, for ay stoppig time T, we have ( T ) E(B (r) T ) = E ds ( x r 1)F s (dx). 0 R We call istataeous jump activity idex at time t the (radom) umber { } (5) βt i = if r > 0: ( x r 1)F s (dx) <. R I light of (5), this is a atural geeralizatio of the otio of Blumethal Getoor idex for Lévy processes [see Blumethal ad Getoor (1961)]. β i is a predictable process takig its values i [0, 2]. This process is also characterized by the property that, for ay ε > 0, we have (6) lim x 0 xβi t +ε F t (x) = 0, limsupx βi t ε F t (x) =. x 0 The lim sup above is usually ot a limit. Note, fially, that βt i = 0 does ot ecessarily imply that the process has fiite jump activity, sice it is possible for the Lévy measure to diverge slowly ear 0, at a subgeometric speed. A example of this would be the Gamma process, which has F(dx) = (η exp( κx)1 {x>0} /x)dx, so that the Lévy measure F t (ε) diverges at a logarithmic rate i ε. Fiite activity processes (compoud Poisso) will have βt i = 0 a.s., however.

6 6 Y. AÏT-SAHALIA AND J. JACOD 2.2. Assumptios. We make two assumptios. The first oe, o the drift b ad volatility σ, is quite mild. Assumptio 1. The processes b ad σ are locally bouded. The secod assumptio, o the Lévy measures F t, is more specific. Essetially, we split F t as F t = F t + F t, where: F t is very close to the Lévy measure of a β-stable process, restricted to a radom iterval ( z ( ) t,z (+) t ) aroud 0 with some β that is ot radom; the radom iterval may be empty for some (ω,t), but ot for all; F t is aother Lévy measure with jump activity idex less tha some β < β. The precise statemet of the assumptio is as follows. Assumptio 2. There are three (oradom) umbers β (0,2), β [0,β) ad γ > 0, ad a locally bouded process L t 1, such that we have, for all (ω, t), (7) where: (8) (a) F t has the form F t (dx) = 1 + x γ f(t,x) x 1+β (a (+) F t = F t + F t, t 1 (+) + {0<x z t } a( ) t 1 ( ) )dx { z t x<0} for some predictable oegative processes a (+) t,a ( ) t,z (+) t ad z ( ) t ad some predictable fuctio f(ω,t,x) satisfyig (9) 1 z (+) 1 t 1, z ( ) t 1, a (+) t L t L t 1 + x f(t,x) 0, f(t,x) L t ; + a ( ) t L t, (b) F t is a measure that is sigular with respect to F t ad satisfies (10) ( x β 1)F t (dx) L t. R We will also eed the icreasig ad locally bouded process (11) t Ā t = A s ds, 0 where A t = a(+) t + a ( ) t. β

7 ESTIMATING JUMP ACTIVITY 7 Remark 1. I view of (6), the istataeous idex at time t due to part F t of the Lévy measure is β o the set {A t > 0}, ad 0, otherwise; whereas, the oe due to F t is everywhere smaller tha β. Hece, outside a ull set, we have β t = β o the set {Āt > 0} ad β t β, otherwise. Remark 2. Oe could formulate Assumptio 2 slightly differetly by writig F t = Ft 1 + Ft 2, where Ft 1 is give by (8) with f(t,x) = 0 ad Ft 2 satisfyig (10) with some β, ad, further, the restrictio of Ft 2 to [ z ( ),z (+) ] has a absolutely cotiuous part with a desity of the form f(t,x) x γ 1 β with (9). The two formulatios are equivalet, provided that we take β = β (β γ). (12) Remark 3. Take ay process of the form dx t = b t dt + σ t dw t + δ t dy t + δ t dy t, where δ ad δ are càdlàg adapted processes, Y is β-stable or tempered β-stable ad Y is ay other Lévy process whose Lévy measure itegrates x β ear the origi ad has a absolutely cotiuous part whose desity is smaller tha K x γ 1 β o [ 1,1] for some γ > 0 (e.g., a stable process with idex strictly smaller tha β ). The, X will satisfy Assumptio 2. For istace, whe further Y is symmetrical with Lévy desity D/ x 1+β, it is satisfied with the two umbers β ad β, γ (as above), f(t,x) = 0, z ( ) t = z (+) t = 1 ad a ( ) t = a (+) t = Dδ β t. Remark 4. Whe X is a Lévy process, so that F t (ω,dx) = F(dx) does ot deped o t ad ω, Assumptio 2 is related with the property that X is a regular Lévy process of expoetial type, as itroduced i Boyarcheko ad Levedorskiĭ (2002), with β = ν ad β (β γ) = ν i the otatio of that paper. These two assumptios are ot exactly comparable. The oe i Boyarcheko ad Levedorskiĭ (2002) is more striget about the behavior of big jumps, whereas ours is slightly more demadig for small jumps The estimators. Recall that we observe X i for i = 0,1,...,[T/ ]. While the processes B(r) are defied from the jumps X s = X s X s of X, we do ot observe these jumps directly. Rather, all that we observe are the discrete icremets (13) i X = X i X (i 1). From these icremets, we could try to evaluate B(r) T ad the ifer β. Fidig cosistet estimators for B(r) T is easy, but deducig from them a estimator for β is almost impossible, because we eed to decide whether B(r) T is ifiite or ot based o a fiite sample.

8 8 Y. AÏT-SAHALIA AND J. JACOD (14) So, we propose the followig idea. For fixed > 0 ad α > 0, we write [t/ ] U(,α) t = i=1 1 { i X >α } for the umber of icremets whose magitude is greater tha α. I all cases below, we will set < 1/2. To better uderstad our ratioale for doig this, cosider the special case X = σw + Y, where Y is a β-stable process, so β t (ω) = β. Ay icremet i X = X i X (i 1) satisfies i X = σ 1/2 W 1 + 1/β Y 1 (equality i law). The, recallig that β < 2 ad 0, with a large probability i X is close to σ 1/2 W 1 i law. Those icremets give essetially o iformatio o Y ad are of order of magitude 1/2. However if Y has a big jump at time s, the correspodig icremet is close to Y s. Hece, oe has to throw away all the small icremets. However, β is related to the behavior of F ear 0 ad, hece, to the very small jumps of Y. This is why we will use oly icremets bigger tha a cutoff level α for some (0,1/2). Asymptotically, those icremets are big, because, sice 1/2, the mai cotributio is due to Y. Those icremets mostly cotai a sigle big jump of size of order at least, ad we still get some iformatio o small jumps, because 0. So, by usig the statistic U, which simply couts the umber of large icremets, defied as those greater tha α, we are retaiig oly those icremets of X that are ot predomiatly made of cotributios from its cotiuous semimartigale part, which are O p ( 1/2 ), ad istead are predomiatly made of cotributios due to a jump. The same heuristics work for more geeral Itô semimartigales. As we will see later, the key property of the fuctioals U(,α, ) is their covergece i probability (15) β U(,α) t P Āt α β, which we will show holds uder Assumptio 2. This property leads us to propose a estimator of β at each stage. Fix 0 < α < α ad defie (16) ˆβ (t,,α,α ) = log(u(,α) t /U(,α ) t ) log(α, /α) which is at least cosistet for estimatig β o the set {Āt > 0}. If either value of U i (16) (17) is 0, the, by covetio, we set the estimator to be 0. ˆβ is costructed from a suitably scaled ratio of two U s evaluated o the same time scale at two differet fixed levels of trucatio of the

9 ESTIMATING JUMP ACTIVITY 9 icremets α ad α. I a way, this costructio is i the same spirit as the classical estimator of Hill (1975), who coducts iferece about the tails of a distributio based o ratios of various extremes. We ca also propose a secod estimator defied as (17) ˆβ (t,,α) = log(u(,α) t /U 2 (,α) t ), log 2 where U 2 (,α) t is defied aalogously to U(,α) t i (14), except that samplig at is replaced by samplig at 2. That is, ˆβ is costructed from a suitably scaled ratio of two U s evaluated at the same level of trucatio α o two separate time scales ad 2. Oe could also look at a third estimator ˆβ obtaied from two U s evaluated at two differet rates of trucatio ad. Oe could further cosider estimators based ot just o coutig the icremets that exceed a certai cutoff but also o the magitude of these icremets, as i the case of power variatios trucated to use oly the large icremets. I the rest of the paper, we will focus maily o the properties of the estimator ˆβ, otig that a similar type of aalysis yields the cosistecy ad asymptotic distributio of the other two estimators. I geeral, the asymptotic variace of ˆβ is smaller tha that of ˆβ ad ˆβ. Before studyig the properties of the estimator ˆβ, let us make a few remarks. Remark 5. Asymptotically, as, the above estimators behave well. However, for ay give, it may happe that they are ot iformative, because too few icremets are retaied, up to the extreme case where U(,α) t = U(,α ) t = 0. If this is the case, oe should take smaller values of α ad α. Remark 6. Eve whe the estimators are well defied, they may take a value bigger tha or equal to 2. I this case, the estimatio is ot reliable, ad it may be a idicatio that Assumptio 2 is simply ot satisfied, which would be the case for example if there is o jump at all i the observed path. So it would make sese to covice oeself that jumps are preset [see, e.g., Aït-Sahalia ad Jacod (2009)] before attemptig to estimate β. Remark 7. As we will see below, asymptotic cosideratios lead to the selectio of = 1/5 as a uiversal choice valid for all possible values of β. The cutoff for large icremets is α. Whe implemetig the estimator i practice, i ay give sample the value of is fixed, so ad α are ot idepedet parameters. The level of trucatio α may be set i relatio to the volatility of the cotiuous part of the semimartigale [i.e.,

10 10 Y. AÏT-SAHALIA AND J. JACOD (t 1 t 0 σ2 s ds)1/2 ] sice the objective is to elimiate the icremets that are maily due to the cotiuous part. The trucatio level ca be selected i a data-drive maer. Despite the presece of jumps, that volatility ca be estimated usig the small icremets of the process, sice (18) [t/ ] i=1 t i X 2 P 1 { i X α } σs 2 ds 0 for ay α > 0 ad (0,1/2). We ca the set the cutoff level α to yield a umber of (estimated) stadard deviatios of the cotiuous part of the semimartigale. For the estimator ˆβ, α ca the be set as a multiple of α. These data-drive choices determie a rage of reasoable values for (α,α ). Oe possibility is the to simply average the estimators ˆβ obtaied for the values of (α,α ) over that rage. The parameters (α,α ) effectively play a role similar to that of badwidth parameters i a oparametric aalysis. Remark 8. The costructio of the estimators relies o the property (15), which holds uder (slightly) weaker assumptios tha Assumptio 2, provided that the defiitio of Ā t ad the rate of covergece are suitably ameded. As a result, the estimators give i (16) remai cosistet uder weaker assumptios. For example, whe X has oly fiitely may jumps, the idex is β = 0, ad U(,α) t coverges to the umber of jumps betwee 0 ad t, irrespective of the value of α, so ˆβ is equal to 0 for all large eough (obviously, this rules out the possibility of a cetral limit theorem). Remark 9. Our estimator is based o the cout of big icremets, although we are iterested i the properties of the small jumps of X, which are those goverig the idex β. This is because the behavior of sums of the squares of the small icremets behave as described i (18), ad other powers smaller tha 2 are also drive by the Wieer part of X ad do ot provide isight o the small jumps. Perhaps cosiderig sums of powers bigger tha 2 for small icremets would provide a alterative meas of costructig estimators of β, but we did ot cosider this possibility here Properties of the estimators. Our first result states that our estimators estimate β o the (radom) set {Āt > 0}, where the jump activity idex is β, ad we ca state the followig rate of covergece. Theorem 1. Let 0 < α < α, 0 < < 1/2 ad t > 0. Uder Assumptios 1 ad 2, we have ˆβ (t,,α,α ) β o the set {Āt > 0}. Moreover, if χ = χ(β,γ,β, ) P

11 (19) ESTIMATING JUMP ACTIVITY 11 = ( γ) 1 β 3 (β β ) 1 + β 1 2 β 2 2, the the estimators ˆβ (t,,α,α ) are χ ε -rate cosistet for ay ε > 0 o 1 the set {Āt > 0}, i the sese that the sequece of variables ( (ˆβ (t,,α,α ) χ ε β)) 1 is bouded i probability (or, tight ) i restrictio to this set. The umber χ is positive, but it may also be very small. If we wat a associated distributioal result, we eed stroger assumptios, which essetially implies that χ = β/2 above, ad this requires that the activity idices β ad β γ of the ostable-like part of the Lévy measure be sufficietly apart from the leadig activity idex β, as follows. Theorem 2. Let 0 < α < α ad t > 0. Assume Assumptios 1 ad 2 with β [0,β/(2 + β)) ad γ > β/2. The, if < 1/(2 + β) 2/(5β), ad i restrictio to the set {Āt > 0}, we have the followig stable covergece i law to a cetered ormal variable idepedet of X: (20) 1 β/2 (ˆβ (t,,α,α ) β) L (s) N ( 0, α β α β Ā t (log(α /α)) 2 The qualifier i restrictio to the set {Āt > 0} is essetial i this statemet. Recall that, ulike the usual covergece i law, stable covergece i law makes it possible to restrict the covergece to a subset of Ω exactly as covergece i probability does. O the complemet set {Āt = 0}, aythig ca happe. O that set, the umber β has o meaig as a jump activity idex for X o [0,t]. Moreover, the stable covergece i law allows for the covergece of stadardized statistics. (21) Theorem 3. Uder the assumptios of Theorem 2, the variables log(α /α) 1/U(,α, ) t 1/U(,α, ) t (ˆβ (t,,α,α ) β) coverge stably i law, i restrictio to the set {Ā t > 0}, to a stadard ormal variable N(0, 1) idepedet of X. These results are model-free i a sese, because the drift ad the volatility processes are totally uspecified apart from Assumptio 1, ad the Lévy measures F t are uspecified, other tha the requiremets specified i Assumptio 2. These three theorems will be proved i Sectio 8 below. ).

12 12 Y. AÏT-SAHALIA AND J. JACOD The restrictio o give i the statemet of Theorem 2 restricts admissible values of i a maer that depeds o β. Sice β is ukow at this poit, we must select a uiversal value of that is admissible for all values of β. Not surprisigly, the most striget value of is obtaied i the limit where β < 2, yieldig = 1/5, ad this is the value we suggest for empirical applicatios. We ote (without proof) that a similar set of properties hold for the secod estimator ˆβ based o the ratio of U s estimated at two differet frequecies ad 2, with (21) replaced by the stadardized statistic (22) log2 (ˆβ (t,,α) β). 1/U(,α,2 ) t 1/U(,α, ) t 3. Stable processes. Here, we specialize the geeral results i a importat special case discussig, i particular, the efficiecy of the estimators of β we propose. I the special case of stable processes, the model is fully specified parametrically, ad we ca compare the properties of efficiet parametric estimators of β to those of the geeral estimators ˆβ. Deote, by Y, a symmetric stable process with idex β (0,1/2). We study the two situatios where X t = Y t (the simplest of all sice there is o cotiuous part) ad X t = bt + σw t + Y t, where σ > 0, b R ad W is a Browia motio. The Lévy measure depeds o a scale parameter A > 0 ad the idex β. It has the form F(dx) = Aβ dx, 2 x 1+β hece (23) F(x) := F([ x,x] c ) = A x β for x > 0. The law of Y 1 has a eve desity g ad a tail fuctio G(x) = P( Y 1 > x) satisfyig, as x [see Zolotarev (1986), Theorems ad Corollary 2 of Theorem 2.5.1], (24) g(x) = Aβ 2 x 1+β + O ( 1 x 1+2β ), G(x) = A ( ) 1 x β + O x 2β. I both cases X = Y ad X t = bt + σw t + Y t, we obviously have Assumptios 1 ad 2, with F t = F t = F ot depedig o (ω,t), ad with F t = F t = 0, Ξ = Ω (0, ), β = 0, ad f t (x) = 0 ad, fially, A t (ω) = A, which is the costat i (23). The, we ca apply the previous results, which further hold o the whole set Ω [because here Āt = ta > 0 for all (t,ω)]. The results are much easier to prove i this special case, ad also the requiremets o are sigificatly weaker, thus allowig for faster rates of covergece (the larger, the faster the covergece i Theorem 1). But these improved results are o loger model-free, sice the structure of jumps is completely specified i this stable model up to the ukow parameters A ad β.

13 ESTIMATING JUMP ACTIVITY The case X = Y. Cosider first the case where X has o cotiuous part. The, the geeral results o ˆβ ca be improved to yield the followig. Theorem 4. Assume that X = Y. Let 0 < α < α ad > 0 ad t > 0. The: (a) If < 1/β, the estimators ˆβ (t,,α,α ) coverge i probability to β; (b) If further < 2/(3β), we have stable covergece i law, over the whole set Ω, as described i Theorems 1 (with Āt = ta) ad 3. Note that i part (a) of the theorem, the closer β is to 2, the stroger the costrait o the trucatio rate. These estimators are ot, however, rate-efficiet. To see this, oe ca recall from Aït-Sahalia ad Jacod (2008) that the parametric model i which oe observes the values X i for i t is regular, ad its Fisher iformatio for estimatig β is asymptotically of the form (25) I log(1/ ) C β t for some costat C β. We ca thus hope for estimators that, after ceterig by β ad ormalizatio by log(1/ )/ are N(0,1/C β ), ad, i fact, the MLE does this. Where is the loss of efficiecy comig from? I order to compute our geeral estimators ˆβ, we are forced by the presece of a cotiuous part i X to discard a very sizeable portio of the data, which is the effect of trucatig away the small icremets of X. However, i this case, if somehow we kew from the start that there is o cotiuous part i X, the there would o loger be a eed to do that. It is clear that better estimators of β could the be costructed. Ad if, further, the law of Y has a fully-specified parametric form, as is the case here, the it would be possible to improve the estimators eve more. I this example, we would simultaeously estimate β ad A, but the rates would be uchaged ad it would be eve more model-depedet. So this is the kid of estimator that we do ot wat to use, sice we have o hope of extedig such a estimator to the geeral semimartigale situatio (or, i fact, eve to more geeral Lévy processes tha the stable oes) The case X t = bt + σw t + Y t. We ow study the situatio where Y is a stable process, but X ow also cotais a cotiuous part. The distributioal properties of the estimators follow directly i this special case. Ideed, for this model, U(,α) t is essetially the same as, or close to, the umber V (,α) t of jumps of Y that are bigger tha α i the iterval

14 14 Y. AÏT-SAHALIA AND J. JACOD [0,t]. But V (,α) t is a Poisso radom variable with parameter Ct/α β β where C is a costat. Hece, (26) (27) 1 β /2 β V (,α) t P C/α β, ( β V (,α) t C/αβ ) L N(0,C/α β ). These properties carry over to U(,α) t, ad this leads to the followig improvemet to Theorem 1. Theorem 5. Assume that X t = bt+σw t +Y t. Let 0 < α < α ad > 0 ad t > 0. The: (a) If < 1/2, the estimators ˆβ (t,,α,α ) coverge i probability to β; (b) If < 1/(2 + β), we have the stable covergeces i law, over the whole set Ω, as described i Theorems 1 (with Āt = ta) ad 3. The estimators ˆβ are agai ot rate-efficiet, although they do come close. I fact, usig the methods of Aït-Sahalia ad Jacod (2008), we ca show that Fisher s iformatio for estimatig β at stage satisfies (28) I A(log(1/ )) 2 β/2 σ β β/2 C β t for aother costat C β. Furthermore, i the (partial) statistical model where we observe the icremets provided, they are bigger tha α ad discard all others (here α > 0 ad 0 < < 1/2), Fisher s iformatio ow satisfies (29) I A(1 )2 (log(1/ )) 2 α β β C βt. So, our geeral estimators are almost [up to a log(1/ ) factor] rate-efficiet for the partial parametric statistical model. As for the complete model, the rate approaches the true rate by takig close to 1/2, but we caot take bigger tha 1/(2 + β), ad sice, i practice, β is ukow other tha beig less tha 2, a uiversal choice may be = 1/4, which is less striget tha the choice = 1/5 required i the geeral case. 4. Geeral Lévy processes. Let us ow cosider the case where X is a geeral Lévy process. Its characteristics are of the form (2) with b t = b, σ t = σ ad F t = F determiistic ad ot depedig o t. The, Assumptio 1 holds. As to Assumptio 2, it may or may ot hold, but if it does it takes a slightly simpler form because the everythig is idepedet of (ω, t). I

15 ESTIMATING JUMP ACTIVITY 15 particular, Ā t = At for some costat A > 0. The two Theorems 1 ad 3 hold without modificatio, except that either {Āt > 0} = Ω for all t > 0, or {Āt > 0} = for all t, i which case those theorems are void of cotet. What is importat here, though, is that those results fail whe the assumptios we made are ot satisfied, eve with such a simple probabilistic structure for X. I order to see why Assumptio 2 is eeded, let us cosider a simpler but closely related statistical model. More precisely, suppose that we observe all big jumps of X up to time t; that is, X s with X s > α for all s t. A priori, this should give us more iformatio o the Lévy measure tha the origial observatio scheme where oly icremets (as opposed to jumps) are observed ad oly those bigger tha α are take ito cosideratio. I this statistical settig, the estimators (16) have o meaig, but we ca replace ˆβ with (30) where we have set (31) β (t,,α,α ) = log(u(,α) t /U(,α ) t ) log(α, /α) U(,α) t = s t 1 { Xs >α }. The estimators β are of course oly virtual sice there is o hope of actually observig the exact jumps of the process. But, i the rest of this sectio, we study the behavior of the estimators β i order to gai some isight o the ecessity of makig a restrictive assumptio o the Lévy measure F t if oe is to estimate β. We will see that such a assumptio is eeded eve uder these idealized circumstaces. Set (32) γ (,α) = F(α ). (33) Lemma 1. Let The: M(,α) t = 1 γ (,α) (U(,α) t γ (,α)t). (a) Each sequece of processes M(,α) coverges stably i law to a stadard Wieer process, idepedet of X; (b) If α < α, all limit poits of the sequece γ (,α )/γ (,α) are i [0,1]. Further, if this sequece coverges to γ, the the pair (M(,α), M(,α ) ) of processes coverges stably i law to a process (W,W ), which is idepedet of X ad a 2-dimesioal Wieer process with uit variaces 1 ad uit covariace γ.

16 16 Y. AÏT-SAHALIA AND J. JACOD Proof. The processes M = M(,α) ad M = M(,α ) are Lévy processes ad martigales, with jumps goig uiformly to 0, ad with predictable brackets M,M t = M,M t = t, M,M γ (,α t = ) γ (,α) t. Observe, also, that α α ; hece, γ (,α ) γ (,α). The remaiig results the follow [see Jacod ad Shiryaev (2003), Chapter VII]. (34) Theorem 6. If α > α ad γ(,α ) γ (,α) ( γ (,α ) coverges stably i law to a N(0, γ [0,1], the the sequece β (t,,α,α ) log(γ (,α)/γ (,α )) log(α /α) 1 γ t(log(α /α)) 2 ) variable, idepedet of X. This result is a simple cosequece of the previous lemma, ad its proof is the same as the proof of Theorem 1 oce the CLT for the processes U(,α) is established, which we will do later. So, the situatio seems geerally hopeless. These estimators are ot eve cosistet for estimatig the activity idex β of F because of bias, ad to remove the bias we have to kow the ratio γ (,α )/γ (,α) (or at least its asymptotic behavior i a precise way), ad, further, there is o CLT if this ratio does ot coverge (a fact which we do ot kow a priori, of course). The major difficulty comes from the possible erratic behavior of F ear 0. Ideed, we have (6) with β istead of β i t, but there are Lévy measures F satisfyig this, ad such that for ay r (0,β) we have x r F(x ) 0 for a sequece x 0 (depedig o r, of course). If F is such, the sequece γ (,α )/γ (,α) may have the whole of [0,1] as limit poits, depedig o the parameter values,α,α, ad i a completely ucotrolled way for the observer. So, we eed some additioal assumptio o F. Let us cosider two assumptios (the secod oe is stroger tha the first oe). Assumptio 3. F is regularly varyig at 0, with idex β (0,2). Assumptio 4. We have F(x) = A ( ) 1 (35) x β + o x β/2 as x 0, for some A > 0. Theorem 7. (a) Uder Assumptio 3, we have β (t,,α,α ) β. ) P

17 ESTIMATING JUMP ACTIVITY 17 (b) Uder Assumptio 4, the variables β (β (t,,α,α ) β) coverge stably i law to a N(0, β α β α ta 2 (log(α /α)) ) variable, idepedet of X. 2 Proof. Assumptio 3 implies that γ (,α) ad γ (,α)/γ (,α ) (α /α) β, so the previous theorem yields (a). Assumptio 4 clearly implies γ (,α) log(γ (,α)/γ (,α )) log(α β, /α) ad also γ (,α) A/α β β, so (b) follows agai from the previous theorem. It may of course happe that Assumptio 3 or 4 fail ad evertheless the coclusios of the previous theorem hold for a particular choice of the parameters (,α,α ) or for a particular choice of the sequece. But, i view of Theorem 6 ad of the previous proof, these assumptios are ecessary if we wat those coclusios to hold for all choices of (,α,α ). Now, comig back to the origial realistic problem, for which oly icremets of X are observed. Assumptio 2, whe F t (ω,dx) = F(dx) for all (ω,t), is obviously stroger tha Assumptio 4, but ot much more. The eed of stroger assumptios for the origial problem comes from the fact that although whe we observe a large icremet i X it is with a high probability almost equal to a large jump. Nevertheless, the observatio of this jump is blurred by the Browia compoet ad also by a sum of very small jumps. This fact is also the reaso why we eed some restrictio o for the origial problem, whereas, here, ca be arbitrarily large. 5. Small sample bias correctio. By costructio, we are forced by the presece of a cotiuous semimartigale to rely o a small fractio of the sample (i.e., those icremets larger tha α ) for the purpose of estimatig β. As a result, the effective sample size utilized by the estimator ˆβ is small, eve if we sample at a relatively high frequecy. This situatio calls for a aalysis of the small sample behavior of the estimator. Such a small sample aalysis is out of reach i geeral but it ca be carried out explicitly for the model X t = σw t +θy t studied i Sectio 3, where Y is a symmetric β-stable process ad W is a Wieer process. Let g deote the desity of Y 1. Here, the process Y is stadardized by E(e iuyt ) = e t u β /2, so the limit β 2 correspods to the stadard ormal desity φ. Oe additioal step i the expasio (24) yields, as x +, g(x) = c β x β+1 + d ( ) β 1 (36) x 2β+1 + O x 3β+1

18 18 Y. AÏT-SAHALIA AND J. JACOD ad, for the tail of the distributio, (37) G(x) = P( Y 1 > x) = 2 + x g(v)dv = 2c β βx β + d β βx 2β + O ( 1 x 3β where the coefficiets of the expasio are ( ) Γ(β + 1) πβ Γ(2β + 1) (38) c β = si ad d β = si(πβ). 2π 2 8π This parametrizatio correspods i terms of the geeral otatio of the paper to (39) A t = A = 2θ β c β /β. Now, cosider the tail probability P at the cutoff level α. The probability P determies the limitig behavior of the Us, sice U(,α) t (t/ )P. We have So, with ( (40) P = 2 = α = G 1 σ α 1/2 1 θ 1/β (α/θ) 1/β ( α θ 1/β ) β u uβσ 1/2 = 1 + α ( ) x y g θ 1/β dx 1 σ 1/2 (1 (σ/α) 1/2 u) ( )) 1 σ α 1/2 u + u2 β(β + 1)σ α 2 ( ) y φ σ 1/2 dy g(v)dv φ(u)du φ(u) du. ), + O( 3/2 3 ) ad + uφ(u)du = 0 ad + u2 φ(u)du = 1, we see, from (37), that ) (1 + (41) P = 2c βθ β 1 β βα β + smaller terms. β(β + 1)σ2 2α d βθ β 1 β 2c β α β The behavior of P suggested by the leadig term i the expressio (2c β θ β /(βα β )) 1 β is the oe we have used to defie the estimator ˆβ i (16) by exploitig the depedece of that leadig term o α. The first correctio term (β(β +1)σ 2 /(2α 2 )) 1 2 i (41) is due to the iteractio betwee the Wieer ad the stable processes, while the secod (d β θ β /(2c β α β )) 1 β is due to the more accurate approximatio of the tail of the stable process i (36) compared to the leadig order term i (24).

19 ESTIMATING JUMP ACTIVITY 19 To uderstad ituitively the eed for the first term, suppose that the cutoff level correspods to seve stadard deviatios of the cotiuous part of the semimartigale. There is very little probability that the Wieer process aloe will geerate a icremet that large. O the other had, whe we cout the icremets due to the jump process aloe, we are missig icremets of the sum of the cotiuous ad discotiuous parts where, say, the Wieer process is resposible for a oe stadard deviatio move, ad the jump process for a six stadard deviatio move, of the same sig. We are also missig icremets where the jump process gives a eight stadard deviatio move ad the Wieer process a oe stadard deviatio move, of the opposite sig. The two effects partly compesate each other ad ideed the term i u i (40) leads to a itegral whose value is zero. But the ext effect, i u 2, leads to a et icrease i the total umber of icremets that are larger tha the cutoff whe the iteractio betwee the Wieer ad jump processes is accouted for. Asymptotically, the first of the two correctig terms i (41) is the largest,, but i small samples a large value of the scalig parameter θ relative to σ ca make their magitudes comparable. Usig (41) at two differet values α ad α, we obtai sice (42) ˆβ β + 1 β 1 log(α /α) { β(β + 1)σ 2 2 ( 1 α 2 1 ) α d βθ β ( 1 2c β α β 1 ) α β 1 β This suggests a small sample bias correctio for the estimator ˆβ obtaied by subtractig a estimator of the two correctio terms o the right had side of (42) from ˆβ. As we will see i simulatios below, the two correctio terms are quite effective i practice. Further, we ote that the two correctio terms i (42), of respective orders 1 2 ad 1 β, are asymptotically egligible at the rate β/2 at which the cetral limit occurs. This is due to the restrictios o the choice of imposed by Theorem 2. Cosequetly, the bias-corrected estimator has the same asymptotic distributio as the origial estimator. More geerally, we have (43) ˆβ β + { 1 t 0 A sσs 2 ds log(α /α) β(β + 1) Ā t 2 t 0 + A2 s ds Ā t }. ( 1 α 2 1 ) α ( βd β 1 4c 2 β α β 1 ) α β 1 β }.

20 20 Y. AÏT-SAHALIA AND J. JACOD To implemet the bias correctio i practice, we eed to estimate the terms (1/Āt) t 0 A sσs 2 ds ad (1/Āt) t 0 A2 s ds. I the case of a stable symmetric process, A s = A ad so (1/Āt) t 0 A2 s ds = A = 2θβ c β /β. We ca the replace (1/Āt) t 0 A sσs 2 ds by (1/t) t 0 σ2 s ds ad use ay stadard estimator of the itegrated volatility. I geeral, we have U(,α) t 1 ( β α β Ā + 1 α 2Aσ2 G 1 (β) 1 2 (44) + 1 ) α β A2 G 2 (β) 1 β, (45) U(,α) t a 1 0 α β + a 1 1 α 2+β + a 1 2 α 2β, β where Ā = t 0 A sσs 2 ds, Aσ 2 = t 0 A sσs 2 ds ad A 2 = t 0 A2 s ds. We ca estimate the ukow coefficiets a 0, a 1 ad a 2 i expressio (45) by a straightforward liear regressio of β U(,α) t o 1/α β, 1/α 2+β ad 1/α 2β. For the purpose of ruig that regressio, we use differet cutoff levels α ad compute the correspodig umber of icremets exceedig that level, U(,α) t ad the first-stage estimate of β. Give estimates of the regressio coefficiets, we have a geeralized bias correctio procedure based o subtractig, from ˆβ, the terms o the right-had side of { ( 1 a1 1 ˆβ β log(α /α) a 0 α 2 1 ) α 2 + a ( 2 1 a 0 α β 1 )} (46) α β evaluated at the regressio estimates of a 0, a 1 ad a Mote Carlo simulatios. We ow report simulatio results documetig the fiite sample performace of the estimator ˆβ i fiite samples. We calibrate the values to be realistic for a very liquid stock. We use a observatio legth of T = 1 day, cosistig of 6.5 hours of tradig (i.e., = 23,400 secods). The averages ad stadard deviatios of the estimator ˆβ, which is based o two differet levels of trucatio α ad α, are reported i Table 1 for various values of β up to 1.5 ad iclude a cotiuous (Browia) part. The table reports the results of 5000 simulatios. The data geeratig process is the stochastic volatility model dx t = σ t dw t + θ dy t, with σ t = v 1/2 t, dv t = κ(η v t )dt + γv 1/2 t db t + dj t, E[dW t db t ] = ρdt, η 1/2 = 0.25, γ = 0.5, κ = 5, ρ = 0.5. J is a compoud Poisso jump process with jumps that are uiformly distributed o [ 30%,30%] ad X 0 = 1. The jump process Y is either a β-stable process with β = 1.5, 1.25, 1.0, 0.75, 0.5 ad 0.25, or a compoud Poisso process (which has fiite activity ad is marked β = 0 i the table) with fixed jump size The estimator is implemeted with

21 ESTIMATING JUMP ACTIVITY 21 Table 1 Mote Carlo simulatios of the estimator ˆβ based o two levels of trucatio for β-stable processes ad a compoud Poisso process (β = 0) Samplig 1 sec 1 sec 1 sec 1 sec 5 sec tail probability 0.25% 0.5% 1.0% 2.5% 1.0% β = 1.5 Sample mea Sample stdev (0.26) (0.18) (0.13) (0.08) (0.25) Asymp stdev (0.26) (0.18) (0.13) (0.08) (0.24) β = 1.25 Sample mea Sample stdev (0.23) (0.16) (0.11) (0.07) (0.19) Asymp stdev (0.23) (0.16) (0.11) (0.07) (0.19) β = 1.0 Sample mea Sample stdev (0.19) (0.14) (0.10) (0.06) (0.14) Asymp stdev (0.19) (0.14) (0.10) (0.06) (0.14) β = 0.75 Sample mea Sample stdev (0.16) (0.11) (0.08) (0.05) (0.11) Asymp stdev (0.16) (0.11) (0.08) (0.05) (0.11) β = 0.5 Sample mea Sample stdev (0.13) (0.09) (0.06) (0.04) (0.08) Asymp stdev (0.13) (0.09) (0.06) (0.04) (0.08) β = 0.25 Sample mea Sample stdev (0.09) (0.06) (0.04) (0.04) (0.05) Asymp stdev (0.09) (0.06) (0.04) (0.04) (0.05) β = 0 Sample mea Sample stdev (0.02) (0.01) (0.007) (0.005) (0.01) α = 5η, α = 10η ad = Give η ad α, the scale parameter θ (or equivaletly A) of the stable process i simulatios is calibrated to deliver the various values of the tail probability P( Y t α ) reported i the colums of the table; for the Poisso process, it is the value of the arrival rate parameter λ that is set to geerate the desired level of jump tail probability. I each row, the top umber is the average value of the estimator ˆβ across the simulatios, after iclusio of the bias correctio discussed i Sectio 5, while the umber below, i paretheses, is the stadard deviatio of the estimator across the same simulatios. The third umber i paretheses is the estimated asymptotic stadard error based o the limitig distributio give i the sectios above. A higher tail probability i the colums has the effect of geeratig more icremets from the jump process that exceed the cutoff level, which makes more observatios available ad correspodigly reduces the stadard deviatio of the estimates. As the results show, ˆβ picks up o average fairly accurately the true value of β. As β gets too close to 2, the β-stable jump process starts to approximate

22 22 Y. AÏT-SAHALIA AND J. JACOD Fig. 1. Mote Carlo distributios of the estimator ˆβ based o two levels of trucatio for β-stable processes (0 < β < 2). too closely the behavior of the Browia motio ad the performace of the estimator deteriorates. Further simulatios (ot reported to save space) suggest that the estimator is ot overly sesitive to the selectio of the trucatio levels (α,α ) withi a reasoable rage. Histograms of the distributio of the estimator ˆβ are show i Figure 1 for the same values of β; the figures are based o the 1% level of jump tail probability. The histogram reports the raw, ustadardized, values of ˆβ, which are ot expected to be asymptotically ormal ulike the stadardized versios.

23 ESTIMATING JUMP ACTIVITY 23 Fig. 2. Mote Carlo distributio of the estimator ˆβ based o two levels of trucatio for a compoud Poisso process (β = 0). A special metio should be made about the compoud Poisso case, which correspods to β = 0 ad does ot satisfy Assumptio 2. As discussed i Remark 8, there is o cetral limit theorem i this case, ad ˆβ should be equal to 0 for large eough. Ituitively, whe there is a small umber or large jumps, the same large icremets remai i the sample at the two trucatio levels α ad α, ad the ratio of U evaluated at α to U evaluated at α i (16) is equal to 1. This is what happes i simulatios i the vast majority of cases, as show by the histogram for β = 0 reported i Figure 2. The asymptotic distributio is a accurate guide for the small samples as show i the stadardized distributios i Figure 3. The estimator i the histograms is stadardized accordig to the asymptotic distributio give i Theorem 3, ad the solid curve i the figures is the limitig N(0,1) desity. As the figures show, the asymptotic distributio is a fairly accurate guide for the small samples. This is i spite of the relatively small umber of (large) icremets that are effectively used by the estimator, combied with the facts that some large icremets are kept, eve though they may ot have cotaied a large jump, or, coversely, smaller icremets may have cotaied two or more large, cacellig, jumps, or the Wieer process may have combied with the pure jump process to produce a larger icremet. Asymptotically, these effects do ot show up at the leadig order i but are preset i small samples ad appear to be effectively captured by the bias correctig term. Fially, we compare i simulatios the performace of the two estimators ˆβ (based o two trucatio levels) ad ˆβ (based o two samplig frequecies) with the same experimet desig as above. The samplig frequecy is = 1 secod, the legth of observatio T = 1 day or 23,400 secods. The

24 24 Y. AÏT-SAHALIA AND J. JACOD Fig. 3. Stadardized Mote Carlo ad asymptotic distributios of the estimator ˆβ based o two levels of trucatio for β-stable processes (0 < β < 2). tail probability is 1.0%, the middle value i Table 1. The results are show i Table 2. The estimator ˆβ teds to have a larger stadard deviatio tha ˆβ ad is slightly biased. For these reasos, we have focused o the estimator ˆβ ad emphasize its use i the empirical applicatio that follows. 7. Empirical applicatio. We ow implemet the estimator ˆβ for the two most actively traded stocks i the Dow Joes Idustrial Average idex, Itel (INTC) ad Microsoft (MSFT), ad each tradig day i The data source is the TAQ database. Each day, we collect all trasactios o the NYSE or NASDAQ, from 9:30 am util 4:00 pm, for each oe of these

25 ESTIMATING JUMP ACTIVITY 25 Table 2 Compariso of the two estimators of β i Mote Carlo simulatios for β-stable processes ad a compoud Poisso process (β = 0) Two trucatio levels ˆβ Two samplig frequecies ˆβ β = 1.0 Sample mea Sample stdev (0.10) (0.13) β = 0.5 Sample mea Sample stdev (0.06) (0.09) β = 0 Sample mea Sample stdev (0.007) (0.02) stocks. We sample i caledar time every 5 ad 15 secods. We use filters to elimiate clear data errors (price set to zero, etc.) ad all trasactios i the origial record that are later corrected, cacelled or otherwise ivalidated, as is stadard i the empirical high frequecy literature. The two time series are plotted i Figure 4. Figure 5 cotais a histogram of the tails of the ucoditioal desities of the log-returs from the two stocks. Comparig the two figures, we see that it is quite possible to have a stadard time series plot display little evidece of large moves (Figure 5), while the tails of the distributio look substatially fatter tha ormal (Figure 5) as cofirmed by the descriptive statistics i Table 3 for the two log-returs series. All together, this evidece poits i the directio of may small, active jumps of the type that we seek to ucover usig our estimator of β. More formally, we compute the statistic Ŝ of Aït-Sahalia ad Jacod (2009) to test for the presece of jumps i the data. Over the differet samplig frequecies cosidered (ragig from 2 secods to 1 miute), the largest value of the statistic Ŝ we obtai for the differet quarters ad the two stocks is Sice the asymptotic value of Ŝ is 1 (resp. 2) whe jumps Fig. 4. Time series of INTC ad MSFT stock prices, all tradig days i 2006.

ESTIMATING THE DEGREE OF ACTIVITY OF JUMPS IN HIGH FREQUENCY DATA

ESTIMATING THE DEGREE OF ACTIVITY OF JUMPS IN HIGH FREQUENCY DATA The Aals of Statistics 2009, Vol. 37, No. 5A, 2202 2244 DOI: 10.1214/08-AOS640 Istitute of Mathematical Statistics, 2009 ESTIMATING THE DEGREE OF ACTIVITY OF JUMPS IN HIGH FREQUENCY DATA BY YACINE AÏT-SAHALIA

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

Asymptotic Growth of Functions

Asymptotic Growth of Functions CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll

More information

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

Department of Computer Science, University of Otago

Department of Computer Science, University of Otago Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly

More information

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13 EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval

More information

Section 11.3: The Integral Test

Section 11.3: The Integral Test Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

THE ABRACADABRA PROBLEM

THE ABRACADABRA PROBLEM THE ABRACADABRA PROBLEM FRANCESCO CARAVENNA Abstract. We preset a detailed solutio of Exercise E0.6 i [Wil9]: i a radom sequece of letters, draw idepedetly ad uiformly from the Eglish alphabet, the expected

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

INFINITE SERIES KEITH CONRAD

INFINITE SERIES KEITH CONRAD INFINITE SERIES KEITH CONRAD. Itroductio The two basic cocepts of calculus, differetiatio ad itegratio, are defied i terms of limits (Newto quotiets ad Riema sums). I additio to these is a third fudametal

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

Tradigms of Astundithi and Toyota

Tradigms of Astundithi and Toyota Tradig the radomess - Desigig a optimal tradig strategy uder a drifted radom walk price model Yuao Wu Math 20 Project Paper Professor Zachary Hamaker Abstract: I this paper the author iteds to explore

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy

More information

Estimating the Degree of Activity of jumps in High Frequency Financial Data. joint with Yacine Aït-Sahalia

Estimating the Degree of Activity of jumps in High Frequency Financial Data. joint with Yacine Aït-Sahalia Estimating the Degree of Activity of jumps in High Frequency Financial Data joint with Yacine Aït-Sahalia Aim and setting An underlying process X = (X t ) t 0, observed at equally spaced discrete times

More information

Overview of some probability distributions.

Overview of some probability distributions. Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability

More information

Convexity, Inequalities, and Norms

Convexity, Inequalities, and Norms Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for

More information

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat

More information

THE HEIGHT OF q-binary SEARCH TREES

THE HEIGHT OF q-binary SEARCH TREES THE HEIGHT OF q-binary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average

More information

4.3. The Integral and Comparison Tests

4.3. The Integral and Comparison Tests 4.3. THE INTEGRAL AND COMPARISON TESTS 9 4.3. The Itegral ad Compariso Tests 4.3.. The Itegral Test. Suppose f is a cotiuous, positive, decreasig fuctio o [, ), ad let a = f(). The the covergece or divergece

More information

Normal Distribution.

Normal Distribution. Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued

More information

MARTINGALES AND A BASIC APPLICATION

MARTINGALES AND A BASIC APPLICATION MARTINGALES AND A BASIC APPLICATION TURNER SMITH Abstract. This paper will develop the measure-theoretic approach to probability i order to preset the defiitio of martigales. From there we will apply this

More information

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of

More information

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1)

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1) BASIC STATISTICS. SAMPLES, RANDOM SAMPLING AND SAMPLE STATISTICS.. Radom Sample. The radom variables X,X 2,..., X are called a radom sample of size from the populatio f(x if X,X 2,..., X are mutually idepedet

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations CS3A Hadout 3 Witer 00 February, 00 Solvig Recurrece Relatios Itroductio A wide variety of recurrece problems occur i models. Some of these recurrece relatios ca be solved usig iteratio or some other ad

More information

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

Plug-in martingales for testing exchangeability on-line

Plug-in martingales for testing exchangeability on-line Plug-i martigales for testig exchageability o-lie Valetia Fedorova, Alex Gammerma, Ilia Nouretdiov, ad Vladimir Vovk Computer Learig Research Cetre Royal Holloway, Uiversity of Lodo, UK {valetia,ilia,alex,vovk}@cs.rhul.ac.uk

More information

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009) 18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

CHAPTER 3 DIGITAL CODING OF SIGNALS

CHAPTER 3 DIGITAL CODING OF SIGNALS CHAPTER 3 DIGITAL CODING OF SIGNALS Computers are ofte used to automate the recordig of measuremets. The trasducers ad sigal coditioig circuits produce a voltage sigal that is proportioal to a quatity

More information

Chapter 14 Nonparametric Statistics

Chapter 14 Nonparametric Statistics Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they

More information

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number. GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series 8 Fourier Series Our aim is to show that uder reasoable assumptios a give -periodic fuctio f ca be represeted as coverget series f(x) = a + (a cos x + b si x). (8.) By defiitio, the covergece of the series

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

Irreducible polynomials with consecutive zero coefficients

Irreducible polynomials with consecutive zero coefficients Irreducible polyomials with cosecutive zero coefficiets Theodoulos Garefalakis Departmet of Mathematics, Uiversity of Crete, 71409 Heraklio, Greece Abstract Let q be a prime power. We cosider the problem

More information

Exploratory Data Analysis

Exploratory Data Analysis 1 Exploratory Data Aalysis Exploratory data aalysis is ofte the rst step i a statistical aalysis, for it helps uderstadig the mai features of the particular sample that a aalyst is usig. Itelliget descriptios

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

Class Meeting # 16: The Fourier Transform on R n

Class Meeting # 16: The Fourier Transform on R n MATH 18.152 COUSE NOTES - CLASS MEETING # 16 18.152 Itroductio to PDEs, Fall 2011 Professor: Jared Speck Class Meetig # 16: The Fourier Trasform o 1. Itroductio to the Fourier Trasform Earlier i the course,

More information

LECTURE 13: Cross-validation

LECTURE 13: Cross-validation LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

arxiv:1506.03481v1 [stat.me] 10 Jun 2015

arxiv:1506.03481v1 [stat.me] 10 Jun 2015 BEHAVIOUR OF ABC FOR BIG DATA By Wetao Li ad Paul Fearhead Lacaster Uiversity arxiv:1506.03481v1 [stat.me] 10 Ju 2015 May statistical applicatios ivolve models that it is difficult to evaluate the likelihood,

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about

More information

Notes on exponential generating functions and structures.

Notes on exponential generating functions and structures. Notes o expoetial geeratig fuctios ad structures. 1. The cocept of a structure. Cosider the followig coutig problems: (1) to fid for each the umber of partitios of a -elemet set, (2) to fid for each the

More information

Infinite Sequences and Series

Infinite Sequences and Series CHAPTER 4 Ifiite Sequeces ad Series 4.1. Sequeces A sequece is a ifiite ordered list of umbers, for example the sequece of odd positive itegers: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29...

More information

Chapter 7: Confidence Interval and Sample Size

Chapter 7: Confidence Interval and Sample Size Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum

More information

Cooley-Tukey. Tukey FFT Algorithms. FFT Algorithms. Cooley

Cooley-Tukey. Tukey FFT Algorithms. FFT Algorithms. Cooley Cooley Cooley-Tuey Tuey FFT Algorithms FFT Algorithms Cosider a legth- sequece x[ with a -poit DFT X[ where Represet the idices ad as +, +, Cooley Cooley-Tuey Tuey FFT Algorithms FFT Algorithms Usig these

More information

Theorems About Power Series

Theorems About Power Series Physics 6A Witer 20 Theorems About Power Series Cosider a power series, f(x) = a x, () where the a are real coefficiets ad x is a real variable. There exists a real o-egative umber R, called the radius

More information

Lesson 17 Pearson s Correlation Coefficient

Lesson 17 Pearson s Correlation Coefficient Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig

More information

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling Taig DCOP to the Real World: Efficiet Complete Solutios for Distributed Multi-Evet Schedulig Rajiv T. Maheswara, Milid Tambe, Emma Bowrig, Joatha P. Pearce, ad Pradeep araatham Uiversity of Souther Califoria

More information

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? JÖRG JAHNEL 1. My Motivatio Some Sort of a Itroductio Last term I tought Topological Groups at the Göttige Georg August Uiversity. This

More information

Basic Elements of Arithmetic Sequences and Series

Basic Elements of Arithmetic Sequences and Series MA40S PRE-CALCULUS UNIT G GEOMETRIC SEQUENCES CLASS NOTES (COMPLETED NO NEED TO COPY NOTES FROM OVERHEAD) Basic Elemets of Arithmetic Sequeces ad Series Objective: To establish basic elemets of arithmetic

More information

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find 1.8 Approximatig Area uder a curve with rectagles 1.6 To fid the area uder a curve we approximate the area usig rectagles ad the use limits to fid 1.4 the area. Example 1 Suppose we wat to estimate 1.

More information

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here). BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly

More information

Perfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing

Perfect Packing Theorems and the Average-Case Behavior of Optimal and Online Bin Packing SIAM REVIEW Vol. 44, No. 1, pp. 95 108 c 2002 Society for Idustrial ad Applied Mathematics Perfect Packig Theorems ad the Average-Case Behavior of Optimal ad Olie Bi Packig E. G. Coffma, Jr. C. Courcoubetis

More information

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book) MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:

More information

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth Questio 1: What is a ordiary auity? Let s look at a ordiary auity that is certai ad simple. By this, we mea a auity over a fixed term whose paymet period matches the iterest coversio period. Additioally,

More information

MAXIMUM LIKELIHOODESTIMATION OF DISCRETELY SAMPLED DIFFUSIONS: A CLOSED-FORM APPROXIMATION APPROACH. By Yacine Aït-Sahalia 1

MAXIMUM LIKELIHOODESTIMATION OF DISCRETELY SAMPLED DIFFUSIONS: A CLOSED-FORM APPROXIMATION APPROACH. By Yacine Aït-Sahalia 1 Ecoometrica, Vol. 7, No. 1 (Jauary, 22), 223 262 MAXIMUM LIKELIHOODESTIMATION OF DISCRETEL SAMPLED DIFFUSIONS: A CLOSED-FORM APPROXIMATION APPROACH By acie Aït-Sahalia 1 Whe a cotiuous-time diffusio is

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006 Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam

More information

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio

More information

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

Parameter estimation for nonlinear models: Numerical approaches to solving the inverse problem. Lecture 11 04/01/2008. Sven Zenker

Parameter estimation for nonlinear models: Numerical approaches to solving the inverse problem. Lecture 11 04/01/2008. Sven Zenker Parameter estimatio for oliear models: Numerical approaches to solvig the iverse problem Lecture 11 04/01/2008 Sve Zeker Review: Trasformatio of radom variables Cosider probability distributio of a radom

More information

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT Keywords: project maagemet, resource allocatio, etwork plaig Vladimir N Burkov, Dmitri A Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT The paper deals with the problems of resource allocatio betwee

More information

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps Swaps: Costat maturity swaps (CMS) ad costat maturity reasury (CM) swaps A Costat Maturity Swap (CMS) swap is a swap where oe of the legs pays (respectively receives) a swap rate of a fixed maturity, while

More information

CS103X: Discrete Structures Homework 4 Solutions

CS103X: Discrete Structures Homework 4 Solutions CS103X: Discrete Structures Homewor 4 Solutios Due February 22, 2008 Exercise 1 10 poits. Silico Valley questios: a How may possible six-figure salaries i whole dollar amouts are there that cotai at least

More information

Research Article Sign Data Derivative Recovery

Research Article Sign Data Derivative Recovery Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 63070, 7 pages doi:0.540/0/63070 Research Article Sig Data Derivative Recovery L. M. Housto, G. A. Glass, ad A. D. Dymikov

More information

One-sample test of proportions

One-sample test of proportions Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:

More information

Subject CT5 Contingencies Core Technical Syllabus

Subject CT5 Contingencies Core Technical Syllabus Subject CT5 Cotigecies Core Techical Syllabus for the 2015 exams 1 Jue 2014 Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical techiques which ca be used to model ad value

More information

*The most important feature of MRP as compared with ordinary inventory control analysis is its time phasing feature.

*The most important feature of MRP as compared with ordinary inventory control analysis is its time phasing feature. Itegrated Productio ad Ivetory Cotrol System MRP ad MRP II Framework of Maufacturig System Ivetory cotrol, productio schedulig, capacity plaig ad fiacial ad busiess decisios i a productio system are iterrelated.

More information

Factoring x n 1: cyclotomic and Aurifeuillian polynomials Paul Garrett <garrett@math.umn.edu>

Factoring x n 1: cyclotomic and Aurifeuillian polynomials Paul Garrett <garrett@math.umn.edu> (March 16, 004) Factorig x 1: cyclotomic ad Aurifeuillia polyomials Paul Garrett Polyomials of the form x 1, x 3 1, x 4 1 have at least oe systematic factorizatio x 1 = (x 1)(x 1

More information

Universal coding for classes of sources

Universal coding for classes of sources Coexios module: m46228 Uiversal codig for classes of sources Dever Greee This work is produced by The Coexios Project ad licesed uder the Creative Commos Attributio Licese We have discussed several parametric

More information

Lecture 3. denote the orthogonal complement of S k. Then. 1 x S k. n. 2 x T Ax = ( ) λ x. with x = 1, we have. i = λ k x 2 = λ k.

Lecture 3. denote the orthogonal complement of S k. Then. 1 x S k. n. 2 x T Ax = ( ) λ x. with x = 1, we have. i = λ k x 2 = λ k. 18.409 A Algorithmist s Toolkit September 17, 009 Lecture 3 Lecturer: Joatha Keler Scribe: Adre Wibisoo 1 Outlie Today s lecture covers three mai parts: Courat-Fischer formula ad Rayleigh quotiets The

More information

Present Values, Investment Returns and Discount Rates

Present Values, Investment Returns and Discount Rates Preset Values, Ivestmet Returs ad Discout Rates Dimitry Midli, ASA, MAAA, PhD Presidet CDI Advisors LLC dmidli@cdiadvisors.com May 2, 203 Copyright 20, CDI Advisors LLC The cocept of preset value lies

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis Ruig Time ( 3.) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

Quadrat Sampling in Population Ecology

Quadrat Sampling in Population Ecology Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may

More information