19 Another Look at Differentiability in Quadratic Mean


 Maud Clarke
 2 years ago
 Views:
Transcription
1 19 Aother Look at Differetiability i Quadratic Mea David Pollard 1 ABSTRACT This ote revisits the delightfully subtle itercoectios betwee three ideas: differetiability, i a L 2 sese, of the squareroot of a probability desity; local asymptotic ormality; ad cotiguity A mystery The traditioal regularity coditios for maximum likelihood theory ivolve existece of two or three derivatives of the desity fuctios, together with domiatio assumptios to justify differetiatio uder itegral sigs. Le Cam (1970) oted that such coditios are uecessarily striget. He commeted: Eve if oe is ot iterested i the maximum ecoomy of assumptios oe caot escape practical statistical problems i which apparetly slight violatios of the assumptios occur. For istace the derivatives fail to exist at oe poit x which may deped o θ, or the distributios may ot be mutually absolutely cotiuous or a variety of other difficulties may occur. The existig literature is rather uclear about what may happe i these circumstaces. Note also that sice the coditios are imposed upo probability desities they may be satisfied for oe choice of such desities but ot for certai other choices. Probably Le Cam had i mid examples such as the double expoetial desity, 1 / 2 exp( x θ ), for which differetiability fails at the poit θ = x. He showed that the traditioal coditios ca be replaced by a simpler assumptio of differetiability i quadratic mea (DQM): differetiability i orm of the square root of the desity as a elemet of a L 2 space. Much asymptotic theory ca be made to work uder DQM. I particular, as Le Cam showed, it implies a quadratic approximatio property for the loglikelihoods kow as local asymptotic ormality (LAN). Le Cam s idea is simple but subtle. Whe I first ecoutered the LAN property I wrogly dismissed it as othig more tha a Taylor expasio to quadratic terms of the loglikelihood. Le Cam s DQM result showed otherwise: 1 Yale Uiversity
2 306 David Pollard oe appears to get the beefit of the quadratic expasio without payig the twicedifferetiability price usually demaded by such a Taylor expasio. How ca that happe? My iitial puzzlemet was ot completely allayed by a study of several careful accouts of LAN, such as those of Le Cam (1970; 1986, Sectio 17.3), Ibragimov & Has miskii (1981, page 114), Millar (1983, page 105), Le Cam & Yag (1990, page 101), or Strasser (1985, Chapter 12). Noe of the proofs left me with the feelig that I really uderstood why secod derivatives are ot eeded. (No criticism of those authors iteded, of course.) Evetually it dawed o me that I had overlooked a vital igrediet i the proofs: the square root of a desity is ot just a elemet of a L 2 space: it is a elemet with orm 1. By rearragig some of the stadard argumets I hope to covice the getle reader of this ote that the fixed orm is the real reaso for why a assumptio of oetimes differetiability (i quadratic mea) ca covey the beefits usually associated with twotimes differetiability. I claim that the Lemma i the ext Sectio is the key to uderstadig the role of DQM A lemma The cocept of differetiability makes sese for maps ito a arbitrary ormed space (L, ). For the purposes of my expositio, it suffices to cosider the case where the orm is geerated by a ier product,,. I fact, L will be L 2 (λ), the space of fuctios squareitegrable with respect to some measure λ, but that simplificatio will play o role for the momet. Amapξ from R k ito L is said to be differetiable at a poit θ 0 with derivative, ifξ(θ) = ξ(θ 0 ) + (θ θ 0 ) + r(θ) ear θ 0, where r(θ) = o( θ θ 0 ) as θ teds to θ 0. The derivative is liear; it may be idetified with a kvector of elemets from L. For a differetiable map, the CauchySchwarz iequality implies that ξ(θ 0 ), r(θ) =o( θ θ 0 ). It would usually be a bluder to assume aively that the boud must therefore be of order O( θ θ 0 2 ); typically, higherorder differetiability assumptios are eeded to derive approximatios with smaller errors. However, if ξ(θ) is costat that is, if the fuctio is costraied to take values lyig o the surface of a sphere the the aive assumptio turs out to be o bluder. Ideed, i that case, ξ(θ 0 ), r(θ) ca be writte as a quadratic i θ θ 0 plus a error of order o( θ θ 0 2 ). The sequetial form of the assertio is more coveiet for my purposes. (1) Lemma Let {δ } be a sequece of costats tedig to zero. Let ξ 0, ξ 1,...be elemets of orm oe for which ξ = ξ 0 +δ W +r, with W aæxed elemet of L ad r =o(δ ). The ξ 0, W =0 ad ξ 0, r = 1 2 δ2 W 2 + o(δ 2).
3 19. Differetiability i Quadratic Mea 307 Proof. Because both ξ ad ξ 0 have uit legth, 0 = ξ 2 ξ 0 2 = 2δ ξ 0, W order O(δ ) + 2 ξ 0, r order o(δ ) + δ 2 W 2 order O(δ 2) + 2δ W, r + r 2 order o(δ 2). O the righthad side I have idicated the order at which the various cotributios ted to zero. (The CauchySchwarz iequality delivers the o(δ ) ad o(δ 2 ) terms.) The exact zero o the lefthad side leaves the leadig 2δ ξ 0, W uhappily exposed as the oly O(δ ) term. It must be of smaller order, which ca happe oly if ξ 0, W =0, leavig 0 = 2 ξ 0, r +δ 2 W 2 + o(δ 2 ), as asserted. Without the fixed legth property, the ier product ξ 0, r, which iherits o(δ ) behaviour from r, might ot decrease at the O(δ 2) rate A theorem Let {P θ : θ } be a family of probability measures o a space (X, A), idexed by a subset of R k. Suppose P θ has desity f (x,θ) with respect to a sigmafiite measure λ. Uder the classical regularity coditios twice cotiuous differetiability of log f (x,θ) with respect to θ, with a domiated secod derivative the likelihood ratio f (x i,θ) f (x i,θ 0 ) ejoys the LAN property. Write L (t) for the likelihood ratio evaluated at θ equal to θ 0 + t/. The property asserts that, if the {x i } are sampled idepedetly from P θ0, the (2) L (t) = exp ( t S 1 2 t Ɣt + o p (1) ) for each t, where Ɣ is a fixed matrix (depedig o θ 0 )ads has a cetered asymptotic ormal distributio with variace matrix Ɣ. Formally, the LAN approximatio results from the usual poitwise Taylor expasio of the log desity g(x,θ) = log f (x,θ), followig a style of argumet familiar to most graduate studets. For example, i oe dimesio, log L (θ 0 + t/ ) = ( g(xi,θ 0 + t/ ) g(x i,θ 0 ) ) = t g (x i,θ 0 ) + t 2 g (x i,θ 0 ) +..., 2
4 308 David Pollard which suggests that S be the stadardized score fuctio, 1 g (x i,θ 0 ) N ( 0, var θ0 g (x,θ 0 ) ), ad Ɣ should be the iformatio fuctio, P θ0 g (x,θ 0 ) = var θ0 g (x,θ 0 ). The dual represetatio for Ɣ allows oe to elimiate all metio of secod derivatives from the statemet of the LAN approximatio, which hits that two derivatives might ot really be eeded, as Le Cam (1970) showed. I geeral, the family of desities is said to be differetiable i quadratic mea at θ 0 if the square root ξ(x,θ)= f (x,θ) is differetiable i the L 2 (λ) sese: for some kvector (x) of fuctios i L 2 (λ), (3) ξ(x,θ)= ξ(x,θ 0 ) + (θ θ 0 ) (x) + r(x,θ), where λ r(x,θ) 2 = o( θ θ 0 2 ) as θ θ 0. Let us abbreviate ξ(x,θ 0 ) to ξ 0 (x) ad (x)/ξ 0 (x) to D(x). From (3) oe almost gets the LAN property. (4) Theorem Assume the DQM property (3). For each Æxed t the likelihood ratio has the approximatio, uder {P,θ0 }, where L (t) = exp ( t S 1 2 t Ɣt + o p (1) ), S = 2 D(x i ) N(0, I 0 ) ad Ɣ = 1 2 I I, with I 0 = 4λ( {ξ 0 > 0}) ad I = 4λ( ). Notice the slight differece betwee Ɣ ad the limitig variace matrix for S. At least formally, 2D(x) equals the derivative of log f (x,θ): igorig problems related to divisio by zero ad distictios betwee poitwise ad L 2 (λ) differetiability, we have 2 2D(x) = f (x,θ0 ) = f (x,θ0 ) θ θ log f (x,θ 0). Also, Ɣ agai correspods to the iformatio matrix, expressed i its variace form, except for the itrusio of the idicator fuctio {ξ 0 > 0}. The extra idicator is ecessary if we wish to be careful about 0/0. Its presece is related to the property called cotiguity aother of Le Cam s great ideas as is explaied i Sectio 5.
5 19. Differetiability i Quadratic Mea 309 At first sight the derivatio of Theorem 4 from assumptio (3) agai appears to be a simple matter of a Taylor expasio to quadratic terms of the log likelihood ratio. Writig R (x) = r(x,θ 0 + t/ )/ξ 0 (x), wehave log L (t) = 2log ξ(x i,θ 0 + t/ ) ξ(x i,θ 0 ) = 2log (1 + t ) D(x i ) + R (x i ). From the Taylor expasio of log( ) about 1, the sum of logarithms ca be writte as a formal series, 2 ( ) t D(x i ) + R (x i ) ( t 2 D(x i ) + R (x i )) +... (5) = 2t D(x i ) + 2 R (x i ) 1 ( t D(x i ) ) The first sum o the righthad side gives the t S i Theorem 4. The law of large umbers gives covergece of the third term to t P θ0 DD t. Mere oetimes differetiability might ot seem eough to dispose of the secod sum. Each summad has stadard deviatio of order o(1/ ), by DQM. A sum of such terms could crudely be bouded via a triagle iequality, leavig a quatity of order o( ), which clearly would ot suffice. I fact the sum of the R (x i ) does ot go away i the limit; as a cosequece of Lemma 1, it cotributes a fixed quadratic i t. That cotributio is the surprise behid DQM A proof Let me write P to deote calculatios uder the assumptio that the observatios x 1,...,x are sampled idepedetly from P θ0. The ratio f (x i,θ 0 + t/ )/f (x i,θ 0 ) is ot well defied whe f (x i,θ 0 ) = 0, but uder P the problem ca be eglected because P { f (x i,θ 0 ) = 0 for at least oe i} =0. For other probability measures that are ot absolutely cotiuous with respect to P, oe should be more careful. It pays to be quite explicit about behaviour whe f (x i,θ 0 ) = 0 for some i, by icludig a explicit idicator fuctio {ξ 0 > 0} as a factor i ay expressios with a ξ 0 i the deomiator. Defie D i to be the radom vector (x i ){ξ 0 (x i )>0}/ξ 0 (x i ), ad, for a fixed t, defie R i, = r(ξ i,θ 0 + t/ ){ξ 0 (x i )>0}/ξ 0 (x i ). The ξ(x i,θ 0 + t/ ) {ξ 0 ( i )>0} =1 + t D i + R i,. ξ 0 (x i )
6 310 David Pollard (6) (8) The radom vector D i has expected value λ(ξ 0 ), which, by Lemma 1, is zero, eve without the traditioal regularity assumptios that justify differetiatio uder a itegral sig. It has variace 1 4 I 0. It follows by a cetral limit theorem that S = 2 D i N(0, I 0 ). Also, by a (weak) law of large umbers, 1 D i D i P (D 1 D 1 ) = 1 4 I 0 i probability. To establish rigorously the earlan assertio of Theorem 4, it is merely a matter of boudig the error terms i (5) ad the justifyig the treatmet of the sum of the R (x i ). Three facts are eeded. (7) Lemma Uder {P }, assumig DQM, (a) max D i =o p ( ), (b) max R i, =o p (1), (c) 2R i, 1 4 t It i probability. Let me first explai how Theorem 4 follows from Lemma 7. Together the two facts (a) ad (b) esure that with high probability log L (t) does ot ivolve ifiite values. For (t D i / ) + R i, > 1 we may the a appeal to the Taylor expasio log(1 + y) = y 1 2 y β(y), where β(y) = o(y 2 ) as y teds to zero, to deduce that log L (t) equals 2 t D i + 2 R i, ( t ) D 2 i + R i, + ( t ) D i β + R i,, which expads to t S + 2 R i, 1 (t D i ) 2 2 t D i R i, R i, 2 + o p(1) ( Di 2 ) + Ri, 2. Each of the last three sums is of order o p (1) because D i 2 / = O p (1) ad P R2 i, = λ( ξ0 2 r(x 1,θ 0 + t/ ){ξ 0 > 0}/ξ0 2 ) λ r(,θ 0 + t/ ) 2 = o(1). By virtue of (6) ad (c), the expasio simplifies to t S 1 4 t It 1 4 t I 0 t + o p (1), as asserted by Theorem 4.
7 19. Differetiability i Quadratic Mea 311 Proof of Lemma 7. Assertio (a) follows from the idetical distributios: P {max D i >ɛ } P { D i >ɛ } = P { 1 >ɛ } ɛ 2 λ 2 1 { 1 >ξ 0 ɛ } 0 by Domiated Covergece. Assertio (b) follows from (8): P {max R i, >ɛ} ɛ 2 P Ri, 2 0. Oly Assertio (c) ivolves ay subtlety. The variace of the sum is bouded by 4 P R (x i ) 2, which teds to zero. The sum of the remaiders must lie withi o p (1) of its expected value, which equals 2P θ0 R 1, = 2λ ( ξ 0 r(,θ 0 + t/ ) ), a ier product betwee two fuctios i L 2 (λ). Notice that the ξ 0 factor makes the idicator {ξ 0 > 0} redudat. It is here that the uit legth property becomes importat. Specializig Lemma 1 to the case δ = 1/, with ξ (x) = ξ(x,θ 0 + t/ ) ad W = t, we get the approximatio to the sum of expected values of the R i,, from which Assertio (c) follows. A slight geeralizatio of the LAN assertio is possible. It is ot ecessary that we cosider oly parameters of the form θ 0 + t/ for a fixed t. By arguig almost as above alog coverget subsequeces of {t } we could prove a aalog of Theorem 4 if t were replaced by a bouded sequece {t } such that θ 0 + t /. The extesio is sigificat because (Le Cam 1986, page 584) the slightly stroger result forces a form of differetiability i quadratic mea Cotiguity ad disappearace of mass For otatioal simplicity, cosider oly the oedimesioal case with the typical value t = 1. Let ξ 2 be the margial desity, ad Q be the joit distributio, for x 1,...,x sampled with parameter value θ 0 + 1/.As before, ξ0 2 ad P correspod to θ 0. The measure Q is absolutely cotiuous with respect to P if ad oly if it puts zero mass i the set A ={ξ 0 (x i ) = 0 for at least oe i }. Writig α for λξ 2{ξ 0 = 0}, wehave Q A = 1 ( 1 Q {ξ 0 (x i ) = 0} ) = 1 (1 α ).
8 312 David Pollard By direct calculatio, α = λ ( r + / ) 2 {ξ0 = 0} =λ 2 {ξ 0 = 0}/ + o(1/). The quatity τ = λ 2 {ξ 0 = 0} has the followig sigificace. Uder Q, the umber of observatios ladig i A has approximately a Poisso(τ) distributio; ad Q A 1 e τ. I some asymptotic sese, the measure Q becomes more early absolutely cotiuous with respect to P if ad oly if τ = 0. The precise sese is called cotiguity: the sequece of measures {Q } is said to be cotiguous with respect to {P } if Q B 0 for each sequece of sets {B } such that P B 0. Because P A = 0 for every, the coditio τ = 0 is clearly ecessary for cotiguity. It is also sufficiet. Cotiguity follows from the assertio that L, the limit i distributio uder {P } of the likelihood ratios {L (1)}, have expected value oe. ( Le Cam s first lemma see the theorem o page 20 of Le Cam ad Yag, 1990.) The argumet is simple: If PL = 1 the, to each ɛ>0 there exists a fiite costat C such that PL{L < C} > 1 ɛ. From the covergece i distributio, P L {L < C} > 1 ɛ evetually. If P B 0the Q B P B L {L < C}+Q {L C} CP B + 1 P L {L < C} < 2ɛ evetually. For the special case of the limitig exp(n(µ, σ 2 )) distributio, where µ = 1 4 I I ad σ 2 = I 0, the requiremet becomes 1 = P exp ( N(µ, σ 2 ) ) = exp ( µ σ 2). That is, cotiguity obtais whe I 0 = I (or equivaletly, λ( 2 {ξ 0 = 0}) = 0), i which case, the limitig variace of S equals Ɣ. This coclusio plays the same role as the traditioal dual represetatio for the iformatio fuctio. As Le Cam & Yag (1990, page 23) commeted, The equality... is the classical oe. Oe fids it for istace i the stadard treatmet of maximum likelihood estimatio uder Cramér s coditios. There it is derived from coditios of differetiability uder the itegral sig. The fortuitous equality is othig more tha cotiguity i disguise. From the literature oe sometimes gets the impressio that λ 2 {ξ 0 = 0} is always zero. It is ot. (9) Example Let λ be Lebesgue measure o the real lie. Defie f 0 (x) = x{0 x 1}+(2 x){1 < x 2}. For 0 θ 1 defie desities f (x,θ)= (1 θ 2 ) f 0 (x) + θ 2 f 0 (x 2). Notice that (10) λ f (x,θ) f (x, 0) θ f (x, 1) 2 = ( 1 θ 2 1) 2 = O(θ 4 ).
9 19. Differetiability i Quadratic Mea 313 The family of desities is differetiable i quadratic mea at θ = 0 with derivative (x) = f (x, 1). For this family, λ 2 {ξ 0 = 0} =1. The earlan assertio of Theorem 4 degeerates: I 0 = 0adI = 4, givig L (t) exp ( t 2) i probability, uder {P,θ0 }. Ideed, as Aad va der Vaart has poited out to me, the limitig experimet (i Le Cam s sese) for the models {P,t/ : 0 t } is ot the Gaussia traslatio model correspodig to the LAN coditio. Istead, the limit experimet is {Q t : t 0}, with Q t equal to the Poisso(t 2 ) distributio. That is, for each fiite set T ad each h, uder {P,h/ } the radom vectors ( dp,t/ ) : t T dp,h/ coverge i distributio to ( ) dqt : t T, dq h as a radom vector uder the Q h distributio. The couterexample would ot work if θ were allowed to take o egative values; oe would eed (x) = f (x, 1) to get the aalog of (10) for egative θ. The failure of cotiguity is directly related to the fact that θ = 0 lies o boudary of the parameter iterval. I geeral, λ {ξ 0 = 0} must be zero at all iterior poits of the parameter space where DQM holds. O the set {ξ 0 = 0} we have 0 ξ(x,θ 0 +t/ ) = t + r, where r 0. Alog a subsequece, r 0, leavig the coclusio that t 0 almost everywhere o the set {ξ 0 = 0}. At a iterior poit, t ca rage over all directios, which forces = 0 almost everywhere o {ξ = 0}; at a iterior poit, {ξ = 0} =0 almost everywhere. More geerally, oe eeds oly to be able to approach θ 0 from eough differet directios to force = 0o{ξ 0 = 0} as i the cocept of a cotiget i Le Cam & Yag (1990, Sectio 6.2). The assumptio that θ 0 lies i the iterior of the parameter space is ot always easy to spot i the literature. Some authors, such as Le Cam & Yag (1990, page 101), prefer to dispese with the domiatig measure λ, by recastig differetiability i quadratic mea as a property of the desities dp θ /dp θ0, whose square roots correspod to the ratios ξ(x,θ){ξ 0 > 0}/ξ 0 (x). With that approach, the behaviour of o the set {ξ 0 = 0} must be specified explicitly. The cotiguity requiremet that P θ puts, at worst, mass of order o( θ θ 0 2 ) i the set {ξ 0 = 0} is the made part of the defiitio of differetiability i quadratic mea Refereces Ibragimov, I. A. & Has miskii, R. Z. (1981), Statistical Estimatio: Asymptotic Theory, SprigerVerlag, New York.
10 314 David Pollard Le Cam, L. (1970), O the assumptios used to prove asymptotic ormality of maximum likelihood estimators, Aals of Mathematical Statistics 41, Le Cam, L. (1986), Asymptotic Methods i Statistical Decisio Theory, SprigerVerlag, New York. Le Cam, L. & Yag, G. L. (1990), Asymptotics i Statistics: Some Basic Cocepts, SprigerVerlag. Millar, P. W. (1983), The miimax priciple i asymptotic statistical theory, Spriger Lecture Notes i Mathematics pp Strasser, H. (1985), Mathematical Theory of Statistics: Statistical Experimets ad Asymptotic Decisio Theory, De Gruyter, Berli.
Properties of MLE: consistency, asymptotic normality. Fisher information.
Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout
More informationORDERS OF GROWTH KEITH CONRAD
ORDERS OF GROWTH KEITH CONRAD Itroductio Gaiig a ituitive feel for the relative growth of fuctios is importat if you really wat to uderstad their behavior It also helps you better grasp topics i calculus
More information3. Covariance and Correlation
Virtual Laboratories > 3. Expected Value > 1 2 3 4 5 6 3. Covariace ad Correlatio Recall that by takig the expected value of various trasformatios of a radom variable, we ca measure may iterestig characteristics
More informationSequences and Series
CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their
More information7. Sample Covariance and Correlation
1 of 8 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 7. Sample Covariace ad Correlatio The Bivariate Model Suppose agai that we have a basic radom experimet, ad that X ad Y
More informationConvexity, Inequalities, and Norms
Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for
More informationChapter 7 Methods of Finding Estimators
Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of
More informationINFINITE SERIES KEITH CONRAD
INFINITE SERIES KEITH CONRAD. Itroductio The two basic cocepts of calculus, differetiatio ad itegratio, are defied i terms of limits (Newto quotiets ad Riema sums). I additio to these is a third fudametal
More informationSAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx
SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval
More informationAsymptotic Growth of Functions
CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll
More informationI. Chisquared Distributions
1 M 358K Supplemet to Chapter 23: CHISQUARED DISTRIBUTIONS, TDISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad tdistributios, we first eed to look at aother family of distributios, the chisquared distributios.
More informationOverview of some probability distributions.
Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability
More informationChapter 7  Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:
Chapter 7  Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries
More informationModule 4: Mathematical Induction
Module 4: Mathematical Iductio Theme 1: Priciple of Mathematical Iductio Mathematical iductio is used to prove statemets about atural umbers. As studets may remember, we ca write such a statemet as a predicate
More informationSequences II. Chapter 3. 3.1 Convergent Sequences
Chapter 3 Sequeces II 3. Coverget Sequeces Plot a graph of the sequece a ) = 2, 3 2, 4 3, 5 + 4,...,,... To what limit do you thik this sequece teds? What ca you say about the sequece a )? For ǫ = 0.,
More informationHypothesis testing. Null and alternative hypotheses
Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate
More informationSection IV.5: Recurrence Relations from Algorithms
Sectio IV.5: Recurrece Relatios from Algorithms Give a recursive algorithm with iput size, we wish to fid a Θ (best big O) estimate for its ru time T() either by obtaiig a explicit formula for T() or by
More information4.3. The Integral and Comparison Tests
4.3. THE INTEGRAL AND COMPARISON TESTS 9 4.3. The Itegral ad Compariso Tests 4.3.. The Itegral Test. Suppose f is a cotiuous, positive, decreasig fuctio o [, ), ad let a = f(). The the covergece or divergece
More informationThe Limit of a Sequence
3 The Limit of a Sequece 3. Defiitio of limit. I Chapter we discussed the limit of sequeces that were mootoe; this restrictio allowed some shortcuts ad gave a quick itroductio to the cocept. But may importat
More informationLecture 7: Borel Sets and Lebesgue Measure
EE50: Probability Foudatios for Electrical Egieers JulyNovember 205 Lecture 7: Borel Sets ad Lebesgue Measure Lecturer: Dr. Krisha Jagaatha Scribes: Ravi Kolla, Aseem Sharma, Vishakh Hegde I this lecture,
More informationIncremental calculation of weighted mean and variance
Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically
More informationIn nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008
I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces
More informationDepartment of Computer Science, University of Otago
Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS200609 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly
More information0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5
Sectio 13 KolmogorovSmirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.
More informationDefinition. Definition. 72 Estimating a Population Proportion. Definition. Definition
7 stimatig a Populatio Proportio I this sectio we preset methods for usig a sample proportio to estimate the value of a populatio proportio. The sample proportio is the best poit estimate of the populatio
More informationClass Meeting # 16: The Fourier Transform on R n
MATH 18.152 COUSE NOTES  CLASS MEETING # 16 18.152 Itroductio to PDEs, Fall 2011 Professor: Jared Speck Class Meetig # 16: The Fourier Trasform o 1. Itroductio to the Fourier Trasform Earlier i the course,
More informationAn example of nonquenched convergence in the conditional central limit theorem for partial sums of a linear process
A example of oqueched covergece i the coditioal cetral limit theorem for partial sums of a liear process Dalibor Volý ad Michael Woodroofe Abstract A causal liear processes X,X 0,X is costructed for which
More informationMaximum Likelihood Estimators.
Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio
More informationEkkehart Schlicht: Economic Surplus and Derived Demand
Ekkehart Schlicht: Ecoomic Surplus ad Derived Demad Muich Discussio Paper No. 200617 Departmet of Ecoomics Uiversity of Muich Volkswirtschaftliche Fakultät LudwigMaximiliasUiversität Müche Olie at http://epub.ub.uimueche.de/940/
More information3.2 Introduction to Infinite Series
3.2 Itroductio to Ifiite Series May of our ifiite sequeces, for the remaider of the course, will be defied by sums. For example, the sequece S m := 2. () is defied by a sum. Its terms (partial sums) are
More informationThe second difference is the sequence of differences of the first difference sequence, 2
Differece Equatios I differetial equatios, you look for a fuctio that satisfies ad equatio ivolvig derivatives. I differece equatios, istead of a fuctio of a cotiuous variable (such as time), we look for
More informationHypothesis Tests Applied to Means
The Samplig Distributio of the Mea Hypothesis Tests Applied to Meas Recall that the samplig distributio of the mea is the distributio of sample meas that would be obtaied from a particular populatio (with
More informationSoving Recurrence Relations
Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree
More informationA probabilistic proof of a binomial identity
A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two
More information4.1 Sigma Notation and Riemann Sums
0 the itegral. Sigma Notatio ad Riema Sums Oe strategy for calculatig the area of a regio is to cut the regio ito simple shapes, calculate the area of each simple shape, ad the add these smaller areas
More informationChapter 6: Variance, the law of large numbers and the MonteCarlo method
Chapter 6: Variace, the law of large umbers ad the MoteCarlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value
More informationSection 9.2 Series and Convergence
Sectio 9. Series ad Covergece Goals of Chapter 9 Approximate Pi Prove ifiite series are aother importat applicatio of limits, derivatives, approximatio, slope, ad cocavity of fuctios. Fid challegig atiderivatives
More informationTHE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n
We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample
More informationTAYLOR SERIES, POWER SERIES
TAYLOR SERIES, POWER SERIES The followig represets a (icomplete) collectio of thigs that we covered o the subject of Taylor series ad power series. Warig. Be prepared to prove ay of these thigs durig the
More informationNormal Distribution.
Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued
More informationKey Ideas Section 81: Overview hypothesis testing Hypothesis Hypothesis Test Section 82: Basics of Hypothesis Testing Null Hypothesis
Chapter 8 Key Ideas Hypothesis (Null ad Alterative), Hypothesis Test, Test Statistic, Pvalue Type I Error, Type II Error, Sigificace Level, Power Sectio 81: Overview Cofidece Itervals (Chapter 7) are
More informationDiscrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13
EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may
More informationN04/5/MATHL/HP2/ENG/TZ0/XX MATHEMATICS HIGHER LEVEL PAPER 2. Thursday 4 November 2004 (morning) 3 hours INSTRUCTIONS TO CANDIDATES
c IB MATHEMATICS HIGHER LEVEL PAPER DIPLOMA PROGRAMME PROGRAMME DU DIPLÔME DU BI PROGRAMA DEL DIPLOMA DEL BI N/5/MATHL/HP/ENG/TZ/XX 887 Thursday November (morig) hours INSTRUCTIONS TO CANDIDATES! Do ot
More informationThe geometric series and the ratio test
The geometric series ad the ratio test Today we are goig to develop aother test for covergece based o the iterplay betwee the it compariso test we developed last time ad the geometric series. A ote about
More informationStandard Errors and Confidence Intervals
Stadard Errors ad Cofidece Itervals Itroductio I the documet Data Descriptio, Populatios ad the Normal Distributio a sample had bee obtaied from the populatio of heights of 5yearold boys. If we assume
More informationTHE HEIGHT OF qbinary SEARCH TREES
THE HEIGHT OF qbinary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average
More informationLecture 4: Cauchy sequences, BolzanoWeierstrass, and the Squeeze theorem
Lecture 4: Cauchy sequeces, BolzaoWeierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits
More information2.7 Sequences, Sequences of Sets
2.7. SEQUENCES, SEQUENCES OF SETS 67 2.7 Sequeces, Sequeces of Sets 2.7.1 Sequeces Defiitio 190 (sequece Let S be some set. 1. A sequece i S is a fuctio f : K S where K = { N : 0 for some 0 N}. 2. For
More informationB1. Fourier Analysis of Discrete Time Signals
B. Fourier Aalysis of Discrete Time Sigals Objectives Itroduce discrete time periodic sigals Defie the Discrete Fourier Series (DFS) expasio of periodic sigals Defie the Discrete Fourier Trasform (DFT)
More information3. Continuous Random Variables
Statistics ad probability: 31 3. Cotiuous Radom Variables A cotiuous radom variable is a radom variable which ca take values measured o a cotiuous scale e.g. weights, stregths, times or legths. For ay
More informationTheorems About Power Series
Physics 6A Witer 20 Theorems About Power Series Cosider a power series, f(x) = a x, () where the a are real coefficiets ad x is a real variable. There exists a real oegative umber R, called the radius
More informationUniversity of California, Los Angeles Department of Statistics. Distributions related to the normal distribution
Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chisquare (χ ) distributio.
More information4 n. n 1. You shold think of the Ratio Test as a generalization of the Geometric Series Test. For example, if a n ar n is a geometric sequence then
SECTION 2.6 THE RATIO TEST 79 2.6. THE RATIO TEST We ow kow how to hadle series which we ca itegrate (the Itegral Test), ad series which are similar to geometric or pseries (the Compariso Test), but of
More informationThe following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles
The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio
More informationIrreducible polynomials with consecutive zero coefficients
Irreducible polyomials with cosecutive zero coefficiets Theodoulos Garefalakis Departmet of Mathematics, Uiversity of Crete, 71409 Heraklio, Greece Abstract Let q be a prime power. We cosider the problem
More informationNPTEL STRUCTURAL RELIABILITY
NPTEL Course O STRUCTURAL RELIABILITY Module # 0 Lecture 1 Course Format: Web Istructor: Dr. Aruasis Chakraborty Departmet of Civil Egieerig Idia Istitute of Techology Guwahati 1. Lecture 01: Basic Statistics
More informationif A S, then X \ A S, and if (A n ) n is a sequence of sets in S, then n A n S,
Lecture 5: Borel Sets Topologically, the Borel sets i a topological space are the σalgebra geerated by the ope sets. Oe ca build up the Borel sets from the ope sets by iteratig the operatios of complemetatio
More informationMetric, Normed, and Topological Spaces
Chapter 13 Metric, Normed, ad Topological Spaces A metric space is a set X that has a otio of the distace d(x, y) betwee every pair of poits x, y X. A fudametal example is R with the absolutevalue metric
More informationApproximating the Sum of a Convergent Series
Approximatig the Sum of a Coverget Series Larry Riddle Ages Scott College Decatur, GA 30030 lriddle@agesscott.edu The BC Calculus Course Descriptio metios how techology ca be used to explore covergece
More informationChapter 5: Inner Product Spaces
Chapter 5: Ier Product Spaces Chapter 5: Ier Product Spaces SECION A Itroductio to Ier Product Spaces By the ed of this sectio you will be able to uderstad what is meat by a ier product space give examples
More informationLecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)
18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the BruMikowski iequality for boxes. Today we ll go over the
More information1 The Binomial Theorem: Another Approach
The Biomial Theorem: Aother Approach Pascal s Triagle I class (ad i our text we saw that, for iteger, the biomial theorem ca be stated (a + b = c a + c a b + c a b + + c ab + c b, where the coefficiets
More informationConfidence Intervals for One Mean with Tolerance Probability
Chapter 421 Cofidece Itervals for Oe Mea with Tolerace Probability Itroductio This procedure calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) with
More informationLesson 12. Sequences and Series
Retur to List of Lessos Lesso. Sequeces ad Series A ifiite sequece { a, a, a,... a,...} ca be thought of as a list of umbers writte i defiite order ad certai patter. It is usually deoted by { a } =, or
More informationSection 11.3: The Integral Test
Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult
More informationBASIC STATISTICS. Discrete. Mass Probability Function: P(X=x i ) Only one finite set of values is considered {x 1, x 2,...} Prob. t = 1.
BASIC STATISTICS 1.) Basic Cocepts: Statistics: is a sciece that aalyzes iformatio variables (for istace, populatio age, height of a basketball team, the temperatures of summer moths, etc.) ad attempts
More information3 Basic Definitions of Probability Theory
3 Basic Defiitios of Probability Theory 3defprob.tex: Feb 10, 2003 Classical probability Frequecy probability axiomatic probability Historical developemet: Classical Frequecy Axiomatic The Axiomatic defiitio
More information5: Introduction to Estimation
5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample
More information1. C. The formula for the confidence interval for a population mean is: x t, which was
s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : pvalue
More informationMARTINGALES AND A BASIC APPLICATION
MARTINGALES AND A BASIC APPLICATION TURNER SMITH Abstract. This paper will develop the measuretheoretic approach to probability i order to preset the defiitio of martigales. From there we will apply this
More informationwhen n = 1, 2, 3, 4, 5, 6, This list represents the amount of dollars you have after n days. Note: The use of is read as and so on.
Geometric eries Before we defie what is meat by a series, we eed to itroduce a related topic, that of sequeces. Formally, a sequece is a fuctio that computes a ordered list. uppose that o day 1, you have
More informationSection 7: Free electron model
Physics 97 Sectio 7: ree electro model A free electro model is the simplest way to represet the electroic structure of metals. Although the free electro model is a great oversimplificatio of the reality,
More informationTIEE Teaching Issues and Experiments in Ecology  Volume 1, January 2004
TIEE Teachig Issues ad Experimets i Ecology  Volume 1, Jauary 2004 EXPERIMENTS Evirometal Correlates of Leaf Stomata Desity Bruce W. Grat ad Itzick Vatick Biology, Wideer Uiversity, Chester PA, 19013
More informationRiemann Sums y = f (x)
Riema Sums Recall that we have previously discussed the area problem I its simplest form we ca state it this way: The Area Problem Let f be a cotiuous, oegative fuctio o the closed iterval [a, b] Fid
More informationAdvanced Probability Theory
Advaced Probability Theory Math5411 HKUST Kai Che (Istructor) Chapter 1. Law of Large Numbers 1.1. σalgebra, measure, probability space ad radom variables. This sectio lays the ecessary rigorous foudatio
More informationCenter, Spread, and Shape in Inference: Claims, Caveats, and Insights
Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the
More informationME 101 Measurement Demonstration (MD 1) DEFINITIONS Precision  A measure of agreement between repeated measurements (repeatability).
INTRODUCTION This laboratory ivestigatio ivolves makig both legth ad mass measuremets of a populatio, ad the assessig statistical parameters to describe that populatio. For example, oe may wat to determie
More informationAQA STATISTICS 1 REVISION NOTES
AQA STATISTICS 1 REVISION NOTES AVERAGES AND MEASURES OF SPREAD www.mathsbox.org.uk Mode : the most commo or most popular data value the oly average that ca be used for qualitative data ot suitable if
More information1 Computing the Standard Deviation of Sample Means
Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.
More information8.5 Alternating infinite series
65 8.5 Alteratig ifiite series I the previous two sectios we cosidered oly series with positive terms. I this sectio we cosider series with both positive ad egative terms which alterate: positive, egative,
More informationInfinite Sequences and Series
CHAPTER 4 Ifiite Sequeces ad Series 4.1. Sequeces A sequece is a ifiite ordered list of umbers, for example the sequece of odd positive itegers: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29...
More informationCase Study. Normal and t Distributions. Density Plot. Normal Distributions
Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca
More informationMATH 361 Homework 9. Royden Royden Royden
MATH 61 Homework 9 Royde..9 First, we show that for ay subset E of the real umbers, E c + y = E + y) c traslatig the complemet is equivalet to the complemet of the traslated set). Without loss of geerality,
More informationNonlife insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring
Nolife isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy
More informationInstitute for the Advancement of University Learning & Department of Statistics
Istitute for the Advacemet of Uiversity Learig & Departmet of Statistics Descriptive Statistics for Research (Hilary Term, 00) Lecture 5: Cofidece Itervals (I.) Itroductio Cofidece itervals (or regios)
More informationThe Euler Totient, the Möbius and the Divisor Functions
The Euler Totiet, the Möbius ad the Divisor Fuctios Rosica Dieva July 29, 2005 Mout Holyoke College South Hadley, MA 01075 1 Ackowledgemets This work was supported by the Mout Holyoke College fellowship
More informationarxiv:1506.03481v1 [stat.me] 10 Jun 2015
BEHAVIOUR OF ABC FOR BIG DATA By Wetao Li ad Paul Fearhead Lacaster Uiversity arxiv:1506.03481v1 [stat.me] 10 Ju 2015 May statistical applicatios ivolve models that it is difficult to evaluate the likelihood,
More informationSubject CT5 Contingencies Core Technical Syllabus
Subject CT5 Cotigecies Core Techical Syllabus for the 2015 exams 1 Jue 2014 Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical techiques which ca be used to model ad value
More information1 n. n > dt. t < n 1 + n=1
Math 05 otes C. Pomerace The harmoic sum The harmoic sum is the sum of recirocals of the ositive itegers. We kow from calculus that it diverges, this is usually doe by the itegral test. There s a more
More informationMeasures of Spread and Boxplots Discrete Math, Section 9.4
Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,
More informationSimulation and Monte Carlo integration
Chapter 3 Simulatio ad Mote Carlo itegratio I this chapter we itroduce the cocept of geeratig observatios from a specified distributio or sample, which is ofte called Mote Carlo geeratio. The ame of Mote
More informationSection 73 Estimating a Population. Requirements
Sectio 73 Estimatig a Populatio Mea: σ Kow Key Cocept This sectio presets methods for usig sample data to fid a poit estimate ad cofidece iterval estimate of a populatio mea. A key requiremet i this sectio
More informationProblem Set 1 Oligopoly, market shares and concentration indexes
Advaced Idustrial Ecoomics Sprig 2016 Joha Steek 29 April 2016 Problem Set 1 Oligopoly, market shares ad cocetratio idexes 1 1 Price Competitio... 3 1.1 Courot Oligopoly with Homogeous Goods ad Differet
More informationApproximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find
1.8 Approximatig Area uder a curve with rectagles 1.6 To fid the area uder a curve we approximate the area usig rectagles ad the use limits to fid 1.4 the area. Example 1 Suppose we wat to estimate 1.
More informationPSYCHOLOGICAL STATISTICS
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics
More informationPlugin martingales for testing exchangeability online
Plugi martigales for testig exchageability olie Valetia Fedorova, Alex Gammerma, Ilia Nouretdiov, ad Vladimir Vovk Computer Learig Research Cetre Royal Holloway, Uiversity of Lodo, UK {valetia,ilia,alex,vovk}@cs.rhul.ac.uk
More informationSolutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork
Solutios to Selected Problems I: Patter Classificatio by Duda, Hart, Stork Joh L. Weatherwax February 4, 008 Problem Solutios Chapter Bayesia Decisio Theory Problem radomized rules Part a: Let Rx be the
More informationJoint Probability Distributions and Random Samples
STAT5 Sprig 204 Lecture Notes Chapter 5 February, 204 Joit Probability Distributios ad Radom Samples 5. Joitly Distributed Radom Variables Chapter Overview Joitly distributed rv Joit mass fuctio, margial
More informationGibbs Distribution in Quantum Statistics
Gibbs Distributio i Quatum Statistics Quatum Mechaics is much more complicated tha the Classical oe. To fully characterize a state of oe particle i Classical Mechaics we just eed to specify its radius
More informationModified Line Search Method for Global Optimization
Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o
More information