THE PROBABLE ERROR OF A MEAN. Introduction

Size: px
Start display at page:

Download "THE PROBABLE ERROR OF A MEAN. Introduction"

Transcription

1 THE PROBABLE ERROR OF A MEAN By STUDENT Itroductio Ay experimet may he regarded as formig a idividual of a populatio of experimets which might he performed uder the same coditios. A series of experimets is a sample draw from this populatio. Now ay series of experimets is oly of value i so far as it eables us to form a judgmet as to the statistical costats of the populatio to which the experimets belog. I a greater umber of cases the questio fially turs o the value of a mea, either directly, or as the mea differece betwee the two quatities. If the umber of experimets be very large, we may have precise iformatio as to the value of the mea, but if our sample be small, we have two sources of ucertaity: (1) owig to the error of radom samplig the mea of our series of experimets deviates more or less widely from the mea of the populatio, ad () the sample is ot sufficietly large to determie what is the law of distributio of idividuals. It is usual, however, to assume a ormal distributio, because, i a very large umber of cases, this gives a approximatio so close that a small sample will give o real iformatio as to the maer i which the populatio deviates from ormality: sice some law of distributio must he assumed it is better to work with a curve whose area ad ordiates are tabled, ad whose properties are well kow. This assumptio is accordigly made i the preset paper, so that its coclusios are ot strictly applicable to populatios kow ot to be ormally distributed; yet it appears probable that the deviatio from ormality must be very extreme to load to serious error. We are cocered here solely with the first of these two sources of ucertaity. The usual method of determiig the probability that the mea of the populatio lies withi a give distace of the mea of the sample is to assume a ormal distributio about the mea of the sample with a stadard deviatio equal to s/, where s is the stadard deviatio of the sample, ad to use the tables of the probability itegral. But, as we decrease the umber of experimets, the value of the stadard deviatio foud from the sample of experimets becomes itself subject to a icreasig error, util judgmets reached i this way may become altogether misleadig. I routie work there are two ways of dealig with this difficulty: (1) a experimet may he repeated may times, util such a log series is obtaied that the stadard deviatio is determied oce ad for all with sufficiet accuracy. This value ca the he used for subsequet shorter series of similar experimets. () Where experimets are doe i duplicate i the atural course of the work, the mea square of the differece betwee correspodig pairs is equal to the stadard deviatio of the populatio multiplied by. We call thus combie 1

2 together several series of experimets for the purpose of determiig the stadard deviatio. Owig however to secular chage, the value obtaied is early always too low, successive experimets beig positively correlated. There are other experimets, however, which caot easily be repeated very ofte; i such cases it is sometimes ecessary to judge of the certaity of the results from a very small sample, which itself affords the oly idicatio of the variability. Some chemical, may biological, ad most agricultural ad largescale experimets belog to this class, which has hitherto bee almost outside the rage of statistical iquiry. Agai, although it is well kow that the method of usig the ormal curve is oly trustworthy whe the sample is large, o oe has yet told us very clearly where the limit betwee large ad small samples is to be draw. The aim of the preset paper is to determie the poit at which we may use the tables of the probability itegral i judgig of the sigificace of the mea of a series of experimets, ad to furish alterative tables for use whe the umber of experimets is too few. The paper is divided ito the followig ie sectios: I. The equatio is determied of the curve which represets the frequecy distributio of stadard deviatios of samples draw from a ormal populatio. II. There is show to be o kid of correlatio betwee the mea ad the stadard deviatio of such a sample. III. The equatio is determied of the curve represetig the frequecy distributio of a quatity z, which is obtaied by dividig the distace betwee the mea of a sample ad the mea of the populatio by the stadard deviatio of the sample. IV. The curve foud i I is discussed. V. The curve foud i III is discussed. VI. The two curves are compared with some actual distributios. VII. Tables of the curves foud i III are give for samples of differet size. VIII ad IX. The tables are explaied ad some istaces are give of their use. X. Coclusios. Sectio 1 Samples of idividuals are draw out of a populatio distributed ormally, to fid a equatio which shall represet the frequecy of the stadard deviatios of these samples. If s be the stadard deviatio foud from a sample x 1 x...x (all these beig measured from the mea of the populatio), the s = S(x 1) ( ) S(x1 ) = S(x 1) S(x 1) S(x 1x ).

3 Summig for all samples ad dividig by the umber of samples we get the moa value of s, which we will write s : s = µ µ = µ ( 1), where µ is the secod momet coefficiet i the origial ormal distributio of x: sice x 1, x, etc. are ot correlated ad the distributio is ormal, products ivolvig odd powers of x 1 vaish o summig, so that S(x1x) is equal to 0. If M R represet the Rth momet coefficiet of the distributio of s about the ed of the rage where s = 0, Agai M 1 ( 1) = µ. { ( )} S(x s 4 = 1 ) S(x1 ) ( ) S(x ( ) ( = 1 ) S(x 1) S(x1 ) S(x1 ) + = S(x4 1) + S(x 1x ) S(X4 1) 3 4S(x 1x ) 3 + S(x4 1) 4 + 6S(x 1x ) 4 +other terms ivolvig odd powers of x 1, etc. which will vaish o summatio. Now S(x 4 1) has terms, buts(x 1x ) has 1 ( 1), hece summig for all samples ad dividig by the umber of samples, we get M = µ 4 ( 1) +µ µ 4 ( 1) µ + µ 4 ( 1) 3 +3µ 3 ) 4 = µ 4 3 { +1}+ µ 3( 1){ +3}. Now sice the distributio of x is ormal, µ 4 = 3µ, hece M = µ ( 1) 3 { } = µ ( 1)(+1). I a similar tedious way I fid ad M 3 = µ 3 ( 1)(+1)(+3) 3 M 4 = µ 4 ( 1)(+1)(+3)(+5) 4. 3

4 The law of formatio of these momet coefficiets appears to be a simple oe, but I have ot see my way to a geeral proof. If ow M R be the Rth momet coefficiet of s about its mea, we have M = µ ( 1) { ( 1)(+1)(+3) M 3 = µ 3 3 {(+1) ( 1)} = µ 3 ( 1). 3( 1). (( 1) ( 1)3 3 = µ 3 ( 1) 3 { } = 8µ 3 ( 1) 3, M 4 = µ4 4 { ( 1)(+1)(+3)(+5) 3( 1) 1( 1) 3 ( 1) 4} = µ4 ( 1) 4 { } = 1µ4 ( 1)(+3) 4. Hece β 1 = M 3 M 3 = 8 1, β = M 4 M = 3(+3) 1), β 3β 1 6 = 1 {6(+3) 4 6( 1)} = 0. 1 Cosequetly a curve of Prof. Pearso s Type III may he expected to fit the distributio of s. The equatio referred to a origi at the zero ed of the curve will be where ad y = Cx p e γx, γ = M M 3 = 4µ ( 1) 3 8 µ ( 1) = µ p = 4 β 1 1 = 1 Cosequetly the equatio becomes y = Cx 3 e x µ, 1 = 3. which will give the distributio of s. The area of this curve is C x 3 e x µ dx = I (say). The first momet 0 coefficiet about the ed of the rage will therefore be C [ ] x 1 e x µ dx C µ x 1 e x x= µ 0 x=0 = + C 1 0 µ x 3 e x µ dx. I I I 4 }

5 The first part vaishes at each limit ad the secod is equal to 1 µ I I = 1 µ. ad we see that the higher momet coefficiets will he formed by multiplyig successively by +1 µ, +3 µ etc., just as appeared to he the law of formatio of M, M 3, M 4, etc. Hece it is probable that the curve foud represets the theoretical distributio of s ; so that although we have o actual proof we shall assume it to do so i what follows. The distributio of s may he foud from this, sice the frequecy of s is equal to that of s ad all that we must do is to compress the base lie suitably. Now if y 1 = φ(s ) be the frequecy curve of s ad y = ψ(s) be the frequecy curve of s, the y 1 d(s ) = y ds, y ds = y 1 sds, y = sy 1. Hece is the distributio of s. This reduces to y = Cs(s ) 3 e s µ. y = Cs e s σ. Hece y = Ax e s µ will give the frequecy distributio of stadard deviatios of samples of, take out of a populatio distributed ormally with stadard deviatio σ. The costat A may he foud by equatig the area of the curve as follows: The Area = A 0 I p = σ = σ x e x σ dx. ( Let I p represet ( e x σ )dx x p 1 d 0 dx ] x= [ x p 1 e x σ = σ (p 1)I p, x=0 sice the first part vaishes at both limits. + σ (p 1) 0 0 ) x p e x σ dx. x p e x σ dx 5

6 or By cotiuig this process we fid ( σ I = ( σ I = accordig is eve or odd. But I 0 is ad I 1 is Hece if be eve, while is be odd or 0 0 ) ) e x σ dx = xe x sigma dx = [ σ ( 3)( 5)...3.1I 0 ( 3)( 5)...4.I 1 ( π ) σ, ] e x x= σ x=0 = σ. Area A = (π )( ( 3)( 5) σ ), 1 A = Area ( 3)( 5)...4. ( σ ) 1 Hece the equatio may be writte ( ) N ( ) 1 y = ( 3)( 5) π σ x e x σ ( eve) y = N ( ) 1 ( 3)( 5)...4. σ x e x σ ( odd) where N as usual represets the total frequecy. Sectio II To show that there is o correlatio betwee (a) the distace of the mea of a sample from the mea of the populatio ad (b) the stadard deviatio of a sample with ormal distributio. (1) Clearly positive ad egative positios of the mea of the sample are equally likely, ad hece there caot be correlatio betwee the absolute value of the distace of the mea from the mea of the populatio ad the stadard. 6

7 deviatio, but () there might be correlatio betwee the square of the distace ad the square of the stadard deviatio. Let ( ) u S(x1 ) = ad s = S(x 1) ( ) S(x1 ). The if m 1, M 1 be the mea values of u ad s z, we have by the precedig part M 1 = µ ( 1) ad m 1 = µ. Now ( ) ( ) 4 u s = S(x 1) S(x1 ) S(x1 ) ( ) S(x = 1 ) + S(x 1x ).S(x 1) 3 S(x4 1) 4 6S(x 1x ) 4 other terms of odd order which will vaish o summatio. Summig for all values ad dividig by the umber of cases we get R u s σ u σ s +m 1 M 1 = µ 4 +µ ( 1) µ 4 3µ ( 1) 3 3, where R u s is the correlatio betwee u ad s. R u s σ u σ s +µ ( 1) = µ ( 1) 3 {3+ 3} = µ ( 1). Hece R u s σ u σ s = 0, or there is o correlatio betwee u ad s. Sectio III To fid the equatio represetig the frequecy distributio of the meas of samples of draw from a ormal populatio, the mea beig expressed i terms of the stadard deviatio of the sample. We have y = C σ s e x 1 σ as the equatio represetig the distributio of s, the stadard deviatio of a sample of, whe the samples are draw from a ormal populatio with stadard deviatio s. Now the meas of these samples of are distributed accordig to the equatio 1 ()N y = e x σ, (π)σ ad we have show that there is o correlatio betwee x, the distace of the mea of the sample, ad s, the stadard deviatio of the sample. 1 Airy, Theory of Errors of Observatios, Part II, 6. 7

8 Now let us suppose x measured i terms of s, i.e. let us fid the distributio of z = x/s. If we have y 1 = φ(x) ad y = ψ(z) as the equatios represetig the frequecy of x ad of z respectively, the y 1 dx = y dz = y 3 dx s, y = sy 1. Hece y = N ()s e s z σ (π)σ is the equatio represetig the distributio of z for samples of with stadard deviatio s. Now the chace that s lies betwee s ad s+ds is s+ds s 0 C σ 1 s e s σ ds C σ 1 s e s σ ds which represets the N i the above equatio. Hece the distributio of z due to values of s which lie betwee s ad s+ds is y = s+ds s C σ ( π 0 ) s 1 e s (1+z ) σ ds C σ 1 s e s σ ds = ( π ) s+ds s 0 C σ s 1 (1+z ) e s σ ds C σ s e s σ ds ad summig for all values of s we have as a equatio givig the distributio of z ( ) s+ds C π s σ s 1 (1+z ) e s y = σ ds. σ C σ s e s σ ds By what we have already proved this reduces to ad to 0 y = (1+z ) 1, if be odd y = (1+z ) 1, if be eve Sice this equatio is idepedet of σ it will give the distributio of the distace of the mea of a sample from the mea of the populatio expressed i terms of the stadard deviatio of the sample for ay ormal populatio. 8

9 Sectio IV. Some Properties of the Stadard Deviatio Frequecy Curve By a similar method to that adopted for fidig the costat we may fid the mea ad momets: thus the mea is at I 1 /I, which is equal to ( π ) σ, if be eve, or (π ) σ, if be odd. The secod momet about the ed of the rage is I = ( 1)σ. I The third momet about the ed of the rage is equal to I +1 I = I +1 I 1. I 1 I = σ the mea. The fourth momet about the ed of the rage is equal to I + = ( 1)(+1) I σ 4. If we write the distace of the mea from the ed of the rage Dσ/ ad the momets about the ed of the rage ν 1, ν, etc., the ν 1 = Dσ, ν = 1 σ, ν 3 = Dσ3, ν 4 = N 1 σ 4. From this we get the momets about the mea: µ = σ ( 1 D ), µ 3 = σ3 {D 3( 1)D +D } = σ3 D {D +3}, µ 4 = σ { 1 4D +6( 1)D 3D 4 } = σ4 { 1 D (3D +6)}. It is of iterest to fid out what these become whe is large. 9

10 is I order to do this we must fid out what is the value of D. Now Wallis s expressio for π derived from the ifiite product value of six π (+1) = () ( 1). If we assume a quatity θ ( = a 0 + a1 +etc.) which we may add to the +1 i order to make the expressio approximate more rapidly to the truth, it is easy to show that θ = ( π etc., ad we get ) = () ( 1). From this we fid that whether be eve or odd D approximates to whe is large. Substitutig this value of D we get ( µ = σ 1 1 ) (1, µ = σ3 3 + ) , µ 4 = 3σ ( ) 16. Cosequetly the ( value of the stadard ) deviatio of a stadard deviatio σ which we have foud becomes the same as that foud for () {1 (1/4)} the ormal curve by Prof. Pearso {σ/()} whe is large eough to eglect the 1/4 i compariso with 1. Neglectig terms of lower order tha 1/, we fid β 1 = 3 (4 3), β) = 3 ( 1 1 )( 1+ 1 Cosequetly, as icreases, β very soo approaches the value 3 of the ormal curve, but β 1 vaishes more slowly, so that the curve remais slightly skew. Diagram I shows the theoretical distributio of the stadard deviatios foud from samples of 10. Sectio V. Some Properties of the Curve y = ( 4 3. π if be eve if be odd ) ). (1+z ) 1 Writig z = taθ the equatio becomes y = etc. cos θ, which affords a easy way of drawig the curve. Also dz = dθ/cos θ. This expressio will be foud to give a much closer approximatio to π tha Wallis s 10

11 Hece to fid the area of the curve betwee ay limits we must fid etc. cos θdθ = { [ 3 cos etc. cos 4 3 ]} θsiθ θdθ + = etc. cos 4 θdθ etc.[cos 3 θsiθ], ad by cotiuig the process the itegral may he evaluated. For example, if we wish to fid the area betwee 0 ad θ for = 8 we have Area = π = 4 3. π θ 0 θ 0 cos 6 θdθ cos 4 θdθ π cos5 θsiθ = θ π + 1 π cosθsiθ π cos3 θsiθ π cos5 θsiθ ad it will be oticed that for = 10 we shall merely have to add to this same expressio the term π cos7 θsiθ. 11

12 The tables at the ed of the paper give the area betwee ad z ( or θ = π ) ad θ = ta 1 z. This is the same as 0.5+the area betwee θ = 0, ad θ = ta 1 z, ad as the whole area of the curve is equal to 1, the tables give the probability that the mea of the sample does ot differ by more tha z times the stadard deviatio of the sample from the mea of the populatio. The whole area of the curve is equal to etc. π cos θdθ 1 π ad sice all the parts betwee the limits vaish at both limits this reduces to 1. Similarly, the secod momet coefficiet is equal to etc. π cos θta θdθ 1 π = etc. = 3 1 = π 1 π (cos 4 θ cos θ)dθ Hece the stadard deviatio of the curve is 1/ ( 3). The fourth momet coefficiet is equal to etc. π cos θta 4 θdθ 1 π = etc. + 1 π 1 π (cos 6 θ cos 4 θ +cos θ)dθ = ( ) 3 +1 = 3 ( 3)( 5). The odd momets are of course zero, a the curve is symmetrical, so β 1 = 0, β = 3( 3) 5 = Hece as it icreases the curve approaches the ormal curve whose stadard deviatio is 1/ ( 3). β, however, is always greater tha 3, idicatig that large deviatios are mere commo tha i the ormal curve. I have tabled the area for the ormal curve with stadard deviatio 1/ 7 so as to compare, with my curve for = It will be see that odds laid 3 See p. 9 1

13 accordig to either table would ot seriously differ till we reach z = 0.8, where the odds are about 50 to 1 that the mea is withi that limit: beyod that the ormal curve gives a false feelig of security, for example, accordig to the ormal curve it is 99,986 to 14 (say 7000 to 1) that the mea of the populatio lies betwee ad +1.3s, whereas the real odds are oly 99,819 to 181 (about 550 to 1). Now 50 to 1 correspods to three times the probable error i the ormal curve ad for most purposes it would be cosidered sigificat; for this reaso I have oly tabled my curves for values of ot greater tha 10, but have give the = 9 ad = 10 tables to oe further place of decimals. They ca he used as foudatios for fidig values for larger samples. 4 The table for = ca be readily costructed by lookig out θ = ta 1 z i Chambers s tables ad the θ/π gives the correspodig value. Similarly 1 siθ +0.5 gives the values whe = 3. There are two poits of iterest i the = curve. Here s is equal to half the distace betwee the two observatios, ta 1 s s = π 4, so that betwee +s ad z lies π 4 1 π or half the probability, i.e. if two observatios have bee made ad we have o other iformatio, it is a eve chace that the mea of the (ormal) populatio will lie betwee them. O the other had the secod 4 E.g. if = 11, to the correspodig value for = 9, we add cos8 θsiθ: 9 if = 13 we add as well cos10 θsiθ, ad so o. 13

14 momet coefficiet is 1 π + 1 π = 1 π ta θdθ = 1 π [taθ π= θ]+1 =, = 1 π or the stadard deviatio is ifiite while the probable error is fiite. Sectio VI. Practical Test of the foregoig Equatios Before I bad succeeded i solvig my problem aalytically, I had edeavoured to do so empirically. The material used was a correlatio table cotaiig the height ad left middle figer measuremets of 3000 crimials, from a paper by W. R. Macdoell (Biometrika, i, p. 19). The measuremets were writte out o 3000 pieces of cardboard, which were the very thoroughly shuffled ad draw at radom. As each card was draw its umbers were writte dow i a book, which thus cotais the measuremets of 3000 crimials i a radom order. Fially, each cosecutive set of 4 was take as a sample 750 i all ad the mea, stadard deviatio, ad correlatio 5 of each sample determied. The differece betwee the mea of each sample ad the mea of the populatio was the divided by the stadard deviatio of the sample, givig us the z of Sectio III. This provides us with two sets of 750 stadard deviatios ad two sets of 750 z s o which to test the theoretical results arrived at. The height ad left middle figer correlatio table was chose because the distributio of both was approximately ormal ad the correlatio was fairly high. Both frequecy curves, however, deviate slightly from ormality, the costats beig for height β 1 = 0.006, β = 3.176, ad for left middle figer legths β 1 = , β = 3.140, ad i cosequece there is a tedecy for a certai umber of larger stadard deviatios to occur tha if the distributios wore ormal. This, however, appears to make very little differece to the distributio of z. Aother thig which iterferes with the compariso is the comparatively large groups i which the observatios occur. The heights are arraged i 1 ich groups, the stadard deviatio beig oly.54 iches. while, the figer legths wore origially grouped i millimetres, but ufortuately I did ot at the time see the importace of havig a smaller uit ad codesed them ito millimetre groups, i terms of which the stadard deviatio is.74. Several curious results follow from takig samples of 4 from material disposed i such wide groups. The followig poits may be oticed: (1) The meas oly occur as multiples of 0.5. () The stadard deviatios occur as the square roots of the followig types of umbers:, +0.10, +0.5, +0.50, +0.69, (3) A stadard deviatio belogig to oe of these groups ca oly be associated with a mea of a particular kid; thus a stadard deviatio of ca 5 I hope to publish the results of the correlatio work shortly. 14

15 oly occur if the mea differs by a whole umber from the group we take as origi, while 1.69 will oly occur whe the mea is at ±0.5. (4) All the four idividuals of the sample will occasioally come from the same group, givig a zero value for the stadard deviatio. Now this leads to a ifiite value of z ad is clearly due to too wide a groupig, for although two me may have the same height whe measured by iches, yet the fier the measuremets the more seldom will they he idetical, till fially the chace that four me will have exactly the same height is ifiitely small. If we had smaller groupig the zero values of the stadard deviatio might be expected to icrease, ad a similar cosideratio will show that the smaller values of the stadard deviatio would also be likely to icrease, such as 0.436, whe 3 fall i oe group ad 1 i a adjacet group, or 0.50 whe fall i two adjacet groups. O the other had, whe the idividuals of the sample lie far apart, the argumet of Sheppard s correctio will apply, the real value of the stadard deviatio beig more likely to he smaller tha that foud owig to the frequecy i ay group beig greater o the side earer the mode. These two effects of groupig will ted to eutralize the effect o the mea value of the stadard deviatio, but both will icrease the variability. Accordigly, we fid that the mea value of the stadard deviatio is quite close to that calculated, while i each case the variability is sesibly greater. The fit of the curve is ot good, both for this reaso ad because the frequecy is ot evely distributed owig to effects () ad (3) of groupig. O the other had, the fit of the curve givig the frequecy of z is very good, ad as that is the oly practical poit the compariso may he cosidered satisfactory. The followig are the figures for height: Mea value of stadard deviatios: Calculated.07 ± 0.0 Observed.06 Differece = Stadard deviatio of stadard deviatios: Calculated ± Observed Differece Compariso of Fit. Theoretical Equatio: y = (π)σ x e x σ Scale i terms of stadard deviatios of populatio Calculated frequecy Observed frequecy Differece Whece χ = 48.06, P = (about). I tablig the observed frequecy, values betwee ad were icluded i oe group, while betwee ad they were divided over thetwogroups. AsaistaceoftheirregularityduetogroupigImaymetio 15

16 that there were 31 cases of stadard deviatios 1.30 (i terms of the groupig) which is i terms of the stadard deviatio of the populatio, ad they wore therefore divided over the groups 0.4 to 0.5 ad 0.5 to 0.6. Had they all bee couted i groups 0.5 to 0.6 χ would have falle to 0.85 ad P would have rise to The χ test presupposes radom samplig from a frequecy followig the give law, but this we have ot got owig to the iterferece of the groupig. Whe, however, we test the z s where the groupig has ot had so much effect, we fid a close correspodece betwee the theory ad the actual result. There were three cases of ifiite values of z which, for the reasos give above, were give the ext largest values which occurred, amely +6 or 6. The rest were divided ito groups of 0.1; 0.04, 0.05 ad 0.06, beig divided betwee the two groups o either side. The calculated value for the stadard deviatio of the frequecy curve was 1 (±0.0171), while the observed was The value of the stadard deviatio is really ifiite, as the fourth momet coefficiet is ifiite, but as we have arbitrarily limited the ifiite cases we may take as a approximatio 1/ 1500 from which the value of the probable error give above is obtaied. The fit of the curve is as follows: Compariso of Fit. Theoretical Equatio: y = N π cos4 θ, z = taθ Scale of z Calculated frequecy Observed frequecy Differece Whece χ = 1.44, P = This is very satisfactory, especially whe we cosider that as a rule observatios are tested agaist curves fitted from the mea ad oe or more other momets of the observatios, so that cosiderable correspodece is oly to ])c expected; while this curve is exposed to the full errors of radom samplig, its costats havig bee calculated quite apart from the observatios. The left middle figer samples show much the same features as those of the height, but as the groupig is ot so large compared to the variability the curves fit the observatios more closely. Diagrams III 6 ad IV give the stadard deviatios of the z s for the set of samples. The results are as follows: 6 There are three small mistakes i plottig the observed values i Diagram III, which make the fit appear worse tha it really is 16

17 Mea value of stadard deviatios: Calculated.186 ± 0.03 Observed.179 Differece = Stadard deviatio of stadard deviatios: Calculated 0.94 ± Observed Differece = Compariso of Fit. Theoretical Equatio: y = (π)σ x e x σ Scale i terms of stadard deviatios of populatio Calculated frequecy Observed frequecy Whece χ = 1.80, P = Value of stadard deviatio: Calculated 1(±0.017) Observed 0.98 Differece = Compariso of Fit. Theoretical Equatio: y = N π cos4 θ, z = taθ Scale of z Calculated frequecy Observed frequecy Differece Whece χ = 7.39, P = 0.9. A very close fit. We see the that if the distributio is approximately ormal our theory gives us a satisfactory measure of the certaity to be derived from a small sample i both the cases we have tested; but we have a idicatio that a fie groupig is of advatage. If the distributio is ot ormal, the mea ad the stadard deviatio of a sample will be positively correlated, so although both will have greater variability, yet they will ted to couteract oe aother, a mea derivig largely from the geeral mea tedig to be divided by a larger stadard deviatio. Cosequetly, I believe that the table give i Sectio VII below may be used i estimatig the degree of certaity arrived at by the mea of a few experimets, i the case of most laboratory or biological work where the distributios are as a rule of a cocked hat type ad so sufficietly early ormal 17

18 18

19 3. 4 Sectio VII. Tables of ( odd ) ta 1 z 1.1 π eve 1 π cos θdθ for values of from 4 to 10 iclusive Together with (π) 7 x e 7x dx for compariso whe = 10 z ( = x s) = 4 = 5 = 6 = 7 = 8 = 9 = 10 ( For compariso 7 ) x (π) e 7x dx Explaatio of Tables The tables give the probability that the value of the mea, measured from the mea of the populatio, i terms of the stadard deviatio of the sample, will lie betwee ad z. Thus, to take the table for samples of 6, the probability of the mea of the populatio lyig betwee ad oce the stadard deviatio of the sample is 0.96, the odds are about 4 to 1 that the mea of the populatio lies betwee these limits. 19

20 The probability is therefore that it is greater tha oce the stadard deviatio ad that it lies outside ±1.0 times the stadard deviatio. Illustratio of Method Illustratio I. As a istace of the kid of use which may be made of the tables, I take the followig figures from a table by A. R. Cushy ad A. R. Peebles i the Joural of Physiology for 1904, showig the differet effects of the optical isomers of hyoscyamie hydrobromide i producig sleep. The average umber of hours sleep gaied by the use of the drug is tabulated below. The coclusio arrived at was that i the usual doses was, but 1 was ot, of value as a soporific. Additioal hours sleep gaied by the use of hyoscyamie hydrobromide Patiet 1 (Dextro-) (Laevo-) Differece ( 1) Mea Mea +.33 Mea s.d s.d s.d First let us see what is the probability that 1 will o the average give icrease of sleep; i.e. what is the chace that the mea of the populatio of which these experimets are a sample is positive /1.70 = 0.44, ad lookig out z = 0.44 i the table for te experimets we fid by iterpolatig betwee ad that 0.44 correspods to , or the odds are to that the mea is positive. That is about 8 to 1, ad would correspod to the ormal curve to about 1.8 times the probable error. It is the very likely that 1 gives a icrease of sleep, but would occasio o surprise if the results were reversed by further experimets. If ow we cosider the chace that is actually a soporific we have the mea iclrease of sleep =.33/1.90 or 1.3 times the s.d. From the table the probability correspodig to this is , i.e. the odds are early 400 to 1 that such is the case. This correspods to about 4.15 times the probable error i the ormal curve. But I take it that the real poit of the authors was that is better tha 1. This we must t4est by makig a ew series, subtractig 1 from. The mea values of this series is +1.38, while the s.d. is 1.17, the mea value beig times the s.d. From the table, the probability is , or the odds are about 666 to oe that is the better soporific. The low value of 0

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth Questio 1: What is a ordiary auity? Let s look at a ordiary auity that is certai ad simple. By this, we mea a auity over a fixed term whose paymet period matches the iterest coversio period. Additioally,

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number. GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all

More information

Department of Computer Science, University of Otago

Department of Computer Science, University of Otago Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

Lesson 17 Pearson s Correlation Coefficient

Lesson 17 Pearson s Correlation Coefficient Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig

More information

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

A Mathematical Perspective on Gambling

A Mathematical Perspective on Gambling A Mathematical Perspective o Gamblig Molly Maxwell Abstract. This paper presets some basic topics i probability ad statistics, icludig sample spaces, probabilistic evets, expectatios, the biomial ad ormal

More information

Basic Elements of Arithmetic Sequences and Series

Basic Elements of Arithmetic Sequences and Series MA40S PRE-CALCULUS UNIT G GEOMETRIC SEQUENCES CLASS NOTES (COMPLETED NO NEED TO COPY NOTES FROM OVERHEAD) Basic Elemets of Arithmetic Sequeces ad Series Objective: To establish basic elemets of arithmetic

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

CHAPTER 3 DIGITAL CODING OF SIGNALS

CHAPTER 3 DIGITAL CODING OF SIGNALS CHAPTER 3 DIGITAL CODING OF SIGNALS Computers are ofte used to automate the recordig of measuremets. The trasducers ad sigal coditioig circuits produce a voltage sigal that is proportioal to a quatity

More information

One-sample test of proportions

One-sample test of proportions Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:

More information

Hypergeometric Distributions

Hypergeometric Distributions 7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you

More information

BINOMIAL EXPANSIONS 12.5. In this section. Some Examples. Obtaining the Coefficients

BINOMIAL EXPANSIONS 12.5. In this section. Some Examples. Obtaining the Coefficients 652 (12-26) Chapter 12 Sequeces ad Series 12.5 BINOMIAL EXPANSIONS I this sectio Some Examples Otaiig the Coefficiets The Biomial Theorem I Chapter 5 you leared how to square a iomial. I this sectio you

More information

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book) MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is 0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

Ekkehart Schlicht: Economic Surplus and Derived Demand

Ekkehart Schlicht: Economic Surplus and Derived Demand Ekkehart Schlicht: Ecoomic Surplus ad Derived Demad Muich Discussio Paper No. 2006-17 Departmet of Ecoomics Uiversity of Muich Volkswirtschaftliche Fakultät Ludwig-Maximilias-Uiversität Müche Olie at http://epub.ub.ui-mueche.de/940/

More information

5.4 Amortization. Question 1: How do you find the present value of an annuity? Question 2: How is a loan amortized?

5.4 Amortization. Question 1: How do you find the present value of an annuity? Question 2: How is a loan amortized? 5.4 Amortizatio Questio 1: How do you fid the preset value of a auity? Questio 2: How is a loa amortized? Questio 3: How do you make a amortizatio table? Oe of the most commo fiacial istrumets a perso

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

Normal Distribution.

Normal Distribution. Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued

More information

NATIONAL SENIOR CERTIFICATE GRADE 12

NATIONAL SENIOR CERTIFICATE GRADE 12 NATIONAL SENIOR CERTIFICATE GRADE MATHEMATICS P EXEMPLAR 04 MARKS: 50 TIME: 3 hours This questio paper cosists of 8 pages ad iformatio sheet. Please tur over Mathematics/P DBE/04 NSC Grade Eemplar INSTRUCTIONS

More information

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations CS3A Hadout 3 Witer 00 February, 00 Solvig Recurrece Relatios Itroductio A wide variety of recurrece problems occur i models. Some of these recurrece relatios ca be solved usig iteratio or some other ad

More information

SEQUENCES AND SERIES

SEQUENCES AND SERIES Chapter 9 SEQUENCES AND SERIES Natural umbers are the product of huma spirit. DEDEKIND 9.1 Itroductio I mathematics, the word, sequece is used i much the same way as it is i ordiary Eglish. Whe we say

More information

Section 11.3: The Integral Test

Section 11.3: The Integral Test Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult

More information

3. Greatest Common Divisor - Least Common Multiple

3. Greatest Common Divisor - Least Common Multiple 3 Greatest Commo Divisor - Least Commo Multiple Defiitio 31: The greatest commo divisor of two atural umbers a ad b is the largest atural umber c which divides both a ad b We deote the greatest commo gcd

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

Chapter 7: Confidence Interval and Sample Size

Chapter 7: Confidence Interval and Sample Size Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum

More information

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships Biology 171L Eviromet ad Ecology Lab Lab : Descriptive Statistics, Presetig Data ad Graphig Relatioships Itroductio Log lists of data are ofte ot very useful for idetifyig geeral treds i the data or the

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

MEP Pupil Text 9. The mean, median and mode are three different ways of describing the average.

MEP Pupil Text 9. The mean, median and mode are three different ways of describing the average. 9 Data Aalysis 9. Mea, Media, Mode ad Rage I Uit 8, you were lookig at ways of collectig ad represetig data. I this uit, you will go oe step further ad fid out how to calculate statistical quatities which

More information

Asymptotic Growth of Functions

Asymptotic Growth of Functions CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll

More information

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test) No-Parametric ivariate Statistics: Wilcoxo-Ma-Whitey 2 Sample Test 1 Ma-Whitey 2 Sample Test (a.k.a. Wilcoxo Rak Sum Test) The (Wilcoxo-) Ma-Whitey (WMW) test is the o-parametric equivalet of a pooled

More information

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here). BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly

More information

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Research Method (I) --Knowledge on Sampling (Simple Random Sampling) Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact

More information

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? JÖRG JAHNEL 1. My Motivatio Some Sort of a Itroductio Last term I tought Topological Groups at the Göttige Georg August Uiversity. This

More information

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find 1.8 Approximatig Area uder a curve with rectagles 1.6 To fid the area uder a curve we approximate the area usig rectagles ad the use limits to fid 1.4 the area. Example 1 Suppose we wat to estimate 1.

More information

2-3 The Remainder and Factor Theorems

2-3 The Remainder and Factor Theorems - The Remaider ad Factor Theorems Factor each polyomial completely usig the give factor ad log divisio 1 x + x x 60; x + So, x + x x 60 = (x + )(x x 15) Factorig the quadratic expressio yields x + x x

More information

Tradigms of Astundithi and Toyota

Tradigms of Astundithi and Toyota Tradig the radomess - Desigig a optimal tradig strategy uder a drifted radom walk price model Yuao Wu Math 20 Project Paper Professor Zachary Hamaker Abstract: I this paper the author iteds to explore

More information

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern.

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern. 5.5 Fractios ad Decimals Steps for Chagig a Fractio to a Decimal. Simplify the fractio, if possible. 2. Divide the umerator by the deomiator. d d Repeatig Decimals Repeatig Decimals are decimal umbers

More information

Confidence Intervals

Confidence Intervals Cofidece Itervals Cofidece Itervals are a extesio of the cocept of Margi of Error which we met earlier i this course. Remember we saw: The sample proportio will differ from the populatio proportio by more

More information

INVESTMENT PERFORMANCE COUNCIL (IPC)

INVESTMENT PERFORMANCE COUNCIL (IPC) INVESTMENT PEFOMANCE COUNCIL (IPC) INVITATION TO COMMENT: Global Ivestmet Performace Stadards (GIPS ) Guidace Statemet o Calculatio Methodology The Associatio for Ivestmet Maagemet ad esearch (AIM) seeks

More information

How to read A Mutual Fund shareholder report

How to read A Mutual Fund shareholder report Ivestor BulletI How to read A Mutual Fud shareholder report The SEC s Office of Ivestor Educatio ad Advocacy is issuig this Ivestor Bulleti to educate idividual ivestors about mutual fud shareholder reports.

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote

More information

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics

More information

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

Quadrat Sampling in Population Ecology

Quadrat Sampling in Population Ecology Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may

More information

The Stable Marriage Problem

The Stable Marriage Problem The Stable Marriage Problem William Hut Lae Departmet of Computer Sciece ad Electrical Egieerig, West Virgiia Uiversity, Morgatow, WV William.Hut@mail.wvu.edu 1 Itroductio Imagie you are a matchmaker,

More information

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

, a Wishart distribution with n -1 degrees of freedom and scale matrix. UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006 Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam

More information

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error STA 2023 Practice Questios Exam 2 Chapter 7- sec 9.2 Formulas Give o the test: Case parameter estimator stadard error Estimate of stadard error Samplig Distributio oe mea x s t (-1) oe p ( 1 p) CI: prop.

More information

Escola Federal de Engenharia de Itajubá

Escola Federal de Engenharia de Itajubá Escola Federal de Egeharia de Itajubá Departameto de Egeharia Mecâica Pós-Graduação em Egeharia Mecâica MPF04 ANÁLISE DE SINAIS E AQUISÇÃO DE DADOS SINAIS E SISTEMAS Trabalho 02 (MATLAB) Prof. Dr. José

More information

Building Blocks Problem Related to Harmonic Series

Building Blocks Problem Related to Harmonic Series TMME, vol3, o, p.76 Buildig Blocks Problem Related to Harmoic Series Yutaka Nishiyama Osaka Uiversity of Ecoomics, Japa Abstract: I this discussio I give a eplaatio of the divergece ad covergece of ifiite

More information

Convexity, Inequalities, and Norms

Convexity, Inequalities, and Norms Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for

More information

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments Project Deliverables CS 361, Lecture 28 Jared Saia Uiversity of New Mexico Each Group should tur i oe group project cosistig of: About 6-12 pages of text (ca be loger with appedix) 6-12 figures (please

More information

THE ROLE OF EXPORTS IN ECONOMIC GROWTH WITH REFERENCE TO ETHIOPIAN COUNTRY

THE ROLE OF EXPORTS IN ECONOMIC GROWTH WITH REFERENCE TO ETHIOPIAN COUNTRY - THE ROLE OF EXPORTS IN ECONOMIC GROWTH WITH REFERENCE TO ETHIOPIAN COUNTRY BY: FAYE ENSERMU CHEMEDA Ethio-Italia Cooperatio Arsi-Bale Rural developmet Project Paper Prepared for the Coferece o Aual Meetig

More information

GCE Further Mathematics (6360) Further Pure Unit 2 (MFP2) Textbook. Version: 1.4

GCE Further Mathematics (6360) Further Pure Unit 2 (MFP2) Textbook. Version: 1.4 GCE Further Mathematics (660) Further Pure Uit (MFP) Tetbook Versio: 4 MFP Tetbook A-level Further Mathematics 660 Further Pure : Cotets Chapter : Comple umbers 4 Itroductio 5 The geeral comple umber 5

More information

Your organization has a Class B IP address of 166.144.0.0 Before you implement subnetting, the Network ID and Host ID are divided as follows:

Your organization has a Class B IP address of 166.144.0.0 Before you implement subnetting, the Network ID and Host ID are divided as follows: Subettig Subettig is used to subdivide a sigle class of etwork i to multiple smaller etworks. Example: Your orgaizatio has a Class B IP address of 166.144.0.0 Before you implemet subettig, the Network

More information

HOSPITAL NURSE STAFFING SURVEY

HOSPITAL NURSE STAFFING SURVEY 2012 Ceter for Nursig Workforce St udies HOSPITAL NURSE STAFFING SURVEY Vacacy ad Turover Itroductio The Hospital Nurse Staffig Survey (HNSS) assesses the size ad effects of the ursig shortage i hospitals,

More information

AP Calculus BC 2003 Scoring Guidelines Form B

AP Calculus BC 2003 Scoring Guidelines Form B AP Calculus BC Scorig Guidelies Form B The materials icluded i these files are iteded for use by AP teachers for course ad exam preparatio; permissio for ay other use must be sought from the Advaced Placemet

More information

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13 EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may

More information

Predictive Modeling Data. in the ACT Electronic Student Record

Predictive Modeling Data. in the ACT Electronic Student Record Predictive Modelig Data i the ACT Electroic Studet Record overview Predictive Modelig Data Added to the ACT Electroic Studet Record With the release of studet records i September 2012, predictive modelig

More information

LECTURE 13: Cross-validation

LECTURE 13: Cross-validation LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M

More information

Mathematical goals. Starting points. Materials required. Time needed

Mathematical goals. Starting points. Materials required. Time needed Level A1 of challege: C A1 Mathematical goals Startig poits Materials required Time eeded Iterpretig algebraic expressios To help learers to: traslate betwee words, symbols, tables, ad area represetatios

More information

How To Solve The Homewor Problem Beautifully

How To Solve The Homewor Problem Beautifully Egieerig 33 eautiful Homewor et 3 of 7 Kuszmar roblem.5.5 large departmet store sells sport shirts i three sizes small, medium, ad large, three patters plaid, prit, ad stripe, ad two sleeve legths log

More information