THE PROBABLE ERROR OF A MEAN. Introduction



Similar documents
I. Chi-squared Distributions

Hypothesis testing. Null and alternative hypotheses

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

Confidence Intervals for One Mean

1. C. The formula for the confidence interval for a population mean is: x t, which was

5: Introduction to Estimation

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

PSYCHOLOGICAL STATISTICS

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

Department of Computer Science, University of Otago

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

Chapter 7 Methods of Finding Estimators

1 Correlation and Regression Analysis

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

1 Computing the Standard Deviation of Sample Means

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Lesson 17 Pearson s Correlation Coefficient

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

A Mathematical Perspective on Gambling

Basic Elements of Arithmetic Sequences and Series

Determining the sample size

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

Sequences and Series

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Measures of Spread and Boxplots Discrete Math, Section 9.4

CHAPTER 3 DIGITAL CODING OF SIGNALS

One-sample test of proportions

Hypergeometric Distributions

BINOMIAL EXPANSIONS In this section. Some Examples. Obtaining the Coefficients

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE The absolute value of the complex number z a bi is

Statistical inference: example 1. Inferential Statistics

A probabilistic proof of a binomial identity

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Output Analysis (2, Chapters 10 &11 Law)

Incremental calculation of weighted mean and variance

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals


Ekkehart Schlicht: Economic Surplus and Derived Demand

5.4 Amortization. Question 1: How do you find the present value of an annuity? Question 2: How is a loan amortized?

Soving Recurrence Relations

Normal Distribution.

NATIONAL SENIOR CERTIFICATE GRADE 12

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

SEQUENCES AND SERIES

Section 11.3: The Integral Test

3. Greatest Common Divisor - Least Common Multiple

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Chapter 7: Confidence Interval and Sample Size

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

MEP Pupil Text 9. The mean, median and mode are three different ways of describing the average.

Asymptotic Growth of Functions

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

2-3 The Remainder and Factor Theorems

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS

Lesson 15 ANOVA (analysis of variance)

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern.

Confidence Intervals

INVESTMENT PERFORMANCE COUNCIL (IPC)

How to read A Mutual Fund shareholder report

Properties of MLE: consistency, asymptotic normality. Fisher information.

Descriptive Statistics

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

Modified Line Search Method for Global Optimization

Quadrat Sampling in Population Ecology

The Stable Marriage Problem

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

Maximum Likelihood Estimators.

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error

Escola Federal de Engenharia de Itajubá

Building Blocks Problem Related to Harmonic Series

Convexity, Inequalities, and Norms

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments

THE ROLE OF EXPORTS IN ECONOMIC GROWTH WITH REFERENCE TO ETHIOPIAN COUNTRY

GCE Further Mathematics (6360) Further Pure Unit 2 (MFP2) Textbook. Version: 1.4

Your organization has a Class B IP address of Before you implement subnetting, the Network ID and Host ID are divided as follows:

AP Calculus BC 2003 Scoring Guidelines Form B

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Predictive Modeling Data. in the ACT Electronic Student Record

LECTURE 13: Cross-validation

How To Solve The Homewor Problem Beautifully

Transcription:

THE PROBABLE ERROR OF A MEAN By STUDENT Itroductio Ay experimet may he regarded as formig a idividual of a populatio of experimets which might he performed uder the same coditios. A series of experimets is a sample draw from this populatio. Now ay series of experimets is oly of value i so far as it eables us to form a judgmet as to the statistical costats of the populatio to which the experimets belog. I a greater umber of cases the questio fially turs o the value of a mea, either directly, or as the mea differece betwee the two quatities. If the umber of experimets be very large, we may have precise iformatio as to the value of the mea, but if our sample be small, we have two sources of ucertaity: (1) owig to the error of radom samplig the mea of our series of experimets deviates more or less widely from the mea of the populatio, ad () the sample is ot sufficietly large to determie what is the law of distributio of idividuals. It is usual, however, to assume a ormal distributio, because, i a very large umber of cases, this gives a approximatio so close that a small sample will give o real iformatio as to the maer i which the populatio deviates from ormality: sice some law of distributio must he assumed it is better to work with a curve whose area ad ordiates are tabled, ad whose properties are well kow. This assumptio is accordigly made i the preset paper, so that its coclusios are ot strictly applicable to populatios kow ot to be ormally distributed; yet it appears probable that the deviatio from ormality must be very extreme to load to serious error. We are cocered here solely with the first of these two sources of ucertaity. The usual method of determiig the probability that the mea of the populatio lies withi a give distace of the mea of the sample is to assume a ormal distributio about the mea of the sample with a stadard deviatio equal to s/, where s is the stadard deviatio of the sample, ad to use the tables of the probability itegral. But, as we decrease the umber of experimets, the value of the stadard deviatio foud from the sample of experimets becomes itself subject to a icreasig error, util judgmets reached i this way may become altogether misleadig. I routie work there are two ways of dealig with this difficulty: (1) a experimet may he repeated may times, util such a log series is obtaied that the stadard deviatio is determied oce ad for all with sufficiet accuracy. This value ca the he used for subsequet shorter series of similar experimets. () Where experimets are doe i duplicate i the atural course of the work, the mea square of the differece betwee correspodig pairs is equal to the stadard deviatio of the populatio multiplied by. We call thus combie 1

together several series of experimets for the purpose of determiig the stadard deviatio. Owig however to secular chage, the value obtaied is early always too low, successive experimets beig positively correlated. There are other experimets, however, which caot easily be repeated very ofte; i such cases it is sometimes ecessary to judge of the certaity of the results from a very small sample, which itself affords the oly idicatio of the variability. Some chemical, may biological, ad most agricultural ad largescale experimets belog to this class, which has hitherto bee almost outside the rage of statistical iquiry. Agai, although it is well kow that the method of usig the ormal curve is oly trustworthy whe the sample is large, o oe has yet told us very clearly where the limit betwee large ad small samples is to be draw. The aim of the preset paper is to determie the poit at which we may use the tables of the probability itegral i judgig of the sigificace of the mea of a series of experimets, ad to furish alterative tables for use whe the umber of experimets is too few. The paper is divided ito the followig ie sectios: I. The equatio is determied of the curve which represets the frequecy distributio of stadard deviatios of samples draw from a ormal populatio. II. There is show to be o kid of correlatio betwee the mea ad the stadard deviatio of such a sample. III. The equatio is determied of the curve represetig the frequecy distributio of a quatity z, which is obtaied by dividig the distace betwee the mea of a sample ad the mea of the populatio by the stadard deviatio of the sample. IV. The curve foud i I is discussed. V. The curve foud i III is discussed. VI. The two curves are compared with some actual distributios. VII. Tables of the curves foud i III are give for samples of differet size. VIII ad IX. The tables are explaied ad some istaces are give of their use. X. Coclusios. Sectio 1 Samples of idividuals are draw out of a populatio distributed ormally, to fid a equatio which shall represet the frequecy of the stadard deviatios of these samples. If s be the stadard deviatio foud from a sample x 1 x...x (all these beig measured from the mea of the populatio), the s = S(x 1) ( ) S(x1 ) = S(x 1) S(x 1) S(x 1x ).

Summig for all samples ad dividig by the umber of samples we get the moa value of s, which we will write s : s = µ µ = µ ( 1), where µ is the secod momet coefficiet i the origial ormal distributio of x: sice x 1, x, etc. are ot correlated ad the distributio is ormal, products ivolvig odd powers of x 1 vaish o summig, so that S(x1x) is equal to 0. If M R represet the Rth momet coefficiet of the distributio of s about the ed of the rage where s = 0, Agai M 1 ( 1) = µ. { ( )} S(x s 4 = 1 ) S(x1 ) ( ) S(x ( ) ( = 1 ) S(x 1) S(x1 ) S(x1 ) + = S(x4 1) + S(x 1x ) S(X4 1) 3 4S(x 1x ) 3 + S(x4 1) 4 + 6S(x 1x ) 4 +other terms ivolvig odd powers of x 1, etc. which will vaish o summatio. Now S(x 4 1) has terms, buts(x 1x ) has 1 ( 1), hece summig for all samples ad dividig by the umber of samples, we get M = µ 4 ( 1) +µ µ 4 ( 1) µ + µ 4 ( 1) 3 +3µ 3 ) 4 = µ 4 3 { +1}+ µ 3( 1){ +3}. Now sice the distributio of x is ormal, µ 4 = 3µ, hece M = µ ( 1) 3 {3 3+ +3} = µ ( 1)(+1). I a similar tedious way I fid ad M 3 = µ 3 ( 1)(+1)(+3) 3 M 4 = µ 4 ( 1)(+1)(+3)(+5) 4. 3

The law of formatio of these momet coefficiets appears to be a simple oe, but I have ot see my way to a geeral proof. If ow M R be the Rth momet coefficiet of s about its mea, we have M = µ ( 1) { ( 1)(+1)(+3) M 3 = µ 3 3 {(+1) ( 1)} = µ 3 ( 1). 3( 1). (( 1) ( 1)3 3 = µ 3 ( 1) 3 { +4+3 6+6 + 1} = 8µ 3 ( 1) 3, M 4 = µ4 4 { ( 1)(+1)(+3)(+5) 3( 1) 1( 1) 3 ( 1) 4} = µ4 ( 1) 4 { 3 +9 +3+15 3+3 1 +4 1 3 +3 3+1} = 1µ4 ( 1)(+3) 4. Hece β 1 = M 3 M 3 = 8 1, β = M 4 M = 3(+3) 1), β 3β 1 6 = 1 {6(+3) 4 6( 1)} = 0. 1 Cosequetly a curve of Prof. Pearso s Type III may he expected to fit the distributio of s. The equatio referred to a origi at the zero ed of the curve will be where ad y = Cx p e γx, γ = M M 3 = 4µ ( 1) 3 8 µ ( 1) = µ p = 4 β 1 1 = 1 Cosequetly the equatio becomes y = Cx 3 e x µ, 1 = 3. which will give the distributio of s. The area of this curve is C x 3 e x µ dx = I (say). The first momet 0 coefficiet about the ed of the rage will therefore be C [ ] x 1 e x µ dx C µ x 1 e x x= µ 0 x=0 = + C 1 0 µ x 3 e x µ dx. I I I 4 }

The first part vaishes at each limit ad the secod is equal to 1 µ I I = 1 µ. ad we see that the higher momet coefficiets will he formed by multiplyig successively by +1 µ, +3 µ etc., just as appeared to he the law of formatio of M, M 3, M 4, etc. Hece it is probable that the curve foud represets the theoretical distributio of s ; so that although we have o actual proof we shall assume it to do so i what follows. The distributio of s may he foud from this, sice the frequecy of s is equal to that of s ad all that we must do is to compress the base lie suitably. Now if y 1 = φ(s ) be the frequecy curve of s ad y = ψ(s) be the frequecy curve of s, the y 1 d(s ) = y ds, y ds = y 1 sds, y = sy 1. Hece is the distributio of s. This reduces to y = Cs(s ) 3 e s µ. y = Cs e s σ. Hece y = Ax e s µ will give the frequecy distributio of stadard deviatios of samples of, take out of a populatio distributed ormally with stadard deviatio σ. The costat A may he foud by equatig the area of the curve as follows: The Area = A 0 I p = σ = σ x e x σ dx. ( Let I p represet ( e x σ )dx x p 1 d 0 dx ] x= [ x p 1 e x σ = σ (p 1)I p, x=0 sice the first part vaishes at both limits. + σ (p 1) 0 0 ) x p e x σ dx. x p e x σ dx 5

or By cotiuig this process we fid ( σ I = ( σ I = accordig is eve or odd. But I 0 is ad I 1 is Hece if be eve, while is be odd or 0 0 ) ) e x σ dx = xe x sigma dx = [ σ ( 3)( 5)...3.1I 0 ( 3)( 5)...4.I 1 ( π ) σ, ] e x x= σ x=0 = σ. Area A = (π )( ( 3)( 5)...3.1 σ ), 1 A = Area ( 3)( 5)...4. ( σ ) 1 Hece the equatio may be writte ( ) N ( ) 1 y = ( 3)( 5)...3.1 π σ x e x σ ( eve) y = N ( ) 1 ( 3)( 5)...4. σ x e x σ ( odd) where N as usual represets the total frequecy. Sectio II To show that there is o correlatio betwee (a) the distace of the mea of a sample from the mea of the populatio ad (b) the stadard deviatio of a sample with ormal distributio. (1) Clearly positive ad egative positios of the mea of the sample are equally likely, ad hece there caot be correlatio betwee the absolute value of the distace of the mea from the mea of the populatio ad the stadard. 6

deviatio, but () there might be correlatio betwee the square of the distace ad the square of the stadard deviatio. Let ( ) u S(x1 ) = ad s = S(x 1) ( ) S(x1 ). The if m 1, M 1 be the mea values of u ad s z, we have by the precedig part M 1 = µ ( 1) ad m 1 = µ. Now ( ) ( ) 4 u s = S(x 1) S(x1 ) S(x1 ) ( ) S(x = 1 ) + S(x 1x ).S(x 1) 3 S(x4 1) 4 6S(x 1x ) 4 other terms of odd order which will vaish o summatio. Summig for all values ad dividig by the umber of cases we get R u s σ u σ s +m 1 M 1 = µ 4 +µ ( 1) µ 4 3µ ( 1) 3 3, where R u s is the correlatio betwee u ad s. R u s σ u σ s +µ ( 1) = µ ( 1) 3 {3+ 3} = µ ( 1). Hece R u s σ u σ s = 0, or there is o correlatio betwee u ad s. Sectio III To fid the equatio represetig the frequecy distributio of the meas of samples of draw from a ormal populatio, the mea beig expressed i terms of the stadard deviatio of the sample. We have y = C σ s e x 1 σ as the equatio represetig the distributio of s, the stadard deviatio of a sample of, whe the samples are draw from a ormal populatio with stadard deviatio s. Now the meas of these samples of are distributed accordig to the equatio 1 ()N y = e x σ, (π)σ ad we have show that there is o correlatio betwee x, the distace of the mea of the sample, ad s, the stadard deviatio of the sample. 1 Airy, Theory of Errors of Observatios, Part II, 6. 7

Now let us suppose x measured i terms of s, i.e. let us fid the distributio of z = x/s. If we have y 1 = φ(x) ad y = ψ(z) as the equatios represetig the frequecy of x ad of z respectively, the y 1 dx = y dz = y 3 dx s, y = sy 1. Hece y = N ()s e s z σ (π)σ is the equatio represetig the distributio of z for samples of with stadard deviatio s. Now the chace that s lies betwee s ad s+ds is s+ds s 0 C σ 1 s e s σ ds C σ 1 s e s σ ds which represets the N i the above equatio. Hece the distributio of z due to values of s which lie betwee s ad s+ds is y = s+ds s C σ ( π 0 ) s 1 e s (1+z ) σ ds C σ 1 s e s σ ds = ( π ) s+ds s 0 C σ s 1 (1+z ) e s σ ds C σ s e s σ ds ad summig for all values of s we have as a equatio givig the distributio of z ( ) s+ds C π s σ s 1 (1+z ) e s y = σ ds. σ C σ s e s σ ds By what we have already proved this reduces to ad to 0 y = 1 3. 4 5... 5 4.3 (1+z ) 1, if be odd y = 1 3. 4 5... 4 3. 1 (1+z ) 1, if be eve Sice this equatio is idepedet of σ it will give the distributio of the distace of the mea of a sample from the mea of the populatio expressed i terms of the stadard deviatio of the sample for ay ormal populatio. 8

Sectio IV. Some Properties of the Stadard Deviatio Frequecy Curve By a similar method to that adopted for fidig the costat we may fid the mea ad momets: thus the mea is at I 1 /I, which is equal to 3. 4 5... 1 ( π ) σ, if be eve, or 3. 4 5... 3 (π ) σ, if be odd. The secod momet about the ed of the rage is I = ( 1)σ. I The third momet about the ed of the rage is equal to I +1 I = I +1 I 1. I 1 I = σ the mea. The fourth momet about the ed of the rage is equal to I + = ( 1)(+1) I σ 4. If we write the distace of the mea from the ed of the rage Dσ/ ad the momets about the ed of the rage ν 1, ν, etc., the ν 1 = Dσ, ν = 1 σ, ν 3 = Dσ3, ν 4 = N 1 σ 4. From this we get the momets about the mea: µ = σ ( 1 D ), µ 3 = σ3 {D 3( 1)D +D } = σ3 D {D +3}, µ 4 = σ { 1 4D +6( 1)D 3D 4 } = σ4 { 1 D (3D +6)}. It is of iterest to fid out what these become whe is large. 9

is I order to do this we must fid out what is the value of D. Now Wallis s expressio for π derived from the ifiite product value of six π (+1) =.4.6...() 1 3 5...( 1). If we assume a quatity θ ( = a 0 + a1 +etc.) which we may add to the +1 i order to make the expressio approximate more rapidly to the truth, it is easy to show that θ = 1 + 1 16 ( π + 1 + 1 16 etc., ad we get ) =.4.6...() 1 3 5...( 1). From this we fid that whether be eve or odd D approximates to 3 + 1 8 whe is large. Substitutig this value of D we get ( µ = σ 1 1 ) (1, µ = σ3 3 + ) 1 16 4 4, µ 4 = 3σ ( 4 4 1+ 1 1 ) 16. Cosequetly the ( value of the stadard ) deviatio of a stadard deviatio σ which we have foud becomes the same as that foud for () {1 (1/4)} the ormal curve by Prof. Pearso {σ/()} whe is large eough to eglect the 1/4 i compariso with 1. Neglectig terms of lower order tha 1/, we fid β 1 = 3 (4 3), β) = 3 ( 1 1 )( 1+ 1 Cosequetly, as icreases, β very soo approaches the value 3 of the ormal curve, but β 1 vaishes more slowly, so that the curve remais slightly skew. Diagram I shows the theoretical distributio of the stadard deviatios foud from samples of 10. Sectio V. Some Properties of the Curve y = 3. 4 5... ( 4 3. π 5 4.3.1 if be eve if be odd ) ). (1+z ) 1 Writig z = taθ the equatio becomes y = 3. 4 5...etc. cos θ, which affords a easy way of drawig the curve. Also dz = dθ/cos θ. This expressio will be foud to give a much closer approximatio to π tha Wallis s 10

Hece to fid the area of the curve betwee ay limits we must fid 3. 4 5...etc. cos θdθ = { [ 3 cos 3. 4 5...etc. cos 4 3 ]} θsiθ θdθ + = 3. 4 5...etc. cos 4 θdθ + 1 4 3 5...etc.[cos 3 θsiθ], ad by cotiuig the process the itegral may he evaluated. For example, if we wish to fid the area betwee 0 ad θ for = 8 we have Area = 6 5.4 3. 1.1 π = 4 3. π θ 0 θ 0 cos 6 θdθ cos 4 θdθ + 1 5.4 3. π cos5 θsiθ = θ π + 1 π cosθsiθ + 1 3. π cos3 θsiθ + 1 5.4 3. π cos5 θsiθ ad it will be oticed that for = 10 we shall merely have to add to this same expressio the term 1 7.6 5.4 3. π cos7 θsiθ. 11

The tables at the ed of the paper give the area betwee ad z ( or θ = π ) ad θ = ta 1 z. This is the same as 0.5+the area betwee θ = 0, ad θ = ta 1 z, ad as the whole area of the curve is equal to 1, the tables give the probability that the mea of the sample does ot differ by more tha z times the stadard deviatio of the sample from the mea of the populatio. The whole area of the curve is equal to + 1 3. 4 5...etc. π cos θdθ 1 π ad sice all the parts betwee the limits vaish at both limits this reduces to 1. Similarly, the secod momet coefficiet is equal to + 1 3. 4 5...etc. π cos θta θdθ 1 π = 3. 4 5...etc. = 3 1 = 1 3. + 1 π 1 π (cos 4 θ cos θ)dθ Hece the stadard deviatio of the curve is 1/ ( 3). The fourth momet coefficiet is equal to + 1 3. 4 5...etc. π cos θta 4 θdθ 1 π = 3. 4 5...etc. + 1 π 1 π (cos 6 θ cos 4 θ +cos θ)dθ = 3. 4 5 ( ) 3 +1 = 3 ( 3)( 5). The odd momets are of course zero, a the curve is symmetrical, so β 1 = 0, β = 3( 3) 5 = 3+ 6 5. Hece as it icreases the curve approaches the ormal curve whose stadard deviatio is 1/ ( 3). β, however, is always greater tha 3, idicatig that large deviatios are mere commo tha i the ormal curve. I have tabled the area for the ormal curve with stadard deviatio 1/ 7 so as to compare, with my curve for = 10 3. It will be see that odds laid 3 See p. 9 1

accordig to either table would ot seriously differ till we reach z = 0.8, where the odds are about 50 to 1 that the mea is withi that limit: beyod that the ormal curve gives a false feelig of security, for example, accordig to the ormal curve it is 99,986 to 14 (say 7000 to 1) that the mea of the populatio lies betwee ad +1.3s, whereas the real odds are oly 99,819 to 181 (about 550 to 1). Now 50 to 1 correspods to three times the probable error i the ormal curve ad for most purposes it would be cosidered sigificat; for this reaso I have oly tabled my curves for values of ot greater tha 10, but have give the = 9 ad = 10 tables to oe further place of decimals. They ca he used as foudatios for fidig values for larger samples. 4 The table for = ca be readily costructed by lookig out θ = ta 1 z i Chambers s tables ad the 0.5 + θ/π gives the correspodig value. Similarly 1 siθ +0.5 gives the values whe = 3. There are two poits of iterest i the = curve. Here s is equal to half the distace betwee the two observatios, ta 1 s s = π 4, so that betwee +s ad z lies π 4 1 π or half the probability, i.e. if two observatios have bee made ad we have o other iformatio, it is a eve chace that the mea of the (ormal) populatio will lie betwee them. O the other had the secod 4 E.g. if = 11, to the correspodig value for = 9, we add 7 8 5 6 3 4 1 1 cos8 θsiθ: 9 if = 13 we add as well 10 7 8 5 6 3 4 1 1 cos10 θsiθ, ad so o. 13

momet coefficiet is 1 π + 1 π = 1 π ta θdθ = 1 π [taθ π= θ]+1 =, = 1 π or the stadard deviatio is ifiite while the probable error is fiite. Sectio VI. Practical Test of the foregoig Equatios Before I bad succeeded i solvig my problem aalytically, I had edeavoured to do so empirically. The material used was a correlatio table cotaiig the height ad left middle figer measuremets of 3000 crimials, from a paper by W. R. Macdoell (Biometrika, i, p. 19). The measuremets were writte out o 3000 pieces of cardboard, which were the very thoroughly shuffled ad draw at radom. As each card was draw its umbers were writte dow i a book, which thus cotais the measuremets of 3000 crimials i a radom order. Fially, each cosecutive set of 4 was take as a sample 750 i all ad the mea, stadard deviatio, ad correlatio 5 of each sample determied. The differece betwee the mea of each sample ad the mea of the populatio was the divided by the stadard deviatio of the sample, givig us the z of Sectio III. This provides us with two sets of 750 stadard deviatios ad two sets of 750 z s o which to test the theoretical results arrived at. The height ad left middle figer correlatio table was chose because the distributio of both was approximately ormal ad the correlatio was fairly high. Both frequecy curves, however, deviate slightly from ormality, the costats beig for height β 1 = 0.006, β = 3.176, ad for left middle figer legths β 1 = 0.0030, β = 3.140, ad i cosequece there is a tedecy for a certai umber of larger stadard deviatios to occur tha if the distributios wore ormal. This, however, appears to make very little differece to the distributio of z. Aother thig which iterferes with the compariso is the comparatively large groups i which the observatios occur. The heights are arraged i 1 ich groups, the stadard deviatio beig oly.54 iches. while, the figer legths wore origially grouped i millimetres, but ufortuately I did ot at the time see the importace of havig a smaller uit ad codesed them ito millimetre groups, i terms of which the stadard deviatio is.74. Several curious results follow from takig samples of 4 from material disposed i such wide groups. The followig poits may be oticed: (1) The meas oly occur as multiples of 0.5. () The stadard deviatios occur as the square roots of the followig types of umbers:, +0.10, +0.5, +0.50, +0.69, +0.75. (3) A stadard deviatio belogig to oe of these groups ca oly be associated with a mea of a particular kid; thus a stadard deviatio of ca 5 I hope to publish the results of the correlatio work shortly. 14

oly occur if the mea differs by a whole umber from the group we take as origi, while 1.69 will oly occur whe the mea is at ±0.5. (4) All the four idividuals of the sample will occasioally come from the same group, givig a zero value for the stadard deviatio. Now this leads to a ifiite value of z ad is clearly due to too wide a groupig, for although two me may have the same height whe measured by iches, yet the fier the measuremets the more seldom will they he idetical, till fially the chace that four me will have exactly the same height is ifiitely small. If we had smaller groupig the zero values of the stadard deviatio might be expected to icrease, ad a similar cosideratio will show that the smaller values of the stadard deviatio would also be likely to icrease, such as 0.436, whe 3 fall i oe group ad 1 i a adjacet group, or 0.50 whe fall i two adjacet groups. O the other had, whe the idividuals of the sample lie far apart, the argumet of Sheppard s correctio will apply, the real value of the stadard deviatio beig more likely to he smaller tha that foud owig to the frequecy i ay group beig greater o the side earer the mode. These two effects of groupig will ted to eutralize the effect o the mea value of the stadard deviatio, but both will icrease the variability. Accordigly, we fid that the mea value of the stadard deviatio is quite close to that calculated, while i each case the variability is sesibly greater. The fit of the curve is ot good, both for this reaso ad because the frequecy is ot evely distributed owig to effects () ad (3) of groupig. O the other had, the fit of the curve givig the frequecy of z is very good, ad as that is the oly practical poit the compariso may he cosidered satisfactory. The followig are the figures for height: Mea value of stadard deviatios: Calculated.07 ± 0.0 Observed.06 Differece = 0.001 Stadard deviatio of stadard deviatios: Calculated 0.8558 ± 0.015 Observed 0.9066 Differece +0.0510 Compariso of Fit. Theoretical Equatio: y = 16 750 (π)σ x e x σ Scale i terms of stadard deviatios of populatio Calculated frequecy 1 1 10 1 7 45 1 64 1 78 1 87 88 81 1 71 58 45 33 3 15 91 5 1 7 Observed frequecy 3 14 1 41 371 107 67 73 77 77 1 64 1 49 1 35 8 11 9 11 1 7 Differece +1 1 +4 1 8 +41 111 14 11 4 7 51 +41 + +5 1 1 +6 0 Whece χ = 48.06, P = 0.00006 (about). I tablig the observed frequecy, values betwee 0.015 ad 0.0875 were icluded i oe group, while betwee 0.0875 ad 0.01.5 they were divided over thetwogroups. AsaistaceoftheirregularityduetogroupigImaymetio 15

that there were 31 cases of stadard deviatios 1.30 (i terms of the groupig) which is 0.5117 i terms of the stadard deviatio of the populatio, ad they wore therefore divided over the groups 0.4 to 0.5 ad 0.5 to 0.6. Had they all bee couted i groups 0.5 to 0.6 χ would have falle to 0.85 ad P would have rise to 0.03. The χ test presupposes radom samplig from a frequecy followig the give law, but this we have ot got owig to the iterferece of the groupig. Whe, however, we test the z s where the groupig has ot had so much effect, we fid a close correspodece betwee the theory ad the actual result. There were three cases of ifiite values of z which, for the reasos give above, were give the ext largest values which occurred, amely +6 or 6. The rest were divided ito groups of 0.1; 0.04, 0.05 ad 0.06, beig divided betwee the two groups o either side. The calculated value for the stadard deviatio of the frequecy curve was 1 (±0.0171), while the observed was 1.030. The value of the stadard deviatio is really ifiite, as the fourth momet coefficiet is ifiite, but as we have arbitrarily limited the ifiite cases we may take as a approximatio 1/ 1500 from which the value of the probable error give above is obtaied. The fit of the curve is as follows: Compariso of Fit. Theoretical Equatio: y = N π cos4 θ, z = taθ Scale of z Calculated frequecy 5 9 1 131 341 441 781 119 141 781 44 1 341 131 131 9 1 5 Observed frequecy 9 14 1 111 33 431 701 1191 1511 1 67 1 49 6 1 16 10 6 Differece +4 +4 1 1 1 8 +1 +10 1 +3 11 +41 8 +1 +1 Whece χ = 1.44, P = 0.56. This is very satisfactory, especially whe we cosider that as a rule observatios are tested agaist curves fitted from the mea ad oe or more other momets of the observatios, so that cosiderable correspodece is oly to ])c expected; while this curve is exposed to the full errors of radom samplig, its costats havig bee calculated quite apart from the observatios. The left middle figer samples show much the same features as those of the height, but as the groupig is ot so large compared to the variability the curves fit the observatios more closely. Diagrams III 6 ad IV give the stadard deviatios of the z s for the set of samples. The results are as follows: 6 There are three small mistakes i plottig the observed values i Diagram III, which make the fit appear worse tha it really is 16

Mea value of stadard deviatios: Calculated.186 ± 0.03 Observed.179 Differece = 0.007 Stadard deviatio of stadard deviatios: Calculated 0.94 ± 0.016 Observed 0.980 Differece = +0.0578 Compariso of Fit. Theoretical Equatio: y = 16 750 (π)σ x e x σ Scale i terms of stadard deviatios of populatio 1 1 101 7 45 1 641 78 1 87 88 81 1 71 58 45 33 3 15 91 5 1 7 Calculated frequecy 14 7 1 51 64 1 91 94 1 68 1 65 1 73 481 401 41 0 1 1 5 7 1 Observed frequecy + 1 +31 +1 +51 +11 +71 191 16 + 91 41 +91 3 +71 +1 1 +1 Whece χ = 1.80, P = 0.19. Value of stadard deviatio: Calculated 1(±0.017) Observed 0.98 Differece = 0.018 Compariso of Fit. Theoretical Equatio: y = N π cos4 θ, z = taθ Scale of z Calculated frequecy 5 9 1 13 1 341 441 781 119 141 119 781 441 341 131 91 5 Observed frequecy 4 15 1 18 33 1 44 75 1 138 101 71 46 1 36 11 9 6 Differece 1 +6 +4 1 1 1 31 +3 3 +11 71 + +11 1 1 +1 Whece χ = 7.39, P = 0.9. A very close fit. We see the that if the distributio is approximately ormal our theory gives us a satisfactory measure of the certaity to be derived from a small sample i both the cases we have tested; but we have a idicatio that a fie groupig is of advatage. If the distributio is ot ormal, the mea ad the stadard deviatio of a sample will be positively correlated, so although both will have greater variability, yet they will ted to couteract oe aother, a mea derivig largely from the geeral mea tedig to be divided by a larger stadard deviatio. Cosequetly, I believe that the table give i Sectio VII below may be used i estimatig the degree of certaity arrived at by the mea of a few experimets, i the case of most laboratory or biological work where the distributios are as a rule of a cocked hat type ad so sufficietly early ormal 17

18

3. 4 Sectio VII. Tables of ( 3 5....1 odd ) ta 1 z 1.1 π eve 1 π cos θdθ for values of from 4 to 10 iclusive Together with (π) 7 x e 7x dx for compariso whe = 10 z ( = x s) = 4 = 5 = 6 = 7 = 8 = 9 = 10 ( For compariso 7 ) x (π) e 7x dx 0.1 0.5633 0.5745 0.5841 0.598 0.6006 0.60787 0.6146 0.60411 0. 0.641 0.6458 0.6634 0.6798 0.6936 0.70705 0.71846 0.70159 0.3 0.6804 0.7096 0.7340 0.7549 0.7733 0.78961 0.8043 0.78641 0.4 0.7309 0.7657 0.7939 0.8175 0.8376 0.85465 0.86970 0.8550 0.5 0.7749 0.8131 0.848 0.8667 0.8863 0.9051 0.91609 0.90691 0.6 0.815 0.8518 0.8813 0.9040 0.918 0.93600 0.9473 0.94375 0.7 0.8440 0.8830 0.9109 0.9314 0.9468 0.95851 0.96747 0.96799 0.8 0.8701 0.9076 0.933 0.951 0.9640 0.9738 0.98007 0.9853 0.9 0.8915 0.969 0.9498 0.965 0.9756 0.9879 0.98780 0.99137 1.0 0.909 0.9419 0.96 0.9751 0.9834 0.98890 0.995 0.9980 1.1 0.936 0.9537 0.9714 0.981 0.9887 0.9980 0.99539 0.9996 1. 0.9354 0.968 0.978 0.9870 0.99 0.9958 0.99713 0.99971 1.3 0.9451 0.9700 0.983 0.9905 0.9946 0.99688 0.99819 0.99986 1.4 0.9451 0.9756 0.9870 0.9930 0.996 0.99791 0.99885 0.99989 1.5 0.9598 0.9800 0.9899 0.9948 0.9973 0.99859 0.9996 0.99999 1.6 0.9653 0.9836 0.990 0.9961 0.9981 0.99903 0.99951 1.7 0.9699 0.9864 0.9937 0.9970 0.9986 0.99933 0.99968 1.8 0.9737 0.9886 0.9950 0.9977 0.9990 0.99953 0.99978 1.9 0.9970 0.9904 0.9959 0.9983 0.999 0.99967 0.99985.0 0.9797 0.9919 0.9967 0.9986 0.9994 0.99976 0.99990.1 0.981 0.9931 0.9973 0.9989 0.9996 0.99983 0.99993. 0.9841 0.9941 0.9978 0.999 0.9997 0.99987 0.99995.3 0.9858 0.9950 0.998 0.9993 0.9998 0.99991 0.99996.4 0.9873 0.9957 0.9985 0.9995 0.9998 0.99993 0.99997.5 0.9886 0.9963 0.9987 0.9996 0.9998 0.99995 0.99998.6 0.9898 0.9967 0.9989 0.9996 0.9999 0.99996 0.99999.7 0.9908 0.997 0.9989 0.9997 0.9999 0.99997 0.99999.8 0.9916 0.9975 0.9989 0.9998 0.9999 0.99998 0.99999.9 0.994 0.9978 0.9989 0.9998 0.9999 0.99998 0.99999 3.0 0.9931 0.9981 0.9989 0.9998 0.99999 Explaatio of Tables The tables give the probability that the value of the mea, measured from the mea of the populatio, i terms of the stadard deviatio of the sample, will lie betwee ad z. Thus, to take the table for samples of 6, the probability of the mea of the populatio lyig betwee ad oce the stadard deviatio of the sample is 0.96, the odds are about 4 to 1 that the mea of the populatio lies betwee these limits. 19

The probability is therefore 0.0378 that it is greater tha oce the stadard deviatio ad 0.07511 that it lies outside ±1.0 times the stadard deviatio. Illustratio of Method Illustratio I. As a istace of the kid of use which may be made of the tables, I take the followig figures from a table by A. R. Cushy ad A. R. Peebles i the Joural of Physiology for 1904, showig the differet effects of the optical isomers of hyoscyamie hydrobromide i producig sleep. The average umber of hours sleep gaied by the use of the drug is tabulated below. The coclusio arrived at was that i the usual doses was, but 1 was ot, of value as a soporific. Additioal hours sleep gaied by the use of hyoscyamie hydrobromide Patiet 1 (Dextro-) (Laevo-) Differece ( 1) 1 +0.7 +1.9 +1. 1.6 +0.8 +.4 3 0. +1.1 +1.3 4 1. +0.1 +1.3 5 0.1 0.1 0 6 +3.4 +4.4 +1.0 7 +3.7 +5.5 +1.8 8 +0.8 +1.6 +0.8 9 0 +4.6 +4.6 10 +.0 +3.4 +1.4 Mea +0.75 Mea +.33 Mea +1.58 s.d. 1.70 s.d. 1.90 s.d. 1.17 First let us see what is the probability that 1 will o the average give icrease of sleep; i.e. what is the chace that the mea of the populatio of which these experimets are a sample is positive. +0.75/1.70 = 0.44, ad lookig out z = 0.44 i the table for te experimets we fid by iterpolatig betwee 0.8697 ad 0.9161 that 0.44 correspods to 0.8873, or the odds are 0.887 to 0.113 that the mea is positive. That is about 8 to 1, ad would correspod to the ormal curve to about 1.8 times the probable error. It is the very likely that 1 gives a icrease of sleep, but would occasio o surprise if the results were reversed by further experimets. If ow we cosider the chace that is actually a soporific we have the mea iclrease of sleep =.33/1.90 or 1.3 times the s.d. From the table the probability correspodig to this is 0.9974, i.e. the odds are early 400 to 1 that such is the case. This correspods to about 4.15 times the probable error i the ormal curve. But I take it that the real poit of the authors was that is better tha 1. This we must t4est by makig a ew series, subtractig 1 from. The mea values of this series is +1.38, while the s.d. is 1.17, the mea value beig +1.35 times the s.d. From the table, the probability is 0.9985, or the odds are about 666 to oe that is the better soporific. The low value of 0

the s.d. is probably due to the differet drugs reactig similarly o the same patiet, so that there is correlatio betwee the results. Of course odds of this kid make it almost certai that is the better soporific, ad i practical life such a high probability is i most matters cosidered as a certaity. Illustratio II. Cases where the tables will be useful are ot ucommo i agricultural work, ad they would be more umerous if the advatages of beig able to apply statistical reasoig were bore i mid whe plaig the experimets. I take the followig istaces from the accouts of the Wobur farmig experimets published yearly by Dr Voelcker i the Joural of the Agricultural Soceity. A short series of pot culture experimets were coducted i order to determie the casues which lead to the productio of Hard (glutious) wheat or Soft (starchy) wheat. I three successive years a bulk of seed cor of oe variety was picked over by had ad two samples were selected, oe cosistig of hard grais avid the other of soft. Some of each of them were plated i both heavy ad light soil ad the resultig crops wore weighed ad examied for hard ad soft cor. The coclusio draw was that the effect of selectig the seed was egligible compared with the ifluece of the soil. This coclusio was thoroughly justified, theheavy soul producig i each case early 100% of hard cor, but still the effect of selectig the seed could just be traced i each year. But a curious poit, to which Dr Voelcker draws attetio i the secod year s report, is that the soft seeds produced the higher yield of both cor ad straw. I view of the well-kow fact that the varieties which have a high yield ted to produce soft cor, it is iterestig to see how much evidece the experimets afford as to the correlatio betwee softess ad fertility i the same variety. Further, Mr Hooker 7 has show that the yield of wheat i oe year is largely determied by the weather durig the precedig year. Dr Voelcker s results may afford a clue as to the way i which the seed id affected, ad would almost justify the selectio of particillar soils for growig wheat. 8 Th figures are as follows, the yields beig expressed i grammes per pot: Year 1899 1900 1901 Stadard Soil Light Heavy Light Heavy Light Heavy Average deviatio z Yield of cor from soft seed 7.55 8.89 14.81 13.55 7.49 15.39 11.38 Yield of cor from hard seed 7.7 8.3 13.81 13.36 7.97 13.13 10.643 Differece +0.58 +0.57 +1.00 +0.19 0.49 +.6 +0.685 0.778 0.88 Yield of straw from soft seed 1.81 1.87. 0.1 13.97.57 17.44 Yield of straw from hard seed 10.71 1.48 1.64 0.6 11.71 18.96 15.97 Differece +.10 +0.39 +0.78 0.05 +.66 +3.61 +1.515 1.61 1.0 If we wish to laid the odds that the soft seed will give a better yield of cor o the average, we divide, the average differece by the stadard deviatio, 7 Joural of the Royal Statistical Society, 1897 8 Ad perhaps a few experimets to see whether there is a correlatio betwee yield ad mellowess i barley. 1

givig us z = 0.88. Lookig this up i the table for = 6 we fid p = 0.9465 or the odds are 0.9465 to 0.0535 about 18 to 1. Similarly for straw z = 1.0, p = 0.978, ad the odds are about 45 to 1. I order to see whether such odds are sufficiet for a practical ma to draw a defiite coclusio, I take aother act of experimets i which Dr Voelcker compares the effects of differet artificial maures used with potatoes o a large scale. The figures represet the differece betwee the crops grow with the rise of sulphate of potash ad kailit respectively i both 1904 ad 1905: cwt. qr. lb. to cwt. qr. lb. 1904 + 10 3 0 : + 1 10 1 6 (two experimets i each year) 1905 + 6 0 3 : + 13 8 The average gai by the use of sulphate of potash was 15.5 cwt. ad the s.d. 9 cwt., whece, if we wat the odds that the coclusio give below is right, z = 1.7, correspodig, whe = 4,to p = 0.9698 or odds of 3 to 1; this is midway betwee the odds i the former example. Dr Voelcker says: It may ow fairly be cocluded that for the potato crop o light lad 1 cwt. per acre of sulphate of potash is a better dressig tha kailit. Am a example of how the table should be used with cautio, I take the followig pot culture experimets to test whether it made ay differece whether large or small seeds were sow. Illustratio III. I 1899 ad i 1903 head cor ad tail cor were take from the same bulks of barley ad sow i pots. The yields i grammes were as follows: 1899 1903 Large seed... 13.9 7.3 Small seed... 14.4 1.4 +0.5 +1.4 The average gai is thus 0.95 ad the s.d. 0.45, givig z =.1. Now the table for = is ot give, but if we look up the agle whose taget is.1 i Chambers s tables, p = ta 1.1 180 +0.5 = 64 39 180 = 0.859, so that the odds are about 6 to 1 that small cor gives a better yield tha large. These odds 9 are those which would be laid, ad laid rigidly, by a ma whose oly kowledge of the matter was cotaied i the two experimets. Ayoe coversat with pot culture would however kow that the differece betwee the 9 [Through a umerical slip, ow corrected, Studet had give the odds as 33 to 1 ad it is to this figure that the remarks i this paragraph relate.

two results would geerally be greater ad would correspodigly moderate the certaity of his coclusio. I poit of fact a large-scale experimet cofirmed this result, the small cor yieldig shout 15% more tha the large. I will coclude with a example which comes beyod the rage of the tables, there beig eleve experimets. To test whether it is of advatage to kil-dry barley seed before sowig, seve varieties of barley wore sow (both kil-dried ad ot kil-dried i 1899 ad four i 1900; the results are give i the table. Lb. head cor per acre Price of head cor i Cwt. straw per acre Value of crop per acre shilligs per quarter i shilligs N.K.D. N.D. Diff. N.K.D. N.D. Diff. N.K.D. N.D. Diff. N.K.D. N.D. Diff. 1903 009 +106 6 1 1 0 19 1 5 +5 1 140 1 15 +11 1 1935 1915 0 8 6 1 1 1 1 4 +1 1 15 1 145 7 1 1910 011 +101 9 1 8 1 1 3 4 +1 158 1 161 + 1 1899 496 463 33 30 9 1 3 8 +5 04 1 199 1 5 108 180 + 7 7 1 7 1 1 1 0 16 14 + 1961 195 36 6 6 0 19 1 060 1 + 6 9 6 3 4 1 419 1 1 1 1 14 139 1 1 168 155 13 1444 148 + 38 9 1 8 1 1 15 1 16 + 1 118 117 1 1 1900 161 154 70 8 1 8 1 18 17 1 1316 1443 +17 30 9 1 14 1 15 1 1 +1 1 18 1 109 1 11 7 1 116 1 +7 1511 1535 + 4 8 1 8 1 17 17 1 + 1 10 10 1 + 1 Average 1841.5 1875. +33.7 8.45 7.55 0.91 19.95 1.05 +1.10 145.8 144.68 +1.14 Stadard...... 63.1...... 0.79.......5...... 6.67 deviatio Stadard deviatio...... 63.1...... 0.79.......5...... 6.67 8 It will he oticed that the kil-dried seed gave o a average the larger yield. of cor ad straw, but that the quality was almost always iferior. At first sight this might be supposed to be due to superior germiatig power i the kil-dried seed, but my farmig frieds tell me that the effect of this would be that the kil-dried seed would produce the better quality barley. Dr Voelcker draws the coclusio: I such seasos as 1899 ad 1900 there is o particular advatage i kil-dryig before mowig. Our examiatio completely justifies this ad adds ad the quality of the resultig barley is iferior though the yield may be greater. I this case I propose to use the approximatio give by the ormal curve with stadard deviatio s/ 3 ad therefore use Sheppard s tables, lookig up the differece divided by S/ 8. The probability i the case of yield of cor per acre is give by lookig up 33.7/.3 = 1.51 i Sheppard s tables. This gives p = 0.934, or the odds are about 14 to 1 that kil-dried cor gives the higher yield. Similarly 0.91/0.8 = 3.5, correspodig to p = 0.9994, so that the odds are very great that kil-dried seed gives barley of a worse quality tha seed which has ot bee kil-dried. Similarly, it is about 11 to 1 that kil-dried seed gives more straw ad about to 1 that the total value of the crop is less with kil-dried seed. As poited out i Sectio V, the ormal curve gives too large a value for p whe the probability is large. I fid the true value i this case to be p = 0.9976. It matters little, however, to a coclusio of this kid whether the odds i its favour are 1660 to 1 or merely 416 to 1. 3

Sectio X. Coclusios 1. A curve has bee foud represetig the frequecy distributio of stadard deviatios of samples draw from a ormal populatio.. A curve has bee foud represetig the frequecy distributio of the meas of the such samples, whe these values are measured from the mea of the populatio i terms of the stadard deviatio of the sample. 3. It has bee show that the curve represets the facts fairly well eve whe the distributio of the populatio is ot strictly ormal. 4. Tablesaregivebywhichitcabejudgedwhetheraseriesofexperimets, however short, have give a result which coforms to ay required stadard of accuracy or whether it is ecessary to cotiue the ivestigatio. Fially I should like to express my thaks to Prof. Karl Pearso, without whose costat advice ad criticism this paper could ot have bee writte. [Biometrika, 6 (1908), pp. 1 5, reprited o pp. 11 34 i Studet s Collected Papers, Edited by E. S. Pearso ad Joh Wishart with a Foreword by Lauce McMulle, Cambridge Uiversity Press for the Biometrika Trustees, 194.] 4