A Studentized Range Test for the Equivalency of Normal Means under Heteroscedasticity

Size: px
Start display at page:

Download "A Studentized Range Test for the Equivalency of Normal Means under Heteroscedasticity"

Transcription

1 A Studentized Range Test for the Equivalency of Normal Means under Heteroscedasticity Miin-Jye Wen and Hubert J. Chen Department of Statistics National Cheng-Kung University Tainan, Taiwan ABSTRACT A studentized range test using a two-stage and a one-stage sampling procedures, respectively, is proposed for testing the hypothesis that the average deviation of the normal means is falling into a practical indifference zone. Both the level and the power of the proposed test associated with the hypotheses are controllable and they are completely independent of the unknown variances. The two-stage procedure is a design-oriented procedure that satisfies certain probability requirements and simultaneously determines the required sample sizes for an experiment while the one-stage procedure is a data-analysis procedure after the data have been collected, which can supplement the two-stage procedure when the later has to end its experiment sooner than its required experimental process is completed. Tables needed for implementing these procedures are given. Key Words and Phrases : Indifference zone; Least favorable configuration; Power of a test; Studentized range test; t distribution; Heteroscedasticity. 1

2 1. Introduction The problem of statistical hypothesis testing concerning several normal means has long been a major concern for statisticians. In classical hypothesis testing the interest is often to test the null hypothesis that the population means are all equal (e.g., Lehmann (1986)). It is well known that, for a large enough sample size, the classical test will almost always reject the null hypothesis as pointed out by many researchers (e.g., Berger (1985)). In many real world problems, the practical interest is often to examine whether the population means fall into an indifference zone, not just the equality of means. This idea leads to the consideration of equivalency hypothesis stated as H 0 : 1 ki=1 k µ i µ δ vs H a : 1 ki=1 k µ i µ δ > δ, where µ is the grand average of the means µ 1,..., µ k, δ ( 0) is a predetermined indifference zone and δ (> 0) is a detective amount specified in advance and the quantity stated in the null hypothesis is often regarded as the average deviation of µ i s from their grand mean. The constant δ can be interpreted as the average deviation about which we are indifferent and the null hypothesis H 0 can be interpreted as saying that there is little difference among means within a small δ-value or there is practically equivalent among the means. This type of null hypothesis appears to be more useful and meaningful in the analysis of means among several treatment populations under fixed-effect analysis of variance models. When there are only two populations, the equivalency hypotheses are also referenced to as interval hypotheses or bioequivalence in pharmaceutical and medical studies (e.g., see Chow and Liu (1992)) when H 0 and H a are reversed. For the case of testing three or more means, the studentized range test for the equivalency hypotheses was studied by Chen and Lam (1991) and Chen, Xiong and Lam (1993) for the common unknown variances. For the case of testing the null hypothesis of the equality of means H0 : µ 1 =... = µ k against a specific alternative hypothesis, Ha : µ max µ min δ, where δ (> 0) is a prespecified constant, Chen and Chen (2000) proposed a range test when the variances are unknown and possibly unequal. In this paper, the case of H 0 vs. H a under unequal variances is studied. When the variances are known but unequal, one can divide the original random variables by their corresponding standard deviations so that the transformed variables satisfy the assumption of a common 2

3 (known) variance and then Chen, Xiong and Lam s (1993) method can be used to define a range test. Furthermore, in situations where the variances are unknown but possibly unequal, we use the two-stage sampling procedure as described by Bishop and Dudewicz (1978) and the one-stage sampling procedure proposed by Chen and Lam (1989), respectively, to formulate a modified studentized range test for testing the null hypotheses of equivalency H 0 against the alternative H a. In Section 2, we briefly discuss the case when variances are known but unequal. In Section 3, the two-stage sampling procedure is introduced and a modified studentized range test is defined, and the level and the power of the proposed range test are calculated. In Section 4, the numerical calculation of the level and the power is discussed. In Section 5, a numerical example is given to illustrate the use of the studentized range test. In Section 6, the single-stage sampling procedure is defined and the level and the power of the range test are determined. Finally, a summary and conclusion is given in Section Transformed Sampling Procedure and Range Test In a one-way layout model, let X ij (i = 1,..., k; j = 1,..., N) be a random sample of size N ( 2) drawn from the normal distribution π i having a unknown mean µ i and a known but unequal variance σi 2, i = 1,..., k. Since the heteroscedastic variances σ 2 i are known, divide the r.v. s X ij (i = 1,..., k; j = 1,..., N) by σ i, through i = 1,..., k, to obtain X ij = X ij/σ i, i = 1,..., k; j = 1,..., N. The purpose of transforming the original variables is to eliminate the influence of unequal variances. To see this, one can easily calculate the variance of X ij : V ar(x ij ) = V ar(x ij σ i ) = 1, i = 1,..., k, which is a constant and X ij is distributed as N(µ i, 1), where µ i = µ i/σ i. That is, the variance of the transformed variable X ij is now homoscedastic. Our goal is to test the following null hypothesis H 0 : 1 k µ i k µ δ i=1 3

4 against the alternative H a : 1 k µ i µ δ (δ > δ), k i=1 where µ = k i=1 µ i /k, and the quantity 1 ki=1 k µ i µ is interpreted as the average deviation of means from the grand mean µ. A range test is defined as X max X min, and then Chen, Xiong and Lam s (1993) result with respect to the transformed data can be used without difficulty. It can be shown that the probability of rejecting H 0 attains its maximum at the least favorable configuration (LFC) µ 0 = ( µ kδ N/2, µ,..., µ, µ + kδ N/2) or at its permutations when k 3 and at µ 0 = ( µ kδ N/2, µ + kδ N/2) or at its permutation when k = 2. When sample sizes are unequal, one can replace N by the harmonic mean to obtain an approximation. 3. Two-Stage Procedure and Modified Studentized Range Test Similarly, for the one-way layout model, let X ij (i = 1,..., k; j = 1,..., N i ) be a random sample of size N i ( 3) drawn from the normal distribution π i which has a unknown mean µ i and a possibly unknown variance σi 2, i = 1,..., k, where these variances may be highly unequal. Our goal is to test the null hypothesis against the alternative H 0 : 1 k µ i µ δ (1) k i=1 H a : 1 k µ i µ δ (δ > δ), k i=1 where µ = k i=1 µ i /k, and the quantity 1 k ki=1 µ i µ is interpreted as the average deviation of means from the grand mean µ. A modified studentized range test is to be proposed in such a way that both the level and the power of the test are controllable under the proposed two-stage sampling procedure and they are shown to be free of the unknown and unequal variances. The two-stage sampling procedure for this problem is stated as follows. 4

5 From each of the k populations takes an initial random sample of size n 0 (n 0 2, but Bishop and Dudewicz (1978) suggested that the initial sample of size n 0 be 10 or more giving better results. For economic reason, it is usually taken to be 25 or less.), let S 2 i be the usual unbiased estimate of σ 2 i based on the first n 0 observations from the i th population, and define N i = max{n 0 + 1, [ S2 i z ] + 1} (2) where z > 0 is a design constant to be determined by the power of the test under a specified H a and [X] stands for the largest integer less than X. Then take N i n 0 additional observations from the i th population so that we have a total of N i observations denoted by X i1,..., X in0,..., X ini. For each i, set the coefficients a i1,..., a in0,..., a ini, such that a i1 =... = a in0 = 1 (N i n 0 )b i a in0 +1 =... = a ini = 1 N i [1 + and computed the weighted sample mean as n 0 X i = a i X ij + b i j=1 n 0 N i j=n 0 +1 = a i n 0 (N i z Si 2) ] (N i n 0 )Si 2 = b i (3) X ij. It should be noted that these coefficients a ij s are so determined to satisfy the following conditions, N i j=1 a ij = 1, a i1 =... = a in0, and Si 2 Ni j=1 a2 ij = z. It is easy to show that the r.v. s T i = ( X i µ i )/ z, i = 1,..., k, have independent Student s t-distributions, free of the unknown variances σ 2 i (see, e.g., Chen and Chen (1998)). Let X [1]... X [k] be the order statistics of X 1,..., Xk and let the modified studentized range statistic be defined by R = X [k] X [1] z. (4) The null hypothesis H 0 is rejected at level α iff R > γ α (5) where γ α = γ α (δ, z, k, n 0 ) is the level-α critical value such that P δ (γ α ) = sup P (R > γ α H 0 : 1 Ω k 5 k µ i µ δ) = α, (6) i=1

6 where α (0, 1) is a predetermined level, and Ω is the set of all possible configurations of the µ i s and σ i s, and the design constant z is determined by the power of the test such that P δ (γ α ) = inf Ω P (γ > γ α H a : 1 k where P is taken to be a large value in advance in (0, 1). k µ i µ δ > δ) = P, (7) i=1 Here, it is necessary to find a LFC of the means which maximizes the level of the test under H 0 in (6) and a least favorable configuration which minimizes the power of the test under a specified H a in (7) such that the level and the power are not only independent of all means differences but also free of the unknown variances. The LFC s are determined by Theorem 1 of Chen, Xiong and Lam (1993). It is stated as follows without proof. Theorem 1: Let φ(x i θ i ) be the standard normal density of the independent normal r.v. X i with mean θ i, i = 1,..., k and let g c (θ) = P θ (R c) be the tail probability where R is the range of X 1,..., X k and θ = (θ 1,..., θ k ). Assume that the g c (θ) attains its maximun at some θ for all θ A = {θ : θ i θ δ}, where θ = (θ θ k )/k. Then θ is the LFC and θ must be in B 1 where B 1 = {( θ δ/2, θ + δ/2), ( θ + δ/2, θ δ/2)} for k = 2, and B 1 = {(θ 1,..., θ k ) : one of the θ i s is θ + δ/2, one of the θ i s is θ δ/2, and the other θ i s are θ} for k 3. We now find these LFC s of (6) and (7) described below. Let µ [1]... µ [k] be the ordered values of µ 1,..., µ k and let X (j) be associated with µ [j]. We have P Ω (R > γ α ) = P Ω ( X [k] X [1] > γ α z) = 1 P ( X [k] X [1] + γ α z) k = 1 P ( X [k] X [1] + γ α z, X(j) = X [1] ) j=1 k = 1 P ( X (j) X (i) X (j) + γ α z, i = 1,..., k, i j) j=1 k X (j) µ [j] + µ [j] µ [i] X (i) µ [i] X (j) µ [j] + µ [j] µ [i] = 1 P [ j=1 z z z +γ α, i = 1,..., k, i j] 6

7 k = 1 P [T j + δ ji T i T j + δ ji + γ α, i = 1,..., k, i j] j=1 k k = 1 [F (y + δ ji + γ α ) F (y + δ ji )]f(y)dy, (8) j=1 i=1,i j where δ ji = (µ [j] µ [i] )/ z, i, j = 1,..., k, i j, and T i = ( X (i) µ [i] )/ z, i = 1,..., k, are i.i.d. Student s t r.v. s having p.d.f. f( ) and c.d.f. F ( ) with n 0 1 d.f.. As the n 0 becomes large, T i converges to the standard normal, or equivalently X i / z converges to N(µ i / z, 1). Thus, by Theorem 1, the asymptotic LFC of means which maximizes the integral in (8) occurs at µ 0 = ( µ kδ/2, µ,..., µ, µ + kδ/2) for k 3 and at µ 0 = ( µ kδ/2, µ + kδ/2) for k = 2. For computational reason (for detail, see Chen and Chen (1999).), we rewrite µ 0 as µ 0 = (( µ kvw/2, µ,..., µ, µ + kvw/2), where v = δ/δ and w = δ / z. Let the maximum of (8) subject to µ H 0 at the LFC µ 0 be P δ (γ α, w). Then P δ (γ α, w) = 1 { [F (y kvw/2 + γ α ) F (y kvw/2)] k 2 [F (y kvw + γ α ) F (y kvw)]f(y)dy +(k 2) [F (y + kvw/2 + γ α ) F (y + kvw/2)][f (y + γ α ) F (y)] k 3 [F (y kvw/2 + γ α ) F (y kvw/2)]f(y)dy + [F (y + kvw/2 + γ α ) F (y + kvw/2)] k 2 [F (y + kvw + γ α ) F (y + kvw)]f(y)dy}. (9) Note that in the special case where δ = 0, the P δ (γ α, w) reduces to P 0 (γ α, w) = 1 k [F (y + γ α ) F (y)] k 1 f(y)dy = P ( X [k] X [1] > γ α z H0 : µ 1 = = µ k ), (10) which is the range of i.i.d. Student s t r.v. s. by Wilcox(1983). Similarly, as n 0 becomes large, the minimum power in (8) under H a : 1 ki=1 k µ i µ δ > δ is attained at the asymptotic LFC µ 1 = ( µ kw/(2l),..., µ kw/(2l), µ + kw/(2(k l)),..., µ + kw/(2(k l))) or its permutations with the l such ( µ kw/2l) s and k l such ( µ + kw/(2(k l))) s, l = 1,..., k 1. (See Chen, Xiong and Lam (1993).) 7

8 Let β δ (γ α, w) denote the minimum power in (8) at the LFC µ = µ 1. Then β δ (γ α, w) = 1 {l + (k l) where b = k 2 /(2l(k l)). [F (y + γ α ) F (y)] l 1 [F (y bw + γ α ) F (y bw)] k l f(y)dy [F (y + bw + γ α ) F (y + bw)] l [F (y + γ α ) F (y)] k l 1 f(y)dy}, (11) The minimum power of (11) occurs at l = k/2 when k is even, and at l = (k 1)/2 or l = (k + 1)/2 when k is odd. It is trivial when k = 2. When k = 3, the power in (11) is the same for l = 1, 2. However, when k is four or large, it is not trivial to find its minimum by analytical method. By numerical quadrature as illustrated in Table 2 we confirm the above claim, and it is true in general. Table 2. The Behavior of Power Function of Expression (11). k n α P ν γ ω l Power Power Power Power Computation of the level and the power Using a grid-searching method combined with Newton-Raphson s iterations as described by Chen and Chen (1999), the critical values of γ α and the values of w in (9)-(11) are calculated. Given the level α and the required power P, the equivalent constant δ and the dedective difference δ (or the delta ratio = δ/δ ) under H 0 and H a, respectively, for a specified number of treatments k and an initial sample size n 0, one inputs a small guess of 8

9 ω (= δ / z) into (9) and uses Newton-Raphson s iterations to find a critical value γ α such that the absolute error of P δ (γ α, ω) α is bounded by Then, substitute these values into (11) to see if the probability inequality β δ (γ α, ω) P (12) is satisfied. Increase or decrease ω such that the smallest ω (accurate up to two decimal figures) satisfies (12). Tables 3-9 present the critical values γ α (upper entry) and the values of w = δ / z (lower entry) for the levels at 5% and 1%, and the powers at 90% and 95%, when k = 2(1)6, 8, 10, n 0 = 5, 10, 15, 25 and various ratio δ/δ 0.6. For example, if α = 5% with required power being at least 90% and k = 3, n 0 = 10, δ/δ = 0.2, then the γ α =4.52 and w = δ / z=2.54 are found from Table 4. If the δ value in the null hypothesis H 0 : 1 ki=1 k µ i µ δ is set to be 0.2 unit and the δ value in the alternative hypothesis H a : 1 ki=1 k µ i µ δ is set to be 1 units, then solve δ / z=2.54 for z to obtain the z value of A Numerical Example The data in Table 8 come from an experiment reported by Bishop and Dudewicz (1978). The experiment involved testing four types of solvents for their effects on the ability of a fungicide methyl-2-benzimidazole-carbamate to destroy the fungus penicillium expansum. The fungicide was diluted in exactly the same manner in the four different types of solvents and sprayed on the fungus, and the percentage of fungus destroyed was measured. The mean percentage of fungus destroyed by solvent i is denoted by µ i. Wen and Chen (1994) found that the data from solvents 1 and 3 are not normally distributed using SAS PROC Univariate testing procedure. However, Dudewicz and van der Meulen (1983) has shown the robustness results that apply to the two-stage sampling procedure for nonnormal distributions when population variances are not all equal. Chen, Chen and Ding (2000) had conducted a robust modified Levene s test for homogeneity of variances and found a significant difference (p-value < ) among variances. So, the two-stage sampling procedure can be applied. 9

10 Table 3. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 2 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =

11 Table 4. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 3 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =

12 Table 5. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 4 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =

13 Table 6. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 5 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =

14 Table 7. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 6 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =

15 Table 8. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 8 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =

16 Table 9. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 10 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =

17 Suppose that the experimenter regards an average deviation of 0.25% (δ value) among the µ i s to be irrelevant and wishes to detect the average deviation to be at least 1.0% (δ value). This can be stated by the null hypothesis H 0 : 1 4i=1 4 µ i µ 0.25 against the alternative hypothesis H a : 1 4i=1 4 µ i µ 1.0. If the experimenter choose the level of the test to be 5% and a power of at least 0.90, and one takes the initial sample of size n 0 = 15 observations from each solvent, then, the critical value of the modified studentized range test produces a γ α = 5.71 and δ / z = 3.04 at δ/δ = 0.25 from Table 5, so z = The remaining N i 15 observations taken at the second stage are given in Table 11. The summary statistics Si 2, a i, b i and N i defined in Section 3 are given in Table 12. The final weighted sample means X 1, X2, X3 and X 4 are given in the last row of Table 12, and finally, the calculated studentized range test statistic using (4) is R=6.68. Hence, H 0 is rejected in asserting H a with a power of being at least Pairwise multiple comparisons given by Chen and Chen (1999) can be employed to detect the differences among mean percentages. 17

18 Table 10. Bacterial Killing Ability Example (First 15 Observations) Solvent 1 Solvent 2 Solvent 3 Solvent Table 11. Bacterial Killing Ability (Second-Stage Observations) Solvent 1 Solvent 2 Solvent 3 Solvent Table 12. Summary Statistics Statistic Solvent 1 Solvent 2 Solvent 3 Solvent 4 Si a i b i N i X i

19 6. Single-Stage Sampling procedure and Modified Studentized Range Test The two-stage sampling procedure discussed in Section 3 is a design-oriented method which determines the necessary sample sizes of N i and to simultaneously meet a prespecified power requirement at a specified level of the test. In some situations where the two-stage experiment is terminated earlier due to some unexpected reasons such as budget shortage or patients drop-out, the required total sample size N i in (2) cannot be achieved, one only has to use the available m i observations ((n 0 + 1) m i N i ) on hand and recalculate the coefficients a ij s according to the so-called one-stage sampling procedure proposed by Chen and Lam (1989) such that the statistical inference theory can still work. The one-stage procedure is briefly described as below. From each of the k populations one uses an initial sample of size n 0 (2 < n 0 < m i ). Calculate the usual unbiased sample mean and unbiased sample variance, respectively, by X i = n0 j=1 X ij n 0 and S 2 i = n0 j=1 (X ij X i ) 2. n 0 1 Then, construct the new coefficients as U i = 1 m i + 1 m i mi n 0 n 0 (m i z /S 2 i 1), V i = 1 m i 1 m i where U i and V i satisfy the following conditions: n0 m i n 0 (m i z /S 2 i 1), n 0 U i + (m i n 0 )V i = 1, S 2 i [ n 0 U 2 i + (m i n 0 )V 2 i ] = z, where z is the maximum of {S 2 i /m i, i = 1,..., k }. Let the final weighted sample mean be defined by n 0 X i = U i X ij + V i j=1 m i n 0 +1 X ij. (13) 19

20 It is known (Chen and Chen (1998)) that ( X i µ i )/ z, i = 1,..., k, have independent Student s t-distributions each with n 0 1 df.. Similarly, the modified studentized range statistic is defined as R = X [k] X [1] z (14) which is used to test the H 0 against H a in (1). The probability statement P Ω (R > γ α ) can be similarly derived by replacing z by z and it is given by P Ω (R > γ α) = P Ω ( X [k] X [1] > γ α z ) = 1 P ( X [k] X [1] + γ α z ) k = 1 P ( X [k] X [1] + γα z, X (j) = X [1] ) j=1 k = 1 P ( X (j) X (i) X (j) + γα z, i = 1,..., k, i j) j=1 k X (j) µ [j] + µ [j] µ [i] X (i) µ [i] X (j) µ [j] + µ [j] µ [i] = 1 P [ j=1 z z z +γα, i = 1,..., k, i j] k = 1 P [T j + δ ji T i T j + δ ji + γα, i = 1,..., k, i j] j=1 k k = 1 [F (y + δ ji + γα ) F (y + δ ji)]f(y)dy, (15) j=1 i=1,i j where δ ji = (µ [j] µ [i] )/ z, i, j = 1,..., k, i j, and T i = ( X (i) µ [i] )/ z, i = 1,..., k are i.i.d. Student s t r.v. s having p.d.f. f( ) and c.d.f. F ( ) with n 0 1 d.f.. As the n 0 becomes large, T i converges to the standard normal, or equivalently X i / z converges to N(µ i / z, 1). Thus, by Theorem 1, we can obtain the asymptotic LFC of means which maximizes the integral in (15) at µ 0 = ( µ kδ/2, µ,..., µ, µ + kδ/2) or its permutations for k 3 and at µ 0 = ( µ kδ/2, µ + kδ/2) for k = 2. For computational reason, let d = δ/ z, the maximum level of (8) subject to µ H 0 occurs at the asymptotic LFC µ 0 and its value can be approximated by 20

21 P δ (γα, d ) = 1 { [F (y kd /2 + γα ) F (y kd /2)] k 2 [F (y kd + γ α ) F (y kd )]f(y)dy +(k 2) [F (y + kd /2 + γ α ) F (y + kd /2)][F (y + γ α ) F (y)]k 3 [F (y kd /2 + γ α ) F (y kd /2)]f(y)dy + [F (y + kd /2 + γ α) F (y + kd /2)] k 2 [F (y + kd + γ α) F (y + kd )]f(y)dy}. (16) Note that in the special case where δ = 0, the above P δ (γ α, d ) reduces to P 0 (γα, d ) = 1 k [F (y + γα) F (y)] k 1 f(y)dy = P (T i > γ α H 0 : µ 1 = = µ k ). (17) Similarly, as n 0 becomes large, the minimum power in (15) under H a : 1 ki=1 k µ i µ δ > δ is attained at the asymptotic LFC µ 1 = ( µ kd /(2l),..., µ kd /(2l), µ+kd /(2(k l)),..., µ+kd /(2(k l))) with the l such (µ kd /2l) s and k l such (µ + kd /(2(k l))) s, l = 1,..., k 1, where d = δ/ z. (See Chen, Xiong and Lam (1993).) Let β δ (γ α, d ) denote the minimum power in (15) at the LFC µ = µ 1. Then, the minimum power can be calculated by β δ (γα, d ) = 1 {l + (k l) where b = k 2 /(2l(k l)). [F (y + γ α ) F (y)]l 1 [F (y bd + γ α ) F (y bd )] k l f(y)dy [F (y + bd + γ α) F (y + bd )] l [F (y + γ α) F (y)] k l 1 f(y)dy} (18) By numerical calculation we confirm that the minimum power of (18) occurs at l = k/2 when k is even, and at l = (k 1)/2 or l = (k + 1)/2 when k is odd. Given the level α and δ at H 0 and d, k, and n 0, the critical value γ α can be obtained by solving the equation g 1 (γ α, d ) = P δ (γ α, d ) α = 0. (19) The solution γ α in (19) is data dependent on d. 21

22 Using Gaussian quadrature to evaluate the integrals in (16) and the Newton-Raphsons iteration one can find the solution γ α to (19) at the nth iteration by the formula γ n = γ n 1 P δ(γn 1, d ) α P δ (γn 1,, (20) d ) where P δ (γ, d ) is the derivative of P δ w.r.t. γ, given by P δ (γ, d ) = {(k 2)[F (y kd /2 + γ ) F (y kd /2)] k 3 f(y kd /2 + γ ) [F (y kd + γ ) F (y kd )] + [F (y kd /2 + γ ) F (y kd /2)] k 2 f(y kd + γ )}f(y)dy (k 2) [F (y + γ ) F (y)] k 4 {f(y + kd /2 + γ )[F (y + γ ) F (y)] [F (y kd /2 + γ ) F (y kd /2)] + (k 3)[F (y + kd /2 + γ ) F (y + kd /2)]f(y + γ ) [F (y kd /2 + γ ) F (y kd /2)] + [F (y + kd /2 + γ ) F (y + kd /2)][F (y + γ ) F (y)] f(y kd /2 + γ )}f(y)dy {(k 2)[F (y + kd /2 + γ ) F (y + kd /2)] k 3 f(y + kd /2 + γ ) [F (y + kd + γ ) F (y + kd )] + [F (y + kd /2 + γ ) F (y + kd /2)] k 2 f(y + kd + γ )}f(y)dy. The solution (20) is unique because P δ (γα, d ) is monotonically decreasing in γ. When the required sample sizes listed in Table 10 cannot be met due to early termination of the experiment, one can only have the total samples m 1 = 19, m 2 = 25, m 3 = 49 and m 4 = 16. At this time the one-stage procedure can be applied. By (13) the weighted sample means are calculated as X 1 = 96.93, X2 = 95.67, X3 = 95.32, X4 = and z = max{si 2/m i} = By (14) the test statistic is γ = Given δ = 0.25, δ = 1.0, k = 4, n 0 = 15 as previously specified, the p-value of γ (= 5.23) is At 10% level 22

23 of significance, the critical value is γ.10 = 4.97 and the estimated power is Therefore, at α = 0.10, we can reject the null hypothesis of all means being within an equivalent average deviation of.25 units in favor of the difference of being at least 1 unit. This calculation was done by a Fortran program named chenwen7.for available from the authors. 7. Summary and Conclusion Testing the null hypothesis of equal treatment means is sometimes impractical in real applications, as pointed out by Berger (1985). An alternative measure to detect the difference among means is the average deviation of the means, which extends the idea of equivalency among means. The test of equivalency receives more attention in health sciences, pharmaceutical industry, and other areas. When the variances are unknown and unequal, a studentized range test using a two-stage and a one-stage sampling procedures, respectively, is proposed for testing the hypothesis that the average deviation of the normal means is falling into a practical indifference zone. Both the level and the power of the proposed test by the two-stage procedure are controllable and they are completely independent of the unknown variances. Statistical tables to implement the procedure are provided for practitioners, and an example is given to demonstrate the use of the procedure. The two-stage procedure is a design-oriented procedure that satisfies certain probability requirements and simultaneously determines the required sample sizes (which can be large at the second stage) in an experiment while the one-stage procedure is a dataanalysis procedure after the data have been collected, which can supplement the two-stage procedure when the later has to end its experiment sooner than its required experimental process is completed. At that time the level and power can be approximated, and the onestage sampling procedure is shown to be quite feasible under heterocedasticity. 23

24 REFERENCES Berger, J. 0. (1985). Statistical Decision Theory, 2nd edition, Springer-Verlag, N.Y. Bishop, T.A., and Dudewicz, E. J. (1978). Exact Analysis of Variance with Unequal Variances : Test Procedures and Tables. Technometrics, 20, Chen, S. Y., and Chen, H. J. (1998). Single-Stage Analysis of Variance under Heteroscedasticity. Communication in Statistics-Theory and Methods, 27(3), Chen, S. Y., and Chen, H. J. (1999). A Range Test for the Equivalency of Means under Unequal Variances. Technometrics, Vol. 41, No. 3, Chen, S. Y., and Chen, H. J. (2000). A Range Test for the Equality of Means When Variances Are Unequal. American Journal of Mathematical and Management Sciences, Vol. 20, Nos. 1&2, Chen, S. Y., Chen, H. J., and Ding, C. G. (2000). An ANOVA Test for the Equivalency of Means under Unequal Variances. Computational Statistics and Data Analysis, 33, Chen, H. J., and Lam, K. (1989). Single-Stage Interval Estimation of the Largest Normal Mean under Heteroscedasticity. Communication in Statistics-Theory and Methods, 18(10), Chen, H. J., and Lam, K. (1991). Percentage Points of a Studentized Range Statistic Arising from Non-identical Normal Random Variables. Communications in Statistics : Simulation and Computation, 20(4), Chen, H. J., Xiong, M., and Lam, K. (1993). Range Tests for the Dispersion of Several Location Parameters. Journal of Statistical Planning and Inference, 36, Chow, S. C., and Liu, J.P. (1992). Design and Analysis of Bioavailability and Bioequivalence Studies, New York: Marcel Dekker. Dudewicz, E. J., and van der Meulen, E. C. (1983). Entropy-Based Statistical Inference II : Selection-of- the-best/complete Ranking for Continuous Distributions on (0,1), with 24

25 Applications to Random Number Generators. Statistics and Decisions, 1, Lehmann, E. L. (1986). Testing Statistical Hypothesis, 2nd edition, Wiley, N. Y. Wen, M. J., and Chen, H. J. (1994). Single-Stage Multiple Comparison Procedures under Heteroscedasticity. American Journal of Mathematical and Management Sciences, 14, Wilcox, R. R. (1983). A Table of Percentage Points of Range of Independent t Variables. Technometrics, 25,

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Non Parametric Inference

Non Parametric Inference Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable

More information

Exact Nonparametric Tests for Comparing Means - A Personal Summary

Exact Nonparametric Tests for Comparing Means - A Personal Summary Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST

MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST Zahayu Md Yusof, Nurul Hanis Harun, Sharipah Sooad Syed Yahaya & Suhaida Abdullah School of Quantitative Sciences College of Arts and

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

The Variability of P-Values. Summary

The Variability of P-Values. Summary The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 boos@stat.ncsu.edu August 15, 2009 NC State Statistics Departement Tech Report

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

13: Additional ANOVA Topics. Post hoc Comparisons

13: Additional ANOVA Topics. Post hoc Comparisons 13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Kruskal-Wallis Test Post hoc Comparisons In the prior

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics References Some good references for the topics in this course are 1. Higgins, James (2004), Introduction to Nonparametric Statistics 2. Hollander and Wolfe, (1999), Nonparametric

More information

2 Sample t-test (unequal sample sizes and unequal variances)

2 Sample t-test (unequal sample sizes and unequal variances) Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing

More information

SAS Certificate Applied Statistics and SAS Programming

SAS Certificate Applied Statistics and SAS Programming SAS Certificate Applied Statistics and SAS Programming SAS Certificate Applied Statistics and Advanced SAS Programming Brigham Young University Department of Statistics offers an Applied Statistics and

More information

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i. Chapter 3 Kolmogorov-Smirnov Tests There are many situations where experimenters need to know what is the distribution of the population of their interest. For example, if they want to use a parametric

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

individualdifferences

individualdifferences 1 Simple ANalysis Of Variance (ANOVA) Oftentimes we have more than two groups that we want to compare. The purpose of ANOVA is to allow us to compare group means from several independent samples. In general,

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Chapter 7. One-way ANOVA

Chapter 7. One-way ANOVA Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

More information

5.1 Identifying the Target Parameter

5.1 Identifying the Target Parameter University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

More information

Randomized Block Analysis of Variance

Randomized Block Analysis of Variance Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of

More information

One-Way Analysis of Variance

One-Way Analysis of Variance One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

More information

Rank-Based Non-Parametric Tests

Rank-Based Non-Parametric Tests Rank-Based Non-Parametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs

More information

Two Correlated Proportions (McNemar Test)

Two Correlated Proportions (McNemar Test) Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

More information

Optimal Stopping in Software Testing

Optimal Stopping in Software Testing Optimal Stopping in Software Testing Nilgun Morali, 1 Refik Soyer 2 1 Department of Statistics, Dokuz Eylal Universitesi, Turkey 2 Department of Management Science, The George Washington University, 2115

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION PROFILES

A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION PROFILES Journal of Biopharmaceutical Statistics, 15: 265 278, 2005 Copyright Taylor & Francis, Inc. ISSN: 1054-3406 print/1520-5711 online DOI: 10.1081/BIP-200049832 A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION

More information

1 Solving LPs: The Simplex Algorithm of George Dantzig

1 Solving LPs: The Simplex Algorithm of George Dantzig Solving LPs: The Simplex Algorithm of George Dantzig. Simplex Pivoting: Dictionary Format We illustrate a general solution procedure, called the simplex algorithm, by implementing it on a very simple example.

More information

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935) Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis

More information

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS Systems of Equations and Matrices Representation of a linear system The general system of m equations in n unknowns can be written a x + a 2 x 2 + + a n x n b a

More information

Principle of Data Reduction

Principle of Data Reduction Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

The CUSUM algorithm a small review. Pierre Granjon

The CUSUM algorithm a small review. Pierre Granjon The CUSUM algorithm a small review Pierre Granjon June, 1 Contents 1 The CUSUM algorithm 1.1 Algorithm............................... 1.1.1 The problem......................... 1.1. The different steps......................

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES Contents 1. Random variables and measurable functions 2. Cumulative distribution functions 3. Discrete

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

Life Table Analysis using Weighted Survey Data

Life Table Analysis using Weighted Survey Data Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

Introduction to Fixed Effects Methods

Introduction to Fixed Effects Methods Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed

More information

Non-Inferiority Tests for Two Means using Differences

Non-Inferiority Tests for Two Means using Differences Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk

More information

Tests for Two Proportions

Tests for Two Proportions Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics

More information

Non-Inferiority Tests for Two Proportions

Non-Inferiority Tests for Two Proportions Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which

More information

Tests for One Proportion

Tests for One Proportion Chapter 100 Tests for One Proportion Introduction The One-Sample Proportion Test is used to assess whether a population proportion (P1) is significantly different from a hypothesized value (P0). This is

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

Nonparametric Tests for Randomness

Nonparametric Tests for Randomness ECE 461 PROJECT REPORT, MAY 2003 1 Nonparametric Tests for Randomness Ying Wang ECE 461 PROJECT REPORT, MAY 2003 2 Abstract To decide whether a given sequence is truely random, or independent and identically

More information

LOGNORMAL MODEL FOR STOCK PRICES

LOGNORMAL MODEL FOR STOCK PRICES LOGNORMAL MODEL FOR STOCK PRICES MICHAEL J. SHARPE MATHEMATICS DEPARTMENT, UCSD 1. INTRODUCTION What follows is a simple but important model that will be the basis for a later study of stock prices as

More information

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Chapter 4: Statistical Hypothesis Testing

Chapter 4: Statistical Hypothesis Testing Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin

More information

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Statistics Statistics are quantitative methods of describing, analysing, and drawing inferences (conclusions)

More information

Chapter 12 Nonparametric Tests. Chapter Table of Contents

Chapter 12 Nonparametric Tests. Chapter Table of Contents Chapter 12 Nonparametric Tests Chapter Table of Contents OVERVIEW...171 Testing for Normality...... 171 Comparing Distributions....171 ONE-SAMPLE TESTS...172 TWO-SAMPLE TESTS...172 ComparingTwoIndependentSamples...172

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Tests of Hypotheses Using Statistics

Tests of Hypotheses Using Statistics Tests of Hypotheses Using Statistics Adam Massey and Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract We present the various methods of hypothesis testing that one

More information

The Wilcoxon Rank-Sum Test

The Wilcoxon Rank-Sum Test 1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We

More information

SAS/STAT. 9.2 User s Guide. Introduction to. Nonparametric Analysis. (Book Excerpt) SAS Documentation

SAS/STAT. 9.2 User s Guide. Introduction to. Nonparametric Analysis. (Book Excerpt) SAS Documentation SAS/STAT Introduction to 9.2 User s Guide Nonparametric Analysis (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule

More information

Analysis of Variance ANOVA

Analysis of Variance ANOVA Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.

More information

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)

More information

Analysis of Variance (ANOVA) Using Minitab

Analysis of Variance (ANOVA) Using Minitab Analysis of Variance (ANOVA) Using Minitab By Keith M. Bower, M.S., Technical Training Specialist, Minitab Inc. Frequently, scientists are concerned with detecting differences in means (averages) between

More information

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable Application: This statistic has two applications that can appear very different,

More information

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1. MATH10212 Linear Algebra Textbook: D. Poole, Linear Algebra: A Modern Introduction. Thompson, 2006. ISBN 0-534-40596-7. Systems of Linear Equations Definition. An n-dimensional vector is a row or a column

More information

T test as a parametric statistic

T test as a parametric statistic KJA Statistical Round pissn 2005-619 eissn 2005-7563 T test as a parametric statistic Korean Journal of Anesthesiology Department of Anesthesia and Pain Medicine, Pusan National University School of Medicine,

More information

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data

More information

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September

More information

Stationary random graphs on Z with prescribed iid degrees and finite mean connections

Stationary random graphs on Z with prescribed iid degrees and finite mean connections Stationary random graphs on Z with prescribed iid degrees and finite mean connections Maria Deijfen Johan Jonasson February 2006 Abstract Let F be a probability distribution with support on the non-negative

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

160 CHAPTER 4. VECTOR SPACES

160 CHAPTER 4. VECTOR SPACES 160 CHAPTER 4. VECTOR SPACES 4. Rank and Nullity In this section, we look at relationships between the row space, column space, null space of a matrix and its transpose. We will derive fundamental results

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

The Relative Worst Order Ratio for On-Line Algorithms

The Relative Worst Order Ratio for On-Line Algorithms The Relative Worst Order Ratio for On-Line Algorithms Joan Boyar 1 and Lene M. Favrholdt 2 1 Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark, joan@imada.sdu.dk

More information

Chapter 1 Introduction. 1.1 Introduction

Chapter 1 Introduction. 1.1 Introduction Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations

More information

Multivariate Normal Distribution

Multivariate Normal Distribution Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

1 Nonparametric Statistics

1 Nonparametric Statistics 1 Nonparametric Statistics When finding confidence intervals or conducting tests so far, we always described the population with a model, which includes a set of parameters. Then we could make decisions

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

The t-test and Basic Inference Principles

The t-test and Basic Inference Principles Chapter 6 The t-test and Basic Inference Principles The t-test is used as an example of the basic principles of statistical inference. One of the simplest situations for which we might design an experiment

More information

CHAPTER 2 Estimating Probabilities

CHAPTER 2 Estimating Probabilities CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2016. Tom M. Mitchell. All rights reserved. *DRAFT OF January 24, 2016* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is a

More information

Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS

Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS Sensitivity Analysis 3 We have already been introduced to sensitivity analysis in Chapter via the geometry of a simple example. We saw that the values of the decision variables and those of the slack and

More information

Statistical Models in R

Statistical Models in R Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

Math 55: Discrete Mathematics

Math 55: Discrete Mathematics Math 55: Discrete Mathematics UC Berkeley, Fall 2011 Homework # 5, due Wednesday, February 22 5.1.4 Let P (n) be the statement that 1 3 + 2 3 + + n 3 = (n(n + 1)/2) 2 for the positive integer n. a) What

More information