A Studentized Range Test for the Equivalency of Normal Means under Heteroscedasticity
|
|
- Debra Nichols
- 7 years ago
- Views:
Transcription
1 A Studentized Range Test for the Equivalency of Normal Means under Heteroscedasticity Miin-Jye Wen and Hubert J. Chen Department of Statistics National Cheng-Kung University Tainan, Taiwan ABSTRACT A studentized range test using a two-stage and a one-stage sampling procedures, respectively, is proposed for testing the hypothesis that the average deviation of the normal means is falling into a practical indifference zone. Both the level and the power of the proposed test associated with the hypotheses are controllable and they are completely independent of the unknown variances. The two-stage procedure is a design-oriented procedure that satisfies certain probability requirements and simultaneously determines the required sample sizes for an experiment while the one-stage procedure is a data-analysis procedure after the data have been collected, which can supplement the two-stage procedure when the later has to end its experiment sooner than its required experimental process is completed. Tables needed for implementing these procedures are given. Key Words and Phrases : Indifference zone; Least favorable configuration; Power of a test; Studentized range test; t distribution; Heteroscedasticity. 1
2 1. Introduction The problem of statistical hypothesis testing concerning several normal means has long been a major concern for statisticians. In classical hypothesis testing the interest is often to test the null hypothesis that the population means are all equal (e.g., Lehmann (1986)). It is well known that, for a large enough sample size, the classical test will almost always reject the null hypothesis as pointed out by many researchers (e.g., Berger (1985)). In many real world problems, the practical interest is often to examine whether the population means fall into an indifference zone, not just the equality of means. This idea leads to the consideration of equivalency hypothesis stated as H 0 : 1 ki=1 k µ i µ δ vs H a : 1 ki=1 k µ i µ δ > δ, where µ is the grand average of the means µ 1,..., µ k, δ ( 0) is a predetermined indifference zone and δ (> 0) is a detective amount specified in advance and the quantity stated in the null hypothesis is often regarded as the average deviation of µ i s from their grand mean. The constant δ can be interpreted as the average deviation about which we are indifferent and the null hypothesis H 0 can be interpreted as saying that there is little difference among means within a small δ-value or there is practically equivalent among the means. This type of null hypothesis appears to be more useful and meaningful in the analysis of means among several treatment populations under fixed-effect analysis of variance models. When there are only two populations, the equivalency hypotheses are also referenced to as interval hypotheses or bioequivalence in pharmaceutical and medical studies (e.g., see Chow and Liu (1992)) when H 0 and H a are reversed. For the case of testing three or more means, the studentized range test for the equivalency hypotheses was studied by Chen and Lam (1991) and Chen, Xiong and Lam (1993) for the common unknown variances. For the case of testing the null hypothesis of the equality of means H0 : µ 1 =... = µ k against a specific alternative hypothesis, Ha : µ max µ min δ, where δ (> 0) is a prespecified constant, Chen and Chen (2000) proposed a range test when the variances are unknown and possibly unequal. In this paper, the case of H 0 vs. H a under unequal variances is studied. When the variances are known but unequal, one can divide the original random variables by their corresponding standard deviations so that the transformed variables satisfy the assumption of a common 2
3 (known) variance and then Chen, Xiong and Lam s (1993) method can be used to define a range test. Furthermore, in situations where the variances are unknown but possibly unequal, we use the two-stage sampling procedure as described by Bishop and Dudewicz (1978) and the one-stage sampling procedure proposed by Chen and Lam (1989), respectively, to formulate a modified studentized range test for testing the null hypotheses of equivalency H 0 against the alternative H a. In Section 2, we briefly discuss the case when variances are known but unequal. In Section 3, the two-stage sampling procedure is introduced and a modified studentized range test is defined, and the level and the power of the proposed range test are calculated. In Section 4, the numerical calculation of the level and the power is discussed. In Section 5, a numerical example is given to illustrate the use of the studentized range test. In Section 6, the single-stage sampling procedure is defined and the level and the power of the range test are determined. Finally, a summary and conclusion is given in Section Transformed Sampling Procedure and Range Test In a one-way layout model, let X ij (i = 1,..., k; j = 1,..., N) be a random sample of size N ( 2) drawn from the normal distribution π i having a unknown mean µ i and a known but unequal variance σi 2, i = 1,..., k. Since the heteroscedastic variances σ 2 i are known, divide the r.v. s X ij (i = 1,..., k; j = 1,..., N) by σ i, through i = 1,..., k, to obtain X ij = X ij/σ i, i = 1,..., k; j = 1,..., N. The purpose of transforming the original variables is to eliminate the influence of unequal variances. To see this, one can easily calculate the variance of X ij : V ar(x ij ) = V ar(x ij σ i ) = 1, i = 1,..., k, which is a constant and X ij is distributed as N(µ i, 1), where µ i = µ i/σ i. That is, the variance of the transformed variable X ij is now homoscedastic. Our goal is to test the following null hypothesis H 0 : 1 k µ i k µ δ i=1 3
4 against the alternative H a : 1 k µ i µ δ (δ > δ), k i=1 where µ = k i=1 µ i /k, and the quantity 1 ki=1 k µ i µ is interpreted as the average deviation of means from the grand mean µ. A range test is defined as X max X min, and then Chen, Xiong and Lam s (1993) result with respect to the transformed data can be used without difficulty. It can be shown that the probability of rejecting H 0 attains its maximum at the least favorable configuration (LFC) µ 0 = ( µ kδ N/2, µ,..., µ, µ + kδ N/2) or at its permutations when k 3 and at µ 0 = ( µ kδ N/2, µ + kδ N/2) or at its permutation when k = 2. When sample sizes are unequal, one can replace N by the harmonic mean to obtain an approximation. 3. Two-Stage Procedure and Modified Studentized Range Test Similarly, for the one-way layout model, let X ij (i = 1,..., k; j = 1,..., N i ) be a random sample of size N i ( 3) drawn from the normal distribution π i which has a unknown mean µ i and a possibly unknown variance σi 2, i = 1,..., k, where these variances may be highly unequal. Our goal is to test the null hypothesis against the alternative H 0 : 1 k µ i µ δ (1) k i=1 H a : 1 k µ i µ δ (δ > δ), k i=1 where µ = k i=1 µ i /k, and the quantity 1 k ki=1 µ i µ is interpreted as the average deviation of means from the grand mean µ. A modified studentized range test is to be proposed in such a way that both the level and the power of the test are controllable under the proposed two-stage sampling procedure and they are shown to be free of the unknown and unequal variances. The two-stage sampling procedure for this problem is stated as follows. 4
5 From each of the k populations takes an initial random sample of size n 0 (n 0 2, but Bishop and Dudewicz (1978) suggested that the initial sample of size n 0 be 10 or more giving better results. For economic reason, it is usually taken to be 25 or less.), let S 2 i be the usual unbiased estimate of σ 2 i based on the first n 0 observations from the i th population, and define N i = max{n 0 + 1, [ S2 i z ] + 1} (2) where z > 0 is a design constant to be determined by the power of the test under a specified H a and [X] stands for the largest integer less than X. Then take N i n 0 additional observations from the i th population so that we have a total of N i observations denoted by X i1,..., X in0,..., X ini. For each i, set the coefficients a i1,..., a in0,..., a ini, such that a i1 =... = a in0 = 1 (N i n 0 )b i a in0 +1 =... = a ini = 1 N i [1 + and computed the weighted sample mean as n 0 X i = a i X ij + b i j=1 n 0 N i j=n 0 +1 = a i n 0 (N i z Si 2) ] (N i n 0 )Si 2 = b i (3) X ij. It should be noted that these coefficients a ij s are so determined to satisfy the following conditions, N i j=1 a ij = 1, a i1 =... = a in0, and Si 2 Ni j=1 a2 ij = z. It is easy to show that the r.v. s T i = ( X i µ i )/ z, i = 1,..., k, have independent Student s t-distributions, free of the unknown variances σ 2 i (see, e.g., Chen and Chen (1998)). Let X [1]... X [k] be the order statistics of X 1,..., Xk and let the modified studentized range statistic be defined by R = X [k] X [1] z. (4) The null hypothesis H 0 is rejected at level α iff R > γ α (5) where γ α = γ α (δ, z, k, n 0 ) is the level-α critical value such that P δ (γ α ) = sup P (R > γ α H 0 : 1 Ω k 5 k µ i µ δ) = α, (6) i=1
6 where α (0, 1) is a predetermined level, and Ω is the set of all possible configurations of the µ i s and σ i s, and the design constant z is determined by the power of the test such that P δ (γ α ) = inf Ω P (γ > γ α H a : 1 k where P is taken to be a large value in advance in (0, 1). k µ i µ δ > δ) = P, (7) i=1 Here, it is necessary to find a LFC of the means which maximizes the level of the test under H 0 in (6) and a least favorable configuration which minimizes the power of the test under a specified H a in (7) such that the level and the power are not only independent of all means differences but also free of the unknown variances. The LFC s are determined by Theorem 1 of Chen, Xiong and Lam (1993). It is stated as follows without proof. Theorem 1: Let φ(x i θ i ) be the standard normal density of the independent normal r.v. X i with mean θ i, i = 1,..., k and let g c (θ) = P θ (R c) be the tail probability where R is the range of X 1,..., X k and θ = (θ 1,..., θ k ). Assume that the g c (θ) attains its maximun at some θ for all θ A = {θ : θ i θ δ}, where θ = (θ θ k )/k. Then θ is the LFC and θ must be in B 1 where B 1 = {( θ δ/2, θ + δ/2), ( θ + δ/2, θ δ/2)} for k = 2, and B 1 = {(θ 1,..., θ k ) : one of the θ i s is θ + δ/2, one of the θ i s is θ δ/2, and the other θ i s are θ} for k 3. We now find these LFC s of (6) and (7) described below. Let µ [1]... µ [k] be the ordered values of µ 1,..., µ k and let X (j) be associated with µ [j]. We have P Ω (R > γ α ) = P Ω ( X [k] X [1] > γ α z) = 1 P ( X [k] X [1] + γ α z) k = 1 P ( X [k] X [1] + γ α z, X(j) = X [1] ) j=1 k = 1 P ( X (j) X (i) X (j) + γ α z, i = 1,..., k, i j) j=1 k X (j) µ [j] + µ [j] µ [i] X (i) µ [i] X (j) µ [j] + µ [j] µ [i] = 1 P [ j=1 z z z +γ α, i = 1,..., k, i j] 6
7 k = 1 P [T j + δ ji T i T j + δ ji + γ α, i = 1,..., k, i j] j=1 k k = 1 [F (y + δ ji + γ α ) F (y + δ ji )]f(y)dy, (8) j=1 i=1,i j where δ ji = (µ [j] µ [i] )/ z, i, j = 1,..., k, i j, and T i = ( X (i) µ [i] )/ z, i = 1,..., k, are i.i.d. Student s t r.v. s having p.d.f. f( ) and c.d.f. F ( ) with n 0 1 d.f.. As the n 0 becomes large, T i converges to the standard normal, or equivalently X i / z converges to N(µ i / z, 1). Thus, by Theorem 1, the asymptotic LFC of means which maximizes the integral in (8) occurs at µ 0 = ( µ kδ/2, µ,..., µ, µ + kδ/2) for k 3 and at µ 0 = ( µ kδ/2, µ + kδ/2) for k = 2. For computational reason (for detail, see Chen and Chen (1999).), we rewrite µ 0 as µ 0 = (( µ kvw/2, µ,..., µ, µ + kvw/2), where v = δ/δ and w = δ / z. Let the maximum of (8) subject to µ H 0 at the LFC µ 0 be P δ (γ α, w). Then P δ (γ α, w) = 1 { [F (y kvw/2 + γ α ) F (y kvw/2)] k 2 [F (y kvw + γ α ) F (y kvw)]f(y)dy +(k 2) [F (y + kvw/2 + γ α ) F (y + kvw/2)][f (y + γ α ) F (y)] k 3 [F (y kvw/2 + γ α ) F (y kvw/2)]f(y)dy + [F (y + kvw/2 + γ α ) F (y + kvw/2)] k 2 [F (y + kvw + γ α ) F (y + kvw)]f(y)dy}. (9) Note that in the special case where δ = 0, the P δ (γ α, w) reduces to P 0 (γ α, w) = 1 k [F (y + γ α ) F (y)] k 1 f(y)dy = P ( X [k] X [1] > γ α z H0 : µ 1 = = µ k ), (10) which is the range of i.i.d. Student s t r.v. s. by Wilcox(1983). Similarly, as n 0 becomes large, the minimum power in (8) under H a : 1 ki=1 k µ i µ δ > δ is attained at the asymptotic LFC µ 1 = ( µ kw/(2l),..., µ kw/(2l), µ + kw/(2(k l)),..., µ + kw/(2(k l))) or its permutations with the l such ( µ kw/2l) s and k l such ( µ + kw/(2(k l))) s, l = 1,..., k 1. (See Chen, Xiong and Lam (1993).) 7
8 Let β δ (γ α, w) denote the minimum power in (8) at the LFC µ = µ 1. Then β δ (γ α, w) = 1 {l + (k l) where b = k 2 /(2l(k l)). [F (y + γ α ) F (y)] l 1 [F (y bw + γ α ) F (y bw)] k l f(y)dy [F (y + bw + γ α ) F (y + bw)] l [F (y + γ α ) F (y)] k l 1 f(y)dy}, (11) The minimum power of (11) occurs at l = k/2 when k is even, and at l = (k 1)/2 or l = (k + 1)/2 when k is odd. It is trivial when k = 2. When k = 3, the power in (11) is the same for l = 1, 2. However, when k is four or large, it is not trivial to find its minimum by analytical method. By numerical quadrature as illustrated in Table 2 we confirm the above claim, and it is true in general. Table 2. The Behavior of Power Function of Expression (11). k n α P ν γ ω l Power Power Power Power Computation of the level and the power Using a grid-searching method combined with Newton-Raphson s iterations as described by Chen and Chen (1999), the critical values of γ α and the values of w in (9)-(11) are calculated. Given the level α and the required power P, the equivalent constant δ and the dedective difference δ (or the delta ratio = δ/δ ) under H 0 and H a, respectively, for a specified number of treatments k and an initial sample size n 0, one inputs a small guess of 8
9 ω (= δ / z) into (9) and uses Newton-Raphson s iterations to find a critical value γ α such that the absolute error of P δ (γ α, ω) α is bounded by Then, substitute these values into (11) to see if the probability inequality β δ (γ α, ω) P (12) is satisfied. Increase or decrease ω such that the smallest ω (accurate up to two decimal figures) satisfies (12). Tables 3-9 present the critical values γ α (upper entry) and the values of w = δ / z (lower entry) for the levels at 5% and 1%, and the powers at 90% and 95%, when k = 2(1)6, 8, 10, n 0 = 5, 10, 15, 25 and various ratio δ/δ 0.6. For example, if α = 5% with required power being at least 90% and k = 3, n 0 = 10, δ/δ = 0.2, then the γ α =4.52 and w = δ / z=2.54 are found from Table 4. If the δ value in the null hypothesis H 0 : 1 ki=1 k µ i µ δ is set to be 0.2 unit and the δ value in the alternative hypothesis H a : 1 ki=1 k µ i µ δ is set to be 1 units, then solve δ / z=2.54 for z to obtain the z value of A Numerical Example The data in Table 8 come from an experiment reported by Bishop and Dudewicz (1978). The experiment involved testing four types of solvents for their effects on the ability of a fungicide methyl-2-benzimidazole-carbamate to destroy the fungus penicillium expansum. The fungicide was diluted in exactly the same manner in the four different types of solvents and sprayed on the fungus, and the percentage of fungus destroyed was measured. The mean percentage of fungus destroyed by solvent i is denoted by µ i. Wen and Chen (1994) found that the data from solvents 1 and 3 are not normally distributed using SAS PROC Univariate testing procedure. However, Dudewicz and van der Meulen (1983) has shown the robustness results that apply to the two-stage sampling procedure for nonnormal distributions when population variances are not all equal. Chen, Chen and Ding (2000) had conducted a robust modified Levene s test for homogeneity of variances and found a significant difference (p-value < ) among variances. So, the two-stage sampling procedure can be applied. 9
10 Table 3. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 2 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =
11 Table 4. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 3 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =
12 Table 5. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 4 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =
13 Table 6. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 5 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =
14 Table 7. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 6 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =
15 Table 8. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 8 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =
16 Table 9. Percentage Point γ (upper entry) of a Range Test R and its Power- Related Ratio, δ / z (lower entry) when k = 10 and P =.90 and.95. n 0 = 5 n 0 = 10 n 0 = 15 n 0 = 25 level level level level δ/δ P = P =
17 Suppose that the experimenter regards an average deviation of 0.25% (δ value) among the µ i s to be irrelevant and wishes to detect the average deviation to be at least 1.0% (δ value). This can be stated by the null hypothesis H 0 : 1 4i=1 4 µ i µ 0.25 against the alternative hypothesis H a : 1 4i=1 4 µ i µ 1.0. If the experimenter choose the level of the test to be 5% and a power of at least 0.90, and one takes the initial sample of size n 0 = 15 observations from each solvent, then, the critical value of the modified studentized range test produces a γ α = 5.71 and δ / z = 3.04 at δ/δ = 0.25 from Table 5, so z = The remaining N i 15 observations taken at the second stage are given in Table 11. The summary statistics Si 2, a i, b i and N i defined in Section 3 are given in Table 12. The final weighted sample means X 1, X2, X3 and X 4 are given in the last row of Table 12, and finally, the calculated studentized range test statistic using (4) is R=6.68. Hence, H 0 is rejected in asserting H a with a power of being at least Pairwise multiple comparisons given by Chen and Chen (1999) can be employed to detect the differences among mean percentages. 17
18 Table 10. Bacterial Killing Ability Example (First 15 Observations) Solvent 1 Solvent 2 Solvent 3 Solvent Table 11. Bacterial Killing Ability (Second-Stage Observations) Solvent 1 Solvent 2 Solvent 3 Solvent Table 12. Summary Statistics Statistic Solvent 1 Solvent 2 Solvent 3 Solvent 4 Si a i b i N i X i
19 6. Single-Stage Sampling procedure and Modified Studentized Range Test The two-stage sampling procedure discussed in Section 3 is a design-oriented method which determines the necessary sample sizes of N i and to simultaneously meet a prespecified power requirement at a specified level of the test. In some situations where the two-stage experiment is terminated earlier due to some unexpected reasons such as budget shortage or patients drop-out, the required total sample size N i in (2) cannot be achieved, one only has to use the available m i observations ((n 0 + 1) m i N i ) on hand and recalculate the coefficients a ij s according to the so-called one-stage sampling procedure proposed by Chen and Lam (1989) such that the statistical inference theory can still work. The one-stage procedure is briefly described as below. From each of the k populations one uses an initial sample of size n 0 (2 < n 0 < m i ). Calculate the usual unbiased sample mean and unbiased sample variance, respectively, by X i = n0 j=1 X ij n 0 and S 2 i = n0 j=1 (X ij X i ) 2. n 0 1 Then, construct the new coefficients as U i = 1 m i + 1 m i mi n 0 n 0 (m i z /S 2 i 1), V i = 1 m i 1 m i where U i and V i satisfy the following conditions: n0 m i n 0 (m i z /S 2 i 1), n 0 U i + (m i n 0 )V i = 1, S 2 i [ n 0 U 2 i + (m i n 0 )V 2 i ] = z, where z is the maximum of {S 2 i /m i, i = 1,..., k }. Let the final weighted sample mean be defined by n 0 X i = U i X ij + V i j=1 m i n 0 +1 X ij. (13) 19
20 It is known (Chen and Chen (1998)) that ( X i µ i )/ z, i = 1,..., k, have independent Student s t-distributions each with n 0 1 df.. Similarly, the modified studentized range statistic is defined as R = X [k] X [1] z (14) which is used to test the H 0 against H a in (1). The probability statement P Ω (R > γ α ) can be similarly derived by replacing z by z and it is given by P Ω (R > γ α) = P Ω ( X [k] X [1] > γ α z ) = 1 P ( X [k] X [1] + γ α z ) k = 1 P ( X [k] X [1] + γα z, X (j) = X [1] ) j=1 k = 1 P ( X (j) X (i) X (j) + γα z, i = 1,..., k, i j) j=1 k X (j) µ [j] + µ [j] µ [i] X (i) µ [i] X (j) µ [j] + µ [j] µ [i] = 1 P [ j=1 z z z +γα, i = 1,..., k, i j] k = 1 P [T j + δ ji T i T j + δ ji + γα, i = 1,..., k, i j] j=1 k k = 1 [F (y + δ ji + γα ) F (y + δ ji)]f(y)dy, (15) j=1 i=1,i j where δ ji = (µ [j] µ [i] )/ z, i, j = 1,..., k, i j, and T i = ( X (i) µ [i] )/ z, i = 1,..., k are i.i.d. Student s t r.v. s having p.d.f. f( ) and c.d.f. F ( ) with n 0 1 d.f.. As the n 0 becomes large, T i converges to the standard normal, or equivalently X i / z converges to N(µ i / z, 1). Thus, by Theorem 1, we can obtain the asymptotic LFC of means which maximizes the integral in (15) at µ 0 = ( µ kδ/2, µ,..., µ, µ + kδ/2) or its permutations for k 3 and at µ 0 = ( µ kδ/2, µ + kδ/2) for k = 2. For computational reason, let d = δ/ z, the maximum level of (8) subject to µ H 0 occurs at the asymptotic LFC µ 0 and its value can be approximated by 20
21 P δ (γα, d ) = 1 { [F (y kd /2 + γα ) F (y kd /2)] k 2 [F (y kd + γ α ) F (y kd )]f(y)dy +(k 2) [F (y + kd /2 + γ α ) F (y + kd /2)][F (y + γ α ) F (y)]k 3 [F (y kd /2 + γ α ) F (y kd /2)]f(y)dy + [F (y + kd /2 + γ α) F (y + kd /2)] k 2 [F (y + kd + γ α) F (y + kd )]f(y)dy}. (16) Note that in the special case where δ = 0, the above P δ (γ α, d ) reduces to P 0 (γα, d ) = 1 k [F (y + γα) F (y)] k 1 f(y)dy = P (T i > γ α H 0 : µ 1 = = µ k ). (17) Similarly, as n 0 becomes large, the minimum power in (15) under H a : 1 ki=1 k µ i µ δ > δ is attained at the asymptotic LFC µ 1 = ( µ kd /(2l),..., µ kd /(2l), µ+kd /(2(k l)),..., µ+kd /(2(k l))) with the l such (µ kd /2l) s and k l such (µ + kd /(2(k l))) s, l = 1,..., k 1, where d = δ/ z. (See Chen, Xiong and Lam (1993).) Let β δ (γ α, d ) denote the minimum power in (15) at the LFC µ = µ 1. Then, the minimum power can be calculated by β δ (γα, d ) = 1 {l + (k l) where b = k 2 /(2l(k l)). [F (y + γ α ) F (y)]l 1 [F (y bd + γ α ) F (y bd )] k l f(y)dy [F (y + bd + γ α) F (y + bd )] l [F (y + γ α) F (y)] k l 1 f(y)dy} (18) By numerical calculation we confirm that the minimum power of (18) occurs at l = k/2 when k is even, and at l = (k 1)/2 or l = (k + 1)/2 when k is odd. Given the level α and δ at H 0 and d, k, and n 0, the critical value γ α can be obtained by solving the equation g 1 (γ α, d ) = P δ (γ α, d ) α = 0. (19) The solution γ α in (19) is data dependent on d. 21
22 Using Gaussian quadrature to evaluate the integrals in (16) and the Newton-Raphsons iteration one can find the solution γ α to (19) at the nth iteration by the formula γ n = γ n 1 P δ(γn 1, d ) α P δ (γn 1,, (20) d ) where P δ (γ, d ) is the derivative of P δ w.r.t. γ, given by P δ (γ, d ) = {(k 2)[F (y kd /2 + γ ) F (y kd /2)] k 3 f(y kd /2 + γ ) [F (y kd + γ ) F (y kd )] + [F (y kd /2 + γ ) F (y kd /2)] k 2 f(y kd + γ )}f(y)dy (k 2) [F (y + γ ) F (y)] k 4 {f(y + kd /2 + γ )[F (y + γ ) F (y)] [F (y kd /2 + γ ) F (y kd /2)] + (k 3)[F (y + kd /2 + γ ) F (y + kd /2)]f(y + γ ) [F (y kd /2 + γ ) F (y kd /2)] + [F (y + kd /2 + γ ) F (y + kd /2)][F (y + γ ) F (y)] f(y kd /2 + γ )}f(y)dy {(k 2)[F (y + kd /2 + γ ) F (y + kd /2)] k 3 f(y + kd /2 + γ ) [F (y + kd + γ ) F (y + kd )] + [F (y + kd /2 + γ ) F (y + kd /2)] k 2 f(y + kd + γ )}f(y)dy. The solution (20) is unique because P δ (γα, d ) is monotonically decreasing in γ. When the required sample sizes listed in Table 10 cannot be met due to early termination of the experiment, one can only have the total samples m 1 = 19, m 2 = 25, m 3 = 49 and m 4 = 16. At this time the one-stage procedure can be applied. By (13) the weighted sample means are calculated as X 1 = 96.93, X2 = 95.67, X3 = 95.32, X4 = and z = max{si 2/m i} = By (14) the test statistic is γ = Given δ = 0.25, δ = 1.0, k = 4, n 0 = 15 as previously specified, the p-value of γ (= 5.23) is At 10% level 22
23 of significance, the critical value is γ.10 = 4.97 and the estimated power is Therefore, at α = 0.10, we can reject the null hypothesis of all means being within an equivalent average deviation of.25 units in favor of the difference of being at least 1 unit. This calculation was done by a Fortran program named chenwen7.for available from the authors. 7. Summary and Conclusion Testing the null hypothesis of equal treatment means is sometimes impractical in real applications, as pointed out by Berger (1985). An alternative measure to detect the difference among means is the average deviation of the means, which extends the idea of equivalency among means. The test of equivalency receives more attention in health sciences, pharmaceutical industry, and other areas. When the variances are unknown and unequal, a studentized range test using a two-stage and a one-stage sampling procedures, respectively, is proposed for testing the hypothesis that the average deviation of the normal means is falling into a practical indifference zone. Both the level and the power of the proposed test by the two-stage procedure are controllable and they are completely independent of the unknown variances. Statistical tables to implement the procedure are provided for practitioners, and an example is given to demonstrate the use of the procedure. The two-stage procedure is a design-oriented procedure that satisfies certain probability requirements and simultaneously determines the required sample sizes (which can be large at the second stage) in an experiment while the one-stage procedure is a dataanalysis procedure after the data have been collected, which can supplement the two-stage procedure when the later has to end its experiment sooner than its required experimental process is completed. At that time the level and power can be approximated, and the onestage sampling procedure is shown to be quite feasible under heterocedasticity. 23
24 REFERENCES Berger, J. 0. (1985). Statistical Decision Theory, 2nd edition, Springer-Verlag, N.Y. Bishop, T.A., and Dudewicz, E. J. (1978). Exact Analysis of Variance with Unequal Variances : Test Procedures and Tables. Technometrics, 20, Chen, S. Y., and Chen, H. J. (1998). Single-Stage Analysis of Variance under Heteroscedasticity. Communication in Statistics-Theory and Methods, 27(3), Chen, S. Y., and Chen, H. J. (1999). A Range Test for the Equivalency of Means under Unequal Variances. Technometrics, Vol. 41, No. 3, Chen, S. Y., and Chen, H. J. (2000). A Range Test for the Equality of Means When Variances Are Unequal. American Journal of Mathematical and Management Sciences, Vol. 20, Nos. 1&2, Chen, S. Y., Chen, H. J., and Ding, C. G. (2000). An ANOVA Test for the Equivalency of Means under Unequal Variances. Computational Statistics and Data Analysis, 33, Chen, H. J., and Lam, K. (1989). Single-Stage Interval Estimation of the Largest Normal Mean under Heteroscedasticity. Communication in Statistics-Theory and Methods, 18(10), Chen, H. J., and Lam, K. (1991). Percentage Points of a Studentized Range Statistic Arising from Non-identical Normal Random Variables. Communications in Statistics : Simulation and Computation, 20(4), Chen, H. J., Xiong, M., and Lam, K. (1993). Range Tests for the Dispersion of Several Location Parameters. Journal of Statistical Planning and Inference, 36, Chow, S. C., and Liu, J.P. (1992). Design and Analysis of Bioavailability and Bioequivalence Studies, New York: Marcel Dekker. Dudewicz, E. J., and van der Meulen, E. C. (1983). Entropy-Based Statistical Inference II : Selection-of- the-best/complete Ranking for Continuous Distributions on (0,1), with 24
25 Applications to Random Number Generators. Statistics and Decisions, 1, Lehmann, E. L. (1986). Testing Statistical Hypothesis, 2nd edition, Wiley, N. Y. Wen, M. J., and Chen, H. J. (1994). Single-Stage Multiple Comparison Procedures under Heteroscedasticity. American Journal of Mathematical and Management Sciences, 14, Wilcox, R. R. (1983). A Table of Percentage Points of Range of Independent t Variables. Technometrics, 25,
Permutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
More informationNon Parametric Inference
Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable
More informationExact Nonparametric Tests for Comparing Means - A Personal Summary
Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More information1.5 Oneway Analysis of Variance
Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments
More information3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
More informationMODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST
MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST Zahayu Md Yusof, Nurul Hanis Harun, Sharipah Sooad Syed Yahaya & Suhaida Abdullah School of Quantitative Sciences College of Arts and
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
More informationThe Variability of P-Values. Summary
The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 boos@stat.ncsu.edu August 15, 2009 NC State Statistics Departement Tech Report
More informationIntroduction to Analysis of Variance (ANOVA) Limitations of the t-test
Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationINTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
More informationHow To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
More information13: Additional ANOVA Topics. Post hoc Comparisons
13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Kruskal-Wallis Test Post hoc Comparisons In the prior
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More information4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
More informationNonparametric Statistics
Nonparametric Statistics References Some good references for the topics in this course are 1. Higgins, James (2004), Introduction to Nonparametric Statistics 2. Hollander and Wolfe, (1999), Nonparametric
More information2 Sample t-test (unequal sample sizes and unequal variances)
Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing
More informationSAS Certificate Applied Statistics and SAS Programming
SAS Certificate Applied Statistics and SAS Programming SAS Certificate Applied Statistics and Advanced SAS Programming Brigham Young University Department of Statistics offers an Applied Statistics and
More information. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.
Chapter 3 Kolmogorov-Smirnov Tests There are many situations where experimenters need to know what is the distribution of the population of their interest. For example, if they want to use a parametric
More informationTwo-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
More informationTwo-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
More informationindividualdifferences
1 Simple ANalysis Of Variance (ANOVA) Oftentimes we have more than two groups that we want to compare. The purpose of ANOVA is to allow us to compare group means from several independent samples. In general,
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationChapter 7. One-way ANOVA
Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks
More information5.1 Identifying the Target Parameter
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying
More informationRandomized Block Analysis of Variance
Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of
More informationOne-Way Analysis of Variance
One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We
More informationRank-Based Non-Parametric Tests
Rank-Based Non-Parametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs
More informationTwo Correlated Proportions (McNemar Test)
Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with
More informationOptimal Stopping in Software Testing
Optimal Stopping in Software Testing Nilgun Morali, 1 Refik Soyer 2 1 Department of Statistics, Dokuz Eylal Universitesi, Turkey 2 Department of Management Science, The George Washington University, 2115
More informationUNDERSTANDING THE TWO-WAY ANOVA
UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
More informationVariables Control Charts
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables
More informationA MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION PROFILES
Journal of Biopharmaceutical Statistics, 15: 265 278, 2005 Copyright Taylor & Francis, Inc. ISSN: 1054-3406 print/1520-5711 online DOI: 10.1081/BIP-200049832 A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION
More information1 Solving LPs: The Simplex Algorithm of George Dantzig
Solving LPs: The Simplex Algorithm of George Dantzig. Simplex Pivoting: Dictionary Format We illustrate a general solution procedure, called the simplex algorithm, by implementing it on a very simple example.
More informationSection 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)
Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis
More informationMATRIX ALGEBRA AND SYSTEMS OF EQUATIONS
MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS Systems of Equations and Matrices Representation of a linear system The general system of m equations in n unknowns can be written a x + a 2 x 2 + + a n x n b a
More informationPrinciple of Data Reduction
Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then
More informationComparing Means in Two Populations
Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we
More informationThe CUSUM algorithm a small review. Pierre Granjon
The CUSUM algorithm a small review Pierre Granjon June, 1 Contents 1 The CUSUM algorithm 1.1 Algorithm............................... 1.1.1 The problem......................... 1.1. The different steps......................
More informationClass 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
More informationMaximum Likelihood Estimation
Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES Contents 1. Random variables and measurable functions 2. Cumulative distribution functions 3. Discrete
More information12: Analysis of Variance. Introduction
1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider
More informationChapter 2. Hypothesis testing in one population
Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance
More informationLife Table Analysis using Weighted Survey Data
Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More informationStatistics courses often teach the two-sample t-test, linear regression, and analysis of variance
2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample
More informationIntroduction to Fixed Effects Methods
Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed
More informationNon-Inferiority Tests for Two Means using Differences
Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous
More informationStat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015
Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation
More informationNonparametric adaptive age replacement with a one-cycle criterion
Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk
More informationTests for Two Proportions
Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics
More informationNon-Inferiority Tests for Two Proportions
Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which
More informationTests for One Proportion
Chapter 100 Tests for One Proportion Introduction The One-Sample Proportion Test is used to assess whether a population proportion (P1) is significantly different from a hypothesized value (P0). This is
More informationCONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE
1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,
More informationNonparametric Tests for Randomness
ECE 461 PROJECT REPORT, MAY 2003 1 Nonparametric Tests for Randomness Ying Wang ECE 461 PROJECT REPORT, MAY 2003 2 Abstract To decide whether a given sequence is truely random, or independent and identically
More informationLOGNORMAL MODEL FOR STOCK PRICES
LOGNORMAL MODEL FOR STOCK PRICES MICHAEL J. SHARPE MATHEMATICS DEPARTMENT, UCSD 1. INTRODUCTION What follows is a simple but important model that will be the basis for a later study of stock prices as
More informationUNDERSTANDING THE INDEPENDENT-SAMPLES t TEST
UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationChapter 4: Statistical Hypothesis Testing
Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin
More informationExamining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish
Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Statistics Statistics are quantitative methods of describing, analysing, and drawing inferences (conclusions)
More informationChapter 12 Nonparametric Tests. Chapter Table of Contents
Chapter 12 Nonparametric Tests Chapter Table of Contents OVERVIEW...171 Testing for Normality...... 171 Comparing Distributions....171 ONE-SAMPLE TESTS...172 TWO-SAMPLE TESTS...172 ComparingTwoIndependentSamples...172
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationTests of Hypotheses Using Statistics
Tests of Hypotheses Using Statistics Adam Massey and Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract We present the various methods of hypothesis testing that one
More informationThe Wilcoxon Rank-Sum Test
1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We
More informationSAS/STAT. 9.2 User s Guide. Introduction to. Nonparametric Analysis. (Book Excerpt) SAS Documentation
SAS/STAT Introduction to 9.2 User s Guide Nonparametric Analysis (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation
More informationNon-Inferiority Tests for One Mean
Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random
More informationE3: PROBABILITY AND STATISTICS lecture notes
E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................
More informationIndependent t- Test (Comparing Two Means)
Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationPrinciples of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions
More informationLecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions
Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule
More informationAnalysis of Variance ANOVA
Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.
More informationUNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
More informationAnalysis of Variance (ANOVA) Using Minitab
Analysis of Variance (ANOVA) Using Minitab By Keith M. Bower, M.S., Technical Training Specialist, Minitab Inc. Frequently, scientists are concerned with detecting differences in means (averages) between
More informationFriedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable
Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable Application: This statistic has two applications that can appear very different,
More informationMATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.
MATH10212 Linear Algebra Textbook: D. Poole, Linear Algebra: A Modern Introduction. Thompson, 2006. ISBN 0-534-40596-7. Systems of Linear Equations Definition. An n-dimensional vector is a row or a column
More informationT test as a parametric statistic
KJA Statistical Round pissn 2005-619 eissn 2005-7563 T test as a parametric statistic Korean Journal of Anesthesiology Department of Anesthesia and Pain Medicine, Pusan National University School of Medicine,
More informationSpatial Statistics Chapter 3 Basics of areal data and areal data modeling
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data
More informationA Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September
More informationStationary random graphs on Z with prescribed iid degrees and finite mean connections
Stationary random graphs on Z with prescribed iid degrees and finite mean connections Maria Deijfen Johan Jonasson February 2006 Abstract Let F be a probability distribution with support on the non-negative
More informationChapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
More information160 CHAPTER 4. VECTOR SPACES
160 CHAPTER 4. VECTOR SPACES 4. Rank and Nullity In this section, we look at relationships between the row space, column space, null space of a matrix and its transpose. We will derive fundamental results
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More informationThe Relative Worst Order Ratio for On-Line Algorithms
The Relative Worst Order Ratio for On-Line Algorithms Joan Boyar 1 and Lene M. Favrholdt 2 1 Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark, joan@imada.sdu.dk
More informationChapter 1 Introduction. 1.1 Introduction
Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations
More informationMultivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
More informationLeast Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
More information1 Nonparametric Statistics
1 Nonparametric Statistics When finding confidence intervals or conducting tests so far, we always described the population with a model, which includes a set of parameters. Then we could make decisions
More informationQuantitative Methods for Finance
Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain
More informationChapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
More informationThe t-test and Basic Inference Principles
Chapter 6 The t-test and Basic Inference Principles The t-test is used as an example of the basic principles of statistical inference. One of the simplest situations for which we might design an experiment
More informationCHAPTER 2 Estimating Probabilities
CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2016. Tom M. Mitchell. All rights reserved. *DRAFT OF January 24, 2016* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is a
More informationSensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS
Sensitivity Analysis 3 We have already been introduced to sensitivity analysis in Chapter via the geometry of a simple example. We saw that the values of the decision variables and those of the slack and
More informationStatistical Models in R
Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova
More informationQUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.
More informationMath 55: Discrete Mathematics
Math 55: Discrete Mathematics UC Berkeley, Fall 2011 Homework # 5, due Wednesday, February 22 5.1.4 Let P (n) be the statement that 1 3 + 2 3 + + n 3 = (n(n + 1)/2) 2 for the positive integer n. a) What
More information