Equivalence Tests for Two Proportions

Size: px
Start display at page:

Download "Equivalence Tests for Two Proportions"

Transcription

1 Chapter 5 Equivalence Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for equivalence tests in two-sample designs in which the outcome is binary. Users may choose from among eight popular test statistics commonly used for running the hypothesis test. The power calculations assume that independent, random samples are drawn from two populations. Four Procedures Documented Here There are four procedures in the menus that use the program module described in this chapter. These procedures are identical except for the type of parameterization. The parameterization can be in terms of proportions, differences in proportions, ratios of proportions, and odds ratios. Each of these options is listed separately on the menus. Example An equivalence test example will set the stage for the discussion of the terminology that follows. Suppose that the response rate of the standard treatment of a disease is Unfortunately, this treatment is expensive and occasionally exhibits serious side-effects. A promising new treatment has been developed to the point where it can be tested. One of the first questions that must be answered is whether the new treatment is therapeutically equivalent to the standard treatment. Because of the many benefits of the new treatment, clinicians are willing to adopt the new treatment even if its effectiveness is slightly different from the standard. After thoughtful discussion with several clinicians, it is decided that if the response rate of the new treatment is between 0.63 and 0.77, the new treatment would be adopted. The margin of equivalence is The developers must design an experiment to test the hypothesis that the response rate of the new treatment does not differ from that of the standard treatment by more than The statistical hypothesis to be tested is H0: p p versus H: p p < Technical Details The details of sample size calculation for the two-sample design for binary outcomes are presented in the chapter entitled Two Proportion Non-Null Case, and they will not be duplicated here. Instead, this chapter only discusses those changes necessary for equivalence tests. 5-

2 Approximate sample size formulas for equivalence tests of two proportions are presented in Chow et al. (003), page 88. Only large sample (normal approximation) results are given there. The results available in this module use exact calculations based on the enumeration of all possible values in the binomial distribution. Suppose you have two populations from which dichotomous (binary) responses will be recorded. Assume without loss of generality that higher proportions are better. The probability (or risk) of cure in group (the treatment group) is p and in group (the reference group) is p. Random samples of n and n individuals are obtained from these two groups. The data from these samples can be displayed in a -by- contingency table as follows Group Success Failure Total Treatment a c m Control b d n Totals s f N The following alternative notation is also used. Group Success Failure Total Treatment Control Totals x x m x x m n n N The binomial proportions p and p are estimated from these data using the formulae a x b x p = = and p = = m n n n Let p. 0 represent the group proportion tested by the null hypothesis H 0. The power of a test is computed at a specific value of the proportion which we will call p.. Let δ represent the smallest difference (margin of equivalence) between the two proportions that still results in the conclusion that the new treatment is equivalent to the current treatment. The set of statistical hypotheses that are tested is H 0 : p. 0 p These hypotheses can be rearranged to give H δ versus H : p. 0 p < δ p p δ or p p δ versus H : δl p. 0 p δu 0:. 0 L. 0 This composite hypothesis can be reduced to two one-sided hypotheses as follows H 0L p 0 p L U :. δ versus H L : δ L p. 0 p H 0U : p. 0 p δ U versus H U : δ U p. 0 p There are three common methods of specifying the margin of equivalence. The most direct is to simply give values for p and p. 0. However, it is often more meaningful to give p and then specify p. 0 implicitly by reporting the difference, ratio, or odds ratio. Mathematically, the definitions of these parameterizations are Parameter Computation Alternative Hypotheses Difference δ = p p δ p p δ. 0 H : L. 0 Ratio φ = p. 0 / p H : φl p. 0 / p Odds Ratio ψ = Odds. 0 / Odds H : ψ L o. 0 / o ψ U φ U U 5-

3 Difference The difference is perhaps the most direct method of comparison between two proportions. It is easy to interpret and communicate. It gives the absolute impact of the treatment. However, there are subtle difficulties that can arise with its interpretation. One difficulty arises when the event of interest is rare. If a difference of 0.00 occurs when the baseline probability is 0.40, it would be dismissed as being trivial. However, if the baseline probability of a disease is 0.00, a 0.00 decrease would represent a reduction of 50%. Thus interpretation of the difference depends on the baseline probability of the event. Note that δ L < 0 and δ U > 0. Usually, δ = δ. L U Equivalence using a Difference The following example might help you understand the concept of an equivalence test. Suppose 60% of patients respond to the current treatment method( p = 0. 60). If the response rate of the new treatment is no less than five percentage points better or worse than the existing treatment, it will be considered to be equivalent. Substituting these figures into the statistical hypotheses gives H Using the relationship gives : p. p or p. p versus H : p 0 p H p. 0 = p + δ : p or p versus H : p In this example, when the null hypothesis is rejected, the concluded alternative is that the response rate is between 0.55 and Ratio The ratio,φ = p. 0 / p, gives the relative change in the probability of the response. Testing equivalence uses the formulation H : p. / p φl or p. / p φ U versus H : φl p. 0 / p φu The only subtlety is that for equivalence tests φ L < andφ U >. Usually, φ = / φ. Equivalence using a Ratio The following example might help you understand the concept of equivalence as defined by the ratio. Suppose that 60% of patients ( p = 0. 60) respond to the current treatment method. If the response rate of a new treatment is within 0% of 0.60, it will be considered to be equivalent to the standard treatment. Substituting these figures into the statistical hypotheses gives Using the relationship gives H : p. / p 0. 9 or p. / p. versus H : 0. 9 p. 0 / p H p. = φ p 0 0 : p or p versus H : p L. U 5-3

4 Odds Ratio The odds ratio, ( ) ( p. 0 p. 0 ) ( p ( p) ) ψ = / / /, gives the relative change in the odds (o) of the response. Testing equivalence use the formulation H : o. / o ψ L or o. / o ψ U versus H : ψ L o. 0 / o ψ U The only subtlety is that for equivalence tests ψ L < andψ U >. Usually, ψ = / ψ. Power Calculation The power for a test statistic that is based on the normal approximation can be computed exactly using two binomial distributions. The following steps are taken to compute the power of these tests.. Find the critical values using the standard normal distribution. The critical values z L and z U are chosen as that value of z that leaves exactly the target value of alpha in the appropriate tail of the normal distribution.. Compute the value of the test statistic z t for every combination of x and x. Note that x ranges from 0 to n, and x ranges from 0 to n. A small value (around 0.000) can be added to the zero-cell counts to avoid numerical problems that occur when the cell value is zero. 3. If zt > zl and zt < zu, the combination is in the rejection region. Call all combinations of x and x that lead to a rejection the set A. 4. Compute the power for given values of p. and p as β = A n x p q n p q x n x x n x.. x 5. Compute the actual value of alpha achieved by the design by substituting p. 0L and p. 0U for p. to obtain and α L α U = = n A x n A x p p n x n x x n x. 0 L q. 0L x n x n x x n x. 0 U q. 0U x The value of alpha is then computed as the maximum ofα L andα U. L U Asymptotic Approximations When the values of n and n are large (say over 00), these formulas take a long time to evaluate. In this case, a large sample approximation can be used. The large sample approximation is made by replacing the values of p and p in the z statistic with the corresponding values of p. and p and then computing the results based on the normal distribution. Note that in large samples, the Farrington and Manning statistic is substituted for the Gart and Nam statistic. 5-4

5 Test Statistics Several test statistics have been proposed for testing whether the difference, ratio, or odds ratio are different from a specified value. The main difference among the several test statistics is in the formula used to compute the standard error used in the denominator. These tests are based on the following z-test p p δ0 c zt = σ The constant, c, represents a continuity correction that is applied in some cases. When the continuity correction is not used, c is zero. In power calculations, the values of p and p are not known. The corresponding values of p. and p can be reasonable substitutes. Following is a list of the test statistics available in PASS. The availability of several test statistics begs the question of which test statistic one should use. The answer is simple: one should use the test statistic that will be used to analyze the data. You may choose a method because it is a standard in your industry, because it seems to have better statistical properties, or because your statistical package calculates it. Whatever your reasons for selecting a certain test statistic, you should use the same test statistic when doing the analysis after the data have been collected. Z Test (Pooled) This test was first proposed by Karl Pearson in 900. Although this test is usually expressed directly as a chisquare statistic, it is expressed here as a z statistic so that it can be more easily used for one-sided hypothesis testing. The proportions are pooled (averaged) in computing the standard error. The formula for the test statistic is where p( p) n n σ = + z t = p p δ σ 0 p = n p n + n p + n Z Test (Unpooled) This test statistic does not pool the two proportions in computing the standard error. where = ( p ) p ( p ) p + n n σ z t = p p δ σ 0 5-5

6 Z Test with Continuity Correction (Pooled) This test is the same as Z Test (Pooled), except that a continuity correction is used. Remember that in the null case, the continuity correction makes the results closer to those of Fisher s Exact test. z t = p F p δ + + n n σ 0 p( p) n n σ = + n p p = n + n p + n where F is - for lower-tailed hypotheses and for upper-tailed hypotheses. Z Test with Continuity Correction (Unpooled) This test is the same as the Z Test (Unpooled), except that a continuity correction is used. Remember that in the null case, the continuity correction makes the results closer to those of Fisher s Exact test. z t = p F p δ + n n σ 0 ( p ) p ( p ) p + n n σ = where F is - for lower-tailed hypotheses and for upper-tailed hypotheses. T-Test of Difference Because of a detailed, comparative study of the behavior of several tests, D Agostino (988) and Upton (98) proposed using the usual two-sample t-test for testing whether the two proportions are equal. One substitutes a for a success and a 0 for a failure in the usual, two-sample t-test formula. Miettinen and Nurminen s Likelihood Score Test of the Difference Miettinen and Nurminen (985) proposed a test statistic for testing whether the difference is equal to a specified, non-zero, value, δ 0. The regular MLE s, p and p, are used in the numerator of the score statistic while MLE s p and p, constrained so that p p = δ0, are used in the denominator. A correction factor of N/(N-) is applied to make the variance estimate less biased. The significance level of the test statistic is based on the asymptotic normality of the score statistic. The formula for computing this test statistic is z MND p = p σ MND δ 0 5-6

7 where σ MND N = + n n N p = p + δ 0 L p = Bcos( A) 3L A = π + cos 3 3 C 3 B L L B = sign( C) 9L 3L 3 L L L L0 C = + 3 7L 6L L L Miettinen and Nurminen s Likelihood Score Test of the Ratio Miettinen and Nurminen (985) proposed a test statistic for testing whether the ratio is equal to a specified value, φ 0. The regular MLE s, p and p, are used in the numerator of the score statistic while MLE s p and p, constrained so that p / p = φ0, are used in the denominator. A correction factor of N/(N-) is applied to make the variance estimate less biased. The significance level of the test statistic is based on the asymptotic normality of the score statistic. The formula for computing the test statistic is where p = p φ ( δ ) = x δ [ δ ] L = N N x δ + M 0 0 ( ) L = N + N δ N M L 0 3 = N 0 B B 4AC p = A A = Nφ 0 z MNR = p / p φ0 N + φ0 n n N 5-7

8 [ φ0 φ0 ] B = N + x + N + x C = M Miettinen and Nurminen s Likelihood Score Test of the Odds Ratio Miettinen and Nurminen (985) proposed a test statistic for testing whether the odds ratio is equal to a specified value, ψ 0. Because the approach they used with the difference and ratio does not easily extend to the odds ratio, they used a score statistic approach for the odds ratio. The regular MLE s are p and p. The constrained MLE s are p and p. These estimates are constrained so that ψ = ψ 0. A correction factor of N/(N-) is applied to make the variance estimate less biased. The significance level of the test statistic is based on the asymptotic normality of the score statistic. The formula for computing the test statistic is ( p p ) ( p p) z MNO = N + N N N where p pψ 0 = + p ( ψ ) 0 B B 4AC p = + A A = N( ψ 0 ) B = N ψ + N M ( ψ ) C = M 0 0 Farrington and Manning s Likelihood Score Test of the Difference Farrington and Manning (990) proposed a test statistic for testing whether the difference is equal to a specified value, δ 0. The regular MLE s, p and p, are used in the numerator of the score statistic while MLE s p and p, constrained so that p p = δ0, are used in the denominator. The significance level of the test statistic is based on the asymptotic normality of the score statistic. The formula for computing the test statistic is p p δ0 zfmd = + n n where the estimates p and p are computed as in the corresponding test of Miettinen and Nurminen (985) given above. 5-8

9 Farrington and Manning s Likelihood Score Test of the Ratio Farrington and Manning (990) proposed a test statistic for testing whether the ratio is equal to a specified value, φ 0. The regular MLE s, p and p, are used in the numerator of the score statistic while MLE s p and p, constrained so that p / p = φ0, are used in the denominator. A correction factor of N/(N-) is applied to increase the variance estimate. The significance level of the test statistic is based on the asymptotic normality of the score statistic. The formula for computing the test statistic is z FMR = p / p φ0 + φ0 n n where the estimates p and p are computed as in the corresponding test of Miettinen and Nurminen (985) given above. Farrington and Manning s Likelihood Score Test of the Odds Ratio Farrington and Manning (990) indicate that the Miettinen and Nurminen statistic may be modified by removing the factor N/(N-). The formula for computing this test statistic is z FMO = ( p p ) ( p p ) + N N where the estimates p and p are computed as in the corresponding test of Miettinen and Nurminen (985) given above. Gart and Nam s Likelihood Score Test of the Difference Gart and Nam (990), page 638, proposed a modification to the Farrington and Manning (988) difference test that corrects for skewness. Let z FMD δ stand for the Farrington and Manning difference test statistic described ( ) above. The skewness-corrected test statistic, z GND where 3/ V pq q p γ = 6 n n ( δ ) ( ) ( q p ), is the appropriate solution to the quadratic equation ( ) γ + + δ + γ = 0 ( ) zgnd ( ) zgnd zfmd( ) Gart and Nam s Likelihood Score Test of the Ratio Gart and Nam (988), page 39, proposed a modification to the Farrington and Manning (988) ratio test that corrects for skewness. Let z FMR φ stand for the Farrington and Manning ratio test statistic described above. The skewness-corrected test statistic, ( ) z GNR, is the appropriate solution to the quadratic equation ( ) ϕ + + φ + ϕ = 0 ( ) zgnr ( ) zgnr zfmr( ) 5-9

10 where ϕ = u / 6 3 q q n p q q u = n + p n p n p ( p ) q ( q p ) Procedure Options This section describes the options that are specific to this procedure. These are located on the Design tab. For more information about the options of other tabs, go to the Procedure Window chapter. Design Tab (Common Options) The Design tab contains the parameters associated with this test such as the proportions, sample sizes, alpha, and power. This chapter covers four procedures, each of which has different options. This section documents options that are common to all four procedures. Later, unique options for each procedure will be documented. Solve For Solve For This option specifies the parameter to be solved for using the other parameters. The parameters that may be selected are P. (or D, R, or OR), Alpha, Power, Sample Size (N), and Sample Size (N). Under most situations, you will select either Power or Sample Size (N). Select Sample Size (N)when you want to calculate the sample size needed to achieve a given power and alpha level. Select Power when you want to calculate the power of an experiment. Test Test Type Specify which test statistic is used in searching and reporting. Although the pooled z-test is commonly shown in elementary statistics books, the likelihood score test is arguably the best choice. Note that C.C. is an abbreviation for Continuity Correction. This refers to the adding or subtracting /(n) to (or from) the numerator of the z-value to bring the normal approximation closer to the binomial distribution. Power and Alpha Power This option specifies one or more values for power. Power is the probability of rejecting a false null hypothesis, and is equal to one minus Beta. Beta is the probability of a type-ii error, which occurs when a false null hypothesis is not rejected. In this procedure, a type-ii error occurs when you fail to reject the null hypothesis of unequal proportions when in fact they are equivalent. Values must be between zero and one. Historically, the value of 0.80 (Beta = 0.0) was used for power. Now, 0.90 (Beta = 0.0) is also commonly used. A single value may be entered here or a range of values such as 0.8 to 0.95 by 0.05 may be entered. 5-0

11 Alpha This option specifies one or more values for the probability of a type-i error. A type-i error occurs when a true null hypothesis is rejected. In this procedure, a type-i error occurs when you reject the null hypothesis of unequal proportions when in fact they are not equal. Values must be between zero and one. Historically, the value of 0.05 has been used for alpha. This means that about one test in twenty will falsely reject the null hypothesis. You should pick a value for alpha that represents the risk of a type-i error you are willing to take in your experimental situation. You may enter a range of values such as or 0.0 to 0.0 by 0.0. Sample Size (When Solving for Sample Size) Group Allocation Select the option that describes the constraints on N or N or both. The options are Equal (N = N) This selection is used when you wish to have equal sample sizes in each group. Since you are solving for both sample sizes at once, no additional sample size parameters need to be entered. Enter N, solve for N Select this option when you wish to fix N at some value (or values), and then solve only for N. Please note that for some values of N, there may not be a value of N that is large enough to obtain the desired power. Enter N, solve for N Select this option when you wish to fix N at some value (or values), and then solve only for N. Please note that for some values of N, there may not be a value of N that is large enough to obtain the desired power. Enter R = N/N, solve for N and N For this choice, you set a value for the ratio of N to N, and then PASS determines the needed N and N, with this ratio, to obtain the desired power. An equivalent representation of the ratio, R, is N = R * N. Enter percentage in Group, solve for N and N For this choice, you set a value for the percentage of the total sample size that is in Group, and then PASS determines the needed N and N with this percentage to obtain the desired power. N (Sample Size, Group ) This option is displayed if Group Allocation = Enter N, solve for N N is the number of items or individuals sampled from the Group population. N must be. You can enter a single value or a series of values. N (Sample Size, Group ) This option is displayed if Group Allocation = Enter N, solve for N N is the number of items or individuals sampled from the Group population. N must be. You can enter a single value or a series of values. 5-

12 R (Group Sample Size Ratio) This option is displayed only if Group Allocation = Enter R = N/N, solve for N and N. R is the ratio of N to N. That is, R = N / N. Use this value to fix the ratio of N to N while solving for N and N. Only sample size combinations with this ratio are considered. N is related to N by the formula: where the value [Y] is the next integer Y. N = [R N], For example, setting R =.0 results in a Group sample size that is double the sample size in Group (e.g., N = 0 and N = 0, or N = 50 and N = 00). R must be greater than 0. If R <, then N will be less than N; if R >, then N will be greater than N. You can enter a single or a series of values. Percent in Group This option is displayed only if Group Allocation = Enter percentage in Group, solve for N and N. Use this value to fix the percentage of the total sample size allocated to Group while solving for N and N. Only sample size combinations with this Group percentage are considered. Small variations from the specified percentage may occur due to the discrete nature of sample sizes. The Percent in Group must be greater than 0 and less than 00. You can enter a single or a series of values. Sample Size (When Not Solving for Sample Size) Group Allocation Select the option that describes how individuals in the study will be allocated to Group and to Group. The options are Equal (N = N) This selection is used when you wish to have equal sample sizes in each group. A single per group sample size will be entered. Enter N and N individually This choice permits you to enter different values for N and N. Enter N and R, where N = R * N Choose this option to specify a value (or values) for N, and obtain N as a ratio (multiple) of N. Enter total sample size and percentage in Group Choose this option to specify a value (or values) for the total sample size (N), obtain N as a percentage of N, and then N as N - N. 5-

13 Sample Size Per Group This option is displayed only if Group Allocation = Equal (N = N). The Sample Size Per Group is the number of items or individuals sampled from each of the Group and Group populations. Since the sample sizes are the same in each group, this value is the value for N, and also the value for N. The Sample Size Per Group must be. You can enter a single value or a series of values. N (Sample Size, Group ) This option is displayed if Group Allocation = Enter N and N individually or Enter N and R, where N = R * N. N is the number of items or individuals sampled from the Group population. N must be. You can enter a single value or a series of values. N (Sample Size, Group ) This option is displayed only if Group Allocation = Enter N and N individually. N is the number of items or individuals sampled from the Group population. N must be. You can enter a single value or a series of values. R (Group Sample Size Ratio) This option is displayed only if Group Allocation = Enter N and R, where N = R * N. R is the ratio of N to N. That is, R = N/N Use this value to obtain N as a multiple (or proportion) of N. N is calculated from N using the formula: where the value [Y] is the next integer Y. N=[R x N], For example, setting R =.0 results in a Group sample size that is double the sample size in Group. R must be greater than 0. If R <, then N will be less than N; if R >, then N will be greater than N. You can enter a single value or a series of values. Total Sample Size (N) This option is displayed only if Group Allocation = Enter total sample size and percentage in Group. This is the total sample size, or the sum of the two group sample sizes. This value, along with the percentage of the total sample size in Group, implicitly defines N and N. The total sample size must be greater than one, but practically, must be greater than 3, since each group sample size needs to be at least. You can enter a single value or a series of values. Percent in Group This option is displayed only if Group Allocation = Enter total sample size and percentage in Group. This value fixes the percentage of the total sample size allocated to Group. Small variations from the specified percentage may occur due to the discrete nature of sample sizes. The Percent in Group must be greater than 0 and less than 00. You can enter a single value or a series of values. 5-3

14 Effect Size Reference (Group ) P (Reference Group Proportion) Specify the value of P, the reference, baseline, or control group s proportion. The null hypothesis is that the two proportions differ by no more than a specified amount. Since P is a proportion, these values must be between 0 and. You may enter a range of values such as or 0. to 0.9 by 0.. Design Tab (Proportion) This section documents options that are used when the parameterization is in terms of the values of the two proportions, P and P. P.0 is the value of the P assumed by the null hypothesis and P. is the value of P at which the power is calculated. Effect Size Equivalence Proportions P.0U & P.0L (Upper & Lower Equivalence Proportion) Specify the margin of equivalence directly by giving the upper and lower bounds of P.0. The two groups are assumed to be equivalent when P.0 is between these values. Thus, P.0U should be greater than P and P.0L should be less than P. Note that the values of P.0U and P.0L are used in pairs. Thus, the first values of P.0U and P.0L are used together, then the second values of each are used, and so on. You may enter a range of values such as or 0.0 to 0.05 by 0.0. Proportions must be between 0 and. They cannot take on the values 0 or. These values should surround P. Effect Size Actual Proportion P. (Actual Proportion) This option specifies the value of P., which is the value of the treatment proportion at which the power is to be calculated. Proportions must be between 0 and. They cannot take on the values 0 or. You may enter a range of values such as or 0.0 to 0.05 by 0.0. Design Tab (Difference) This section documents options that are used when the parameterization is in terms of the difference, P P. P.0 is the value of P assumed by the null hypothesis and P. is the value of P at which the power is calculated. Once P, D0, and D are given, the values of P. and P.0 can be calculated. Effect Size Equivalence Differences D0.U & D0.L (Upper & Lower Equivalence Difference) Specify the margin of equivalence by specifying the largest distance above (D0.U) and below (D0.L) P which will still result in the conclusion of equivalence. As long as the actual difference is between these two values, the difference is not considered to be large enough to be of practical importance. The values of D0.U must be positive and the values of D0.L must be negative. D0.L can be set to -D0.U, which is usually what is desired. 5-4

15 The power calculations assume that P.0 is the value of P under the null hypothesis. This value is used with P to calculate the value of P.0 using the formula: P.0U = D0.U + P. You may enter a range of values for D0.U such as or.05 to.0 by.05. Note that if you enter values for D0.L (other than '-D0.U'), they are used in pairs with the values of D0.U. Thus, the first values of D0.U and D0.L are used together, then the second values of each are used, and so on. RANGE: D0.L must be between - and 0. D0.U must be between 0 and. Neither can take on the values -, 0, or. Effect Size Actual Difference D (Actual Difference) This option specifies the actual difference between P. (the actual value of P) and P. This is the value of the difference at which the power is calculated. In equivalence trials, this difference is often set to 0. The power calculations assume that P. is the actual value of the proportion in group (experimental or treatment group). This difference is used with P to calculate the true value of P using the formula: P. = D + P. You may enter a range of values such as or -.05 to.05 by.0. Actual differences must be between - and. They cannot take on the values - or. Design Tab (Ratio) This section documents options that are used when the parameterization is in terms of the ratio, P / P. P.0 is the value of P assumed by the null hypothesis and P. is the value of P at which the power is calculated. Once P, R0, and R are given, the values of P.0 and P. can be calculated. Effect Size Equivalence Ratios R0.U & R0.L (Upper & Lower Equivalence Ratio) Specify the margin of equivalence by specifying the largest ratio (P/P) above (R0.U) and below (R0.L) which will still result in the conclusion of equivalence. As long as the actual ratio is between these two values, the difference between the proportions is not considered to be large enough to be of practical importance. The values of R0.U must be greater than and the values of R0.L must be less than. R0.L can be set to /R0.U, which is often desired. The power calculations assume that P.0 is the value of P under the null hypothesis. This value is used with P to calculate the value of P.0 using the formula: P.0U = R0.U x P. You may enter a range of values for R0.U such as..5.8 or. to. by 0.. Note that if you enter values for R0.L (other than '/R0.U'), they are used in pairs with the values of R0.U. Thus, the first values of R0.U and R0.L are used together, then the second values of each are used, and so on. R0.L must be between 0 and. R0.U must be greater than. Neither can take on the value. 5-5

16 Effect Size Actual Ratio R (Actual Ratio) This option specifies the ratio of P. and P, where P. is the actual proportion in the treatment group. The power calculations assume that P. is the actual value of the proportion in group. This difference is used with P to calculate the value of P using the formula: P. = R x P. In equivalence trials, this ratio is often set to. Ratios must be positive. You may enter a range of values such as or 0.9 to.9 by 0.0. Design Tab (Odds Ratio) This section documents options that are used when the parameterization is in terms of the odds ratios, O. / O and O.0 / O. Note that the odds are defined as O = P / ( P), O.0 = P.0 / ( P.0), etc. P.0 is the value of P assumed by the null hypothesis and P. is the value of P at which the power is calculated. Once P, OR0, and OR are given, the values of P. and P.0 can be calculated. Effect Size Equivalence Odds Ratios OR0.U & OR0.L (Upper & Lower Equivalence Odds Ratio) Specify the margin of equivalence by specifying the largest odds ratio above (OR0.U) and below (OR0.L) which will still result in the conclusion of equivalence. As long as the actual odds ratio is between these two values, the difference between the proportions is not large enough to be of practical importance. The values of OR0.U must be greater than and the values of OR0.L must be less than. OR0.L can be set to /OR0.U, which is often desired. The power calculations assume that P.0 is the value of the P under the null hypothesis. This value is used with P to calculate the value of P.0. You may enter a range of values for OR0.U such as..5.8 or. to. by 0.. Note that if you enter values for OR0.L (other than '/OR0.U'), they are used in pairs with the values of OR0.U. Thus, the first values of OR0.U and OR0.L are used together, next the second values of each are used, and so on. OR0.L must be between 0 and. OR0.U must be greater than. Neither can take on the value. Effect Size Actual Odds Ratio OR (Actual Odds Ratio) This option specifies the odds ratio of P. and P, where P. is the actual proportion in the treatment group. The power calculations assume that P. is the actual value of the proportion in group. This value is used with P to calculate the value of P. In equivalence trials, this odds ratio is often set to. Odds ratios must be positive. You may enter a range of values such as or 0.9 to.9 by

17 Options Tab The Options tab contains various limits and options. Zero Counts Zero Count Adjustment Method Zero cell counts cause many calculation problems. To compensate for this, a small value (called the Zero Count Adjustment Value) can be added either to all cells or to all cells with zero counts. This option specifies whether you want to use the adjustment and which type of adjustment you want to use. We recommend that you use the option Add to zero cells only. Zero cell values often do not occur in practice. However, since power calculations are based on total enumeration, they will occur in power and sample size estimation. Adding a small value is controversial, but can be necessary for computational considerations. Statisticians have recommended adding various fractions to zero counts. We have found that adding seems to work well. Zero Count Adjustment Value Zero cell counts cause many calculation problems when computing power or sample size. To compensate for this, a small value may be added either to all cells or to all zero cells. This value indicates the amount that is added. We have found that works well. Be warned that the value of the ratio and the odds ratio will be affected by the amount specified here! Exact Test Options Maximum N or N for Exact Calculations When either N or N is above this amount, power calculations are based on the normal approximation to the binomial. When the normal approximation to the binomial is used, the actual value of alpha is not calculated. Currently, for three-gigahertz computers, a value near 00 is reasonable. As computers increase in speed, this number may be increased. 5-7

18 Example Finding Power A study is being designed to establish the equivalence of a new treatment compared to the current treatment. Historically, the current treatment has enjoyed a 50% cure rate. The new treatment reduces the seriousness of certain side effects that occur with the current treatment. Thus, the new treatment will be adopted even if it is slightly less effective than the current treatment. The researchers will recommend adoption of the new treatment if its cure rate is within 5% of the standard treatment. The researchers plan to use the Farrington and Manning likelihood score test statistic to analyze the data. They want to study the power of the Farrington and Manning test at group sample sizes ranging from 50 to 500 for detecting a difference inside 5% when the actual cure rate of the new treatment ranges from 50% to 60%. The significance level will be Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the using Differences procedure window by expanding Proportions, then Two Independent Proportions, then clicking on Equivalence, and then clicking on using Differences. You may then make the appropriate entries as listed below, or open Example by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Power Test Type... Likelihood Score (Farr. & Mann.) Alpha Group Allocation... Equal (N = N) Sample Size Per Group to 500 by 50 D0.U (Upper Equivalence Difference) D0.L (Lower Equivalence Difference)... -D0.U D (Actual Difference) P (Reference Group Proportion)

19 Annotated Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results Numeric Results for Equivalence Tests Based on the Difference: P - P H0: P - P D0.L or P - P D0.U. H: D0.L < P - P = D < D0.U. Test Statistic: Score test (Farrington & Manning) Ref. Target Actual Power N N N P P.0L P.0U D0.L D0.U D Alpha Alpha (report continues) Note: Direct Binomial distribution calculations for alpha and power were only used when both N and N were less than 00. In all other cases, Normal approximation was used. Report Definitions Power is the probability of rejecting a false null hypothesis. N and N are the number of items sampled from each population. N is the total sample size, N + N. P is the proportion for Group. This is the standard, reference, or control group. P.0L is the smallest treatment-group response rate that still yields an equivalence conclusion. P.0U is the largest treatment-group response rate that still yields an equivalence conclusion. D0.L is the lowest difference that still results in the conclusion of equivalence. D0.U is the highest difference that still results in the conclusion of equivalence. D is the actual difference, P-P, at which the power is calculated. Target Alpha is the input probability of rejecting a true null hypothesis. Actual Alpha is the value of alpha that is actually achieved. Summary Statements Sample sizes of 50 in the treatment group and 50 in the reference group achieve 0% power to detect equivalence. The margin of equivalence, given in terms of the difference, extends from to The actual difference is The reference group proportion is The calculations assume that two, one-sided likelihood score (Farrington & Manning) tests are used. Although the significance level is targeted at , the level actually achieved is This report shows the values of each of the parameters, one scenario per row. Note that the actual alpha value is blank for sample sizes greater than 00, which was the limit set for exact computation. Most of the report columns have obvious interpretations. Those that may not be obvious are presented here. Prop Grp P This is the value of P, the response rate in the control group. Lower & Upper Equiv. Grp Prop: P.0L & P.0U These are the margin of equivalence for the response rate of the treatment group as specified by the null hypothesis of non-equivalence. Values of P inside these limits are considered equivalent to P. Lower & Upper Equiv. Margin Diff: D0.L & D0.U These set the margin of equivalence for the difference in response rates. Values of the difference outside these limits are considered non-equivalent. Actual Margin Diff D This is the value of D, the difference between the two group proportions at which the power is computed. This is the value of the difference under the alternative hypothesis. 5-9

20 Target Alpha This is the value of alpha that was targeted by the design. Note that the target alpha is not usually achieved exactly. For two-sided tests, this value will usually be Actual Alpha This is the value of alpha that was actually achieved by this design. Note that since the limit on exact calculations was set to 00, and since this value is calculated exactly, it is not shown for values of N greater than 00. The difference between the Target Alpha and the Actual Alpha is caused by the discrete nature of the binomial distribution and the use of the normal approximation to the binomial in determining the critical value of the test statistic. Plots Section 5-0

21 The values from the table are displayed in the above charts. These charts give a quick look at the sample size that will be required for various values of D. 5-

22 Example Finding the Sample Size Continuing with the scenario given in Example, the researchers want to determine the sample size necessary for each value of D to achieve a power of To cut down on the runtime, they decide to look at approximate values whenever N is greater than 00. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the using Differences procedure window by expanding Proportions, then Two Independent Proportions, then clicking on Equivalence, and then clicking on using Differences. You may then make the appropriate entries as listed below, or open Example by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Sample Size Test Type... Likelihood Score (Farr. & Mann.) Power Alpha Group Allocation... Equal (N = N) D0.U (Upper Equivalence Difference) D0.L (Lower Equivalence Difference)... -D0.U D (Actual Difference) P (Reference Group Proportion) Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results Numeric Results for Equivalence Tests Based on the Difference: P - P H0: P - P D0.L or P - P D0.U. H: D0.L < P - P = D < D0.U. Test Statistic: Score test (Farrington & Manning) Target Actual Ref. Target Actual Power Power N N N P P.0L P.0U D0.L D0.U D Alpha Alpha The required sample size will depend a great deal on the value of D. Any effort spent determining an accurate value for D will be worthwhile. 5-

23 Example 3 Comparing the Power of Several Test Statistics Continuing with Example, the researchers want to determine which of the eight possible test statistics to adopt by using the comparative reports and charts that PASS produces. They decide to compare the powers and actual alphas for various sample sizes between 50 and 00 when D is 0.. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the using Differences procedure window by expanding Proportions, then Two Independent Proportions, then clicking on Equivalence, and then clicking on using Differences. You may then make the appropriate entries as listed below, or open Example 3 by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Power Test Type... Likelihood Score (Farr. & Mann.) Alpha Group Allocation... Equal (N = N) Sample Size Per Group to 00 by 50 D0.U (Upper Equivalence Difference) D0.L (Lower Equivalence Difference)... -D0.U D (Actual Difference) P (Reference Group Proportion) Options Tab Maximum N or N for Exact Calc Reports Tab Show Numeric Report... Not checked Show Comparative Reports... Checked Show Summary Statements... Not checked Plots Tab Show Plots... Not checked Show Comparative Plots... Checked 5-3

24 Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results and Plots Power Comparison of Equivalence Tests Based on the Difference: P - P H0: P - P D0.L or P - P D0.U. H: D0.L < P - P = D < D0.U. Upper Equiv. Z(P) Z(UnP) Z(P) Z(UnP) T F.M. M.N. G.N. Margin Target Test Test CC Test CC Test Test Score Score Score N/N P D0.U Alpha Power Power Power Power Power Power Power Power 50/ / / / Actual Alpha Comparison of Equivalence Tests Based on the Difference: P - P H0: P - P D0.L or P - P D0.U. H: D0.L < P - P = D < D0.U. Upper Equiv. Z(P) Z(UnP) Z(P) Z(UnP) T F.M. M.N. G.N. Margin Target Test Test CC Test CC Test Test Score Score Score N/N P D0.U Alpha Alpha Alpha Alpha Alpha Alpha Alpha Alpha Alpha 50/ / / / It is interesting to note that the powers of the continuity-corrected test statistics are consistently lower than the other tests. This occurs because the actual alpha achieved by these tests is lower than for the other tests. An interesting finding of this example is that the regular t-test performed about as well as the z-test. 5-4

25 Example 4 Validation using Chow with Equal Sample Sizes Chow et al. (003), page 9, present a sample size study in which P = 0.75, D0.U = 0., D0.L = -0., D = 0.05, alpha = 0.05, and beta = 0.. Using the pooled Z test statistic, they found the sample size to be 96 in each group. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the using Differences procedure window by expanding Proportions, then Two Independent Proportions, then clicking on Equivalence, and then clicking on using Differences. You may then make the appropriate entries as listed below, or open Example 4 by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Sample Size Test Type... Z Test (Pooled) Power Alpha Group Allocation... Equal (N = N) D0.U (Upper Equivalence Difference) D0.L (Lower Equivalence Difference)... -D0.U D (Actual Difference) P (Reference Group Proportion) Options Tab Maximum N or N for Exact Calc.... (Set low for a rapid search.) Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results Numeric Results for Equivalence Tests Based on the Difference: P - P H0: P - P D0.L or P - P D0.U. H: D0.L < P - P = D < D0.U. Test Statistic: Z test (pooled) Target Actual Ref. Target Actual Power Power N N N P P.0L P.0U D0.L D0.U D Alpha Alpha PASS found the required sample size to be 98 which is slightly larger than the 96 that Chow obtained. This is mainly due to the rounding to two decimal places that Chow did in this example. We used the exact option in PASS and obtained N = 99. Thus, PASS was indeed closer than was Chow. 5-5

26 Example 5 Validation using Tuber-Bitter with Equal Sample Sizes Tuber-Bitter et al. (000), page 7, present a sample size study in which P = 0.; D0.U = 0.0, 0.0, 0.03; D0.L = -D0.U; D = 0.0; alpha = 0.05; and beta = 0.. Using the pooled Z test statistic, they found the sample sizes to be 9484, 487, and 65 in each group. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the using Differences procedure window by expanding Proportions, then Two Independent Proportions, then clicking on Equivalence, and then clicking on using Differences. You may then make the appropriate entries as listed below, or open Example 5 by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Sample Size Test Type... Z Test (Pooled) Power Alpha Group Allocation... Equal (N = N) D0.U (Upper Equivalence Difference) D0.L (Lower Equivalence Difference)... -D0.U D (Actual Difference) P (Reference Group Proportion) Options Tab Maximum N or N for Exact Calc.... (Set low for a rapid search.) Output Click the Calculate button to perform the calculations and generate the following output. Numeric Results Numeric Results for Equivalence Tests Based on the Difference: P - P H0: P - P D0.L or P - P D0.U. H: D0.L < P - P = D < D0.U. Test Statistic: Z test (pooled) Target Actual Ref. Target Actual Power Power N N N P P.0L P.0U D0.L D0.U D Alpha Alpha PASS found the required sample sizes to within rounding error of Tuber-Bitter. 5-6

27 Example 6 Computing the Power after Completing an Experiment Researchers are testing a generic drug to determine if it is equivalent to the name-brand alternative. Equivalence is declared if the success rate of the generic brand is no more than 5% from that of the name-brand drug. In a study with 000 individuals in each group, they find that 774, or 77.4%, are successfully treated using the namebrand drug, and 700, or 70%, respond to the generic drug. An equivalence test (exact test) with alpha = 0.05 failed to declare that the two drugs are equivalent. The researchers would now like to compute the power for actual differences ranging from 0 to 4%. Suppose that the true value for the response rate for the name-brand drug is 77%. Note that the power is not calculated at the difference observed in the study, 77.4%. In fact, the difference observed in the study is larger than the proposed equivalence difference, 5%. It would make no sense to perform a power calculation for a difference larger than the equivalence difference. It is more informative to study a range of values smaller than or equal to the equivalence difference. Setup This section presents the values of each of the parameters needed to run this example. First, from the PASS Home window, load the using Differences procedure window by expanding Proportions, then Two Independent Proportions, then clicking on Equivalence, and then clicking on using Differences. You may then make the appropriate entries as listed below, or open Example 6 by going to the File menu and choosing Open Example Template. Option Value Design Tab Solve For... Power Test Type... Likelihood Score (Farr. & Mann.) Alpha Group Allocation... Equal (N = N) Sample Size Per Group D0.U (Upper Equivalence Difference) D0.L (Lower Equivalence Difference)... -D0.U D (Actual Difference) to 0.04 by 0.0 P (Reference Group Proportion)

Non-Inferiority Tests for Two Proportions

Non-Inferiority Tests for Two Proportions Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which

More information

Tests for Two Proportions

Tests for Two Proportions Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics

More information

Non-Inferiority Tests for Two Means using Differences

Non-Inferiority Tests for Two Means using Differences Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

More information

Tests for One Proportion

Tests for One Proportion Chapter 100 Tests for One Proportion Introduction The One-Sample Proportion Test is used to assess whether a population proportion (P1) is significantly different from a hypothesized value (P0). This is

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Tests for Two Survival Curves Using Cox s Proportional Hazards Model Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

Pearson's Correlation Tests

Pearson's Correlation Tests Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

Two Correlated Proportions (McNemar Test)

Two Correlated Proportions (McNemar Test) Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Confidence Intervals for Cp

Confidence Intervals for Cp Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools. Tools for Summarizing Data Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

More information

Confidence Intervals for Exponential Reliability

Confidence Intervals for Exponential Reliability Chapter 408 Confidence Intervals for Exponential Reliability Introduction This routine calculates the number of events needed to obtain a specified width of a confidence interval for the reliability (proportion

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

Randomized Block Analysis of Variance

Randomized Block Analysis of Variance Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of

More information

Confidence Intervals for Cpk

Confidence Intervals for Cpk Chapter 297 Confidence Intervals for Cpk Introduction This routine calculates the sample size needed to obtain a specified width of a Cpk confidence interval at a stated confidence level. Cpk is a process

More information

Three-Stage Phase II Clinical Trials

Three-Stage Phase II Clinical Trials Chapter 130 Three-Stage Phase II Clinical Trials Introduction Phase II clinical trials determine whether a drug or regimen has sufficient activity against disease to warrant more extensive study and development.

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

More information

Confidence Intervals for Spearman s Rank Correlation

Confidence Intervals for Spearman s Rank Correlation Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence

More information

NCSS Statistical Software. One-Sample T-Test

NCSS Statistical Software. One-Sample T-Test Chapter 205 Introduction This procedure provides several reports for making inference about a population mean based on a single sample. These reports include confidence intervals of the mean or median,

More information

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS Systems of Equations and Matrices Representation of a linear system The general system of m equations in n unknowns can be written a x + a 2 x 2 + + a n x n b a

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Years after 2000. US Student to Teacher Ratio 0 16.048 1 15.893 2 15.900 3 15.900 4 15.800 5 15.657 6 15.540

Years after 2000. US Student to Teacher Ratio 0 16.048 1 15.893 2 15.900 3 15.900 4 15.800 5 15.657 6 15.540 To complete this technology assignment, you should already have created a scatter plot for your data on your calculator and/or in Excel. You could do this with any two columns of data, but for demonstration

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Stat 5102 Notes: Nonparametric Tests and. confidence interval Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

STRUTS: Statistical Rules of Thumb. Seattle, WA

STRUTS: Statistical Rules of Thumb. Seattle, WA STRUTS: Statistical Rules of Thumb Gerald van Belle Departments of Environmental Health and Biostatistics University ofwashington Seattle, WA 98195-4691 Steven P. Millard Probability, Statistics and Information

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate 1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,

More information

Introduction to Hypothesis Testing

Introduction to Hypothesis Testing I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it: EXCEL Analysis TookPak [Statistical Analysis] 1 First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it: a. From the Tools menu, choose Add-Ins b. Make sure Analysis

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

More information

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters. Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample

More information

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions Chapter 208 Introduction This procedure provides several reports for making inference about the difference between two population means based on a paired sample. These reports include confidence intervals

More information

Module 4 (Effect of Alcohol on Worms): Data Analysis

Module 4 (Effect of Alcohol on Worms): Data Analysis Module 4 (Effect of Alcohol on Worms): Data Analysis Michael Dunn Capuchino High School Introduction In this exercise, you will first process the timelapse data you collected. Then, you will cull (remove)

More information

Crosstabulation & Chi Square

Crosstabulation & Chi Square Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among

More information

Lin s Concordance Correlation Coefficient

Lin s Concordance Correlation Coefficient NSS Statistical Software NSS.com hapter 30 Lin s oncordance orrelation oefficient Introduction This procedure calculates Lin s concordance correlation coefficient ( ) from a set of bivariate data. The

More information

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS 1. SYSTEMS OF EQUATIONS AND MATRICES 1.1. Representation of a linear system. The general system of m equations in n unknowns can be written a 11 x 1 + a 12 x 2 +

More information

Guide to Microsoft Excel for calculations, statistics, and plotting data

Guide to Microsoft Excel for calculations, statistics, and plotting data Page 1/47 Guide to Microsoft Excel for calculations, statistics, and plotting data Topic Page A. Writing equations and text 2 1. Writing equations with mathematical operations 2 2. Writing equations with

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Be able to explain the difference between the p-value and a posterior

More information

Unit 26 Estimation with Confidence Intervals

Unit 26 Estimation with Confidence Intervals Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference

More information

CHAPTER 2 Estimating Probabilities

CHAPTER 2 Estimating Probabilities CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2016. Tom M. Mitchell. All rights reserved. *DRAFT OF January 24, 2016* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is a

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Odds ratio, Odds ratio test for independence, chi-squared statistic. Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review

More information

Factor Analysis. Chapter 420. Introduction

Factor Analysis. Chapter 420. Introduction Chapter 420 Introduction (FA) is an exploratory technique applied to a set of observed variables that seeks to find underlying factors (subsets of variables) from which the observed variables were generated.

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

ABSORBENCY OF PAPER TOWELS

ABSORBENCY OF PAPER TOWELS ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?

More information

Exact Nonparametric Tests for Comparing Means - A Personal Summary

Exact Nonparametric Tests for Comparing Means - A Personal Summary Exact Nonparametric Tests for Comparing Means - A Personal Summary Karl H. Schlag European University Institute 1 December 14, 2006 1 Economics Department, European University Institute. Via della Piazzuola

More information

Statistical tests for SPSS

Statistical tests for SPSS Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly

More information

individualdifferences

individualdifferences 1 Simple ANalysis Of Variance (ANOVA) Oftentimes we have more than two groups that we want to compare. The purpose of ANOVA is to allow us to compare group means from several independent samples. In general,

More information

CHAPTER 5 Round-off errors

CHAPTER 5 Round-off errors CHAPTER 5 Round-off errors In the two previous chapters we have seen how numbers can be represented in the binary numeral system and how this is the basis for representing numbers in computers. Since any

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

Sampling Strategies for Error Rate Estimation and Quality Control

Sampling Strategies for Error Rate Estimation and Quality Control Project Number: JPA0703 Sampling Strategies for Error Rate Estimation and Quality Control A Major Qualifying Project Report Submitted to the faculty of the Worcester Polytechnic Institute in partial fulfillment

More information

Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; hflores@rice.edu, and Dr. J.A. Dobelman

Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; hflores@rice.edu, and Dr. J.A. Dobelman Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; hflores@rice.edu, and Dr. J.A. Dobelman Statistics lab will be mainly focused on applying what you have learned in class with

More information

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010 MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples Statistics One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples February 3, 00 Jobayer Hossain, Ph.D. & Tim Bunnell, Ph.D. Nemours

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

STATISTICAL SIGNIFICANCE OF RANKING PARADOXES

STATISTICAL SIGNIFICANCE OF RANKING PARADOXES STATISTICAL SIGNIFICANCE OF RANKING PARADOXES Anna E. Bargagliotti and Raymond N. Greenwell Department of Mathematical Sciences and Department of Mathematics University of Memphis and Hofstra University

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Multiple-Comparison Procedures

Multiple-Comparison Procedures Multiple-Comparison Procedures References A good review of many methods for both parametric and nonparametric multiple comparisons, planned and unplanned, and with some discussion of the philosophical

More information

StatCrunch and Nonparametric Statistics

StatCrunch and Nonparametric Statistics StatCrunch and Nonparametric Statistics You can use StatCrunch to calculate the values of nonparametric statistics. It may not be obvious how to enter the data in StatCrunch for various data sets that

More information

LOGIT AND PROBIT ANALYSIS

LOGIT AND PROBIT ANALYSIS LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 amitvasisht@iasri.res.in In dummy regression variable models, it is assumed implicitly that the dependent variable Y

More information

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application

More information

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences. 1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

More information

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other 1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric

More information

Probability Distributions

Probability Distributions CHAPTER 5 Probability Distributions CHAPTER OUTLINE 5.1 Probability Distribution of a Discrete Random Variable 5.2 Mean and Standard Deviation of a Probability Distribution 5.3 The Binomial Distribution

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Mind on Statistics. Chapter 12

Mind on Statistics. Chapter 12 Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference

More information