Size: px
Start display at page:

Download ""

Transcription

1 CHAPTER 7 Comparison of Two Independent Samples 7.1 We first find the standard error of each mean. SE 1 = SE ȳ1 = s 1 n 1 = = SE 2 = SE ȳ2 = s 2 n 2 = = SE (ȳ1 - ȳ 2 ) = SE SE 2 2 = = SE 1 = = ; SE 2 = = = SE 1 = 5 25 = 1; SE 2 = 6 29 = = SE 1 = = 2.907; SE 2 = = = SE 1 = = 2.055; SE 2 = = = = SE (ȳ1 - ȳ 2 ) = SE SE 2 2 = = SE 1 = =.133; SE 2 = = = = 10.2.

2 7.10 Let 1 denote males and let 2 denote females. ȳ 1 = 45.8; SE 1 = 2.8/ 489 =.127. ȳ 2 = 40.6; SE 2 = 2.9/ 469 = The standard error of the difference is SE (ȳ1 - ȳ 2 ) = =.185. The critical value t.025 is determined from Student's t distribution with df = 950. Using df = 1000 (the nearest value given in Table 4), we find that t(1000).025 = The 95% confidence interval is ȳ 1 - ȳ 2 ± t.025 SE (ȳ1 - ȳ 2 ) ( ) ± (1.962)(.185). So the confidence interval is (4.84,5.56) or 4.84 < µ 1 - µ 2 < Let 1 denote dark and let 2 denote photoperiod. SE (ȳ1 - ȳ 2 ) = = (a) (92-115) ± (2.447)(9.192) (df = 6) (-45.5,-.5) or < µ 1 - µ 2 < -.5 nmol/gm. (b) (92-115) ± (1.943)(9.192) (df = 6) (-40.9,-5.1) or < µ 1 - µ 2 < -5.1 nmol/gm (a) Let 1 denote biofeedback and let 2 denote control. SE (ȳ1 - ȳ 2 ) = = ( ) ± (1.977)(1.867) (using df = 140) (6.1,13.5) or 6.1 < µ 1 - µ 2 < 13.5 mm Hg. (b) We are 95% confident that the population mean reduction in systolic blood pressure for those who receive training for eight weeks (µ 1 ) is larger than that for others (µ 2 ) by an amount that might be as small as 6.1 mm Hg or as large as 13.5 mm Hg No. The confidence interval found in Exercise 7.11 is valid even if the distributions are not normal, because the sample sizes are large (a) Let 1 denote antibiotic and let 2 denote control. SE (ȳ1 - ȳ 2 ) = = (25-23) ± (1.740)(4.050) (using df = 17) (-5.0,9.0) or -5 < µ 1 - µ 2 < 9 sec. (b) We are 90% confident that the population mean prothrombin time for rats treated with an antibiotic (µ 1 ) is smaller than that for control rats (µ 2 ) by an amount that might be as much as 5 seconds or is larger than that for control rats (µ 2 ) by an amount that might be as large as 9 seconds.

3 Let 1 denote control and let 2 denote Pargyline. SE (ȳ1 - ȳ 2 ) = = (a) ( ) ± (1.96)(.4286) (df = ) (-32.4,-30.8) or < µ 1 - µ 2 < mg. (b) ( ) ± (2.576)(.4286) (df = ) (-32.7,-30.5) or < µ 1 - µ 2 < mg (a) Let 1 denote successful and let 2 denote unsuccessful. SE (ȳ1 - ȳ 2 ) = = ( ) ± (2.021)(.08763) (using df = 40) (-.12,.24) or -.12 < µ 1 - µ 2 <.24 mm. (b) We are 95% confident that the population mean head width of all females who mate successfully (µ 1 ) is smaller than that for rejected females (µ 2 ) by an amount that might be as much as.12 mm or is larger than that for rejected females (µ 2 ) by an amount that might be as large as.24 mm We are 97.5% confident that the population mean drop in systolic blood pressure of adults placed on a diet rich in fruits and vegetables for eight weeks (µ 1 ) is larger than that for adults placed on a standard diet (µ 2 ) by an amount that might be as small as.9 mm Hg or as large as 4.7 mm Hg We are 97.5% confident that the population mean drop in diastolic blood pressure of adults placed on a diet rich in fruits and vegetables for eight weeks (µ 1 ) is smaller than that for adults placed on a standard diet (µ 2 ) by an amount that might be as much as.3 mm Hg or is larger than that for adults placed on a standard diet (µ 2 ) by an amount that might be as much as 2.4 mm Hg Let 1 denote caffeine and let 2 denote decaf. SE (ȳ1 - ȳ 2 ) = = ( ) ± (1.740)(5.02) (using df = 17) (-7.33,10.13) or < µ 1 - µ 2 < Normal probability plots support the normality condition: c a f f e i n e d e c a f nscores nscores 7.21 Let 1 denote red and let 2 denote green.

4 SE (ȳ1 - ȳ 2 ) = = ( ) ± (2.021)(.509) (using df = 40) (-1.61,0.45) or < µ 1 - µ 2 < No; the sample sizes are moderately large, so the sampling distribution of the difference in means is approximately normal, despite the skewness in the underlying data (a) The observed t statistic is t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = From Table 4 with df = 4, we find the critical values t.02 = and t.01 = Because t s is between t.01 and t.02, the P-value must be between.02 and.04. Thus, the P-value is bracketed as.02 < P <.04. (b) The observed t statistic is t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = From Table 4 with df = 12, we find the critical values t.20 =.873 and t.10 = Because t s is between t.10 and t.20, the P-value must be between.20 and.40. Thus, the P-value is bracketed as.20 < P <.40. (c) The observed t statistic is t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = From Table 4 with df = 30, we find the critical value t.0005 = Because t s is greater than t.0005, the P-value must be less than.001. Thus, the P-value is bracketed as P < (a) t s = ( )/5.7 = t(10).20 =.879; t(10).10 = Thus,.20 < P <.40. (b) t s = ( )/1.9 = t(13).01 = 2.650; t(13).005 = Thus,.01 < P <.02. (c) t s = ( )/.12 = t(19).0005 = Thus, P < (a).085 <.10, which means that the P-value is less than α. Thus, we reject H 0. (b).065 >.05, which means that the P-value is greater than α. Thus, we do not reject H 0. (c) Table 4 gives t(19).005 = and t(19).0005 = 3.883, so.001 < P <.01. Since P < α, we reject H 0. (d) Table 4 gives t(12).05 = and t(12).04 = 1.912, so.08 < P <.10. Since P > α, we do not reject H (a) Because P > α, we do not reject H 0. (b) Because P < α, we reject H 0. (c) t(5).04 = and t(5).03 = 2.422, so.06 < P <.08. Because P < α, we reject H 0.

5 100 (d) t(16).04 = and t(16).03 = 2.024, so.06 < P <.08. Because P > α, we do not reject H 0. Remark concerning tests of hypotheses The answer to a hypothesis testing exercise includes verbal statements of the hypotheses and a verbal statement of the conclusion from the test. In phrasing these statements, we have tried to capture the essence of the biological question being addressed; nevertheless the statements are necessarily oversimplified and they gloss over many issues that in reality might be quite important. For instance, the hypotheses and conclusion may refer to a causal connection between treatment and response; in reality the validity of such a causal interpretation usually depends on a number of factors related to the design of the investigation (such as unbiased allocation of animals to treatment groups) and to the specific experimental procedures (such as the accuracy of assays or measurement techniques). In short, the student should be aware that the verbal statements are intended to clarify the statistical concepts; their biological content may be open to question H 0 : µ HH = µ SH H A : µ HH µ SH SE (ȳ1 - ȳ 2 ) = = 4.09; t s = = df = 71.9, using formula (7.1), or df = 32 using the conservative choice of min{n 1-1,n 2-1}, or df = 82 using n 1 + n 2-2. In any case,.20 < P <.40. Thus, P > α and we do not reject H ȳ 1 - ȳ 2 = =.44. (a) SE (ȳ1 - ȳ 2 ) = =.2530 t s =.44/.2530 = Table 4 gives t(8).10 = and t(8).05 = 1.860, so.10 < P <.20. (b) SE (ȳ1 - ȳ 2 ) = = t s =.44/.1789 = Table 4 gives t(18).02 = and t(18).01 = 2.552, so.02 < P <.04. (c) SE (ȳ1 - ȳ 2 ) = = t s =.44/.1461 = Table 4 gives t(28).005 = and t(28).0005 = 3.674, so.001 < P < (a) The null and alternative hypotheses are H 0 : µ 1 = µ 2 H A : µ 1 µ 2 where 1 denotes heart disease and 2 denotes control. These hypotheses may be stated as H 0 : Mean serotonin concentration is the same in heart patients and in controls H A : Mean serotonin concentration is not the same in heart patients and in controls The test statistic is

6 t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = From Table 4 with df = 14, we find the critical values t.10 = and t.05 = Thus, the P-value is bracketed as.10 < P <.20. Since the P-value is greater than α (.05), H 0 is not rejected. (b) There is insufficient evidence (.10 < P <.20) to conclude that serotonin levels are different in heart patients than in controls. (c) SE (ȳ1 - ȳ 2 ) = = (a) H 0 : mean tibia length does not depend on gender (µ 1 = µ 2 ) H A : mean tibia length depends on gender (µ 1 µ 2 ) SE (ȳ1 - ȳ 2 ) = = t s = ( )/ = df = n 1 + n 2-2 = (Formula (7.1) gives df = 94.3.) Table 4 gives t.005 = and t.0005 = 3.390; thus.001 < P <.01, so we reject H 0. (b) There is sufficient evidence (.001 < P <.01) to conclude that mean tibia length is larger in females than in males. (c) Judging from the means and SDs, the two distributions overlap substantially, so tibia length would be a poor predictor of sex. (d) H 0 and H A are as is part (a). SE (ȳ1 - ȳ 2 ) = = t s = ( )/1.962 = df = n 1 + n 2-2 = 9. (Formula (7.1) gives df = 7.8.) Table 4 gives t.20 =.883 and t.10 = 1.383; thus.20 < P <.40 and we do not reject H (a) H 0 : mean thymus weight is the same at 14 and 15 days (µ 1 = µ 2 ) H A : mean thymus weight is not the same at 14 and 15 days (µ 1 µ 2 ) SE (ȳ1 - ȳ 2 ) = = 5.06 t s = ( )/5.06 =.49. df = n 1 + n 2-2 = 8. (Formula (7.1) gives df = 7.7.) Table 4 gives t.20 =.889; thus P >.40, so we do not reject H 0. There is insufficient evidence (P >.40) to conclude that mean thymus weight is different at 14 and 15 days. (b) According to the P-value found in part (a), the fact that ȳ 1 is greater than ȳ 2 could easily be attributed to chance. [Remark: A student has commented that it would not be surprising if µ 1 were actually less than µ 2, because in certain cases the thymus gland would be expected to shrink during embryonic development.] 7.32 (a) The null and alternative hypotheses are H 0 : µ 1 = µ 2 H A : µ 1 µ 2

7 102 where 1 denotes flooded and 2 denotes control. These hypotheses may be stated as H 0 : Flooding has no effect on ATP H A : Flooding has some effect on ATP The standard error of the difference is SE (ȳ1 - ȳ 2 ) = = The test statistic is t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = From Table 4 with df = n 1 + n 2-2 = 6 (Formula (7.1) yields df = 5.6), we find the critical values t.005 = and t.0005 = Thus, the P-value is bracketed as.001 < P <.01. Since the P-value is less than α (.05), we reject H 0. (b) There is sufficient evidence (.001 < P <.01) to conclude that flooding tends to lower ATP in birch seedlings (a) H 0 : Albumin and polygelatin are equally effective as plasma expanders (µ 1 = µ 2 ) H A : Albumin and polygelatin are not equally effective as plasma expanders (µ 1 µ 2 ) SE (ȳ1 - ȳ 2 ) = = t s = ( )/67.08 = df = n 1 + n 2-2 = and t(40).0005 = Thus, P <.001, so we reject H 0. (b) There is sufficient evidence (P <.001) to conclude that albumin is more effective than polygelatin as a plasma expander H 0 : Mean fall in cholesterol is the same on both diets (µ 1 = µ 2 ) H A : Mean fall in cholesterol is not the same on both diets (µ 1 µ 2 ) SE (ȳ1 - ȳ 2 ) = t s = ( )/13.53 = = df = n 1 + n 2-2 = 18 and t(18).20 =.862. Thus, P >.40, so we do not reject H 0. There is insufficient evidence (P >.40) to conclude that the two diets differ in their effects on cholesterol (a) True. We would reject H 0 because the P-value is less than α. (b) True. We would reject H 0 because the P-value is less than α. (c) True. This follows directly from the definition of a P-value (a) True. We would reject H 0 because the P-value is less than α. (b) False. We do not reject H 0 because the P-value is greater than α. (c) False. The P-value is the probability, under H 0, of getting a result as extreme as, or more extreme than, the result that was actually observed.

8 7.37 (a) H 0 : mean number of colonies is the same for control and soap (µ 1 = µ 2 ) H A : mean number of colonies is not the same for control and soap (µ 1 µ 2 ) 103 SE (ȳ1 - ȳ 2 ) 2 2 = = t s = ( )/10.21 =.92. Using Table 4 with df=10 we have t.20 =.879 and t.10 = Thus.20 < P <.40, so we do not reject H 0. (b) There is insufficient evidence (.20 < P <.40) to conclude that the mean number of colonies differs for control and soap (a) H 0 : mean height is the same for control and fertilized plants (µ 1 = µ 2 ) H A : mean height is not the same for control and fertilized plants (µ 1 µ 2 ) 2 SE (ȳ1 - ȳ 2 ) = =.183 t s = ( )/.183 = Using Table 4 with df=50 we have t.005 = and t.0005 = Thus.001 < P <.01, so we reject H 0. (b) There is strong evidence (.001 < P <.01) to conclude that the mean height of fertilized radish sprouts is less than that of controls See Section III of this Manual If we reject H 0 (i.e., if the drug is approved) then we eliminate the possibility of a Type II error. (But by rejecting H 0 we may have made a Type I error.) 7.43 A type II error may have been made Yes; because zero is outside of the confidence interval, we know that the P-value is less than.05, so we reject the hypothesis that µ 1 - µ 2 = Yes; because zero is outside of the confidence interval, we know that the P-value is less than.05, so the P- value is less than.10. Thus, we reject the hypothesis that µ 1 - µ 2 = (a) The observed t statistic is t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = To check the directionality of the data, we note that ȳ 1 > ȳ 2. Thus, the data do deviate from H 0 in the direction (µ 1 > µ 2 ) specified by H A, and therefore the one-tailed P-value is the area under the t curve beyond From Table 4 with df = 18, we find the critical values t.20 =.862 and t.10 = Because t s is between t.10 and t.20, the one-tailed P-value must be between.10 and.20. Thus, the P-value is bracketed as.10 < P <.20.

9 104 (b) The observed t statistic is t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = To check the directionality of the data, we note that ȳ 1 > ȳ 2. Thus, the data do deviate from H 0 in the direction (µ 1 > µ 2 ) specified by H A, and therefore the one-tailed P-value is the area under the t curve beyond From Table 4 with df = 140 (the closest value to 180), we find the critical values t.04 = and t.03 = Because t s is between t.03 and t.04, the one-tailed P-value must be between.03 and.04. Thus, the P-value is bracketed as.03 < P < (a) t s =.61 =.39. With df = 17, Table 4 gives t.20 =.863. Thus, P > (b) t s = 45 = With df = 8, Table 4 gives t.20 =.889 and t.10 = Thus,.10 < P <.20. (c) t s = = Because ȳ 1 < ȳ 2, the data do not deviate from H 0 in the direction specified by H A. Thus, P > (a) Yes. t s is positive, as predicted by H A. Thus, the P-value is the area under the t curve beyond With df = 19, Table 4 gives t.005 = and t.0005 = Thus,.0005 < P <.005. Since P < α, we reject H 0. (b) Yes. t s is positive, as predicted by H A. Thus, the P-value is the area under the t curve beyond 2.6. With df = 5, Table 4 gives t.025 = and t.02 = Thus,.02 < P <.025. Since P < α, we reject H 0. (c) Yes. t s is positive, as predicted by H A. Thus, the P-value is the area under the t curve beyond 2.1. With df = 7, Table 4 gives t.04 = and t.03 = Thus,.03 < P <.04. Since P < α, we reject H 0. (d) No. t s is positive, as predicted by H A. Thus, the P-value is the area under the t curve beyond 1.8. With df = 7, Table 4 gives t.10 = and t.05 = Thus,.05 < P <.10. Since P > α, we do not reject H (a) No. With df = 23, Table 4 gives t.10 = and t.05 = Thus,.05 < P <.10. Since P > α, we do not reject H 0. (b) Yes. With df = 5, Table 4 gives t.04 = and t.03 = Thus,.03 < P <.04. Since P < α, we reject H 0. (c) No. Because t s > 0, the data do not deviate from H 0 in the direction specified by H A. Thus, P >.50 and we do not reject H 0. (d) Yes. With df = 27, Table 4 gives t.005 = and t.0005 = Thus,.0005 < P <.005. Since P < α, we reject H 0.

10 7.50 Let 1 denote infected and 2 denote noninfected. H 0 : Malaria does not affect red cell count (µ 1 = µ 2 ) H A : Malaria reduces red cell count (µ 1 < µ 2 ) 105 We note that ȳ 1 > ȳ 2, so the data do not deviate from H 0 in the direction specified by H A. Thus, P >.50. (a) H 0 is not rejected. There is no evidence that malaria reduces red cell count in this population. (b) Same as part (a). [Note: If H A were reversed (µ 1 > µ 2 ), then H 0 would be rejected at α =.10. Thus, this exercise illustrates the importance of the directionality check.] 7.51 Let 1 denote experimental (to be hypnotized) and 2 denote control. SE (ȳ1 - ȳ 2 ) = t s = ( )/.3183 = = With df = n 1 + n 2-2 = 14 (Formula (7.1) yields df = 13.97), Table 4 gives t.01 = and t.005 = (a) H 0 : Mean ventilation is the same in the "to be hypnotized" condition and in the "control" condition (µ 1 = µ 2 ) H A : Mean ventilation is different in the "to be hypnotized" condition than in the "control" condition (µ 1 µ 2 ) H 0 is rejected. There is sufficient evidence (.01 < P <.02) to conclude that mean ventilation is higher in the "to be hypnotized" condition than in the "control" condition. (b) H 0 : Mean ventilation is the same in the "to be hypnotized" condition and in the "control" condition (µ 1 = µ 2 ) H A : Mean ventilation is higher in the "to be hypnotized" condition than in the "control" condition (µ 1 > µ 2 ) H 0 is rejected. There is sufficient evidence (.005 < P <.01) to conclude that mean ventilation is higher in the "to be hypnotized" condition than in the "control" condition. (c) The nondirectional alternative (part (a)) is more appropriate. According to the narrative, the researchers formulated the directional alternative in part (b) after they had seen the data. Thus, it would not be legitimate for them (or us) to use a directional alternative Let 1 denote experimental (to be hypnotized) and 2 denote control. H 0 : Extra nitrogen does not enhance plant growth (µ 1 = µ 2 ) H A : Extra nitrogen does enhance plant growth (µ 1 < µ 2 ) SE (ȳ1 - ȳ 2 ) = = t s = ( )/.3848 = With df = n 1 + n 2-2 = 8 (Formula (7.1) yields df = 7.7), Table 4 gives t.10 = and t.05 = H 0 is not rejected. There is insufficient evidence (.05 < P <.10) to conclude that extra nitrogen enhances plant growth under these conditions.

11 The null and alternative hypotheses are H 0 : µ 1 = µ 2 H A : µ 1 < µ 2 where 1 denotes wounded and 2 denotes control. These hypotheses may be stated as H 0 : Wounding the plant has no effect on larval growth H A : Wounding the plant tends to diminish larval growth To check the directionality of the data, we note that ȳ 1 < ȳ 2. Thus, the data do deviate from H 0 in the direction (µ 1 < µ 2 ) specified by H A. We proceed to calculate the test statistic. The standard error of the difference is SE (ȳ1 - ȳ 2 ) = = The test statistic is ȳ 1 - ȳ t s = SE = (ȳ1 - ȳ 2 ) 3.46 = From Table 4 with df = = (Formula (7.1) yields df = 31.8), we find the critical values t.01 = and t.005 = Thus, the P-value is bracketed as.005 < P <.01. Since the P-value is less than α (.05), we reject H 0. There is sufficient evidence (.005 < P <.01) to conclude that wounding the plant tends to diminish larval growth (a) The null and alternative hypotheses are H 0 : µ 1 = µ 2 H A : µ 1 > µ 2 where 1 denotes drug and 2 denotes placebo. These hypotheses may be stated as H 0 : The drug is not effective H A : The drug is effective To check the directionality of the data, we note that ȳ 1 > ȳ 2. Thus, the data do deviate from H 0 in the direction (µ 1 > µ 2 ) specified by H A. We proceed to calculate the test statistic. The standard error of the difference is SE (ȳ1 - ȳ 2 ) = The test statistic is t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = = From Table 4 with df = = (Formula (7.1) yields df = 47.2), we find the critical values t.04 = and t.03 = Thus, the P-value is bracketed as.03 < P <.04. Since the P-value is less than α (.05), we reject H 0. There is sufficient evidence (.03 < P <.04) to conclude that the drug is effective at increasing pain relief. (b) The only change in the calculations from part (a) would be that the one-tailed area would be doubled if the alternative were nondirectional. Thus, the p-value would be between.06 and.08 and at α =.05 we would not reject H 0.

12 < P-value <.05. Thus, we would reject H 0 and conclude that ancy does, indeed, inhibit growth Let 1 denote 250 meters and 2 denote 800 meters. H 0 : Distance from the reef does not affect settler density (µ 1 = µ 2 ) H A : Settler density decreases as distance from the reef increases (µ 1 > µ 2 ) SE (ȳ1 - ȳ 2 ) = =.095. t s = ( )/.095 = 2. With df =80, Table 4 gives t.025 = and t.02 = Thus,.02 < P <.025 and we reject H 0. There is statisitcally significant evidence (.02 < P <.025) to conclude that settler density decreases as distance from the reef increases The proponents are confused. They are speaking as if it is known that µ 1 - µ 2 = 4 lb/acre, whereas the field trial indicates only that ȳ 1 - ȳ 2 = 4 lb/acre. That statistician's data analysis indicates that the trial gives only weak information about µ 1 - µ 2 ; in fact, the results do not even show whether µ 1 - µ 2 is positive, let alone that it is equal to 4 lb/acre The lack of a statistically significant difference in therapeutic responses does not show that the two medications are equally effective. (Such evidence could be obtained from either a confidence interval or an analysis of the power of the test.) 7.59 Let 1 denote male and 2 denote female. SE (ȳ1 - ȳ 2 ) = = ( ) ± (1.977)(.8157) (using df = 140) (-1.6,1.6) or -1.6 < µ 1 - µ 2 < 1.6 beats per minute. We can be 95% confident that the mean difference does not exceed 1.6 beats per minute, which is small and unimportant (in comparison with, for example, ordinary fluctuations in heart rate from one minute to the next.) 7.60 The mean difference in concentration of coumaric acid is µ 1 - µ 2, where 1 denotes dark and 2 denotes photoperiod. We construct a 95% confidence interval for µ 1 - µ 2. SE (ȳ1 - ȳ 2 ) = = The critical value t.025 is found from Student's t distribution with df = n 1 + n 2-2 = 6. (Formula (7.1) gives df = 5.7.) From Table 4, we find t(6).025 = The 95% confidence interval is ȳ 1 - ȳ 2 ± t.025 SE (ȳ1 - ȳ 2 ) ( ) ± (2.447)(17.103) (-37.9,45.9) or < µ 1 - µ 2 < 45.9 nmol/gm. The difference could be larger than 20 nmol/gm or much smaller, so the data do not indicate whether the difference is "important."

13 SE (ȳ1 - ȳ 2 ) = = ( ) ± (1.994)(5.408) (-6.8,14.8) or -6.8 < µ 1 - µ 2 < 14.8 nmol/gm. The difference is less than 20 nmol/gm, so the data do indicate that the difference is "important." 7.62 The mean difference in serum concentration of uric acid is µ 1 - µ 2, where 1 denotes men and 2 denotes women. We construct a 95% confidence interval for µ 1 - µ 2. SE (ȳ1 - ȳ 2 ) = = The critical value t.025 is found from Student's t distribution with df = n 1 + n 2-2 = (Formula (7.1) gives df = ) From Table 4, we find t(1000).025 = The 95% confidence interval is ȳ 1 - ȳ 2 ± t.025 SE (ȳ1 - ȳ 2 ) ( ) ± (1.962)(.00354) (.0841,.0979) or.0841 < µ 1 - µ 2 <.0979 mmol/l. All values in the confidence interval are greater than.08 mmol/l. Therefore, according to the confidence interval the data indicate that the difference is "clinically important." 7.63 SE (ȳ1 - ȳ 2 ) = = ( ) ± (1.984)(.0112) (using df = 100) (.069,.113) or.069 < µ 1 - µ 2 <.113 mmol/l. The difference could be greater than or less than.08 mmol/l, so the data do not indicate whether the difference is "clinically important." 7.64 From the preliminary data, we obtain.3 cm as a guess of σ. (a) If the true difference is.25 cm, then the effect size is µ 1 - µ 2 =.25 σ.3 =.83. We consult Table 5 for a two-tailed test at α =.05 and an effect size of.83.85; to achieve power.80, Table 5 recommends n = 23. (b) If the true difference is.5 cm, then the effect size is µ 1 - µ 2 =.5 σ.3 = We consult Table 5 for a two-tailed test at α =.05 and an effect size of ; to achieve power.95, Table 5 recommends n = µ 1 - µ 2 /σ = 44.4/69.6 =.64. (a) Table 5 gives n = 39.

14 (b) Table 5 gives n = µ 1 - µ 2 /σ = 2/.8 = 2.5. (a) Table 5 gives n = 5. (b) The required conditions are that the sampled populations are normal with equal standard deviations. The condition of normality can be checked from the pilot data. (c) Table 5 gives n = We need to find n to achieve a power of.9. The effect size is µ 1 - µ 2 σ = =.55. We consult Table 5. (a) For a two-tailed test at α =.05, Table 5 gives n = 71. (b) For a two-tailed test at α =.01, Table 5 gives n = 101. (c) For a one-tailed test at α =.05, Table 5 gives n = (a) µ 1 - µ 2 /σ = 2/2.5 =.8; Table 5 gives n = 26. (b) µ 1 - µ 2 /σ = 2/1.25 = 1.6; Table 5 gives n = 8. Yes, the measure would be cost-effective, because the required number of rats would be reduced by more than half. (In fact, the modified experiment would cost only 62% as much, because (2)(8)/26 =.62.) 7.69 The effect size is µ 1 - µ 2 = 4 σ 10 =.4. From Table 5 we find that, for a one-tailed test with n = 35 at significance level α =.05, the power is.50 if the effect size is.4. Thus, the probability that Jones will reject H 0 is equal to µ 1 - µ 2 /σ =.5 = (a) (a) Table 5 gives n = 51. (b) Table 5 gives n = 70. (b)

15 n = µ 1 - µ 2 /σ = (a) Both of the distributions appear to be slightly skewed to the right. (b) Because n 1 and n 2 are quite large, Student's t method is valid even if the populations are nonnormal The sample SDs are: Heart disease: s 1 = (850) 8 = 2404 Controls: s 2 = (640) 12 = 2217 The fact that the mean (3840) for the heart disease patients is only about 1.6 SDs, together with the fact that the observed variable cannot be negative, casts doubt on the normality condition for the population. Because the sample sizes are rather small, such a departure from normality can lead to poor performance by the t test (a) False; the P-value is the probability of data as unusual as those obtained if H 0 is true. (b) True; reject H 0 since the P-value is less than α. (c) False. We should reject H 0, but don t know the chance that we would reject H 0 in repeated experiments we do not know if H 0 is true. (d) True; this is the interpretation of a P-value We consult Table 6 with n = 7 and n' = 5. (a) U s = 26. From Table 6, the smallest critical value is 27, which corresponds to a nondirectional P-value of.20. Since U s < 27, it follows that P >.20. (b) U s = 30. From Table 6, the critical value 30 is under the.05 heading for a nondirectional alternative. The P-value is between this heading and the next one to the right. Thus,.02 < P.05. (c) U s = 35. From Table 6, the largest critical value is 34, which is under the.01 heading for a nondirectional alternative. There is no entry under the.002 heading, which means that the P-value is greater than.002. It follows that.002 < P < n = 8, n' = 4. (a).10 < P <.20. (b).002 < P <.01.

16 (c) The.002 entry is blank, so.002 < P < (a) The null and alternative hypotheses are H 0 : Toluene has no effect on dopamine in rat striatum H A : Toluene has some effect on dopamine in rat striatum Let 1 denote toluene and let 2 denote control. The ordered arrays of observations are as follows: Y 1 : Y 2 : For the K 1 count, we note that there are four Y 2 's less than the first Y 1 ; there are five Y 2 's less than the second Y 1 ; there are five Y 2 's less than the third Y 1 ; and there are six Y 2 's less than the fourth, fifth, and sixth Y 1. Thus, K 1 = = 32. For the K 2 count, we note that there are no Y 1 's less than the first, second, third, or fourth Y 2 ; there is one Y 1 less than the fifth Y 2 ; and there are three Y 1 's less than the sixth Y 2. Thus, K 2 = = 4. To check the counts, we verify that K 1 + K 2 = = 36 = (6)(6) = (n 1 )(n 2 ). The Wilcoxon-Mann-Whitney test statistic is the larger of the two counts K 1 and K 2 ; thus U s = 32. Looking in Table 6 under n = 6 and n' = 6, we find that for a nondirectional alternative, the.05 entry is 31 and the.02 entry is 33. Thus, the P-value is bracketed as.02 < P <.05. At significance level α =.05, we reject H 0, since P <.05. We note that K 1 is larger than K 2, which indicates a tendency for the Y 1 's to be larger than the Y 2 's. Thus, there is sufficient evidence (.02 < P <.05) to conclude that toluene increases dopamine in rat striatum. (b) When conducting a nondirectional test, we must check directionality. In this case, we note that K 1 is larger than K 2, which indicates a tendency for the Y 1 's to be larger than the Y 2 's, which is what the directional alternative predicts. We proceed as in part (a), except that we use the "directional" tail probabilities. Thus,.01 < P <.025. We reject H 0 and conclude that there is sufficient evidence (.01 < P <.025) to conclude that toluene increases dopamine in rat striatum Let 1 denote experimental (to be hypnotized) and 2 denote control. (a) H 0 : Ventilation is not differently affected by the "to be hypnotized" and the "control" conditions H A : Ventilation is differently affected by the "to be hypnotized" and the "control" conditions K 1 = 53, K 2 = 11, U s = 53. With n = 8 and n' = 8, Table 6 gives.02 < P <.05. H 0 is rejected. There is sufficient evidence (.02 < P <.05) to conclude that ventilation rate tends to be higher under the "to be hypnotized" condition than under the "control" condition (a) U s = 9. With n = n' = 3, U s = 9 is under the.10 heading and is the largest entry listed. Thus,.05 < P <.10. (b) U s = 16. With n = n' = 4, U s = 16 is under the.05 heading and is the largest entry listed. Thus,

17 < P <.05. (c) U s = 25. With n = n' = 5, U s = 25 is under the.01 heading and is the largest entry listed. Thus,.002 < P < (a) H 0 : There is no sex difference in preening behavior H A : There is a sex difference in preening behavior For n = n' = 15, the largest critical value is 189, which is under the.001 heading for a nondirectional alternative. It follows that P <.001, so H 0 is rejected. There is sufficient evidence (P <.001) to conclude that females tend to preen longer than males. (b) H 0 : There is no sex difference in preening behavior (µ 1 = µ 2 ) H A : There is a sex difference in preening behavior (µ 1 µ 2 ) t s = ( )/.7933 = With df = n 1 + n 2-2 = 28, Table 4 gives t.01 = and t.005 = 2.763, so that.01 < P <.02. Formula (7.1) yields df = 15.1 and the conservative approach of df = min{n 1-1,n 2-1} gives df = 14. For either of these df values we get.02 < P <.04. In any case, H 0 is not rejected, since P >.01. There is sufficient evidence to conclude that there is a sex difference in preening behavior. (c) Both tests require independent, random samples. The condition required for the t test but not for the Wilcoxon-Mann-Whitney test is that the population distributions are normal. The frequency distribution for the females is highly skewed, due to the two large observations of 10.7 and This casts doubt on the normality condition. (d) K 1 = = 35.5 K 2 = (11.5) + 3(13) (15) = where 1 denotes male and 2 denotes female (a) Let 1 denote singly housed and let 2 denote group-housed. H 0 : There is no difference in benzo(a)pyrene concentrations between singly housed and group-housed mice. H A : Benzo(a)pyrene concentrations tend to be higher in group-housed mice than in singly housed mice K 1 = 0, K 2 = 25, U s = 25, and the shift in the data is in the direction predicted by H A. With n = n' = 5, U s = 25 is under the.005 heading for a directional alternative and is the largest entry listed. Thus,.001 < P <.005 and H 0 is rejected. There is sufficient evidence (.001 < P <.005) to conclude that benzo(a)pyrene concentrations tend to be higher in group-housed mice than in singly housed mice. (b) A directional alternative is valid in this case because the researchers were investigating the hypothesis that licking or biting other mice leads to increase benzo(a)pyrene concentration. If access to other mice affects benzo(a)pyrene concentration, the effect would be to increase the concentration; a decrease in concentration is not plausible Let 1 denote joggers and let 2 denote fitness program entrants. H 0 : There is no difference in resting blood concentration of HBE between joggers fitness program entrants H A : There is a difference in resting blood concentration of HBE between joggers fitness program entrants K 1 = 93.5, K 2 = 71.5, U s = With n = 15 and n' = 11, 108 is under the.20 heading for a nondirectional alternative and is the smallest entry listed. Thus, P >.20 and H 0 is not rejected. There is insufficient evidence (P >.20) to conclude that there is a difference in resting blood concentration of HBE

18 between joggers fitness program entrants (a) SE 1 = 9.6/ 12 = 2.771; SE 2 = 10.2/ 13 = = (b) SE 1 = 2.7/ 22 =.576; SE 2 = 1.9/ 19 = =.722. (c) SE 1 = 1.2/ 5 =.537; SE 2 = 1.4/ 7 = = The null and alternative hypotheses are H 0 : Mean platelet calcium is the same in people with high blood pressure as in people with normal blood pressure (µ 1 = µ 2 ) H A : Mean platelet calcium is different in people with high blood pressure than in people with normal blood pressure (µ 1 µ 2 ) The standard error of the difference is SE (ȳ1 - ȳ 2 ) = The test statistic is t s = ȳ 1 - ȳ 2 SE (ȳ1 - ȳ 2 ) = = = From Table 4 with df = = 81 80, we find the critical value t.0005 = The tail area is doubled for the nondirectional test. Thus, the P-value is bracketed as P <.001. (Formula (7.1) yields df = 67.5, but the P-value is still bracketed as P <.001.) Since the P-value is less than α (.01), we reject H 0. There is sufficient evidence (P <.001) to conclude that mean platelet calcium is higher in people with high blood pressure than in people with normal blood pressure The mean difference in blood pressure is µ 1 - µ 2, where 1 denotes high blood pressure and 2 denotes normal blood pressure. The standard error of the difference is SE (ȳ1 - ȳ 2 ) = = The critical value t.025 is found from Student's t distribution with df given by Formula (7.1) as df = Table 4 gives t(70).025 = The 95% confidence interval is ȳ 1 - ȳ 2 ± t.025 SE (ȳ1 - ȳ 2 ) ( ) ± (1.994)(5.399). So the confidence interval is (49.5,71.1) or 49.5 < µ 1 - µ 2 < 71.1 nm. Alternatively, we could use df = = 81 80, in which case the critical value is t(80).025 = This gives an interval of ( ) ± (1.990)(5.399).

19 114 So the confidence interval is (49.6,71.0) or 49.6 < µ 1 - µ 2 < 71.0 nm No; the t test is valid because the sample sizes are rather large (a) H 0 : Mechanical milking does not produce different cell count than manual milking (µ 1 = µ 2 ) H A : Mechanical milking produces higher cell count than manual milking (µ 1 > µ 2 ) t s = ( )/ = With df = 18, Table 4 gives t.02 = and t.01 = Formula (7.1) yields df = 9.2 9; with df = 9, Table 4 gives t.025 = and t.02 = Using either df value, P <.05 and H 0 is rejected. There is sufficient evidence to conclude that mechanical milking produces higher cell count than manual milking. (The data support the investigator's suspicion.) (b) H 0 : Mechanical milking does not produce different cell count than manual milking H A : Mechanical milking produces higher cell count than manual milking U s = 69. The shift in the data is in the direction predicted by H A. With n = n' = 10, the entry under.10 for a directional alternative is 68 and the entry under.05 for a directional alternative is 73. Thus, we do not reject H 0. There is insufficient evidence (.05 < P <.10) to conclude that mechanical milking produces higher cell count than manual milking. (The data do not support the investigator's suspicion.) [Note that this contradicts the conclusion from part (a).] (c) Both tests require independent, random samples. The condition required for the t test but not for the Wilcoxon-Mann-Whitney test is that the population distributions are normal. The frequency distribution for the mechanical group is highly skewed, with some observations (2996 and 3452) that are much greater than the others. This casts doubt on the normality condition. (d) K 1 = = 69 K 2 = = 31 where 1 denotes mechanical and 2 denotes manual (a) Let 1 denote control and let 2 denote stress. H 0 : Stress has no effect on growth (µ 1 = µ 2 ) H A : Stress tends to retard growth (µ 1 > µ 2 ) SE (ȳ1 - ȳ 2 ) = = t s = ( )/.7611 = With df = 24, Table 4 gives t.005 = and t.0005 = (Formula (7.1) gives df = 23, which makes no appreciable change.) Thus,.0005 < P <.005, so we reject H 0. (b) There is sufficient evidence (.0005 < P <.005) to conclude that stress tends to retard plant growth Let 1 denote control and let 2 denote stress. SE (ȳ1 - ȳ 2 ) = = ( ) ± (2.064)(.7611) (df = 24) (1.24,4.38) or 1.24 < µ 1 - µ 2 < 4.38 cm. (a) The confidence interval indicates that the difference is at least 1 cm and so is "horticulturally important." (b) According to the confidence interval, the difference could be greater or smaller than 2 cm, so the data do not indicate whether the difference is "horticulturally important."

20 (c) The confidence interval indicates that the difference is less than 5 cm and so is not "horticulturally important." The null and alternative hypotheses are H 0 : Stress has no effect on growth H A : Stress tends to retard growth The data are already arrayed in increasing order. We let Y 1 denote control and Y 2 denote stress. For the K 1 count, we note that there is one Y 2 less than the first Y 1 ; there are ten Y 2 's less than the second Y 1 ; there are twelve Y 2 's less than the third, fourth, fifth, sixth, and seventh Y 1 ; there are twelve Y 2 's less than the eighth Y 1 and one equal to it; and there are thirteen Y 2 's less than the ninth through thirteenth Y 1. Thus, K 1 = = For the K 2 count, we note that there are no Y 1 's less than the first Y 2 ; there is one Y 1 less than the second through tenth Y 2 ; there are two Y 1 's less than the eleventh and twelfth Y 2 ; and there are seven Y 1 's less than the thirteenth Y 2 and one equal to it. Thus, K 2 = = To check the counts, we verify that K 1 + K 2 = = 169 = (13)(13) = (n 1 )(n 2 ). To check the directionality of the data, we note that K 1 > K 2, which suggests a tendency for the Y 1 's to be larger than the Y 2 's, which would indicate that stress retards growth. Thus, the data do deviate from H 0 is the direction specified by H A. The Wilcoxon-Mann-Whitney test statistic is the larger of the two counts K 1 and K 2 ; thus U s = Looking in Table 6 under n = 13 and n' = 13, we find that for a directional alternative, the largest entry is 146 under the.0005 heading. Thus, the P-value is bracketed as P < At significance level α =.01, we reject H 0, since P < α. There is sufficient evidence (P <.0005) to conclude that stress tends to retard growth Let 1 denote Vermilion River and let 2 denote Black River. H 0 : The populations from which the two samples were drawn have the same distribution of tree species per plot H A : Biodiversity is greater along the Vermilion River than along the Black River K 1 = 37, K 2 = 80, U s = 80; the data deviate from H 0 is the direction specified by H A. With n = n' = 13 and a directional alternative, the.10 entry in Table 6 is 79 and the.05 entry is 84. Thus, the P-value is between.05 and.10, so we reject H 0. There is sufficient evidence (.05 < P <.10) to conclude that biodiversity is greater along the Vermilion River than along the Black River Let 1 denote positive response and let 2 denote no response. H 0 : Ovarian ph is not related to progesterone response H A : Ovarian ph is related to progesterone response K 1 = 1.5, K 2 = 106.5, U s = With n = 18, n' = 6, and a nondirectional alternative, the largest entry is 100 under the.0005 heading. Thus, the P-value is bracketed as

21 116 P < At significance level α =.05, we reject H 0, since P < α. There is sufficient evidence (P <.0005) to conclude that ovarian ph is lower among responders to progesterone than among nonresponders H 0 : Ovarian ph is not related to progesterone response (µ 1 = µ 2 ) H A : Ovarian ph is related to progesterone response (µ 1 µ 2 ) SE (ȳ1 - ȳ 2 ) = = t s = ( )/ = With df = 22, Table 4 gives t.0005 = (Formula (7.1) gives df = 14.1, which makes no change in the conclusion.) Thus, P <.001, so we reject H 0. There is sufficient evidence (P <.001) to conclude that ovarian ph is lower among responders to progesterone than among nonresponders The lack of a statistically significant difference in weight gain does not show that the new diet is as good as the standard diet. (Evidence to this effect could be obtained from either a confidence interval or an analysis of the power of the test; see the next exercise.) 7.97 If the standard deviation is about 20% of the mean and the difference between the means is about 10%, then µ 1 = µ 2 =.10 σ.20 =.5. To achieve power of.50, Table 5 indicates that 32 animals per group would be required. Since 25 < 32, the power of this study was less than.50. Thus, the study did not have a reasonably good chance of detecting a population mean deficiency of 10% on the new diet (a) Two of the patients contributed two observations each to the data set. Thus, there is hierarchical structure, so the t test is not appropriate. (b) No. The Wilcoxon-Mann-Whitney test, like the t test, requires that the observations within a sample be independent of each other, so the Wilcoxon-Mann-Whitney test is not appropriate Let 1 denote low chromium and let 2 denote normal. H 0 : Low chromium diet does not affect GITH (µ 1 = µ 2 ) H A : Low chromium diet does affect GITH (µ 1 µ 2 ) ȳ 1 = 51.75, s 1 = ȳ 2 = 53.17, s 1 = SE (ȳ1 - ȳ 2 ) = = t s = ( )/1.970 = Formula (7.1) gives df = 21.9; t(22).20 =.858, so P >.40. Using a computer, we get P =.48. Thus, we do not reject H 0. There is insufficient evidence (P =.48) to conclude that low chromium diet affects GITH in rats Let 1 denote low chromium and let 2 denote normal. H 0 : Low chromium diet does not affect GITH H A : Low chromium diet does affect GITH

22 117 U s = n = 14, n' = 10, and a nondirectional alternative, the smallest entry is 93, under the.20 heading. Thus, P >.20 and we do not reject H 0. There is insufficient evidence (P >.20) to conclude that low chromium diet affects GITH in rats (See Exercise 7.99 for basic computations.) (a) ( ) ± (2.074)(1.970) (df = ) (-5.5,2.7) or -5.5 < µ 1 - µ 2 < 2.7 thousand cpm/gm. (b) All values in the confidence interval are smaller in magnitude than 8 thousand cpm/gm; thus the data support the conclusion that the difference is "unimportant." (c) The confidence interval indicates that the difference could be larger in magnitude than 4 thousand cpm/gm or smaller; thus the data do not indicate whether the difference is "unimportant." (a) Let 1 denote infected and let 2 denote uninfected. H 0 : Malaria is not related to stamina (µ 1 = µ 2 ) H A : Malaria is associated with decreased stamina (µ 1 < µ 2 ) ȳ 1 = 26.87, s 1 = 6.815; ȳ 2 = 32.23, s 1 = SE (ȳ1 - ȳ 2 ) = = t s = ( )/2.727 = Formula (7.1) gives df = 27.2; t(27).03 = and t(27).025 = 2.052, so.025 < P <.03. Using a computer, we get P = Thus, we reject H 0. There is sufficient evidence (P =.0298) to conclude that malaria is associated with decreased stamina. (b) H 0 : Malaria is not related to stamina H A : Malaria is associated with decreased stamina U s = 155; the data deviate from H 0 in the direction specified by H A. With n = 15, n' = 15, and a directional alternative, the entry under the.05 heading is 153 and the entry under the.025 heading is 161. Thus,.025 < P <.05 and we reject H 0. There is sufficient evidence (.025 < P <.05) to conclude that malaria is associated with decreased stamina (a) Let 1 denote amphetamine and let 2 denote control. H 0 : Amphetamine is not related to water consumption (µ 1 = µ 2 ) H A : Amphetamine is associated with decreased water consumption (µ 1 < µ 2 ) ȳ 1 = , s 1 = ȳ 2 = 156.0, s 1 = SE (ȳ1 - ȳ 2 ) = = t s = ( )/18.82 = With df = n 1 + n 2-2 = 6 (Formula (7.1) yields df = 5.9), Table 4 gives t.20 = and t.10 = 1.440, so.10 < P <.20. (Using a computer, we get P =.104.) Thus, we do not reject H 0. There is insufficient evidence (10 < P <.20) to conclude that amphetamine is associated with decreased water consumption. (b) H 0 : Amphetamine is not related to water consumption

23 118 H A : Amphetamine is associated with decreased water consumption K 1 = 4, K 2 = 12, U s = 12; the data deviate from H 0 in the direction specified by H A. With n = 4, n' = 4, and a directional alternative, the smallest entry is 13, under the.10 heading. Thus, P >.10 and we do not reject H 0. There is insufficient evidence (P >.10) to conclude that amphetamine is associated with decreased water consumption (a) False. The confidence interval includes zero, so we are not confident that µ 1 and µ 2 are different. (b) True. This is what a confidence interval tells us. (c) False. We know that ȳ 1 - ȳ 2 is exactly 6.9. (d) False. The confidence interval is used to make an inference about the difference between µ 1 and µ 2 ; it does not tell us about individual data points (such as the length of hospitalization for a nitric oxide infant) False. The 95% confidence interval includes zero, which means that the P-value for a nondirectional test is greater than.05. Thus, we would not reject H 0 at the.05 significance level (a) False. The P-value for a test is the probability of getting data at least as extreme as those obtained, if H 0 is true; it is not the probability that the null hypothesis is true. (b) True. The P-value for a test is the probability of getting data at least as extreme as those obtained, if H 0 is true, which is what this statement says. (c) False. The probability that H 0 is rejected depends on the power of the test, which is not known. (If H 0 is true -- and we don't know if it is true or not -- and a new study is done that uses α =.04, then there is a 4% probability that H 0 will be rejected.) (a) False. The confidence interval gives a range that we infer covers µ 1 - µ 2. It does not tells us where the bulk of the data lie. (b) True. This is what a confidence interval tells us. (c) False. The confidence interval is used to make an inference about the difference between µ 1 and µ 2 ; it does not tell us about future values of ȳ 1 and ȳ 2. (If µ 1 - µ 2 were exactly equal to the ȳ 1 - ȳ 2 difference obtained in this study -- which is not at all likely -- then the confidence interval could be used to predict where future ȳ 1 - ȳ 2 differences would fall.) (d) False. The confidence interval is used to make an inference about the difference between µ 1 and µ 2 ; it does not tell us about individual data points (such as the number of calories in a particular entree).

24 7.108 (a) The normal probability plots show very mild skewness: 119 Near Far nscores nscores (b) The following computer output shows that the P-value is Sample t-test of µ1-µ2 Ho: µ1-µ2 = 0 Ha: µ1-µ2 0 near - far : Test Ho: µ(near)-µ(far) = 0 vs Ha: µ(near)-µ(far) 0 Difference Between Means = t-statistic = w/65 df Reject Ho at Alpha = 0.05 p = (c) After taking logarithms of each observation, the normal probability plots look more linear: Log(Near) Log(Far) nscores nscores The P-value for the t-test is now.0156, which is almost the same as the P-value before the transformation:

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters. Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 1. Does vigorous exercise affect concentration? In general, the time needed for people to complete

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

Unit 26: Small Sample Inference for One Mean

Unit 26: Small Sample Inference for One Mean Unit 26: Small Sample Inference for One Mean Prerequisites Students need the background on confidence intervals and significance tests covered in Units 24 and 25. Additional Topic Coverage Additional coverage

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

Inference for two Population Means

Inference for two Population Means Inference for two Population Means Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison October 27 November 1, 2011 Two Population Means 1 / 65 Case Study Case Study Example

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Online 12 - Sections 9.1 and 9.2-Doug Ensley Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 12 - Sections 9.1 and 9.2 1. Does a P-value of 0.001 give strong evidence or not especially strong

More information

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails. Chi-square Goodness of Fit Test The chi-square test is designed to test differences whether one frequency is different from another frequency. The chi-square test is designed for use with data on a nominal

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck! Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck! Name: 1. The basic idea behind hypothesis testing: A. is important only if you want to compare two populations. B. depends on

More information

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Describing Populations Statistically: The Mean, Variance, and Standard Deviation Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Mind on Statistics. Chapter 12

Mind on Statistics. Chapter 12 Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives C H 8A P T E R Outline 8 1 Steps in Traditional Method 8 2 z Test for a Mean 8 3 t Test for a Mean 8 4 z Test for a Proportion 8 6 Confidence Intervals and Copyright 2013 The McGraw Hill Companies, Inc.

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

Mind on Statistics. Chapter 13

Mind on Statistics. Chapter 13 Mind on Statistics Chapter 13 Sections 13.1-13.2 1. Which statement is not true about hypothesis tests? A. Hypothesis tests are only valid when the sample is representative of the population for the question

More information

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Hypothesis Testing: Two Means, Paired Data, Two Proportions Chapter 10 Hypothesis Testing: Two Means, Paired Data, Two Proportions 10.1 Hypothesis Testing: Two Population Means and Two Population Proportions 1 10.1.1 Student Learning Objectives By the end of this

More information

Pearson's Correlation Tests

Pearson's Correlation Tests Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test. Dr. Tom Pierce Radford University Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

More information

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013 STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico Fall 2013 CHAPTER 18 INFERENCE ABOUT A POPULATION MEAN. Conditions for Inference about mean

More information

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals. 1 BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine

More information

Math 108 Exam 3 Solutions Spring 00

Math 108 Exam 3 Solutions Spring 00 Math 108 Exam 3 Solutions Spring 00 1. An ecologist studying acid rain takes measurements of the ph in 12 randomly selected Adirondack lakes. The results are as follows: 3.0 6.5 5.0 4.2 5.5 4.7 3.4 6.8

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

PRACTICE PROBLEMS FOR BIOSTATISTICS

PRACTICE PROBLEMS FOR BIOSTATISTICS PRACTICE PROBLEMS FOR BIOSTATISTICS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION 1. The duration of time from first exposure to HIV infection to AIDS diagnosis is called the incubation period.

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

WISE Power Tutorial All Exercises

WISE Power Tutorial All Exercises ame Date Class WISE Power Tutorial All Exercises Power: The B.E.A.. Mnemonic Four interrelated features of power can be summarized using BEA B Beta Error (Power = 1 Beta Error): Beta error (or Type II

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion Learning Objectives Upon successful completion of Chapter 8, you will be able to: Understand terms. State the null and alternative

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Testing Hypotheses About Proportions

Testing Hypotheses About Proportions Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Math 58. Rumbos Fall 2008 1. Solutions to Review Problems for Exam 2

Math 58. Rumbos Fall 2008 1. Solutions to Review Problems for Exam 2 Math 58. Rumbos Fall 2008 1 Solutions to Review Problems for Exam 2 1. For each of the following scenarios, determine whether the binomial distribution is the appropriate distribution for the random variable

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010 MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

More information

Statistical estimation using confidence intervals

Statistical estimation using confidence intervals 0894PP_ch06 15/3/02 11:02 am Page 135 6 Statistical estimation using confidence intervals In Chapter 2, the concept of the central nature and variability of data and the methods by which these two phenomena

More information

Analysis of Variance ANOVA

Analysis of Variance ANOVA Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.

More information

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)

More information

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION SOLUTIONS 1. a. To calculate the mean, we just add up all 7 values, and divide by 7. In Xi i= 1 fancy

More information

Parametric and non-parametric statistical methods for the life sciences - Session I

Parametric and non-parametric statistical methods for the life sciences - Session I Why nonparametric methods What test to use? Rank Tests Parametric and non-parametric statistical methods for the life sciences - Session I Liesbeth Bruckers Geert Molenberghs Interuniversity Institute

More information

In the past, the increase in the price of gasoline could be attributed to major national or global

In the past, the increase in the price of gasoline could be attributed to major national or global Chapter 7 Testing Hypotheses Chapter Learning Objectives Understanding the assumptions of statistical hypothesis testing Defining and applying the components in hypothesis testing: the research and null

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Correlational Research

Correlational Research Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

Testing a claim about a population mean

Testing a claim about a population mean Introductory Statistics Lectures Testing a claim about a population mean One sample hypothesis test of the mean Department of Mathematics Pima Community College Redistribution of this material is prohibited

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

3. There are three senior citizens in a room, ages 68, 70, and 72. If a seventy-year-old person enters the room, the

3. There are three senior citizens in a room, ages 68, 70, and 72. If a seventy-year-old person enters the room, the TMTA Statistics Exam 2011 1. Last month, the mean and standard deviation of the paychecks of 10 employees of a small company were $1250 and $150, respectively. This month, each one of the 10 employees

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935) Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis

More information

Hypothesis Testing --- One Mean

Hypothesis Testing --- One Mean Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

Difference of Means and ANOVA Problems

Difference of Means and ANOVA Problems Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table ANALYSIS OF DISCRT VARIABLS / 5 CHAPTR FIV ANALYSIS OF DISCRT VARIABLS Discrete variables are those which can only assume certain fixed values. xamples include outcome variables with results such as live

More information

The Wilcoxon Rank-Sum Test

The Wilcoxon Rank-Sum Test 1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade Statistics Quiz Correlation and Regression -- ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements

More information

Chapter 23 Inferences About Means

Chapter 23 Inferences About Means Chapter 23 Inferences About Means Chapter 23 - Inferences About Means 391 Chapter 23 Solutions to Class Examples 1. See Class Example 1. 2. We want to know if the mean battery lifespan exceeds the 300-minute

More information

Chapter 26: Tests of Significance

Chapter 26: Tests of Significance Chapter 26: Tests of Significance Procedure: 1. State the null and alternative in words and in terms of a box model. 2. Find the test statistic: z = observed EV. SE 3. Calculate the P-value: The area under

More information

2 Precision-based sample size calculations

2 Precision-based sample size calculations Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size

More information