3.4 Statistical inference for 2 populations based on two samples

Size: px
Start display at page:

Download "3.4 Statistical inference for 2 populations based on two samples"

Transcription

1 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted as Y 1, Y 2,..., Y n. The population means of the populations from which these samples are taken are denoted by µ X and µ Y, respectively. 1 / 63

2 Statistical inference for 2 populations based on two samples We consider 2 cases 1. Dependent samples - in this case we have n pairs of observations (X 1, Y 1 ),..., (X n, Y n ). 2. Two unrelated samples (X 1,..., X m ) and (Y 1,..., Y n ). 2 / 63

3 3.4.1 Tests for the difference between two population means - dependent samples Each pair of observations comes from either one individual under two different conditions e.g. the weights of a group before (X ) and after (Y ) a diet. or from two related sources e.g. the height of a father (X ) and his son (Y ). 3 / 63

4 Tests for the difference between two population means - dependent samples We wish to test the hypothesis H 0 : µ X = µ Y In the first example, this hypothesis states that the diet has no effect on weight. In the second example, this hypothesis states that on average fathers are as tall as their sons. 4 / 63

5 One and two-sided tests As before, we consider both one and two-sided tests. In a two-sided test the alternative is H A : µ X µ Y In the first example this states that the diet has some effect on weight. In the second example this states that the average height of fathers differs from the average height of their sons. 5 / 63

6 One and two-sided tests The alternative in a one-sided test may be of the form H A : µ X > µ Y. In the first example this states that the diet on average causes weight loss. In the second example this states that the average height of fathers is greater than the average height of their sons. 6 / 63

7 One and two-sided tests The alternative may be of the form H A : µ X < µ Y. In the first example this states that the diet on average causes a gain in weight. In the second example, this states that the average height of sons is greater than the average height of their fathers. 7 / 63

8 Testing procedure for 2 dependent samples When the two samples are dependent, we calculate the differences D i = X i Y i. We treat these differences as one sample and carry out the appropriate one sample test. Let µ D be the population mean of this difference. We have µ D = µ X µ Y. 8 / 63

9 Testing procedure for 2 dependent samples The null hypothesis corresponds to The alternatives H 0 : µ X = µ Y H 0 : µ D = 0. H A : µ X µ Y ; H A : µ X > µ Y ; H A : µ X < µ Y correspond to H A : µ D 0; H A : µ D > 0; H A : µ D < 0, respectively. 9 / 63

10 Testing procedure for 2 dependent samples Suppose a company promoting a diet stated that on average a person loses 4kg on this diet, this hypothesis is H 0 : µ X µ Y = 4 H 0 : µ D = 4. The alternative in this case would be that the diet is not as effective as the company states i.e. H A : µ X µ Y < 4 H A : µ D < / 63

11 Testing procedure for 2 dependent samples The test statistic for the null hypothesis H 0 : µ D = µ 0 is T = D µ 0 S.E.(D), where D is the mean of the sample of differences. This is simply the statistic used for a one-sample test for a population mean. If the sample size is large (n > 30), then this statistic has approximately a standard normal distribution. It should be noted that if the sample size is small, then this test assumes that the differences come from a normal distribution. If this condition is satisfied, then the test statistic has a Student t-distribution with n 1 degrees of freedom. 11 / 63

12 Example athletes run 400m both at sea level and at altitude. Their times are given below. Test the hypothesis that altitude does not affect the average times of runners at a significance level of 5%. Runner Sea Level Altitude / 63

13 Example Since two times are given for each athlete, these samples are dependent. We calculate the differences D i = X i Y i, where X i and Y i are the times of the i-th athlete at sea level and altitude, respectively. The sample of differences is given by Runner Sea Level Altitude Difference / 63

14 Example i) The hypotheses are ii) The test statistic is H 0 : µ D = 0 against H A : µ D 0. T = D µ 0 S.E.(D), where µ 0 is the mean difference according to the null hypothesis, here µ 0 = / 63

15 Example iii) We calculate the realisation of the test statistic D= 8 s 2 = 1 n (D i D) 2 n 1 i=1 = 0.2 = ( )2 + ( ) ( ) 2 7 s= Hence, S.E.(D) s n = t= / 63

16 Example iv) Since the sample size is small, if these differences come from a normal distribution, this statistic has a Student t-distribution with 7 degrees of freedom. The test is two sided, thus the critical value is t n 1,α/2 = t 7,0.025 = v) Since t = 2.49 > t n 1,α/2 = 2.365, we reject H 0. We conclude that altitude affects the runners times. 16 / 63

17 Use of duality for two-sided tests We can also use the duality between confidence intervals and two sided tests. In this case since the significance level is 5%, we calculate a 95% confidence interval for the mean difference. Since the samples are dependent, we treat the differences as one sample and use the appropriate formula to calculate a confidence interval for the true mean difference (in the appropriate population, e.g. all those on the given diet). If this interval contains the value from the null hypothesis, then we do not reject H / 63

18 Use of duality for two-sided tests Since the sample is small, this formula is This gives D ± t n 1,α/2 S.E.(D) = D ± st n 1,α/2 n 0.2 ± = 0.2 ± 0.19 = [0.01, 0.39] We are testing the hypothesis H 0 : µ D = 0. Since 0 does not belong to this confidence interval, we reject H 0 at a significance level of 5%. We have evidence that altitude affects runners times. 18 / 63

19 Assumptions of this test It should be noted that this test assumes that the differences come from a normal distribution. It is not clear whether this is satisfied. Since, the realisation of the test statistic is close to the critical value, we should be somewhat sceptical of our conclusion (more data should be collected). 19 / 63

20 3.4.2 Tests for the difference between two population means: independent samples In this case the two samples come from two unrelated populations. e.g. the height of Americans and Irish, the times of two different groups of runners. We consider two cases 1. Large sample tests (both samples have at least 30 observations). 2. Small sample tests. 20 / 63

21 Large Samples Assume that we have samples (X 1,..., X m ) and (Y 1,..., Y n ) from populations with population means µ X and µ Y (where m and n are at least 30). We use the difference between the two sample means, X Y, to estimate the difference between the two population means, µ X µ Y. The standard error of this estimate is σx 2 S.E.(X Y ) = m + σ2 Y n. This is approximated using S.E.(X Y ) s 2 X m + s2 Y n. 21 / 63

22 Large Samples Suppose we wish to test the hypothesis that H 0 : µ X µ Y = d i.e. the difference between the two population means is d. In two-tailed tests the alternative is H A : µ X µ Y d. If the test is two-tailed, we can always label the two samples in such a way that the alternative is H A : µ X µ Y > d 22 / 63

23 Large Samples When both samples are large (m, n > 30), the test statistic is Z = (X Y ) d S.E.(X Y ) This statistic has approximately a standard normal distribution. Critical values and p-values are calculated in the same way as in one sample tests. i.e. The p-value for a two sided test is p = P( Z > t ) = 2P(Z > t ). The p-value for a one sided test is p = P(Z > t). 23 / 63

24 Large samples The critical value for a two sided test is Z α/2 = t,α/2. H 0 is rejected if and only if t > t,α/2. The critical value for a one sided test is Z α = t,α. H 0 is rejected if and only if t > t,α. It should be noted that, as before, the realisation of the test statistic is a measure of the distance between the data and H 0. e.g. when the difference between the sample means is much greater than d, then the realisation of the test statistic will be much greater than / 63

25 Example The average height of 100 Dutch men is 176cm and their standard deviation 12cm. The average height of 50 Japanese men is 169cm and their standard deviation is 10cm. Test at a significance level of 1% the hypothesis that the average heights of Dutch men and Japanese men are equal. 25 / 63

26 Example i) We have H 0 : µ X µ Y = 0; H A : µ X µ Y 0, where µ X is the mean height of all Dutch men and µ Y the mean height of all Japanese men. 26 / 63

27 Example ii) We use the test statistic Z= X Y S.E.(X Y ) s 2 X S.E.(X Y ) m + s2 Y n 12 2 = = The realisation of this test statistic is t = / 63

28 Example iv) From the table for the standard normal distribution, the p-value for this test is p = 2P(Z > 3.77) = v) Since p < α = 0.01, we reject H 0 at a significance level of 1%. Also, since p < 0.001, we have very strong evidence that the mean heights of Dutchmen and Japanese differ. From the data we may state that Dutchmen are taller on average than Japanese men. 28 / 63

29 Example iv) We can also use the appropriate critical value. Since both samples are large and this is a two sided test, this value is given by Z α/2 = t,α/2 = v) Since t = 3.77 > t,α/2 = 2.576, we reject H 0 at a significance level of 1%. We have strong evidence that the mean heights of Dutchmen and Japanese differ. 29 / 63

30 Duality for two independent samples When both samples are large, the 100(1 α)% confidence interval for the difference between two population means is (X Y ) ± t,α/2 S.E.(X Y ) We can use the duality between confidence intervals and two sided tests. 30 / 63

31 Example Calculate a 95% confidence interval for the difference between the mean height of Dutch and Japanese men. Test the hypothesis that on average Dutch men are 10cm taller than Japanese men (data from previous example). We are testing H 0 : µ X µ Y = 10 against H A : µ X µ Y / 63

32 Example Since the samples are large, the confidence interval for the difference between the population means is (X Y )±t,α/2 S.E.(X Y ) t,α/2 =t,0.025 = 1.96 Hence, the confidence interval is ( ) ± = 7 ± 3.64 = [3.36, 10.64] Since 10 [3.36, 10.64], we do not reject H 0 at a significance level of 5%. The is no evidence against the hypothesis that on average Dutchmen are 10cm taller than Japanese men. 32 / 63

33 Small sample tests In the case where at least one of the samples is small, the test for the difference between two population means assumes that the observations come from normal distributions with equal variances (i.e. σ 2 X = σ2 Y = σ2 ). 33 / 63

34 Test for equality of variances Before we carry out the test for a difference between two population means, we should carry out an F test for the equality of two variances. We test H 0 : σ 2 X = σ2 Y against H A : σ 2 X σ2 Y The test statistic, F, is the ratio between the two sample variances. F = max{s2 X, s2 Y } min{s 2 X, s2 Y }. When this ratio is close to one we do not reject the null hypothesis that the population variances are equal. Ratios much greater than 1 indicate that the null hypothesis is not true. 34 / 63

35 Test for equality of variances Suppose the observations in both samples come from a normal distribution. F has an F distribution with j 1 and k 1 degrees of freedom, where j and k are the number of observations in the sample with the largest and smallest variance, respectively. We reject H 0 if and only if the realisation of the test statistic, f, satisfies f > F j 1,k 1,α/2, where P(F > F j 1,k 1,α/2 = α/2). 35 / 63

36 Test for equality of variances Critical values of the F j 1,k 1,p are given in Table 9. j 1 and k 1 correspond to the column and row number, respectively. This test is normally carried out at a significance level of 5%. Each cell contains 4 critical values. The first is for p = 0.05, the second (in brackets) for p = (this is the appropriate value), the third for p = 0.01 and the fourth for p = Note that if the two variances are not equal, the assumptions of the test for a difference between two means, presented below, do not hold. The appropriate procedure in this case is not covered in the course. 36 / 63

37 Test for difference between two means (small samples) Given the hypothesis regarding the equality of variances was not rejected, we use a pooled estimate of the variances, s 2 p, where s 2 p = (m 1)s2 X + (n 1)s2 Y m + n 2 This is a weighted average of the sample variances, in which the sample with the largest number of observations has the largest weight. The standard error of the difference between the sample means is. S.E.(X Y ) = s p 1 m + 1 n 37 / 63

38 Test for difference between two means (small samples) Suppose we wish to test the null hypothesis The test statistic used is H 0 : µ X µ Y = d. T = (X Y ) d S.E.(X Y ). Given the assumptions of the test are satisfied (the observations come from normal distributions with a common variance), then this statistic has a student t distribution with m + n 2 degrees of freedom. A 100(1 α)% confidence interval for the difference between the two population means, µ X µ Y, is given by (X Y ) ± t m+n 2,α/2 S.E.(X Y ). 38 / 63

39 Test for difference between two means (small samples) The critical value for the two sided test with is t m+n 2,α/2. H A : µ X µ Y d. We reject H 0 iff t > t m+n 2,α/2. If the test is two sided, we can always label the two samples such that the alternative is of the form H A : µ X µ Y > d. The critical value for such a test is t m+n 2,α. We reject H 0 iff t > t m+n 2,α. 39 / 63

40 Example The average height of 13 Dutch men is 176cm and their standard deviation 12cm. The average height of 11 Japanese men is 169cm and their standard deviation is 10cm. Test at a significance level of 5% the hypothesis that the average heights of Dutch men and Japanese men are equal. 40 / 63

41 Example Since the sample sizes are small, we first test the assumption that the population variances are equal. i) We have ii) The test statistic is H 0 : σ 2 X = σ2 Y against H A : σ 2 X σ2 Y F = max{s2 X, s2 Y } min{s 2 X, s2 Y }. iii) The realisation of this test statistic is f = max{122, 10 2 } min{12 2, 10 2 } = / 63

42 Example iv) We read the appropriate critical value. Since there are 13 observations in the sample with the greatest variance, j 1 = 12. Similarly, k 1 = 10. Since α = 0.05, the critical value is F 12,10,0.025 = v) Since f < F 12,10,0.025 = 3.62, we do not reject H 0. Hence, we may assume that the two population variances are equal. 42 / 63

43 Example We now proceed to test the hypothesis regarding the equality of the two means. i) We have H 0 : µ X µ Y = 0 against H A : µ X µ Y 0. ii) The test statistic for this test is where T = X Y S.E.(X Y ), S.E.(X Y )=s p 1 m + 1 n s 2 p= (m 1)s2 X + (n 1)s2 Y m + n / 63

44 Example iii) We calculate the realisation of the test statistic. The pooled variance is s 2 p = (13 1) (11 1) = 124. The standard error of the difference between the two sample means is 1 S.E.(X Y )=s p m + 1 n 1 = The realisation of the test statistic is t = / 63

45 Example iv) The critical value for the test is t m+n 2,α/2 = t 22,0.025 = v) Since t = 1.53 < t m+n 2,α/2 = 2.074, we do not reject H 0. There is no evidence that the mean height of Dutchmen differs from the mean height of Japanese. 45 / 63

46 Use of duality It should be noted that we can carry out this test using the duality between two sided tests and confidence intervals. The formula for a 100(1 α)% confidence interval for the difference between two population means when the sample sizes are small is (X Y ) ± t m+n 2,α/2 S.E.(X Y ). Since we are carrying out a test at a significance level of 5%, we calculate a 95% confidence interval. This is given by ( ) ± t 22,0.025 S.E.(X Y )=7 ± =7 ± 9.46 = [ 2.46, 16.46] 46 / 63

47 Use of duality Since we are testing H 0 : µ X µ Y = 0, We reject the null hypothesis if and only if 0 does not belong to this confidence interval. Since 0 belongs to this confidence interval, we do not reject H 0 at a significance level of 5%. There is no evidence that the mean height of Dutchmen differs from the mean height of Japanese. 47 / 63

48 Confidence intervals for the difference between two population proportions Suppose we have two independent, large samples from distinct populations. Suppose the i-th sample has n i observations and the number of individuals showing the trait of interest in the i-th sample is x i. Let the proportion of individuals exhibiting these traits in the i-th population be p i and the proportion of individuals exhibiting these traits in the i-th sample be ˆp i, where ˆp i = x i n i. The difference between the two sample proportions is used to estimate the difference between the two population proportions. 48 / 63

49 Confidence intervals for the difference between two population proportions The standard error of the difference between the two sample proportions is p 1 (1 p 1 ) S.E.(ˆp 1 ˆp 2 ) = + p 2(1 p 2 ) n 1 n 2 As before, the standard error of this difference depends on the (unknown) population proportions. When we calculate a confidence interval for the difference between population proportions, this standard error can be approximated using ˆp 1 (1 ˆp 1 ) S.E.(ˆp 1 ˆp 2 ) = + ˆp 2(1 ˆp 2 ). n 1 n 2 49 / 63

50 Confidence intervals for the difference between two population proportions An approximate 100(1 α)% confidence interval for the difference between two population proportions is given by (ˆp 1 ˆp 2 ) ± t m+n 2,α/2 S.E.(ˆp 1 ˆp 2 ) Note that this procedure should not be used e.g. to calculate a confidence interval for the difference between the level of support of two political parties in a single population. 50 / 63

51 Example of 300 male applicants for an engineering course were accepted and 40 of 80 female applicants. Calculate a 95% confidence interval for the difference in the proportion of males and females accepted for the course. 51 / 63

52 Example The sample proportions are ˆp 1 = = 0.4; ˆp 2 = = 0.5 The estimate of the standard error of the difference between the two proportions is given by ˆp 1 (1 ˆp 1 ) S.E.(ˆp 1 ˆp 2 )= + ˆp 2(1 ˆp 2 ) n 1 n = / 63

53 Example The confidence interval for the difference between the two proportions is (ˆp 1 ˆp 2 ) ± t,α/2 S.E.(ˆp 1 ˆp 2 )=0.1 ± =0.1 ± = [ 0.023, 0.223]. 53 / 63

54 Hypothesis testing Suppose we want to test the hypothesis The test statistic for this test is H 0 : p 1 = p 2. Z = (ˆp 1 ˆp 2 ) S.E.(ˆp 1 ˆp 2 ). This statistic has approximately a standard normal distribution. Let t be the realisation of this statistic. 54 / 63

55 Hypothesis testing In order to estimate the standard error of the difference between the two sample proportions under the null hypothesis, we calculate the pooled proportion p. This is the total number of individuals with the trait in both samples divided by the total number of individuals in both samples p = x 1 + x 2 n 1 + n 2. We have [ 1 S.E.(ˆp 1 ˆp 2 ) p(1 p) + 1 ]. n 1 n 2 55 / 63

56 Hypothesis testing For two tailed tests with H A : p 1 p 2 the critical value is Z α/2 = t,α/2. We reject H 0 if and only if t > t,α/2. The p-value is 2P(Z > t ). 56 / 63

57 Hypothesis testing In the case of one tailed tests, we can always number the samples such that the alternative is The critical value is Z α = t,α. H A : p 1 > p 2. We reject H 0 if and only if t > t,α. The p-value is P(Z > t). 57 / 63

58 Example of 300 male applicants for an engineering course were accepted and 40 of 80 female applicants. Test at a significance level of 5% the hypothesis that the proportion of males accepted equals the proportion of females accepted. 58 / 63

59 Example The hypotheses are ii) The statistic used is H 0 : p 1 = p 2 against H A : p 1 p 2 Z = ˆp 1 ˆp 2 S.E.(ˆp 1 ˆp 2 ). 59 / 63

60 Example iii) We calculate the realisation of the test statistic. The pooled proportion is p = x 1 + x = n 1 + n The estimate of the standard error of the difference between the two sample proportions under H 0 is [ 1 S.E.(ˆp 1 ˆp 2 ) p(1 p) + 1 ] n 1 n 2 [ 1 = ] / 63

61 Example The sample proportions are ˆp 1 = = 0.4; ˆp 2 = = 0.5. The realisation of the test statistic is t = iv) The critical value for this test is t,0.025 = v) Since t = 1.61 < t,0.025 = 1.96, we do not reject H 0 at a significance level of 5%. There is no evidence that the admission rates vary according to sex. 61 / 63

62 Example In iv) we could calculate the p-value p = 2P(Z > t ) = 2P(Z > 1.61) = = Since p > 0.05, we do not reject H 0. There is no evidence that the admission rates vary. 62 / 63

63 Example As in the one sample case, we can use the duality between two-sided hypothesis tests and confidence intervals. Since different approximations are used to estimate the standard error of the difference between the sample proportions, this duality is only approximate. Suppose we wish to test H 0 : p 1 = p 2 (i.e. p1 p2 = 0). In this example the 95% confidence interval for the difference between the two proportions was [-0.023, 0.223]. Since 0 belongs to this confidence interval, we do not reject H 0 at a significance level of approximately 5%. There is no evidence that the admission rates vary. 63 / 63

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Estimation of σ 2, the variance of ɛ

Estimation of σ 2, the variance of ɛ Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

Two Related Samples t Test

Two Related Samples t Test Two Related Samples t Test In this example 1 students saw five pictures of attractive people and five pictures of unattractive people. For each picture, the students rated the friendliness of the person

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems. Practice problems for Homework 1 - confidence intervals and hypothesis testing. Read sections 10..3 and 10.3 of the text. Solve the practice problems below. Open the Homework Assignment 1 and solve the

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

Mind on Statistics. Chapter 13

Mind on Statistics. Chapter 13 Mind on Statistics Chapter 13 Sections 13.1-13.2 1. Which statement is not true about hypothesis tests? A. Hypothesis tests are only valid when the sample is representative of the population for the question

More information

Hypothesis Testing --- One Mean

Hypothesis Testing --- One Mean Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Unit 26 Estimation with Confidence Intervals

Unit 26 Estimation with Confidence Intervals Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference

More information

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters. Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample

More information

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Hypothesis Testing: Two Means, Paired Data, Two Proportions Chapter 10 Hypothesis Testing: Two Means, Paired Data, Two Proportions 10.1 Hypothesis Testing: Two Population Means and Two Population Proportions 1 10.1.1 Student Learning Objectives By the end of this

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Difference of Means and ANOVA Problems

Difference of Means and ANOVA Problems Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly

More information

Math 251, Review Questions for Test 3 Rough Answers

Math 251, Review Questions for Test 3 Rough Answers Math 251, Review Questions for Test 3 Rough Answers 1. (Review of some terminology from Section 7.1) In a state with 459,341 voters, a poll of 2300 voters finds that 45 percent support the Republican candidate,

More information

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Opgaven Onderzoeksmethoden, Onderdeel Statistiek Opgaven Onderzoeksmethoden, Onderdeel Statistiek 1. What is the measurement scale of the following variables? a Shoe size b Religion c Car brand d Score in a tennis game e Number of work hours per week

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 1. Does vigorous exercise affect concentration? In general, the time needed for people to complete

More information

Two-sample hypothesis testing, II 9.07 3/16/2004

Two-sample hypothesis testing, II 9.07 3/16/2004 Two-sample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For two-sample tests of the difference in mean, things get a little confusing, here,

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives C H 8A P T E R Outline 8 1 Steps in Traditional Method 8 2 z Test for a Mean 8 3 t Test for a Mean 8 4 z Test for a Proportion 8 6 Confidence Intervals and Copyright 2013 The McGraw Hill Companies, Inc.

More information

Chapter 7 Section 1 Homework Set A

Chapter 7 Section 1 Homework Set A Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl Dept of Information Science j.nerbonne@rug.nl October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

Introduction to Hypothesis Testing

Introduction to Hypothesis Testing I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

22. HYPOTHESIS TESTING

22. HYPOTHESIS TESTING 22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty?

More information

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013 STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico Fall 2013 CHAPTER 18 INFERENCE ABOUT A POPULATION MEAN. Conditions for Inference about mean

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Chapter 4 Statistical Inference in Quality Control and Improvement. Statistical Quality Control (D. C. Montgomery)

Chapter 4 Statistical Inference in Quality Control and Improvement. Statistical Quality Control (D. C. Montgomery) Chapter 4 Statistical Inference in Quality Control and Improvement 許 湘 伶 Statistical Quality Control (D. C. Montgomery) Sampling distribution I a random sample of size n: if it is selected so that the

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so: Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a

More information

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935) Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

The Wilcoxon Rank-Sum Test

The Wilcoxon Rank-Sum Test 1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Sample Practice problems - chapter 12-1 and 2 proportions for inference - Z Distributions Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%. Chapter 7-8 Exam Name: Answer the questions in the spaces provided. If you run out of room, show your work on a separate paper clearly numbered and attached to this exam. Please indicate which program

More information

Comparing Means Between Groups

Comparing Means Between Groups Comparing Means Between Groups Michael Ash Lecture 6 Summary of Main Points Comparing means between groups is an important method for program evaluation by policy analysts and public administrators. The

More information

Chapter 2 Probability Topics SPSS T tests

Chapter 2 Probability Topics SPSS T tests Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform

More information

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck! Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck! Name: 1. The basic idea behind hypothesis testing: A. is important only if you want to compare two populations. B. depends on

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric

More information

Non-Parametric Tests (I)

Non-Parametric Tests (I) Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent

More information

How To Compare Birds To Other Birds

How To Compare Birds To Other Birds STT 430/630/ES 760 Lecture Notes: Chapter 7: Two-Sample Inference 1 February 27, 2009 Chapter 7: Two Sample Inference Chapter 6 introduced hypothesis testing in the one-sample setting: one sample is obtained

More information

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.

More information

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples Statistics One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples February 3, 00 Jobayer Hossain, Ph.D. & Tim Bunnell, Ph.D. Nemours

More information

5.1 Identifying the Target Parameter

5.1 Identifying the Target Parameter University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Odds ratio, Odds ratio test for independence, chi-squared statistic. Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review

More information

Is it statistically significant? The chi-square test

Is it statistically significant? The chi-square test UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical

More information

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/ Statistiek I Proportions aka Sign Tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/34 Proportions aka Sign Test The relative frequency

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

12.5: CHI-SQUARE GOODNESS OF FIT TESTS 125: Chi-Square Goodness of Fit Tests CD12-1 125: CHI-SQUARE GOODNESS OF FIT TESTS In this section, the χ 2 distribution is used for testing the goodness of fit of a set of data to a specific probability

More information

Tests for Two Proportions

Tests for Two Proportions Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics

More information

NCSS Statistical Software. One-Sample T-Test

NCSS Statistical Software. One-Sample T-Test Chapter 205 Introduction This procedure provides several reports for making inference about a population mean based on a single sample. These reports include confidence intervals of the mean or median,

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

Social Studies 201 Notes for November 19, 2003

Social Studies 201 Notes for November 19, 2003 1 Social Studies 201 Notes for November 19, 2003 Determining sample size for estimation of a population proportion Section 8.6.2, p. 541. As indicated in the notes for November 17, when sample size is

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

More information

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences. 1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4% Hypothesis Testing for a Proportion Example: We are interested in the probability of developing asthma over a given one-year period for children 0 to 4 years of age whose mothers smoke in the home In the

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

CHAPTER 14 NONPARAMETRIC TESTS

CHAPTER 14 NONPARAMETRIC TESTS CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences

More information

Mind on Statistics. Chapter 12

Mind on Statistics. Chapter 12 Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference

More information

2 Precision-based sample size calculations

2 Precision-based sample size calculations Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size

More information

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)

More information

1. How different is the t distribution from the normal?

1. How different is the t distribution from the normal? Statistics 101 106 Lecture 7 (20 October 98) c David Pollard Page 1 Read M&M 7.1 and 7.2, ignoring starred parts. Reread M&M 3.2. The effects of estimated variances on normal approximations. t-distributions.

More information

Hypothesis Testing. Hypothesis Testing

Hypothesis Testing. Hypothesis Testing Hypothesis Testing Daniel A. Menascé Department of Computer Science George Mason University 1 Hypothesis Testing Purpose: make inferences about a population parameter by analyzing differences between observed

More information

Module 2 Probability and Statistics

Module 2 Probability and Statistics Module 2 Probability and Statistics BASIC CONCEPTS Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The standard deviation of a standard normal distribution

More information

Review #2. Statistics

Review #2. Statistics Review #2 Statistics Find the mean of the given probability distribution. 1) x P(x) 0 0.19 1 0.37 2 0.16 3 0.26 4 0.02 A) 1.64 B) 1.45 C) 1.55 D) 1.74 2) The number of golf balls ordered by customers of

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Describing Populations Statistically: The Mean, Variance, and Standard Deviation Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly

More information