6. Duality between confidence intervals and statistical tests

Size: px
Start display at page:

Download "6. Duality between confidence intervals and statistical tests"

Transcription

1 6. Duality between confidence intervals and statistical tests Suppose we carry out the following test at a significance level of 100α%. H 0 :µ = µ 0 H A :µ µ 0 Then we reject H 0 if and only if µ 0 does not belong to the 100(1 α)% confidence interval for µ. 1 / 44

2 Intuition of the duality between tests and confidence intervals Intuition: if µ 0 belongs to the appropriate confidence interval, then it is a credible value for the population mean and hence we do not reject H 0. Significance level + confidence level = 100%. 2 / 44

3 Example 6.1 (see Example 5.2) A sample of 100 Irish people were measured. Their mean height was 168cm and the sample standard deviation 12cm. Calculate a 95% confidence interval for the mean height of all Irish people. On the basis of this confidence interval, should we reject the hypothesis that the mean height of Irish people is 170cm? What is the significance level of such a test? 3 / 44

4 Solution to Example 6.1 The sample is large and 100(1 α) = 95 α = The formula for the 95% confidence interval is X ± t,α/2s =168 ± n 10 =168 ± 2.35 = [165.65, ] Since µ 0 = 170 belongs to the 95% confidence interval for µ, we do not reject H 0. 4 / 44

5 Applications of the concept of duality The duality between confidence intervals and tests can easily be extended to tests for the difference of two means for 2 dependent samples. Let D denote the difference between a pair of observations. We reject the null hypothesis H 0 : µ D = 0, at a significance level of 100α%, if and only if 0 does not belong to the 100(1 α)% confidence interval for the mean difference. 5 / 44

6 Example 6.2 (see Example 5.4) A drug for reducing blood pressure is tested on a group of 7 patients. The maximum resting blood pressure of these patients was measured before and after treatment. Patient Blood Pressure - Before Blood Pressure - After Calculate 95% and 99% confidence intervals for the mean fall in blood pressure. Can it be said at a significance level of a) 5% b) 1%, that the drug changes the mean blood pressure? 6 / 44

7 Solution of Example 6.2 It should be noted that this hypothesis states that µ D 0. This cannot be the null hypothesis. The null hypothesis is the complement of this hypothesis i.e. µ D = 0. First we calculate the falls in the blood pressure for each patient. Patient Before (X ) After (Y ) Difference (X Y ) / 44

8 Solution of Example 6.2 Now we calculate the confidence intervals for the mean difference. The sample of differences is small, hence the appropriate formulae are as follows: 95% confidence interval: D ± s Dt n 1,0.025 n 99% confidence interval: D ± s Dt n 1,0.005 n 8 / 44

9 Solution of Example 6.2 We calculate the mean and the standard deviation of the differences D= = 10 7 n sd 2 = i=1 (d i D) 2 n 1 = (10 10)2 + (10 10) (20 10) 2 = s D = 100 = / 44

10 Solution of Example 6.2 We now calculate the 95% confidence interval: D ± s Dt n 1,0.025 =10 ± 10t 6,0.025 n =10 ± = 10 ± 9.25 = [0.75, 19.25] 7 Since 0 does not belong to the 95% confidence interval for µ D, we reject H 0 at a significance level of 5%. 10 / 44

11 Solution of Example 6.2 We now calculate the 99% confidence interval: D ± s Dt n 1,0.005 =10 ± 10t 6,0.005 n =10 ± 7 =10 ± = [ 4.01, 24.01] Since 0 belongs to the 99% confidence interval for µ D, we do not reject H 0 at a significance level of 1%. Hence, we have evidence that the drug affects blood pressure, although this evidence is not very strong. 11 / 44

12 6.1 Confidence intervals for the difference between population means based on 2 independent samples For 2 large samples When both samples are large the 100(1-α)% confidence interval for the difference between 2 population means is: Confidence interval for the difference between population means X Y ± t, α S.E.(X Y ) / 44

13 Confidence intervals for the difference between population means based on 2 independent samples In particular, 95% confidence interval X Y ± t,0.025 S.E.(X Y ) = X Y ± 1.96S.E.(X Y ). 99% confidence interval X Y ± t,0.005 S.E.(X Y ) = X Y ± 2.576S.E.(X Y ), where X and Y are the sample means and S.E.(X Y ) is the standard error of using (X Y ) to estimate µ X µ Y. 13 / 44

14 Standard error of the difference between the sample means When both samples are large, the approximation for the standard error of the difference between the sample means is given by sx 2 S.E.(X Y ) m + s2 Y n, where m and n are the sample sizes. 14 / 44

15 Example 6.3 (see Example 5.5) The average height of 100 Dutch men is 176cm and their standard deviation 12cm. The average height of 50 Japanese men is 169cm and the standard deviation is 10cm. Calculate a 95% confidence interval for the mean difference in height between Dutch and Japanese men. Can it be stated at a significance level of 5% that the mean heights of these nationatilities differ? 15 / 44

16 Solution to Example 6.3 First we estimate the standard error of the difference between the sample means s 2 X S.E.(X Y ) m + s2 Y n 122 = = / 44

17 Solution to Example 6.3 The 95% confidence interval for the difference between population means is X Y ± t,0.025 S.E.(X Y )=( ) ± =7 ± 3.64 = [3.36, 10.64] 17 / 44

18 Solution to Example 6.3 We have H 0 :µ X µ Y = 0 H A :µ X µ Y 0. Since 0 does not belong to this confidence interval, we reject H 0 at a significance level of 5%, we have evidence that the mean heights differ. 18 / 44

19 6.1.2 For small samples If at least one of the samples is small, then we use the following approximation for S.E.(X Y ). S.E.(X Y ) where s 2 p is the pooled variance s 2 p ( 1 m + 1 ), n s 2 p = (m 1)s2 X + (n 1)s2 Y m + n 2 This formula assumes that the population variances are equal. The formula for the confidence interval assumes that if a sample is small then observations come from a normal distribution. 19 / 44

20 Formulae for confidence intervals for the difference between two population means (small samples) The 100(1-α)% confidence interval for the difference between 2 population means is (X Y ) ± t m+n 2, α S.E.(X Y ). 2 (if m + n 2 > 30, we may use the approximation t m+n 2,α/2 t,α/2. In particular, the 95% confidence interval is (X Y ) ± t m+n 2,0.025 S.E.(X Y ). The 99% confidence interval for the difference between 2 population means is (X Y ) ± t m+n 2,0.005 S.E.(X Y ). 20 / 44

21 Example 6.3 (see Example 5.6) The table below gives statistics regarding the weights of Americans and Japanese. Sample Americans Japanese size mean std. dev Calculate a 99% confidence interval for the mean difference in weight between Americans and Japanese. Can it be stated at a significance level of 1% that these mean weights differ? 21 / 44

22 Solution to Example 6.3 First we calculate the pooled variance sp= 2 (m 1)s2 X + (n 1)s2 Y m + n 2 = Now we calculate the standard error ( 1 S.E.(X Y ) sp 2 m + 1 ) n ( ) / 44

23 Solution to Example 6.3 Now we calculate the 99% confidence interval (X Y ) ± t m+n 2,0.005 S.E.(X Y )=(86 72) ± t 23, =14 ± ± 12.9 = [1.1, 26.9]. 23 / 44

24 Solution to Example 6.3 We have H 0 :µ X µ Y = 0 H A :µ X µ Y 0, where µ X and µ Y are the mean wieghts of the populations of Americans and Japanese, respectively. Since 0 does not belong to the 99% confidence interval, we reject H 0 at a significance level of 1%. We have strong evidence that the mean weight of Americans and Japanese differs. 24 / 44

25 Solution to Example 6.3 It should be noted that these calculations assume that weight has a normal distribution. Since weight has a slightly skewed distribution, the calculation of the confidence interval is not totally accurate. The evidence that the mean weights differ is thus not so strong as stated. 25 / 44

26 6.2 Tests for a population proportion Suppose we have a large sample. We wish to test between the following two hypotheses H 0 :p = p 0 H A :p p 0, where p is the proportion of a population showing a given trait. The obvious estimator of this proportion is the sample proportion ˆp = X n, where X is the number of individuals exhibiting the given trait in a sample of n individuals. 26 / 44

27 Testing procedure The test statistic for this test is Z = ˆp p 0 S.E.(ˆp), where S.E.(ˆp) is the standard error of the sample proportion. The p-value for the test is 2P(Z > t ), where t is the realisation of the test statistic. At a significance level of α, we reject H 0 : p = p 0 if t > t, α / 44

28 Standard error of the sample proportion The standard error of this estimate is p(1 p) S.E.(ˆp) =. n Under the null hypothesis p = p 0, p0 (1 p 0 ) S.E.(ˆp) =. n This formula for the standard error is used in the standard testing procedure. 28 / 44

29 Confidence intervals for a population proportion A 100(1-α)% confidence interval for a population proportion is given by ˆp ± t, α 2 S.E.(ˆp). When calculating a confidence interval, the standard error is approximated using the sample proportion i.e. ˆp(1 ˆp) S.E.(ˆp). n 29 / 44

30 Particular confidence intervals for a population proportion In particular, a 95% confidence interval for a population proportion is given by ˆp ± t,0.025 S.E.(ˆp) = ˆp ± 1.96S.E.(ˆp). A 99% confidence interval for a population proportion is given by ˆp ± t,0.005 S.E.(ˆp) = ˆp ± 2.576S.E.(ˆp). 30 / 44

31 Approximate duality between confidence intervals for population proportions and tests It should be noted that when we are testing the hypothesis H 0 : p = p 0, due to the different formulae used to calculate the standard error of ˆp, the duality between confidence intervals and hypothesis tests is only approximate (in the standard testing procedure p = p 0 is used to calculate the standard error, but p ˆp is used in the calculation of the confidence interval). e.g. using a 95% confidence to test the hypothesis H 0 : p = p 0, the significance level would be approximately 5%. 31 / 44

32 Example of 300 randomly picked Irish households have a PC. i) Test the null hypothesis that 55% of all Irish households have a PC. ii) Construct 95% and 99% confidence intervals for the proportion of Irish households owning a PC. 32 / 44

33 Solution to Example 6.4 i) The sample proportion is The test statistic is ˆp = = 0.61 Z = ˆp p 0 S.E.(ˆp) 33 / 44

34 Solution to Example 6.4 The standard error of the sample proportion is p0 (1 p 0 ) S.E.(ˆp)= n = The realisation of the test statistic is t = / 44

35 Conclusion Since t > t,0.025 = 1.96 and t < t,0.005 = 2.576, we reject H 0 at a significance level of 5%, but not at a significance level of 1%. We have evidence that the percentage is higher than 55%. 35 / 44

36 Calculation of confidence interval The approximation of the standard error is ˆp(1 ˆp) S.E.(ˆp) n = / 44

37 Solution to Example 6.4 The 95% confidence interval for the proportion of all Irish households owning a PC is 0.61 ± ± = [0.555, 0.665] Since 0.55 does not belong to this confidence interval, based on the confidence interval we reject the null hypothesis that p = 0.55 at a significance level of approximately 5%. 37 / 44

38 Solution to Example 6.4 The 99% confidence interval for the proportion of all Irish households owning a PC is 0.61 ± ± = [0.537, 0.683] Since 0.55 belongs to this confidence interval, we do not reject the null hypothesis that p = 0.55 at a significance level of approximately 1%. Hence, we reject H 0 : p = 0.55 at the 5% level, but not at the 1% level. We have evidence that the proportion of Irish households owning PCs is greater than 55%, but this evidence is not strong. 38 / 44

39 6.3 Testing for a difference between two population proportions Suppose we have two large samples. We wish to test between the following two hypotheses H 0 :p 1 = p 2 (i.e. p 1 p 2 = 0) H A :p 1 p 2 where p i is the proportion of population i showing a given trait. We will only consider such tests using duality i.e. based on confidence intervals. 39 / 44

40 Confidence intervals for the difference between two population proportions A 100(1-α)% confidence interval for the difference between two population proportions is given by (ˆp 1 ˆp 2 ) ± t, α 2 S.E.(ˆp 1 ˆp 2 ), where ˆp i is the sample proportion for the sample from population i. In particular, a 95% confidence interval for the difference between two population proportions is given by (ˆp 1 ˆp 2 ) ± t,0.025 S.E.(ˆp 1 ˆp 2 ) = (ˆp 1 ˆp 2 ) ± 1.96S.E.(ˆp 1 ˆp 2 ) A 99% confidence interval for the difference between two population proportions is given by (ˆp 1 ˆp 2 ) ± t,0.005 S.E.(ˆp 1 ˆp 2 ) = (ˆp 1 ˆp 2 ) ± 2.576S.E.(ˆp 1 ˆp 2 ) 40 / 44

41 Standard error of the difference between sample proportions The standard error for the difference between two sample proportions is estimated by ˆp 1 (1 ˆp 1 ) S.E.(ˆp 1 ˆp 2 ) + ˆp 2(1 ˆp 2 ), n 1 n 2 where n i is the size of the sample from the population i. Again, the duality between confidence intervals and hypothesis tests is only approximate. 41 / 44

42 Example 6.5 A long term study showed that 64 of 400 smokers contracted lung cancer, while 18 of 900 non-smokers contracted the disease. Calculate a 95% confidence interval for the difference between the proportions of smokers and non-smokers contracting the disease. 42 / 44

43 Solution to Example 6.5 The sample proportions are ˆp 1 = = 0.16 ˆp 2 = = 0.02 The estimate of the standard error for the difference between the sample proportions is ˆp 1 (1 ˆp 1 ) + ˆp 2(1 ˆp 2 ) = + n 1 n / 44

44 Solution to Example 6.5 A 95% confidence interval for the difference between the population proportions is given by (ˆp 1 ˆp 2 ) ± 1.96S.E.(ˆp 1 ˆp 2 )=( ) ± =0.14 ± = [0.103, 0.177] Since 0 does not belong to this 95% confidence interval, we reject the null hypothesis at a significance level of approximately 5%. Hence, we have evidence that the population proportions are not equal. From the data it can be seen that the proportion of smokers contracting lung cancer is higher than the proportion of non-smokers. 44 / 44

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 STATISTICS 8, FINAL EXAM NAME: KEY Seat Number: Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 Make sure you have 8 pages. You will be provided with a table as well, as a separate

More information

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters. Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Chapter 2 Probability Topics SPSS T tests

Chapter 2 Probability Topics SPSS T tests Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform

More information

Mind on Statistics. Chapter 13

Mind on Statistics. Chapter 13 Mind on Statistics Chapter 13 Sections 13.1-13.2 1. Which statement is not true about hypothesis tests? A. Hypothesis tests are only valid when the sample is representative of the population for the question

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013 STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico Fall 2013 CHAPTER 18 INFERENCE ABOUT A POPULATION MEAN. Conditions for Inference about mean

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption Last time, we used the mean of one sample to test against the hypothesis that the true mean was a particular

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 1. Does vigorous exercise affect concentration? In general, the time needed for people to complete

More information

Chapter 7 Section 1 Homework Set A

Chapter 7 Section 1 Homework Set A Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the

More information

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Online 12 - Sections 9.1 and 9.2-Doug Ensley Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 12 - Sections 9.1 and 9.2 1. Does a P-value of 0.001 give strong evidence or not especially strong

More information

PRACTICE PROBLEMS FOR BIOSTATISTICS

PRACTICE PROBLEMS FOR BIOSTATISTICS PRACTICE PROBLEMS FOR BIOSTATISTICS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION 1. The duration of time from first exposure to HIV infection to AIDS diagnosis is called the incubation period.

More information

Testing Hypotheses About Proportions

Testing Hypotheses About Proportions Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Sample Practice problems - chapter 12-1 and 2 proportions for inference - Z Distributions Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

Solutions to Homework 6 Statistics 302 Professor Larget

Solutions to Homework 6 Statistics 302 Professor Larget s to Homework 6 Statistics 302 Professor Larget Textbook Exercises 5.29 (Graded for Completeness) What Proportion Have College Degrees? According to the US Census Bureau, about 27.5% of US adults over

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA

An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA ABSTRACT An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA Often SAS Programmers find themselves in situations where performing

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

socscimajor yes no TOTAL female 25 35 60 male 30 27 57 TOTAL 55 62 117

socscimajor yes no TOTAL female 25 35 60 male 30 27 57 TOTAL 55 62 117 Review for Final Stat 10 (1) The table below shows data for a sample of students from UCLA. (a) What percent of the sampled students are male? 57/117 (b) What proportion of sampled students are social

More information

University of Chicago Graduate School of Business. Business 41000: Business Statistics

University of Chicago Graduate School of Business. Business 41000: Business Statistics Name: University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas. 2. Throughout

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Factors affecting online sales

Factors affecting online sales Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

More information

2 Precision-based sample size calculations

2 Precision-based sample size calculations Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size

More information

Two-sample hypothesis testing, II 9.07 3/16/2004

Two-sample hypothesis testing, II 9.07 3/16/2004 Two-sample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For two-sample tests of the difference in mean, things get a little confusing, here,

More information

Chapter 9: Two-Sample Inference

Chapter 9: Two-Sample Inference Chapter 9: Two-Sample Inference Chapter 7 discussed methods of hypothesis testing about one-population parameters. Chapter 8 discussed methods of estimating population parameters from one sample using

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = 0.2941 Adverse impact as defined by the 4/5ths rule was not found in the above data.

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = 0.2941 Adverse impact as defined by the 4/5ths rule was not found in the above data. 1 of 9 12/8/2014 12:57 PM (an On-Line Internet based application) Instructions: Please fill out the information into the form below. Once you have entered your data below, you may select the types of analysis

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

A) 0.1554 B) 0.0557 C) 0.0750 D) 0.0777

A) 0.1554 B) 0.0557 C) 0.0750 D) 0.0777 Math 210 - Exam 4 - Sample Exam 1) What is the p-value for testing H1: µ < 90 if the test statistic is t=-1.592 and n=8? A) 0.1554 B) 0.0557 C) 0.0750 D) 0.0777 2) The owner of a football team claims that

More information

a) Find the five point summary for the home runs of the National League teams. b) What is the mean number of home runs by the American League teams?

a) Find the five point summary for the home runs of the National League teams. b) What is the mean number of home runs by the American League teams? 1. Phone surveys are sometimes used to rate TV shows. Such a survey records several variables listed below. Which ones of them are categorical and which are quantitative? - the number of people watching

More information

Hypothesis Tests for 1 sample Proportions

Hypothesis Tests for 1 sample Proportions Hypothesis Tests for 1 sample Proportions 1. Hypotheses. Write the null and alternative hypotheses you would use to test each of the following situations. a) A governor is concerned about his "negatives"

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric

More information

Unit 26 Estimation with Confidence Intervals

Unit 26 Estimation with Confidence Intervals Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Math 251, Review Questions for Test 3 Rough Answers

Math 251, Review Questions for Test 3 Rough Answers Math 251, Review Questions for Test 3 Rough Answers 1. (Review of some terminology from Section 7.1) In a state with 459,341 voters, a poll of 2300 voters finds that 45 percent support the Republican candidate,

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

Chapter 8 Section 1. Homework A

Chapter 8 Section 1. Homework A Chapter 8 Section 1 Homework A 8.7 Can we use the large-sample confidence interval? In each of the following circumstances state whether you would use the large-sample confidence interval. The variable

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other 1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

More information

Math 108 Exam 3 Solutions Spring 00

Math 108 Exam 3 Solutions Spring 00 Math 108 Exam 3 Solutions Spring 00 1. An ecologist studying acid rain takes measurements of the ph in 12 randomly selected Adirondack lakes. The results are as follows: 3.0 6.5 5.0 4.2 5.5 4.7 3.4 6.8

More information

9 Testing the Difference

9 Testing the Difference blu49076_ch09.qxd 5/1/2003 8:19 AM Page 431 c h a p t e r 9 9 Testing the Difference Between Two Means, Two Variances, and Two Proportions Outline 9 1 Introduction 9 2 Testing the Difference Between Two

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Non-Parametric Tests (I)

Non-Parametric Tests (I) Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent

More information

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice! Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!) Part A - Multiple Choice Indicate the best choice

More information

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/ Statistiek I Proportions aka Sign Tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/34 Proportions aka Sign Test The relative frequency

More information

Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation

Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation Leslie Chandrakantha lchandra@jjay.cuny.edu Department of Mathematics & Computer Science John Jay College of

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.

More information

Fixed-Effect Versus Random-Effects Models

Fixed-Effect Versus Random-Effects Models CHAPTER 13 Fixed-Effect Versus Random-Effects Models Introduction Definition of a summary effect Estimating the summary effect Extreme effect size in a large study or a small study Confidence interval

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

Comparing Means Between Groups

Comparing Means Between Groups Comparing Means Between Groups Michael Ash Lecture 6 Summary of Main Points Comparing means between groups is an important method for program evaluation by policy analysts and public administrators. The

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Two Related Samples t Test

Two Related Samples t Test Two Related Samples t Test In this example 1 students saw five pictures of attractive people and five pictures of unattractive people. For each picture, the students rated the friendliness of the person

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Hypothesis Testing: Two Means, Paired Data, Two Proportions Chapter 10 Hypothesis Testing: Two Means, Paired Data, Two Proportions 10.1 Hypothesis Testing: Two Population Means and Two Population Proportions 1 10.1.1 Student Learning Objectives By the end of this

More information

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%. Chapter 7-8 Exam Name: Answer the questions in the spaces provided. If you run out of room, show your work on a separate paper clearly numbered and attached to this exam. Please indicate which program

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

August 2012 EXAMINATIONS Solution Part I

August 2012 EXAMINATIONS Solution Part I August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

Review #2. Statistics

Review #2. Statistics Review #2 Statistics Find the mean of the given probability distribution. 1) x P(x) 0 0.19 1 0.37 2 0.16 3 0.26 4 0.02 A) 1.64 B) 1.45 C) 1.55 D) 1.74 2) The number of golf balls ordered by customers of

More information

Hypothesis Testing. Steps for a hypothesis test:

Hypothesis Testing. Steps for a hypothesis test: Hypothesis Testing Steps for a hypothesis test: 1. State the claim H 0 and the alternative, H a 2. Choose a significance level or use the given one. 3. Draw the sampling distribution based on the assumption

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Sample Size Planning, Calculation, and Justification

Sample Size Planning, Calculation, and Justification Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa

More information

Difference of Means and ANOVA Problems

Difference of Means and ANOVA Problems Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly

More information