Chapter 9: Hypothesis Testing GBS221, Class April 15, 2013 Notes Compiled by Nicolas C. Rouse, Instructor, Phoenix College

Similar documents
HYPOTHESIS TESTING: POWER OF THE TEST

3.4 Statistical inference for 2 populations based on two samples

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Hypothesis Testing --- One Mean

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

22. HYPOTHESIS TESTING

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 2. Hypothesis testing in one population

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

Testing a claim about a population mean

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing OPRE 6301

How To Test For Significance On A Data Set

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Hypothesis Testing. Hypothesis Testing

Microsoft Excel 2010 and Tools for Statistical Analysis

Hypothesis testing - Steps

Difference of Means and ANOVA Problems

In the past, the increase in the price of gasoline could be attributed to major national or global

Lesson 9 Hypothesis Testing

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Introduction to Hypothesis Testing

Mind on Statistics. Chapter 12

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

Using Microsoft Excel to Analyze Data

Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation

Testing Hypotheses About Proportions

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

WISE Power Tutorial All Exercises

Module 4 (Effect of Alcohol on Worms): Data Analysis

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

HYPOTHESIS TESTING WITH SPSS:

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two Related Samples t Test

Statistical Functions in Excel

Review #2. Statistics

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Using Microsoft Excel to Analyze Data from the Disk Diffusion Assay

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Is it statistically significant? The chi-square test

Chapter 7 TEST OF HYPOTHESIS

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Chapter 23. Two Categorical Variables: The Chi-Square Test

Simulating Chi-Square Test Using Excel

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Estimation of σ 2, the variance of ɛ

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

FAT-FREE OR REGULAR PRINGLES: CAN TASTERS TELL THE DIFFERENCE?

When to use Excel. When NOT to use Excel 9/24/2014

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

6: Introduction to Hypothesis Testing

Regression Analysis: A Complete Example

Unit 26 Estimation with Confidence Intervals

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Tests for One Proportion

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Descriptive Statistics

Chapter 2 Probability Topics SPSS T tests

Point Biserial Correlation Tests

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

1.5 Oneway Analysis of Variance

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Math 108 Exam 3 Solutions Spring 00

MULTIPLE REGRESSION EXAMPLE

Tests for Two Proportions

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Simple Regression Theory II 2010 Samuel L. Baker

Variables Control Charts

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

Independent samples t-test. Dr. Tom Pierce Radford University

CHAPTER 13. Experimental Design and Analysis of Variance

Non-Inferiority Tests for Two Means using Differences

How To Check For Differences In The One Way Anova

Two Correlated Proportions (McNemar Test)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Permutation Tests for Comparing Two Populations

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Transcription:

Chapter Objectives 1. Learn how to formulate and test hypotheses about a population mean and a population proportion. 2. Be able to use an Excel worksheet to conduct hypothesis tests about population means and proportions. 3. Understand the types of errors possible when conducting a hypothesis test. 4. Be able to determine the probability of making various errors in hypothesis tests. 5. Know how to compute and interpret p-values. 6. Know the definition of the following terms: null hypothesis, alternative hypothesis, type I error, type II error, critical value, level of significance, one-tailed test, two-tailed test, p-value 1. Developing Null and Alternative Hypotheses Hypothesis testing can be used to determine whether a statement about the value of a population parameter should or should not be rejected. The null hypothesis, denoted by H 0, is a tentative assumption about a population parameter. The alternative hypothesis, denoted by H a. is the opposite of what is stated in the null. Hypothesis testing is similar to a criminal trial. The hypotheses are: o H 0 : The defendant is innocent o H a : The defendant is guilty. Testing the Validity of a Claim o Manufacturers' claims are usually given the benefit of the doubt and stated as the null hypothesis. o The conclusion that the claim is false comes from sample data that contradict the null hypothesis. Testing in Decision-Making Situations o A decision maker might have to choose between two courses of action, one associated with the null hypothesis and another associated with the alternative hypothesis. o Example: Accepting a shipment of goods from a supplier or returning the shipment of goods to the supplier. Summary of Forms for Null and Alternative Hypotheses: Population Mean o The equality part of the hypotheses always appears in the null hypothesis. o In general, a hypothesis test about the value of a population mean must take one of the following three forms (where H 0 is the hypothesized value of the population mean). H 0 : µ > µ 0 H 0 : µ < µ 0 H 0 : µ = µ 0 H a : µ < µ 0 H a : µ > µ 0 H a : µ!= µ 0 2. Using Excel for Hypothesis Tests Excel does not provide built-in routines for the hypothesis tests presented in this chapter. To handle these situations, we present Excel worksheets that we designed to use as templates

for testing hypotheses about a population mean and a population proportion. The worksheets are easy to use and can be modified to handle any sample data. For the population mean when σ is known (Data File: Hyp Sigma Known.xlsx) o We illustrate using the MaxFlight golf ball distance example in Section 9.3. The data are in column A of an Excel worksheet. The population standard deviation σ = 12 is assumed known and the level of significance is α =.05. The following steps can be used to test the hypothesis H 0 : μ = 295 versus H a : μ!= 295. (!= means not equal) o Refer to Figure 9.8 as we describe the procedure. The worksheet in the background shows the cell formulas used to compute the results shown in the foreground worksheet. o The data are entered into cells A2:A51. The following steps are necessary to use the template for this data set. Step 1. Enter the data range A2:A51 into the =COUNT cell formula in cell D4 Step 2. Enter the data range A2:A51 into the =AVERAGE cell formula in cell D5 Step 3. Enter the population standard deviation σ = 12 into cell D6 Step 4. Enter the hypothesized value for the population mean 295 into cell D8 o The remaining cell formulas automatically provide the standard error, the value of the test statistic z, and three p-values. Because the alternative hypothesis ( μ 0!= 295) indicates a two-tailed test, the p-value (Two Tail) in cell D15 is used to make the rejection decision. o With p-value =.1255 = α >.05, the null hypothesis cannot be rejected. The p-values in cells D13 or D14 would be used if the hypotheses involved a one-tailed test. For the population mean when σ is unknown (Data File: Hyp Sigma Unknown.xlsx) o We illustrate using the Heathrow Airport rating example in Section 9.4. The data are in column A of an Excel worksheet. The population standard deviation σ is unknown and will be estimated by the sample standard deviation s. The level of significance is α =.05. The following steps can be used to test the hypothesis H 0 : μ < 7 versus H a : μ > 7. o Refer to Figure 9.9 as we describe the procedure. The background worksheet shows the cell formulas used to compute the results shown in the foreground version of the worksheet. The data are entered into cells A2:A61. The following steps are necessary to use the template for this data set. Step 1. Enter the data range A2:A61 into the =COUNT cell formula in cell D4 Step 2. Enter the data range A2:A61 into the =AVERAGE cell formula in cell D5 Step 3. Enter the data range A2:A61 into the =STDEV cell formula in cell D6 Step 4. Enter the hypothesized value for the population mean 7 into cell D8 o The remaining cell formulas automatically provide the standard error, the value of the test statistic t, the number of degrees of freedom, and three p-values. Because the alternative hypothesis (μ > 7) indicates an upper tail test, the p-value (Upper Tail) in cell D15 is used to make the decision. With p-value =.0353 < α =.05, the null hypothesis is rejected. The p-values in cells D14 or D16 would be used if the hypotheses involved a lower tail test or a two-tailed test.

o This template can be used to make hypothesis testing computations for other applications. For instance, to conduct a hypothesis test for a new data set, enter the new sample data into column A of the worksheet and modify the formulas in cells D4, D5, and D6 to correspond to the new data range. Enter the hypothesized value for the population mean into cell D8 to obtain the results. If the new sample data have already been summarized, the new sample data do not have to be entered into the worksheet. In this case, enter the sample size into cell D4, the sample mean into cell D5, the sample standard deviation into cell D6, and the hypothesized value for the population mean into cell D8 to obtain the results. For population proportion (Data File: Hypothesis p.xlsx) o We illustrate using the Pine Creek golf course survey data presented in Section 9.5. The data of Male or Female golfer are in column A of an Excel worksheet. Refer to Figure 9.10 as we describe the procedure. The background worksheet shows the cell formulas used to compute the results shown in the foreground worksheet. The data are entered into cells A2:A401. The following steps can be used to test the hypothesis H 0 : p <.20 versus H a : p >.20. Step 1. Enter the data range A2:A401 into the =COUNTA cell formula in cell D3 Step 2. Enter Female as the response of interest in cell D4 Step 3. Enter the data range A2:A401 into the =COUNTIF cell formula in cell D5 Step 4. Enter the hypothesized value for the population proportion.20 into cell D8 o The remaining cell formulas automatically provide the standard error, the value of the test statistic z, and three p-values. Because the alternative hypothesis (p 0 >.20) indicates an upper tail test, the p-value (Upper Tail) in cell D14 is used to make the decision. o With p-value =.0062 < α =.05, the null hypothesis is rejected. The p-values in cells D13 or D15 would be used if the hypothesis involved a lower tail test or a two-tailed test. o This template can be used to make hypothesis testing computations for other applications. For instance, to conduct a hypothesis test for a new data set, enter the new sample data into column A of the worksheet. Modify the formulas in cells D3 and D5 to correspond to the new data range. Enter the response of interest into cell D4 and the hypothesized value for the population proportion into cell D8 to obtain the results. If the new sample data have already been summarized, the new sample data do not have to be entered into the worksheet. In this case, enter the sample size into cell D3, the sample proportion into cell D6, and the hypothesized value for the population proportion into cell D8 to obtain the results. 3. and 4. Errors The null and alternative hypotheses are competing statements about the population. Either the null hypothesis H 0 is true or the alternative hypothesis Ha is true, but not both. Ideally the hypothesis testing procedure should lead to the acceptance of H 0 when H 0 is true and the

rejection of H 0 when H a is true. Unfortunately, the correct conclusions are not always possible. Because hypothesis tests are based on sample information, we must allow for the possibility of errors. The first row of Table 9.1 shows what can happen if the conclusion is to accept H 0. If H 0 is true, this conclusion is correct. However, if Ha is true, we make a Type II error; that is, we accept H 0 when it is false. The second row of Table 9.1 shows what can happen if the conclusion is to reject H 0. If H 0 is true, we make a Type I error; that is, we reject H 0 when it is true. However, if Ha is true, rejecting H 0 is correct. The probability of making a Type I error when the null hypothesis is true as an equality is called the level of significance. In practice, the person responsible for the hypothesis test specifies the level of significance. By selecting α, that person is controlling the probability of making a Type I error. If the cost of making a Type I error is high, small values of α are preferred. If the cost of making a Type I error is not too high, larger values of α are typically used. Applications of hypothesis testing that only control for the Type I error are called significance tests. Many applications of hypothesis testing are of this type. 5. p-values A p-value is a probability that provides a measure of the evidence against the null hypothesis provided by the sample. Smaller p-values indicate more evidence against H 0. The p-value is used to determine whether the null hypothesis should be rejected. A small p-value indicates the value of the test statistic is unusual given the assumption that H0 is true. The value of the test statistic is used to compute the p-value. The method used depends on whether the test is a lower tail, an upper tail, or a two-tailed test. For a lower tail test, the p-value is the probability of obtaining a value for the test statistic as small as or smaller than that provided by the sample. Thus, to compute the p-value for the lower tail test in the σ known case, we must find the area under the standard normal curve to the left of the test statistic. After computing the p-value, we must then decide whether it is small enough to reject the null hypothesis; the decision involves comparing the p-value to the level of significance. Rejection rule using p-values o Reject H 0 if p-value < α Critical Value Approach o The critical value approach requires that we first determine a value for the test statistic called the critical value. For a lower tail test, the critical value serves as a benchmark for determining whether the value of the test statistic is small enough to

KEY TERMS reject the null hypothesis. It is the value of the test statistic that corresponds to an area of α (the level of significance) in the lower tail of the sampling distribution of the test statistic. In other words, the critical value is the largest value of the test statistic that will result in the rejection of the null hypothesis. o In the σ known case, the sampling distribution for the test statistic z is a standard normal distribution. o Rejection Rule Reject H 0 if z < -z α where -z α is the critical value; that is, the z value that provides an area of α in the lower tail of the standard normal distribution. Null hypothesis The hypothesis tentatively assumed true in the hypothesis testing procedure. Alternative hypothesis The hypothesis concluded to be true if the null hypothesis is rejected. Type I error The error of rejecting H 0 when it is true. Type II error The error of accepting H 0 when it is false. Level of significance The probability of making a Type I error when the null hypothesis is true as an equality. One-tailed test A hypothesis test in which rejection of the null hypothesis occurs for values of the test statistic in one tail of its sampling distribution. Test statistic A statistic whose value helps determine whether a null hypothesis should be rejected. p-value A probability that provides a measure of the evidence against the null hypothesis provided by the sample. Smaller p-values indicate more evidence against H 0. For a lower tail test, the p-value is the probability of obtaining a value for the test statistic as small as or smaller than that provided by the sample. For an upper tail test, the p-value is the probability of obtaining a value for the test statistic as large as or larger than that provided by the sample. For a two-tailed test, the p-value is the probability of obtaining a value for the test statistic at least as unlikely as or more unlikely than that provided by the sample. Critical value A value that is compared with the test statistic to determine whether H 0 should be rejected. Two-tailed test A hypothesis test in which rejection of the null hypothesis occurs for values of the test statistic in either tail of its sampling distribution.

KEY FORMULAS