Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Similar documents

Hypothesis Testing --- One Mean

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

Mind on Statistics. Chapter 12

Non-Parametric Tests (I)

Introduction to Hypothesis Testing OPRE 6301

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

Hypothesis testing - Steps

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

How To Test For Significance On A Data Set

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Math 251, Review Questions for Test 3 Rough Answers

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Correlational Research

3.4 Statistical inference for 2 populations based on two samples

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Lesson 9 Hypothesis Testing

Introduction to Hypothesis Testing

22. HYPOTHESIS TESTING

Chapter 2. Hypothesis testing in one population

Simple Linear Regression Inference

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

UNDERSTANDING THE TWO-WAY ANOVA

Sample Size and Power in Clinical Trials

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

p ˆ (sample mean and sample

WISE Power Tutorial All Exercises

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Statistics 2014 Scoring Guidelines

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

5.1 Identifying the Target Parameter

HYPOTHESIS TESTING WITH SPSS:

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

In the past, the increase in the price of gasoline could be attributed to major national or global

8 6 X 2 Test for a Variance or Standard Deviation

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Review. March 21, S7.1 2_3 Estimating a Population Proportion. Chapter 7 Estimates and Sample Sizes. Test 2 (Chapters 4, 5, & 6) Results

Chapter 7 TEST OF HYPOTHESIS

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

Independent t- Test (Comparing Two Means)

WHERE DOES THE 10% CONDITION COME FROM?

II. DISTRIBUTIONS distribution normal distribution. standard scores

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Stats Review Chapters 9-10

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Estimation of σ 2, the variance of ɛ

Unit 26 Estimation with Confidence Intervals

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Module 2 Probability and Statistics

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

1 Nonparametric Statistics

Premaster Statistics Tutorial 4 Full solutions

ELEMENTARY STATISTICS

Final Exam Practice Problem Answers

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Point and Interval Estimates

Inference for two Population Means

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Lecture Notes Module 1

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Testing Hypotheses About Proportions

DDBA 8438: The t Test for Independent Samples Video Podcast Transcript

Two-sample hypothesis testing, II /16/2004

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Introduction. Statistics Toolbox

Review #2. Statistics

Tests for Two Proportions

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Step 6: Writing Your Hypotheses Written and Compiled by Amanda J. Rockinson-Szapkiw

Confidence intervals

Example 1. so the Binomial Distrubtion can be considered normal

Hypothesis Testing for Beginners

Tutorial 5: Hypothesis Testing

Difference of Means and ANOVA Problems

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Transcription:

Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing a Claim About a Standard Deviation or Variance 1 2 8-1 Overview Hypothesis Definition in statistics, is a claim or statement about a property of a population 3

8-1 Overview Definition Hypothesis Test is a standard procedure for testing a claim about a property of a population 4 Rare Event Rule for Inferential Statistics If, under a given assumption, the probability of a particular observed event is exceptionally small, we conclude that the assumption is probably not correct. 2 5 Example: ProCare ProCare Industries, Ltd., once provided a product called Gender Choice, which, according to advertising claims, allowed couples to increase your chances of having a boy up to 85%, a girl up to 80%. Suppose we conduct an experiment with 100 couples who want to have baby girls, and they all follow the Gender Choice directions in the pink package. For the purpose of testing the claim of an increased likelihood for girls, we will assume that Gender Choice has no effect. Using common sense and no formal statistical methods, what should we conclude about the assumption of no effect from Gender Choice if 100 couples using Gender Choice have 100 babies consisting of a) 52 girls?; b) 97 girls? 6

Example:ProCare Industries, Ltd.: Part a) a) We normally expect around 50 girls in 100 births. The results of 52 girls is close to 50, so we should not conclude that the Gender Choice product is effective. If the 100 couples used no special method of gender selection, the result of 52 girls could easily occur by chance. The assumption of no effect from Gender Choice appears to be correct. There isn t sufficient evidence to say that Gender Choice is effective. 7 Example:ProCare Industries, Ltd.: Part b) b) The result of 97 girls in 100 births is extremely unlikely to occur by chance. We could explain the occurrence of 97 girls in one of two ways: Either an extremelyrare rare event has occurred by chance, or Gender Choice is effective. The extremely low probability of getting 97 girls is strong evidence against the assumption that Gender Choice has no effect. It does appear to be effective. 3 8 8-2 Basics of Hypothesis Testing 9

8-2 Section Objectives Given a claim, identify the null hypothesis, and the alternative hypothesis, and express them both in symbolic form. Given a claim and sample data, calculate the value of the test statistic. Given a significance level, identify the critical value(s). Given a value of the test statistic, identify the P- value. State the conclusion of a hypothesis test in simple, non-technical terms. Identify the type I and type II errors that could be made when testing a given claim. 10 Example: Let Let s again refer to the Gender Choice product that was once distributed by ProCare Industries. ProCareIndustries claimed that couple using the pink packages of Gender Choice would have girls at a rate that is greater than 50% or 0.5. Let s again consider an experiment whereby 100 couples use Gender Choice in an attempt to have a baby girl; let s assume that the 100 babies include exactly 52 girls, and let s formalize some of the analysis. Under normal circumstances the proportion of girls is 0.5, so a claim that Gender Choice is effective can be expressed as p > 0.5. 4 Using a normal distribution as an approximation to the binomial distribution, we find P(52 or more girls in 100 births) = 0.3821. continued 11 Example: Let Let s again refer to the Gender Choice product that was once distributed by ProCare Industries. ProCareIndustries claimed that couple using the pink packages of Gender Choice would have girls at a rate that is greater than 50% or 0.5. Let s again consider an experiment whereby 100 couples use Gender Choice in an attempt to have a baby girl; let s assume that the 100 babies include exactly 52 girls, and let s formalize some of the analysis. Figure 8-1 shows that with a probability of 0.5, the outcome of 52 girls in 100 births is not unusual. continued 12

Figure 8-1 We do not reject random chance as a reasonable explanation. We conclude that the proportion of girls born to couples using Gender Choice is not significantly greater than the number that we would expect by random chance. 13 Key Points Claim: For couples using Gender Choice, the proportion of girls is p > 0.5. Working assumption: The proportion of girls is p = 0.5 (with no effect from Gender Choice). The sample resulted in 52 girls among 100 births, so the sample proportion is ˆ p = 52/100 = 0.52. 5 14 Key Points Assuming that p = 0.5,, we use a normal distribution as an approximation to the binomial distribution to find that P(at least 52 girls in 100 births) = 0.3821. There are two possible explanation for the result of 52 girls in 100 births: Either a random chance event (with probability 0.3821) has occurred, or the proportion of girls born to couples using Gender Choice is greater than 0.5. There isn t sufficient evidence to support Gender Choice s claim. 15

Components of a Formal Hypothesis Test 16 Null Hypothesis: H 0 Statement about value of population parameter that is equal to some claimed value H 0 : p = 0.5 H 0 : m = 98.6 H 0 : s = 15 Test the Null Hypothesis directly Reject H 0 or fail to reject H 0 6 17 Alternative Hypothesis: H 1 the statement that the parameter has a value that somehow differs from the null Must be true if H 0 is false,, <, > 18

Claim: Using math symbols H 0 : Must contain equality H 1 : Will contain,, <, > 19 Note about Identifying H and H 0 1 Figure 8-2 7 20 Note about Forming Your Own Claims (Hypotheses) If you are conducting a study and want to use a hypothesis test to support your claim, the claim must be worded so that it becomes the alternative hypothesis. This means your claim must be expressed using only expressed using only,, <, > 21

Test Statistic The test statistic is a value computed from the sample data, and it is used in making the decision about the rejection of the null hypothesis. /\ z = p - p pq n Test statistic for proportions 22 Test Statistic The test statistic is a value computed from the sample data, and it is used in making the decision about the rejection of the null hypothesis. 8 z = x - µ x s n Test statistic for mean 23 Test Statistic The test statistic is a value computed from the sample data, and it is used in making the decision about the rejection of the null hypothesis. t = x - µ x s n Test statistic for mean 24

Test Statistic The test statistic is a value computed from the sample data, and it is used in making the decision about the rejection of the null hypothesis. c 2 = (n 1)s2 s 2 Test statistic for standard deviation 25 Example: A survey of n = 880 randomly selected adult drivers showed that 56%(or p = 0.56) of those respondents admitted to running red ˆ lights. Find the value of the test statistic for the claim that the majority of all adult drivers admit to running red lights. (In Section 8-3 we will see that there are assumptions that must be verified. For this example, assume that the required assumptions are satisfied and focus on finding the indicated test statistic.) 9 26 Solution: The preceding example showed that the given claim results in the following null and alternative hypotheses: H 0 : p = 0.5 and H 1 : p > 0.5. Because we work under the assumption that the null hypothesis is true with p = 0.5, we get the following test statistic: /\ z = p p pq n = 0.56-0.5 (0.5)(0.5) 880 = 3.56 27

Interpretation: We know from previous chapters that a z score of 3.56 is exceptionally large. It appears that in addition to being more than half, the sample result of 56% is significantly more than 50%. We could show that the sample proportion of 0.56 (from 56%) does fall within the range of values considered to be significant because they are so far above 0.5 that they are not likely to occur by chance (assuming that the population proportion is p = 0.5). 28 10 29 Critical Region (or Rejection Region) Set of all values of the test statistic that would cause a rejection of the null hypothesis 30

Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Region 31 Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Region 11 32 Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Regions 33

Significance Level denoted by a the probability that the test statistic will fall in the critical region when the null hypothesis is actually true. same a introduced in Section 7-2. common choices are 0.05, 0.01, and 0.10 34 Critical Value Any value that separates the critical region (where we reject the null hypothesis) from the values of the test statistic that do not lead to a rejection of the null hypothesis 12 35 Critical Value Any value that separates the critical region (where we reject the null hypothesis) from the values of the test statistic that do not lead to a rejection of the null hypothesis Critical Value ( z score ) 36

Critical Value Any value that separates the critical region (where we reject the null hypothesis) from the values of the test statistic that do not lead to a rejection of the null hypothesis Reject H 0 Fail to reject H 0 Critical Value ( z score ) 37 Two-tailed, Right-tailed, tailed, Left-tailed tailed Tests The tails in a distribution are the extreme regions bounded by critical values. 13 38 Two-tailed Test H : = 0 H 1 : Means less than or greater than a is divided equally between the two tails of the critical region Values that differ significantly from H 0 39

Right-tailed tailed Test H 0 : = H : > 1 Points Right Values that differ significantly from H o 40 Points Left Left-tailed tailed Test H 0 : = H : < 1 14 Values that differ significantly from H o 41 P-Value The P-value (or p-value or probability value) is the probability of getting a value of the test statistic that is at least as extremeas the one representing the sample data, assuming that the null hypothesis is true. The null hypothesis is rejected if the P-value is very small, such as 0.05 or less. 42

Example: Finding P-values Figure 8-6 43 Conclusions in Hypothesis Testing always test the null hypothesis 1. Reject the H 0 15 2. Fail to reject the H 0 44 Decision Criterion Traditional method: Reject H 0 if the test statistic falls within the critical region. Fail to reject H 0 if the test statistic does not fall within the critical region. 45

Decision Criterion P-value method: Reject H 0 if P-value a (where a is the significance level, such as 0.05). Fail to reject H 0 if P-value > a. 46 Decision Criterion Another option: Instead of using a significance level such as 0.05, simply identify the P-value and leave the decision to the reader. 16 47 Decision Criterion Confidence Intervals: Because a confidence interval estimate of a population parameter contains the likely values of that parameter, reject a claim that the population parameter has a value that is not included in the confidence interval. 48

Wording the Final Conclusion 49 Wording of Final Conclusion 17 Figure 8-7 50 Accept versus Fail to Reject Some texts use accept the null hypothesis We are not proving the null hypothesis Sample evidence is not strong enough to warrant rejection (such as not enough evidence to convict a suspect) 51

Type I Error A Type I error is the mistake of rejecting the null hypothesis when it is true. The symbol a (alpha) is used to represent the probability of a type I error. 52 Type II Error A Type II error is the mistake of failing to reject the null hypothesis when it is false. The symbol b (beta) is used to represent the probability of a type II error. 18 53 54

Controlling Type I and Type II Errors For any fixed a,, an increase in the sample size n will cause a decrease in b. For any fixed sample size n, a decrease in a will cause an increase in b.. Conversely, an increase in a will cause a decrease in b. To decrease both a and b,, increase the sample size. 55 Definition Power of a Hypothesis Test The power of a hypothesis test is the probability (1 - b ) of rejecting a false null hypothesis, which is computed by using a particular significance level a and a particular value of the population parameter that is an alternative to the value assumed true in the null hypothesis. 19 56 Comprehensive Hypothesis Test 57

Comprehensive Hypothesis Test 58 Comprehensive Hypothesis Test A confidence interval estimate of a population parameter contains the likely values of that parameter. We should therefore reject a claim that the population parameter has a value that is not included in the confidence interval. 20 59 Comprehensive Hypothesis Test Caution: In some cases, a conclusion based on a confidence interval may be different from a conclusion based on a hypothesis test. See the comments in the individual sections. 60

61 21