8-2 Basics of Hypothesis Testing. Definitions. Rare Event Rule for Inferential Statistics. Null Hypothesis

Similar documents
Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

HYPOTHESIS TESTING: POWER OF THE TEST

Hypothesis Testing --- One Mean

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

Non-Parametric Tests (I)

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Mind on Statistics. Chapter 12

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Introduction to Hypothesis Testing

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

MAT 155. Key Concept. February 03, S4.1 2_3 Review & Preview; Basic Concepts of Probability. Review. Chapter 4 Probability

Correlational Research

In the past, the increase in the price of gasoline could be attributed to major national or global

Hypothesis testing - Steps

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Introduction to Hypothesis Testing OPRE 6301

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

II. DISTRIBUTIONS distribution normal distribution. standard scores

Math 251, Review Questions for Test 3 Rough Answers

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

How To Test For Significance On A Data Set

Lesson 9 Hypothesis Testing

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

3.4 Statistical inference for 2 populations based on two samples

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Two Correlated Proportions (McNemar Test)

6: Introduction to Hypothesis Testing

p ˆ (sample mean and sample

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Stats Review Chapters 9-10

Error Type, Power, Assumptions. Parametric Tests. Parametric vs. Nonparametric Tests

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Chapter 2. Hypothesis testing in one population

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

22. HYPOTHESIS TESTING

HYPOTHESIS TESTING WITH SPSS:

WISE Power Tutorial All Exercises

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Principles of Hypothesis Testing for Public Health

Sample Size and Power in Clinical Trials

Testing Hypotheses About Proportions

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Chapter 7 TEST OF HYPOTHESIS

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Descriptive Statistics

Estimation of σ 2, the variance of ɛ

Simple Linear Regression Inference

Final Exam Practice Problem Answers

Example 1. so the Binomial Distrubtion can be considered normal

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Independent samples t-test. Dr. Tom Pierce Radford University

Hypothesis Testing. Hypothesis Testing

Tests for Two Proportions

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Statistics 2014 Scoring Guidelines

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Chi Square Tests. Chapter Introduction

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

People have thought about, and defined, probability in different ways. important to note the consequences of the definition:

Chapter 7: One-Sample Inference

Variables Control Charts

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

Tests for One Proportion

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Lecture Notes Module 1

Chi-square test Fisher s Exact test

ELEMENTARY STATISTICS

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

UNDERSTANDING THE TWO-WAY ANOVA

Two Related Samples t Test

Section 8-1 Pg. 410 Exercises 12,13

8 6 X 2 Test for a Variance or Standard Deviation

Premaster Statistics Tutorial 4 Full solutions

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Section 12 Part 2. Chi-square test

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

12: Analysis of Variance. Introduction

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

1-3 id id no. of respondents respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

Review #2. Statistics

Module 2 Probability and Statistics

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

CHANCE ENCOUNTERS. Making Sense of Hypothesis Tests. Howard Fincher. Learning Development Tutor. Upgrade Study Advice Service

Pearson's Correlation Tests

Testing a claim about a population mean

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

Tutorial 5: Hypothesis Testing

Step 6: Writing Your Hypotheses Written and Compiled by Amanda J. Rockinson-Szapkiw

Transcription:

8-2 Basics of Hypothesis Testing Definitions This section presents individual components of a hypothesis test. We should know and understand the following: How to identify the null hypothesis and alternative hypothesis from a given claim, and how to express both in symbolic form How to calculate the value of the test statistic, given a claim and sample data How to choose the sampling distribution that is relevant How to identify the P-value or identify the critical value(s) How to state the conclusion about a claim in simple and nontechnical terms A hypothesis is a claim or statement about a property of a population. A hypothesis test is a procedure for testing a claim about a property of a population. Rare Event Rule for Inferential Statistics Null Hypothesis If, under a given assumption, the probability of a particular observed event is exceptionally small, we conclude that the assumption is probably not correct. The null hypothesis (denoted by H 0 ) is a statement that the value of a population parameter (such as proportion, mean, or standard deviation) is equal to some claimed value. We test the null hypothesis directly in the sense that we assume it is true and reach a conclusion to either reject H 0 or fail to reject H 0. Alternative Hypothesis Note about Forming Your Own Claims (Hypotheses) The alternative hypothesis (denoted by H or H A ) is the statement that the parameter has a value that somehow differs from the null hypothesis. The symbolic form of the alternative hypothesis must use one of these symbols: <, >,. If you are conducting a study and want to use a hypothesis test to support your claim, the claim must be worded so that it becomes the alternative hypothesis.

Steps, 2, 3 Identifying H 0 and H Assume that 00 babies are born to 00 couples treated with the XSORT method of gender selection that is claimed to make girls more likely. We observe 58 girls in 00 babies. Write the hypotheses to test the claim the with the XSORT method, the proportion of girls is greater than the 50% that occurs without any treatment. H H 0 : p = 0.5 : p > 0.5 Step 4 Select the Significance Level α Significance Level 22 The significance level (denoted by α) is the probability that the test statistic will fall in the critical region when the null hypothesis is actually true (making the mistake of rejecting the null hypothesis when it is true). This is the same α introduced in Section 7-2. Common choices for α are 0.05, 0.0, and 0.0. Step 5 Identify the Test Statistic and Determine its Sampling Distribution Test Statistic The test statistic is a value used in making a decision about the null hypothesis, and is found by converting the sample statistic to a score with the assumption that the null hypothesis is true.

Step 6 Find the Value of the Test Statistic, Then Find Either the P-Value or the Critical Value(s) First transform the relevant sample statistic to a standardized score called the test statistic. Then find the P-Value or the critical value(s). Let s again consider the claim that the XSORT method of gender selection increases the likelihood of having a baby girl. Preliminary results from a test of the XSORT method of gender selection involved 00 couples who gave birth to 58 girls and 42 boys. Use the given claim and the preliminary results to calculate the value of the test statistic. Use the format of the test statistic given above, so that a normal distribution is used to approximate a binomial distribution. - Continued The claim that the XSORT method of gender selection increases the likelihood of having a baby girl results in the following null and alternative hypotheses: : p= 0.5 H : p > 0.5 We work under the assumption that the null hypothesis is true with p = 0.5. The sample proportion of 58 girls in 0 births results in: 58 p ˆ = = 0.58 00 33 Convert to the Test Statistic pˆ p 0.58 0.5 z = = =.60 pq ( 0.5)( 0.5) n 00 We know from previous chapters that a z score of.60 is not unusual. At first glance, 58 girls in 00 births does not seem to support the claim that the XSORT method increases the likelihood a having a girl (more than a 50% chance). Types of Hypothesis Tests: Two-tailed, Left-tailed, Right-tailed The tails in a distribution are the extreme regions bounded by critical values. : = H : Two-tailed Test α is divided equally between the two tails of the critical region Determinations of P-values and critical values are affected by whether a critical region is in two tails, the left tail, or the right tail. It, therefore, becomes important to correctly,characterize a hypothesis test as two-tailed, left-tailed, or right-tailed.

Left-tailed Test Right-tailed Test H : = 0 H : < All α in the left tail H : = 0 H : > All α in the right tail P-Value P-Value The P-value (or probability value) is the probability of getting a value of the test statistic that is at least as extreme as the one representing the sample data, assuming that the null hypothesis is true. Critical region in the left tail: P-value = area to the left of the test statistic 44 The null hypothesis is rejected if the P-value is very small, such as 0.05 or less. Critical region in the right tail: Critical region in two tails: P-value = area to the right of the test statistic P-value = twice the area in the tail beyond the test statistic The claim that the XSORT method of gender selection increases the likelihood of having a baby girl results in the following null and alternative hypotheses: : p= 0.5 H : p > 0.5 The test statistic of z =.60 has an area of 0.0548 to its right, so a right-tailed test with test statistic z =.60 has a P-value of 0.0548 The test statistic was : pˆ p 0.58 0.5 z = = =.60 pq ( 0.5)( 0.5) n 00

Procedure for Finding P-Values Critical Region The critical region (or rejection region) is the set of all values of the test statistic that cause us to reject the null hypothesis. For example, see the red-shaded region in the previous figures. Critical Value A critical value is any value that separates the critical region (where we reject the null hypothesis) from the values of the test statistic that do not lead to rejection of the null hypothesis. The critical values depend on the nature of the null hypothesis, the sampling distribution that applies, and the significance level α. 55 For the XSORT birth hypothesis test, the critical value and critical region for an α = 0.05 test are shown below: Caution Don t confuse a P-value with a proportion p. Know this distinction: P-value = probability of getting a test statistic at least as extreme as the one representing sample data Step 7 : Make a Decision: Reject H 0 or Fail to Reject H 0 The methodologies depend on if you are using the P-Value method or the critical value method. p = population proportion

Decision Criterion Decision Criterion P-value Method: Using the significance level α: If P-value α, reject H 0. If P-value > α, fail to reject H 0. Critical Value Method: If the test statistic falls within the critical region, reject H 0. If the test statistic does not fall within the critical region, fail to reject H 0. For the XSORT baby gender test, the test had a test statistic of z =.60 and a P-Value of 0.0548. We tested: : p= 0.5 H : p > 0.5 Using the P-Value method, we would fail to reject the null at the α = 0.05 level. Using the critical value method, we would fail to reject the null because the test statistic of z =.60 does not fall in the rejection region. (You will come to the same decision using either method.) 66 Step 8 : Restate the Decision Using Simple and Nontechnical Terms State a final conclusion that addresses the original claim with wording that can be understood by those without knowledge of statistical procedures. Wording of Final Conclusion For the XSORT baby gender test, there was not sufficient evidence to support the claim that the XSORT method is effective in increasing the probability that a baby girl will be born.

Caution Accept Versus Fail to Reject Never conclude a hypothesis test with a statement of reject the null hypothesis or fail to reject the null hypothesis. Always make sense of the conclusion with a statement that uses simple nontechnical wording that addresses the original claim. Some texts use accept the null hypothesis. We are not proving the null hypothesis. Fail to reject says more correctly that the available evidence is not strong enough to warrant rejection of the null hypothesis. Type I Error Type II Error A Type I error is the mistake of rejecting the null hypothesis when it is actually true. The symbol α is used to represent the probability of a type I error. 77 A Type II error is the mistake of failing to reject the null hypothesis when it is actually false. The symbol β (beta) is used to represent the probability of a type II error. Type I and Type II Errors a) Identify a type I error. b) Identify a type II error. Assume that we are conducting a hypothesis test of the claim that a method of gender selection increases the likelihood of a baby girl, so that the probability of a baby girls is p > 0.5. Here are the null and alternative hypotheses: : p= 0.5 H : p > 0.5

- Continued a) A type I error is the mistake of rejecting a true null hypothesis: We conclude the probability of having a girl is greater than 50%, when in reality, it is not. Our data misled us. b) A type II error is the mistake of failing to reject the null hypothesis when it is false: There is no evidence to conclude the probability of having a girl is greater than 50% (our data misled us), but in reality, the probability is greater than 50%. Controlling Type I and Type II Errors For any fixed α, an increase in the sample size n will cause a decrease in β. For any fixed sample size n, a decrease in α will cause an increase in β. Conversely, an increase in α will cause a decrease in β. To decrease both α and β, increase the sample size. Part 2: Beyond the Basics of Hypothesis Testing: The Power of a Test Definition The power of a hypothesis test is the probability β of rejecting a false null hypothesis. 88 The value of the power is computed by using a particular significance level α and a particular value of the population parameter that is an alternative to the value assumed true in the null hypothesis. Power and the Design of Experiments Just as 0.05 is a common choice for a significance level, a power of at least 0.80 is a common requirement for determining that a hypothesis test is effective. (Some statisticians argue that the power should be higher, such as 0.85 or 0.90.) When designing an experiment, we might consider how much of a difference between the claimed value of a parameter and its true value is an important amount of difference. When designing an experiment, a goal of having a power value of at least 0.80 can often be used to determine the minimum required sample size.