22. HYPOTHESIS TESTING



Similar documents
Hypothesis testing - Steps

Introduction to Hypothesis Testing OPRE 6301

HYPOTHESIS TESTING: POWER OF THE TEST

Hypothesis Testing --- One Mean

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Introduction to Hypothesis Testing

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

3.4 Statistical inference for 2 populations based on two samples

Chapter 2. Hypothesis testing in one population

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Mind on Statistics. Chapter 12

Chapter 7 TEST OF HYPOTHESIS

Correlational Research

Module 2 Probability and Statistics

Testing Hypotheses About Proportions

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

Two Related Samples t Test

Unit 26 Estimation with Confidence Intervals

HYPOTHESIS TESTING WITH SPSS:

16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

How To Test For Significance On A Data Set

Estimation of σ 2, the variance of ɛ

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

Testing a claim about a population mean

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Lesson 9 Hypothesis Testing

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

6: Introduction to Hypothesis Testing

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Solutions to Questions on Hypothesis Testing and Regression

Independent t- Test (Comparing Two Means)

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Tests for One Proportion

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

Practice Problems and Exams

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

1-3 id id no. of respondents respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Hypothesis Testing. Hypothesis Testing

Tutorial 5: Hypothesis Testing

Chapter Study Guide. Chapter 11 Confidence Intervals and Hypothesis Testing for Means

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Hypothesis Testing for Beginners

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Difference of Means and ANOVA Problems

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Math 251, Review Questions for Test 3 Rough Answers

Recall this chart that showed how most of our course would be organized:

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Sample Size and Power in Clinical Trials

Chapter 2 Probability Topics SPSS T tests

Normal distribution. ) 2 /2σ. 2π σ

Using Stata for One Sample Tests

Tests of Hypotheses Using Statistics

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

STAT 350 Practice Final Exam Solution (Spring 2015)

Name: Date: Use the following to answer questions 3-4:

Statistics in Medicine Research Lecture Series CSMC Fall 2014

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Non-Parametric Tests (I)

In the past, the increase in the price of gasoline could be attributed to major national or global

Pearson's Correlation Tests

SAMPLE SIZE CONSIDERATIONS

Simple Regression Theory II 2010 Samuel L. Baker

Two-sample hypothesis testing, II /16/2004

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen.

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Review #2. Statistics

How Far is too Far? Statistical Outlier Detection

Point Biserial Correlation Tests

Comparing Means in Two Populations

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

6.2 Permutations continued

Confidence Intervals for the Difference Between Two Means

The Wilcoxon Rank-Sum Test

socscimajor yes no TOTAL female male TOTAL

Chapter 7 - Practice Problems 1

Hypothesis Testing: Two Means, Paired Data, Two Proportions

p ˆ (sample mean and sample

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

3. Mathematical Induction

Transcription:

22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty? (DNA Fingerprinting). Does Viagra work? Does a market analyst have any skill at forecasting the price of Netscape? Is the climate changing? Does a high fiber diet help prevent colon cancer? Are students in this class able to tell the difference between Diet Coke and Diet Pepsi? Do 4 oz bags of M&Ms actually contain 4 ounces of product, on average? Do McDonalds Quarter Pounders actually weigh less than 1/4 pound, before cooking, on the average? Hypothesis testing provides answers to questions like these. It must be emphasized, however, that our conclusions may be wrong, because of sampling variability. Here, we focus attention on methods of making decisions about a population mean, µ.

Eg: Based on a sample of 50 Quarter Pounders, we want to decide if the mean weight µ of all Quarter Pounders is actually less than 0.25 pounds. Statistical hypothesis tests are set up like criminal court cases: the defendant is presumed innocent until proven guilty beyond a reasonable doubt. Thus, we will start with the assumption that µ is in fact 0.25. This is called the null hypothesis, and is written as H 0 : µ = 0.25. This represents the status quo, since McDonald s has been marketing these hamburgers as quarter pounders for years. In most applications, it is hoped that the data will provide enough evidence to allow us to reject the null hypothesis in favor of the alternative hypothesis, which is in this case H A : µ < 0.25. The alternative hypothesis is sometimes called the research hypothesis, since it is often the hypothesis that the researcher wishes to prove. In the above example, a demonstration that µ < 0.25 would be newsworthy. You (the researcher) might be able to win a Pulitzer prize for your investigative reporting. The evidence is provided by a single number called a test statistic, obtained from the observed data. Based on the value of the test statistic, we will decide either to reject, or not to reject, the null hypothesis. Specifically, we reject H 0 whenever the test statistic lies in a pre-specified range of values called the rejection region, or critical region.

As in the courtroom analogy, there are two different kinds of incorrect decisions, or errors. Type I Error: Rejecting H 0 when H 0 is true. Type II Error: Not rejecting H 0 when H 0 is false. Generally, we try to formulate the hypotheses so that a Type I error is considered more serious than a Type II error. Using statistical theory, we can control the probability of making a Type I error. This probability is denoted by α and called the significance level of the test. We are free to choose the value of α. Clearly it is desirable for α to be small, but there is a tradeoff here: the smaller α is, the larger the probability of a Type II error will be. It is customary to use either α = 0.05 or α = 0.01. Examples: What are H 0, H A, Type I errors and Type II errors in the following situations: Murder Trials Smoke Alarms Pregnancy Tests Boy Who Cried Wolf.

Format For Hypothesis Tests (1) Before examining the data, formulate H 0, H A, and select α. (2) Examine the data, evaluate the test statistic, and draw your conclusion. To ensure the validity of the test results, H 0 and H A must be formulated before the data are examined. The null hypothesis always takes the form H 0 : µ = µ 0, so that µ 0 is the value of the population mean if H 0 is true. Depending on what we are trying to prove, we take H A as either: H A : µ µ 0 (A two-sided alternative) H A : µ < µ 0 (A one-sided alternative) H A : µ > µ 0 (A one-sided alternative). Suppose for now that σ is known. Suppose also that either the population is normally distributed or that the sample size is large enough for the Central Limit Theorem to hold. x µ 0 Then the test statistic is z = σ n. The z statistic provides a standardized measure of the difference between the hypothesized value of µ and the actual value of the sample mean x. If H 0 is true, then Z is (approximately) standard normal. (Why?) In hypothesis testing, we simply check whether z is too large or too small to have plausibly come from a standard normal distribution. If so, then we reject H 0.

How To Perform the Test If z lies in the rejection region (given below), reject H 0. Otherwise, do not reject H 0. The rejection region for a level α test depends on the nature of the alternative hypothesis. If H A is two-sided, we get a two-tailed test. If H A is one-sided, we get a one-tailed test. (Note that there are two kinds of one-tailed tests). H A Rejection Region µ µ 0 z > z α/2 µ < µ 0 z < z α µ > µ 0 z > z α For example, to test H 0 : µ = µ 0 against the two-sided alternative H A : µ µ 0, we reject H 0 if either z < z α/2 or z > z α/2. The dividing lines, in this case z α/2 and z α/2, are called critical values. This does give a level α test, since Prob(Type I Error) = Prob(Reject H 0 H 0 is True) = Prob ( Z > z α/2 H 0 is true) = Prob ( Std Normal RV > z α/2 ) = α. Explain why the above 1-tailed test criteria are also correct.

There are four critical values worth memorizing: z 0.05 = 1.645, z 0.025 = 1.96, z 0.01 = 2.326, z 0.005 = 2.576. These correspond to α = 5% (one-tailed and two-tailed), and to α = 1% (one-tailed and two-tailed). Eg 1: For the Quarter Pounders example, test H 0 : µ = 0.25 against H A : µ < 0.25 at the 5% level of significance, assuming that n = 50, σ = 0.035 and x = 024.. Eg 2: The Domino s Pizza closest to NYU advertises that their average delivery time to NYU is at most 20 minutes. A sample of 32 delivery times had an average of 24 minutes. The true standard deviation of delivery times is 10 minutes. Is there sufficient evidence to reject Domino s Pizza s claim at the 1% level of significance? Eg 3: In the Pepsi example, for 100 2-liter bottles, the average amount of Pepsi filled by the machine was x = 1985. liters. The population standard deviation is known to be 0.05 liters. If the expected amount (per bottle) of Pepsi filled by the machine, denoted by µ, is not equal to 2, then the machine is out of control, and must be shut down for repairs. Test whether the machine is out of control, at the 5% level of significance.

The Confidence Interval Approach To Hypothesis Testing For Two-Tailed Tests The 95% confidence interval for the mean in the situation of Example 3 was (1.975, 1.995), which does not contain the null hypothesis value of 2 liters. This would seem to indicate that a mean value of 2 liters is inconsistent with the available information, and indeed we rejected the null hypothesis at level 0.05 in Eg 3. In general, it can be shown that the confidence interval consists of the values of µ 0 which would not be rejected in a 2-tailed hypothesis test of H 0 : µ = µ 0 versus H A : µ µ 0. Thus, if the hypothesized mean falls in the confidence interval, then we cannot reject H 0 in favor of H A. On the other hand, if the hypothesized mean does not fall in the confidence interval, then we can reject H 0 in favor of H A. These properties allow us, if we wish, to perform a 2-tailed test at level α by simply examining the (1 α)100% confidence interval and seeing if the null value falls in the interval.