Intro. to Hypothesis Tests

Similar documents
p ˆ (sample mean and sample

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

Hypothesis testing - Steps

Sample Size and Power in Clinical Trials

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Two-sample inference: Continuous data

Chapter 26: Tests of Significance

Week 3&4: Z tables and the Sampling Distribution of X

Introduction to Hypothesis Testing

People have thought about, and defined, probability in different ways. important to note the consequences of the definition:

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

22. HYPOTHESIS TESTING

6. Let X be a binomial random variable with distribution B(10, 0.6). What is the probability that X equals 8? A) (0.6) (0.4) B) 8! C) 45(0.6) (0.

Independent samples t-test. Dr. Tom Pierce Radford University

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

University of Chicago Graduate School of Business. Business 41000: Business Statistics

Name: Date: Use the following to answer questions 3-4:

Introduction to Hypothesis Testing OPRE 6301

Fairfield Public Schools

Statistics 2014 Scoring Guidelines

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

5.1 Identifying the Target Parameter

Hypothesis Testing for Beginners

Recall this chart that showed how most of our course would be organized:

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

CHAPTER 14 NONPARAMETRIC TESTS

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Mind on Statistics. Chapter 12

Chapter 2. Hypothesis testing in one population

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Playing with Numbers

AMS 5 CHANCE VARIABILITY

Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 4. Probability and Probability Distributions

Hypothesis Testing. Steps for a hypothesis test:

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

HYPOTHESIS TESTING: POWER OF THE TEST

Unit 26 Estimation with Confidence Intervals

Lecture Notes Module 1

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Correlational Research

6: Introduction to Hypothesis Testing

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Chapter 6: Probability

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Normal distribution. ) 2 /2σ. 2π σ

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

Inclusion and Exclusion Criteria

How To Test For Significance On A Data Set

Lesson 9 Hypothesis Testing

HYPOTHESIS TESTING WITH SPSS:

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Math 251, Review Questions for Test 3 Rough Answers

MATH 140 Lab 4: Probability and the Standard Normal Distribution

Elementary Statistics and Inference. Elementary Statistics and Inference. 17 Expected Value and Standard Error. 22S:025 or 7P:025.

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

8 6 X 2 Test for a Variance or Standard Deviation

Study Guide for the Final Exam

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

2 GENETIC DATA ANALYSIS

2 Sample t-test (unequal sample sizes and unequal variances)

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

University of Chicago Graduate School of Business. Business 41000: Business Statistics Solution Key

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Week 4: Standard Error and Confidence Intervals

8. THE NORMAL DISTRIBUTION

Chapter 7 Section 7.1: Inference for the Mean of a Population

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

4. Continuous Random Variables, the Pareto and Normal Distributions

A) B) C) D)

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Review #2. Statistics

Planning sample size for randomized evaluations

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Unit 27: Comparing Two Means

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Simple Regression Theory II 2010 Samuel L. Baker

Module 2 Probability and Statistics

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Chapter 7 Review. Confidence Intervals. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Point and Interval Estimates

Transcription:

Intro. to Hypothesis Tests Two of the most common types of statistical inference: 1. Confidence intervals Goal is to estimate (and communicate uncertainty in our estimate of) a population parameter. 2. Tests of Significance Goal is to assess the evidence provided by the data about some claim concerning the population. 1 Basic Idea of Tests of Significance Example: Each day Tom and Heather decide who pays for lunch based on a toss of Tom s favorite quarter. Heads - Tom pays Tails - Heather pays Tom claims that heads and tails are equally likely outcomes for this quarter. Heather thinks she pays more often. 2 1

Heather steals the quarter, tosses it 10 times, and gets 7 tails (70% tails). She is furious and claims that the coin is not fair. There are two possibilities: 1. Tom is telling truth the chance of tails is 50% and the observation of 7 tails out of 10 tosses was only due to sampling variability. 2. Tom is lying the chance of tails is greater than 50%. 3 Suppose they call you to decide between the two possibilities. To be fair to both of them, you toss the quarter 25 times. Suppose you get 21 tails. What would you conclude? Why? => The coin is probably not fair. Even with sampling variability it is unlikely that a fair coin would result give such a high percentage of tails. (The actual probability of getting 21 or more tails in 25 tosses is 0.000455 if the coin is fair.) 4 2

Moral of the story: an outcome that would rarely happen if a claim were true is good evidence that the claim is in fact not true. This is the idea behind Hypothesis Testing. 5 A hypothesis is a statement about the parameters in a population; we will be making statements about in Section 6.2. A hypothesis test (or significance test) is a formal procedure for comparing observed data with a hypothesis whose truth we want to assess. The results of a test are expressed in terms of a probability that measures how well the data and hypothesis agree. 6 3

Performing a Hypothesis Test 1. State Hypotheses State your research question as two hypotheses - the null and the alternative hypotheses. These hypotheses are written in terms of the population parameters. The null hypothesis (H 0 ) is the statement being tested. This is assumed true and compared to the data to see if there is evidence against it. A null hypothesis that we will see often is that the mean µ is equal to some standard value. Usually, null hypotheses give a statement of no difference or no effect. 7 Suppose we want to test the null hypothesis that µ is some specified value, say µ 0. Then H 0 : = µ 0 Note: We will always express H 0 using an equality sign. 8 4

The alternative hypothesis (H a ) is the statement about the population parameter that we hope or suspect is true. We are interested in seeing if the data support this hypothesis. H a can be one-sided: H a : >µ 0 or H a : < µ 0 H a can be two-sided: H a : µ 0 9 Example: Strawberry Bars Kellogg s says that its strawberry bars weigh, on average, 16 oz. 10TV s consumer reporter is suspicious that the bars weigh less than what is claimed. In order to check his suspicion, he weighs the contents of 20 randomly chosen bars. These 20 bars have an average weight of 15.6 oz. Assume that the weights follow a normal distribution with a standard deviation of 0.7 oz. Is there evidence that the reporter s suspicion is correct? 10 5

The hypotheses are: H 0 : = 16 H a : < 16 Is this a one-sided test or a two-sided test? This is a one sided test. The reporter thought the bars were smaller than 16 oz. 11 2) Calculate P-value We ask: Does the sample give evidence against the null hypothesis? P-value: The probability that the sample mean would take a value as extreme or more extreme than the one we actually observed assuming H 0 is true. 12 6

In the strawberry bar example, this means: Is the sample average much less than we would expect to see if really is 16? We need to find the probability that we get a sample of 20 bars whose mean is 15.6 or less given that = 16. Question: Why 15.6 or less? This is more extreme evidence against our null hypothesis and in support of the alterative hypothesis. 13 What is the distribution of x, the average weight of the bars sampled if the null hypothesis is true? N (16, 0.7/20) What area under the Normal (16, 0.7/20) curve corresponds to the p-value? The area to the left of 15.6. 14 7

Calculate the test statistic (z-score) 15.6 16 z = 0.7 20 = 2.56 Calculate the P-value: Using Table A, the area to the left of -2.56 is 0.0052. => The probability of getting more extreme evidence against H 0 : = 16, given that = 16, is 0.0052. 15 More on p-values: P-values correspond to tail areas under the density curve. For the onesided test in the strawberry bar example, the p-value was the area to the left of the test statistic. What area corresponds to the p-value of a two-sided test? In the case of two-sided tests, we could observe something as extreme or more extreme in either direction. The p-value includes two tail areas. 16 8

P-values in terms of the test statistic: H a P - v a lu e A r e a u n d e r c u r v e µ < µ 0 P ( Z z ) µ > µ 0 P ( Z z ) µ µ 0 2 P ( Z z ) Left of z Right of z Tails where z is the observed value of the test statistic and the probabilities are found using the standard normal distribution given in Table A. 17 A P-value is exact if the population distribution is normal. If the population is not normal, the P-value approximates the true probability for large n because of the Central Limit Theorem. 18 9

3) State Your Conclusions The final step is to decide if there is a strong amount of evidence to reject H 0 in favor of H a. This is accomplished using the P-value. In our example, we got a P-value =0.0052. What does this tell us? If H 0 is true (i.e., true mean weight is 16 oz), then the chance of getting a sample whose mean weight is 15.6 oz or less is 0.52% 19 Does it give evidence against H 0? Yes, it is very unlikely that we would observe a sample mean as low as we did if H 0 is true. Conclusion: We reject the null hypothesis. 20 10

A small P-value is strong evidence against H 0. Such a P-value says that if H 0 is true, then the observed data are unlikely to occur just by chance. The smaller the P-value, the stronger the evidence against the null. 21 Question: How small does the P-value need to be? Prior to testing, it is determined how small the P-value must be to be considered decisive evidence against H 0. This value is called the significance level and is usually represented as α. Typical α levels used are 0.1, 0.05 and 0.01. If the P-value α, reject the null hypothesis. If the P-value > α, do not reject the null hypothesis. 22 11

If we use an α level of 0.01 and the p-value is smaller than α, we can say that there is a less than one in one-hundred chance that we would observe data as extreme or more extreme than what we saw if the null hypothesis is true. If the P-value α we say the data are statistically significant at level α. Note: When we do not reject H 0, we are not claiming H 0 is true. We are just concluding there is not sufficient evidence to reject it. 23 Example: Ameritech Suppose last year Ameritech s repair service took an average of 3.1 days to fix customer complaints. One of the managers is assigned to check if this year s data show a different average time to fix problems. He collects a random sample of 30 customer complaints and finds that the average time taken to fix them is 2.1 days. Assume that the standard deviation of the time taken to fix the complaints is 2.5 days. Is this good evidence that the average time taken to fix the complaints is not 3.1 days? 24 12

State the hypotheses: H 0 :=3.1 H a : 3.1 Calculate the test statistic: z = x µ σ n 2.1 3.1 = 2.5 30 0 = 2.19 25 Calculate P-value: Since this is a two-sided test, we find the area to the left of -2.19 and then double it to get the p-value. P-value = 2 x 0.0143 = 0.0286 What is your conclusion at the 0.05 level? Since the p-value is less than 0.05, the test is significant at the 0.05 level. We reject the null. What is your conclusion at the 0.01 level? The p-value is larger than 0.01, so the test is not significant at this level. We would not reject the null at this level. 26 13

Example: EPA The EPA limit on concentration of PCB in drinking water is 5 ppm. Wells are regularly tested to make sure they are not over the limit. A random sample of 100 water specimens from a well was collected and has an average PCB 5.1 ppm. Is there evidence at 5% level that the well is over the limit? Assume that the PCB concentration varies with standard deviation 0.8. 27 State the hypotheses: H 0 : = 5 H a : > 5 Calculate the test statistic: z = x µ σ n 5.1 5 0.8 100 0 = = 1.25 28 14

Calculate the P-value: The P-value is the area under the normal curve to the right of 1.25 P-value = 0.1056 Since the p-value is larger than 0.05, we do not have evidence at the 5% level that the PCB level exceeds the limit. We do not reject the null hypothesis. 29 Tests from Confidence Intervals We have covered two types of statistical inference procedures for the population mean µ: Confidence Interval (CI) and Tests of Significance Question: Is there any relationship between hypothesis tests and confidence intervals? Answer: Yes, a level α two-sided test rejects a hypothesis H 0 :µ = µ 0 exactly when the value µ 0 falls outside a level (1-α) CI for µ. 30 15

Example: Concrete Block Bud s Home Center sells concrete blocks. Bud wants to estimate the average weight of all blocks in stock. A sample of 64 blocks has a mean weight of 65.5 lbs. Assume that the weights of blocks vary with standard deviation 4.6 lbs. 31 Construct a 95% CI for the mean weight of all blocks. σ 4.6 x ± z * = 65.5 ± (1.96 ) n 64 => (64.373, 66.627) is a 95% CI for µ. 32 16

Bud is interested in knowing if the mean weight of all blocks is 68 lbs or not. State the appropriate hypotheses. Using the above CI, what do you conclude? H 0 :=68 H a :68 Our confidence interval was 64.373 to 66.627. Since 68 does not fall in this interval, we reject H 0. The significance level of the above test is 5% (0.05), because we used a 95% confidence interval. 33 Example: Jupiter The diameter of Jupiter is measured 100 times independently using a new unbiased process. Using these 100 measurements, a 99% CI for the true diameter is computed to be (88,707 miles to 88,733 miles). Is there evidence at 1% level that the true diameter of Jupiter is not 88,720 miles? 34 17

What would the hypotheses be? H 0 : = 88,720 H a :88,720 What do you conclude? Since 88,720 falls in the 99% confidence interval (88,707 miles, 88,733 miles), we do not reject the null hypothesis when testing at the 1% level. 35 More on stating your conclusions: When working with hypothesis tests there are many ways to state your conclusion. The following four statements convey the same conclusion 1. The test is significant. 2. Reject the null hypothesis. 3. The data show strong evidence against H 0. 4. The p-value is smaller than. 36 18

These four statements also convey the same conclusion: 1. The test is not significant. 2. Do not reject the null hypotheses. 3. The data do not show evidence against H 0. 4. The p-value is larger than. Usually 1,2, or 3 are given as the conclusion and 4 is given as the reason for or explanation of the conclusion. 37 19