Statistical Tests (Hypothesis Testing)

Similar documents
5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

Hypothesis Testing --- One Mean

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Introduction to Hypothesis Testing OPRE 6301

HYPOTHESIS TESTING: POWER OF THE TEST

Section 8-1 Pg. 410 Exercises 12,13

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

WISE Power Tutorial All Exercises

Hypothesis testing - Steps

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Introduction to Hypothesis Testing

Name: Date: Use the following to answer questions 3-4:

Online 12 - Sections 9.1 and 9.2-Doug Ensley

In the past, the increase in the price of gasoline could be attributed to major national or global

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

6: Introduction to Hypothesis Testing

Regression Analysis: A Complete Example

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

p ˆ (sample mean and sample

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Descriptive Statistics

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

FAT-FREE OR REGULAR PRINGLES: CAN TASTERS TELL THE DIFFERENCE?

9 Testing the Difference

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Independent samples t-test. Dr. Tom Pierce Radford University

Stats Review Chapters 9-10

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

8 6 X 2 Test for a Variance or Standard Deviation

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

3. Mathematical Induction

Lecture Notes Module 1

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Fat Content in Ground Meat: A statistical analysis

Unit 26 Estimation with Confidence Intervals

3.4 Statistical inference for 2 populations based on two samples

Two-sample hypothesis testing, II /16/2004

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

STAT 360 Probability and Statistics. Fall 2012

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Introduction. Statistics Toolbox

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

University of Chicago Graduate School of Business. Business 41000: Business Statistics Solution Key

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Chi-square test Fisher s Exact test

Chapter 7 TEST OF HYPOTHESIS

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

IB Practice Chi Squared Test of Independence

Math 108 Exam 3 Solutions Spring 00

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

Sample Size and Power in Clinical Trials

Using Microsoft Excel to Analyze Data

Chapter 2. Hypothesis testing in one population

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Fairfield Public Schools

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Study Guide for the Final Exam

Chapter 7: One-Sample Inference

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

Recall this chart that showed how most of our course would be organized:

University of Chicago Graduate School of Business. Business 41000: Business Statistics

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

Hypothesis Testing. Steps for a hypothesis test:

SIMON FRASER UNIVERSITY

Mind on Statistics. Chapter 12

individualdifferences

Chapter 4. Hypothesis Tests

A Short Guide to Significant Figures

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Testing Hypotheses About Proportions

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

Testing a claim about a population mean

UNDERSTANDING THE TWO-WAY ANOVA

Tutorial 5: Hypothesis Testing

Section 13, Part 1 ANOVA. Analysis Of Variance

Topic #6: Hypothesis. Usage

Chapter 26: Tests of Significance

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Basic Concepts in Research and Data Analysis

Two Related Samples t Test

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Transcription:

Outlines December 2, 2004

Outlines Introduction Introduction Examples Homework Much of this material is from Allan Bluman s Elementary Statistics: A Brief Version, Second Edition.

Part I

Example A medical researcher is interested in finding out whether a new medication will have any undesirable side effects, particularly with respect to heart rate. Will a patient s pulse rate, increase, decrease, or remain unchanged after taking this medication?

Example A medical researcher is interested in finding out whether a new medication will have any undesirable side effects, particularly with respect to heart rate. Will a patient s pulse rate, increase, decrease, or remain unchanged after taking this medication? After conducting controlled tests on a sample of patients, the researcher finds that the mean pulse rate of the group taking the new medicine is higher than average. But is this increase significant, or simply due to random chance?

Example A medical researcher is interested in finding out whether a new medication will have any undesirable side effects, particularly with respect to heart rate. Will a patient s pulse rate, increase, decrease, or remain unchanged after taking this medication? After conducting controlled tests on a sample of patients, the researcher finds that the mean pulse rate of the group taking the new medicine is higher than average. But is this increase significant, or simply due to random chance? Statistical tests (or hypothesis testing) can be used to answer that question and give the conclusion some certainty.

Define a population under study.

Define a population under study. State the particular hypotheses that will be investigated, and give the significance level.

Define a population under study. State the particular hypotheses that will be investigated, and give the significance level. Select a sample from the population, and collect data.

Define a population under study. State the particular hypotheses that will be investigated, and give the significance level. Select a sample from the population, and collect data. Perform calculations for the statistical test, and reach a conclusion.

Define a population under study Recall that a population is the collection of all possible objects that have a common measurable or otherwise observable feature or characteristic.

Define a population under study Recall that a population is the collection of all possible objects that have a common measurable or otherwise observable feature or characteristic. For the purposes of hypothesis testing, we re often concerned with knowing the population s mean and standard deviation.

Define a population under study Recall that a population is the collection of all possible objects that have a common measurable or otherwise observable feature or characteristic. For the purposes of hypothesis testing, we re often concerned with knowing the population s mean and standard deviation. In the case of the medical researcher, the population is all adult men, and the mean pulse rate was 70 beats per minute (bpm), with a standard deviation of 8 bpm.

State the hypotheses to be investigated A statistical hypothesis is a conjecture about a population parameter. The conjecture may or may not be true.

State the hypotheses to be investigated A statistical hypothesis is a conjecture about a population parameter. The conjecture may or may not be true. There are two types of statistical hypotheses in each test: the null hypothesis and the alternative hypothesis.

State the hypotheses to be investigated A statistical hypothesis is a conjecture about a population parameter. The conjecture may or may not be true. There are two types of statistical hypotheses in each test: the null hypothesis and the alternative hypothesis. The null hypothesis (H 0 ), is a hypothesis that states there is no difference between a parameter and a specific value, or that there is no difference between two parameters.

State the hypotheses to be investigated A statistical hypothesis is a conjecture about a population parameter. The conjecture may or may not be true. There are two types of statistical hypotheses in each test: the null hypothesis and the alternative hypothesis. The null hypothesis (H 0 ), is a hypothesis that states there is no difference between a parameter and a specific value, or that there is no difference between two parameters. The alternative hypothesis (H 1 ) is a hypothesis that states the existance of difference between a parameter and a specific value, or states that there is a difference between two parameters.

Hypothesis examples H 0 : There is no difference in pulse between these patients and the nationwide population (µ = 70). H 1 : The pulse of patients taking this medication is substantially different than the nationwide population (µ 70).

Hypothesis examples H 0 : There is no difference in pulse between these patients and the nationwide population (µ = 70). H 1 : The pulse of patients taking this medication is substantially different than the nationwide population (µ 70). H 0 : This new additive does nothing to increase the life of automobile batteries (µ 36). H 1 : This additive significantly increases battery life (µ > 36).

Hypothesis examples H 0 : There is no difference in pulse between these patients and the nationwide population (µ = 70). H 1 : The pulse of patients taking this medication is substantially different than the nationwide population (µ 70). H 0 : This new additive does nothing to increase the life of automobile batteries (µ 36). H 1 : This additive significantly increases battery life (µ > 36). H 0 : A new type of insulation does not help decrease monthly heating costs (µ 78). H 1 : This new insulation reduces monthly heating costs (µ < 78).

Types of Statistical Tests and Hypotheses The null and alternative hypotheses are always stated together.

Types of Statistical Tests and Hypotheses The null and alternative hypotheses are always stated together. Written mathematically, the null hypothesis always contains an equal sign.

Types of Statistical Tests and Hypotheses The null and alternative hypotheses are always stated together. Written mathematically, the null hypothesis always contains an equal sign. The mathematical form of the null and alternative hypotheses for different types of tests are as follows:

Types of Statistical Tests and Hypotheses The null and alternative hypotheses are always stated together. Written mathematically, the null hypothesis always contains an equal sign. The mathematical form of the null and alternative hypotheses for different types of tests are as follows: Two-tailed test Right-tailed test Left-tailed test H 0 : µ = k H 0 : µ k H 0 : µ k H 1 : µ k H 1 : µ > k H 1 : µ < k

Designing the study, defining the significance level The simplest design of the medication study would be to give the drug to a sample of patients, wait long enough for the drug to be absorbed, and measure the patients heart rates.

Designing the study, defining the significance level The simplest design of the medication study would be to give the drug to a sample of patients, wait long enough for the drug to be absorbed, and measure the patients heart rates. However, regardless of the medication or the sampling, the mean heart rate of the sampled patients will almost never be exactly 70 bpm.

Designing the study, defining the significance level The simplest design of the medication study would be to give the drug to a sample of patients, wait long enough for the drug to be absorbed, and measure the patients heart rates. However, regardless of the medication or the sampling, the mean heart rate of the sampled patients will almost never be exactly 70 bpm. Two possibilities exist:

Designing the study, defining the significance level The simplest design of the medication study would be to give the drug to a sample of patients, wait long enough for the drug to be absorbed, and measure the patients heart rates. However, regardless of the medication or the sampling, the mean heart rate of the sampled patients will almost never be exactly 70 bpm. Two possibilities exist: Either the null hypothesis is true and any differences between sample mean and population mean are purely due to chance,

Designing the study, defining the significance level The simplest design of the medication study would be to give the drug to a sample of patients, wait long enough for the drug to be absorbed, and measure the patients heart rates. However, regardless of the medication or the sampling, the mean heart rate of the sampled patients will almost never be exactly 70 bpm. Two possibilities exist: Either the null hypothesis is true and any differences between sample mean and population mean are purely due to chance, or the null hypothesis is false and the observed difference in heart rate is due to side effects of the medication.

Designing the study, defining the significance level Qualitatively, we would expect that if the sample mean pulse rate is 71 or 72 bpm, that the difference is due to random chance.

Designing the study, defining the significance level Qualitatively, we would expect that if the sample mean pulse rate is 71 or 72 bpm, that the difference is due to random chance. Similarly, we would expect that if the sample mean pulse rate is 80 or 85 bpm, that the difference is due to the medicine.

Designing the study, defining the significance level Qualitatively, we would expect that if the sample mean pulse rate is 71 or 72 bpm, that the difference is due to random chance. Similarly, we would expect that if the sample mean pulse rate is 80 or 85 bpm, that the difference is due to the medicine. But these decisions cannot be made qualitatively, there must be a quantifiable standard limits the probability that differences are due to chance.

Designing the study, defining the significance level Qualitatively, we would expect that if the sample mean pulse rate is 71 or 72 bpm, that the difference is due to random chance. Similarly, we would expect that if the sample mean pulse rate is 80 or 85 bpm, that the difference is due to the medicine. But these decisions cannot be made qualitatively, there must be a quantifiable standard limits the probability that differences are due to chance. That is where the level of significance appears.

Types of Errors There are four possible outcomes from a hypothesis test: The null hypothesis is actually true, but the sample data does not support it and we decide to reject the null hypothesis.

Types of Errors There are four possible outcomes from a hypothesis test: The null hypothesis is actually true, but the sample data does not support it and we decide to reject the null hypothesis. The null hypothesis is actually false, but the sample data leads us to believe that we should not reject the null hypothesis.

Types of Errors There are four possible outcomes from a hypothesis test: The null hypothesis is actually true, but the sample data does not support it and we decide to reject the null hypothesis. The null hypothesis is actually false, but the sample data leads us to believe that we should not reject the null hypothesis. The null hypothesis is actually true, and the sample data supports this fact.

Types of Errors There are four possible outcomes from a hypothesis test: The null hypothesis is actually true, but the sample data does not support it and we decide to reject the null hypothesis. The null hypothesis is actually false, but the sample data leads us to believe that we should not reject the null hypothesis. The null hypothesis is actually true, and the sample data supports this fact. The null hypothesis is actually false, and the sample data helps us reject it.

Types of Errors There are four possible outcomes from a hypothesis test: The null hypothesis is actually true, but the sample data does not support it and we decide to reject the null hypothesis. The null hypothesis is actually false, but the sample data leads us to believe that we should not reject the null hypothesis. The null hypothesis is actually true, and the sample data supports this fact. The null hypothesis is actually false, and the sample data helps us reject it. Two of these situations result in a correct decision, and two of them result in an incorrect decision. H 0 True H 0 False Reject H 0 Type I Error Correct decision Do not reject H 0 Correct decision Type II Error

Level of Significance (finally) The level of significance is defined as the maximum probability of committing a type I error, and is normally shown as α. For example, if α = 0.05, there is a 5% chance that we will reject a true null hypothesis. If α = 0.01, there is a 1% chance that we will reject a true null hypothesis. (The probability of committing a type II error is normally shown as β, and is normally not easily computed. However, for a given testing situation, lowering α increases β, and raising α lowers β.)

Perform calculations for a statistical test The significance level α and the type of test (two-tailed, left-tailed, right-tailed) lead to the selection of a critical value from statistical tables. The critical value(s) separates the critical region of the probability distribution curve from the noncritical region.

Perform calculations for a statistical test The significance level α and the type of test (two-tailed, left-tailed, right-tailed) lead to the selection of a critical value from statistical tables. The critical value(s) separates the critical region of the probability distribution curve from the noncritical region. The critical region is the range of values of the test value that indicate a significant difference from the population and that the null hypothesis should be rejected.

Perform calculations for a statistical test The significance level α and the type of test (two-tailed, left-tailed, right-tailed) lead to the selection of a critical value from statistical tables. The critical value(s) separates the critical region of the probability distribution curve from the noncritical region. The critical region is the range of values of the test value that indicate a significant difference from the population and that the null hypothesis should be rejected. The noncritical region is the range of values of the test value that indicate any differences from the population are probably due to chance, and that the null hypothesis should not be rejected.

Example Critical Values: Right-Tailed Test For a right-tailed test on a normal distribution, the critical value is the z value that gives Φ(z > z crit ) = α. 0.4 0.35 z * =2.33 α=φ(z>z * )=0.01 0.3 0.25 f Z (z) 0.2 0.15 0.1 0.05 0 3 2 1 0 1 2 3 z

Example Critical Values: Left-Tailed Test For a left-tailed test on a normal distribution, the critical value is the z value that gives Φ(z < z crit ) = α. 0.4 0.35 z * = 2.33 α=φ(z<z * )=0.01 0.3 0.25 f Z (z) 0.2 0.15 0.1 0.05 0 3 2 1 0 1 2 3 z

Example Critical Values: Two-Tailed Test For a two-tailed test on a normal distribution, the critical value is the z value that gives Φ(z > z crit ) + Φ(z < z crit ) = α. 0.4 0.35 z * =±2.58 α=φ(z<z * )=0.005 α=φ(z>z * )=0.005 0.3 0.25 f Z (z) 0.2 0.15 0.1 0.05 0 3 2 1 0 1 2 3 z

Introduction Examples Homework The z test is a statistical test for the mean of a population. It can be used whenever: the sample size n 30, or

Introduction Examples Homework The z test is a statistical test for the mean of a population. It can be used whenever: the sample size n 30, or if the population is known to be normally distributed and the population standard deviation σ is known (regardless of n).

Introduction Examples Homework The z test is a statistical test for the mean of a population. It can be used whenever: the sample size n 30, or if the population is known to be normally distributed and the population standard deviation σ is known (regardless of n). The formula for the z test is z = X µ σ/ n

Introduction Examples Homework The z test is a statistical test for the mean of a population. It can be used whenever: the sample size n 30, or if the population is known to be normally distributed and the population standard deviation σ is known (regardless of n). The formula for the z test is z = X µ σ/ n If this z value falls into the critical region, we reject the null hypothesis, and conclude that there is not enough evidence to support the idea that there is a significant difference between the sample and the overall population.

Introduction Examples Homework Example 1 Let s revisit our medical researcher. The mean pulse rate of the entire population of adult men is 70 beats per minute (bpm), with a standard deviation of 8 bpm. A sample of 10 adult male patients is given a new drug, and after several minutes, their mean pulse rate is measured at 75 bpm. With a level of significance of α = 0.01, is this change in pulse rate due to random chance?

Introduction Examples Homework Example 1 Solution (1) State the null hypothesis: There is no significant difference in pulse between these patients and the population as a whole. (H 0 : µ = 70)

Introduction Examples Homework Example 1 Solution (1) State the null hypothesis: There is no significant difference in pulse between these patients and the population as a whole. (H 0 : µ = 70) State the alternative hypothesis: There is a significant difference in pulse between these patients and the population as a whole. (H 1 : µ 70).

Introduction Examples Homework Example 1 Solution (1) State the null hypothesis: There is no significant difference in pulse between these patients and the population as a whole. (H 0 : µ = 70) State the alternative hypothesis: There is a significant difference in pulse between these patients and the population as a whole. (H 1 : µ 70). The null hypothesis is given in the form corresponding to a two-tailed test (H 0 : µ = k, H 1 : µ k), and for α = 0.01, the critical z values are z = ±2.58.

Introduction Examples Homework Example 1 Solution (2) Calculate the test z value: z = X µ σ/ n with X = 75, µ = 70, σ = 8, n = 10.

Introduction Examples Homework Example 1 Solution (2) Calculate the test z value: z = X µ σ/ n with X = 75, µ = 70, σ = 8, n = 10. In this case, z = 1.98. Since z falls within the noncritical region, our conclusion is to not reject the null hypothesis.

Introduction Examples Homework Example 1 Solution (2) Calculate the test z value: z = X µ σ/ n with X = 75, µ = 70, σ = 8, n = 10. In this case, z = 1.98. Since z falls within the noncritical region, our conclusion is to not reject the null hypothesis. We conclude that there is not sufficient evidence to indicate that the change in pulse rate is due to anything other than random chance.

Introduction Examples Homework Example 2 Our medical researcher continues with the pulse tests. A larger sample of 50 adult male patients has been tested identically to the first group, and their mean pulse rate is also measured at 75 bpm. With a level of significance of α = 0.01, is this change in pulse rate due to random chance?

Introduction Examples Homework Example 2 Solution (1) State the null hypothesis: There is no significant difference in pulse between these patients and the population as a whole. (H 0 : µ = 70)

Introduction Examples Homework Example 2 Solution (1) State the null hypothesis: There is no significant difference in pulse between these patients and the population as a whole. (H 0 : µ = 70) State the alternative hypothesis: There is a significant difference in pulse between these patients and the population as a whole. (H 1 : µ 70).

Introduction Examples Homework Example 2 Solution (1) State the null hypothesis: There is no significant difference in pulse between these patients and the population as a whole. (H 0 : µ = 70) State the alternative hypothesis: There is a significant difference in pulse between these patients and the population as a whole. (H 1 : µ 70). The null hypothesis is given in the form corresponding to a two-tailed test (H 0 : µ = k, H 1 : µ k), and for α = 0.01, the critical z values are z = ±2.58.

Introduction Examples Homework Example 2 Solution (2) Calculate the test z value: z = X µ σ/ n with X = 75, µ = 70, σ = 8, n = 50.

Introduction Examples Homework Example 2 Solution (2) Calculate the test z value: z = X µ σ/ n with X = 75, µ = 70, σ = 8, n = 50. In this case, z = 4.42. Since z falls within the critical region, our conclusion is to reject the null hypothesis.

Introduction Examples Homework Example 2 Solution (2) Calculate the test z value: z = X µ σ/ n with X = 75, µ = 70, σ = 8, n = 50. In this case, z = 4.42. Since z falls within the critical region, our conclusion is to reject the null hypothesis. We conclude that there is sufficient evidence to indicate that the change in pulse rate is due to something other than random chance.

Introduction Examples Homework Homework (hand in at final exam time) A manufacturer states that the average lifetime of its lightbulbs is 36 months, with a standard deviation of 10 months. Thirty bulbs are selected, and their average lifetime is found to be 32 months. Should the manufacturer s statement be rejected at α = 0.01? Should it be rejected if you tested 50 bulbs and found an average lifetime of 32 months?