Examples of Hypothesis Testing

Similar documents
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Difference of Means and ANOVA Problems

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

How To Test For Significance On A Data Set

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Hypothesis Testing. Steps for a hypothesis test:

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

3.4 Statistical inference for 2 populations based on two samples

Comparing Means in Two Populations

Final Exam Practice Problem Answers

Chapter 7 Section 1 Homework Set A

Lecture Notes Module 1

2. Filling Data Gaps, Data validation & Descriptive Statistics

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Hypothesis testing - Steps

Dongfeng Li. Autumn 2010

Independent t- Test (Comparing Two Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

NCSS Statistical Software. One-Sample T-Test

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Using Stata for One Sample Tests

Chapter 7 Section 7.1: Inference for the Mean of a Population

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

Chapter 2 Probability Topics SPSS T tests

NCSS Statistical Software

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Quantitative Methods for Finance

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

22. HYPOTHESIS TESTING

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Confidence Intervals for Cp

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Math 108 Exam 3 Solutions Spring 00

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

2 Precision-based sample size calculations

Recall this chart that showed how most of our course would be organized:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Two Related Samples t Test

Name: Date: Use the following to answer questions 3-4:

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Two-sample hypothesis testing, II /16/2004

NCSS Statistical Software

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Chapter 2. Hypothesis testing in one population

Estimation of σ 2, the variance of ɛ

Regression Analysis: A Complete Example

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

Confidence Intervals for the Difference Between Two Means

HYPOTHESIS TESTING WITH SPSS:

The Wilcoxon Rank-Sum Test

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Hypothesis Testing: Two Means, Paired Data, Two Proportions

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

HYPOTHESIS TESTING: POWER OF THE TEST

Mean = (sum of the values / the number of the value) if probabilities are equal

Objectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)

1.5 Oneway Analysis of Variance

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions

Chapter Study Guide. Chapter 11 Confidence Intervals and Hypothesis Testing for Means

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Guide to Microsoft Excel for calculations, statistics, and plotting data

Simple linear regression

Statistics Review PSY379

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Chapter 26: Tests of Significance

How To Compare Birds To Other Birds

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

AP STATISTICS (Warm-Up Exercises)

Descriptive Statistics

Introduction. Statistics Toolbox

Simple Linear Regression Inference

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Testing Research and Statistical Hypotheses

1 Nonparametric Statistics

How Does My TI-84 Do That

Point and Interval Estimates

Using R for Linear Regression

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

II. DISTRIBUTIONS distribution normal distribution. standard scores

Principles of Hypothesis Testing for Public Health

2 Sample t-test (unequal sample sizes and unequal variances)

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

STAT 350 Practice Final Exam Solution (Spring 2015)

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Math 251, Review Questions for Test 3 Rough Answers

Chapter 7. One-way ANOVA

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Transcription:

Overview Examples of Hypothesis Testing Dr Tom Ilvento Department of Food and Resource Economics Let s continue with some examples of hypothesis tests introduce computer output compare hypothesis test to confidence intervals see what happens if we use a t versus a z for the Critical Value See what happens with an outlier Introduce hypothesis tests for proportions 2 Example: Test for Humerus Bones Humerus Bones Example Humerus bones from the same species have the same length to width ratio, so they are often used as a means to identify bones by archeologists It is known that a Species A exhibits a mean ratio of 8.5 Suppose 4 fossils of humerus bones were unearthed in East Africa Test whether the mean ratio from this sample differs from Species A (µ = 8.5). Use!=.0 3 The length to width ratio was calculated for the sample and resulted in the following univariate statistics n= 4 Mean = 9.26 s=.20 Min value = 6.23 Max value=2.00 4

Humerus Bones Hypothesis Test The Components of a Hypothesis Test Set up the Null Hypothesis µ =??? µ = 8.5 Set up the Alternative Hypothesis It takes up one of three forms The problem asked to Test whether the mean ratio from this sample differs from Species A µ! 8.5 Two-tailed? If large sample, > 30, use s as estimate of sigma and use a t-value 5 µ = 8.5 µ! 8.5 2-tailed n= 4, " unknown, use t t* = (9.258 8.5)/.88 # =.0,.0/2, 40 d.f., t = ± 2.704 t* = 4.032 t* > t.0/2, 40 df 4.032 > 2.704 Reject Ho: µ = 8.5 6 Here s how it Looks in Pictures What would have happened if I used a z-test in place of the t-test? Our critical values were -2.704 and 2.704 Our test statistic was 4.032 the test statistic is in the rejection region on the right hand side I can also see it from the output from JMP -2.704 2.704 t*=4.03 7 µ = 8.5 µ! 8.5 2-tailed n= 4, " unknown, large sample use z z* = (9.258 8.5)/.88 # =.0,.0/2, z = ± 2.575 z* = 4.032 z* > z.0/2 4.032 > 2.575 Reject Ho: µ = 8.5 8

Peanut Package Problem Peanut Package Problem A peanut company sells a package product of 6 oz of salted peanuts through an automated process Not all packages contain exactly 6 oz of peanuts they shoot for an average of 6 oz with a standard deviation of.8 oz. They routinely take random samples of 40 packages and weigh them They want to see if each sample is different from the package claim at!=. 9 If the manufacturing process overfills the packages, even by a little, they lose profit If the manufacturing process under-fills the packages they risk angry customers and fines from government They are interested in a twotailed test, a priori Let s say they take a sample of 40 packages and get a mean value of 6.42 Does this sample result warrant checking the manufacturing process? Note: this is a problem where we can view " as being known: " =.8 SE=.8/40.5 =.265 Use the z-distribution 0 The Components of a Hypothesis Test 90% Confidence Interval µ = 6.0 µ! 6.0 2-tailed n= 40, " =.8, use z z* = (6.42 6.00)/.265 # =.0,.0/2, z = ±.645 z* = 3.32 z* > z.0/2 3.32 >.645 Reject Ho: µ = 6.0 Calculate the 90% confidence interval for this problem 6.42 ±.645[.8/(40).5 ] 6.42 ±.208 6.2 to 6.63 Note that 6 is NOT in the 90% C.I. A similar (-!) C.I. will generate the same result as a two-tailed hypothesis test If the the Null value is in the C.I. You cannot reject Ho 2

Another way to approach this problem using C.I. Another way to use a confidence interval: Calculate the C.I. Around 6 oz 6 ±.645[.8/(40).5 ] 6 ±.208 5.792 to 6.208 Any sample that falls outside of this interval will cause them to reject the null hypothesis (based on two-tailed test and! =.) Note: Type I Error =. They can expect to wrongly reject Ho: 0 of 00 times 3 Let s try a problem together The Body mass index (BMI) is a measure of body fat based on height and weight that applies to both adult men and women. A BMI > than 30 is considered obese. A random sample of adults participated in a health study, and 3 of them had a BMI > 30. We will look at the systolic blood pressure reading, which represents the maximum pressure exerted when the heart contracts. Assume the systolic blood pressure follows something like a normal distribution and an unhealthy reading is greater than 20. We want to test to see if people with BMI > 30 tend to have a systolic blood pressure reading greater than 20. Use # =.0 Systolic Blood Pressure for patients with BMI >30 Hypothesis Test for Sys BP Here are the results from JMP You take the relevant information SYS BP Quantiles maximum 00.0% 99.5% 97.5% 90.0% 75.0% 50.0% 25.0% 0.0% 2.5% 0.5% 0.0% quartile median quartile minimum 8.00 8.00 8.00 70.60 32.00 25.00 3.50 08.20 07.00 07.00 07.00 Moments Mean Std Dev Std Err Mean upper 95% Mean lower 95% Mean N Sum Wgt Sum Variance Skewness Kurtosis CV N Missing 27.65 20.304 5.63 39.885 5.346 3.000 3.000 659.000 42.256.780 3.424 5.90 0.000 Stem and Leaf Stem Leaf Count 8 7 7 6 6 5 5 5 3 3 3 2 2 556 3 2 3 6 034 3 0 7 0 7 represents 07 5 6

Hypothesis Test for Sys BP JMP output for the Hypothesis Test µ = 20 µ > 20 -tailed upper n= 3, " unknown, use t t* = (27.65 20)/5.63 # =.0, 2 d.f., t =.356 t* =.352 t* < t.0, 2 df.352 <.356 Cannot Reject Ho: µ = 20 7 JMP shows the same output, but not the t-value for the Critical Value Instead it gives a p-value This is the probability of finding a value greater than the test statistic into the tail as either a one-tail or two-tail test We would compare the p-value SYS BP Test Mean=value Hypothesized Value Actual Estimate df Std Dev Test Statistic Prob > t Prob > t Prob < t t Test.3523 0.202 0.006 0.8994 20 27.65 2 20.304 00 0 20 30 0 for the appropriate test to! 8 One value was Extreme, 8, what happens if we remove it? Hypothesis Tests for Proportions SYS BP Moments Mean Std Dev Std Err Mean upper 95% Mean lower 95% Mean N Sum Wgt Sum Variance Skewness Kurtosis CV N Missing 23.67 3.002 3.753 3.428.905 2.000 2.000 78.000 69.06.242 2.356 0.557 0.000 Stem and Leaf Stem Leaf 5 5 5 3 3 3 2 556 2 3 6 034 0 7 Count 2 3 3 0 7 represents 07 The data are better behaved. The changes: the mean is lower, but so is the standard deviation and ultimately the standard error; we lose a degree of freedom µ = 20 µ > 20 -tailed upper n= 2, " unknown, use t t* = (23.67 20)/3.753 # =.0, d.f., t =.363 t* =.8439 t* < t.0, df.8439 <.363 Cannot Reject Ho: µ = 20 9 The Pepsi Challenge asked soda drinkers to compare Diet Coke and Diet Pepsi in a blind taste test. Pepsi claimed that more than # of Diet Coke drinkers said they preferred Diet Pepsi (P=.5) Suppose we take a random sample of 00 Diet Coke Drinkers and we found that 56 preferred Diet Pepsi. Use # =.05 level to test if we have enough evidence to conclude that more than half of Diet Coke Drinkers will prefer Pepsi. 20

Hypothesis Test for a Proportion Hypothesis test for proportions is the same It must be based on a large sample We have an estimate of the population parameter, P, from a sample - p We use the same strategy of comparing our sample estimate to the theoretical sampling distribution And the same formulas But, with one slight twist! 2 Remember, if we have additional information we should use it With proportions we have a slightly different approach to the standard error Remember, the variance, std dev, and standard error of a proportion is tied to P or p " 2 = PQ " = (PQ/n).5 If we hypothesize that P =.5 under a null hypothesis Then we ought to use the hypothesized P and Q as the components for the standard error of the sampling distribution! P = (.5 ".5) /00! P =.25/00 =.05 22 Pepsi Challenge Hypothesis Test Here s how it Looks in Pictures P =.5 P >.5 -tailed, upper n= 00, " =.25, binomial = normal z* = (.56.5)/.05 # =.05, z =.645 z* =.20 z* < z.05.20 <.645 Cannot Reject Ho: P =.5 23 Our critical value was.645 Our test statistic was.20 the test statistic is not in the rejection region on the right hand side z*=.2.645 24

Summary I hope you are getting more comfortable with the mechanics of a hypothesis test Take it step by step Determine if the problem is dealing with a proportion or a mean And if a mean, do we know sigma? Is the sample size large? Can we reasonable assume the population variable follows a normal distribution? 25