Hypothesis Testing. Hypothesis Testing



Similar documents
Hypothesis Testing --- One Mean

HYPOTHESIS TESTING: POWER OF THE TEST

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Estimation of σ 2, the variance of ɛ

Correlational Research

Chapter 2. Hypothesis testing in one population

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Introduction to Hypothesis Testing OPRE 6301

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

3.4 Statistical inference for 2 populations based on two samples

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

Pearson's Correlation Tests

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Hypothesis testing - Steps

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

Independent t- Test (Comparing Two Means)

Difference of Means and ANOVA Problems

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Regression Analysis: A Complete Example

Simple Linear Regression Inference

Two Related Samples t Test

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Point Biserial Correlation Tests

Chapter Study Guide. Chapter 11 Confidence Intervals and Hypothesis Testing for Means

Stats Review Chapters 9-10

Permutation Tests for Comparing Two Populations

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

How To Test For Significance On A Data Set

Final Exam Practice Problem Answers

Module 2 Probability and Statistics

22. HYPOTHESIS TESTING

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Descriptive Statistics

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Applied Data Analysis. Fall 2015

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Testing a claim about a population mean

Chapter 4 Statistical Inference in Quality Control and Improvement. Statistical Quality Control (D. C. Montgomery)

Lesson 9 Hypothesis Testing

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

Week TSX Index

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Interaction between quantitative predictors

Practice Problems and Exams

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Simple Regression Theory II 2010 Samuel L. Baker

Non-Inferiority Tests for One Mean

Hypothesis Testing for Beginners

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

WISE Power Tutorial All Exercises

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Study Guide for the Final Exam

Confidence Intervals for the Difference Between Two Means

6: Introduction to Hypothesis Testing

NCSS Statistical Software. One-Sample T-Test

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Part 2: Analysis of Relationship Between Two Variables

SAMPLE SIZE CONSIDERATIONS

Using R for Linear Regression

II. DISTRIBUTIONS distribution normal distribution. standard scores

1.5 Oneway Analysis of Variance

Descriptive Analysis

Non-Inferiority Tests for Two Means using Differences

This can dilute the significance of a departure from the null hypothesis. We can focus the test on departures of a particular form.

1-3 id id no. of respondents respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Math 251, Review Questions for Test 3 Rough Answers

Chapter 7 Section 1 Homework Set A

SPSS/Excel Workshop 2 Semester One, 2010

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

2 Precision-based sample size calculations

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Chapter 4: Statistical Hypothesis Testing

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Transcription:

Hypothesis Testing Daniel A. Menascé Department of Computer Science George Mason University 1 Hypothesis Testing Purpose: make inferences about a population parameter by analyzing differences between observed sample statistics and the results one expects to obtain if some underlying assumption is true. Null hypothesis: H 0 : µ = x Alternative hypothesis: H1 : µ x If the null hypothesis is rejected then the alternative hypothesis is accepted. 2 1

0.0 µ rejection non-rejection rejection critical values Test statistic: Z X µ = σ n 3 Risks in Decision Making Type I Error occurs if H o is rejected when it is true. Pr [H o is rejected true] = α Type II Error occurs if H o is not rejected when it is false. Pr[H o is not rejected false] = β Confidence coefficient: Pr [H o not rejected true]= 1- α Power of the test: Pr[H o is rejected false]= 1-β 4 2

Actual Situation Accept H o H o true Correct decision Confidence=1-α H o false Type II Error: Pr[Type II]=β Reject H o Type I Error P[Type I]=α Correct Decision Power=1-β 5 Example of Hypothesis Testing A sample of 50 files from a file system is selected. The sample mean is 12.3Kbytes. The standard deviation is known to be 0.5 Kbytes. Ho: µ = 12.5 Κbytes H 1 : µ 12.5 Κbytes Confidence: 0.95 6 3

rejection=0.025 rejection= 0.025-2.83 0-1.96 1.96 non-rejection = 0.95 NORMINV(1-0.05/2,0,1) 12.3 12.5 Z = = 2.83 0.5 50 Reject H o 7 Z Test of Hypothesis for the Mean Null Hypothesis m= 12.5 Level of Significance 0.05 Population Standard Deviation 0.5 Sample Size 50 Sample Mean 12.3 Standard Error of the Mean 0.070710678 Z Test Statistic -2.828427125 Two-Tailed Test Lower Critical Value -1.959961082 Upper Critical Value 1.959961082 p-value 0.00467786 Reject the null hypothesis 8 4

Steps in Hypothesis Testing 1. State the null and alternative hypothesis. 2. Choose the level of significance α. 3. Choose the sample size n. Larger samples allow us to detect even small differences between sample statistics and true population parameters. For a given α, increasing n decreases β. 4. Choose the appropriate statistical technique and test statistic to use (Z or t). 9 Steps in Hypothesis Testing 5. Determine the critical values that divide the regions of acceptance and nonacceptance. 6. Collect the data and compute the sample mean and the appropriate test statistic (e.g., Z). 7. If the test statistic falls in the non-reject region, H o cannot be rejected. Else H o is rejected. 10 5

The p-value Approach p-value: observed level of significance. Defined as the probability that the test statistic is equal to or more extreme than the result obtained from the sample data, given that H o is true. 11 p/2 p/2 0 -Z Z p/2=f(-z) =NORMDIST(-z,0,1,true) critical values If p α then do not reject H o, else reject H o. 12 6

p/2 p/2 0 -Z Z p/2=f(-z) =NORMDIST(-z,0,1,true) critical values Do not reject H o 13 p/2 p/2 0 -Z Z p/2=f(-z) =NORMDIST(-z,0,1,true) critical values Reject H o 14 7

NORMDIST(-2.828427125,0,1,TRUE) Computing p-values Z Test of Hypothesis for the Mean Null Hypothesis m= 12.5 Level of Significance 0.05 Population Standard Deviation 0.5 Sample Size 50 Sample Mean 12.3 Standard Error of the Mean 0.070710678 Z Test Statistic -2.828427125 Two-Tailed Test Lower Critical Value -1.959961082 Upper Critical Value 1.959961082 p-value 0.00467786 Reject the null hypothesis The null hypothesis is rejected because p (0.0047) is less than the level of significance (0.05). 15 Steps in Determining the p-value. 1. State the null and alternative hypothesis. 2. Choose the level of significance α. 3. Choose the sample size n. Larger samples allow us to detect even small differences between sample statistics and true population parameters. For a given α, increasing n decreases β. 4. Choose the appropriate statistical technique and test statistic to use (Z or t). 16 8

Steps in Determining the p-value. 5. Collect the data and compute the sample mean and the appropriate test statistic (e.g., Z). 6. Calculate the p-value based on the test statistic 7. Compare the p-value to α. 8. If p α then do not reject H o, else reject H o. 17 One-tailed Tests Null hypothesis is an inequality. H o 3.5 H 1 < 3.5 18 9

α 1-α 0.0 µ rejection non-rejection critical value Test statistic: Z X µ = σ n 19 Example of One-Tailed Test A sample of 50 files from a file system is selected. The sample mean is 12.35Kbytes. The standard deviation is known to be 0.5 Kbytes. Ho: µ 12.3 Κbytes H 1 : µ < 12.3 Κbytes Confidence: 0.95 20 10

Example of One-Tailed Test Z X µ = σ / n 12.35 12.3 = = 0.707 0.5/ 50 (test statistic) Critical value =NORMINV(0.05,0,1)= -1.645. Region of non-rejection: Z 1.645. So, do not reject H o. (Z exceeds critical value) 21 0.05 (α) 0.95 (1 α) 0.0 µ rejection non-rejection 0.707 (test statistic) -1.645 (critical value) Test statistic: Z X µ = σ n 22 11

One-tailed Test Z Test of Hypothesis for the Mean Null Hypothesis m= 12.3 Level of Significance 0.05 Population Standard Deviation 0.5 Sample Size 50 Sample Mean 12.35 Standard Error of the Mean 0.070710678 Z Test Statistic 0.707106781 Lower-Tail Test Lower Critical Value -1.644853 p-value 0.760250013 Do not reject the null hypothesis 23 Hypothesis Tests with Unknown σ If the population is assumed to be normally distributed the sampling distribution for the mean follows a t distribution with n-1 degrees of freedom. t statistic for unknown σ: t = X µ s n Use sample standard deviation 24 12

Example of Hypothesis Testing A sample of 50 files from a file system is selected. The sample mean is 12.3Kbytes. The sample standard deviation is 0.5 Kbytes. Ho: µ = 12.35 Κbytes H 1 : µ 12.35 Κbytes Confidence: 0.95 25 t Test of Hypothesis for the Mean Null Hypothesis m= 12.35 Level of Significance 0.05 Sample Size 50 Sample Mean 12.3 Sample Standard Deviation 0.5 Standard Error of the Mean 0.070710678 Degrees of Freedom 49 t Test Statistic -0.707106781 TINV(0.05,49) Two-Tailed Test Lower Critical Value -2.009574018 Upper Critical Value 2.009574018 p -Value 0.482849571 Do not reject the null hypothesis The t test statistic is between the lower and critical values. So, do not reject the null hypothesis. 26 13