How To Test For A Hypothesis Test

Similar documents
Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

1 Sufficient statistics

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

3.4 Statistical inference for 2 populations based on two samples

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Hypothesis Testing for Beginners

HYPOTHESIS TESTING: POWER OF THE TEST

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Independent t- Test (Comparing Two Means)

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Lecture 8. Confidence intervals and the central limit theorem

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Exact Confidence Intervals

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Chapter 4 Statistical Inference in Quality Control and Improvement. Statistical Quality Control (D. C. Montgomery)

How To Test For Significance On A Data Set

Hypothesis testing - Steps

Module 2 Probability and Statistics

Name: Date: Use the following to answer questions 3-4:

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Chapter 2. Hypothesis testing in one population

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

Principle of Data Reduction

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Tests of Hypotheses Using Statistics

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

Introduction to Hypothesis Testing

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Estimation of σ 2, the variance of ɛ

Stats Review Chapters 9-10

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Chapter 7 Section 1 Homework Set A

4. Continuous Random Variables, the Pareto and Normal Distributions

Non-Parametric Tests (I)

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Chapter 4: Statistical Hypothesis Testing

Online 12 - Sections 9.1 and 9.2-Doug Ensley

1. How different is the t distribution from the normal?

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen.

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Basics of Statistical Machine Learning

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Maximum Likelihood Estimation

Simple Linear Regression Inference

Lecture 8: More Continuous Random Variables

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Math 251, Review Questions for Test 3 Rough Answers

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

The Normal distribution

Mind on Statistics. Chapter 12

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Parametric and non-parametric statistical methods for the life sciences - Session I

Correlational Research

Hypothesis Testing. Hypothesis Testing

Practice problems for Homework 11 - Point Estimation

Dongfeng Li. Autumn 2010

AP STATISTICS (Warm-Up Exercises)

1 Nonparametric Statistics

Difference of Means and ANOVA Problems

Statistics 2014 Scoring Guidelines

Two-sample hypothesis testing, II /16/2004

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Section 13, Part 1 ANOVA. Analysis Of Variance

Introduction to Hypothesis Testing OPRE 6301

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

Principles of Hypothesis Testing for Public Health

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

A) B) C) D)

5.1 Identifying the Target Parameter

Week 3&4: Z tables and the Sampling Distribution of X

Topic 8. Chi Square Tests

Multivariate normal distribution and testing for means (see MKB Ch 3)

BUS/ST 350 Exam 3 Spring 2012

Two Related Samples t Test

Review #2. Statistics

Statistics Review PSY379

Study Guide for the Final Exam

Hypothesis Testing --- One Mean

Unit 26 Estimation with Confidence Intervals

Descriptive Statistics

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

Interaction between quantitative predictors

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Unit 26: Small Sample Inference for One Mean

Likelihood Approaches for Trial Designs in Early Phase Oncology


Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Transcription:

Introduction to Hypothesis Testing Point estimation and confidence intervals are useful statistical inference procedures. Another type of inference is used frequently used concerns tests of hypotheses. We are interested in a random variable X which has density f(x; θ) where θ Ω. We would like to know whether θ ω 0 or θ ω 1, where ω 0 ω 1 = Ω.

Hypothesis testing: H 0 : θ ω 0 versus H 1 : θ ω 1. The hypothesis H 0 is referred to as the null hypothesis while H 1 is referred as the alternative hypothesis. Often the null hypothesis represents no change or no difference from the past while the alternative hypothesis represents change or difference. The decision rule to take H 0 or H 1 is based on a sample X 1, X 2,..., X n from the distribution of X. We may make mistakes.

Table 1: 2 2 Decision Table for a Test of Hypothesis Decision H 0 is True H 1 is True Reject H 0 Type I Error Correct Decision Accept H 0 Correct Decision Type II Error Type I Error = P ( Reject H 0 H 0 is true) Type II Error = P ( Accept H 0 H 1 is true)

An Example Let µ 1 be the mean score of midterm for UMBC male students and µ 2 be the mean score for female students. Hypothesis testing: H 0 : µ 1 µ 2 0 vs. H 1 : µ 1 µ 2 > 0 H 0 : µ 1 µ 2 = 0 vs. H 1 : µ 1 µ 2 0 H 0 : µ 1 75 vs. H 1 : µ 1 > 75

Some Definitions Critical Region C: reject H 0 (accept H 1 ): if (X 1,..., X n ) C accept H 0 (reject H 1 ): if (X 1,..., X n ) C c. A Type I error occurs if H 0 is rejected when it is true while a Type II error occurs if H 0 is accepted when H 1 is true. The goal is to select a critical region from all possible critical regions which minimizes the probabilities of these errors. In general, this is impossible. For example, let C =, type I error is 0 but the type II error is 1. Often we consider type I error to be the worse of the two errors. We select critical region which bound the type I error and minimize the type II error.

We say a critical region C is of size α (significance level) if α = max θ ω 0 P θ [(X 1,..., X n ) C]. Over all critical regions of size α, we consider critical regions which have lower probabilities of Type II error. For θ ω 1, we want to maximize 1 P θ [Type II error] = P θ [(X 1,..., X n ) C]. The probability on the right hand side is called the Power of the test at θ. The power function is defined as γ C (θ) = P θ [(X 1,..., X n ) C] = P ( accept H 1 H 1 is true )

Testing for a Binomial Proportion of Success Let X be a Bernoulli random variable with probability of success p. Suppose we want to test H 0 : p = p 0 vs. H 1 : p < p 0. Let X 1,..., X n be a random sample from the distribution of X and let S = n i=1 X i. An intuitive decision rule (critical region) is Reject H 0 in favor of H 1 if S k, where k is such that α = P H0 [S k]. Since S is Binomial, we may find k which solve this equation. For example, n = 20, p 0 = 0.7 and α = 0.15, then, S bin(20, 0.7) and k. = 11.

The power function is γ(p) = P p [S k], p < p 0. See Figure 5.5.1 for the picture of γ(p). Note that the function is decreasing. The power is higher to detect the alternative p = 0.2 than p = 0.6. Simple Hypothesis: completely specifies the underlying distribution, e.g., H 0 : p = p 0. Composite hypotheses: compose of many simple hypothesis, e.g., H 1 : p < p 0.

Large Sample Tests for the Mean The test in the last example is based on the exact distribution of its test statistics, i.e., the binomial distribution. Often we cannot get the distribution of test statistics in closed form. Use central limit theorem. Let X be a random variable with mean µ and variance σ 2. We want to test where µ 0 is specified. H 0 : µ = µ 0 vs. H 1 : µ > µ 0, For example: µ 0 is the mean level on a standardized test of students who have been taught by a standard method of teaching. Let X 1,..., X n be a random sample from the distribution of X.

Because X p µ, an intuitive decision rule is given by Reject H 0 in favor of H 1 if X is much larger than µ0. Using central limit theorem, X µ S/ n p Z. Using this, we obtain a test with an approximate size α, if Reject H 0 in favor of H 1 if X µ S/ n z α. The test is intuitive. To reject H 0, X must exceed µ0 by at least z α S/ n.

The power function is approximated by γ(µ) = P µ ( X µ 0 + z α σ/ n). n(µ0 µ) = 1 Φ(z α + ) σ n(µ0 µ) = Φ( z α ), σ which is an increasing function of µ.

Let X N(µ, σ 2 ). Consider Tests for µ under Normality H 0 : µ = µ 0 vs. H 1 : µ > µ 0. Under H 0, the test statistics T = ( X µ)/(s/ n) has a t-distribution with n 1 degrees of freedom. The decision rule is Reject H 0 in favor of H 1 if T = X µ S/ n t α,n 1.

p-value p-value: The p-value is the observed tail probability of a statistics being at least as extreme as the particular observed value when H 0 is true. If Y = u(x 1,..., X n ) is the statistics to be used in a test of H 0 and if the critical region is of the form u(x 1,..., x n ) c, an observed value u(x 1,..., x n ) = d would mean that the p-value = P (Y d; H 0 ). If p-value is small, we need reject the null hypothesis.

Let X 1,..., X 25 be a random sample from N(µ, 2 2 ). To test H 0 : µ = 77 vs. H 1 : µ < 77. Say we observe the 25 values and determine that x = 76.1. We know that Z = ( X 77)/ 4/25 is N(0, 1) provided that µ = 77. Since the observed statistics is z = (76.1 77)/0.4 = 2.25, the p-value of the test is Φ( 2.25) = 1 0.998 = 0.012. Accordingly, if we use a significance level of α = 0.05, we would reject H 0 and accept µ < 77.