Chapter 9 Testing Hypothesis and Assesing Goodness of Fit

Similar documents
1 Sufficient statistics

Chapter 4: Statistical Hypothesis Testing

Triangle deletion. Ernie Croot. February 3, 2010

Introduction to Detection Theory

Practice problems for Homework 11 - Point Estimation

Chapter 3 RANDOM VARIATE GENERATION

HYPOTHESIS TESTING: POWER OF THE TEST

You flip a fair coin four times, what is the probability that you obtain three heads.

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Principle of Data Reduction

PSTAT 120B Probability and Statistics

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

How To Calculate The Power Of A Cluster In Erlang (Orchestra)

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

I. Pointwise convergence

Lecture 9: Bayesian hypothesis testing

Practice with Proofs

Homework # 3 Solutions

Handout #1: Mathematical Reasoning

5. Continuous Random Variables

Notes on metric spaces

Section 5.1 Continuous Random Variables: Introduction

Maximum Likelihood Estimation

I. GROUPS: BASIC DEFINITIONS AND EXAMPLES

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

1 Inner Products and Norms on Real Vector Spaces

Undergraduate Notes in Mathematics. Arkansas Tech University Department of Mathematics

User Guide Thank you for purchasing the DX90

Statistics 100A Homework 7 Solutions

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Math 461 Fall 2006 Test 2 Solutions

3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

Lecture 7: Continuous Random Variables

Pearson's Correlation Tests

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan , Fall 2010

Math 151. Rumbos Spring Solutions to Assignment #22

Exact Nonparametric Tests for Comparing Means - A Personal Summary

6.2 Permutations continued

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008

Lecture Notes on Elasticity of Substitution

Lecture 13 - Basic Number Theory.

Chapter 2. Hypothesis testing in one population

Chapter 6: Point Estimation. Fall Probability & Statistics

Lecture 2: Universality

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

Hypothesis Testing for Beginners

1 if 1 x 0 1 if 0 x 1

Mathematical Induction

1.1 Introduction, and Review of Probability Theory Random Variable, Range, Types of Random Variables CDF, PDF, Quantiles...

Chapter 3: DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS. Part 3: Discrete Uniform Distribution Binomial Distribution

Cartesian Products and Relations

Lecture 13: Martingales

ECE302 Spring 2006 HW5 Solutions February 21,

Cardinality. The set of all finite strings over the alphabet of lowercase letters is countable. The set of real numbers R is an uncountable set.

2WB05 Simulation Lecture 8: Generating random variables

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

Lecture 6: Discrete & Continuous Probability and Random Variables

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

Chapter 7 - Roots, Radicals, and Complex Numbers

Tests for Two Proportions

Introduction to Hypothesis Testing

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Point Biserial Correlation Tests

Valuation of Lease Contracts In Continuous Time With Stochastic Asset Values

Note on some explicit formulae for twin prime counting function

Lecture 22: November 10

Non-Inferiority Tests for Two Proportions

Solutions for Practice problems on proofs

THE BANACH CONTRACTION PRINCIPLE. Contents

Math 370, Actuarial Problemsolving Spring 2008 A.J. Hildebrand. Practice Test, 1/28/2008 (with solutions)

INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS

Metric Spaces. Chapter Metrics

Section 4.4 Inner Product Spaces

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Page 331, 38.4 Suppose a is a positive integer and p is a prime. Prove that p a if and only if the prime factorization of a contains p.

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

Lecture 8: More Continuous Random Variables

Notes on Complexity Theory Last updated: August, Lecture 1

Notes on Continuous Random Variables

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Lecture 4: BK inequality 27th August and 6th September, 2007

6. Define log(z) so that π < I log(z) π. Discuss the identities e log(z) = z and log(e w ) = w.

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

ON SEQUENTIAL CONTINUITY OF COMPOSITION MAPPING. 0. Introduction

88 CHAPTER 2. VECTOR FUNCTIONS. . First, we need to compute T (s). a By definition, r (s) T (s) = 1 a sin s a. sin s a, cos s a

Inference on the parameters of the Weibull distribution using records

Finite dimensional topological vector spaces

Math 370/408, Spring 2008 Prof. A.J. Hildebrand. Actuarial Exam Practice Problem Set 3 Solutions

Non Parametric Inference

E3: PROBABILITY AND STATISTICS lecture notes

Unit 26 Estimation with Confidence Intervals

The Exponential Distribution

Math 370, Spring 2008 Prof. A.J. Hildebrand. Practice Test 2 Solutions

Transcription:

Chapter 9 Testing Hypothesis and Assesing Goodness of Fit 9.1-9.3 The Neyman-Pearson Paradigm and Neyman Pearson Lemma We begin by reviewing some basic terms in hypothesis testing. Let H 0 denote the null hypothesis to be tested against the alternative hypothesis H A. Definition: Let C be the subset of the sample space which leads to rejection of the hypothesis under consideration. C is called the critical region or rejection region of the test. Based on our analysis of the data and resulting conclusion, there are two types of errors that can be made: type I error is where the null hypothesis is rejected when it is, in fact, true. This is usually denoted by α and called the significance level of the test. type II error is where the null hypothesis is accepted (or not rejected) when it is, in fact, false.

The probability tha H 0 is rejected when it is false is called the power of the test and is denoted by 1 β. We will first be concerned with testing a simple null hypothesis H 0 : θ = θ against a simple alternative hypothesis H A : θ = θ. Definition: Let C denote a subset of the sample space. Then C is called a best critical region of size α for testing H 0 : θ = θ against H A : θ = θ if, for every subset A of the sample space for which P [A : H 0 ] = α, the following are true (a) P [C; H 0 ] = α and (b) P [C; H 1 ] P [A; H 1 ].

Example 1 Consider a random variable X that has a binomial distribution with n = 5 and p = θ. We wish to test H 0 : θ = 1/2 versus H A : θ = 3/4, at significance level α = 1/32. X f(x;1/2) f(x;3/4) f(x;12)/f(x;34) 0 1/32 1/1024 32 1 5/32 15/1024 32/3 2 10/32 90/1024 32/9 3 10/32 270/1024 32/27 4 5/32 405/1024 32/81 5 1/32 243/1024 32/243 To satisfy the significance level α = 1/32 we select the critical region as either A 1 = {x : x = 0} or A 2 = {x : x = 5}. In fact, these are the only two possible ways of selecting a critical region that has probability no greater than 1/32 under the null hypothesis H 0 : θ = 1/2. Thus, at least one of them must be a best critical region. However, note that P [A 1 ; H A ] = 1 1024 < 243 1024 = P [A 2; H A ] Thus, A 2 is the unique best critical region.

Lemma (Neyman-Pearson): Let X 1, X 2,..., X n, where n is a fixed positive integer, denote a random sample from a distribution with pdf f(x; θ). Then the joint pdf of X 1, X 2,..., X n is L(θ; x 1,..., x n ) = f(x 1 ; θ)f(x 2 ; θ) f(x n ; θ). Let θ and θ be distinct values of the parameter space Ω = {θ : θ = θ, θ }, and let k be a positive number. Define C as the subset of the sample space such that (a) L(θ ;x 1,...,x n ) L(θ ;x 1,...,x n ) k, for (x 1,..., x n ) C. (b) L(θ ;x 1,...,x n ) L(θ ;x 1,...,x n ) > k, for (x 1,..., x n ) C. (c) α = P [C; H 0 ]. Then C is a best critical region of size α for testing the simple null hypothesis H 0 : θ = θ against the simple alternative hypothesis H A : θ = θ.

Example 2: Let X 1, X 2,..., X n denote a random sample from a distribution with pdf f(x; θ) = 1 (x θ)2 exp 2π 2 < x <. We wish to test H 0 : θ = 0 versus H 1 : θ = 1. L(0; x 1,..., x n ) L(1; x 1,..., x n ) = (1/ [ 2π) n exp (1/ 2π) n exp n i=1 x 2 ] i 2 [ n i=1 (x i 1) 2 For k > 0 the set of points (x 1,..., x n ) satisfying exp n 2 n i=1 2 x i k ] = exp is a best critical region. Notice that this inequality is equivalent to n 2 n i=1 x i or n 2 n x i ln(k) i=1

n i=1 x i n 2 ln(k) = c. Given a particular significance level α, we can find c. Note that under H 0, X i has a N(0, n) distribution. Under H 0, α = P [ X i c] = P [Z c/ n] = 1 P [Z c/ n] where Z is a standard normal random variable. Thus, and P [Z c/ n] = 1 α c = nφ 1 (1 α) where Φ denotes the cdf of a standard normal distribution.

Now, we will extend the notion of best critical regions (most powerful tests), to the case where the alternative hypothesis is a composite hypothesis. Example 3: Suppose we know that the distribution of a random variablex has the form f(x; θ) = 1 θ e x/θ for 0 < x <, where the parameter space Ω = {θ : θ 2}. We wish to test H 0 : θ = 2 versus H A : θ > 2. We can see that X has an exponential distribution or equivalently a Gamma distribution with α = 1 and β = θ. Thus, E[X; θ] = θ. Based on this, we would have evidence in favor of H A for large values of the sample mean or X 1 + X 2. We will take a random sample X 1, X 2 of size 2 and test H 0 versus H A according to the critical region C = {(x 1, x 2 ) : 9.5 x 1 + x 2 < }.

Find the significance level of this test. significance level (size)=p [C; θ = 2]. When H 0 is true, the joint pdf of X 1 and X 2 is f(x 1, x 2 ; θ = 2) = f(x 1 ; 2)f(x 2 ; 2) = 1 4 e (x 1+x 2 )/2 and P [C; θ = 2] = 1 P [C ; θ = 2] = 1 9.5 0 9.5 x 2 0 1 4 e (x 1+x 2 )/2 dx 1 dx 2 0.05. In fact, using the previous theory, we can see that for any θ > 2, C is a best critical region of size 0.05 for testing the simple hypothesis H 0 : θ = θ = 2, versus the simple alternative H A : θ = θ. [ ] x 1 +x 2 2 L(2; x 1, x 2 ) L(θ ; x 1, x 2 ) + (1/2)2 exp (1/θ ) 2 exp [ x ] 1+x 2 k θ

This implies 2 ln(1/2) 2 ln(1/θ ) + (x 1 + x 2 )( 1/2 + 1/θ ) Equivalently, ln(k) (x 1 + x 2 ) [ln(k) 2 ln(1/2) + 2 ln(1/θ )]/( 1/2 + 1/θ ) = c Thus, we would reject H 0 when x 1 + x 2 is greater than c. For a given significance level, we solve for c to find the best critical region C. For example if, α = 0.05, we see that c = 9.5 is the correct choice, no matter the value of θ > 2. Because this is true for all θ, it is also the best critical region of size 0.05, when the alternative hypothesis is a composite hypothesis H A : θ > 2.

Let K(θ) = P [C; θ] denote the power function. K(θ) = 1 9.5 0 = 9.5 x 2 0 θ + 9.5 θ 1 θ 2e (x 1+x 2 )/θ dx 1 dx 2 e 9.5/θ

Figure 1: Plot of K(θ) versus θ power 0.0 0.2 0.4 0.6 0.8 1.0 5 10 15 20 25 theta

Definition: The critical region C is a uniformly most powerful critical region of size α for testing a simple null hypothesis H 0 against a composite alternative hypothesis H A if C is a best critical region of size α for testing H 0 against each simple hypothesis in H A. Definition: A test defined by a uniformly most powerful critical region is called a uniformly most powerful test, with significance level α, for testing the simple null hypothesis H 0 against the composite alternative hypothesis H A. Uniformly most powerful tests do not always exist. However, when they do, using the Neyman-Pearson Lemma as in the previous example is a useful method for finding them.