Extending Hypothesis Testing. p-values & confidence intervals

Similar documents
p ˆ (sample mean and sample

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Hypothesis testing - Steps

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Point and Interval Estimates

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Mind on Statistics. Chapter 12

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Introduction to Hypothesis Testing

Simple Linear Regression Inference

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

Lecture Notes Module 1

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Comparing Means in Two Populations

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

HYPOTHESIS TESTING: POWER OF THE TEST

Descriptive Statistics

Unit 26 Estimation with Confidence Intervals

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Analysis of Variance ANOVA

Chapter 2. Hypothesis testing in one population

Hypothesis Testing for Beginners

Study Guide for the Final Exam

3.4 Statistical inference for 2 populations based on two samples

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Introduction to Hypothesis Testing OPRE 6301

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Testing Hypotheses About Proportions

5/31/2013. Chapter 8 Hypothesis Testing. Hypothesis Testing. Hypothesis Testing. Outline. Objectives. Objectives

WISE Power Tutorial All Exercises

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Two-sample inference: Continuous data

Multinomial and Ordinal Logistic Regression

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Estimation of σ 2, the variance of ɛ

Non-Parametric Tests (I)

Section 13, Part 1 ANOVA. Analysis Of Variance

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Principles of Hypothesis Testing for Public Health

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Nonlinear Regression Functions. SW Ch 8 1/54/

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

2 Precision-based sample size calculations

Sample Size Planning, Calculation, and Justification

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Final Exam Practice Problem Answers

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Paired 2 Sample t-test

Correlational Research

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

A) B) C) D)

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Simple Regression Theory II 2010 Samuel L. Baker

Premaster Statistics Tutorial 4 Full solutions

Two-sample hypothesis testing, II /16/2004

Math 251, Review Questions for Test 3 Rough Answers

2 Sample t-test (unequal sample sizes and unequal variances)

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Confidence Intervals for the Difference Between Two Means

NCSS Statistical Software

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Regression Analysis: A Complete Example

Two Correlated Proportions (McNemar Test)

Week 4: Standard Error and Confidence Intervals

Chapter 3 RANDOM VARIATE GENERATION

1.5 Oneway Analysis of Variance

Chi-square test Fisher s Exact test

August 2012 EXAMINATIONS Solution Part I

Confidence Intervals for Spearman s Rank Correlation

Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation

NCSS Statistical Software. One-Sample T-Test

CALCULATIONS & STATISTICS

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Review. March 21, S7.1 2_3 Estimating a Population Proportion. Chapter 7 Estimates and Sample Sizes. Test 2 (Chapters 4, 5, & 6) Results

Hypothesis Testing --- One Mean

How To Test For Significance On A Data Set

Crosstabulation & Chi Square

Lesson 9 Hypothesis Testing

Stats Review Chapters 9-10

Review #2. Statistics

The Variability of P-Values. Summary

Lesson Outline Outline

Pearson's Correlation Tests

Confidence Intervals

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

Transcription:

Extending Hypothesis Testing p-values & confidence intervals

So far: how to state a question in the form of two hypotheses (null and alternative), how to assess the data, how to answer the question by using a statistic and an associated measure of the probability of observing our statistic, given the current state or null hypothesis.

Next: We will use the p-value to: make inferences about the population assign a level of confidence

Review the Steps Phase 1: State the Question 1. Evaluate and describe the data 2. Review assumptions 3. State the question-in the form of hypotheses Phase 2: Decide How to Answer the Question 4. Decide on a summary number-a statistic-that reflects the question 5. How could random variation affect that statistic? 6. State a decision rule, using the statistic, to answer the question

Detailed Steps (cont) Phase 3: Answer the Question 7. Calculate the statistic 8. Make a statistical decision 9. State the substantive conclusion Phase 4: Communicate the Answer to the Question 10. Document our understanding with text, tables, or figures

Clarify & Generalize the steps Step 2: Assumptions: Representative: Is the observed data representative of the population? Independence: Are the observations (responses of interest) independent? Size: Is the size of the sample large enough to make generalizations to the population at large?

Size Assumption So, how large is large enough? Rule-of-thumb: N large enough to expect to see five of each of the two outcomes Both of the following must be true: p 0 n > 5 (1 p 0 ) n > 5

In the CPR study p 0 = 0.06 and n = 278, so: p 0 n = 0.06 278 = 16.68 > 5 (1 p 0 ) n = (1 0.06) 278 = 261.31 > 5

Common Mistakes Using the observed proportion rather than the hypothesized proportion Compare the observed number of events of interest to five Why Not? We always operate under the assumption that the null hypothesis is true so use the null proportion!

Step 2 is particularly important: If the data do not meet the assumptions, then the statistical tests applied to test the hypothesis will not be valid Only proceed to steps 3 10 if the assumptions are met

Step 4 For the CPR example, we used a specific statistic, the proportion p The statistic and decision rules can be more generally defined and applied to all situations for testing a proportion.

Review CPR Simulation H0: The population survival proportion is 0.06 or less if the observed proportion p 0.083 (x = 23 survivors or less). HA: The population survival proportion is larger than 0.06 if the observed proportion p > 0.086 (x = 24 or more survivors).

Recall for the CPR simulations, the results looked similar to a normal distribution

Applied vs. Theoretical The smooth curve is the theoretical distribution of a normal curve under the null hypothesis Centered on the population value (p0 = 0.06) with proportions farther away from this center being less likely to occur Use the theoretical distribution to determine if our observed proportion is different from our assumed proportion

General Test Statistic observed p - hypothesized p standard error of the hypothesized p z = pˆ p0 p0 0 n ( 1 p )

observed proportion assumed proportion z = standard error of p 0 p pˆ p0 ( 1 ) 0 p0 n

Why p 0? Calculate the test statistic under the assumption that the null hypothesis is true. We are not concerned about how the variability of the observed data will affect our hypothesis testing result We believe the null hypothesis and the variability in the observed data should be assumed to be the same as the variability under the null hypothesis.

Z-score Using the z-score allows us to use a decision rule based on the standard normal distribution, rather than the proportion, p. The standard normal distribution ~N(0,1) The cut-off for the decision rule does not change for different values of p, n, and p 0.

For an a = 0.05, the z value is 1.645, ( 5% of the N(0,1) values are greater than 1.645)

General Decision Rule H 0 : proportion p p 0. Choose this if z z critical and p-value α. H A : proportion p > p 0. Choose this if z > z critical and p-value < α.

Clarify Steps w/ CPR Example 1. Evaluate and describe the data We observed n = 278 CPR patients who received instructions by phone, of whom x = 29 survived to hospital discharge. The characteristic of interest is survival proportion, p = 29/278 = 0.104. The intent is to compare the outcomes in this study to a = 0.06 survival rate presumed to be typical.

2. Review assumptions There are three assumptions: Representativeness: From the design of the study, it is clear that subjects are representative of cardiacarrest victims in cities with a quick-response emergency system. Independence: The response of one cardiac-arrest victim does not depend on the response of others. The subjects are independent. Sufficient size: Since, n = 0.06 278 = 16.68 > 5, and (1 ) n = (1 0.06) 278 = 261.31 > 5, this assumption is valid.

3. State the question in the form of hypotheses The intent is to show that phone-cpr is superior to doing nothing. Thus, the alternative hypothesis is that there are higher than 6% survival rates: H 0 : p 0.06 H A : p > 0.06.

4. Decide on a summary number a statistic that reflects the question We ll use the z-score: z = pˆ p0 ( 1 p ) p0 0 n

5. How could random variation affect that statistic? If the null hypothesis is true, then z is zero. Since the assumptions are met, z is normally distributed. Large values of z reflect higher survival proportions and thus favor the alternative hypothesis.

6. State a decision rule, using the statistic, to answer the question General Choose to believe (at α = 0.05): H 0 : Choose this if p-value α H A : Choose this if p-value < α For CPR Example, for an α = 0.05: H 0 : p 0.06 Choose this if p-value 0.05 H A : p > 0.06. Choose this if p-value < 0.05

7. Calculate the statistic z pˆ p0 0.104 0.06 = = p0 0 n 278 ( 1 p ) 0.06( 1 0.06) 0.044 = = 0.0142 3.09 Recall that a z-value to the right of 3 is unlikely. In fact, the associated p-value is p = 0.0010 (we ll talk about calculating p-values later).

8. Make a statistical decision Reject the null hypothesis since p-value < 0.05. The observed value of the summary statistic is larger than what is expected by chance alone.

9. State the substantive conclusion We conclude that the survival proportion is larger than 0.06.

10. Document our understanding with text, tables, or figures Does dispatcher-instructed bystander-administered CPR improve the chances of survival? Without this intervention it is presumed that the survival probability will be unchanged (at 6%). From this study, which used n = 278 patients, we observed p = 0.1040 (x = 29 survived until hospital discharge). The observed rate was compared to the hypothesized rate using the z test statistic. We reject the hypothesis p 0.06 in favor of the alternative hypothesis that the survival probability is larger than 6% (z = 3.09, p-value = 0.0010).

Universal Decision Rule H 0 : null-hypothesis. Choose this if p-value α (usually 0.05). H A : alternative-hypothesis. Choose this if p-value < α (usually 0.05).

How do we determine p-values? p-values can be determined from standard normal tables, such as Table A.1 in the Statistical Sleuth.,715 Tedious and you need to be careful what the table gives as the proportion it could be the opposite of what you are looking for! Use a calculator

Calculation note: Software might return a p-value as 0 or 0.000 not possible Determine the number of decimal places the calculator reports (when it will return a 0 value) Then report p < 0.001 or p < 0.0001

Confidence Intervals Often, researchers want to use a less rigid approach to hypothesis testing by estimating the parameter and placing upper and lower bounds (or limits) on the estimate. The interval is called a confidence interval.

The confidence interval approach allows us to make statements about a population parameter without referring to hypotheses Also gives a range of values that reflects our degree of certainty.

Definitions Inference: An inference is a conclusion that patterns observed in the data are present in the broader population. Statistical Inference: A statistical inference is an inference justified by a probability model (distribution) linking the data to the broader population. Parameter: A parameter is an unknown numerical value describing a feature of a distribution.

More Definitions Statistic: A statistic is any value that can be calculated from the observed data. Estimate: An estimate is a statistic used as a guess (or estimate) of a parameter.

General Definition estimate ± (reliability coefficient) (standard error) Estimating a parameter with an interval involves three components: The point estimate. The standard error of the estimate. This describes how much variability we expect. A reliability coefficient. This describes our degree of certainty.

Estimate Calculate the observed proportion: p = x / n In the CPR case p = 0.104.

Standard Error The standard error we use here is different from that used in hypothesis testing. Recall that earlier we were in the mind-set of hypothesis testing. Here we are not doing hypothesis testing here. We re just estimating a confidence interval based upon the observed data

Standard Error of p-hat SE p ˆ = pˆ ( 1 pˆ) n Note that the standard error of the estimate gets smaller as n gets larger. We expect less variability in an estimate if we use more data to make the estimate.

CPR Example For n = 278 and = 0.104, the associated standard error is: SE p ˆ pˆ ˆ = = n ( 1 p) 0.104( 1 0.104) 278 = 0.0183

Reliability coefficient The reliability coefficient reflects how sure we want to be: 95% sure 90% sure 99% sure Based on the standard normal for those proportions

Reliability Coefficients Commonly Used For 90% confidence, use z = 1.645. For 95% confidence, use z = 1.96. For 99% confidence, use z = 2.575.

Confidence Interval pˆ z SE p ˆ ± ( 1 α 2) ( ) 0.104 ± 1.96 0.0183 ( 0.068, 0.140)

Using a sentence: In the first case study there were 29 survivors (out of n = 278 studied) yielding a 95% confidence interval on the population survival proportion of [0.068, 0.140]. That is, We re 95% confident that the survival proportion is between 0.068 and 0.140.

Is there a 100% CI? Yes, it is [0,1] But this is a silly answer and doesn t make a conclusive statement about the population estimate. This is the same for all proportions!

Using the 10 Steps The Changes to the 10 Steps are minimal: 3. State the question (CI) 4. Decide on a summary statistic that reflects the question (CI formula). 5. How could random variation affect that statistic? (If the assumptions are met, then this interval will cover the population proportion 95% of the time )

6. Determine the reliability coefficient and standard error to be used in the CI 7. Calculate the interval 8. Compare the interval to comparison value (If there is a comparison value, does the interval include it?)

9. State the substantive conclusion: Something like: We estimate the population proportion of to be [lower, upper] with 95% confidence perhaps which does not include the hypothesized value of. 10. Document our understanding with text

Summary We have looked at several methods to assess and describe data and underlying populations. We can use simulations, z-scores, p-values, or confidence intervals about an estimate to make conclusions about observed data and broader populations. Next, we ll look at sample size and precision of estimates and the design of a study to estimate population proportions.