T-test in SPSS Hypothesis tests of proportions Confidence Intervals (End of chapter 6 material)

Similar documents
THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Two Related Samples t Test

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Hypothesis testing - Steps

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Independent t- Test (Comparing Two Means)

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Chapter 2 Probability Topics SPSS T tests

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

An SPSS companion book. Basic Practice of Statistics

Chapter Study Guide. Chapter 11 Confidence Intervals and Hypothesis Testing for Means

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Two-sample hypothesis testing, II /16/2004

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Testing a claim about a population mean

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING WITH SPSS:

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

How To Test For Significance On A Data Set

Mind on Statistics. Chapter 12

The Dummy s Guide to Data Analysis Using SPSS

Lesson 9 Hypothesis Testing

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Introduction to Hypothesis Testing

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Independent samples t-test. Dr. Tom Pierce Radford University

6: Introduction to Hypothesis Testing

Analysis of Variance ANOVA

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

CHAPTER 14 NONPARAMETRIC TESTS

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

HYPOTHESIS TESTING: POWER OF THE TEST

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Using Microsoft Excel to Analyze Data from the Disk Diffusion Assay

7. Comparing Means Using t-tests.

Using Microsoft Excel to Analyze Data

Testing for differences I exercises with SPSS

22. HYPOTHESIS TESTING

Chapter 26: Tests of Significance

Chapter 2. Hypothesis testing in one population

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Study Guide for the Final Exam

Chapter 5 Analysis of variance SPSS Analysis of variance

Descriptive Statistics

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

Data Analysis Tools. Tools for Summarizing Data

WISE Power Tutorial All Exercises

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Week 4: Standard Error and Confidence Intervals

Chapter 7 Section 7.1: Inference for the Mean of a Population

3.4 Statistical inference for 2 populations based on two samples

Comparing Means in Two Populations

SPSS Explore procedure

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

1.5 Oneway Analysis of Variance

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Pearson's Correlation Tests

Name: Date: Use the following to answer questions 3-4:

Introduction to Hypothesis Testing OPRE 6301

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

Tutorial 5: Hypothesis Testing

Hypothesis Test for Mean Using Given Data (Standard Deviation Known-z-test)

1 Why is multiple testing a problem?

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Lesson 7 Z-Scores and Probability

A full analysis example Multiple correlations Partial correlations

Simple Regression Theory II 2010 Samuel L. Baker

SPSS/Excel Workshop 3 Summer Semester, 2010

This chapter discusses some of the basic concepts in inferential statistics.

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

Regression Analysis: A Complete Example

NCSS Statistical Software

Tests for One Proportion

Lecture Notes Module 1

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions

Point Biserial Correlation Tests

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Statistics 2014 Scoring Guidelines

p ˆ (sample mean and sample

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Transcription:

T-test in SPSS Hypothesis tests of proportions Confidence Intervals (End of chapter 6 material)

Definition of p-value: The probability of getting evidence as strong as you did assuming that the null hypothesis is true. A smaller p-value means that it s less likely you would get a sample like this if the null hypothesis were true. A smaller p-value means stronger evidence against that null hypothesis.

Definition of alpha, the level of significance: The highest acceptable p-value that we will use to reject the null hypothesis. The default alpha is 0.05. A smaller alpha means less of a chance of falsely rejecting the null. (Also called a Type I error) A smaller alpha means we want to be more certain about something before rejecting the null.

If the p-value is smaller than the alpha, we reject the null hypothesis. (Enough evidence to reject) If the p-value is larger than the null, we fail to reject the null hypothesis. (Not yet enough evidence to reject)

SPSS Example: Milk dataset. In the milk data set, good milk had a calcium level of 20 mg/l. Bad milk, for our cases, will have something other than 20. Step 1: Load the milk dataset. First we re going to try goodcalc4, which is a sample of 25 bottles.

Our null hypothesis is that the milk is good (calcium is 20mg/L) The alternative is that the milk has a different calcium value. We re comparing the mean of one sample to a specific value (20), so we ll go to Analyze Compare Means One-Sample T Test

Choose good sample, n=25, 3 rd. Move this variable to the right by dragging or using the move arrow. Then, change the test value to 20. (The hypothesized mean)

Click OK You ll get two tables. The first is a data summary like what we ve seen before.

The sample size is 25 The sample mean is 18.79 The sample standard deviation is 5.41115 The standard error of the mean is 1.082 (t-score = -1.12 if we were doing this by hand.) The second table is the result of the t-test

Sig. (2-tailed) is the p-value for a two sided test. We re doing a two-sided test (not equal to 20), so this works. p-value =.275 >.050 (default level of significance) So we fail to reject the null.

Let s try some bad milk, as found in lowclac2, and this time let s say we don t care if the milk has too much calcium.

Another possibility is to include the upper area in the null hypothesis. The analysis is exactly the same. (Both ways acceptable)

(SPSS steps are the same, but with bad sample, n=25) SPSS gives us the two-sided significance, but we only want one side. By symmetry, we use half of two-sided p-value to get the one-sided value. p-value =.016 / 2 =.008 <.05.

We reject the null hypothesis. The milk is bad. Two-tailed means two-sided. The tails are terms for the ends of a distribution.

Having two tails isn t normal.

But in statistics, the Normal has two tails. Everything we ve done so far with means (z-scores, t-tests) we can do with proportions as well. The only difference is in the standard error. P is the sample proportion.

For proportions we always use the z-score (and not the t- table). is population proportion.

For proportions we always use the z-score (and not the t- table). Z is still the number of standard deviations about the mean.

Example: ICBC (Insurance Corporation of British Columbia) is concerned about people not wearing seatbelts in cars. If fewer than 90% of drivers are wearing seatbelts, ICBC will start a campaign to pick that number back up.

They surveyed 500 people (by hidden camera to prevent response bias) and find 461 wear seatbelts. First, get the sample proportion P, and its standard error. P = 461 / 500 = 0.922

Next, get the z-score. We want the chance of getting a sample of this proportion or less if the true proportion is.90. This area will include the mean, so we ll use area between and then add 50%. Total Area = 96.78%, or.9678

Why look at area less than z=1.85? The alternative hypothesis was that seatbeat usage is less than.900.. We found seatbeat usage was actually HIGHER than.900, so there should be very little evidence against the null. Area less than 1.85:.9678 P-value is.9678 >.05 (default level of significance), So we fail to reject the null. Real life terms: No seatbeat intervention is necessary.

Always wear your seatbelt, you never know who s on the road.

Confidence intervals From our sample, the proportion of people using their seatbelts was P = 0.922. It would be dishonest to say that the true proportion that uses seatbelts is = 0.922. But that s our best guess from our sample, so it would be even more dishonest to say it s any other value either.

Instead of a single value, it would be a lot more honest to give an interval that captured most of that uncertainty of the value and say We re 95% confident that the true parameter (mean, proportion, whatever) is in this interval. The interval we gave would be the 95% confidence interval. (90%, 99% or other values are possible, but 95% is default, just like 5% significance.)

The standard confidence interval is centered about the sample proportion. means plus or minus. To get the upper end of the interval, use the plus. To get the lower end of the interval, use the minus.

is the true proportion and P is the sample proportion as before.

is the standard error. In this case of the proportion.

Z* Stands for the critical z-score. For a 95% interval, it s whatever z-score would put 95% in the middle. (Or 2.5% on either side)

Two ways to find z*. 1. Look for the appropriate area beyond in the z-table. 2. Look at the bottom of the t-table for the appropriate significance at df. (One-tailed) For 95%, the area beyond is 2.5%. Both ways will give you a critical z-score z* = 1.96.

In the case the seatbelts, the 95% confidence interval would be:

=.922 (1.96)(.0119).922 (1.96)(.0119) =.922-0.023 =.899.922 + (1.96)(.0119) =.922 + 0.023 =.945 The 95% confidence interval is (.899 to.945) Or.922.023, 19 times out of 20.

Subtle note: This doesn t mean there s 95% chance that the true proportion is in the interval (.899 to.945). is a fixed value, it s either in there or it isn t. We ve set an interval that has a 95% chance to contain the parameter.

Each blue vertical line is a confidence interval. The red dotted line horizontally across them represents the parameter value. Most blue lines cross the red line (include the parameter), but not all of them.

Confidence interval: Milk example. Find the 95% confidence interval of the calcium level in the good milk. From SPSS, we know the sample mean is 18.79 and that the standard error of the mean is 1.082

The milk example uses a t* critical, because we re using the t- distribution. In t-table: One-tailed,.025 significance, df=24, the critical value is: 2.064. (.050 sig. in two-tailed gives the same value)

We re 95% confidence that the true calcium level is between... 18.79 (2.064)(1.082) = 18.79 2.23 =16.56 18.79 + (2.064)(1.082) = 18.79 + 2.23 = 21.02 is between 16.56 and 21.02. Since the hypothesized value of 20 is within the confidence interval, 20 is a plausible value for the parameter. Just as before, we fail to reject the null hypothesis.

We can use the confidence intervals to do two-sided hypothesis tests as well. By-hand confidence interval rule: If the confidence interval contains the value given in the null hypothesis, we fail to reject. Otherwise, reject. 16.56 to 21.02 contains 20, so we fail to reject

In SPSS, doing a one-sample t-test will automatically give you a confidence interval as well (defaults to 95%). The values given in SPSS are in relation to 20. -3.44 means 20 3.44 = 16.56 1.02 means 20 + 1.02 = 21.02

SPSS does this because then the reject/fail to reject rule is simplified a bit: If the confidence interval includes zero, we fail to reject. Otherwise, reject. (-3.44 to 1.02 contains zero, so we fail to reject)

Final point: The confidence interval only works for two-sided tests. Why? The interval cuts off at both ends. It has a lower limit and an upper limit. That means if the sample value is too far above or too far below the null hypothesis value, it will be rejected. By definition that s a two-sided test.

On Monday, we do two-sample t-tests (start of chapter 7). Assignment 3 coming tonight or on the weekend.