Paired 2 Sample t-test



Similar documents
2 Sample t-test (unequal sample sizes and unequal variances)

How To Test For Significance On A Data Set

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

How To Run Statistical Tests in Excel

Data Transforms: Natural Logarithms and Square Roots

Point Biserial Correlation Tests

Independent t- Test (Comparing Two Means)

Two Related Samples t Test

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Unit 26: Small Sample Inference for One Mean

Unit 27: Comparing Two Means

Skewed Data and Non-parametric Methods

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

A) B) C) D)

Chapter 23 Inferences About Means

Chapter 7. One-way ANOVA

Regression Analysis: A Complete Example

Comparing Means in Two Populations

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Chapter 7 Section 7.1: Inference for the Mean of a Population

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

How To Check For Differences In The One Way Anova

Pearson's Correlation Tests

1.5 Oneway Analysis of Variance

One-Way Analysis of Variance

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Analysis of Variance ANOVA

Hypothesis testing - Steps

3.4 Statistical inference for 2 populations based on two samples

Case Study Call Centre Hypothesis Testing

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Projects Involving Statistics (& SPSS)

The Dummy s Guide to Data Analysis Using SPSS

Independent samples t-test. Dr. Tom Pierce Radford University

Hypothesis Testing. Steps for a hypothesis test:

Analysis of Variance. MINITAB User s Guide 2 3-1

Statistiek II. John Nerbonne. October 1, Dept of Information Science

13: Additional ANOVA Topics. Post hoc Comparisons

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Estimation of σ 2, the variance of ɛ

Chapter 2 Simple Comparative Experiments Solutions

Difference of Means and ANOVA Problems

Factors affecting online sales

Two-sample hypothesis testing, II /16/2004

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

12: Analysis of Variance. Introduction

Multiple Linear Regression

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

Final Exam Practice Problem Answers

Module 5: Statistical Analysis

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Data Analysis Tools. Tools for Summarizing Data

Using Excel for inferential statistics

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Predictor Coef StDev T P Constant X S = R-Sq = 0.0% R-Sq(adj) = 0.

Chapter 5 Analysis of variance SPSS Analysis of variance

NCSS Statistical Software. One-Sample T-Test

Recall this chart that showed how most of our course would be organized:

Introduction. Statistics Toolbox

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

Descriptive Analysis


Chapter 7 Section 1 Homework Set A

Exercise 1.12 (Pg )

Mind on Statistics. Chapter 13

= $96 = $24. (b) The degrees of freedom are. s n For the mean monthly rent, the 95% confidence interval for µ is

Analysing Questionnaires using Minitab (for SPSS queries contact -)

Binary Diagnostic Tests Two Independent Samples

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Descriptive Statistics

MULTIPLE REGRESSION EXAMPLE

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Elementary Statistics Sample Exam #3

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions

Simple Regression Theory II 2010 Samuel L. Baker

Section 13, Part 1 ANOVA. Analysis Of Variance

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

HYPOTHESIS TESTING WITH SPSS:

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Chapter 23. Inferences for Regression

One-Way Analysis of Variance (ANOVA) Example Problem

Using Microsoft Excel to Analyze Data from the Disk Diffusion Assay

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

CHAPTER 13. Experimental Design and Analysis of Variance

Lecture Notes Module 1

NCSS Statistical Software

Transcription:

Variations of the t-test: Paired 2 Sample 1 Paired 2 Sample t-test Suppose we are interested in the effect of different sampling strategies on the quality of data we recover from archaeological field surveys. For example, we would hypothesize that the width of survey transects between individuals effects the density estimate of artifacts per unit area, however, we wish to test this hypothesis formally. To test this hypothesis we set up an experiment on a known productive archaeological landscape. We establish twenty five 5 meter by 5 meter plots, and walk them with a survey crew at two different transect widths, the first at 5 meter intervals, the second 1 meter intervals. Each time artifacts are encountered they are recorded, but left in place and the results are tabulated. As this is a manipulated experiment we can no longer treat the two samples as independent, as we are interested in the mean difference between both methods on each plot. These samples are dependent and so we use a paired t-test. The paired test is similar to the independent 2 sample tests but with some important differences. As we are interested in the difference between the two techniques at each plot, we treat each sample plot as a pair of results (Y i5, Y i1 ), and calculate the difference (d i ), that is d = Y Y, i5 i1 where Yij = the recorded density of artifacts on plot i under treatment j. These differences give us a new distribution of twenty five values that has it s own sample statistics; we can then treat this new distribution as a 1 sample t-test, against the null hypothesis that the mean = zero. Therefore, let d = the mean value of the differences between the two treatments. We state the null hypothesis at the a =.5 level (95%): H H O A : d = : d The data is: 5 meter 1 meter Difference 1 9.998 15.384-5.385 2 14.228 54.83 85.425 3 4.669 38.913 1.756 4 4.3 1.875 29.155 5 86.292 1.733 75.559 6 76.255 36.444 39.811 7 16.689 34.774-18.85 8 115.963 49.252 66.712 9 161.497 51.759 19.737 1 29.529.643 28.886 11 37.765.98 36.857 12 16.919 114.969-98.5 13 22.415 42.673-2.259 14 4.496 43.37-38.874 15 22.272 27.73-4.81 ij

Variations of the t-test: Paired 2 Sample 2 16 73.61 19.343 53.718 17 57.477 27.489 29.988 18 64.188 2.88 61.38 19 4.958 2.994 37.963 2 1.224 95.575-85.351 21 91.245 53.564 37.682 22 38.836 32.265 6.571 23 16.985 42.12 118.883 24 1.452 3.544-2.93 25 29.54 11.333 198.27 The important difference with the paired t-test is that the assumption of normality refers to the distribution of the differences, not the original data themselves. Therefore, when we run or calculate our descriptive statistics we are only concerned with the 1 sample of d i s. The MINTAB output and boxplot for the descriptive statistics looks like: Descriptive Statistics Variable N Mean Median Tr Mean StDev SE Mean Differen 25 29.8 3. 28.1 62.9 12.6 Variable Min Max Q1 Q3 Differen -98. 198.2-5.1 64. And the boxplot: Boxplot of Differen -1 1 2 Difference

Variations of the t-test: Paired 2 Sample 3 ooking at the output, we see the mean is very close to the median, and the boxplot shows the distribution is pretty symmetrical, therefore we conclude it is normal and can proceed with the t-test. In fact, our parent distributions are both non-normal as a boxplot demonstrates, but as we are interested in a paired test, this is of no consequence. 2 5 meter 1 5 meter 1 meter Our degrees of freedom in this test are v = 24, so our t CRIT (v = 24, a =.5) = 2.64. To calculate the t STAT we plug our numbers into the equation as usual: t STAT = d se d = 29.8 12.6 = 2.37 As out t STAT > t CRIT we reject the null hypothesis that there is no difference between the results of the sampling strategies in favor of the alternative. To establish our confidence limits at the 95% level: U = 29.8 2.64 12.6 = 29.8 + 2.64 12.6 ( ) = 3.79 ( ) = 55. 81 As these bounds are both positive, they do not encompass zero and we reject the null hypothesis at the 95% level in favor of the alternative. Notice that this is a two-tailed test, and that even though the alternative hypothesis states simply that there is difference between the two samples, we can tell that treatment 1 (5 meter transects) is significantly greater than treatment 2 (1 meter transects) as both the bounds are positive.

Variations of the t-test: Paired 2 Sample 4 The procedure in MINITAB is a little different for the paired test. MINITAB does not have a set function for a paired t-test, but we trick it into doing one for us. In this test we are interested in the differences between the two samples and so we calculate this distribution in either EXCE or MINITAB and transfer the single column of data into MINITAB and run a 1 sample t-test on this data. To calculate the confidence limits: >STAT >BASIC STATISTICS >1 SAMPE t >Click CONFIDENCE INTERVA set at 95% >GRAPHS; select BOXPOT The output for this test looks like: Confidence Intervals Variable N Mean StDev SE Mean 95. % CI Differen 25 29.8 62.9 12.6 (3.8, 55.8) And checking our manual calculations, we see our confidence limits agree with MINITAB s if we were to round them to one decimal place. The boxplot shows: Boxplot of Differen (with Ho and 95% t-confidence interval for the mean) [ _ ] Ho X -1 1 2 Difference

Variations of the t-test: Paired 2 Sample 5 The boxplot shows us the confidence interval around d (x on the boxplot) at the 95% level and the position of our null value, zero. As it falls outside the confidence interval we reject the null hypothesis in favor of the alternative. For the actual t-test: >STAT >BASIC STATISTICS >1 SAMPE t >Click TEST MEAN, set at >ATERNATIVE should be NOT EQUA And the output is: T-Test of the Mean Test of mu =. vs mu not =. Variable N Mean StDev SE Mean T P Differen 25 29.8 62.9 12.6 2.37.26 We find that the T value MINITAB gives us is the same as the one we calculated 2.37. The final p =.26, and as this is less that our stated a =.5, we reject the null hypothesis of no difference in favor of the alternative. Now, notice that our p =.26, and as we are dealing with a two-tailed test, we might think we need to compare this to a/2 =.25, in which case we would fail to reject the null hypothesis. However, t tables are (almost) always based on two-tailed tests and so our a level takes into account both sides of the distribution such that there is no need for us to divide a by 2; it is already done for us.