Supplement 16B: Small Sample Wilcoxon Rank Sum Test

Similar documents
Nonparametric Statistics

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Skewed Data and Non-parametric Methods

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

1 Nonparametric Statistics

Hypothesis testing - Steps

Mind on Statistics. Chapter 12

How To Test For Significance On A Data Set

Comparing Means in Two Populations

Non-Parametric Tests (I)

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

NCSS Statistical Software

NCSS Statistical Software

Hypothesis Testing. Steps for a hypothesis test:

Regression Analysis: A Complete Example

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

The Wilcoxon Rank-Sum Test

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

SPSS Tests for Versions 9 to 13

Tutorial 5: Hypothesis Testing

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Data Analysis Tools. Tools for Summarizing Data

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Two Correlated Proportions (McNemar Test)

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable

CHAPTER 14 NONPARAMETRIC TESTS

Factors affecting online sales

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Rank-Based Non-Parametric Tests

A) B) C) D)

Projects Involving Statistics (& SPSS)

How To Check For Differences In The One Way Anova

Introduction. Statistics Toolbox

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

1-3 id id no. of respondents respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Independent t- Test (Comparing Two Means)

Difference of Means and ANOVA Problems

3.4 Statistical inference for 2 populations based on two samples

NCSS Statistical Software. One-Sample T-Test

Estimation of σ 2, the variance of ɛ

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

Testing Hypotheses About Proportions

Chapter 2. Hypothesis testing in one population

Two-sample hypothesis testing, II /16/2004

Paired 2 Sample t-test

Analysis of Variance ANOVA

THE KRUSKAL WALLLIS TEST

8 6 X 2 Test for a Variance or Standard Deviation

ISyE 2028 Basic Statistical Methods - Fall 2015 Bonus Project: Big Data Analytics Final Report: Time spent on social media

Hypothesis Testing --- One Mean

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Chi-square test Fisher s Exact test

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

How Does My TI-84 Do That

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Name: Date: Use the following to answer questions 3-4:

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Mind on Statistics. Chapter 13

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions

Binary Diagnostic Tests Two Independent Samples

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Testing for differences I exercises with SPSS

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Multiple-Comparison Procedures

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Testing a claim about a population mean

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Parametric and non-parametric statistical methods for the life sciences - Session I

Biostatistics: Types of Data Analysis

Descriptive Statistics

22. HYPOTHESIS TESTING

Hypothesis Test for Mean Using Given Data (Standard Deviation Known-z-test)

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Permutation & Non-Parametric Tests

START Selected Topics in Assurance

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

2 Sample t-test (unequal sample sizes and unequal variances)

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Dongfeng Li. Autumn 2010

Transcription:

Supplement 16B: Small Sample Wilcoxon Rank Sum Test Hypothesis Testing Steps When the samples have fewer than 10 observations, it may not be appropriate to use the large sample Wilcoxon Rank Sum (Mann-Whitney) test. However, there are tables of critical values that allow us to safely perform this test for small samples. As in the large sample case, the hypotheses are: H 0 : Populations are the same H 1 : populations are not the same If the analyst is willing to assume that the populations differ only in location (i.e., the center of the distributions is shifted) and are otherwise the same, we can view this as a test of two medians. For a two-sided test the hypotheses would then be: H 0 : M 1 = M 2 (medians are the same) H 0 : M 1 M 2 (medians are not the same) The hypothesis testing procedure is similar to the large-sample case until we get to Step 4. Step 1: Combine the two samples. Step 2: Calculate the ranks for the combined samples, sorting from smallest to largest. Be careful to average the ranks if there are tied data values. Warning: If you are using Excel, use the 2010 Excel function =RANK.AVG(X,Array,1) to sort from smallest to largest. Be sure to specify the third argument 1 because the default is to sort from largest to smallest. That is, if yo0u were to use the function =RANK.AVG(X,Array,0) or if the third argument is omitted as in =RANK.AVG(X,Array,) your data will be sorted from largest to smallest (the opposite of the test format shown here). Also, beware of the old 2007 Excel function =RANK(X,Array) and the new 2010 function =RANK.EQ(X,Array) which do not handle tied data values correctly. Step 3: Separate the ranks into the original groups and sum the ranks for each group. Denote the rank sums T 1 and T 2. Step 4: The test statistic is the sum of the ranks from the smaller sample (the sample with fewer observations). To avoid confusion, it is best to list the smaller sample first, so that the test statistic can be denoted T 1. Table 16.xx shows the critical values for a two-tailed test at α =.05 (upper and lower 2.5% critical values). Reject H 0 if T 1 W Lower or if T 1 W Upper.

Illustration: Computer Repair Claims Warranty Baldr Electronics Emporium is a medium-sized electronics retailer that offers a one-year parts and labor warranty on laptop computers that it sells. During the month of October, there were 15 claims for warranty repairs for its top two brands of laptops (6 claims for brand A and 9 claims for brand B). The store noted the number of days the laptops had been owned prior to coming in for repair. Is there a difference in the days owned prior to repairs? There is doubt about whether the data are normally distributed, so we will perform the Wilcoxon rank sum test to compare the medians. The color-coded data are: Step 1: Combine the two samples. Brand A Brand B 225 83 79 52 225 113 52 67 29 165 98 132 Step 2: Calculate the ranks for the combined samples, sorting from smallest to largest. Be careful to average the ranks if there are tied data values. For example, here the value 52 occurs twice, as does the value 225. Color coding helps you keep track of data in the the combined samples. Combined and Sorted Rank Brand A Rank Brand B Rank 29 1 29 1 48 2 48 2 52 3.5 52 3.5 52 3.5 79 6 67 5 52 3.5 98 8 83 7 67 5 225 12.5 113 9 79 6 225 12.5 132 10 83 7 165 11 98 8 230 14 113 9 255 15 132 10 165 11 Sum T 1 = 43.5 Sum T 2 = 76.5 225 12.5 n 1 = 6 n 2 = 9 225 12.5 Median 1 = 88.5 Median 2 = 113.0 230 14 255 15 48 230 255

Step 3: Separate the ranks into the original groups and sum them for each group. The test statistic is the sum of the ranks from the smaller sample (the sample with fewer observations). If you wish, you can check your sums by adding; the ranks must sum to n(n+1)/2 where n = n 1 + n 2. In this case, n = n 1 + n 2. = 6 + 9 = 15 so the ranks must sum to n(n+1)/2 = 15(15+1)/2 = 120. Checking our rank sums, we get T 1 + T 2 = 43.5 + 76.5 = 120 which confirms our rank calculations. Step 4: Table 16.B1 shows the critical values for a two-tailed test at α =.05 (upper and lower 2.5% critical values). We would reject H 0 if T 1 W Lower or if T 1 W Upper. For our data, n 1 = 6 and n 2 = 9, so the decision rule is: Reject H 0 if T 1 31 or if T 1 65 Because our test statistic is T 1 = 43.5, we cannot reject H 0. Although there is a difference in the sample medians, it is not great enough to conclude unequal population medians. TABLE 16.B1 Lower 2.5% and Upper 2.5% Critical Values for Wilcoxon Rank Sum Test n 1 n 2 4 5 6 7 8 9 10 11 12 4 10,26 5 11,29 17,38 6 12,32 18,42 26,52 7 13,35 20,45 27,57 36,69 8 14,38 21,49 29,61 38,74 49,87 9 14,42 22,53 31,65 40,79 51,93 62,109 10 15,45 23,57 32,70 42,84 53,99 65,115 78,132 11 16,48 24,61 34,74 44,89 55,105 68,121 81,139 96,157 12 17,51 26,64 35,79 46,94 58,110 71,127 84,146 99,165 115,185 Decision Rule: Reject the null hypothesis if T 1 W Lower or if T 1 W Upper where T 1 is the rank sum from the smaller sample. Source: F. Wilcoxon and R.A. Wilcox, Some Rapid Approximate Statistical Procedures, Lederle Laboratories, 1964. Use with permission of the American Cyanamid Company. Step 5: No action is required, However, the retailer may wish to continue accumulating data on the length of time before each warranty claim for these two top-selling brands. It is possible that in a larger sample, significant differences might be detected. Computer Software There are many reasons to prefer using a computer for this type of test. First, the calculations are easier. Second, you don t need tables. Third, tables become awkwardly large for this test when sample sizes become larger. Table 16.B1, for example, is abbreviated. If you have sample sizes between 13 and 20, you would need a larger table. Figure 16.B1 show the output from Minitab, which confirms our calculations and our decision not to reject H 0 at α =.05. Note that Minitab

also provides a confidence interval for the difference of medians as well as a p-value (0.6367) which shows that the observed difference in medians is within the realm of chance. FIGURE 16.B1 Minitab Results for Wilcoxon Rank Sum/Mann-Whitney Test Mann-Whitney Test and CI: Brand A, Brand B N Median Brand A 6 88.5 Brand B 9 113.0 Point estimate for ETA1-ETA2 is -15.0 96.1 Percent CI for ETA1-ETA2 is (-113.0,112.0) W = 43.5 Test of ETA1 = ETA2 vs ETA1 not = ETA2 is significant at 0.6374 The test is significant at 0.6367 (adjusted for ties) Section Exercises Note: *Indicates optional exercises based on large sample z-test or using software that may not be available to students. 16B.1 A trucking company wants to compare the number of miles driven by two delivery truck drivers in one week on different days (n 1 = 5 days, n 2 = 7 days). Do not assume that distances driven are normally distributed. (a) Use Table 16.B1 to test the hypothesis of equal medians at α =.05. Show the steps in your analysis. (b*) If possible, check your work using Minitab or another computer package. (c*) Perform a large-sample test using the z-test. Is your conclusion the same? Delivery Driver 1 Driver 2 128 97 102 158 78 112 40 112 76 216 316 112 16B.2 Below are data for two different regions, showing the number of days that kidney transplant patients had to wait before a donor was found (n 1 = 6 patients, n 2 = 8 patients). Do not assume a normal distribution of waiting times. (a) Use Table 16.B1 to test the hypothesis of equal medians at α =.05. Show the steps in your analysis. (b*)if possible, check your work using Minitab or another computer package. (c*) Perform a largesample test using the z-test. Is your conclusion the same? Kidneys East Region West Region 109 137 248 93 85 52

107 191 28 236 67 205 92 133