Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,



Similar documents
NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

Tutorial 5: Hypothesis Testing

Non-Parametric Tests (I)

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Rank-Based Non-Parametric Tests

NCSS Statistical Software

Difference tests (2): nonparametric

Nonparametric Statistics

THE KRUSKAL WALLLIS TEST

CHAPTER 14 NONPARAMETRIC TESTS

Exact Nonparametric Tests for Comparing Means - A Personal Summary

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Name: Date: Use the following to answer questions 3-4:

Parametric and non-parametric statistical methods for the life sciences - Session I

1 Nonparametric Statistics

Research Methodology: Tools

NCSS Statistical Software

Non-Inferiority Tests for One Mean

Permutation Tests for Comparing Two Populations

NCSS Statistical Software. One-Sample T-Test

The Friedman Test with MS Excel. In 3 Simple Steps. Kilem L. Gwet, Ph.D.

MEASURES OF LOCATION AND SPREAD

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

HYPOTHESIS TESTING WITH SPSS:

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

Chapter G08 Nonparametric Statistics

Nonparametric Statistics

Come scegliere un test statistico

The Wilcoxon Rank-Sum Test

Stat 5102 Notes: Nonparametric Tests and. confidence interval

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Biostatistics: Types of Data Analysis

CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U

Likert Scales. are the meaning of life: Dane Bertram

Introduction. Chapter 14: Nonparametric Tests

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Recall this chart that showed how most of our course would be organized:

Multivariate Analysis of Ecological Data

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Statistical tests for SPSS

Projects Involving Statistics (& SPSS)

13: Additional ANOVA Topics. Post hoc Comparisons

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Parametric and Nonparametric: Demystifying the Terms

NAG C Library Chapter Introduction. g08 Nonparametric Statistics

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions

STATISTICAL SIGNIFICANCE OF RANKING PARADOXES

Terminating Sequential Delphi Survey Data Collection

Non-Inferiority Tests for Two Means using Differences

One-Way Analysis of Variance (ANOVA) Example Problem

Introduction to Statistics and Quantitative Research Methods

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

UNIVERSITY OF NAIROBI

The Variability of P-Values. Summary

Chapter 7 Section 7.1: Inference for the Mean of a Population

The Statistics Tutor s Quick Guide to

1.5 Oneway Analysis of Variance

Intro to Data Analysis, Economic Statistics and Econometrics

Study Guide for the Final Exam

Nonparametric statistics and model selection

Testing for differences I exercises with SPSS

Permutation & Non-Parametric Tests

Dongfeng Li. Autumn 2010

The Chi-Square Test. STAT E-50 Introduction to Statistics

Comparing Means in Two Populations

Normality Testing in Excel

II. DISTRIBUTIONS distribution normal distribution. standard scores

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable

Quantitative Methods for Finance

THE CERTIFIED SIX SIGMA BLACK BELT HANDBOOK

A Survey Report on Non-Parametric Hypothesis Testing Including Kruskal-Wallis ANOVA and Kolmogorov Smirnov Goodness-Fit-Test

StatCrunch and Nonparametric Statistics

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

How To Test For Significance On A Data Set

IBM SPSS Statistics for Beginners for Windows

Research Methods & Experimental Design

SPSS TUTORIAL & EXERCISE BOOK

3.4 Statistical inference for 2 populations based on two samples

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Non Parametric Inference

Nonparametric and Distribution- Free Statistical Tests

Chapter 5 Analysis of variance SPSS Analysis of variance

One-Way Analysis of Variance

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Analysis of Questionnaires and Qualitative Data Non-parametric Tests

SPSS Tests for Versions 9 to 13

The Kruskal-Wallis test:

Hypothesis testing - Steps

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

Unit 27: Comparing Two Means

8 INTERPRETATION OF SURVEY RESULTS

P(every one of the seven intervals covers the true mean yield at its location) = 3.

Transcription:

CHAPTER 13 Nonparametric and Distribution-Free Statistics Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., 2 tests for goodness of fit and independence). Distribution-Free tests tests that make no assumptions about the sampled populations. Note. These terms, in practice, tend to be used interchangably, most often with both under the umbrella of nonparametric statistics. Advantages of nonparametric statistics (1) Allow for the testing of hypotheses that are not statements about population parameter values. (2) May be used when the form of the sampled population is unknown. (3) Tend to be computationally easier and more quickly applied than parametric procedures. (4) May be applied when the data being analyzed consist merely of rankings or classifications. Disadvantages of nonparametric statistics (1) The use of nonparametric procedures with data that can be handled with parametric procedures is a waste of time. (2) The application of some nonparametric tests may be laborious for large samples. 134

13. NONPARAMETRIC AND DISTRIBUTION-FREE STATISTICS 135 Wilcoxon Signed-Rank Test for Location for testing a null hypothesis about a population mean where neither z (small sample [n < 30] from a population that is grossly nonnormally distributed, so Central Limit Theorem does not apply) nor t (the sampled population does not su ciently approximate a normal population) is an appropriate test statistic. Test assumptions on the data: 1) Random variable 2) Continuous variable 3) Population symmetric about µ 4) At least an interval scale Calculations: 1) For H 0 : µ = µ 0, let d i = x i µ 0 For H 0 : µ 1 = µ 2, let d i = x 1,i x 2,i Eliminate cases where d i = 0, reducing n accordingly. 2) Rank the usable d i from the smallest absolute value to the largest absolute value. If two or more of the d i are equal, assign each tied value the mean of the rank positions the tied values occupy. 3) Assign each rank the sign of the d i that yields that rank. 4) Find T +, the sum of the ranks with positive signs, and T, the sum of the ranks with negative signs. Test statistic T : For H A : µ 6= µ 0, T is the smaller of T + and T For H A : µ < µ 0, T = T + For H A : µ > µ 0, T = T

136 13. NONPARAMETRIC AND DISTRIBUTION-FREE STATISTICS Example (13.4.1). Using the data on page 683 of the text, we test the hypotheses H 0 : µ = 5.05 H A : µ 6= 5.05 at the = 0.05 level of significance. The critical value in Table K for 2 =.025 and n = 15 is, by using.024, T = 25. We compute T + = 86 and T = 34, yielding T = 34. Since 34 > 25, we are unable to reject H 0. From Table K, we also get that p = 2(.0757) =.1514. This same data is used on pages 39-40 of my SPSS manual, yielding the following output:

13. NONPARAMETRIC AND DISTRIBUTION-FREE STATISTICS 137 The Z is the standardized normal approximation to the test statistic, with p =.140 here. Mann-Whitney test for equal medians for two independent samples Assumptions: (1) Independent random samples of size m and n (2) At least ordinal scales (3) Continuous variables (4) Populations di er only with respect to medians We test the hypothesis against H 0 : M 1 = M 2 H A : M 1 6= M 2, H A : M 1 > M 2, or H A : M1 < M 2 with level of significance =.05. If the two populations are symmetric, so that within each population the mean and the median are the same, the conclusions we reach regarding the two population medians will also apply to the two populations means. Example (13.6.1). We will consider three cases, all with level of significance =.05: (a) H 0 : M X M Y H A : M X < M Y (b) H 0 : M X apple M Y H A : M X > M Y (c) H 0 : M X = M Y H A : M X 6= M Y

138 13. NONPARAMETRIC AND DISTRIBUTION-FREE STATISTICS Calculations: (1) Rank all variables from smallest to largest, yet keeping them separate. Handle ties as in Wilcoxon. See Table 13.6.2 on page 692 of the text. (2) The test statistic is n(n + 1) T = S 2 where n is the number of sample X observations and S is the sum of the ranks assigned to the sample observations from the population X. The choice of which sample s values we label X is arbitrary. In this example, 15(16) T = 145 = 25. 2 (3) For (a), we reject H 0 since 25 < 45 with 45 the critical value obtained from Table L. For (b), we would reject H 0 if T > nm critical value = 150 45 = 105. That is not the case here. For (c), we use =.025 in Table L to get critical values of 40 and 150 40 = 110. We reject H 0 if T < 40 or T > 110. In our case, T < 40, so we reject H 0. (4) Since 22 < 25 < 30, we have.001 < p <.005 for the one-sided tests and.002 < p <.01 for the two-sided test. (5) Table L does not work for n or m greater than 20. In general, if nm apple 400 and mn + min(n, m) apple 220, the exact significance level is based on an 2 algorithm of Dineen and Blakesley. This is what is in Table L. Otherwise, we compute T mn/2 z = p, nm(n + m + 1)/12 which is distributed approximately as a standard normal distribution.

13. NONPARAMETRIC AND DISTRIBUTION-FREE STATISTICS 139 The SPSS output for this example is given below. The Asymp. Sig. is based on Z.