COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore*



Similar documents
Statistical tests for SPSS

UNIVERSITY OF NAIROBI

Nonparametric Statistics

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

Chapter G08 Nonparametric Statistics

NAG C Library Chapter Introduction. g08 Nonparametric Statistics

Non-Parametric Tests (I)

Likert Scales. are the meaning of life: Dane Bertram

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

SPSS Explore procedure

Difference tests (2): nonparametric

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Terminating Sequential Delphi Survey Data Collection

WHAT IS A JOURNAL CLUB?

NCSS Statistical Software

Descriptive Statistics

II. DISTRIBUTIONS distribution normal distribution. standard scores

The Dummy s Guide to Data Analysis Using SPSS

Research Methods & Experimental Design

T test as a parametric statistic

Association Between Variables

Chapter 12 Nonparametric Tests. Chapter Table of Contents

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

THE KRUSKAL WALLLIS TEST

Biostatistics: Types of Data Analysis


NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Permutation Tests for Comparing Two Populations

The Kendall Rank Correlation Coefficient

Come scegliere un test statistico

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

CALCULATIONS & STATISTICS

Standard Deviation Estimator

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

DATA COLLECTION AND ANALYSIS

Skewed Data and Non-parametric Methods

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Rank-Based Non-Parametric Tests

Additional sources Compilation of sources:

Introduction to Quantitative Methods

Sample Size and Power in Clinical Trials

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Tutorial 5: Hypothesis Testing

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Statistics. Measurement. Scales of Measurement 7/18/2012

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

MEASURES OF LOCATION AND SPREAD

Statistics for Sports Medicine

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Parametric and non-parametric statistical methods for the life sciences - Session I

Projects Involving Statistics (& SPSS)

The correlation coefficient

Recall this chart that showed how most of our course would be organized:

NCSS Statistical Software

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

SOCIOLOGY 7702 FALL, 2014 INTRODUCTION TO STATISTICS AND DATA ANALYSIS

Chapter 7 Section 7.1: Inference for the Mean of a Population

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Descriptive Analysis

CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U

Chi-square test Fisher s Exact test

Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data. Patrick F. Smith, Pharm.D. University at Buffalo Buffalo, New York

The Statistics Tutor s Quick Guide to

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

Categorical Data Analysis

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

AC - Clinical Trials

Introduction to Statistics and Quantitative Research Methods

6.4 Normal Distribution

NCSS Statistical Software. One-Sample T-Test

Quantitative Methods for Finance

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Simple linear regression

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

T-test & factor analysis

Non Parametric Inference

KADI SARVA VISHWA VIDYALAYA GANDHINAGAR. Ph.D. Course Work SOCIAL WORK

Section 12 Part 2. Chi-square test

Mathematics within the Psychology Curriculum

The Friedman Test with MS Excel. In 3 Simple Steps. Kilem L. Gwet, Ph.D.

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Study Guide for the Final Exam

Testing Gaussian and Non-Gaussian Break Point Models: V4 Stock Markets

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Economic Statistics (ECON2006), Statistics and Research Design in Psychology (PSYC2010), Survey Design and Analysis (SOCI2007)

Error Type, Power, Assumptions. Parametric Tests. Parametric vs. Nonparametric Tests

Using Excel for inferential statistics

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Exact Nonparametric Tests for Comparing Means - A Personal Summary

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Analyzing Research Data Using Excel

Transcription:

COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore* The data collection phases for evaluation designs may involve analysis of data from distributions that are skewed, restricted in range, lacking in variability or otherwise possessing characteristics rendering it difficult to assume normality of the mean and variance of the parent population. When assumptions have been violated, the following are some examples of transformations that can be used to result in a closer approximation to the normal distribution: 1. The square -root transformation This transformation is useable when data are in the form of frequency counts as in "yes" "no" responses. The transformed score (X') would be: X' =. The arc sine transformation This transformation is useable when scores are in proportions, as in "percentage correct responses." The transformed score (X') would be: X' = arc 3. The logarithmic transformation This transformation is useable when the data are decidedly skewed. The transformed score (X') would be: X' = log X If the measures are small, the transformation might be: X' = log (X 1) can be used for fairly complex designs. They are also useable when the researcher wants to present a more elaborate analysis which would extend beyond a classical analysis of variance designs. Transformed data, however, changes the relationships between the statistics observed from the original data. Hence, when generali.zi.ng results, caution may need to be used in relating the results of the converted measures to the original measures. Non -parametric and other non -traditional tests are other alternatives that have been used when assumptions of a normal distribution have not been met. May and Konklin developed a simplified version of the nonparametric counterpart to the F test for trend (4). Cooper used a non - parametric test for increasing trend by making comparisons of all possible pairs of observations (1). Lewis and Johnson illustrated the use of Kendall's Coefficient of Concordance for the evaluation of the extent of agreement among a set of judges, each of whom ranks in entirety a set of objects (3). These authors demonstrated that critical values of "W" are given only by approximation and only for large numbers. It was shown that Kendall's Coefficient of Concordance can be used to test more specific hypothesis about agreement on the criteria of choice within a group of judges (3). Simms and Collons illustrated the use of an algorithm for the Coefficient of Concordance which allows for the identification of the particular combination of individuals in a group, and all possible combinations of such individuals who exhibit or display the greatest amount of consensus (7). Reynolds has stressed that indiscriminate use of tests of significance contributes to theoretical inadequacies in terms of scope and explanatory power (5). Young has stressed that a large part of the data which social scientists must deal with is not distributed according to known mathematical functions; non- parametric techniques (order statistics) offer the opportunity to test hypothesis which require no assumption about the form of the distribution of the population (8). Siegel has offered a variety of such non -parametric techniques which are particularly useable at the nominal and ordinal level of measurement (6). Non - parametric tests can be used for pilot testing of instruments within an evaluation design, where assumption of normality cannot be taken for grsx.ted. In an abbreviated pilottested needs analysis procedure, first and second year university students ranked their preferences for the following goals for an educational research center: 1) Develop skills in evaluation of Educational Programs. ) Develop skills in behavioral objectives and criterion -referenced testing- - applications and development. 3) Develop knowledge of Instructional Systems Development. 4) Improve personal proficiency with Instructional Systems Development. 5) Make contacts with happenings in Educational Research. 6) Earn a PHD. 7) Build an Academic Recommendation. 8) Obtain tools needed for Educational Research- -math, statistics, computer operations and research design. 9) Become familiar with Professional Societies, and Journals of Educational Research 10) Take courses in other centers to keep abreast of most recent trends in Education. 11) Understanding of knowledge of diffusion. 1) Developing Skills in Curriculum Development. 440

13) Further knowledge in concept of Creativity and Research. 14) Gain realistic and practical experience in Evaluation. 15) Relate Evaluation and Teaching Skills to Humanistic Education Field. The sign -rank test was used to test the hypothesis of no differences in ranking of these goals by first and second year university students as follows: Ho (Null Hypothesis).. There is no difference in the ranking of these goals as perceived by first and second year students; alpha level equal.05- H1 (Alternative Hypothesis).. There is a difference in the ranking of these goals as perceived by first and second year students. Table I presents an application of the sign - rank test used to test differences for these purposes, The null hypothesis tested by the sign test is that: p(xa?)cb) = p(xagxb) = Another way of stating the null hypothesis is that the median difference is zero. Focus is placed on the direction of the differences rather than the size of the differences. Reference is made to Table D in Siegel - -Table of Probabilities Associated with Values as Small, as cbserved Values of x in the Binomial Test (B). The probability of getting this result for a two -tailed test is.180. This is outside the rejection region and the decision is to accept the null hypothesis. quence of the Type I or the Type II error. This is a decision that should be made based on the type of variables that are being investigated and the sampling environment. Hence, in researching the effects of a treatment to reduce the cause of cancer or in evaluating the effects of an educational program, it is important that Type II errors not be made. The power of non - parametric methods relative to Type I and Type II errors have been summarized in detail by Festinger and Katz (). A Type I error is made when the decision is made to reject the null hypothesis when it should be accepted; a Type II error is made when the decision is made to accept the null hypothesis when it should be rejected. Power = 1 minus the probability of a Type II error; stated alternatively, power is the probability of appropriately rejecting the null hypothesis. Within evaluation setting, generalizability of results is not necessarily a goal. However, it is very important that results be valid for a particular decision - making setting. Failure to adequately consider the importance of this ffctcr results in the possibility of collecting data that lacks decision -maker validity. Hence, in an evaluation setting, the decision to use a parametric or a non -parametric test would depend on the relative consequences of the power of that test on decision -making accuracy. *The author wishes to acknowledge appreciation of the University of Mass. and Dr. N. Nagarajan formerly of the University of Maryland, Eastern Shore, for data presented. Non -parametric tests can be used in the preliminary stages of validating an instrument that will later be used for evaluation purposes. Table II illustrates the use of the Coefficient of Concordance to test the degree of agreement among Freshman, Sophomore, Junior and Senior students, concerning the quality of library services. Comparable groups of students from each of these four levels were asked to rank the quality of library services in four different testing situations using different instruments. The results (W =.99) indicated a high degree of agreement among the four groups of students. Table III lists a variety of non -parametric methods which can be used at various levels of measurement (nominal, ordinal and interval). A researcher who decides on the use of a nonparametric test must decide on the most appropriate test to use in terms of the level of measurement. This is an important consideration since the use of a non -parametric test yields less powerful results when a parametric tests can be used for the same purpose. The researcher must decide upon the relative conse- 441

Table I An Application of the Sign -Rank Test of Differences in 1st and nd Year Student Responses to Group Goals nd Year (a) 1st Year (B) nd yr -1st yr. Direction of Difference Signs (A-D) 6 5 1 XB 4 XA XE 4 XA 0 XA = XB 0 7 5 XA 'XB 6 4 XA 5 0 5 XA 7 8-1 -YE - 7 4 3 XA' XB 7 5 3 4-1 XA XB - 3-1 XA - 4 3 1 XA XS 5 6-1 XA- XB - 4 XA 'XB x number of minority signs = 4 N -Dumber of non -zero ranks = 14 Table II Coefficient of Concordance Groups Conditions IV A 4 3 1 4 3 1 4 3 1 D 4 3 1 16 11 9 4 Sum or ranks 40 Mean of R = 10 = W =.-0.99 1 K (N3 - N) 1 s = Sum of squares of observed deviations from mean of R. k = n = 1 1X # of sets of rankings --# of judges (i.e. A,B,C,D) # of entities (objects or individuals) ranked (i.e. Conditions I -IV) 3 -n) = max. possible sum of squared deviations, i.e. sum "s" which would occur with perfect agreement among K ranks 44

TABLE III (From Siegel's Non -Parametric Statistics) One -sample case (Chap. 4) NONPARAMETRIC STATISTICAL TEST ** Two -sample case Related samples Independent samples Related Samples (Chap. 5) (Chap. 6) (Chap. 7) k- sample case Independent Samples (Chap. 8) NONPARAMETRIC MEASURE OF CORRELATION (Chap. 9) Binomial test, pp. 36-4 x one -sample test, pp. 4-47 Kolmogorov -Smirnov one -sample test, pp. 47-5 One -sample runs tests pp. 5-58 test for the significance of changes, pp. 63-67 Sign test, pp. 68-75 Wilcoxon matchedpairs signed-ranks tests** pp. 75-83 Fisher exact probability test, pp. 6-104 x test for two independent samples, pp. 104-111 measurement) Median test, pp. 111-116 Mann -Whitney U test, pp. 116-17 Kolmogorov- Smirnov two -sample test, pp. 17-136 Wald -Wolfowitz runs test, pp. 136-145 Moses test of extreme reactions, pp. 145-15 Cochran Q test, pp. 161-166 Friedman two -way analysis of variance, pp. 166-17 x test for k independent samples, pp. 175-179 Extension of the median test, pp. 179-184 Kruskal- Wallis one - way analysis of variance, pp. 184-193 Contingency coefficient C, pp. 196-0 Spearman rank correlation coefficient: re, pp. 0-13 Kendall rank correlation coefficient: r, pp.13-3 Kendall partial rank correlation coefficient: 3 rzy,z, -9 Kendall coefficient of concordance: W, pp. 9-38 (Ordinal measurement Walsh test, pp. 83-87 Randomization test for matched pairs, pp. 88-9 (Interval measurement) ar.docrization test for two independent samples, pp. 15-156 (Interval * *Each column liss, cumulatively downward, the tests applicable to the given level of measurement. For exar.ple, in the case of k related samples, when ordinal measurement has been achieved both the Friedman two -way analysis of variance and the Cochran Q test are applicable.' ** *The Wilcoxon te requires ordinal measurement not only within pairs, as is required for the sign test, but also of the differences between pairs. -'ee the discussion on pp. 75-76.

References (1) Cooper, M. "A Non- Parametric Test for Increasing Trend ". Educational and Psychological Measurement, Vol. 35, 1975, pp. 303-306. () Festinger, L. and Katz, D. Research Methods in the Behavioral Sciences. New York: Holt, Rinehart and Winston, Inc., 1966, pp. 536-577. (3) Lewis, G. H. and Johnson, R. G. "Kendall's Coefficient of Concordance for Sociometric Rankings with Self Excluded ". Sociometry, 1971, Vol. 34, No. 4, December 1971, pp. 496-503. (4) May, R. and Konkin, P. "A Non -Parametric Test of an Ordered Hypothesis for K Independent Samples." Educational and Psychological Measurement, Vol. 30, 1970, pp. 51-58. (5) Reynolds, R. Jr. "Replication and Substantive Import: A Critique on the Use of Statistical Inference in Social Research ". Sociology and Social Research, Vol. 53, No. 3, April 1969, pp. 99-310. (6) Siegel, S. Non- arametric Statistics for The Behavioral Sciences. New York: McGraw -Hill Book Co., 1956. (7) Simms, C. and Collons, R. "Sociological Research Applications of an Algorithm for the Coefficient of Concordance ". American Sociologist, Vol. 4, No. 4, November 1969, pp. 31-34. (8) Young, P. Scientific Social Surveys and Research. Englewood Cliffs, New Jersey: Prentice -Hall, Inc., 1966. pp. 340-341. 444