TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2



Similar documents
Additional sources Compilation of sources:

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

First-year Statistics for Psychology Students Through Worked Examples

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

Descriptive Analysis

Chapter 13. Chi-Square. Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running two separate

Chapter 23. Two Categorical Variables: The Chi-Square Test

Research Methods & Experimental Design

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Descriptive Statistics

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

Is it statistically significant? The chi-square test

Chapter 5 Analysis of variance SPSS Analysis of variance

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Nonparametric Statistics

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Projects Involving Statistics (& SPSS)

Study Guide for the Final Exam

The Chi-Square Test. STAT E-50 Introduction to Statistics

Association Between Variables

January 26, 2009 The Faculty Center for Teaching and Learning

UNDERSTANDING THE TWO-WAY ANOVA

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

SPSS Explore procedure

The Statistics Tutor s Quick Guide to

Data analysis process

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Odds ratio, Odds ratio test for independence, chi-squared statistic.

SPSS Tests for Versions 9 to 13

TRAINING PROGRAM INFORMATICS

Recall this chart that showed how most of our course would be organized:

Nonparametric Tests. Chi-Square Test for Independence

Statistics Review PSY379

Rank-Based Non-Parametric Tests


ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R.

An introduction to IBM SPSS Statistics

Chi Square Distribution

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES

Elementary Statistics

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

THE KRUSKAL WALLLIS TEST

One-Way Analysis of Variance (ANOVA) Example Problem

II. DISTRIBUTIONS distribution normal distribution. standard scores

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

This chapter discusses some of the basic concepts in inferential statistics.

Testing differences in proportions

Using Stata for Categorical Data Analysis

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

in nigerian companies.

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

Chi Square Tests. Chapter Introduction

A COMPARISON OF LOW PERFORMING STUDENTS ACHIEVEMENTS IN FACTORING CUBIC POLYNOMIALS USING THREE DIFFERENT STRATEGIES

ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE. School of Mathematical Sciences

CHAPTER 5 COMPARISON OF DIFFERENT TYPE OF ONLINE ADVERTSIEMENTS. Table: 8 Perceived Usefulness of Different Advertisement Types

DPLS 722 Quantitative Data Analysis

Crosstabulation & Chi Square

Mind on Statistics. Chapter 15

IBM SPSS Statistics for Beginners for Windows

Directions for using SPSS

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

Tutorial 5: Hypothesis Testing

Using Excel for inferential statistics

Chi-square test Fisher s Exact test

Experimental Designs (revisited)

Terminating Sequential Delphi Survey Data Collection

Chapter 7 Section 7.1: Inference for the Mean of a Population

The Dummy s Guide to Data Analysis Using SPSS

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Two Related Samples t Test

Analysis of categorical data: Course quiz instructions for SPSS

Statistical tests for SPSS

CHAPTER 12. Chi-Square Tests and Nonparametric Tests LEARNING OBJECTIVES. USING T.C. Resort Properties

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Instructions for SPSS 21

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Categorical Data Analysis

Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables

Foundation of Quantitative Data Analysis

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

UNIVERSITY OF NAIROBI

Chapter 7. One-way ANOVA

SPSS TUTORIAL & EXERCISE BOOK

Analysing Questionnaires using Minitab (for SPSS queries contact -)

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Introduction to Statistics with GraphPad Prism (5.01) Version 1.1

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

Friedman's Two-way Analysis of Variance by Ranks -- Analysis of k-within-group Data with a Quantitative Response Variable

WHAT IS A JOURNAL CLUB?

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Transcription:

About Chi Squares

TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing with Chi Squares... 2 STEP 1: Establish the hypotheses... 2 STEP 2: Calculate the Chi-square... 2 STEP 3: Assess the significance level... 3 STEP 4: Make a decision... 4 Post hoc Testing... 4 METHOD 1... 4 METHOD 2... 5 Glossary... 6 References... 7

About Chi Squares What is a CHI SQUARE? A chi square is a non-parametric test used to explore the relationships between categorical variables. Parametric statistics are concerned with interval or ratio data. Sometimes, however, the researcher wishes to use nominal or ordinal to examine the frequency of certain categories. As you might expect, these types of analyses are called non-parametric statistics. Chi Squares Sometimes researchers are interested in determining differences in the frequency of events; however, we are limited in what we can do statistically with categorical data because we are unable to calculate the mean (which is a foundation of many statistical tests). The Chi square statistic allows us to test hypotheses about categorical variables by examining if there is a relationship between 2 or more categorical variables. Chi squares are most often used to test the relationship between 2 variables. They can also be used to make a multivariate assessment of the relationship between 3 or more variables. Goodness of fit test (One-way χ 2 ) In this case, only 1 variable is being examined. Goodness of fit tests are often used for examining fairness in games of chance, such as cards, roulette and dice games. 1

Test of Independence (Two-way χ 2 ) Two variables with at least two levels each are being examined. A level is a subcategory of the variable. The goal of this test is to learn if the variables are independent of each other. Hypothesis Testing with Chi Squares STEP 1: Establish the hypotheses The hypotheses help state the purpose of the study, by indicating what it is we are trying to conclude from conducting the particular study. The null hypothesis is that the two variables are independent: H 0 = 0 The alternative hypothesis is that the two variables are related: H 1 0 STEP 2: Calculate the Chi-square There are 3 things that need to be known before the Chi square can be calculated: The first is the observed count. This is the observed frequency of each variable. The second is the expected count. This is the value that we would expect to find if there was no error in our study. It is calculated with: E = (row total) x (column total) table total 2

Finally, we calculate the Chi square (χ ). It is calculated with: χ 2 = Σ (O E) 2 E It is easier to conceptualize these 3 steps in a chart : The cell in the bottom right is the one that gives us the value of χ 2. O E (O-E) 2 (O-E) 2 /E Σ(O - E) 2 / E = χ 2 STEP 3: Assess the significance level In order to conclude if the relationship we find between the two variables is statistically significant, we have to set the alpha level. If the value that we calculated is higher than the alpha level then we can conclude that the relationship is statistically significant. Usually the alpha level is set at α =.05, which means that there is less than a 5% possibility that the result occurred by chance. The alpha level and the degrees of freedom allow us to find the critical value from a statistical table, and we compare this number to the observed value in order to determine statistical significance. Degrees of freedom for goodness of fit tests are calculated with: df = k 1, where k = number of levels Degrees of freedom for tests of independence are calculated with: df = (number of rows 1)(number of columns 1) 3

STEP 4: Make a decision If the obtained χ 2 value is larger than the critical χ 2 value, then we can conclude that there is a significant relationship between the variables. If the obtained χ 2 value is smaller than the critical χ 2 value, then we can conclude that there is no relationship between the variables. Post hoc Testing When you conduct a Chi-square test of independence with variables that have more than 2 levels and find a significant result, post hoc tests need to be performed in order to determine where the significance lies. METHOD 1 Post hoc means after the fact. A post hoc test is performed after a significant result is found. The post hoc test compares each of the levels and allows you to determine which pairs of cells (levels) are significantly different. This involves simply looking at the table. By examining the residual levels (the observed level the expected level), you can make a judgment on which pairs of cells are most significant. The percentage of contribution can be calculated and compared to see which pairs contribute the most to the χ 2 value. The percentage is calculated with: % = [(O - E) 2 / χ 2 ] x 100 E 4

This is not the most accurate method of post hoc testing because it is based mostly on visual interpretation. METHOD 2 This involves extracting a 2x2 table of interest from the original table. It is easy to collapse ordinal variables into a dichotomy because the data is rank ordered. This is much more difficult for nominal variables because they are distinct and unordered categories. NOTE: you are not removing data, but rather combining the cells in a meaningful way. This will produce a new χ 2 value that might be a significant result. We need to recognize that this value is part of the larger table and that it doesn t relate directly to the original analysis. This inflates Type I error. In order to counter the inflated Type I error we need to adopt a conservative alpha level by performing a Bonferroni adjustment (k): k = r! x c! 2!(r-1)! 2!(c-1)! Where r is the number of rows, c is the number of columns and! refers to a factorial, which is a series of multiplications. For example, 2! = (2)(1), 3! = (3)(2)(1), 4! = (4)(3)(2)(1), etc. Next, we divide our original alpha level by k to determine the conservative alpha level. We compare our new χ 2 value to the critical value at this alpha level to determine if the result is significant. 5

Glossary Bonferroni adjustment: Categorical variable: Expected count: Non-parametric test: Observed count: Post hoc test: Type I error: a statistical procedure that corrects for Type I error inflation when the cells in a chi-square table have been collapsed. one that we can place into categories, but these categories may not have any logical ordering (Nominal or ordinal). the expected frequency. a type of test that does not assume that the variables follow a specific distribution, like a normal distribution. the recorded frequency. a test that is performed after a significant result is found. When there are more than two levels, this test shows which level is significant. occurs when you reject the null hypothesis even though it is true. In other words, it is concluding that there is a significant relationship when there actually isn t one. 6

References Cohen, R. J., & Swerdlik, M. E. (2005). Psychological testing and assessment: An introduction to tests and measurements. (6th ed.) NY: McGraw-Hill. Gardner, R. C. (2001). Psychological statistics using SPSS for Windows. New Jersey: Prentice Hall. Hamilton, L. C. (1996). Data analysis for social scientists. Scarborough: Duxbury Press. Quantitative Methods in Social Sciences e-lessons. (n.d.). The chi square. Retrieved on May 30, 2007 from http://ccnmtl.columbia.edu/projects/qmss/chi_test.htmlm Utts, J. (2005). Seeing through the statistics. (3rd ed.). Toronto: Thomson Brooks/Cole. Wilson, J. H. (2005). Essential Statistics. New Jersey: Pearson Prentice Hall. 7