Is it statistically significant? The chi-square test

Similar documents
Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Chapter 23. Two Categorical Variables: The Chi-Square Test

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = Adverse impact as defined by the 4/5ths rule was not found in the above data.

The Chi-Square Test. STAT E-50 Introduction to Statistics

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

Mind on Statistics. Chapter 15


Simulating Chi-Square Test Using Excel

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Independent t- Test (Comparing Two Means)

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Using Stata for Categorical Data Analysis

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Use of the Chi-Square Statistic. Marie Diener-West, PhD Johns Hopkins University

Chi-square test Fisher s Exact test

Association Between Variables

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

STAT 145 (Notes) Al Nosedal Department of Mathematics and Statistics University of New Mexico. Fall 2013

Elementary Statistics

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Elementary Statistics Sample Exam #3

SUGI 29 Statistics and Data Analysis

Section 12 Part 2. Chi-square test

3.4 Statistical inference for 2 populations based on two samples

Final Exam Practice Problem Answers

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Inferential Statistics. What are they? When would you use them?

Row vs. Column Percents. tab PRAYER DEGREE, row col

Chi Square Distribution

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

American Journal Of Business Education July/August 2012 Volume 5, Number 4

Chapter 5 Analysis of variance SPSS Analysis of variance

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

AP STATISTICS (Warm-Up Exercises)

Enrollment Data Undergraduate Programs by Race/ethnicity and Gender (Fall 2008) Summary Data Undergraduate Programs by Race/ethnicity

IB Practice Chi Squared Test of Independence

Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Nonparametric Statistics

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Stats Review Chapters 9-10

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Nonparametric Tests. Chi-Square Test for Independence

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Math 108 Exam 3 Solutions Spring 00

Testing differences in proportions

2014 Only Influencers Marketing Salary Guide

This chapter discusses some of the basic concepts in inferential statistics.

Solutions to Homework 10 Statistics 302 Professor Larget

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

Additional sources Compilation of sources:

PERCEPTION OF SENIOR CITIZEN RESPONDENTS AS TO REVERSE MORTGAGE SCHEME

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

>

Binary Logistic Regression

CHAPTER 12. Chi-Square Tests and Nonparametric Tests LEARNING OBJECTIVES. USING T.C. Resort Properties

Mind on Statistics. Chapter 13

11. Analysis of Case-control Studies Logistic Regression

The Effects of Demographics on Consumer Perceptions of Identity Theft in Rural and Urban Settings

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

DIRECTIONS. Exercises (SE) file posted on the Stats website, not the textbook itself. See How To Succeed With Stats Homework on Notebook page 7!

DESCRIPTIVE STATISTICS & DATA PRESENTATION*

First-year Statistics for Psychology Students Through Worked Examples

How to set the main menu of STATA to default factory settings standards

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Chi Square Tests. Chapter Introduction

MULTIPLE REGRESSION EXAMPLE

ISyE 2028 Basic Statistical Methods - Fall 2015 Bonus Project: Big Data Analytics Final Report: Time spent on social media

How to Make APA Format Tables Using Microsoft Word

Crosstabulation & Chi Square

Need for Sampling. Very large populations Destructive testing Continuous production process

2 GENETIC DATA ANALYSIS

Difference of Means and ANOVA Problems

Categorical Data Analysis

ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA

Chapter 7 Section 7.1: Inference for the Mean of a Population

Multinomial and Ordinal Logistic Regression

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

Tutorial Segmentation and Classification

MARKETING EDUCATION: ONLINE VS TRADITIONAL

Common Univariate and Bivariate Applications of the Chi-square Distribution

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

IBM SPSS Statistics for Beginners for Windows

Testing Research and Statistical Hypotheses

Main Effects and Interactions

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Chapter 19 The Chi-Square Test

Student Placement in Mathematics Courses by Demographic Group Background

Transcription:

UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1

Why chi-square? Tests whether two categorical variables are independent (no relationship) sex and proportion of First divisions and degree classification ethnicity and proportion of Firsts Categorical variables: sex, school type, ethnicity, classified exam result (1 st, 2.1, 2.2, 3) but not age, average exam mark Page 2

Observed vs. expected There are 4,000 finalists The probability of getting First is 50% 2,000 males and 2,000 females How many males and how many females would you expect to get First? 1,000 males and 1,000 females (50% of 2,000) But the observed values are 1,200 and 800 Is it significant? Page 3

Steps to follow State the hypothesis Calculate the expected values Use the observed and expected values to calculate the chi-square test statistic Establish the significance level you need (usually 95% p = 0.05) and the number of degrees of freedom Compare the chi-square statistic with the critical value from the table Make a decision about your hypothesis Page 4

Hypothesis Hypothesis Observed values Expected values Calculation of chi-square Degrees of freedom Compare with the table Conclusion H 0 : There is no association between gender and proportion of Firsts (the proportion is the same for males and females) H 1 : There is an association between gender and proportion of Firsts (the proportion is different for males and females) Page 5

Observed values Hypothesis Observed values Expected values Calculation of chi-square Degrees of freedom Compare with the table Conclusion Male Female Total No First 6202 (68.5%) 6429 (76.7%) 12631 (72.5%) First 2851 (31.5%) 1952 (23.2%) 4803 (27.5%) Total 9053 8381 17434 (100%) Proportion of students with First: 4803/17434 = 0.275 (*100%) = 27.5% Proportion of males with First: 2851/9053 = 0.315 (*100%) = 31% Proportion of females with First: 1952/8381 = 0.232 (*100%) = 23% Page 6

Expected values Hypothesis Observed values Expected values Calculation of chi-square Degrees of freedom Compare with the table Conclusion No First 6202 6558.93 First 2851 2494.06 Male Female Total 6429 6072.06 1952 2308.93 12631 4803 Total 9053 8381 17343 Proportion with Firsts: 0.275 Proportion with no Firsts: (1 0.273) = 0.725 Males with no Firsts: 9053*0.725 = 6558.93 Females with no Firsts: 8381*0.725 = 6072.07 Males with Firsts: 9053*0.275 = 2494.07 Females with Firsts: 8381*0.275 = 2308.93 Page 7

Calculation Hypothesis Observed values Expected values Calculation of chi-square Degrees of freedom Compare with the table Conclusion χ 2 = 19.42 + 20.98 + 51.08 + 55.17 = 146.67 Page 8

Degrees of freedom Hypothesis Observed values Expected values Calculation of chi-square Degrees of freedom Compare with the table Conclusion Number of degrees of freedom is calculated by multiplying the number of rows minus 1 by the number of columns minus 1. df = (rows-1)x(columns-1) For a 2x2 table that is (2-1)x(2-1) = 1 Page 9

Table of critical values Hypothesis Observed values Expected values Calculation of chi-square Degrees of freedom Compare with the table Conclusion Look up the critical chi-square statistic value for p = 0.05 (95% confidence level) with 1 degree of freedom 3.84 Page 10

Is it significant? Hypothesis Observed values Expected values Calculation of chi-square Degrees of freedom Compare with the table Conclusion Test value > table value then REJECT H 0 146.67 > 3.84 We reject H 0 the proportion of Firsts is the same for males and females Instead the H 1 is true that the proportion of firsts is different for males and females Page 11

Steps in Excel Create table with observed values Table with expected values calculated Use Excel CHITEST function to calculate the p value If p 0.05 statistically significant If p > 0.05 not statistically significant Page 12

Limitations of chi-square If the test comes out significant that means there is some association. No further information. Tests only two variables at one time Some cells may have small values. Each cell should have at least a value of 1, and no more than 20% of cells can have values lower than 5. Potential problem with small courses when compared between many categories of ethnicity Page 13

Limitations of chi-square (2) Degree classification Fee status First 2.1 2.2 Total Home 21 91 2 114 EU 0 5 1 6 Overseas 5 1 0 6 Total 26 97 3 126 Degree classification Fee status First No First Total Home 21 93 114 International 5 7 12 Total 26 100 126 Page 14