Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.



Similar documents
Odds ratio, Odds ratio test for independence, chi-squared statistic.

Testing Research and Statistical Hypotheses

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Independent samples t-test. Dr. Tom Pierce Radford University

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Is it statistically significant? The chi-square test

This chapter discusses some of the basic concepts in inferential statistics.

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

Association Between Variables

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

The Chi-Square Test. STAT E-50 Introduction to Statistics

Chapter 13. Chi-Square. Crosstabs and Nonparametric Tests. Specifically, we demonstrate procedures for running two separate

Nonparametric Tests. Chi-Square Test for Independence

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

Simple Regression Theory II 2010 Samuel L. Baker

Hypothesis testing - Steps

Chi-square test Fisher s Exact test

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Topic 8. Chi Square Tests

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Independent t- Test (Comparing Two Means)

Week 4: Standard Error and Confidence Intervals

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

The Dummy s Guide to Data Analysis Using SPSS

Two Related Samples t Test

Crosstabulation & Chi Square

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

UNDERSTANDING THE TWO-WAY ANOVA

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Categorical Data Analysis

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

Elementary Statistics

Using Stata for Categorical Data Analysis

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Lesson 3: Calculating Conditional Probabilities and Evaluating Independence Using Two-Way Tables

Chapter 5 Analysis of variance SPSS Analysis of variance

DDBA 8438: The t Test for Independent Samples Video Podcast Transcript

Final Exam Practice Problem Answers

3.4 Statistical inference for 2 populations based on two samples

MATH 140 Lab 4: Probability and the Standard Normal Distribution

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Descriptive Statistics

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

Stats Review Chapters 9-10

Math 108 Exam 3 Solutions Spring 00

First-year Statistics for Psychology Students Through Worked Examples

Solutions to Homework 10 Statistics 302 Professor Larget

How to Make APA Format Tables Using Microsoft Word

WISE Power Tutorial All Exercises

Using Excel for inferential statistics

Fairfield Public Schools

Chapter 7. Comparing Means in SPSS (t-tests) Compare Means analyses. Specifically, we demonstrate procedures for running Dependent-Sample (or

November 08, S8.6_3 Testing a Claim About a Standard Deviation or Variance

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

2 Sample t-test (unequal sample sizes and unequal variances)

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Analysis of Variance ANOVA

Section 12 Part 2. Chi-square test

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen.

Chapter 23. Two Categorical Variables: The Chi-Square Test

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

American Journal Of Business Education July/August 2012 Volume 5, Number 4

5.5. Solving linear systems by the elimination method

Introduction to Hypothesis Testing

Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)

Statistical Impact of Slip Simulator Training at Los Alamos National Laboratory

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

Statistical tests for SPSS

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Chi Square Distribution

Common Univariate and Bivariate Applications of the Chi-square Distribution

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Testing differences in proportions

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Chapter 23 Inferences About Means

AP STATISTICS (Warm-Up Exercises)

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

II. DISTRIBUTIONS distribution normal distribution. standard scores

statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

2 GENETIC DATA ANALYSIS

Experimental Designs (revisited)

Unit 26 Estimation with Confidence Intervals

Nonparametric Statistics

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

CALCULATIONS & STATISTICS

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Transcription:

Chi-square Goodness of Fit Test The chi-square test is designed to test differences whether one frequency is different from another frequency. The chi-square test is designed for use with data on a nominal scale, when all you know about people is the category they're in. The chi-square test gives you a way of comparing the results that you should get if the number of passengers has no effect on the frequency of stopping to the results you actually got. For example: Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails. If we flipped a coin 20 times how many times should it come up tails? [10] Say the actual number of tails was 18. Is this number far enough away from 10 for you to be suspicious that the coin was rigged. The chi-square test tests whether the odds are less than 5% of getting 18 tails when you know that 10 tails is what you would expect to get. What we're going to do is to calculate a value for the chi-square statistic and then compare it to a value for chi-square that we can look up in the back of the stats book. That number will tell you how large the calculated value has to be in order to be confident that the result was not just due to chance. To calculate the value for chi-square for the coin data you need to know what frequencies you would expect to get. 10 heads and 10 tails. You know that the actual values were 2 heads and 18 tails. To get chi-square: 1. For every possible outcome, subtract the expected frequency from the observed, or actual, frequency. 2-10 = -8 18-10 = 8 2. Now square each of these differences: 64 64 3. Divide each of these squared differences by the original expected frequency for that cell:

64/10 = 6.4 64/10 = 6.4 4. Add these numbers up. This gives you the value for chi-square. [12.8] The equation for calculating Chi-square is thus Χ 2 = The steps for calculating in table for are f(o) f(e) f(o) - f(e) [f(o) - f(e)] 2 [f(o) - f(e)] 2 / f(e) ------------------------------------------------------------------------------------- Heads 2 10-8 64 6.4 Tails 18 10 +8 64 6.4 ------------------------------------------------------------------------------------- Χ 2 = 12.8 To get the critical value for Chi-square: 1. You need to know the number of degrees of freedom. This is equal to number of possible categories minus one. [2-1 = 1] 2. You need to know which column to look the value up in. If you are going to use 5% as the odds of the frequencies being different by chance, look the number up in the.05 column. [3,48] In this example, the number for chi-square that we calculated was larger than the comparison number we looked up in the book. This tells us that there is less a 5% chance that the percentages are different by chance. So you might decide that the coin is rigged. APA format for writing the conclusion statement is: The frequencies of the tails option in the coin flips is significantly greater from what one would expect to get by chance, Χ 2 (1, N = 20) = 12.8, p <.05. A Chi-square test when there is one variable is often referred to as a Goodness of Fit Test.

Chi-Square Test of Independence Now let s say we observe drivers at an intersection with a stop sign. We record two pieces of information: whether the driver stops at the stop sign and whether the driver is talking on a cell phone. The question we want to answer is whether drivers are less likely to come to come to a complete stop at the stop sign when talking on a cell phone compared to when drivers are not talking on a cell phone. Here are the data Cell phone No cell phone ---------------- ---------------- stop 5 15 20 -- ---------------- ---------------- don't stop 20 10 30 ---------------- ---------------- 25 25 N=50 The question is really whether the percentage of drivers who stopped while talking on a cell phone is significantly different from the percentage of drivers who stopped while not talking on a cell phone. Obviously, 5/25 or 20% of drivers with no passengers came to a complete stop. 15/25, or 60% of drivers with one or more passengers came to a complete stop. The chi-square test will still tell us if the observed frequencies are significantly different from the expected frequencies. We can use the numbers in the row and column margins to help us to compute the expected frequencies. The expected frequency for each cell is computed using the following formula: f(e) = [row total)(column total)] / N For the top left cell, the expected frequency is: (20)(25) / 50 = 500/50 = 10 top right: (20)(25) / 50 = 500/50 = 10

bottom left: (30)(25) / 50 = 750/50 = 15 bottom right: (30)(25) / 50 = 750/50 = 15 - Once you have the expected frequencies, the calculation for chi-square is exactly the same except that you have four rows in the calculations instead of two. f(o) - f(e) [f(o) - f(e)] 2 [f(o) - f(e)] 2 / f(e) --------------------------------------------------------------- Stop no pass 5-10 = -5 25 2.5 Stop -- pass 5-10 = 5 25 2.5 No stop no pass 20-15 = 5 25 1.67 No Stop -- pass 10-15 = -5 25 1.67 ---------------------------------------------------------------- 8.34 You compare this observed value for chi-square against a comparison value you look up in the chi-square table. You still use the.05 column because we're using 5% as the odds we're using to make our decision. The number of degrees of freedom to use is found from the following equation: df = (# of rows - 1)(# of columns - 1) In this case we have (2-1)(2-1) = (1)(1) = 1, giving us a comparison value of 3.84. We can say that the percentages found in the first column are significantly different from the percentages found in the second column. Drivers on a cell phone are significantly less likely to stop at a stop sign than drivers not talking on a cell phone, Χ 2 (1, N = 50) = 8.34, p <.05.

Chi-Square Practice Problems For each of the problems below, a. Please state both the null and alternative hypotheses for this question. b. Provide the decision rule for making this decision. Use an alpha level of.05. c. Show all of the work necessary to calculate the appropriate statistic. d. What conclusion are you allowed to draw? Write a conclusions sentence in APA format (i.e., it includes the appropriate statistical information). 1. From Thorne (1989): At a state university, the student population is approximately onethird male and two-thirds female. Over a two-day period the gender of each student entering the student union is recorded, with the following results: males, 452; females, 1548. Determine whether males and females are as likely to enter the union as would be predicted on the basis of their percentage in the student population. 2. A chimpanzee was trained to make same-different judgments about pairs of stimuli. For pictures of objects, the animal was correct on 23 out of 24 trials. Did the chimp perform significantly better than chance? 3. From Sprinthall (1990). A researcher is interested in whether or not a significant trend exists regarding the popularity of certain work shifts among police officers. A random sample of 60 police officers is selected from a large metropolitan police force. The officers are asked to indicate which of three work shifts they preferred. The results show that 40 officers prefer the first shift, 10 prefer the second shirt, and 10 prefer the third shift. Do the results deviate significantly from what would be expected due to chance? 4. A professor wants to determine whether her department should keep the requirement of college algebra as a prerequisite for an Introductory Statistics course. Accordingly, she allows some students to register for the course on a pass-fail basis regardless of whether or not they have had the prerequisite. Of the 70 students in the class, 40 have had algebra and 30 have not. At the end of the semester, the professor compares the number of students passing or failing the class with whether or not they had algebra. The results are presented blow. Are students more likely to pass the course if they have taken college algebra? Pass Fail ----------------------- Algebra 34 6 No algebra 12 18 5. In a study of intraspecific aggression (aggression directed toward other members of the

same species), an experimenter finds that 16 out of 23 animals tested in species A exhibit aggression, wile only 6 of 25 are aggressive in species B. Do the two species differ significantly in terms of intraspecific aggression? 6. In a study of the effectiveness of an antipsychotic drug, patients treated with the drug were compared to patients receiving a placebo. In terms of the number relapsing, 698 of 1,068 patients relapsed after taking the placebo while 639 out of 2,127 patients relapsed after taking the antipsychotic drug. Test the prediction that the antipsychotic is significantly more effective in preventing relapse than the placebo. Answers: 1. HO: Males and females are equally likely to enter the student union. H1. Males and females are not equally likely to enter the student union. Conclusion sentence: Males and females are not equally likely to enter the student union, Χ 2 (1, N = 2000) = 103.68, p <.05. 2. HO: The chimp did not perform significantly better than chance. H1: The chimp performed significantly better than chance. Conclusion sentence: The chimp performed significantly better than chance, Χ 2 (1, N = 24) = 20.16, p <.05. 3. HO: Police officers do not have a preference in terms of the shift they work H1: Police officers have a preference in terms of the shift they work Decision rule: If Χ 2 +5.99 or if Χ 2-5.99, reject HO. Conclusion sentence: Police officers have a preference in terms of the shift they work, Χ 2 (1, N = 60) = 30.0, p <.05. 4. HO: Students are not more likely to pass the course if they have taken college algebra H1: Students are more likely to pass the course if they have taken college algebra

Conclusion sentence: Students are more likely to pass the course if they have taken college algebra, Χ 2 (1, N = 70) = 15.39, p <.05. 5. HO: The two species do not differ significantly in terms of intraspecific aggression. H1: The two species differ significantly in terms of intraspecific aggression. Conclusion sentence: The two species differ significantly in terms of intraspecific aggression, Χ 2 (1, N = 48) = 10.02, p <.05. 6. HO: The antipsychotic is not significantly more effective in preventing relapse than the placebo. H1: The antipsychotic is significantly more effective in preventing relapse than the placebo. Conclusion sentence: The antipsychotic is significantly more effective in preventing relapse than the placebo, Χ 2 (1, N = 3195) = 364.32, p <.05.