# Chi Square Distribution

 To view this video please enable JavaScript, and consider upgrading to a web browser that supports HTML5 video
Save this PDF as:

Size: px
Start display at page:

## Transcription

1 17. Chi Square A. Chi Square Distribution B. One-Way Tables C. Contingency Tables D. Exercises Chi Square is a distribution that has proven to be particularly useful in statistics. The first section describes the basics of this distribution. The following two sections cover the most common statistical tests that make use of the Chi Square distribution. The section One-Way Tables shows how to use the Chi Square distribution to test the difference between theoretically expected and observed frequencies. The section Contingency Tables shows how to use Chi Square to test the association between two nominal variables. This use of Chi Square is so common that it is often referred to as the Chi Square Test. 599

2 Chi Square Distribution by David M. Lane Prerequisites Chapter 1: Distributions Chapter 7: Standard Normal Distribution Chapter 10: Degrees of Freedom Learning Objectives 1. Define the Chi Square distribution in terms of squared normal deviates 2. Describe how the shape of the Chi Square distribution changes as its degrees of freedom increase A standard normal deviate is a random sample from the standard normal distribution. The Chi Square distribution is the distribution of the sum of squared standard normal deviates. The degrees of freedom of the distribution is equal to the number of standard normal deviates being summed. Therefore, Chi Square with one degree of freedom, written as χ 2 (1), is simply the distribution of a single normal deviate squared. The area of a Chi Square distribution below 4 is the same as the area of a standard normal distribution below 2, since 4 is 2 2. Consider the following problem: you sample two scores from a standard normal distribution, square each score, and sum the squares. What is the probability that the sum of these two squares will be six or higher? Since two scores are sampled, the answer can be found using the Chi Square distribution with two degrees of freedom. A Chi Square calculator can be used to find that the probability of a Chi Square (with 2 df) being six or higher is The mean of a Chi Square distribution is its degrees of freedom. Chi Square distributions are positively skewed, with the degree of skew decreasing with increasing degrees of freedom. As the degrees of freedom increases, the Chi Square distribution approaches a normal distribution. Figure 1 shows density functions for three Chi Square distributions. Notice how the skew decreases as the degrees of freedom increase. 600

3 Density df = df = 4 Density Density df = Figure 1. Chi Square distributions with 2, 4, and 6 degrees of freedom. The Chi Square distribution is very important because many test statistics are approximately distributed as Chi Square. Two of the more common tests using the 601

4 Chi Square distribution are tests of deviations of differences between theoretically expected and observed frequencies (one-way tables) and the relationship between categorical variables (contingency tables). Numerous other tests beyond the scope of this work are based on the Chi Square distribution. 602

5 One-Way Tables (Testing Goodness of Fit) by David M. Lane Prerequisites Chapter 5: Basic Concepts of Probability Chapter 11: Significance Testing Chapter 17: Chi Square Distribution Learning Objectives 1. Describe what it means for there to be theoretically-expected frequencies 2. Compute expected frequencies 3. Compute Chi Square 4. Determine the degrees of freedom The Chi Square distribution can be used to test whether observed data differ significantly from theoretical expectations. For example, for a fair six-sided die, the probability of any given outcome on a single roll would be 1/6. The data in Table 1 were obtained by rolling a six-sided die 36 times. However, as can be seen in Table 1, some outcomes occurred more frequently than others. For example, a 3 came up nine times, whereas a 4 came up only two times. Are these data consistent with the hypothesis that the die is a fair die? Naturally, we do not expect the sample frequencies of the six possible outcomes to be the same since chance differences will occur. So, the finding that the frequencies differ does not mean that the die is not fair. One way to test whether the die is fair is to conduct a significance test. The null hypothesis is that the die is fair. This hypothesis is tested by computing the probability of obtaining frequencies as discrepant or more discrepant from a uniform distribution of frequencies as obtained in the sample. If this probability is sufficiently low, then the null hypothesis that the die is fair can be rejected. Table 1. Outcome Frequencies from a Six-Sided Die Outcome Frequency

6 The first step in conducting the significance test is to compute the expected frequency for each outcome given that the null hypothesis is true. For example, the expected frequency of a 1 is 6, since the probability of a 1 coming up is 1/6 and there were a total of 36 rolls of the die. Expected frequency = (1/6)(36) = 6 ables Note that the expected frequencies are expected only in a theoretical sense. We do not really expect the observed frequencies to match the expected frequencies exactly. The calculation continues as follows. Letting E be the expected frequency of an outcome and O be the observed frequency of that outcome, compute ( ) for each outcome. Table 2 shows these calculations. ( ) Table 2. Outcome Frequencies = 5.33 from a Six-Sided Die Outcome E O (E-O) 2 /E ay Tables ( ) ( ) = Next we add up all the values in Column 4 of Table 2. ( ( ) ) = = = 5.33 cy Tables ( ) 604

7 ncy Tables cy Tables ( ) = 5.33 ( ) = 5.33 This sampling distribution of ( ) ( ) is approximately distributed as Chi Square with k-1 degrees of freedom, where k is the number of categories. Therefore, for this problem the test statistic is = = ( ) = = ( ) = = which means the value of Chi Square with 5 degrees of freedom is From a Chi Square calculator it can be determined that the probability of a Chi Square of or larger is Therefore, the null hypothesis that the die is fair cannot be rejected. The Chi Square test can also be used to test other deviations between expected and observed frequencies. The following example shows a test of whether the variable University GPA in the SAT and College GPA case study is normally distributed. The first column, = in Table 3 shows the normal distribution divided into five ranges. The second, = column shows the proportions of a normal distribution falling in the ranges specified in the first column. The expected frequencies (E) are calculated by multiplying the number of scores (105) by the proportion. The final column shows the observed number of scores in each range. It is clear that the observed frequencies vary greatly from the expected frequencies. Note that if the distribution were normal, then there would have been only about 35 scores between 0 and 1, whereas 60 were observed. ( ) = = ( ) = = Table 3. Expected and Observed Scores for 105 University GPA Scores. Range Proportion E O Above to to Below

8 = The test of whether the observed scores deviate significantly from the expected scores is computed using the familiar calculation. ( ) = = The subscript 3 means there are three degrees of freedom. As before, the degrees of freedom is the number of outcomes minus one, which is three in this example. The Chi Square distribution calculator shows that p < for this Chi Square. Therefore, the null hypothesis that the scores are normally distributed can be rejected. ntingency Tables, = ( ) = =

9 Contingency Tables by David M. Lane Prerequisites Chapter 17: Chi Square Distribution Chapter 17: One-Way Tables Learning Objectives 1. State the null hypothesis tested concerning contingency tables 2. Compute expected cell frequencies 3. Compute Chi Square and df This section shows how to use Chi Square to test the relationship between nominal variables for significance. For example, Table 1 shows the data from the Mediterranean Diet and Health case study. Table 1. Frequencies for Diet and Health Study Outcome Diet Cancers Fatal Heart Disease Non-Fatal Heart Disease Healthy Total AHA Mediterranean Total The question is whether there is a significant relationship between diet and outcome. The first step is to compute the expected frequency for each cell based on the assumption that there is no relationship. These expected frequencies are computed from the totals as follows. We begin by computing the expected frequency for the AHA Diet/Cancers combination. Note that 22/605 subjects developed cancer. The proportion who developed cancer is therefore If there were no relationship between diet and outcome, then we would expect of those on the AHA diet to develop cancer. Since 303 subjects were on the AHA diet, we would expect (0.0364)(303) = cancers on the AHA diet. Similarly, we would expect (0.0364)(302) = cancers on the Mediterranean diet. In 607

10 = = cy Tables ( ) = 5.33 general, the expected frequency for a cell in the i th row and the j th column is equal to, = ( ) where E i,j is the expected frequency for cell i,j, T i is the total for the i th row, T j is the total for the j th column, and T is the total number of observations. For the AHA Diet/Cancers cell, i = 1, j = 1, T i = 303, T j = 22, and T = 605. Table 2 shows the expected frequencies (in parenthesis) for each cell in the experiment. Table 2 shows the expected frequencies (in parenthesis) for each cell in the experiment. ( ) ( ) = = Table 2. Observed = and Expected = Frequencies for Diet and Health Study Outcome Diet ntingency Tables AHA Cancers 15 (11.02) Fatal Heart Disease 24 (19.03) Non-Fatal Heart Disease Healthy Total 25 (16.53) 239 (256.42) Mediterranean (10.98) (18.97) (16.47) ( , = Total The significance test is conducted by computing Chi Square as follows. ( ) = = The degrees of freedom is equal to (r-1)(c-1), where r is the number of rows and c is the number of columns. For this example, the degrees of freedom is (2-1)(4-1) = 3. The Chi Square calculator can be used to determine that the probability value for a Chi Square of with three degrees of freedom is equal to Therefore, the null hypothesis of no relationship between diet and outcome can be rejected. A key assumption of this Chi Square test is that each subject contributes data to only one cell. Therefore, the sum of all cell frequencies in the table must be the same as the number of subjects in the experiment. Consider an experiment in 608

11 which each of 16 subjects attempted two anagram problems. The data are shown in Table 3. Table 3. Anagram problem data Anagram 1 Anagram 2 Solved 10 4 Did not Solve 6 12 It would not be valid to use the Chi Square test on these data since each subject contributed data to two cells: one cell based on their performance on Anagram 1 and one cell based on their performance on Anagram 2. The total of the cell frequencies in the table is 32, but the total number of subjects is only 16. The formula for Chi Square yields a statistic that is only approximately a Chi Square distribution. In order for the approximation to be adequate, the total number of subjects should be at least 20. Some authors claim that the correction for continuity should be used whenever an expected cell frequency is below 5. Research in statistics has shown that this practice is not advisable. For example, see: Bradley, Bradley, McGrath, & Cutcomb (1979). The correction for continuity when applied to 2 x 2 contingency tables is called the Yates correction. 609

12 Statistical Literacy by David M. Lane Prerequisites Chapter 17: Contingency Tables An experiment was conducted to test whether the spice saffron can inhibit liver cancer. Two groups of rats were tested. Both groups were injected with chemicals known to increase the chance of liver cancer. The experimental group was fed saffron (n = 24) whereas the control group was not (n = 8). The experiment is described here. Only 4 of the 24 subjects in the saffron group developed cancer as compared to 6 of the 8 subjects in the control group. What do you think? What method could be used to test whether this difference between the experimental and control groups is statistically significant? The Chi Square test of contingency tables could be used. It yields a Chi Squared (df = 1) of 9.50 which has an associated p of % 610

13 References Bradley, D. R., Bradley, T. D., McGrath, S. G., & Cutcomb, S. D. (1979) Type I error rate of the chi square test of independence in r x c tables that have small expected frequencies. Psychological Bulletin, 86,

14 Exercises Prerequisites All material presented in the Chi Square Chapter 1. Which of the two Chi Square distributions shown below (A or B) has the larger degrees of freedom? How do you know? 2. Twelve subjects were each given two flavors of ice cream to taste and then were asked whether they liked them. Two of the subjects liked the first flavor and nine of them liked the second flavor. Is it valid to use the Chi Square test to determine whether this difference in proportions is significant? Why or why not? 3. A die is suspected of being biased. It is rolled 25 times with the following result: 612

15 Conduct a significance test to see if the die is biased. (a) What Chi Square value do you get and how many degrees of freedom does it have? (b) What is the p value? 4. A recent experiment investigated the relationship between smoking and urinary incontinence. Of the 322 subjects in the study who were incontinent, 113 were smokers, 51 were former smokers, and 158 had never smoked. Of the 284 control subjects who were not in- continent, 68 were smokers, 23 were former smokers, and 193 had never smoked. a. Create a table displaying this data. b. What is the expected frequency in each cell? c. Conduct a significance test to see if there is a relationship between smoking and incontinence. What Chi Square value do you get? What p value do you get? d. What do you conclude? 5. At a school pep rally, a group of sophomore students organized a free raffle for prizes. They claim that they put the names of all of the students in the school in the basket and that they randomly drew 36 names out of this basket. Of the prize winners, 6 were freshmen, 14 were sophomores, 9 were juniors, and 7 were seniors. The results do not seem that random to you. You think it is a little fishy that sophomores organized the raffle and also won the most prizes. Your school is composed of 30% freshmen, 25% sophomores, 25% juniors, and 20% seniors. a. What are the expected frequencies of winners from each class? b. Conduct a significance test to determine whether the winners of the prizes were distributed throughout the classes as would be expected based on the percentage of students in each group. Report your Chi Square and p values. c. What do you conclude? 613

16 6. Some parents of the West Bay little leaguers think that they are noticing a pattern. There seems to be a relationship between the number on the kids jerseys and their position. These parents decide to record what they see. The hypothetical data appear below. Conduct a Chi Square test to determine if the parents suspicion that there is a relationship between jersey number and position is right. Report your Chi Square and p values. 7. True/false: A Chi Square distribution with 2 df has a larger mean than a Chi Square distribution with 12 df. 8. True/false: A Chi Square test is often used to determine if there is a significant relationship between two continuous variables. 9. True/false: Imagine that you want to determine if the spinner shown below is biased. You spin it 50 times and write down how many times the arrow lands in each section. You will reject the null hypothesis at the.05 level and determine that this spinner is biased if you calculate a Chi Square value of 7.82 or higher. Questions from Case Studies SAT and GPA (SG) case study 614

17 10. (SG) Answer these items to determine if the math SAT scores are normally distributed. You may want to first standardize the scores. a. If these data were normally distributed, how many scores would you expect there to be in each of these brackets: (i) smaller than 1 SD below the mean, (ii) in between the mean and 1 SD below the mean, (iii) in between the mean and 1 SD above the mean, (iv) greater than 1 SD above the mean? b. How many scores are actually in each of these brackets? c. Conduct a Chi Square test to determine if the math SAT scores are normally distributed based on these expected and observed frequencies. Diet and Health (DH) case study 11. (DH) Conduct a Pearson Chi Square test to determine if there is any relationship between diet and outcome. Report the Chi Square and p values and state your conclusions. The following questions are from ARTIST (reproduced with permission) 12. A study compared members of a medical clinic who filed complaints with a random sample of members who did not complain. The study divided the complainers into two subgroups: those who filed complaints about medical treatment and those who filed nonmedical complaints. Here are the data on the total number in each group and the number who voluntarily left the medical clinic. Set up a two-way table. Analyze these data to see if there is a relationship between complaint (no, yes - medical, yes - nonmedical) and leaving the clinic (yes or no). 615

18 13. Imagine that you believe there is a relationship between a person s eye color and where he or she prefers to sit in a large lecture hall. You decide to collect data from a random sample of individuals and conduct a chi-square test of independence. What would your two-way table look like? Use the information to construct such a table, and be sure to label the different levels of each category. 14. A geologist collects hand-specimen sized pieces of limestone from a particular area. A qualitative assessment of both texture and color is made with the following results. Is there evidence of association between color and texture for these limestones? Explain your answer. 15. Suppose that college students are asked to identify their preferences in political affiliation (Democrat, Republican, or Independent) and in ice cream (chocolate, vanilla, or straw- berry). Suppose that their responses are represented in the following two-way table (with some of the totals left for you to calculate). a. What proportion of the respondents prefer chocolate ice cream? b. What proportion of the respondents are Independents? c. What proportion of Independents prefer chocolate ice cream? d. What proportion of those who prefer chocolate ice cream are Independents? 616

19 e. Analyze the data to determine if there is a relationship between political party preference and ice cream preference. 16. NCAA collected data on graduation rates of athletes in Division I in the mid-1980s. Among 2,332 men, 1,343 had not graduated from college, and among 959 women, 441 had not graduated. a. Set up a two-way table to examine the relationship between gender and graduation. b. Identify a test procedure that would be appropriate for analyzing the relationship between gender and graduation. Carry out the procedure and state your conclusion 617

### Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) 90. 35 (d) 20 (e) 25 (f) 80. Totals/Marginal 98 37 35 170

Work Sheet 2: Calculating a Chi Square Table 1: Substance Abuse Level by ation Total/Marginal 63 (a) 17 (b) 10 (c) 90 35 (d) 20 (e) 25 (f) 80 Totals/Marginal 98 37 35 170 Step 1: Label Your Table. Label

### Elementary Statistics

lementary Statistics Chap10 Dr. Ghamsary Page 1 lementary Statistics M. Ghamsary, Ph.D. Chapter 10 Chi-square Test for Goodness of fit and Contingency tables lementary Statistics Chap10 Dr. Ghamsary Page

### MATH 10: Elementary Statistics and Probability Chapter 11: The Chi-Square Distribution

MATH 10: Elementary Statistics and Probability Chapter 11: The Chi-Square Distribution Tony Pourmohamad Department of Mathematics De Anza College Spring 2015 Objectives By the end of this set of slides,

### CHI SQUARE DISTRIBUTION

CI SQUARE DISTRIBUTION 1 Introduction to the Chi Square Test of Independence This test is used to analyse the relationship between two sets of discrete data. Contingency tables are used to examine the

### 13.2 The Chi Square Test for Homogeneity of Populations The setting: Used to compare distribution of proportions in two or more populations.

13.2 The Chi Square Test for Homogeneity of Populations The setting: Used to compare distribution of proportions in two or more populations. Data is organized in a two way table Explanatory variable (Treatments)

### CHAPTER 11 CHI-SQUARE: NON-PARAMETRIC COMPARISONS OF FREQUENCY

CHAPTER 11 CHI-SQUARE: NON-PARAMETRIC COMPARISONS OF FREQUENCY The hypothesis testing statistics detailed thus far in this text have all been designed to allow comparison of the means of two or more samples

### Inferential Statistics

Inferential Statistics Sampling and the normal distribution Z-scores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are

### Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Features Of The Chi-Square Statistic The chi-square test is non-parametric. That is, it makes no assumptions about the distribution

### When to use a Chi-Square test:

When to use a Chi-Square test: Usually in psychological research, we aim to obtain one or more scores from each participant. However, sometimes data consist merely of the frequencies with which certain

### The Chi Square Test. Diana Mindrila, Ph.D. Phoebe Balentyne, M.Ed. Based on Chapter 23 of The Basic Practice of Statistics (6 th ed.

The Chi Square Test Diana Mindrila, Ph.D. Phoebe Balentyne, M.Ed. Based on Chapter 23 of The Basic Practice of Statistics (6 th ed.) Concepts: Two-Way Tables The Problem of Multiple Comparisons Expected

### Chi Square for Contingency Tables

2 x 2 Case Chi Square for Contingency Tables A test for p 1 = p 2 We have learned a confidence interval for p 1 p 2, the difference in the population proportions. We want a hypothesis testing procedure

### Is it statistically significant? The chi-square test

UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical

### Math 58. Rumbos Fall 2008 1. Solutions to Review Problems for Exam 2

Math 58. Rumbos Fall 2008 1 Solutions to Review Problems for Exam 2 1. For each of the following scenarios, determine whether the binomial distribution is the appropriate distribution for the random variable

### PASS Sample Size Software

Chapter 250 Introduction The Chi-square test is often used to test whether sets of frequencies or proportions follow certain patterns. The two most common instances are tests of goodness of fit using multinomial

### CHI-Squared Test of Independence

CHI-Squared Test of Independence Minhaz Fahim Zibran Department of Computer Science University of Calgary, Alberta, Canada. Email: mfzibran@ucalgary.ca Abstract Chi-square (X 2 ) test is a nonparametric

### Chi-square test Testing for independeny The r x c contingency tables square test

Chi-square test Testing for independeny The r x c contingency tables square test 1 The chi-square distribution HUSRB/0901/1/088 Teaching Mathematics and Statistics in Sciences: Modeling and Computer-aided

### Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 000: Page 1: CHI-SQUARE TESTS: When to use a Chi-Square test: Usually in psychological research, we aim to obtain one or more scores

### Chi Square Tests. Chapter 10. 10.1 Introduction

Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square

### Crosstabulation & Chi Square

Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among

### 9. Sampling Distributions

9. Sampling Distributions Prerequisites none A. Introduction B. Sampling Distribution of the Mean C. Sampling Distribution of Difference Between Means D. Sampling Distribution of Pearson's r E. Sampling

### Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared

### TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2

About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing

### CHAPTER 11. GOODNESS OF FIT AND CONTINGENCY TABLES

CHAPTER 11. GOODNESS OF FIT AND CONTINGENCY TABLES The chi-square distribution was discussed in Chapter 4. We now turn to some applications of this distribution. As previously discussed, chi-square is

### Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.

### Module 9: Nonparametric Tests. The Applied Research Center

Module 9: Nonparametric Tests The Applied Research Center Module 9 Overview } Nonparametric Tests } Parametric vs. Nonparametric Tests } Restrictions of Nonparametric Tests } One-Sample Chi-Square Test

### Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

### Lecture 7: Binomial Test, Chisquare

Lecture 7: Binomial Test, Chisquare Test, and ANOVA May, 01 GENOME 560, Spring 01 Goals ANOVA Binomial test Chi square test Fisher s exact test Su In Lee, CSE & GS suinlee@uw.edu 1 Whirlwind Tour of One/Two

### Mind on Statistics. Chapter 15

Mind on Statistics Chapter 15 Section 15.1 1. A student survey was done to study the relationship between class standing (freshman, sophomore, junior, or senior) and major subject (English, Biology, French,

### Unit 29 Chi-Square Goodness-of-Fit Test

Unit 29 Chi-Square Goodness-of-Fit Test Objectives: To perform the chi-square hypothesis test concerning proportions corresponding to more than two categories of a qualitative variable To perform the Bonferroni

### Chi Square (χ 2 ) Statistical Instructions EXP 3082L Jay Gould s Elaboration on Christensen and Evans (1980)

Chi Square (χ 2 ) Statistical Instructions EXP 3082L Jay Gould s Elaboration on Christensen and Evans (1980) For the Driver Behavior Study, the Chi Square Analysis II is the appropriate analysis below.

### Chapter 23. Two Categorical Variables: The Chi-Square Test

Chapter 23. Two Categorical Variables: The Chi-Square Test 1 Chapter 23. Two Categorical Variables: The Chi-Square Test Two-Way Tables Note. We quickly review two-way tables with an example. Example. Exercise

### SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Ch. 10 Chi SquareTests and the F-Distribution 10.1 Goodness of Fit 1 Find Expected Frequencies Provide an appropriate response. 1) The frequency distribution shows the ages for a sample of 100 employees.

### Chi-Square Test (χ 2 )

Chi Square Tests Chi-Square Test (χ 2 ) Nonparametric test for nominal independent variables These variables, also called "attribute variables" or "categorical variables," classify observations into a

### Lecture 42 Section 14.3. Tue, Apr 8, 2008

the Lecture 42 Section 14.3 Hampden-Sydney College Tue, Apr 8, 2008 Outline the 1 2 the 3 4 5 the The will compute χ 2 areas, but not χ 2 percentiles. (That s ok.) After performing the χ 2 test by hand,

### Elementary Statistics Sample Exam #3

Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to

### Contingency Tables and the Chi Square Statistic. Interpreting Computer Printouts and Constructing Tables

Contingency Tables and the Chi Square Statistic Interpreting Computer Printouts and Constructing Tables Contingency Tables/Chi Square Statistics What are they? A contingency table is a table that shows

### Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

### CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS Hypothesis 1: People are resistant to the technological change in the security system of the organization. Hypothesis 2: information hacked and misused. Lack

### Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Chi-square Goodness of Fit Test The chi-square test is designed to test differences whether one frequency is different from another frequency. The chi-square test is designed for use with data on a nominal

### Categorical Data Analysis

Richard L. Scheaffer University of Florida The reference material and many examples for this section are based on Chapter 8, Analyzing Association Between Categorical Variables, from Statistical Methods

### CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS CHI-SQUARE TESTS OF INDEPENDENCE (SECTION 11.1 OF UNDERSTANDABLE STATISTICS) In chi-square tests of independence we use the hypotheses. H0: The variables are independent

### In the past, the increase in the price of gasoline could be attributed to major national or global

Chapter 7 Testing Hypotheses Chapter Learning Objectives Understanding the assumptions of statistical hypothesis testing Defining and applying the components in hypothesis testing: the research and null

### Recall this chart that showed how most of our course would be organized:

Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

### Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

### ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

ANOVA ANOVA Analysis of Variance Chapter 6 A procedure for comparing more than two groups independent variable: smoking status non-smoking one pack a day > two packs a day dependent variable: number of

### Topic 8. Chi Square Tests

BE540W Chi Square Tests Page 1 of 5 Topic 8 Chi Square Tests Topics 1. Introduction to Contingency Tables. Introduction to the Contingency Table Hypothesis Test of No Association.. 3. The Chi Square Test

### Chi-Square Test. Contingency Tables. Contingency Tables. Chi-Square Test for Independence. Chi-Square Tests for Goodnessof-Fit

Chi-Square Tests 15 Chapter Chi-Square Test for Independence Chi-Square Tests for Goodness Uniform Goodness- Poisson Goodness- Goodness Test ECDF Tests (Optional) McGraw-Hill/Irwin Copyright 2009 by The

### The Chi-Square Test. STAT E-50 Introduction to Statistics

STAT -50 Introduction to Statistics The Chi-Square Test The Chi-square test is a nonparametric test that is used to compare experimental results with theoretical models. That is, we will be comparing observed

### Testing on proportions

Testing on proportions Textbook Section 5.4 April 7, 2011 Example 1. X 1,, X n Bernolli(p). Wish to test H 0 : p p 0 H 1 : p > p 0 (1) Consider a related problem The likelihood ratio test is where c is

### Pearson s 2x2 Chi-Square Test of Independence -- Analysis of the Relationship between two Qualitative

Pearson s 2x2 Chi-Square Test of Independence -- Analysis of the Relationship between two Qualitative or (Binary) Variables Analysis of 2-Between-Group Data with a Qualitative (Binary) Response Variable

### Chapter 19 The Chi-Square Test

Tutorial for the integration of the software R with introductory statistics Copyright c Grethe Hystad Chapter 19 The Chi-Square Test In this chapter, we will discuss the following topics: We will plot

### Introduction to Hypothesis Testing. Copyright 2014 Pearson Education, Inc. 9-1

Introduction to Hypothesis Testing 9-1 Learning Outcomes Outcome 1. Formulate null and alternative hypotheses for applications involving a single population mean or proportion. Outcome 2. Know what Type

### Hypothesis Testing & Data Analysis. Statistics. Descriptive Statistics. What is the difference between descriptive and inferential statistics?

2 Hypothesis Testing & Data Analysis 5 What is the difference between descriptive and inferential statistics? Statistics 8 Tools to help us understand our data. Makes a complicated mess simple to understand.

### 15. Analysis of Variance

15. Analysis of Variance A. Introduction B. ANOVA Designs C. One-Factor ANOVA (Between-Subjects) D. Multi-Factor ANOVA (Between-Subjects) E. Unequal Sample Sizes F. Tests Supplementing ANOVA G. Within-Subjects

### The Goodness-of-Fit Test

on the Lecture 49 Section 14.3 Hampden-Sydney College Tue, Apr 21, 2009 Outline 1 on the 2 3 on the 4 5 Hypotheses on the (Steps 1 and 2) (1) H 0 : H 1 : H 0 is false. (2) α = 0.05. p 1 = 0.24 p 2 = 0.20

### Chi-Square Analysis (Ch.8) Purpose. Purpose. Examples

Chi-Square Analysis (Ch.8) Chi-square test of association (contingency) x tables rxc tables Post-hoc Interpretation Running SPSS Windows CROSSTABS Chi-square test of goodness of fit Purpose Chi-square

### Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

ANALYSIS OF DISCRT VARIABLS / 5 CHAPTR FIV ANALYSIS OF DISCRT VARIABLS Discrete variables are those which can only assume certain fixed values. xamples include outcome variables with results such as live

### Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

### Sample Size Determination

Sample Size Determination Population A: 10,000 Population B: 5,000 Sample 10% Sample 15% Sample size 1000 Sample size 750 The process of obtaining information from a subset (sample) of a larger group (population)

### CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V

CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V Chapters 13 and 14 introduced and explained the use of a set of statistical tools that researchers use to measure

### Chi-Square Tests. In This Chapter BONUS CHAPTER

BONUS CHAPTER Chi-Square Tests In the previous chapters, we explored the wonderful world of hypothesis testing as we compared means and proportions of one, two, three, and more populations, making an educated

### Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

### MATH Chapter 23 April 15 and 17, 2013 page 1 of 8 CHAPTER 23: COMPARING TWO CATEGORICAL VARIABLES THE CHI-SQUARE TEST

MATH 1342. Chapter 23 April 15 and 17, 2013 page 1 of 8 CHAPTER 23: COMPARING TWO CATEGORICAL VARIABLES THE CHI-SQUARE TEST Relationships: Categorical Variables Chapter 21: compare proportions of successes

### Chi-square test Fisher s Exact test

Lesson 1 Chi-square test Fisher s Exact test McNemar s Test Lesson 1 Overview Lesson 11 covered two inference methods for categorical data from groups Confidence Intervals for the difference of two proportions

### Math 108 Exam 3 Solutions Spring 00

Math 108 Exam 3 Solutions Spring 00 1. An ecologist studying acid rain takes measurements of the ph in 12 randomly selected Adirondack lakes. The results are as follows: 3.0 6.5 5.0 4.2 5.5 4.7 3.4 6.8

### Hypothesis Testing for a Proportion

Math 122 Intro to Stats Chapter 6 Semester II, 2015-16 Inference for Categorical Data Hypothesis Testing for a Proportion In a survey, 1864 out of 2246 randomly selected adults said texting while driving

### Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................

### CHAPTER 12. Chi-Square Tests and Nonparametric Tests LEARNING OBJECTIVES. USING STATISTICS @ T.C. Resort Properties

CHAPTER 1 Chi-Square Tests and Nonparametric Tests USING STATISTICS @ T.C. Resort Properties 1.1 CHI-SQUARE TEST FOR THE DIFFERENCE BETWEEN TWO PROPORTIONS (INDEPENDENT SAMPLES) 1. CHI-SQUARE TEST FOR

### A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

### Descriptive Statistics

Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

### 4. Descriptive Statistics: Measures of Variability and Central Tendency

4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and

### χ 2 = (O i E i ) 2 E i

Chapter 24 Two-Way Tables and the Chi-Square Test We look at two-way tables to determine association of paired qualitative data. We look at marginal distributions, conditional distributions and bar graphs.

### 1. Comparing Two Means: Dependent Samples

1. Comparing Two Means: ependent Samples In the preceding lectures we've considered how to test a difference of two means for independent samples. Now we look at how to do the same thing with dependent

### Chapter 11 Chi square Tests

11.1 Introduction 259 Chapter 11 Chi square Tests 11.1 Introduction In this chapter we will consider the use of chi square tests (χ 2 tests) to determine whether hypothesized models are consistent with

### Using Stata for Categorical Data Analysis

Using Stata for Categorical Data Analysis NOTE: These problems make extensive use of Nick Cox s tab_chi, which is actually a collection of routines, and Adrian Mander s ipf command. From within Stata,

### 3.2 Roulette and Markov Chains

238 CHAPTER 3. DISCRETE DYNAMICAL SYSTEMS WITH MANY VARIABLES 3.2 Roulette and Markov Chains In this section we will be discussing an application of systems of recursion equations called Markov Chains.

### NUMB3RS Activity: Candy Pieces. Episode: End of Watch

Teacher Page 1 NUMB3RS Activity: Candy Pieces Topic: Chi-square test for goodness-of-fit Grade Level: 11-12 Objective: Use a chi-square test to determine if there is a significant difference between the

### AP Statistics for Friday 4/15/16. 2) turn in work; hand back work. 3) lesson on Chi Square tests for Association with handouts

AP Statistics for Friday 4/15/16 1) go over schedule 2) turn in work; hand back work 3) lesson on Chi Square tests for Association with handouts 4) Assign #105 p 631 / 19, 23, 31 schedule... Normal Distribution

### 4) The goodness of fit test is always a one tail test with the rejection region in the upper tail. Answer: TRUE

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 13 Goodness of Fit Tests and Contingency Analysis 1) A goodness of fit test can be used to determine whether a set of sample data comes from a specific

### Sydney Roberts Predicting Age Group Swimmers 50 Freestyle Time 1. 1. Introduction p. 2. 2. Statistical Methods Used p. 5. 3. 10 and under Males p.

Sydney Roberts Predicting Age Group Swimmers 50 Freestyle Time 1 Table of Contents 1. Introduction p. 2 2. Statistical Methods Used p. 5 3. 10 and under Males p. 8 4. 11 and up Males p. 10 5. 10 and under

### CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency

### One-Way Analysis of Variance (ANOVA) Example Problem

One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means

### UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

### Chi Square Test. Paul E. Johnson Department of Political Science. 2 Center for Research Methods and Data Analysis, University of Kansas

Chi Square Test Paul E. Johnson 1 2 1 Department of Political Science 2 Center for Research Methods and Data Analysis, University of Kansas 2011 What is this Presentation? Chi Square Distribution Find

### 12.5: CHI-SQUARE GOODNESS OF FIT TESTS

125: Chi-Square Goodness of Fit Tests CD12-1 125: CHI-SQUARE GOODNESS OF FIT TESTS In this section, the χ 2 distribution is used for testing the goodness of fit of a set of data to a specific probability

### An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA

ABSTRACT An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA Often SAS Programmers find themselves in situations where performing

### Statistics Review PSY379

Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

### Module 5 Hypotheses Tests: Comparing Two Groups

Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this

### 1. Rejecting a true null hypothesis is classified as a error. 2. Failing to reject a false null hypothesis is classified as a error.

1. Rejecting a true null hypothesis is classified as a error. 2. Failing to reject a false null hypothesis is classified as a error. 8.5 Goodness of Fit Test Suppose we want to make an inference about

### CHAPTER 11: FACTS ABOUT THE CHI- SQUARE DISTRIBUTION

CHAPTER 11: FACTS ABOUT THE CHI- SQUARE DISTRIBUTION Exercise 1. If the number of degrees of freedom for a chi-square distribution is 25, what is the population mean and standard deviation? mean = 25 and

### First-year Statistics for Psychology Students Through Worked Examples

First-year Statistics for Psychology Students Through Worked Examples 1. THE CHI-SQUARE TEST A test of association between categorical variables by Charles McCreery, D.Phil Formerly Lecturer in Experimental

### Experimental Designs (revisited)

Introduction to ANOVA Copyright 2000, 2011, J. Toby Mordkoff Probably, the best way to start thinking about ANOVA is in terms of factors with levels. (I say this because this is how they are described

### Association Between Variables

Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

### Chi-square (χ 2 ) Tests

Math 442 - Mathematical Statistics II May 5, 2008 Common Uses of the χ 2 test. 1. Testing Goodness-of-fit. Chi-square (χ 2 ) Tests 2. Testing Equality of Several Proportions. 3. Homogeneity Test. 4. Testing

### A Basic Guide to Analyzing Individual Scores Data with SPSS

A Basic Guide to Analyzing Individual Scores Data with SPSS Step 1. Clean the data file Open the Excel file with your data. You may get the following message: If you get this message, click yes. Delete