1. Chi-Squared Tests

Size: px

Start display at page:

Download "1. Chi-Squared Tests"

Dana Walsh
7 years ago
Views:

1 1. Chi-Squared Tests We'll now look at how to test statistical hypotheses concerning nominal data, and specifically when nominal data are summarized as tables of frequencies. The tests we will considered are generically called chi-squared (or chi-square) tests. ach test involves computing a test statistic, and then calculating the area in the tail of a theoretical distribution called the chisquared (χ²) distribution. The χ² distribution, like the t distribution, is actually a family of distributions each one corresponding to a certain number of degrees of freedom: However in the case of the χ² distribution, we are almost always concerned with upper-tail probabilities. That is, chi-squared tests are usually 1-tailed.

2 Hypothetical Data Various Outcomes to Arterial Stent Placement Outcome Observed (O) xpected () Rejected days > 100 days Replaced 0 5 Total 8 8 Our observed frequencies come from data on 8 patients who receive the treatment. Our expected frequencies may come from theoretical models or from estimates of probabilities derived from some larger reference population. Our null hypothesis is that the observed frequencies do not differ from the expected frequencies by more than is expected than chance. Or: H0: Our sample comes from some specified reference population. To test the null hypothesis, we may use either of two test statistics. Pearson X-squared statistic Likelihood ratio statistic X = All cells ( O ) L = All cells O O ln Both of these test statistics follow a theoretical χ²-distribution. They are typically, (though not necessarily always), close in value to each other. Note that in the former case the test statistic is denoted X. This should be called "ex-squared". It is not the same as the theoretical distribution, χ² (chi-squared). Most textbooks mistakenly call the test statistic (X ) "chi-squared." That is, the name "chi-squared" test comes from the distribution used to test the hypothesis (χ² distribution), and not the test statistic itself.

3 We perform our test by computing X. Our calculations for the example data are shown below: Hypothetical Data Various Outcomes to Arterial Stent Placement Outcome Observed (O) xpected () ( O ) (O ) Rejected days > 100 days Replaced Total 8 8 Sum = X = The area of the χ² distribution (with 4 1 = 3 df) above is vanishingly small (p = ). ven assuming a low α (e.g., α = 0.001) then p < α, so we reject the H0 which asserted that our data came from the reference population. That is, our sample comes from some other population, with probabilities of each level that are different from the reference population. We can check our results here: As mentioned briefly in the last lecture, our expected frequencies in an analysis like this would come from estimates of the probabilities of observations falling in each category. Getting xpected Frequencies from Probability or Proportion stimates Outcome Observed (O) Population Probability (π) xpected () Rejected days > 100 days Replaced Total

4 These probabilities might come from a theoretical model or from knowledge about the composition of the population. In any case, we would get the expected frequencies for each category (i) by multiplying each probability times the number of cases (n) in our sample: i = πi One common application of the above method is to perform a goodness-of-fit test. Suppose, for example, that we have a continuous variable and we wish to know if it's distribution is, for example, normal (or Poisson, or some other known shape). Our null and alternative hypotheses are as follows: H0: Our data follow the hypothezied distributional form. H1: Our data do not follow the hypothesized distributional form. We conduct the test as follows: Divide the continuous variable into discrete ranges. Observed frequencies are the numbers of observations that fall in each range. Probabilities (π) are what we would expect if the variable had the hypothesized distributional form (e.g., obtained from integral of the normal distribution over each range). For expected frequencies, we multiply the probabilities times the n of our sample size. We then calculate the X test statistic and consult the χ² distribution with k 1 df (where k is the number of levels or categories). Our p-value is the area of the distribution above the calculated value of X. If p < α, we reject the null hypothesis that our data are normally distributed. Note that in this case, unlike other applications, we typically *do not* want to reject the null hypothesis (i.e., we wish to conclude that the variable has the predicted distributional shape). For this reason, in a goodness-of-fit test, α is often set higher than usual, e.g., 0.1. Video: Pearson's Chi Square Test (Goodness of Fit) n

5 . Chi-Squared Tests for One Variable in xcel 1. Place level names in Column A. In Column B, place observed frequencies (O) for each level. 3. In Column C, place expected probabilities (π) for each level. 4. Multiply probabilities times sample size (n) to produce expected frequencies (); place in Column D. 5. In Column, calculate (O ) / for each row. 6. Sum results of Column. This is your X test statistic. 7. Compute p-value as area of χ² distribution (with k 1 df, where k is the number of levels) above X. If p < α (e.g., p < 0.05), reject null hypothesis that your O and frequencies come from the same population or distribution. Use function: =CHIDIST(x-square, df) where is the value of X and df = k 1.

6 3. Chi-Squared Tests for Two-way Tables Another, more common use of chi-squared statistics is to test whether two (or more) nominal variables are statistically independent. Two nominal variables are statistically independent if the level of one variable has no influence on or predictive value for the second variable. Our null and alternative hypotheses are as follows: H0: The two variables are statistically independent. H1: The two variables are not statistically independent. We will illustrate the method using two variables with two levels each, but the same principles can be applied to variables with more than two levels. Let two nominal variables be measured on the same sample of n subjects. We can summarize the data as a two-way table of frequencies (cross-classification table), where O ij is the number of cases observed with level i of variable 1 and level j of variable. Suppose for example we have measured presence/absence of two symptoms on a set of patients: Table: Cross-classification Frequencies for Presence/Absence of Two Symptoms Symptom Symptom 1 Absent Present Total Absent O 11 O 1 r 1 Present O 1 O r Total c 1 c N The numbers along the edges (bottom and right), are called the marginal totals (also called marginal frequencies, or sometimes just marginals). These are simply row (r 1 and r ) and column totals (c 1 and c ). We use the row and column marginal totals to compute the expected frequencies of each cell. Under the assumption of statistical independence, the probability of a randomly selected case falling in cell (i,j) is the probability of falling in row i times the probability of falling in column j. We estimate these row and column probabilities from the marginal frequencies of our table. For example, r 1/N estimates the probability of a case falling in row 1, and c 1/N estimates the probability of a case falling on column 1. The expected of cases falling in cell (i, j) is therefore estimated as follows: ri = N N c j ij = N r c i N j

7 If our null hypothesis is correct, then the observed frequencies should differ more than is expected by random sampling variability from the expected frequencies. To test this, we measure the discrepancy of observed and expected frequencies using our previous formula: Or, more precisely: X X = All cells ( O ) ij ) ( O = ij i j ij where, for our example above, summation is over i, j = 1, Homework: Use xcel to reproduce the results in section, using the data

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the