Unit 29: Inference for TwoWay Tables


1 Unit 29: Inference for TwoWy Tbles Prerequisites Unit 13, TwoWy Tbles is prerequisite for this unit. In ddition, students need some bckground in significnce tests, which ws introduced in Unit 25. Additionl Topic Coverge Additionl coverge of inference for twowy tbles cn be found in The Bsic Prctice of Sttistics, Chpter 23, Two Ctegoricl Vribles: The ChiSqure Test. Activity Description Students should work in smll groups on this ctivity. The ctivity consists of three prts. The first prt provides justifiction for the formul for computing the expected cell counts for chisqure tbles. Students cn work on Prt I on their own or it could be prt of lecture/ clss discussion. Prts II nd III involve two different structures for dtsets, both of which re pproprite for the chisqure nlysis covered in this unit. Here re the two dt structures: (1) subjects from single smple re clssified ccording to two ctegoricl vribles nd (2) subjects from multiple smples (drwn from different popultions) re clssified ccording to single ctegoricl vrible. In the ltter cse, which smple cn be thought of s the second ctegoricl vrible. In the first cse, chisqure test for independence is performed; in the second cse, chisqure test for homogeneity is performed. The chisqure test sttistics nd the nlyses re the sme for both situtions. So, in this unit, we hve put little emphsis on distinguishing between these two situtions. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 1
2 Mterils For Prt III, bgs of t lest two different types of M&Ms re needed. Lrgesized bgs were used for the smple dt, with the exception of the M&Ms minis, for which medium bg ws purchsed. In ddition, students will need pper pltes or bowls to contin the M&Ms while they re being counted. Prt I: Introduction Assumption of Independence nd Expected Count Formul Prt I provides n explntion of the expected counts formul used in chisqure test of independence. Students need to be fmilir with the Multipliction Rule from Unit 19, Probbility Models. This prt could be pproched either s n ctivity or s prt of n informl lecture tht introduces the topic of this ctivity. It could lso be skipped nd students could move directly to Prt II. Prt II: Single Smple, Clssified on Two Ctegoricl Vribles For this prt, students will need to collect dt from people. The clss could serve s the smple, or perhps combine this clss with nother clss, or hve students dd their friends to the smple. Students will need to clssify ech individul in the smple by gender nd eye color. An esy wy to collect the dt is to drw tble on the bord. Ech student should come up to the bord nd put tlly line in the pproprite box for gender nd eye color. After students hve completed their entries, numbers cn replce the tlly mrks. Students cn then copy the tble from the bord nd begin work on Prt II. Prt III: Multiple Smples, Clssified on One Ctegoricl Vrible Students should work in groups to collect the dt on the M&Ms colors. Agin, you my wnt to put chrt on the bord nd hve students enter their results for ech color s they finish sorting their M&Ms into colors. Once the dt re collected, groups will need copy of the clss dt. Since the resulting twowy tble is quite lrge, group members should be encourged to divide up the work of computing the expected cell counts. The color distribution of M&Ms differs by types nd hs chnged over the yers. You cn write to Mrs, the mkers of M&Ms, for the ltest color distribution in its cndies. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 2
3 The Video Solutions 1. Dr. Prdis Sbeti investigtes the nonstop evolutionry rms rce between our bodies nd the infectious microorgnisms tht invde nd inhbit them. In other words, she investigtes connections between genotypes nd protections from infectious diseses. Her work on Lss fever is still in its erly stges. 2. Sickle cell nemi hemoglobin muttion, HbS. 3. H0 : No ssocition betweeen mlri nd HbS. H : Assocition between mlri nd HbS. 4. Expected count = (row totl)(column totl). grnd totl 5. We reject the null hypothesis nd conclude tht there is n ssocition between the HbS gene nd mlri. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 3
4 Unit Activity: Associtions With Color Solutions Prt I: Introduction Assumption of Independence nd Expected Count Formul 1.. P(DEM nd femle) = P(DEM) P(femle) = b. Expected number = ( ) ( = 196 )( 246) c. Expected count = (196)(246)» d. P(DEM nd mle) = P(DEM)P(mle) = Expected number = Expected count = (196)(254)» ( ) = (196)(254) Expected Mle Femle Politicl DEM (Blue) Preference GOP (Red) Color IND (White) ( ) 2 ( ) 2 b. χ 2 Activity 107 Solutions = df = (3 1)(2 1) = 2; p ( ) Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 4
5 c. There is sufficient evidence to reject the null hypothesis. There is ssocition between these two vribles. In other words, they re dependent. 3.. Smple dt will be used to provide smple nswers. Gender Eye Color Count Blue Brown Other Mle Femle b. H0 : No ssocition between gender nd eye color. H : Assocition between gender nd eye color. c. Smple nswer: d. Smple nswer: ( χ 2 = ) Activity Solutions 3 Eye Color Count Blue Brown Other Gender Mle Femle Activity Solutions 3c ( ) ( ) ; df = 2 p There is insufficient evidence to reject the null hypothesis. In other words, there is no strong evidence to suggest tht there is n ssocition between eye color nd gender. 4.. Smple dt (will be used for smple nswers) (See next pge...): Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 5
6 Type 1 Type 2 Type 3 Type 4 Count Drk Regulr Penut Mini Green Blue Color Yellow Ornge Red Brown b. H0 : No ssocition Activity between Solutions 4M&M type nd color distribution. H : Assocition between M&M type nd color distribution. c. Smple nswer: Color Type 1 Type 2 Type 3 Type 4 Count Drk Regulr Penut Mini Green Blue Yellow Ornge Red Brown Activity Solutions 4c d. χ ; df = (6 1)(4 1) = 15; p 0 There is n ssocition between M&Ms type nd color distribution. In other words, Different types of M&Ms hve different color distributions. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 6
7 Exercise Solutions 1.. There were two cells with expected counts less thn 1. The guidelines cll for ll expected counts to be greter thn 1. Also, there were 7 cells with expected counts below 5. Tht mens tht round 39% of the cells hve expected counts under 5. The guidelines stte tht no more thn 20% of the cells should hve expected counts less thn 5. b. See solution to (c). c. Bsed on the completed tble below, ll expected counts were greter thn 1. Two expected counts were below 5, which is just under 17% of the cells. So, the expected counts in the tble below meet the guidelines. Energy Drinks None One Two Three Observed Observed Observed Observed Environment Count Expected Expected Expected Expected Frm Country City Ex. Solution 1(c ) d. This is 4 3 tble; df = (4 1)(3 1) = 6. The chisqure test sttistic is clculted below: ( ) 2 ( ) 2 ( ) 2 χ 2 = ( ) 2 ( ) 2 ( ) ( ) 2 ( ) 2 ( ) ( ) 2 ( ) 2 ( ) e. p (See re under density curve below.) There is insufficient evidence to reject the null hypothesis. We found no cler evidence of n ssocition between 12 th grde students consumption of energy drinks nd their growingup environment. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 7
8 Chisqure Density Curve, df = Χ 2.. Gender is the explntory vrible. We would like to use gender to explin how students rte their intelligence compred to their peers. b. H0 : No ssocition between gender nd intelligence rting. H : Assocition between gender nd intelligence rting. c. Count Femle Gender Mle Intelligence Below Averge Averge Above Averge d. df = (2 1)(3 1) = 2 χ 2 = ( ) ( ) ( ) ( ) ( ) ( ) (Answers my vry somewht depending on the number of decimls used in the expected cell count.) Ex. Solution 2c e. p 0. Reject the null hypothesis. There is sttisticlly significnt difference between how mles nd femles rte their intelligence compred to their peers. (In other words, there is n ssocition between gender nd intelligence rting.) Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 8
9 3.. H0 : No ssocition between intelligence rting nd verge grdes. H : Assocition between intelligence rting nd verge grdes. b. Intelligence Averge Grde Count A B C or Below Above Averge Below Exercise Solution 3b c. df = (3 1)(3 1) = 4 χ 2 = ( ) ( ) As shown below, p ChiSqure Density Curve, df = Χ d. We would expect to see vlue from chisqure distribution with df = 4 s or more extreme thn 6.35 roughly 17.4% of the time. So, this is somewht common occurrence. It does not provide strong evidence ginst the null hypothesis. Generlly strong evidence mens tht the percentge should be below 5%. 4.. H0 : No ssocition between gender nd hours worked/week. H : Assocition between gender nd hours worked/week. b. χ 2 = ; p = < Therefore, the results re significnt. There is n ssocition between gender nd hours worked per week. (Note: The prcticl significnce is nother mtter nd cnnot be determined by pvlue.) Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 9
10 c. The biggest discrepncy in work ptterns is tht higher percentge of mles did not work (43.52%) compred to femles (40.59%). Furthermore, in every ctegory of hours worked/ week, there is higher percentge of femles thn mles. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 10
11 Review Questions Solutions 1.. H0 : No ssocition between hbitt use nd eel species. H : Assocition between hbitt use nd eel species. b. Hbitt Use Count Spotted Purplemouth G S B c. Here re the clcultions Review Questions for the chisqure Solutions 1btest sttistic: χ 2 = ( ) ( ) ( ) ( ) ( ) ( ) The degrees of freedom re: df = (3 1)(2 1) = 2. Using softwre, p Since p < 0.05, we reject the null hypothesis nd conclude tht there is n ssocition between hbitt use nd mory eel species. d. Column percentges re more pproprite. The explntory vrible is the eel species. So, we should compre the conditionl distributions of hbitt use for ech species of mory eel. Spotted Purplemouth G 25.9% 33.7% Hbitt S 20.2% 19.5% Use B 53.9% 40.8% 100% 100% We lern tht mjority Review (53.9%) Questions of the Solutions spotted 1d mory eels were found in border hbitts compred to only 46.8% of the purplemouth mory eels. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 11
12 2.. Eductionl ttinment is the explntory vrible nd voting is the response vrible. We expect tht person s highest eductionl ttinment will shed light on whether or not they voted in the 2012 elections. b. H0 : No ssocition between eduction nd voting. H : Assocition between eduction nd voting. c. Highest Eductionl Attinment Voted Nov Count Yes No Not HS Grd Expected HS Grd/No College Expected Some College/Associte's Expected Bchelor's or Higher Expected ( ) 2 d. χ 2 = 84.5 ( ) Review Questions Solutions 2c ( ) df = (4 1)(2 1) = 3; p Since p < 0.5, the results re significnt. There is reltionship between these two vribles. e. Since the explntory vrible is highest eductionl ttinment, the chrt below represents grphiclly the conditionl distributions of voting for ech level of highest eductionl ttinment Percent Voted Nov. No Yes No Yes No Yes Eduction Not HS Grd HS Grd/No College Some College/Assoc. Percent within levels of Highest Eductionl Attinment No Yes Bchelor s or higher As the level of highest eductionl ttinment increses, so does the prticiption in voting. More educted people re more likely to vote thn those who re not educted. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 12
13 3.. Energy Shots Consumed Per Dy Count Femle Mle None Less thn one One Two Three Four Five or Six Seven or more b. No, the guidelines re not stisfied. There re two cells tht hve counts below 1 (0.48 nd Review Questions Solutions ). In ddition, there re 4 cells with counts less thn 5, which is 25% of the cells. c. Smple nswer (students my decide to combine different ctegories): Energy Shots Consumed Per Dy Count Femle Mle None One or Less Two or Three Four or more d. Smple nswer is bsed on smple nswer to (c): χ 2 = ; p Review Questions Solutions 3c There is insufficient evidence to reject the null hypothesis. There is insufficient evidence to indicte tht there is linkge between mounts of energy drink shots consumed nd gender. Unit 29: Inference for TwoWy Tbles Fculty Guide Pge 13
More information