Chapter. ThreeWay ANOVA CONCEPTUAL FOUNDATION. A Simple ThreeWay Example. 688 Chapter 22 ThreeWay ANOVA


 Christine Watts
 1 years ago
 Views:
Transcription
1 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA ThreeWay ANOVA 22 Chapter A CONCEPTUAL FOUNDATION 688 You will need to use the following from previous chapters: Symbols k: Number of independent groups in a oneway ANOVA c: Number of levels (i.e., conditions) of an RM factor n: Number of subjects in each cell of a factorial ANOVA N T : Total number of observations in an experiment Formulas Formula 16.2: SS inter (by subtraction) also Formulas 16.3, 16.4, 16.5 Formula 14.3: SS bet or one of its components Concepts Advantages and disadvantages of the RM ANOVA SS components of the oneway RM ANOVA SS components of the twoway ANOVA Interaction of factors in a twoway ANOVA So far I have covered two types of twoway factorial ANOVAs: twoway independent (Chapter 14) and the mixed design ANOVA (Chapter 16). There is only one more simple twoway ANOVA to describe: the twoway repeated measures design. [There are other twoway designs, such as those including randomeffects or nested factors, but they are not commonly used see Hays (1994) for a description of some of these.] Just as the oneway RM ANOVA can be described in terms of a twoway independentgroups ANOVA, the twoway RM ANOVA can be described in terms of a threeway independentgroups ANOVA. This gives me a reason to describe the latter design next. Of course, the threeway factorial ANOVA is interesting in its own right, and its frequent use in the psychological literature makes it an important topic to cover, anyway. I will deal with the threeway independentgroups ANOVA and the twoway RM ANOVA in this section and the two types of threeway mixed designs in Section B. Computationally, the threeway ANOVA adds nothing new to the procedure you learned for the twoway; the same basic formulas are used a greater number of times to extract a greater number of SS components from SS total (eight SSs for the threeway as compared with four for the twoway). However, anytime you include three factors, you can have a threeway interaction, and that is something that can get quite complicated, as you will see. To give you a manageable view of the complexities that may arise when dealing with three factors, I ll start with a description of the simplest case: the ANOVA. A Simple ThreeWay Example At the end of Section B in Chapter 14, I reported the results of a published study, which was based on a 2 2 ANOVA. In that study one factor contrasted subjects who had an alcoholdependent parent with those who did not. I ll call this the alcohol factor and its two levels, at risk (of codependency) and control. The other factor (the experimenter factor) also had two levels; in one level subjects were told that the experimenter was an exploitive person, and in the other level the experimenter was described as a nurturing person. All of the subjects were women. If we imagine that the experiment was replicated using equalsized groups of men and women, the original
2 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 689 Section A Conceptual Foundation 689 twoway design becomes a threeway design with gender as the third factor. We will assume that all eight cells of the design contain the same number of subjects. As in the case of the twoway ANOVA, unbalanced threeway designs can be difficult to deal with both computationally and conceptually and therefore will not be discussed in this chapter (see Chapter 18, section A). The cell means for a threefactor experiment are often displayed in published articles in the form of a table, such as Table Nurturing Exploitive Row Mean Control: Men Women Mean Table 22.1 At risk: Men Women Mean Column mean Graphing Three Factors The easiest way to see the effects of this experiment is to graph the cell means. However, putting all of the cell means on a single graph would not be an easy way to look at the threeway interaction. It is better to use two graphs side by side, as shown in Figure With a twoway design one has to decide which factor is to be placed along the horizontal axis, leaving the other to be represented by different lines on the graph. With a threeway design one chooses both the factor to be placed along the horizontal axis and the factor to be represented by different lines, leaving the third factor to be represented by different graphs. These decisions result in six different ways that the cell means of a threeway design can be presented. Let us look again at Figure The graph for the women shows the twoway interaction you would expect from the study on which it is based. The graph for the men shows the same kind of interaction, but to a considerably lesser extent (the lines for the men are closer to being parallel). This difference 80 Women At risk 80 Men Figure 22.1 Graph of Cell Means for Data in Table At risk Control 20 Control 20 0 Nurturing Exploitive 0 Nurturing Exploitive
3 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA in amount of twoway interaction for men and women constitutes a threeway interaction. If the two graphs had looked exactly the same, the F ratio for the threeway interaction would have been zero. However, that is not a necessary condition. A main effect of gender could raise the lines on one graph relative to the other without contributing to a threeway interaction. Moreover, an interaction of gender with the experimenter factor could rotate the lines on one graph relative to the other, again without contributing to the threeway interaction. As long as the difference in slopes (i.e., the amount of twoway interaction) is the same in both graphs, the threeway interaction will be zero. Simple Interaction Effects A threeway interaction can be defined in terms of simple effects in a way that is analogous to the definition of a twoway interaction. A twoway interaction is a difference in the simple main effects of one of the variables as you change levels of the other variable (if you look at just the graph of the women in Figure 22.1, each line is a simple main effect). In Figure 22.1 each of the two graphs can be considered a simple effect of the threeway design more specifically, a simple interaction effect. Each graph depicts the twoway interaction of alcohol and experimenter at one level of the gender factor. The threeway interaction can be defined as the difference between these two simple interaction effects. If the simple interaction effects differ significantly, the threeway interaction will be significant. Of course, it doesn t matter which of the three variables is chosen as the one whose different levels are represented as different graphs if the threeway interaction is statistically significant, there will be significant differences in the simple interaction effects in each case. Varieties of Threeway Interactions Just as there are many patterns of cell means that lead to twoway interactions (e.g., one line is flat while the other goes up or down, the two lines go in opposite directions, or the lines go in the same direction but with different slopes), there are even more distinct patterns in a threeway design. Perhaps the simplest is when all of the means are about the same, except for one, which is distinctly different. For instance, in our present example the results might have shown no effect for the men (all cell means about 40), no difference for the control women (both means about 40), and a mean of 40 for atrisk women exposed to the nice experimenter. Then, if the mean for atrisk women with the exploitive experimenter were well above 40, there would be a strong threeway interaction. This is a situation in which all three variables must be at the right level simultaneously to see the effect in this variation of our example the subject must be female and raised by an alcoholdependent parent and exposed to the exploitive experimenter to attain a high score. Not only might the threeway interaction be significant, but one cell mean might be significantly different from all of the other cell means, making an even stronger case that all three variables must be combined properly to see any effect (if you were sure that this pattern were going to occur, you could test a contrast comparing the average of seven cell means to the one you expect to be different and not bother with the ANOVA at all). More often the results are not so clearcut, but there is one cell mean that is considerably higher than the others (as in Figure 22.1). This kind of pattern is analogous to the ordinal interaction in the twoway case and tends to cause all of the effects to be significant. On the other hand, a threeway interaction could arise because the twoway interaction reverses its pattern when changing levels of the third variable (e.g., imagine that in Figure 22.1
4 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 691 Section A Conceptual Foundation 691 the labels of the two lines were reversed for the graph of men but not for the women). This is analogous to the disordinal interaction in the twoway case. Or, the twoway interaction could be strong at one level of the third variable and much weaker (or nonexistent) at another level. Of course, there are many other possible variations. And consider how much more complicated the threeway interaction can get when each factor has more than two levels (we will deal with a greater number of levels in Section B). Fortunately, threeway (betweensubjects) ANOVAs with many levels for each factor are not common. One reason is a practical one: the number of subjects required. Even a design as simple as a has 24 cells (to find the number of cells, you just multiply the numbers of levels). If you want to have at least 5 subjects per cell, 120 subjects are required. This is not an impractical study, but you can see how quickly the addition of more levels would result in a required sample size that could be prohibitive. Main Effects In addition to the threeway interaction there are three main effects to look at, one for each factor. To look at the gender main effect, for instance, just take the average of the scores for all of the men and compare it to the average of all of the women. If you have the cell means handy and the design is balanced, you can average all of the cell means involving men and then all of the cell means involving women. In Table 22.1, you can average the four cell means for the men (40, 28, 36, 48) to get 38 (alternatively, you could use the row means in the extreme right column and average 34 and 42 to get the same result). The average for the women (30, 22, 40, 88) is 45. The means for the other main effects have already been included in Table Looking at the bottom row you can see that the mean for the nurturing experimenter is 36.5 as compared to 46.5 for the exploitive one. In the extreme right column you ll find that the mean for the control subjects is 30, as compared to 53 for the atrisk subjects. TwoWay Interactions in ThreeWay ANOVAs Further complicating the threeway ANOVA is that, in addition to the threeway interaction and the three main effects, there are three twoway interactions to consider. In terms of our example there are the gender by experimenter, gender by alcohol, and experimenter by alcohol interactions. We will look at the last of these first. Before graphing a twoway interaction in a threefactor design, you have to collapse (i.e., average) your scores over the variable that is not involved in the twoway interaction. To graph the alcohol by experimenter (A B) interaction you need to average the men with the women for each combination of alcohol and experimenter levels (i.e., each cell of the A B matrix). These means have also been included in Table The graph of these cell means is shown in Figure If you compare this overall twoway interaction with the twoway interactions for the men and women separately (see Figure 22.1), you will see that the overall interaction looks like an average of the two separate interactions; the amount of interaction seen in Figure 22.2 is midway between the amount of interaction for the men and that amount for the women. Does it make sense to average the interactions for the two genders into one overall interaction? It does if they are not very different. How different is too different? The size of the threeway interaction tells us how different these two twoway interactions are. A statistically significant threeway interaction suggests that we should be cautious in interpreting any of the twoway interactions. Just as a significant twoway interaction tells us to look carefully at, and possible test, the
5 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA Figure 22.2 Graph of Cell Means in Table 22.1 after Averaging Across Gender Average of Men and Women At risk Control 0 Nurturing Exploitive simple main effects (rather than the overall main effects), a significant threeway interaction suggests that we focus on the simple interaction effects the twoway interactions at each level of the third variable (which of the three independent variables is treated as the third variable is a matter of convenience). Even if the threeway interaction falls somewhat short of significance, I would recommend caution in interpreting the twoway interactions and the main effects, as well, whenever the simple interaction effects look completely different and, perhaps, show opposite patterns. So far I have been focusing on the twoway interaction of alcohol and experimenter in our example, but this choice is somewhat arbitrary. The two genders are populations that we are likely to have theories about, so it is often meaningful to compare them. However, I can just as easily graph the threeway interaction using alcohol as the third factor, as I have done in Figure 22.3a. To graph the overall twoway interaction of gender and experimenter, you can go back to Table 22.1 and average across the alcohol factor. For instance, the mean for men in the nurturing condition is found by averaging the mean for control group men in the nurturing condition (40) with Figure 22.3a Control At Risk Graph of Cell Means in Table 22.1 Using the Alcohol Factor to Distinguish the Panels Women Men Men Women Nurturing Exploitive 0 Nurturing Exploitive
6 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 693 Section A Conceptual Foundation Average of Control and at Risk Women Men Figure 22.3b Graph of Cell Means in Table 22.1 after Averaging Across the Alcohol Factor 0 Nurturing Exploitive the mean for atrisk men in the nurturing condition (36), which is 38. The overall twoway interaction of gender and experimenter is shown in Figure 22.3b. Note that once again the twoway interaction is a compromise. (Actually, the two twoway interactions are not as different as they look; in both cases the slope of the line for the women is more positive or at least less negative). For completeness, I have graphed the threeway interaction using experimenter as the third variable, and the overall twoway interaction of gender and alcohol in Figures 22.4a and 22.4b. An Example of a Disordinal ThreeWay Interaction In the threefactor example I have been describing, it looks like all three main effects and all three twoway interactions, as well as the threeway interaction, could easily be statistically significant. However, it is important to note that in a balanced design all seven of these effects are independent; the seven F ratios do share the same error term (i.e., denominator), but the sizes of the numerators are entirely independent. It is quite possible to have Nurturing Exploitive Figure 22.4a Women Graph of Cell Means in Table 22.1 Using the Experimenter Factor to Distinguish the Panels Women Men Men Control At risk 0 Control At risk
7 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA Figure 22.4b Average of Nurturing and Exploitive Graph of Cell Means in Table 22.1 after Averaging Across the Experimenter Factor Women Men 0 Control At risk a large threeway interaction while all of the other effects are quite small. By changing the means only for the men in our example, I will illustrate a large, disordinal interaction that obliterates two of the twoway interactions and two of the main effects. You can see in Figure 22.5a that this new threeway interaction is caused by a reversal of the alcohol by experimenter interaction from one gender to the other. In Figure 22.5b, you can see that the overall interaction of alcohol by gender is now zero (the lines are parallel); the gender by experimenter interaction is also zero (not shown). On the other hand, the large gender by alcohol interaction very nearly obliterates the main effects of both gender and alcohol (see Figure 22.5c). The main effect of experimenter is, however, large, as can be seen in Figure 22.5b. An Example in which the ThreeWay Interaction Equals Zero Finally, I will change the means for the men once more to create an example in which the threeway interaction is zero, even though the graphs for the Figure 22.5a Women Men Rearranging the Cell Means of Table 22.1 to Depict a Disordinal 3Way Interaction At risk Control Control At risk 0 Nurturing Expoitive 0 Nurturing Expoitive
8 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 695 Section A Conceptual Foundation Average of men and women At risk Control Figure 22.5b Regraphing Figure 22.5a after Averaging Across Gender 0 Nurturing Exploitive Average of Nurturing and Exploitive Figure 22.5c Men Women Regraphing Figure 22.5a after Averaging Across the Experimenter Factor Control At risk two genders do not look the same. In Figure 22.6, I created the means for the men by starting out with the women s means and subtracting 10 from each (this creates a main effect of gender); then I added 30 only to the men s means that involved the nurturing condition. The latter change creates a twoway interaction between experimenter and gender, but because it affects both the men/nurturing means equally, it does not produce any threeway interaction. One way to see that the threeway interaction is zero in Figure 22.6 is to subtract the slopes of the two lines for each gender. For the women the slope of the atrisk line is positive: = 48. The slope of the control line is negative: = 8. The difference of the slopes is 48 ( 8) = 56. If we do the same for the men, we get slopes of 18 and 38, whose difference is also 56. You may recall that a 2 2 interaction has only one df, and can be summarized by a single number, L, that forms the basis of a simple linear contrast. The same is true for a interaction or any higherorder interaction in which all of the factors have two levels. Of course, quantifying a threeway interaction gets considerably more complicated when the factors have more than two levels, but it is safe to say that if the two (or more) graphs are exactly the same, there will be no threeway interaction (they will continue to be identical, even if a different factor is chosen to distinguish the
9 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA Figure 22.6 Women Men Rearranging the Cell Means of Table 22.1 to Depict a Zero Amount of ThreeWay Interaction At risk At risk Control Control 0 Nurturing Expoitive 0 Nurturing Expoitive graphs). Bear in mind, however, that even if the graphs do not look the same, the threeway interaction will be zero if the amount of twoway interaction is the same for every graph. Calculating the ThreeWay ANOVA Calculating a threeway independentgroups ANOVA is a simple extension of the method for a twoway independentgroups ANOVA, using the same basic formulas. In particular, there is really nothing new about calculating MS W (the error term for all the F ratios); it is just the ordinary average of the cell variances when the design is balanced. (It is hard to imagine that anyone would calculate an unbalanced threeway ANOVA with a calculator rather than a computer, so I will not consider that possibility. The analysis of unbalanced designs is described in general in Chapter 18, Section A). Rather than give you all of the cell standard deviations or variances for the example in Table 22.1, I ll just tell you that SS W equals 6,400; later I ll divide this by df W to obtain MS W. (If you had all of the raw scores, you would also have the option of obtaining SS W by calculating SS total and subtracting SS betweencells as defined in the following.) Main Effects The calculation of the main effects is also the same as in the twoway ANOVA; the SS for a main effect is just the biased variance of the relevant group means multiplied by the total N. Let us say that each of the eight cells in our example contains five subjects, so N T equals 40. Then the SS for the experimenter factor (SS exper ) is 40 times the biased variance of 36.5 and 46.5 (the nurturing and exploitive means from Table 22.1), which equals 40(25) = 1000 (the shortcut for finding the biased variance of two numbers is to take the square of the difference between them and then divide by 4). Similarly, SS alcohol = 40(132.25) = 5290, and SS gender = 40(12.25) = 490. The TwoWay Interactions When calculating the twoway ANOVA, the SS for the twoway interaction is found by subtraction; it is the amount of the SS betweencells that is left after sub
10 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 697 Section A Conceptual Foundation 697 tracting the SSs for the main effects. Similarly, the threeway interaction SS is the amount left over after subtracting the SSs for the main effects and the SSs for all the twoway interactions from the overall SS betweencells. However, finding the SSs for the twoway interactions in a threeway design gets a little tricky. In addition to the overall SS betweencells, we must also calculate some intermediate twoway SS between terms. To keep track of these I will have to introduce some new subscripts. The overall SS betweencells is based on the variance of all the cell means, so no factors are collapsed, or averaged over. Representing gender as G, alcohol as A, and experimenter as E, the overall SS betweencells will be written as SS GAE. We will also need to calculate an SS between after averaging over gender. This is based on the four means (included in Table 22.1) I used to graph the alcohol by experimenter interaction and will be represented by SS AE. Because the design is balanced, you can take the simple average of the appropriate male cell mean and female cell mean in each case. Note that SS AE is not the SS for the alcohol by experimenter interaction because it also includes the main effects of those two factors. In similar fashion, we need to find SS GA from the means you get after averaging over the experimenter factor and SS GE by averaging over the alcohol factor. Once we have calculated these four SS between terms, all of the SSs we need for the threeway ANOVA can be found by subtraction. Let s begin with the calculation of SS GAE ; the biased variance of the eight cell means is , so SS GAE = 40(366.75) = 14,670. The means for SS AE are 35, 25, 38, 68, and their biased variance equals , so SS AE = 10,290. SS GA is based on the following means: 34, 26, 42, 64, so SS GA = 40(200.75) = 8,030. Finally, SS GE, based on means of 38, 38, 35, 55, equals 2,490. Next we find the SSs for each twoway interaction: SS A E = SS AE SS alcohol SS exper = 10,290 5,290 1,000 = 4,000 SS G A = SS GA SS gender SS alcohol = 8, ,290 = 2,250 SS G E = SS GE SS gender SS exper = 2, ,000 = 1,000 Finally, the SS for the threeway interaction (SS G A E ) equals SS GAE SS A E SS G A SS G E SS gender SS alcohol SS exper = 14,670 4,000 2,250 1, ,290 1,000 = 640 Formulas for the General Case It is traditional to assign the letters A, B, and, C to the three independent variables in the general case; variables D, E, and so forth, can then be added to represent a fourway, fiveway, or higher ANOVA. I ll assume that the following components have already been calculated using Formula 14.3 applied to the appropriate means: SS A, SS B, SS C, SS AB, SS AC, SS BC, SS ABC. In addition, I ll assume that SS W has also been calculated, either by averaging the cell variances and multiplying by df W or by subtracting SS ABC from SS total. The remaining SS components are found by Formula 22.1: a. SS A B = SS AB SS A SS B Formula 22.1 b. SS A C = SS AC SS A SS C c. SS B C = SS BC SS B SS C d. SS A B C = SS ABC SS A B SS B C SS A C SS A SS B SS C At the end of the analysis, SS total (whether or not it has been calculated separately) has been divided into eight components: SS A, SS B, SS C, the four interactions listed in Formula 22.1, and SS W. Each of these is divided by its corresponding df to form a variance estimate, MS. Using a to represent the
11 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA number of levels of the A factor, b for the B factor, c for the C factor, and n for the number of subjects in each cell, the formulas for the df components are as follows: a. df A = a 1 Formula 22.2 b. df B = b 1 c. df C = c 1 d. df A B = (a 1)(b 1) e. df A C = (a 1)(c 1) f. df B C = (b 1)(c 1) g. df A B C = (a 1)(b 1)(c 1) h. df W = abc (n 1) Completing the Analysis for the Example Because each factor in the example has only two levels, all of the numerator df s are equal to 1, which means that all of the MS terms are equal to their corresponding SS terms except, of course, for the error term. The df for the error term (i.e., df W ) equals the number of cells (abc) times one less than the number of subjects per cell (this gives the same value as N T minus the number of cells); in this case df W = 8(4) = 32. MS W = SS W /df W ; therefore, MS W = 6400/32 = 200. (Reminder: I gave the value of SS W to you to reduce the amount of calculation.) Now we can complete the threeway ANOVA by calculating all of the possible F ratios and testing each for statistical significance: MS gender 490 F gender = = = 2.45 MSW 200 MS alcohol 5,290 F alcohol = = = MSW 200 MS exper 1,000 F exper = = = 5 MSW 200 MS A E 4,000 F A E = = = 20 MSW 200 MS G A 2,250 F G A = = = MSW 200 MS G E 1000 F G E = = = 5 MSW 200 MS G A E 640 F G A E = = = 3.2 MSW 200 Because the df happens to be 1 for all of the numerator terms, the critical F for all seven tests is F.05 (1,32), which is equal (approximately) to Except for the main effect of gender, and the threeway interaction, all of the F ratios exceed the critical value (4.15) and are therefore significant at the.05 level. FollowUp Tests for the ThreeWay ANOVA Decisions concerning followup comparisons for a factorial ANOVA are made in a topdown fashion. First, one checks the highestorder interaction
12 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 699 Section A Conceptual Foundation 699 for significance; in a threeway ANOVA it is the threeway interaction. (Twoway interactions are the simplest possible interactions and are called firstorder interactions; threeway interactions are known as secondorder interactions, etc.) If the highest interaction is significant, the post hoc tests focus on the various simple effects or interaction contrasts, followed by appropriate celltocell comparisons. In a threeway ANOVA in which the threeway interaction is not significant, as in the present example, attention turns to the three twoway interactions. Although all of the twoway interactions are significant in our example, the alcohol by experimenter interaction is the easiest to interpret because it replicates previous results. It would be appropriate to follow up the significant alcohol by experimenter interaction with four t tests (e.g., one of the relevant t tests would determine whether atrisk subjects differ significantly from controls in the exploitive condition). Given the disordinal nature of the interaction (see Figure 22.2), it is likely that the main effects would simply be ignored. A similar approach would be taken to the two other significant twoway interactions. Thus, all three main effects would be regarded with caution. Note that because all of the factors are dichotomous, there would be no followup tests to perform on significant main effects, even if none of the interactions were significant. With more than two levels for some or all of the factors, it becomes possible to test partial interactions, and significant main effects for factors not involved in significant interactions can be followed by pairwise or complex comparisons, as described in Chapter 14, Section C. I will illustrate some of the complex planned and post hoc comparisons for the threeway design in Section B. Types of ThreeWay Designs Cases involving significant threeway interactions and factors with more than two levels will be considered in the context of mixed designs in Section B. However, before we turn to mixed designs, let us look at some of the typical situations in which threeway designs with no repeated measures arise. One situation involves three experimental manipulations for which repeated measures are not feasible. For instance, subjects perform a repetitive task in one of two conditions: They are told that their performance is being measured or that it is not. In each condition half of the subjects are told that performance on the task is related to intelligence, and the other half are told that it is not. Finally, within each of the four groups just described, half the subjects are treated respectfully and half are treated rudely. The work output of each subject can then be analyzed by a ANOVA. Another possibility involves three grouping variables, each of which involves selecting subjects whose group is already determined. For instance, a group of people who exercise regularly and an equalsized group of those who don t are divided into those high and those relatively low on selfesteem (by a median split). If there are equal numbers of men and women in each of the four cells, we have a balanced design. More commonly one or two of the variables involve experimental manipulations and two or one involve grouping variables. The example calculated earlier in this section involved two grouping variables (gender and having an alcoholdependent parent or not) and one experimental variable (nurturing vs. exploitive experimenter). To devise an interesting example with two experimental manipulations and one grouping variable, start with two experimental factors that are expected to interact (e.g., one factor is whether or not the subjects are told that performance on the experimental task is related to intelligence, and the other factor is whether or not the group of subjects run together will know
13 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA each other s final scores). Then, add a grouping variable by comparing subjects who are either high or low on selfesteem, need for achievement, or some other relevant aspect of personality. If the twoway interaction differs significantly between the two groups of subjects, the threeway interaction will be significant. The TwoWay RM ANOVA One added benefit of learning how to calculate a threeway ANOVA is that you now know how to calculate a twoway ANOVA in which both factors involve repeated measures. In Chapter 15, I showed you that the SS components of a oneway RM design are calculated as though the design were a twoway independentgroups ANOVA with no withincell variability. Similarly, a twoway RM ANOVA is calculated just as shown in the preceding for the threeway independentgroups ANOVA, with the following modifications: (1) One of the three factors is the subjects factor each subject represents a different level of the subjects factor, (2) the main effect of subjects is not tested, and there is no MS W error term, (3) each of the two main effects that is tested uses the interaction of that factor with the subjects factor as the error term, and (4) the interaction of the two factors of interest is tested by using as the error term the interaction of all three factors (i.e., including the subjects factor). If one RM factor is labeled Q and the other factor, R, and we use S to represent the subjects factor, the equations for the three F ratios can be written as follows: MS Q MS R MS Q R F Q =, F R = F Q R = MSQ S MSR S MXQ R S HigherOrder ANOVA This text will not cover factorial designs of higher order than the threeway ANOVA. Although higherorder ANOVAs can be difficult to interpret, no new principles are introduced. The fourway ANOVA produces 15 different F ratios to test: four main effects, 6 twoway interactions, 4 threeway interactions, and 1 fourway interaction. Testing each of these 15 effects at the.05 level raises serious concerns about the increased risk of Type I errors. Usually, all of the F ratios are not tested; specific hypotheses should guide the selection of particular effects to test. Of course, the potential for an inflated rate of Type I errors only increases as factors are added. In general, an Nway ANOVA produces 2 N 1 F ratios that can be tested for significance. In the next section I will delve into more complex varieties of the threeway ANOVA in particular those that include repeated measures on one or two of the factors. A SUMMARY 1. To display the cell means of a threeway factorial design, it is convenient to create twoway graphs for each level of the third variable and place these graphs side by side (you have to decide which of the three variables will distinguish the graphs and which of the two remaining variables will be placed along the X axis of each graph). Each twoway graph depicts a simple interaction effect; if the simple interaction effects are significantly different from each other, the threeway interaction will be significant. 2. Threeway interactions can occur in a variety of ways. The interaction of two of the factors can be strong at one level of the third factor and close
14 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 701 Section A Conceptual Foundation 701 to zero at a different level (or even stronger at a different level). The direction of the twoway interaction can reverse from one level of the third variable to another. Also, a threeway interaction can arise when all of the cell means are similar except for one. 3. The main effects of the threeway ANOVA are based on the means at each level of one of the factors, averaging across the other two. A twoway interaction is the average of the separate twoway interactions (simple interaction effects) at each level of the third factor. A twoway interaction is based on a twoway table of means created by averaging across the third factor. 4. The error term for the threeway ANOVA, MS W, is a simple extension of the error term for a twoway ANOVA; in a balanced design, it is the simple average of all of the cell variances. All of the SS between components are found by Formula 14.3, or by subtraction using Formula There are seven F ratios that can be tested for significance: the three main effects, three twoway interactions, and the threeway interaction. 5. Averaging simple interaction effects together to create a twoway interaction is reasonable only if these effects do not differ significantly. If they do differ, followup tests usually focus on the simple interaction effects themselves or particular 2 2 interaction contrasts. If the threeway interaction is not significant, but a twoway interaction is, the significant twoway interaction is explored as in a twoway ANOVA with simple main effects or interaction contrasts. Also, when the threeway interaction is not significant, any significant main effect can be followed up in the usual way if that variable is not involved in a significant twoway interaction. 6. All three factors in a threeway ANOVA can be grouping variables (i.e., based on intact groups), but this is rare. It is more common to have just one grouping variable and compare the interaction of two experimental factors among various subgroups of the population. Of course, all three factors can involve experimental manipulations. 7. The twoway ANOVA in which both factors involve repeated measures is analyzed as a threeway ANOVA, with the different subjects serving as the levels of the third factor. The error term for each RM factor is the interaction of that factor with the subject factor; the error term for the interaction of the two RM factors is the threeway interaction. 8. In an Nway factorial ANOVA, there are 2 N 1 F ratios that can be tested. The twoway interaction is called a firstorder interaction, the threeway is a secondorder interaction, and so forth. EXERCISES 1. Imagine an experiment in which each subject is required to use his or her memories to create one emotion: either happiness, sadness, anger, or fear. Within each emotion group, half of the subjects participate in a relaxation exercise just before the emotion condition, and half do not. Finally, half the subjects in each emotion/relaxation condition are run in a dark, soundproof chamber, and the other half are run in a normally lit room. The dependent variable is the subject s systolic blood pressure when the subject signals that the emotion is fully present. The design is balanced, with a total of 128 subjects. The results of the threeway ANOVA for this hypothetical experiment are as follows: SS emotion = 223.1, SS relax = 64.4, SS dark = 31.6, SS emo rel = 167.3, SS emo dark = 51.5; SS rel dark = 127.3, and SS emo rel dark = The total sum of squares is 2,344. a. Calculate the seven F ratios, and test each for significance.
15 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA b. Calculate partial eta squared for each of the three main effects (use Formula 14.9). Are any of these effects at least moderate in size? 2. In this exercise there are 20 subjects in each cell of a design. The levels of the first factor (location) are urban, suburban, and rural. The levels of the second factor are no siblings, one or two siblings, and more than two siblings. The third factor has only two levels: presently married and not presently married. The dependent variable is the number of close friends that each subject reports having. The cell means are as follows: Urban Suburban Rural No Siblings Married Not Married or 2 Siblings Married Not Married or more Siblings Married Not Married a. Given that SS W equals 1,094, complete the threeway ANOVA, and present your results in a summary table. b. Draw a graph of the means for Location Number of Siblings (averaging across marital status). Describe the nature of the interaction. c. Using the means from part b, test the simple effect of number of siblings at each location. 3. Seventytwo patients with agoraphobia are randomly assigned to one of four drug conditions: SSRI (e.g., Prozac), tricyclic antidepressant (e.g., Elavil), antianxiety (e.g., Xanax), or a placebo (offered as a new drug for agoraphobia). Within each drug condition, a third of the patients are randomly assigned to each of three types of psychotherapy: psychodynamic, cognitive/behavioral, and group. The subjects are assigned so that half the subjects in each drug/therapy group are also depressed, and half are not. After 6 months of treatment, the severity of agoraphobia is measured for each subject (30 is the maximum possible phobia score); the cell means (n = 3) are as follows: a. Given that SS W equals 131, complete the threeway ANOVA, and present your results in a summary table. SSRI Tricyclic Antianxiety Placebo Psychodynamic Not Depressed Depressed Cog/Behav Not Depressed Depressed Group Not Depressed Depressed b. Draw a graph of the cell means, with separate panels for depressed and not depressed. Describe the nature of the therapy drug interaction in each panel. Does there appear to be a threeway interaction? Explain. c. Given your results in part a, describe a set of followup tests that would be justifiable. d. Optional: Test the interaction contrast that results from deleting Group therapy and the SSRI and placebo conditions from the analysis (extend the techniques of Chapter 13, Section B, and Chapter 14, Section C). 4. An industrial psychologist is studying the relation between motivation and productivity. Subjects are told to perform as many repetitions of a given clerical task as they can in a 1hour period. The dependent variable is the number of tasks correctly performed. Sixteen subjects participated in the experiment for credit toward a requirement of their introductory psychology course (credit group). Another 16 subjects were recruited from other classes and paid $10 for the hour (money group). All subjects performed a small set of similar clerical tasks as practice before the main study; in each group (credit or money) half the subjects (selected randomly) were told they had performed unusually well on the practice trials (positive feedback), and half were told they had performed poorly (negative feedback). Finally, within each of the four groups created by the manipulations just described, half of the subjects (at random) were told that performing the tasks quickly and accurately was correlated with other important job skills (self motivation), whereas the other half were told that good performance would help the experiment (other motivation). The data appear in the following table:
16 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 703 Section B Basic Statistical Procedures 703 CREDIT SUBJECTS PAID SUBJECTS Positive Negative Positive Negative Feedback Feedback Feedback Feedback Self Other a. Perform a threeway ANOVA on the data. Test all seven F ratios for significance, and present your results in a summary table. b. Use graphs of the cell means to help you describe the pattern underlying each effect that was significant in part a. c. Based on the results in part a, what post hoc tests would be justified? 5. Imagine that subjects are matched in blocks of three based on height, weight, and other physical characteristics; six blocks are formed in this way. Then the subjects in each block are randomly assigned to three different weightloss programs. Subjects are measured before the diet, at the end of the diet program, 3 months later, and 6 months later. The results of the twoway RM ANOVA for this hypothetical experiment are given in terms of the SS components, as follows: SS diet = 403.1, SS time = 316.8, SS diet time = 52, SS diet S = 295.7, SS time S = 174.1, and SS diet time S = 230. a. Calculate the three F ratios, and test each for significance. b. Find the conservatively adjusted critical F for each test. Will any of your conclusions be affected if you do not assume that sphericity exists in the population? 6. A psychologist wants to know how both the affective valence (happy vs. sad vs. neutral) and the imageability (low, medium, high) of words affect their recall. A list of 90 words is prepared with 10 words from each combination of factors (e.g., happy, low imagery: promotion; sad, high imagery: cemetery) randomly mixed together. The number of words recalled in each category by each of the six subjects in the study is given in the following table: SAD NEUTRAL HAPPY Subject No. Low Medium High Low Medium High Low Medium High a. Perform a twoway RM ANOVA on the data. Test the three F ratios for significance, and present your results in a summary table. b. Find the conservatively adjusted critical F for each test. Will any of your conclusions be affected if you do not assume that sphericity exists in the population? c. Draw a graph of the cell means, and describe any trend toward an interaction that you can see. d. Based on the variables in this exercise, and the results in part a, what post hoc tests would be justified and meaningful? An important way in which one threefactor design can differ from another is the number of factors that involve repeated measures (or matching). The design in which none of the factors involve repeated measures was covered in Section A. The design in which all three factors are RM factors will not be covered in this text; however, the threeway RM design is a straightforward extension of the twoway RM design described at the end of Section A. This section will focus on threeway designs with either one or two RM factors (i.e., mixed designs), and it will also elaborate on the general principles of dealing with threeway ANOVAs, as introduced in Section A, and consider B BASIC STATISTICAL PROCEDIRES
17 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA the complexities of interactions and post hoc tests when the factors have more than two levels each. One RM Factor I will begin with a threefactor design in which there are repeated measures on only one of the factors. The ANOVA for this design is not much more complicated than the twoway mixed ANOVA described in the previous chapter for instance, there are only two different error terms. Such designs arise frequently in psychological research. One simple way to arrive at such a design is to start with a twoway ANOVA with no repeated measures. For instance, patients with two different types of anxiety disorders (generalized anxiety vs. specific phobias) are treated with two different forms of psychotherapy (psychodynamic vs. behavioral). The third factor is added by measuring the patients anxiety at several points in time (e.g., beginning of therapy, end of therapy, several months after therapy has stopped); I will refer to this factor simply as time. To illustrate the analysis of this type of design I will take the twoway ANOVA from Section B of Chapter 14 and add time as an RM factor. You may recall that that example involved four levels of sleep deprivation and three levels of stimulation. Performance was measured only once after 4 days in the sleep lab. Now imagine that performance on the simulated truck driving task is measured three times: after 2, 4, and 6 days in the sleep lab. The raw data for the threefactor study are given in Table 22.2, along with the various means we will need to graph and analyze the results; note that the data for Day 4 are identical to the data for the corresponding twoway ANOVA in Chapter 14. To see what we may expect from the results of a threeway ANOVA on these data, the cell means have been graphed so that we can look at the sleep by stimulation interaction at each time period (see Figure 22.7). You can see from Figure 22.7 that the sleep stimulation interaction, which was not quite significant for Day 4 alone (see Chapter 14, section B), increases over time, perhaps enough so as to produce a threeway interaction. We can also see that the main effects of stimulation and sleep, significant at Day 4, are likely to be significant in the threeway analysis. The general decrease in scores from Day 2 to Day 4 to Day 6 is also likely to yield a significant main effect for time. Without regraphing the data, it is hard to see whether the interactions of time with either sleep or stimulation are large or small. However, because these interactions are less interesting in the context of this experiment, I won t bother to present the two other possible sets of graphs. To present general formulas for analyzing the kind of experiment shown in Table 22.2, I will adopt the following notation. The two betweensubject factors will be labeled A and B. Of course, it is arbitrary which factor is called A and which B; in this example the sleep deprivation factor will be A, and the stimulation factor will be B. The lowercase letters a and b will stand for the number of levels of their corresponding factors in this case, 4 and 3, respectively. The withinsubject factor will be labeled R, and its number of levels, c, to be consistent with previous chapters. Let us begin with the simplest SS components: SS total, and the SSs for the numerators of each main effect. SS total is based on the total number of observations, N T, which for any balanced threeway factorial ANOVA is equal to abcn, where n is the number of different subjects in each cell of the A B table. So, N T = = 180. The biased variance obtained by entering all 180 scores is , so SS total = = 7, SS A is based
18 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 705 Table 22.2 PLACEBO MOTIVATION CAFFEINE Subject Subject Subject Row Day 2 Day 4 Day 6 Means Day 2 Day 4 Day 6 Means Day 2 Day 4 Day 6 Means Means None AB means Jet Lag AB means Interrupt AB means Total AB means Column means
19 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page Chapter 22 ThreeWay ANOVA Figure Graph of the Cell Means in Table Motivation Caffeine Day 2 15 Placebo None JetLag Interrupt Total Caffeine Motivation Placebo None JetLag Interrupt Total 30 Day Caffeine Day 6 Motivation 10 7 Placebo 0 None JetLag Interrupt Total on the means for the four sleep deprivation levels, which can be found in the rightmost column of the table, labeled row means. SS B is based on the means for the three stimulation levels, which are found where the bottom row of the table (Column Means), intersects the columns labeled Subject Means (these are averaged over the three days, as well as the sleep levels). The means for the three different days are not in the table but can be found by averaging the three Column Means for Day 2, the three for Day 4, and similarly for Day 6. The SSs for the main effects are as follows: SS A =σ 2 (25.58, 23.51, 17.35, 16.53) 180 = = 2, SS B =σ 2 (17.77, 21.38, 23.08) 180 = = SS R =σ 2 (23.63, 21.22, 17.38) = = 1,192.0 As in Section A, we will need the SS based on the cell means, SS ABR, and the SSs for each twoway table of means: SS AB, SS AR, and SS BR. In addition, because one factor has repeated measures we will also need to find the means for each subject (averaging their scores for Day 2, Day 4, and Day 6) and the SS based on those means, SS betweensubjects.
20 Cohen_Chapter22.j.qxd 8/23/02 11:56 M Page 707 Section B Basic Statistical Procedures 707 The cell means we need for SS ABR are given in Table 22.2, under Day 2, Day 4, and Day 6, in each of the rows labeled AB Means; there are 36 of them (a b c). The biased variance of these cell means is , so SS ABR = = 5, The means for SS AB are found by averaging across the 3 days for each combination of sleep and stimulation levels and are found in the rows for AB Means under Subject Means. The biased variance of these 12 (i.e., a b) means equals , so SS AB = 3,974. The nine means for SS BR are the column means of Table 22.2, except for the columns labeled Subject Means. SS BR =σ 2 (20.3, 19.0, 14.0, 25.5, 21.0, 17.65, 25.1, 23.65, 20.5) 180 = 2, Unfortunately, there was no convenient place in Table 22.2 to put the means for SS AR. They are found by averaging the (AB) means for each day and level of sleep deprivation over the three stimulation levels. SS AR =σ 2 (27.13, 25.6, 24, 25.6, 24.47, 20.47, 21.27, 18.07, 12.73, 20.53, 16.73, 12.33) 180 = 4, Finally, we need to calculate SS betweensubjects for the 60 (a b n) subject means found in Table 22.2 under Subject Means (ignoring the entries in the rows labeled AB Means and Column Means, of course). SS betweensubjects = = 5, Now we can get the rest of the SS components we need by subtraction. The SSs for the twoway interactions are found just as in Section A from Formula 22.1a, b, and c (except that factor C has been changed to R): SS A B = SS AB SS A SS B SS A R = SS AR SS A SS R SS B R = SS BR SS B SS R Plugging in the SSs for the present example, we get SS A B = 3,974 2, = SS A R = 4, , ,192 = SS B R = 2, ,192 = The threeway interaction is found by subtracting from SS ABR the SSs for three twoway interactions and the three main effects (Formula 22.1d). SS A B R = SS ABR SS A B SS A R SS B R SS A SS B SS R SS A B R = 5, , = As in the twoway mixed design there are two different error terms. One of the error terms involves subjecttosubject variability within each group or, in the case of the present design, within each cell formed by the two betweengroup factors. This is the error component you have come to know as SS W, and I will continue to call it that. The total variability from one subject to another (averaging across the RM factor) is represented by a term we have already calculated: SS betweensubjects, or SS bets, for short. In the oneway RM ANOVA this source of variability was called the subjects factor (SS sub ), or the main effect of subjects, and because it did not play a useful role, we ignored it. In the mixed design of the previous chapter it was simply divided between SS groups and SS W. Now that we have two betweengroup factors, that source of variability can be divided into four components, as follows: SS bets = SS A + SS B + SS A B + SS W This relation can be expressed more simply as SS bets = SS AB + SS W The error portion, SS W, is found most easily by subtraction: SS W = SS bets SS AB Formula 22.3
Experimental Designs (revisited)
Introduction to ANOVA Copyright 2000, 2011, J. Toby Mordkoff Probably, the best way to start thinking about ANOVA is in terms of factors with levels. (I say this because this is how they are described
More informationUNDERSTANDING THE TWOWAY ANOVA
UNDERSTANDING THE e have seen how the oneway ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
More information1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand
More informationMain Effects and Interactions
Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly
More information15. Analysis of Variance
15. Analysis of Variance A. Introduction B. ANOVA Designs C. OneFactor ANOVA (BetweenSubjects) D. MultiFactor ANOVA (BetweenSubjects) E. Unequal Sample Sizes F. Tests Supplementing ANOVA G. WithinSubjects
More informationStatistics Review PSY379
Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 OneWay ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More informationTesting Group Differences using Ttests, ANOVA, and Nonparametric Measures
Testing Group Differences using Ttests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 354870348 Phone:
More informationChapter 7. Oneway ANOVA
Chapter 7 Oneway ANOVA Oneway ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The ttest of Chapter 6 looks
More informationAn analysis method for a quantitative outcome and two categorical explanatory variables.
Chapter 11 TwoWay ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that
More information10. Analysis of Longitudinal Studies Repeatmeasures analysis
Research Methods II 99 10. Analysis of Longitudinal Studies Repeatmeasures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.
More informationTwoWay ANOVA Lab: Interactions
Name TwoWay ANOVA Lab: Interactions Perhaps the most complicated situation that you face in interpreting a twoway ANOVA is the presence of an interaction. This brief lab is intended to give you additional
More informationJanuary 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
More informationOneWay Analysis of Variance
OneWay Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We
More informationAssociation Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
More informationSTATISTICS FOR PSYCHOLOGISTS
STATISTICS FOR PSYCHOLOGISTS SECTION: STATISTICAL METHODS CHAPTER: REPORTING STATISTICS Abstract: This chapter describes basic rules for presenting statistical results in APA style. All rules come from
More informationAnalysis of Data. Organizing Data Files in SPSS. Descriptive Statistics
Analysis of Data Claudia J. Stanny PSY 67 Research Design Organizing Data Files in SPSS All data for one subject entered on the same line Identification data Betweensubjects manipulations: variable to
More informationANOVA ANOVA. TwoWay ANOVA. OneWay ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups
ANOVA ANOVA Analysis of Variance Chapter 6 A procedure for comparing more than two groups independent variable: smoking status nonsmoking one pack a day > two packs a day dependent variable: number of
More informationOneWay ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate
1 OneWay ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationTesting and Interpreting Interactions in Regression In a Nutshell
Testing and Interpreting Interactions in Regression In a Nutshell The principles given here always apply when interpreting the coefficients in a multiple regression analysis containing interactions. However,
More informationChapter 14: Repeated Measures Analysis of Variance (ANOVA)
Chapter 14: Repeated Measures Analysis of Variance (ANOVA) First of all, you need to recognize the difference between a repeated measures (or dependent groups) design and the between groups (or independent
More information9. Sampling Distributions
9. Sampling Distributions Prerequisites none A. Introduction B. Sampling Distribution of the Mean C. Sampling Distribution of Difference Between Means D. Sampling Distribution of Pearson's r E. Sampling
More informationIntroduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
More informationExploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
More informationHow to Get More Value from Your Survey Data
Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2
More informationCase Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?
Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure? Harvey Motulsky hmotulsky@graphpad.com This is the first case in what I expect will be a series of case studies. While I mention
More information2. Simple Linear Regression
Research methods  II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
More informationThe F distribution and the basic principle behind ANOVAs. Situating ANOVAs in the world of statistical tests
Tutorial The F distribution and the basic principle behind ANOVAs Bodo Winter 1 Updates: September 21, 2011; January 23, 2014; April 24, 2014; March 2, 2015 This tutorial focuses on understanding rather
More informationHow To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
More information1 Theory: The General Linear Model
QMIN GLM Theory  1.1 1 Theory: The General Linear Model 1.1 Introduction Before digital computers, statistics textbooks spoke of three procedures regression, the analysis of variance (ANOVA), and the
More informationChapter 9. TwoSample Tests. Effect Sizes and Power Paired t Test Calculation
Chapter 9 TwoSample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and TwoSample Tests: Paired Versus
More informationUsing Excel for Statistical Analysis
Using Excel for Statistical Analysis You don t have to have a fancy pants statistics package to do many statistical functions. Excel can perform several statistical tests and analyses. First, make sure
More informationThis chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
More information4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"
Data Analysis Plan The appropriate methods of data analysis are determined by your data types and variables of interest, the actual distribution of the variables, and the number of cases. Different analyses
More informationProfile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases:
Profile Analysis Introduction Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases: ) Comparing the same dependent variables
More informationSimple Tricks for Using SPSS for Windows
Simple Tricks for Using SPSS for Windows Chapter 14. Followup Tests for the TwoWay Factorial ANOVA The Interaction is Not Significant If you have performed a twoway ANOVA using the General Linear Model,
More informationUsing Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationRevised Version of Chapter 23. We learned long ago how to solve linear congruences. ax c (mod m)
Chapter 23 Squares Modulo p Revised Version of Chapter 23 We learned long ago how to solve linear congruences ax c (mod m) (see Chapter 8). It s now time to take the plunge and move on to quadratic equations.
More informationEXPONENTS. To the applicant: KEY WORDS AND CONVERTING WORDS TO EQUATIONS
To the applicant: The following information will help you review math that is included in the Paraprofessional written examination for the Conejo Valley Unified School District. The Education Code requires
More information6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
More informationCOMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
More informationAP Statistics 2011 Scoring Guidelines
AP Statistics 2011 Scoring Guidelines The College Board The College Board is a notforprofit membership association whose mission is to connect students to college success and opportunity. Founded in
More informationTwoWay Independent ANOVA Using SPSS
TwoWay Independent ANOVA Using SPSS Introduction Up to now we have looked only at situations in which a single independent variable was manipulated. Now we move onto more complex designs in which more
More informationData Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
More informationData Analysis in SPSS. February 21, 2004. If you wish to cite the contents of this document, the APA reference for them would be
Data Analysis in SPSS Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 354870348 Heather Claypool Department of Psychology Miami University
More informationMeasurement and Measurement Scales
Measurement and Measurement Scales Measurement is the foundation of any scientific investigation Everything we do begins with the measurement of whatever it is we want to study Definition: measurement
More informationRandomized Block Analysis of Variance
Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of
More informationindividualdifferences
1 Simple ANalysis Of Variance (ANOVA) Oftentimes we have more than two groups that we want to compare. The purpose of ANOVA is to allow us to compare group means from several independent samples. In general,
More information10. Comparing Means Using Repeated Measures ANOVA
10. Comparing Means Using Repeated Measures ANOVA Objectives Calculate repeated measures ANOVAs Calculate effect size Conduct multiple comparisons Graphically illustrate mean differences Repeated measures
More informationBasic Concepts in Research and Data Analysis
Basic Concepts in Research and Data Analysis Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...3 The Research Question... 3 The Hypothesis... 4 Defining the
More informationBiostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY
Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to
More informationThe Math. P (x) = 5! = 1 2 3 4 5 = 120.
The Math Suppose there are n experiments, and the probability that someone gets the right answer on any given experiment is p. So in the first example above, n = 5 and p = 0.2. Let X be the number of correct
More informationSPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stemandleaf plots and extensive descriptive statistics. To run the Explore procedure,
More informationMinitab Tutorials for Design and Analysis of Experiments. Table of Contents
Table of Contents Introduction to Minitab...2 Example 1 OneWay ANOVA...3 Determining Sample Size in Oneway ANOVA...8 Example 2 Twofactor Factorial Design...9 Example 3: Randomized Complete Block Design...14
More information1.5 Oneway Analysis of Variance
Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments
More informationAnalysis of Variance. MINITAB User s Guide 2 31
3 Analysis of Variance Analysis of Variance Overview, 32 OneWay Analysis of Variance, 35 TwoWay Analysis of Variance, 311 Analysis of Means, 313 Overview of Balanced ANOVA and GLM, 318 Balanced
More informationReporting Statistics in Psychology
This document contains general guidelines for the reporting of statistics in psychology research. The details of statistical reporting vary slightly among different areas of science and also among different
More informationPRINCIPAL COMPONENT ANALYSIS
1 Chapter 1 PRINCIPAL COMPONENT ANALYSIS Introduction: The Basics of Principal Component Analysis........................... 2 A Variable Reduction Procedure.......................................... 2
More informationChapter 6: The Information Function 129. CHAPTER 7 Test Calibration
Chapter 6: The Information Function 129 CHAPTER 7 Test Calibration 130 Chapter 7: Test Calibration CHAPTER 7 Test Calibration For didactic purposes, all of the preceding chapters have assumed that the
More informationExperiment #1, Analyze Data using Excel, Calculator and Graphs.
Physics 182  Fall 2014  Experiment #1 1 Experiment #1, Analyze Data using Excel, Calculator and Graphs. 1 Purpose (5 Points, Including Title. Points apply to your lab report.) Before we start measuring
More informationUNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)
UNDERSTANDING ANALYSIS OF COVARIANCE () In general, research is conducted for the purpose of explaining the effects of the independent variable on the dependent variable, and the purpose of research design
More informationMINITAB ASSISTANT WHITE PAPER
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. OneWay
More informationMath Review. for the Quantitative Reasoning Measure of the GRE revised General Test
Math Review for the Quantitative Reasoning Measure of the GRE revised General Test www.ets.org Overview This Math Review will familiarize you with the mathematical skills and concepts that are important
More informationChi Square Distribution
17. Chi Square A. Chi Square Distribution B. OneWay Tables C. Contingency Tables D. Exercises Chi Square is a distribution that has proven to be particularly useful in statistics. The first section describes
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationOneWay Analysis of Variance (ANOVA) Example Problem
OneWay Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesistesting technique used to test the equality of two or more population (or treatment) means
More informationHow to Make APA Format Tables Using Microsoft Word
How to Make APA Format Tables Using Microsoft Word 1 I. Tables vs. Figures  See APA Publication Manual p. 147175 for additional details  Tables consist of words and numbers where spatial relationships
More informationSydney Roberts Predicting Age Group Swimmers 50 Freestyle Time 1. 1. Introduction p. 2. 2. Statistical Methods Used p. 5. 3. 10 and under Males p.
Sydney Roberts Predicting Age Group Swimmers 50 Freestyle Time 1 Table of Contents 1. Introduction p. 2 2. Statistical Methods Used p. 5 3. 10 and under Males p. 8 4. 11 and up Males p. 10 5. 10 and under
More informationChapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS
Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple
More informationChapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
More informationSimple Random Sampling
Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for
More informationRules for Sources of Variance, Degrees of Freedom, and F ratios
1 Rules for Sources of Variance, Degrees of Freedom, and F ratios 1. If necessary, identify the subsections of the table. In the source column list each factor, including subjects (S) (a) If there are
More informationEffect size and eta squared James Dean Brown (University of Hawai i at Manoa)
Shiken: JALT Testing & Evaluation SIG Newsletter. 1 () April 008 (p. 3843) Statistics Corner Questions and answers about language testing statistics: Effect size and eta squared James Dean Brown (University
More informationWhen to use Excel. When NOT to use Excel 9/24/2014
Analyzing Quantitative Assessment Data with Excel October 2, 2014 Jeremy Penn, Ph.D. Director When to use Excel You want to quickly summarize or analyze your assessment data You want to create basic visual
More informationCHAPTER 13. Experimental Design and Analysis of Variance
CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection
More informationBelow is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.
Excel Tutorial Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Working with Data Entering and Formatting Data Before entering data
More informationThe Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests
More informationData analysis process
Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis
More information4. Descriptive Statistics: Measures of Variability and Central Tendency
4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and
More informationTwosample hypothesis testing, II 9.07 3/16/2004
Twosample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For twosample tests of the difference in mean, things get a little confusing, here,
More informationHLM software has been one of the leading statistical packages for hierarchical
Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush
More informationSTATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Webbased Analytics Table
More informationSection 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
More informationIntroduction to Analysis of Variance (ANOVA) Limitations of the ttest
Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One Way ANOVA Limitations of the ttest Although the ttest is commonly used, it has limitations Can only
More information8. MultiFactor Designs. Chapter 8. Experimental Design II: Factorial Designs
8. MultiFactor Designs Chapter 8. Experimental Design II: Factorial Designs 1 Goals Identify, describe and create multifactor (a.k.a. factorial ) designs Identify and interpret main effects and interaction
More informationReview of Fundamental Mathematics
Review of Fundamental Mathematics As explained in the Preface and in Chapter 1 of your textbook, managerial economics applies microeconomic theory to business decision making. The decisionmaking tools
More informationLecture 14. Chapter 7: Probability. Rule 1: Rule 2: Rule 3: Nancy Pfenning Stats 1000
Lecture 4 Nancy Pfenning Stats 000 Chapter 7: Probability Last time we established some basic definitions and rules of probability: Rule : P (A C ) = P (A). Rule 2: In general, the probability of one event
More informationSection 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
More informationPURSUITS IN MATHEMATICS often produce elementary functions as solutions that need to be
Fast Approximation of the Tangent, Hyperbolic Tangent, Exponential and Logarithmic Functions 2007 Ron Doerfler http://www.myreckonings.com June 27, 2007 Abstract There are some of us who enjoy using our
More informationContrast Coding in Multiple Regression Analysis: Strengths, Weaknesses, and Utility of Popular Coding Structures
Journal of Data Science 8(2010), 6173 Contrast Coding in Multiple Regression Analysis: Strengths, Weaknesses, and Utility of Popular Coding Structures Matthew J. Davis Texas A&M University Abstract: The
More informationCOLLEGE ALGEBRA. Paul Dawkins
COLLEGE ALGEBRA Paul Dawkins Table of Contents Preface... iii Outline... iv Preliminaries... Introduction... Integer Exponents... Rational Exponents... 9 Real Exponents...5 Radicals...6 Polynomials...5
More informationImportant Financial Concepts
Part 2 Important Financial Concepts Chapter 4 Time Value of Money Chapter 5 Risk and Return Chapter 6 Interest Rates and Bond Valuation Chapter 7 Stock Valuation 130 LG1 LG2 LG3 LG4 LG5 LG6 Chapter 4 Time
More informationMultivariate Analysis of Variance (MANOVA)
Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction
More informationINTRODUCTION TO MULTIPLE CORRELATION
CHAPTER 13 INTRODUCTION TO MULTIPLE CORRELATION Chapter 12 introduced you to the concept of partialling and how partialling could assist you in better interpreting the relationship between two primary
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, TTESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, TTESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationCompensation Basics  Bagwell. Compensation Basics. C. Bruce Bagwell MD, Ph.D. Verity Software House, Inc.
Compensation Basics C. Bruce Bagwell MD, Ph.D. Verity Software House, Inc. 2003 1 Intrinsic or Autofluorescence p2 ac 1,2 c 1 ac 1,1 p1 In order to describe how the general process of signal crossover
More informationCHAPTER 11 CHISQUARE AND F DISTRIBUTIONS
CHAPTER 11 CHISQUARE AND F DISTRIBUTIONS CHISQUARE TESTS OF INDEPENDENCE (SECTION 11.1 OF UNDERSTANDABLE STATISTICS) In chisquare tests of independence we use the hypotheses. H0: The variables are independent
More information