I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

Size: px
Start display at page:

Download "I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN"

Transcription

1 : Part 2 Profile Analsysis and 2 way Edps/Soc 584 and Psych 594 Applied Multivariate Statistics Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN c Board of Trustees, University of Illinois : Part 2 Slide 1 of 75

2 Outline Extensions of 1 Way : Two groups Two or more groups Two-Way 2 way Review 2-way ANOVA 2-way Example Reading: Johnson & Wichern pages xx xx. : Part 2 Slide 2 of 75

3 Example of Profiles for Two Groups Two-Way Profile analysis is an extension of 1 way involving p response variables administered to g groups of individuals or cases. e.g., A battery of personality tests (sub-scales) where the p test scores are measured in the same units (or at least similar or commensurate units). gives you more specific hypotheses to test than the standard H o : µ 1 = µ 2 = = µ g. Consider p 1 mean vector for each group µ l = (µ l1,µ l2,...,µ lp ) and plot the means as profiles for each group. : Part 2 Slide 3 of 75

4 Example of Profiles for Two Groups 15 Example of Profiles for Two Groups Mean Value 10 5 Group 1 Group 2 Two-Way One Two Three...p Variable : Part 2 Slide 4 of 75

5 : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles : The usual null hypothesis is H o : µ 1 = µ 2, but in profile analysis, we ll also consider both a more general and more restrictive hypotheses. The Three Questions (Steps): Question 1: Are the profiles parallel? or Are successive (adjacent) differences between means equal? i.e., H o1 : µ 1i µ 1,i 1 = µ 2i µ 2,i 1 for all i = 2,...,p Mean Value µ 11 µ 21 µ 12 µ Variables Group 1 Group 2 : Part 2 Slide 5 of 75

6 Question 2 Assuming that profiles are parallel, are they coincident? or Group 1 : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Are the population means equal for the two groups? i.e., H o2 : µ 1i = µ 2i for all i = 1,...p; that is, H o2: µ 1 = µ 2 Mean Value µ 21 = µ 22 = µ 11 µ 12 Group Variables : Part 2 Slide 6 of 75

7 Question 3 : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Assuming that the profiles are coincident, are they Level (flat)? or Are all the means equal? (over groups and variables). H o3 : µ 11 = µ 12 = = µ 1p = µ 21 = µ 22 = = µ 2p Mean Value Group 1 µ µ 1i 2i = Group Variables : Part 2 Slide 7 of 75

8 Testing Question 1: Parallel Assuming X 1j N(µ 1,Σ) and X 2j N(µ 2,Σ) : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles for j = 1,...,n l, and independent. Question 1: Are the profiles parallel? We can write the hypothesis are H o1 : Cµ 1 = Cµ 2 where C (p 1) p contrast matrix. e.g., C (p 1) p = What we re doing is linearly transforming our original p variables into (p 1) new variables. : Part 2 Slide 8 of 75

9 : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Question 1 continued When we take CX = X 1 +X 2 X 2 +X 3... X p 1 +X p, which are measured on both groups (populations, etc). Random variable CX 1j N p 1 (Cµ 1,CΣC ), and Random variable CX 2j N p 1 (Cµ 2,CΣC ). To estimate the covariance matrix Σ, use S pool, S pool (n 1 1)S 1 +(n 2 1)S 2 n 1 +n 2 2 and the estimate of CΣC equals CS pool C. : Part 2 Slide 9 of 75

10 Finishing Question 1 To test H o1 : Cµ 1 Cµ 2 = C(µ 1 µ 2 ) = 0, we can just use Hotelling s T 2 for Two independent samples. : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Reject H o1 if [( T 2 = ( X 1 X 1 2 ) C + 1 ) ] 1 CS pool C C( n 1 n X 1 X 2 ) > c 2 2 where c 2 = (n 1 +n 2 2)(p 1) n 1 +n 2 p F (p 1),(n1 +n 2 p)(α) If you reject H o1 STOP.... You can do any follow-up to examine differences. If you retain H o2 Conclude profiles are parallel and PROCEED to next question. : Part 2 Slide 10 of 75

11 Testing Question 2: Coincident profiles Assuming that the the profiles are parallel, are the profiles coincident? If profiles are parallel, then one will be above the other for all i = 1,...,p; that is, : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles µ 1j > µ 2j for all i = 1,...,p or µ 1j < µ 2j for all i = 1,...,p So, profiles will be coincident only if the total heights are equal (µ 11 +µ µ 1p ) = (µ 21 +µ µ 2p ) 1 pµ 1 = 1 pµ 2 where 1 p is a (p 1) vector of ones. : Part 2 Slide 11 of 75

12 Question 2 continued The null hypothesis for question 2 is H o2 : 1 µ 1 = 1 µ 2 : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles We are forming a new variable, 1 X, which is a simple sum and test whether this variable is equal for the two groups. We estimate 1 X 1j N 1 (1 µ 1,1 Σ1) j = 1,...,n 1 1 X 2j N 1 (1 µ 2,1 Σ1) j = 1,...,n 2 1 µ 1 by 1 x 1 1 µ 2 by 1 x 2 and 1 Σ1 by 1 S pool 1 : Part 2 Slide 12 of 75

13 Finishing Up Question 2 : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles To test H o2: 1 µ 1 = 1 µ 2, we can do a simple univariate 2 independent sample t-test. We will reject H o2 at the α-level if t = 1 ( x 1 x 2 ) ( > t n1 +n 2 2(α) 1 n n 2 )1 S pool 1 If Reject H o2 STOP and conclude that the profiles are parallel but not coincident. If Retain H o2 PROCEED and test whether profiles are flat. : Part 2 Slide 13 of 75

14 Question 3: Flat If the profiles are coincident, do they all have the same mean? i.e., : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles H o3 : µ 11 = µ 12 = = µ 1p = µ 21 = µ 22 = = µ 2p We can test this using a contrast matrix C, such as the one we used for Question 1 (testing parallel profiles) H o3 : C(µ 1 +µ 2 ) = 0 Note that we are adding rather than subtracting as we did in H o1. This tests (µ 12 µ 11 )+(µ 22 µ 21 ) (µ 13 µ 12 )+(µ 23 µ 22 ). (µ 1p µ 1,p 1 )+(µ 2p µ 2,p 1 ) Cµ 1 +Cµ 2 = 0 = 0 : Part 2 Slide 14 of 75

15 Question 3 continued Testing : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Cµ 1 +Cµ 2 = 0 (µ 12 µ 11 )+(µ 22 µ 21 ) (µ 13 µ 12 )+(µ 23 µ 22 ). = 0 (µ 1p µ 1,p 1 )+(µ 2p µ 2,p 1 ) Is the same as this, which is what we want to test, (µ 12 +µ 22 ) = (µ 11 +µ 21 ) (µ 13 +µ 23 ) = (µ 12 +µ 22 ). (µ 1p +µ 2p ) = (µ 1,p 1 +µ 2,p 1 ) Because If the profiles are coincident, then µ 1i = µ 2i for all i = 1,...,p. So if H o3 is true, then C(µ 1 +µ 2 ) = 0.. : Part 2 Slide 15 of 75

16 Question 3 continued For H o3 : C(µ 1 +µ 2 ) = 0, flat profiles, we are taking linear combinations in two ways simultaneously Over variables via the use of C. : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Over groups via an additive function. The mean C(µ 1 +µ 2 ) is estimated by taking the grand mean n1 j=1 X = X 1j + n 2 j=1 X 2j = n X 1 1 +n 2 X2 n 1 +n 2 n 1 +n 2 and the distribution is of C X is C X N p 1 ( C(µ 1 +µ 2 ),CΣC ( How do we get this? 1 n 1 +n 2 )) : Part 2 Slide 16 of 75

17 Finishing up Question 3 : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Reject H o3 : C(µ 1 +µ 2 ) = 0 (i.e., that profiles are flat) at α-level if (n 1 +n 2 ) X C (CSC) 1 C X > (n 1 +n 2 1)(p 1) n 1 +n 2 p+1 where S is the total sample covariance matrix 1 n 1 S = (X 1j n 1 +n 2 1 X)(X 1j X) + j=1 n 2 j=1 Note that we use the total sample mean in computing S. F (p 1),(n1 +n 2 p+1(α) (X 2j X)(X 2j X) : Part 2 Slide 17 of 75

18 Example: WAIS data This example is from Morrison (2005). : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Fourty nine elderly men participating in an interdisciplinary study of human aging were classified into the diagnostic categories senile factor present and no senile factor on the basis of an intensive psychiatric examination. The Wechsler Adult Intelligence Scale (WAIS) was administered to all subjects by an independent investigator. Below are mean scores by group on some of the WAIS subtests. Not Senile (n = 37) Senile (n = 12) Sub-Test x std dev x std dev Information Similarities Note: Arithmetic Picture My results differ slightly from Morrison s. There is either a typo in the text or in the data set. (no way to find out which). : Part 2 Slide 18 of 75

19 WAIS Profiles 15 : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Sub-test Group Mean 10 5 Picture completion Arithmetic Similarities Information WAIS Sub-Tests No Senile Factor Senile Factor : Part 2 Slide 19 of 75

20 WAIS: Are Profiles Parallel? The within group (pooled) sample covariance matrix: : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles S pool = = 1 n 1 +n 2 2 ((n 1 1)S 1 +(n 2 1)S 2 ) Test for parallel profiles: The hypothesis is that H o : µ 1i µ 1,i 1 = µ 2,i µ 2,i 1 for i = 2,3,4; that is, simultaneously, the following three qualities hold in the population, µ 12 µ 11 = µ 22 µ 21 µ 13 µ 12 = µ 23 µ 22 µ 14 µ 13 = µ 24 µ 23 : Part 2 Slide 20 of 75

21 WAIS: Are Profiles Parallel? : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles To test this hypothesis, we need a (3 4) contrast matrix C C = Our hypothesis can now be,expressed as H o : Cµ 1 = Cµ 2, or H o : C(µ 1 µ 2 ) = 0. This hypothesis is tested by Hotelling s T 2, and we need and CS pool C = ( x n x y ) = (3.818,4.153,3.014,3.223), (C( x n x y )) = (0.336, 1.140,0.209), : Part 2 Slide 21 of 75

22 WAIS: Are Profiles Parallel? Putting all of this into our equation for T 2 gives us : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles T 2 n 1 n 2 = ( x 1 x 2 ) C (CS pool C ) 1 C( x 1 x 2 ) n 1 +n 2 = and compare the following statistic to the F distribution, F = n 1 +n 2 p (n 1 +n 2 2)(p 1) T2 = = (47)(3) Since F 3,45 (α =.05) = is greater than our observed statistic (or the p-value of F equals.76), we do not reject the hypothesis that the profiles are parallel. Since we Retained the null, we will PROCEED to test whether the profiles are coincident. : Part 2 Slide 22 of 75

23 WAIS: Are Profiles Test for Equal (coincident) Profiles. We now test H o : µ 1i = µ 2i for i = 1,2,3,4 variables. Since we concluded that the profiles are parallel, then we can test this hypothesis by testing whether : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles (µ 11 +µ 12 +µ 13 +µ 14 ) = (µ 21 +µ 22 +µ 23 +µ 24 ) 1 µ 1 = 1 µ 2 where 1 is a (4 1) vector of ones. This is just a univariate, 2 independent sample t. To test H o : 1 (µ 1 µ 2 ) = 0, we need for each group the sums of the means over all variables, which are 1 x 1 = and 1 x 2 = We also need an estimate of the variance of the difference between means, 1 S pool 1 = : Part 2 Slide 23 of 75

24 WAIS: Are Profiles : Question 2 Question 3 Testing Question 1: Parallel Question 1 continued Finishing Question 1 Testing Question 2: Coincident profiles Question 2 continued Finishing Up Question 2 Question 3: Flat Finishing up Question 3 Example: WAIS data WAIS Profiles Putting all of these statistics together for our test statistics gives us t = ( ) = Since t 47 (α =.025) = (or the p-value of t 47 = 4.15 is <.001), we reject the null hypothesis. The profiles are not coincident. At this point we STOP and should not go on to test whether profiles are level (flat). : Part 2 Slide 24 of 75

25 Profiles Analysis for g To deal with two or more groups, we follow the same logic: 1. Using assumptions (including results from earlier tests), determine what the hypothesis implies for population parameters. This has implications for data. Profiles Analysis forg 2 Groups Two or More: Parallel GLM: Parallel and WAIS Two or More: Two or More: Level Profiles Two-Way 2. Compute a statistic that reflects the implications for data. 3. Find a transformation of the statistic with a known sampling distribution. It is easier to do profile analysis using the GLM framework. We will use X = AB+ǫ, and test hypotheses of the form H o : LBM = 0 Appropriate definitions of L and M will lead to tests for parallelism, coincidence, and level means. : Part 2 Slide 25 of 75

26 Two or More: Parallel Parallel profiles imply H o1 : (µ 1i µ 1,i 1 ) = (µ 2i µ 2,i 1 ) = = (µ gi µ g,i 1 ) Profiles Analysis forg 2 Groups Two or More: Parallel GLM: Parallel and WAIS Two or More: Two or More: Level Profiles Two-Way for all i = 2,...,p. This will be a on differences. This hypothesis takes linear combinations of the means within groups and then compare the groups. For the linear transformations of the regression parameters (i.e., the τ or µ s), define M p (p 1) = For the comparisons of groups on these new variables, define L = C (g 1) g contrast matrix. : Part 2 Slide 26 of 75

27 Profiles Analysis forg 2 Groups Two or More: Parallel GLM: Parallel and WAIS Two or More: Two or More: Level Profiles Two-Way GLM: Parallel and WAIS For the WAIS example, H o1 is LBM = (0, 1, 1) = (0,1, 1) β 01 β 02 β 03 β 04 β 11 β 12 β 13 β 14 β 21 β 22 β 23 β (β 01 β 02 ) (β 02 β 03 ) (β 03 β 04 ) (β 11 β 12 ) (β 12 β 13 ) (β 13 β 14 ) (β 21 β 22 ) (β 22 β 23 ) (β 23 β 24 ) = [(β 11 β 12 ) (β 21 β 22 ),(β 12 β 13 ) (β 22 β 23 ), So we have ( (β 11 β 12 ) (β 21 β 22 ) (β 12 β 13 ) (β 22 β 23 ) (β 13 β 14 ) (β 23 β 24 )] = (0,0) ) = ( (µ 11 µ 12 ) (µ 21 µ 22 ) (µ 12 µ 13 ) (µ 22 µ 23 ) = 0 = 0 ) = ( 0 0 ) : Part 2 Slide 27 of 75

28 Two or More: Assuming parallel, then coincidence implies Profiles Analysis forg 2 Groups Two or More: Parallel GLM: Parallel and WAIS Two or More: Two or More: Level Profiles Two-Way H o2 : 1 µ 1 = 1 µ 2 = = 1 µ g So we re testing equivalence of a single variable over g groups: 1-way ANOVA. The linear transformation of the variables is given by For the groups, M p 1 = See SAS for WAIS example L = I : Part 2 Slide 28 of 75

29 Two or More: Level Profiles If profiles are coincidence, then flat or level profiles means H o3 : µ 11 = µ 12 = = µ 1p = µ 21 = = µ gp Profiles Analysis forg 2 Groups Two or More: Parallel GLM: Parallel and WAIS Two or More: Two or More: Level Profiles Two-Way That is, sums of two variables are the same for pairs of groups. The required linear transformation, M (p (p 1) = The required contrasts of the groups, L = C (g 1) g which is the same one used for testing parallel profiles. : Part 2 Slide 29 of 75

30 Mini Outline: 1. Review 2 way ANOVA 2. 2 way for balanced data 3. Example: Distributed versus /instruction 4. Unbalanced designs. 5. Multivariate GLM and further extensions (MANCOVA, longitudinal) Two-Way : Part 2 Slide 30 of 75

31 2 way ANOVA 2 Factors (qualitative variables). ANOVA Model for Two Factors Examples: Interaction & No Interaction Observation as Sum of Means Decomposition of Sums of Squares Sums of Squares Decomposition ANOVA Summary Table Hypothesis Tests Two-Way Notation: Factor B 1 k b 1 X 11r X 1kr X 1br n observations in.... each of the gb Factor A l X l1r X lkr X lbr combinations of levels.... of the factors. g X gkr X gkr X gbr Observation: X lkr = the r th observation at level l of Factor A and level k of Factor B. Factor A: l = 1,...,g Factor B: k = 1,...,b Replications: r = 1,...,n For now, a balanced design. : Part 2 Slide 31 of 75

32 ANOVA Model for Two Factors X lkr = µ }{{} overall level + τ l }{{} fixed effect factor A at level l + β k }{{} fixed effect factor B at level k + γ lk }{{} interaction between factors A &B at levels l &,k + ǫ lkr }{{} residual ANOVA Model for Two Factors Examples: Interaction & No Interaction Observation as Sum of Means Decomposition of Sums of Squares Sums of Squares Decomposition ANOVA Summary Table Hypothesis Tests where g b g b τ g = β k + γ lk = γ lk = 0 l=1 k=1 l=1 k=1 and ǫ lkr N(0,σ 2 ) and all independent. Model for the mean: Two-Way E(X lkr ) = µ lk = µ+τ l +β k +γ lk TThe effects are not additive; there is an interaction. : Part 2 Slide 32 of 75

33 Examples: Interaction & No Interaction ANOVA Model for Two Factors Examples: Interaction & No Interaction Observation as Sum of Means Decomposition of Sums of Squares Sums of Squares Decomposition ANOVA Summary Table Hypothesis Tests Cell Mean Factor B b 3 b 1 b 2 a 1 a 2 a 3 a 4 Factor A Factor B b 1 a 1 a 2 a 3 a 4 Factor A b 2 b 3 Two-Way : Part 2 Slide 33 of 75

34 Observation as Sum of Means ANOVA Model for Two Factors Examples: Interaction & No Interaction Observation as Sum of Means Decomposition of Sums of Squares Sums of Squares Decomposition ANOVA Summary Table Hypothesis Tests x lkr = }{{} x +( x l x) +( x k x) }{{}}{{} ˆµ ˆτ l ˆβ k where x = overall sample mean. +( x lk x l x k x) }{{} ˆγ lk x l = mean for l th level of Factor A (or row mean). x k = mean for k th level of Factor B (or column mean). x lk = mean for the l th level of Factor A and k th level of Factor B (or the cell mean). The decomposition of the observations above shows the estimates of the population effects. +(x lkr x lk ) }{{} ˆǫ lkr Two-Way : Part 2 Slide 34 of 75

35 Decomposition of Sums of Squares x lkr = x }{{} ˆµ +( x l x) }{{} ˆτ l +( x k x) }{{} ˆβ k +( x lk x l x k x) }{{} ˆγ lk +(x lkr x lk ) }{{} ˆǫ lkr ANOVA Model for Two Factors Examples: Interaction & No Interaction Observation as Sum of Means Decomposition of Sums of Squares Sums of Squares Decomposition ANOVA Summary Table Hypothesis Tests Two-Way To get the sums of squares: 1. Subtract x from both sides. 2. Square both sides. 3. Sum over observations within cells, levels of Factor A, and levels of Factor B. 4. After a little algebra, g b n (x lkr x) 2 = l=1 k=1 r=1 g n( x l x) 2 + l=1 + + g l=1 g l=1 b gn( x k x) 2 k=1 b n( x lk x l x k + x) 2 k=1 b k=1 r=1 n (x lkr x lk ) 2 : Part 2 Slide 35 of 75

36 Sums of Squares Decomposition We get an orthogonal decomposition of the sums of squares: SS total = SS A +SS B +SS AB +SS residual Note: SS total is corrected for the grand mean. ANOVA Model for Two Factors Examples: Interaction & No Interaction Observation as Sum of Means Decomposition of Sums of Squares Sums of Squares Decomposition ANOVA Summary Table Hypothesis Tests Also, we get decomposition of degrees of freedom, (gbn 1) }{{} total = (g 1) + (b 1) }{{}}{{} Factor A Factor B +(g 1)(b 1) }{{} Interaction +gb(n 1) }{{} Residual Two-Way : Part 2 Slide 36 of 75

37 ANOVA Summary Table ANOVA Model for Two Factors Examples: Interaction & No Interaction Observation as Sum of Means Decomposition of Sums of Squares Sums of Squares Decomposition ANOVA Summary Table Hypothesis Tests Source of Degrees SS variation of freedom Sums of Squared Factor A (g 1) SS A = g l=1 n( x l x) 2 Factor B (b 1) SS B = b k=1 gn( x k x) 2 Interaction (g 1)(b 1) SS AB = g b l=1 k=1 n( x lk x l x k + x) 2 Residual gb(n 1) SS res = g b n l=1 k=1 r=1 (x lkr x lk ) 2 (error) Total gbn 1 SS total = g b n l=1 k=1 r=1 (x lkr x) 2 Two-Way : Part 2 Slide 37 of 75

38 Hypothesis Tests Interaction: H o : γ 11 = γ 12 = = γ gb = 0. If the null is true and all assumptions valid, F = SS AB/((g 1)(b 1)) SS res /(gb(n 1)) F (g 1)(b 1),gb(n 1) ANOVA Model for Two Factors Examples: Interaction & No Interaction Observation as Sum of Means Decomposition of Sums of Squares Sums of Squares Decomposition ANOVA Summary Table Hypothesis Tests Two-Way Factor A: H o : τ 1 = τ 2 =... = τ g. If the null is true and all assumptions valid, F = SS A /(g 1) SS res /(gb(n 1)) F (g 1),gb(n 1) Factor B: H o : β 1 = β 2 = = β b. If the null is true and all assumptions valid, = SS B /(b 1) SS res /(gb(n 1)) F (b 1),gb(n 1) We should start with the Interaction before considering whether to test the main effects. : Part 2 Slide 38 of 75

39 Two-Way For p variables, X lkr is a p 1 vector of measurements on p variables. The model X lkr = µ+τ l +β k +γ lk +ǫ lkr and ǫ lkr N p (0,Σ) and independent. where l = 1,...,g Two-Way Two-Way Decomposition of Observation Decomposition of SSCP k = 1,...,b r = 1,...,n lk Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection And for identification g τ l = l=1 b β k = k=1 g γ lk = l=1 b γ lk = 0 k=1 : Part 2 Slide 39 of 75

40 Decomposition of Observation We decompose the observation vector (x lkr x) into sums of various vectors, Two-Way Two-Way Decomposition of Observation Decomposition of SSCP Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection (x lkr x) = ( x l x) +( x k x) }{{}}{{} ˆτ l ˆβ k +(x lkr x lk ) }{{} ˆǫ lkr +( x lk x l x k + x) }{{} ˆγ lk If we take the sums of squares and cross-products of this, we obtain the SSCP decomposition, ( x l x)( x l x) =... l k r } {{ } SSCP total which will take more space than is left on this slide : Part 2 Slide 40 of 75

41 Decomposition of SSCP ( x l x)( x l x) l k r } {{ } SSCP total Two-Way Two-Way Decomposition of Observation Decomposition of SSCP Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection = bn( x l x)( x l x) l }{{} SSCP FactorA + gn( x k x)( x k x) k }{{} SSCP FactorB + n( x lk x l x k + x)( x lk x l x k + x) l k }{{} SSCP interaction + (x lkr x lk )(x lkr x lk ) l k r }{{} SSCP residual : Part 2 Slide 41 of 75

42 Decomposition of SSCP So, SSCP total = SSCP FactorA +SSCP FactorB +SSCP Interaction +SSCP residual which is orthogonal so long as the design is balanced (i.e., n lk = n) or proportional... what to do with unbalanced is discussed a bit later. And Two-Way Two-Way Decomposition of Observation Decomposition of SSCP Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection df total = df FactorA +df FactorB +df Interaction +df residual (gbn 1) = (g 1) + (b 1) + (g 1)(b 1)+gb(n 1) : Part 2 Slide 42 of 75

43 Hypothesis Testing: Interaction First H o : γ 11 = γ 12 = = γ gp Test Statistic is Wilk s Lambda Λ = det(sscp residual ) det(sscp residual +SSCP interaction ) = det(e) det(e +H) Two-Way Two-Way Decomposition of Observation Decomposition of SSCP Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection ν h = ν interaction = (g 1)(n 1) ν e = ν residual +gb(n 1) Distribution of Λ are given on the following slide for various cases. If you don t have p 2 or ν h 2, but you have large n then the approximate sampling distribution of ( ν e p+1 ν ) h ln(λ ) is χ 2 pν 2 h : Part 2 Slide 43 of 75

44 Distribution of Wilk s Lambda Λ Wilk s Λ = SSCP e SSCP e +SSCP h Number df for variables Hypothesis Sampling distribution for multivariate data Two-Way Two-Way Decomposition of Observation Decomposition of SSCP Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection p = 1 ν h 1 p = 2 ν h 1 p 1 ν h = 1 p 2 ν h = 2 ( νe ( ν h )( 1 Λ Λ ) F νh,ν e )( ν e 1 1 ) Λ ν h Λ F 2νh,2(ν e 1) ( )( ) νe +ν h p 1 Λ p Λ F p,(νe +ν h p) ( νe +ν h p 1 p )( 1 Λ Λ ) F 2p,2(νe +ν h p 1) where ν h = degrees of freedom for hypothesis, and ν e = degrees of freedom for error (residual). : Part 2 Slide 44 of 75

45 Hypothesis Testing: Interaction First You should test the interaction first. If H o for the interaction is rejected, then you don t so tests for main effects because the Factor main effects do not have a clear interpretation. J&W recommend doing p-univariate 2-way ANOVAs to find out where the interactions exist. (i.e., are there interactions for just some of the p variables and not others?). Two-Way Two-Way Decomposition of Observation Decomposition of SSCP Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection For variables with no interaction, you can interpret main effects. Note: In some cases you can go on to test main effects even though the interaction is significant (we ll do this in our example). In general, if you retain the null hypothesis for interaction, we continue and test main effects... : Part 2 Slide 45 of 75

46 Hypothesis Testing: Main Effects For Factor A: H o : τ 1 = τ 2 = = τ g = 0 versus H a : at least one τ l 0 Two-Way Two-Way Decomposition of Observation Decomposition of SSCP Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection Λ A = det(sscp residual ) det(sscp residual +SSCP FactorA ) = E E +H A and ν h = g 1. For p 2 or ν A 2 (i.e., g 3), use the exact distribution given in the table. For Factor B: H o : β 1 = β 2 = = β b = 0 versus H a : at least one β k 0 Λ B = det(sscp residual ) det(sscp residual +SSCP FactorB ) = E E +H B and ν h = b 1. For p 2 or ν B 2 (i.e., v 3), use the exact distribution given in the table. For Large n, you can use ( ν e p+1 ν ) h ln(λ 2 h) χ 2 pν h : Part 2 Slide 46 of 75

47 Following Rejection Two-Way Two-Way Decomposition of Observation Decomposition of SSCP Decomposition of SSCP Hypothesis Testing: Interaction First Distribution of Wilk s Lambda Λ Hypothesis Testing: Interaction First Hypothesis Testing: Main Effects Following Rejection If a null hypothesis is rejected, you should perform additional analyses to figure out where the effects are. These could include Multivariate T 2 ; that is, generalized squared distances D 2 (X) lk l k = ( X lk X l k ) S 1 pool ( X lk X l k ) Simultaneous confidence statements/intervals. When interactions are not significant, concentrate on contrasts of levels of Factor A and of Factor B (if they re significant). These are the same as those described in 1-way notes. For example, Use Bonferroni if planned. (τ li τ mi ) or (µ li µ mi ) 2-way ANOVAs with multiple comparisons. Discriminant Analysis. : Part 2 Slide 47 of 75

48 1 Way : Data from Tatsuoka (1988), Multivariate Analysis: Techniques for Educational and Psychological Research, pp An experiment was conducted for comparing 2 methods (A & B) of teaching shorthand computer programing to 60 female seniors in a vocational high school (a dated example). Also of interest were the effects of distributed versus massed practice Two-Way The Design and Data Cell Means &95% Confidence Ellipses Factor Means &95% Confidence Ellipses Test for Interaction Test for Method Main Effect Test for Condition Main Effect Interpretation & Conclusion C 1 : C 2 : C 3 : 2 hours of instruction/day for 6 weeks 3 hours of instruction/day for 4 weeks 4 hours of instruction/day for 3 weeks So each subject received a total of 12 hours of instruction. Note: n l = 10 per cell of the design Two variables (dependent measures): X 1 = speed X 2 = accuracy : Part 2 Slide 48 of 75

49 The Design and Data Various Means: C 1 C 2 C 3 Two-Way The Design and Data Cell Means &95% Confidence Ellipses Factor Means &95% Confidence Ellipses Test for Interaction Test for Method Main Effect Test for Condition Main Effect Interpretation & Conclusion X 1 X 2 X 1 X 2 X 1 X 2 X 1 X 2 A B Notes: Cell means (black) based on n lk = 10. Means for method (green) are based on n l = 30 Means for condition (blue) are based on n k = 20. Total sample means (red) are based on n = 60. : Part 2 Slide 49 of 75

50 Cell Means & 95% Confidence Ellipses Two-Way The Design and Data Cell Means &95% Confidence Ellipses Factor Means &95% Confidence Ellipses Test for Interaction Test for Method Main Effect Test for Condition Main Effect Interpretation & Conclusion X2 (Accuracy) x AC3 x AC X 1 (Speed) x BC3 x AC1 x BC2 x BC1 : Part 2 Slide 50 of 75

51 Factor Means & 95% Confidence Ellipses Two-Way The Design and Data Cell Means &95% Confidence Ellipses Factor Means &95% Confidence Ellipses Test for Interaction Test for Method Main Effect Test for Condition Main Effect Interpretation & Conclusion X2 (Accuracy) x A X 1 (Speed) x B x C3 x C2 x C1 : Part 2 Slide 51 of 75

52 Test for Interaction H o : γ AC1 = γ AC2 = γ AC3 = γ BC1 = γ BC2 = γ BC3 = 0 H a : not all γ lk = 0 W = E = ( ) H AB = ( ) Two-Way The Design and Data Cell Means &95% Confidence Ellipses Factor Means &95% Confidence Ellipses Test for Interaction Test for Method Main Effect Test for Condition Main Effect Interpretation & Conclusion and Λ = F = det(e) det(e +H AB ) = =.6361 ( 53 2 ) ( ) = 6.73 Comparing F = 6.73 to the F 4,106 distribution, F has p <.001. : Part 2 Slide 52 of 75

53 Test for Method Main Effect... For purposes of illustration... H o : τ A = τ B = 0 versus H a : not all γ l = 0 W = E = ( ) H Method = ( ) Two-Way The Design and Data Cell Means &95% Confidence Ellipses Factor Means &95% Confidence Ellipses Test for Interaction Test for Method Main Effect Test for Condition Main Effect Interpretation & Conclusion and Λ = det(e) det(e +H Method ) = =.9077 F = ( 53 1 ) ( ) = 2.63 Comparing F = 2.63 to the F 2,106 distribution, F has p =.08. : Part 2 Slide 53 of 75

54 Test for Condition Main Effect H o : β C1 = β C2 = β C3 = 0 versus H a : not all β k = 0 W = E = ( ) H Condition = ( ) Two-Way Λ = det(e) det(e +H Condition ) = =.1341 The Design and Data Cell Means &95% Confidence Ellipses Factor Means &95% Confidence Ellipses Test for Interaction Test for Method Main Effect Test for Condition Main Effect Interpretation & Conclusion and F = ( 53 2 ) ( ) = Comparing F = to the F 4,106 distribution, F has p <.001. : Part 2 Slide 54 of 75

55 Interpretation & Conclusion Two-Way The effectiveness of the teaching methods (A and B) depends on which of the three conditions of distributed practice that a student used. (i.e., There is a significant interaction). The method by condition cell means indicate that method B is more effective in terms of both speed and accuracy under conditions C 1 and C 2, but A is more effective in terms of speed and accuracy under condition C 3. The Design and Data Cell Means &95% Confidence Ellipses Factor Means &95% Confidence Ellipses Test for Interaction Test for Method Main Effect Test for Condition Main Effect Interpretation & Conclusion Statistical tests of these conclusions should be performed in follow up analyses. Potentially useful supplement analyses include ANOVAs for each of the dependent variables, simultaneous confidence intervals for various treatment effects, and discriminant analysis. : Part 2 Slide 55 of 75

56 The assumptions are Multivariate normality. Equality of Σ s. Independence between and within groups. Two-Way Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier Checks on multivariate normality: Scatter plots of residuals. QQ plots of residuals. Scatter & QQ plots of principal components. Tests of univariate normality for each residuals. Tests of multivariate normality (see text). method Checking Independence : Part 2 Slide 56 of 75 Assumption

57 Shorthand Example: Scatter Plot Two-Way Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier method Checking Independence : Part 2 Slide 57 of 75 Assumption

58 Shorthand Example: QQ Plot 1 Two-Way Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier method Checking Independence : Part 2 Slide 58 of 75 Assumption

59 Shorthand Example: QQ Plot 2 Two-Way Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier method Checking Independence : Part 2 Slide 59 of 75 Assumption

60 Equality of Covariance Matrices When there are equal n lk s (balanced), violating equality of Σ lk s probably doesn t hurt too much. Two-Way When there are unequal n lk s and det(σ lk s differ substantially, then will tend to make more errors. Where you make more Type I or Type II errors depends on how different the n lk s and Σ lk s are from each other. Test of Equality of Σ s this gets into ideas of tests and inference for covariance matrices, which leads into topics on canonical correlation and discriminate analysis. Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier Assume: X lj N p (µ l,σ l ) for l = 1,...,g and j = 1,...,n l and independent over the g groups and with groups. The null hypothesis: H o : Σ 1 = Σ 2 = = Σ g Σ method Checking Independence : Part 2 Slide 60 of 75 Assumption

61 Testing Equality of Covariance Matrices If H o : Σ 1 = Σ 2 = = Σ g Σ is true, then ( ) g 1 S pool = S = g l=1 ν ν l ln( S i ) l where ν l = (n i 1) = df(s i ). When H o is true, then S pool is an unbiased estimate of Σ. l=1 Two-Way Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier Box s test (1949ish): ( g ) M = ν l l=1 Test statistic is [ 1 2p2 +3p 1 6(p+1)(g 1) ln( S }{{} S pool ) ( g ( ) 1 l=1 g ν l ln( S l ) method Checking Independence : Part 2 Slide 61 of 75 Assumption ν l l=1 )] 1 g l=1 ν l } {{ } Correction factor i.e., asymptotically M χ 2 (g 1)p(p+1)/2

62 Example: Box s Test of Equality of Σ lk H o : Σ A,C1 = Σ A,C2 = Σ A,C3 = Σ B,C1 = Σ B,C2 = Σ B,C3 versus H a : not all Σ lk are equal. The method by condition sample covariance matrices: ( ) ( S A,C1 = S B,C1 = ) Two-Way Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier S A,C2 = S A,C3 = ( ( Pooled Covariance Matrix: S pool = method Checking Independence : Part 2 Slide 62 of 75 Assumption ( ) ) S B,C2 = S B,C3 = ( ( ) ) )

63 Example of Box s Test continued M = ( l,k (n lk 1))ln S pool l,k (n lk 1) S lk Two-Way Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier = 6(10 1)ln(69.14) (10 1)(ln( ln(268.82)+ln(20.30) + ln(85.07) + ln(12.73) + ln(13.04)) = = Since n lk = 10 for all K = (2)(3) = 6 combinations of method and condition (i.e., equal cell sizes), the correction factor simplifies to 1 (2p2 +3p 1)(g +1) 6(p+1)gn = 1 (2(2)2 +3(2) 1)(6+1) 6(2 + 1)10(6) =.9157 Test statistic =.9158(23.41) = with df = (K 1)p(p+1)/2 = (6 1)2(2+1)/2 = 15, which using the χ 2 distribution has p-value =.123. Retain null hypothesis; that is, the equal covariance matrix assumption is reasonable. method Checking Independence : Part 2 Slide 63 of 75 Assumption

64 Box s Test the easier method data shorthand; input speed accuracy method $ conditin $; if conditin= C1 and method= A then group=1; else if conditin= C1 and method= B then group=2; else if conditin= C2 and method= A then group=3; else if conditin= C2 and method= B then group=4; else if conditin= C3 and method= A then group=5; else if conditin= C3 and method= B then group=6; Two-Way Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier proc discrim simple pool=test Wcov Pcov list; class group; var speed accuracy; run;... and the output... method Checking Independence : Part 2 Slide 64 of 75 Assumption

65 Notation: K P N = Number of Groups = Number of Variables = Total Number of Observations - Number of Groups N(i) = Number of Observations in the i th Group - 1 N(i)/2 Within SS Matrix(i) V = N/2 Pooled SS Matrix P + 3P - 1 RHO = SUM _ N(i) N _ 6(P+1)(K-1) DF =.5(K-1)P(P+1) PN/2 N V Under the null hypothesis: -2 RHO ln PN(i)/2 _ N(i) _ is distributed approximately as Chi-Square(DF). Chi-Square DF Pr > ChiSq Since the Chi-Square value is not significant at the 0.1 level, a pooled covariance matrix will be used in the discriminant function. Reference: Morrison, D.F. (1976) Multivariate Statistical Methods p252 Note: There is a more recent edition of Morrison (I think). 64-1

66 Checking Independence Assumption Two-Way is generally relatively robust to violations of multivariate normality and equal covariance matrices; however, it is not robust to violations of Independence of observations within groups. Independence of observations between groups. Violation of independence generally increases the Type I error rate (i.e., it s higher than what you think it is, and possibly much larger). You can check for possible violation by computing the intra-class correlation for each of the variables using Shorthand Example: Scatter Plot Shorthand Example: QQ Plot 1 Shorthand Example: QQ Plot 2 Equality of Covariance Matrices Testing Equality of Covariance Matrices Example: Box s Test of Equality ofσ lk Example of Box s Test continued Box s Test the easier r intra = MS between MS error MS between +(n 1)MS error You can also consider the data collection procedure/method, because this is where dependence can slip in. It s the independence within that is most likely to be violated. In some cases this can be dealt with. method Checking Independence : Part 2 Slide 65 of 75 Assumption

67 Similar to 1-way ANOVA. With balanced designs (or proportional) Two-Way SSCP total = SSCP A +SSCP B +SSCP AB +SSCP residual or SSCP model = SSCP A +SSCP B +SSCP AB and interpretation is straight forward. With unbalanced designs, the partitioning of the is not longer unique. It depends on Order in which SSCP are extracted Type I SSCP: Sequential SSCP Example Type I SSCP: Sequential SSCP Type I: Sequential SSCP continued Type II or Partial SSCP Example: Type II or Partial SSCP Type III & Type IV SSCP Type IV and When to Use Which : Part 2 Slide 66 of 75 The model. Various sub-models of model as specified by the order in which various SSCP are extracted.... These notes are based on Khattree & Naik.

68 Order in which SSCP are extracted Suppose X lkr = µ+τ l +β k +γ lk +ǫ lkr where l = 1,...,g, k = 1,...,b, and r = 1,...,n lk. We could partition total SSCP as SSCP total corrected = SSCP A µ +SSCP B µ,a +SSCP AB µ,a,b or Two-Way SSCP total corrected = SSCP B µ +SSCP A µ,b +SSCP AB µ,a,b Order in which SSCP are extracted Type I SSCP: Sequential SSCP Example Type I SSCP: Sequential SSCP Type I: Sequential SSCP continued Type II or Partial SSCP Example: Type II or Partial SSCP Type III & Type IV SSCP Type IV and When to Use Which : Part 2 Slide 67 of 75 When the design is unbalanced, SSCP A µ SSCP A µ,b There are 4 different ways of computing SSCP s, which can lead to different results and different interpretation.

69 Type I SSCP: Sequential SSCP The partitioning of the model SSCP into component SSCP due to each variable or effect (including interactions) as it s added sequentially to the model as given in the model statement in proc glm in SAS. e.g., Two-Way Order in which SSCP are extracted Type I SSCP: Sequential SSCP Example Type I SSCP: Sequential SSCP Type I: Sequential SSCP continued Type II or Partial SSCP Example: Type II or Partial SSCP Type III & Type IV SSCP Type IV and When to Use Which : Part 2 Slide 68 of 75 model x1 x2 = A B C A B A C B C A B C That is all main effects, 2 way interactions and a 3-way interaction. The SSCP for A C = SSCP(A C µ,a,b,c,a B); that is, SSCP is adjusted for all previously entered terms, model x1 x2 = A B C A B }{{} already entered A C B C A B C

70 Example Type I SSCP: Sequential SSCP Two-Way To demonstrate this, I deleted some observations from the shorthand data set to make it a bit unbalanced. The sample sizes are now Practice Condition Method C1 C2 C3 Total A B Total Below are the hypothesis SSCP matrices for two different orders: Order in which SSCP are extracted Type I SSCP: Sequential SSCP Example Type I SSCP: Sequential SSCP Type I: Sequential SSCP continued Type II or Partial SSCP Example: Type II or Partial SSCP Type III & Type IV SSCP Type IV and When to Use Which : Part 2 Slide 69 of 75 Order: M C C M C M C M H m = ( ) H m = ( )

71 Type I: Sequential SSCP continued Two-Way The sum of H m +H c will be equivalent regardless of whether method order condition was entered first. With Type I SSCP, the sum of SSCP matrices of all the effects in the model will sum to the total for the model, regardless of the order in which they are entered into the model. The total SSCP is completely partitioned. Order in which SSCP are extracted Type I SSCP: Sequential SSCP Example Type I SSCP: Sequential SSCP Type I: Sequential SSCP continued Type II or Partial SSCP Example: Type II or Partial SSCP Type III & Type IV SSCP Type IV and When to Use Which : Part 2 Slide 70 of 75

72 Type II or Partial SSCP With Type II, an SSCP matrix equals the increase in model SSCP for a particular variable (effect). e.g., Two-Way Order in which SSCP are extracted Type I SSCP: Sequential SSCP Example Type I SSCP: Sequential SSCP Type I: Sequential SSCP continued Type II or Partial SSCP Example: Type II or Partial SSCP Type III & Type IV SSCP Type IV and When to Use Which : Part 2 Slide 71 of 75 model x1 x2 = A B C A B A C B C A B C That is all main effects, 2 way interactions and a 3-way interaction. Consider the SSCP for A C. The SSCP for A C is the increase in model SSCP by adding A C to the model that has A, B, A C and B C in it. The three-way interaction A B C contains A C in it, so the SSCP for A C is in the model. The effects A B C and A C are correlated. With Type II, the sum of the SSCP s do not equal the total model SSCP.

73 Two-Way Order in which SSCP are extracted Type I SSCP: Sequential SSCP Example Type I SSCP: Sequential SSCP Type I: Sequential SSCP continued Type II or Partial SSCP Example: Type II or Partial SSCP Type III & Type IV SSCP Type IV and When to Use Which : Part 2 Slide 72 of 75 Example: Type II or Partial SSCP Below are the SSCP matrices for Type I and Type II and the totals: Type I: Sequential H m H c H cm Sum M C C M C M C M Type II: Partial

74 Two-Way Order in which SSCP are extracted Type I SSCP: Sequential SSCP Example Type I SSCP: Sequential SSCP Type I: Sequential SSCP continued Type II or Partial SSCP Example: Type II or Partial SSCP Type III & Type IV SSCP Type IV and When to Use Which : Part 2 Slide 73 of 75 Type IV and When to Use Which Type IV are designed for situations where there are empty cells. For balanced designs, the 4 ways to compute SSCP yield the same results. For unbalanced designs, the 4 ways can lead to different conclusions. Type III is appropriate when interest is in comparing the effects of experimental variables. For model building purposes (finding a good model for prediction), Types I and/or II might be preferable. If there are design parameters, make sure that tests do not involve these. The choice should be made on a case-by-case basis with thought given to what it is you want to test.

75 Any design that you can form for univariate case can be formed for the multivariate situation. Two-Way This can be done by creating an appropriate design matrix in the general linear model context. Models can include fixed (as we have done), but also random effects (as in the univariate case). It is possible to reformulated as a linear mixed model, and hence provides a natural way to handle missing observations (be sure there are systematic reasons for missingness).... see Goldstein. : Part 2 Slide 74 of 75

Multivariate Analysis of Variance (MANOVA)

Multivariate Analysis of Variance (MANOVA) Chapter 415 Multivariate Analysis of Variance (MANOVA) Introduction Multivariate analysis of variance (MANOVA) is an extension of common analysis of variance (ANOVA). In ANOVA, differences among various

More information

Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases:

Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases: Profile Analysis Introduction Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases: ) Comparing the same dependent variables

More information

Multivariate Normal Distribution

Multivariate Normal Distribution Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues

More information

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine 2 - Manova 4.3.05 25 Multivariate Analysis of Variance What Multivariate Analysis of Variance is The general purpose of multivariate analysis of variance (MANOVA) is to determine whether multiple levels

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Multivariate Analysis of Variance (MANOVA): I. Theory

Multivariate Analysis of Variance (MANOVA): I. Theory Gregory Carey, 1998 MANOVA: I - 1 Multivariate Analysis of Variance (MANOVA): I. Theory Introduction The purpose of a t test is to assess the likelihood that the means for two groups are sampled from the

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

An analysis method for a quantitative outcome and two categorical explanatory variables.

An analysis method for a quantitative outcome and two categorical explanatory variables. Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September

More information

Multivariate Analysis of Variance (MANOVA)

Multivariate Analysis of Variance (MANOVA) Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction

More information

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

More information

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

More information

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables. SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation

More information

One-Way Analysis of Variance

One-Way Analysis of Variance One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

How To Understand Multivariate Models

How To Understand Multivariate Models Neil H. Timm Applied Multivariate Analysis With 42 Figures Springer Contents Preface Acknowledgments List of Tables List of Figures vii ix xix xxiii 1 Introduction 1 1.1 Overview 1 1.2 Multivariate Models

More information

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools. Tools for Summarizing Data Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

More information

Notes on Applied Linear Regression

Notes on Applied Linear Regression Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 444-8935 email:

More information

ANOVA. February 12, 2015

ANOVA. February 12, 2015 ANOVA February 12, 2015 1 ANOVA models Last time, we discussed the use of categorical variables in multivariate regression. Often, these are encoded as indicator columns in the design matrix. In [1]: %%R

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Analysis of Variance. MINITAB User s Guide 2 3-1

Analysis of Variance. MINITAB User s Guide 2 3-1 3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Elementary Statistics Sample Exam #3

Elementary Statistics Sample Exam #3 Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl Dept of Information Science j.nerbonne@rug.nl October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Multivariate Statistical Inference and Applications

Multivariate Statistical Inference and Applications Multivariate Statistical Inference and Applications ALVIN C. RENCHER Department of Statistics Brigham Young University A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

GLM I An Introduction to Generalized Linear Models

GLM I An Introduction to Generalized Linear Models GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial

More information

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

One-Way Analysis of Variance (ANOVA) Example Problem

One-Way Analysis of Variance (ANOVA) Example Problem One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means

More information

Two-sample hypothesis testing, II 9.07 3/16/2004

Two-sample hypothesis testing, II 9.07 3/16/2004 Two-sample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For two-sample tests of the difference in mean, things get a little confusing, here,

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Chapter 4 and 5 solutions

Chapter 4 and 5 solutions Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Introduction to Longitudinal Data Analysis

Introduction to Longitudinal Data Analysis Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

1 Simple Linear Regression I Least Squares Estimation

1 Simple Linear Regression I Least Squares Estimation Simple Linear Regression I Least Squares Estimation Textbook Sections: 8. 8.3 Previously, we have worked with a random variable x that comes from a population that is normally distributed with mean µ and

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

CHAPTER 13. Experimental Design and Analysis of Variance

CHAPTER 13. Experimental Design and Analysis of Variance CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection

More information

10. Analysis of Longitudinal Studies Repeat-measures analysis

10. Analysis of Longitudinal Studies Repeat-measures analysis Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217 Part 3 Comparing Groups Chapter 7 Comparing Paired Groups 189 Chapter 8 Comparing Two Independent Groups 217 Chapter 9 Comparing More Than Two Groups 257 188 Elementary Statistics Using SAS Chapter 7 Comparing

More information

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models

Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis

More information

DISCRIMINANT FUNCTION ANALYSIS (DA)

DISCRIMINANT FUNCTION ANALYSIS (DA) DISCRIMINANT FUNCTION ANALYSIS (DA) John Poulsen and Aaron French Key words: assumptions, further reading, computations, standardized coefficents, structure matrix, tests of signficance Introduction Discriminant

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Regression step-by-step using Microsoft Excel

Regression step-by-step using Microsoft Excel Step 1: Regression step-by-step using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Chapter 6: Multivariate Cointegration Analysis

Chapter 6: Multivariate Cointegration Analysis Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration

More information

Linear Models in STATA and ANOVA

Linear Models in STATA and ANOVA Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples

More information

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2 University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages

More information

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model. Polynomial Regression POLYNOMIAL AND MULTIPLE REGRESSION Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model. It is a form of linear regression

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression We are often interested in studying the relationship among variables to determine whether they are associated with one another. When we think that changes in a

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne Applied Statistics J. Blanchet and J. Wadsworth Institute of Mathematics, Analysis, and Applications EPF Lausanne An MSc Course for Applied Mathematicians, Fall 2012 Outline 1 Model Comparison 2 Model

More information

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand

More information

Multivariate normal distribution and testing for means (see MKB Ch 3)

Multivariate normal distribution and testing for means (see MKB Ch 3) Multivariate normal distribution and testing for means (see MKB Ch 3) Where are we going? 2 One-sample t-test (univariate).................................................. 3 Two-sample t-test (univariate).................................................

More information

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The

More information

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate 1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

How to calculate an ANOVA table

How to calculate an ANOVA table How to calculate an ANOVA table Calculations by Hand We look at the following example: Let us say we measure the height of some plants under the effect of different fertilizers. Treatment Measures Mean

More information

SPSS Tests for Versions 9 to 13

SPSS Tests for Versions 9 to 13 SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

1 Theory: The General Linear Model

1 Theory: The General Linear Model QMIN GLM Theory - 1.1 1 Theory: The General Linear Model 1.1 Introduction Before digital computers, statistics textbooks spoke of three procedures regression, the analysis of variance (ANOVA), and the

More information

Applied Regression Analysis and Other Multivariable Methods

Applied Regression Analysis and Other Multivariable Methods THIRD EDITION Applied Regression Analysis and Other Multivariable Methods David G. Kleinbaum Emory University Lawrence L. Kupper University of North Carolina, Chapel Hill Keith E. Muller University of

More information

Multiple regression - Matrices

Multiple regression - Matrices Multiple regression - Matrices This handout will present various matrices which are substantively interesting and/or provide useful means of summarizing the data for analytical purposes. As we will see,

More information

Chapter 7. One-way ANOVA

Chapter 7. One-way ANOVA Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

More information

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

Nonlinear Regression:

Nonlinear Regression: Zurich University of Applied Sciences School of Engineering IDP Institute of Data Analysis and Process Design Nonlinear Regression: A Powerful Tool With Considerable Complexity Half-Day : Improved Inference

More information