Stat 101. Unit 11: Analysis of Variance. Textbook Chapter 12

Size: px
Start display at page:

Download "Stat 101. Unit 11: Analysis of Variance. Textbook Chapter 12"

Transcription

1 Stat 101 Unit 11: Analysis of Variance Textbook Chapter 1

2 Unit 11 Outline Analysis of Variance (ANOVA) General format and ANOVA s F-test Link to binary regression Contrast testing Multiple comparisons (& the Bonferroni correction) Two-way ANOVA (and Multi-way ANOVA)

3 Recall the Jumping Rats Example As always, first visualize the data: Groups 1 No jumping 30 cm jump 3 60 cm jump Means & SD s We d like to do a t-test, but there s no formula for 3 groups Solution: Analysis of Variance (ANOVA) 3

4 Analysis of Variance (ANOVA) ANOVA extends hypotheses that we have seen already With one population: H 0 : μ = μ 0 With two populations: H 0 : μ 1 = μ Now consider inferences for g = 3 or more populations (technically, g > ): H 0 : μ 1 = μ =... = μ g Analysis technique for this situation is ANOVA 4

5 Main concept of ANOVA With I populations (groups), there are two types of variability in the data: (A) Variation of individual values around their group means ( groups (variability within (B) Variation of group means around the overall mean ( groups (variability between Main concept: If (A) is small relative to (B), this implies the group means are different. ANOVA determines whether variability in data is mainly from variation within groups or variation between groups. That is, does labelling the data into these groups lead to more than chance differences in the group means Think Boxplots split by groups 5 5

6 Within group variability relatively large, adds noise, obscures group differences Within group variability relatively small, group differences not obscured by noise 6 6

7 One-way ANOVA the model The one-way ANOVA model for a quantitative variable X is x ij = μ i + ε ij for i = 1,,..., g groups (subpopulations)for a specific factor and j = 1,,..., n i individuals sampled in each group ( σ The ε ij are assumed to be ~ N(0, Model parameters are μ 1, μ,..., μ g and σ (one common SD!) Model can be written as (just like regression!) DATA = MODEL + RESIDUAL Note: we used the symbol x here to represent the measured variable, but we could have chosen y instead (to mimic regression s response variable) 7 7

8 Model for one-way ANOVA with g = 3 groups ( σ Residuals ε ij are assumed to be ~ N(0, So only difference among the groups are differing means 8 8

9 One-way ANOVA the data SRS s from each of the I populations (groups): Sample size n 1 n n g Observations x 1j x j x gj for j = 1,,, n i Sample means x1 x xg Sample SD s s 1 s s g Practical rule for examining SD s in ANOVA (checking assumption of a common σ.( If largest s is less than twice the smallest s, the ANOVA results will be approximately correct. Check to see if the ratio holds: s s largest smallest 9 9

10 Variance within groups Since we assumed equal σ s for all g groups, all sample SD s should be estimating the common σ Thus we combine them in a pooled estimate (same idea as for ( the -sample pooled t-test which we skipped Pooled estimate of σ (the variance within groups) is s W ( n 1 1) s SSError dferror 1 ( n MSE 1) s n The subscript W refers to this within groups estimate g... ( n 1) s Also known as the mean square within groups (MSW) and mean square error (MSE) g g g i1 ( n i n 1) s g i

11 Variance between groups If the null hypothesis, H 0 : μ 1 = μ =... = μ g, is true then examining individual sample means is as if we are sampling g times from the same population, with mean μ and standard deviation σ Recall from the sampling distribution of sample means: s so under H 0, σ can be estimated by B n 1 ( x1 x) n( x x)... ng ( xg x) i1 g 1 SSGroups dfgroups MSG The subscript B (on s B ) refers to this between groups estimate Also known as the mean square between groups (MSB or MSG) or mean square of the model (MSM) Only a valid estimate of σ if H 0 : true, otherwise inflated x g ~ n N, i ( x g i 1 n x)

12 Concept behind the test If H 0 : μ 1 = μ =... = μ g is true then s W and s B both estimate σ and should be of similar magnitude If H 0 : is not true, the between groups estimate of σ will, in general, be larger than the within groups estimate of σ Therefore a test of H 0 : μ 1 = μ =... = μ g can be based on a comparison (ratio) of the between groups and within groups estimates of σ Examine this ratio of variance estimates as an F test F s s B W MS between groups MS within groups MSGroups MSError 1 1

13 One-way ANOVA: F-test To test H 0 : μ 1 = μ =... = μ g H A : not all μ i are equal, use the F statistic F MSG MSE SSG SSE / / df df G E This test statistic has an F distribution with g 1 and N g degrees of freedom when H 0 is true Reject H 0 : for large values of F; if F > F* α, g 1, N - g or the corresponding p-value is less than α

14 Analysis of variance (ANOVA) As for simple and multiple linear regression, for one-way ANOVA the total sum of squares is decomposed by: SST = SSG + SSE The degrees of freedom are now partitioned DFT = DFG + DFE (n - 1) = (g - 1) + (n - g) The mean squares (MS) are formed the same way MS SS df Sum of Squares degrees of freedom 14 14

15 One-way ANOVA table Source SS DF MS F Groups (Between) s B SSG g 1 = SSG/DFG F = MSG/MSE Error s W (Within) SSE n g = SSE/DFE Total SST n 1 SST/DFT The F statistic tests if there is a difference among any of the g population means MSE is still our estimate of σ : the variance of the residuals or the average variance of the observations from their group means (sometime called the pooled variance estimate). 15

16 Solution: H 0 : μ 1 = μ = μ 3 H A : at least one μ i is different g SSG ni i1 10( ) 10( ) 10( ) g SSE ( ni i1 ( x x) MSG i 1) s i MSE SSG / df ( G SSE / df /(3 1) ) ( E ) ( F = / = F* = Since our F > F*, we reject H 0. The rat s have differing mean bone densities in the different treatment groups /( ) )

17 ANOVA Results from SPSS In SPSS: Analyze Compare Means One-Way ANOVA 17 17

18 Unit 11 Outline Analysis of Variance (ANOVA) General format and ANOVA s F-test Link to binary regression Contrast testing Multiple comparisons (& the Bonferroni correction) Two-way ANOVA (and Multi-way ANOVA) 18 18

19 The link between ANOVA and Regression ANOVA treats the groups as `nominal variables, i.e., variables whose coding gives the groups a name but no numerical meaning A naive application of regression here might use the codes no jump = 1, low jump =, high jump = 3 in a single predictor, but this is mathematically incorrect! A correct way to apply regression here is to create (g 1) binary variables (variables coded (0 or 1), sometimes called dummy variables) that recreate the groups: 19 19

20 A few thoughts about ANOVA ANOVA is easy to calculate However, regression is more general because additional continuous variables can be added ANOVA generalizes to several categorical predictor variables/factors (Ch 13), in two-way and higher ANOVA. Note: the term ANOVA is thrown around a lot. It can refer to the ANOVA setting/procedure where you are looking at a quantitative variable across multiple groups (or treatments). It can also refer to the ANOVA table (that is calculated under both regression and ANOVA settings). 0 0

21 Unit 11 Outline Analysis of Variance (ANOVA) General format and ANOVA s F-test Link to binary regression Contrast testing Multiple comparisons (& the Bonferroni correction) Two-way ANOVA (and Multi-way ANOVA) 1 1

22 Steps in a Complete ANOVA Procedure 1. Examine the data, checking assumptions. If possible, formulate in advance some working (alternative) hypotheses about how the population group means might differ. 3. Check the evidence against the global null hypothesis of no differences among the groups by calculating the F-test 4. If the F-test is significant (that is, if it leads to a rejection of the null hypothesis of equal population group means): Test the individual hypotheses specified at the second step If there was not enough information to formulate working hypotheses, test all pairwise comparisons of means, adjusting for multiple comparisons (example also coming)

23 Contrasts After the omnibus F test has shown overall significance investigate other comparisons of groups using contrasts can be performed A contrast is a linear combination of μ i s i a i ( i ) where a 0 The corresponding contrast of sample means is (ψ is the Greek psi ). In this rats example, what might be an interesting comparison of the 3 groups involved (no jump, low jump, high jump)? For example, consider the bone density study. To compare control (group 1) versus treatment (groups and 3) use. ψ = μ 1 (μ + μ 3 )/ = (1)μ 1 +(- 1 / )μ + (- 1 / )μ 3 To compare levels of jumping (group versus group 3) use ψ = μ μ 3 = (0)μ 1 +(1)μ + ( 1)μ c a i x ) ( i

24 Contrasts testing hypotheses To test the hypothesis H versus 0 : 0 H A : 0 we use a t-test much like the pooled t-test: t s p a x i i a n i i This t-test has a t distribution with degrees of freedom associated with s p under H 0 (test can be 1-sided or -sided) 4 4

25 Example bone density Analysis of variance for bone density s p Reject H 0 : μ 1 = μ = μ 3 if our calculated F > F* 0.05,df=,7 = 3.35 Omnibus test: P = 0.00, reject H 0 and conclude there is a difference in bone density among the three groups. Where does that difference lie? Let s look at the comparisons (via contrasts): (Group 1) versus (Groups and 3) (Group ) versus (Group 3) 5 5

26 Example bone density Does jumping (any level) affect bone density? Set-up hypotheses: H H 0 A : : Control Control Calculate t-test t s p a i ( x 0.5( i a n i i lowjump lowjump highjump highjump Calculate the p-value: df = N I = 30 3 = 7. p-value = *P(t<-.93) < (0.005) = So reject H 0 and conclude any kind of jumping improves bone density over no jumping at all ) ) (1)601.1 ( 0.5)61.5 ( 0.5) ( 0.5) 10 ( 0.5)

27 Example bone density Does the level of jumping affect bone density? Look at group versus group 3 using the contrast H H 0 A : (0) : (0) Control Control (1) (1) lowjump lowjump ( 1) ( 1) highjump highjump 0 0 t s p a i i x i a n i (0)601.1 (1)61.5 ( 1) (1) 10 ( 1) 10 So reject H 0 (since our p-value < 0.0 < α) and conclude higher jumping affects bone density differently than moderate jumping (in fact, it makes bones denser). 7 7

28 Unit 11 Outline Analysis of Variance (ANOVA) General format and ANOVA s F-test Link to binary regression Contrast testing Multiple comparisons (& the Bonferroni correction) Two-way ANOVA (and Multi-way ANOVA) 8 8

29 The multiple comparisons problem To test H 0 : μ 1 = μ =... = μ g, why not simply conduct multiple twosample t-tests? For example, with g = 9 there are 36 possible pair-wise t-tests. What is the probability of rejecting a true H 0 at least once? P(Type I error) = 1 ( ) = 0.84 This inflated Type I error is due to multiple comparisons: we have looked at multiple tests at once (each with α = 0.05), and thus it will lead to significant results that are not truly there (simply by chance). In general, we would like the probability of a type I error to be some fixed value α (e.g., α = 0.05) This is accomplished using the overall ANOVA F-test What happens when we start using multiple contrasts? If we don t have any contrasts pre-specified, we can just look at all the pairwise two-sample t-tests, but adjust α. 9 9

30 The Bonferroni correction A solution to the multiple comparisons problem is the adjustment of α levels using the Bonferroni correction This correction is a conservative solution Suppose we wish to perform all possible pairs of comparisons among g groups: There are g g!!( g )! g( g 1) such comparisons The Bonferroni correction - to protect the overall level of α we must perform each individual test at level α * * g 30 30

31 Example - Bonferroni correction Suppose we wish to perform pair-wise comparisons among 3 groups but still maintain an overall α = 0.05 If g = 3, there are 3 3!!(1)! possible comparisons: (Group 1 versus ), (1 versus 3), and ( versus 3) The Bonferroni correction says that if we want an overall α level of < 0.05, then we do each of the 3 tests at the α* = 0.05 / 3 = level Thus, with each test at level α* = , this Bonferroni correction gives an overall α <

32 Bonferroni Correction in SPSS Then the pairwise comparisons can be made when you run the one-way ANOVA. Just click on the Post Hoc button, and select Bonferroni : * x1 x tdf 7sex ( 13.4, 36.04) x 11.4 Note: these confidence intervals are wider than those from the regression (or if you did them separately based on the t-stats for comparing two means). Here they use (gotten from an online calculator): * t /, df

33 Using Multiple Comparisons to your Advantage (bad use of statistics!!!) 33

34 Unit 11 Outline Analysis of Variance (ANOVA) General format and ANOVA s F-test Link to binary regression Contrast testing Multiple comparisons (& the Bonferroni correction) Two-way ANOVA (and Multi-way ANOVA) 34 34

35 Extension to Two-way ANOVA (and beyond!) We can expand ANOVA to include an additional factor (a second grouping variable) call it two-way ANOVA For example, we could predict text messaging based on class year (with g = 4 groups) and sex (with g = groups), and even a third variable like house (g = 1?) or a fourth like concentration (with g =???). Algebraically, things get a little more complicated, but we won t focus on that in this course Having multiple grouping variables (just like predictor variables) also allows us to include interaction terms as well Easy to perform in SPSS!!! Analyze General Linear Model Univariate 35 35

36 Two-way ANOVA in SPSS The example we will use is trying to predict the ln_text messages by Harvard College students based on their class year and sex. In SPSS: Analyze General Linear Model Univariate Click on the Model button, select Custom, and drag the grouping variables over into the model. Click Continue then OK

37 Two-way ANOVA in SPSS (with interaction) Here are the results in SPSS for this sample if n = 169 students What should be the 3 different sets of degrees of freedoms here? The Corrected Model row tests for the model overall, and the gender and classyear rows tests for each variable separately. Note: you can ignore the Intercept and Total rows, and always compare to the Corrected Total row. SS s will not add up

38 Two-way ANOVA in SPSS (with interaction) We can also consider the interaction between the two grouping variables here (sex and class year) What would the interaction term represent? The effect of class year on # text messages sent may be different for men than women Equivalent to saying that the effect of women on # text messages sent may be different in the 4 class years Maybe men s average decreases over the 4 years, but women s average stays the same What should be the degrees of freedom for the interaction between these two variables? (g 1-1)*(g -1) = (4-1)*(-1) =

39 Two-way ANOVA in SPSS (with interaction) Here are the results in SPSS for this sample if we include the interaction between the grouping variables as well (which is sometimes called the full factorial ANOVA model this is the default in SPSS)

40 ANOVA: Main Points Simple idea behind ANOVA: significant difference among or more groups if variability between groups (differences among the group means) is significantly larger than variability within groups (differences from mean within each group) [F-test]. If there is evidence of difference among groups, then an a priori hypothesis can be tested via a contrast t-test, or some care must be taken in searching for the group pairs that are significantly different (using Bonferroni correction). ANOVA can be expanded to include multiple grouping variables (Multi-way ANOVA). Algebra is hard, but SPSS does the work for us. Interactions can then be considered

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl Dept of Information Science j.nerbonne@rug.nl October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The

More information

1 Basic ANOVA concepts

1 Basic ANOVA concepts Math 143 ANOVA 1 Analysis of Variance (ANOVA) Recall, when we wanted to compare two population means, we used the 2-sample t procedures. Now let s expand this to compare k 3 population means. As with the

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

One-Way Analysis of Variance (ANOVA) Example Problem

One-Way Analysis of Variance (ANOVA) Example Problem One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

CHAPTER 14 NONPARAMETRIC TESTS

CHAPTER 14 NONPARAMETRIC TESTS CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences

More information

Chapter 7. One-way ANOVA

Chapter 7. One-way ANOVA Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

An analysis method for a quantitative outcome and two categorical explanatory variables.

An analysis method for a quantitative outcome and two categorical explanatory variables. Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that

More information

ABSORBENCY OF PAPER TOWELS

ABSORBENCY OF PAPER TOWELS ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?

More information

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA) UNDERSTANDING ANALYSIS OF COVARIANCE () In general, research is conducted for the purpose of explaining the effects of the independent variable on the dependent variable, and the purpose of research design

More information

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate 1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,

More information

Analysis of Variance ANOVA

Analysis of Variance ANOVA Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding

More information

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups ANOVA ANOVA Analysis of Variance Chapter 6 A procedure for comparing more than two groups independent variable: smoking status non-smoking one pack a day > two packs a day dependent variable: number of

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

Regression step-by-step using Microsoft Excel

Regression step-by-step using Microsoft Excel Step 1: Regression step-by-step using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

13: Additional ANOVA Topics. Post hoc Comparisons

13: Additional ANOVA Topics. Post hoc Comparisons 13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Kruskal-Wallis Test Post hoc Comparisons In the prior

More information

Experimental Designs (revisited)

Experimental Designs (revisited) Introduction to ANOVA Copyright 2000, 2011, J. Toby Mordkoff Probably, the best way to start thinking about ANOVA is in terms of factors with levels. (I say this because this is how they are described

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2

TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2 About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing

More information

Linear Models in STATA and ANOVA

Linear Models in STATA and ANOVA Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples

More information

When to use Excel. When NOT to use Excel 9/24/2014

When to use Excel. When NOT to use Excel 9/24/2014 Analyzing Quantitative Assessment Data with Excel October 2, 2014 Jeremy Penn, Ph.D. Director When to use Excel You want to quickly summarize or analyze your assessment data You want to create basic visual

More information

MULTIPLE REGRESSION WITH CATEGORICAL DATA

MULTIPLE REGRESSION WITH CATEGORICAL DATA DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 86 MULTIPLE REGRESSION WITH CATEGORICAL DATA I. AGENDA: A. Multiple regression with categorical variables. Coding schemes. Interpreting

More information

Simple Tricks for Using SPSS for Windows

Simple Tricks for Using SPSS for Windows Simple Tricks for Using SPSS for Windows Chapter 14. Follow-up Tests for the Two-Way Factorial ANOVA The Interaction is Not Significant If you have performed a two-way ANOVA using the General Linear Model,

More information

One-Way Analysis of Variance

One-Way Analysis of Variance One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Estimation of σ 2, the variance of ɛ

Estimation of σ 2, the variance of ɛ Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated

More information

January 26, 2009 The Faculty Center for Teaching and Learning

January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i

More information

Multiple-Comparison Procedures

Multiple-Comparison Procedures Multiple-Comparison Procedures References A good review of many methods for both parametric and nonparametric multiple comparisons, planned and unplanned, and with some discussion of the philosophical

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Statistical Models in R

Statistical Models in R Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools. Tools for Summarizing Data Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

More information

THE KRUSKAL WALLLIS TEST

THE KRUSKAL WALLLIS TEST THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKAL-WALLIS TEST: The non-parametric alternative to ANOVA: testing for difference between several independent groups 2 NON

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

Unit 31: One-Way ANOVA

Unit 31: One-Way ANOVA Unit 31: One-Way ANOVA Summary of Video A vase filled with coins takes center stage as the video begins. Students will be taking part in an experiment organized by psychology professor John Kelly in which

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Crosstabulation & Chi Square

Crosstabulation & Chi Square Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among

More information

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

CHAPTER 13. Experimental Design and Analysis of Variance

CHAPTER 13. Experimental Design and Analysis of Variance CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

Two Related Samples t Test

Two Related Samples t Test Two Related Samples t Test In this example 1 students saw five pictures of attractive people and five pictures of unattractive people. For each picture, the students rated the friendliness of the person

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

The ANOVA for 2x2 Independent Groups Factorial Design

The ANOVA for 2x2 Independent Groups Factorial Design The ANOVA for 2x2 Independent Groups Factorial Design Please Note: In the analyses above I have tried to avoid using the terms "Independent Variable" and "Dependent Variable" (IV and DV) in order to emphasize

More information

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Notes on Applied Linear Regression

Notes on Applied Linear Regression Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 444-8935 email:

More information

Chapter 14: Repeated Measures Analysis of Variance (ANOVA)

Chapter 14: Repeated Measures Analysis of Variance (ANOVA) Chapter 14: Repeated Measures Analysis of Variance (ANOVA) First of all, you need to recognize the difference between a repeated measures (or dependent groups) design and the between groups (or independent

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program. Statistics Module Part 3 2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

Chapter 2 Probability Topics SPSS T tests

Chapter 2 Probability Topics SPSS T tests Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

Main Effects and Interactions

Main Effects and Interactions Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine 2 - Manova 4.3.05 25 Multivariate Analysis of Variance What Multivariate Analysis of Variance is The general purpose of multivariate analysis of variance (MANOVA) is to determine whether multiple levels

More information

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. 277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Multivariate Analysis of Variance (MANOVA)

Multivariate Analysis of Variance (MANOVA) Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction

More information

1 Theory: The General Linear Model

1 Theory: The General Linear Model QMIN GLM Theory - 1.1 1 Theory: The General Linear Model 1.1 Introduction Before digital computers, statistics textbooks spoke of three procedures regression, the analysis of variance (ANOVA), and the

More information

Moderation. Moderation

Moderation. Moderation Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation

More information

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

Analysis of Variance. MINITAB User s Guide 2 3-1

Analysis of Variance. MINITAB User s Guide 2 3-1 3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced

More information