One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups



Similar documents
One-Way Analysis of Variance (ANOVA) Example Problem

General Regression Formulae ) (N-2) (1 - r 2 YX

1.5 Oneway Analysis of Variance

Regression step-by-step using Microsoft Excel

12: Analysis of Variance. Introduction

Elementary Statistics Sample Exam #3

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Univariate Regression

Section 13, Part 1 ANOVA. Analysis Of Variance

An analysis method for a quantitative outcome and two categorical explanatory variables.

One-Way Analysis of Variance

Statistiek II. John Nerbonne. October 1, Dept of Information Science

2. Simple Linear Regression

Recall this chart that showed how most of our course would be organized:

Regression Analysis: A Complete Example

Part 2: Analysis of Relationship Between Two Variables

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

1 Basic ANOVA concepts

Statistical Models in R

Multiple Linear Regression

Chapter 7. One-way ANOVA

CHAPTER 13. Experimental Design and Analysis of Variance

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Testing for Lack of Fit

Statistics Review PSY379

Week TSX Index

MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis

A Primer on Forecasting Business Performance

How to calculate an ANOVA table

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

The Analysis of Variance ANOVA

Study Guide for the Final Exam

Final Exam Practice Problem Answers

CS 147: Computer Systems Performance Analysis

Randomized Block Analysis of Variance

Example: Boats and Manatees

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

5. Linear Regression

Difference of Means and ANOVA Problems

International Statistical Institute, 56th Session, 2007: Phil Everson

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

individualdifferences

PSYC 381 Statistics Arlo Clark-Foos, Ph.D.

2. What is the general linear model to be used to model linear trend? (Write out the model) = or

N-Way Analysis of Variance

Simple Linear Regression Inference

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

1 Simple Linear Regression I Least Squares Estimation

Simple Methods and Procedures Used in Forecasting

Introduction to General and Generalized Linear Models

Jinadasa Gamage, Professor of Mathematics, Illinois State University, Normal, IL, e- mail:

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 2 Simple Comparative Experiments Solutions

MULTIPLE REGRESSION WITH CATEGORICAL DATA

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Module 5: Multiple Regression Analysis

August 2012 EXAMINATIONS Solution Part I

SPSS Guide: Regression Analysis

Solutions to Homework 10 Statistics 302 Professor Larget

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

Rockefeller College University at Albany

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

The F distribution and the basic principle behind ANOVAs. Situating ANOVAs in the world of statistical tests

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?

Section 1: Simple Linear Regression

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

A STUDY OF WHETHER HAVING A PROFESSIONAL STAFF WITH ADVANCED DEGREES INCREASES STUDENT ACHIEVEMENT MEGAN M. MOSSER. Submitted to

STAT 350 Practice Final Exam Solution (Spring 2015)

Guide to Microsoft Excel for calculations, statistics, and plotting data

Notes on Applied Linear Regression

1 Theory: The General Linear Model

Multiple Linear Regression in Data Mining

Causal Forecasting Models

MULTIPLE REGRESSIONS ON SOME SELECTED MACROECONOMIC VARIABLES ON STOCK MARKET RETURNS FROM

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

ANOVA. February 12, 2015

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

GLM I An Introduction to Generalized Linear Models

Experimental Designs (revisited)

The Wondrous World of fmri statistics

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance

Multivariate Analysis of Variance (MANOVA): I. Theory

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

Testing Research and Statistical Hypotheses

Unit 31: One-Way ANOVA

The correlation coefficient

Correlation and Simple Linear Regression

Creating a Campus Netflow Model

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

Transcription:

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The assumptions upon which the tests and estimation procedures of the analysis of variance are based on are as follows: a) whatever the technique of data collection, the observations within each sampled population are normally distributed. b) The sampled population has a common variance of s2. REQUIREMENTS One way ANOVA tests the equality of group means for a single specified variable. For example, The F ratio tests the statistical significance between means. Mathematical Formulations THE SUM OF SQUARES Let there be k populations, with population means µ1, µ2, µ3. µk, based on independent random samples of n1, n2, n3. nk observations, selected from populations 1,2,3..,k, respectively. Then the Total Sum of Squares is the sum of squares of deviation of all n ( n = n1 +n2 +n3 +.. + nk) x values about their overall mean i.e. k Total SS = SSx = i=1 (xi -x ˉ ) 2 The Total Sum of Squares can be broken down to two components that measure the source of variation. They are: Sum of Squares for Treatment (SST) k Ti Where: SST = i=1 ( 2 )-CM ni Ti = Total of all observations receiving the treatment i (or of the i th population) ni = Number of observations receiving the treatment i (or of the i th population) CM= Correction for the mean = T 2 /n T = Total of all observations = ( T1 + T2 + T3 +. + Tk ) n = Total number of Observations = ( n1 + n2 + n3 +. + nk ) Sum of Squares for Error (SSE) SSE is usually computed in a simplified way from the equation: SS ERROR = SS TOTAL SS TREATMENT ANOVA 1

THE DEGREES OF FREEDOM The degrees of freedom for the Total Sum of Squares is always (n 1); where n = Total number of observations in all samples = ( n1 + n2 + n3 +. + nk ) The degrees of freedom of the Model (Treatment) is always (k 1); where k = Total number of populations being analyzed. The degrees of freedom of the Error is always (n k). The following relationship always holds: D.F. (Treatment) + D.F. (ERROR) = (k-1) + (n-k) = (n-1) = D.F. (TOTAL SS) THE MEAN SQUARE The mean square gives an estimate of the s² based on the variation among the sample means (corresponding to the model) and the variation within the samples (corresponding to the error). These estimates are calculated by dividing the sum of squares by the corresponding degrees of freedom. Thus, The Mean Square for Treatment (Model) = MST = (SST)/(k-1) The Mean Square of the Error = MSE = (SSE)/(n-k) (The MSE is a pooled estimate of s 2 based on the sum of squares of deviations of the x-values about their respective sample means and is also denoted by s 2.) THE F STATISTIC The F statistic is used for comparing the estimate of s 2 (MS (Treatment) ) and the s 2 (MS (Error) ) and is given by F = MS (Treatment) /MS (Error). The Analysis The ANOVA is done with the Ho: μ 1 = μ 2 = μ 3 =..= μ k Next, using the tables, the F-value with degrees of freedom v1 (v1 = D.F. of the numerator i.e. of MS (Treatment) = k-1) and v2 (v2 = D.F. of the denominator i.e. of MS (Error) = n-k), and for the significance level used in the analysis, is obtained. ANOVA 2

This F-value is compared with the F statistic computed. If the F-value obtained is greater than or equal to the F-Statistic Computed; then we say that THERE IS INSUFFICIENT EVIDENCE TO REJECT THE NULL HYPOTHESIS AT THE GIVEN LEVEL OF SIGNIFICANCE. But, if the F-value obtained is less than the F-Statistic Computed; then we say that THERE IS SUFFICIENT EVIDENCE TO REJECT THE NULL HYPOTHESIS AT THE GIVEN LEVEL OF SIGNIFICANCE and that leads to the conclusion that at least one of the population means (μi) is different from the others. The observed significance level is the significance level for which the F-value obtained from the table, corresponding to degrees of freedom v1 and v2, is equal to the F statistic computed. Another way of testing the null hypothesis is by using this observed significance level. If this significance level is less than or equal to the significance level set for the test, then the null hypothesis is rejected. We re Here to Help! Qualtrics.com provides the most advanced online survey building, data collection (via panels or corporate / personal contacts), real-time view of survey results, and advanced dashboard reporting tools. If you are interested in learning more about how the Qualtrics professional services team can help you with a conjoint analysis research project, contact us at research@qualtrics.com. ANOVA 3

ANOVA 4

1. The Degree of Freedom for the Regression Model, also called the explained model, is given by k, where k = number of independent variables in the regression equation. For the Residual, the error unexplained by the regression model, the Degree of Freedom is given by (n-k-1), where n = number of counts of the independent variable in the data set. 2. Mean Square = (Sum of Squares)/(DF) 3. F Ratio = (Mean Square of the Regression)/(Mean Square of the Residual) 4. F-Prob = Level of significance corresponding to the F Value ANOVA 5

ANOVA 6