One Way Analysis of Variance (ANOVA)

Similar documents
Multiple Linear Regression

N-Way Analysis of Variance

Chapter 5 Analysis of variance SPSS Analysis of variance

SPSS Tests for Versions 9 to 13

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

Simple Tricks for Using SPSS for Windows

Mixed 2 x 3 ANOVA. Notes

Data Analysis Tools. Tools for Summarizing Data

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

SPSS Explore procedure

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Multiple-Comparison Procedures

ABSORBENCY OF PAPER TOWELS

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SPSS/Excel Workshop 3 Summer Semester, 2010

Comparing Means in Two Populations

ANALYSIS OF TREND CHAPTER 5

Section 13, Part 1 ANOVA. Analysis Of Variance

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

One-Way Analysis of Variance (ANOVA) Example Problem

13: Additional ANOVA Topics. Post hoc Comparisons

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

An introduction to IBM SPSS Statistics

Linear Models in STATA and ANOVA

1.5 Oneway Analysis of Variance

Two-way ANOVA and ANCOVA

Scatter Plots with Error Bars

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

A Basic Guide to Analyzing Individual Scores Data with SPSS

1.1. Simple Regression in Excel (Excel 2010).

5 Analysis of Variance models, complex linear models and Random effects models

Two-Way ANOVA tests. I. Definition and Applications...2. II. Two-Way ANOVA prerequisites...2. III. How to use the Two-Way ANOVA tool?...

Main Effects and Interactions

Directions for Frequency Tables, Histograms, and Frequency Bar Charts

SPSS Introduction. Yi Li

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Analysis of Variance ANOVA

ANOVA. February 12, 2015

Regression step-by-step using Microsoft Excel

12: Analysis of Variance. Introduction

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Directions for using SPSS

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Analysis of Variance. MINITAB User s Guide 2 3-1

INTRODUCTION TO SPSS FOR WINDOWS Version 19.0

Chapter 7. One-way ANOVA

What is a Mail Merge?

Using R for Linear Regression

Systat: Statistical Visualization Software

SPSS Guide: Regression Analysis

IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA

January 26, 2009 The Faculty Center for Teaching and Learning

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Guide to Microsoft Excel for calculations, statistics, and plotting data

How to install and use the File Sharing Outlook Plugin

Correlation and Simple Linear Regression

We extended the additive model in two variables to the interaction model by adding a third term to the equation.

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; and Dr. J.A. Dobelman

ABSTRACT INTRODUCTION READING THE DATA SESUG Paper PO-14

Statistical Models in R

Moderation. Moderation

Using SPSS, Chapter 2: Descriptive Statistics

Regression Analysis (Spring, 2000)

8. Comparing Means Using One Way ANOVA

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

Chapter 2 Probability Topics SPSS T tests

Predictor Coef StDev T P Constant X S = R-Sq = 0.0% R-Sq(adj) = 0.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

One-Way Analysis of Variance

Curve Fitting. Before You Begin

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

EDUCATION AND VOCABULARY MULTIPLE REGRESSION IN ACTION

Minitab Tutorials for Design and Analysis of Experiments. Table of Contents

Data Analysis in SPSS. February 21, If you wish to cite the contents of this document, the APA reference for them would be

Concepts of Experimental Design

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

Task Scheduler. Morgan N. Sandquist Developer: Gary Meyer Reviewer: Lauri Watts

Two Related Samples t Test

Adobe Acrobat X Pro Creating & Working with PDF Documents

In This Issue: Excel Sorting with Text and Numbers

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

Introduction to Quantitative Methods

Installing a Browser Security Certificate for PowerChute Business Edition Agent

Simple Linear Regression Inference

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

SPSS Guide How-to, Tips, Tricks & Statistical Techniques

Using Excel in Research. Hui Bian Office for Faculty Excellence

ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R.

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Simple Predictive Analytics Curtis Seare

Using MS Excel to Analyze Data: A Tutorial

Microsoft Office 2010

How to set the main menu of STATA to default factory settings standards

Transcription:

One Way Analysis of Variance (ANOVA) Example: Researchers wish to see if there is difference in average BMI among three.different populations. Use the following data to test if there is significant difference in average BMI among three different populations, at 5% level of significance. BMI Population 1 Population 2 Population 3 30 29 33 27 27 32 28 24 29 26 26 37 29 25 30 27 26 38 29 27 35 1) Create the data with two variables, BMI and Population. The arrangement of the data should as in the Figure on the right for data collected from 21 subjects from the three independent samples. 2) If the Population variable data was entered as numeric, one should convert it into a Factor variable. The values of the Population variable, 1, 2, and 3 can be relabeled to Population 1, Population 2, and Population 3, in the process of converting to factor. To convert to factor, click on Data in the R Commander menu bar, and then select Manage variable in active data set, and select Convert numeric variables to factors. To convert re-label the Population variable, in the Convert Numeric Variables to Factors dialog box, select the Population variable and checked on Use Numbers option (or click on Supply level names option if one wishes to supply names for relabeling the numbers) and click on OK button, and also click on OK to overwrite variable.

Check normality assumption in One Way ANOVA 3) To check the normality assumption for ANOVA F-test, one can use the following by() function in R Commander Script Window and click Submit button to run the by function for normality test. Be sure that the Population variable is a factor variable. R Script for normality test: by(bmi$bmi, BMI$Population, shapiro.test) Data set name. Output shown in R Commander Output Window: BMI$Population: 1 So, BMI$Population means the Population variable in BMI data set. W = 0.9524, p-value = 0.7518 ------------------------------------------------ BMI$Population: 2 W = 0.9671, p-value = 0.8766 ------------------------------------------------ BMI$Population: 3 W = 0.9534, p-value = 0.7602 The p-values from normality tests for all three samples are all greater 0.05. The normality assumption is acceptable. Check Homogeneity of variances assumption for One Way ANOVA 4) For checking the homogeneity of variances assumption, in R Commander, click on Statistics in menu bar, and select Variances, and then select Bartlett s test option, and then select the Population as the group variable and BMI as the Response variable and click OK. See the following figure:

Output from R Commander on Barlett s Test of Homogeneity of Variances: > tapply(bmi$bmi, BMI$Population, var, na.rm=true) 1 2 3 2.000000 2.571429 11.619048 > bartlett.test(bmi ~ Population, data=bmi) Bartlett test of homogeneity of variances data: BMI by Population Bartlett's K-squared = 5.4029, df = 2, p-value = 0.06711 The p-value of the test is 0.06711 which is greater than 0.05, therefore the homogeneity of variances assumption is acceptable. The difference in variances is not statistically significant. Since both normality and homogeneity of variances assumptions are satisfied, we can run the One Way ANOVA F-test for equality of population means.

5) To run the One Way ANOVA for testing difference in average BMI s, click on Statistics and select Means and choose One Way ANOVA option, and in the One-way ANOVA dialog box, choose Population as the Group variable and BMI as the Response variable. If one wishes to also perform multiple comparisons for means, the Pairwise comparisons of means check box should be checked. The output from R Commander Window: > AnovaModel.1 <- aov(bmi ~ Population, data=bmi) > summary(anovamodel.1) Df Sum Sq Mean Sq F value Pr(>F) Population 2 194.667 97.333 18.035 5.021e-05 *** Residuals 18 97.143 5.397 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 (The p-value of the F-test is 5.021e-05, i.e. 0.00005021, which is less than 0.05. One can conclude that the difference in means is significant.)

> numsummary(bmi$bmi, groups=bmi$population, + statistics=c("mean", "sd")) mean sd n 1 28.00000 1.414214 7 2 26.28571 1.603567 7 3 33.42857 3.408672 7 >.Pairs <- glht(anovamodel.1, linfct = + mcp(population = "Tukey")) > confint(.pairs) # confidence intervals Simultaneous Confidence Intervals Multiple Comparisons of Means: Tukey Contrasts Fit: aov(formula = BMI ~ Population, data = BMI) Quantile = 2.5522 95% family-wise confidence level Linear Hypotheses: Estimate lwr upr 2-1 == 0-1.7143-4.8847 1.4561 3-1 == 0 5.4286 2.2582 8.5990 3-2 == 0 7.1429 3.9725 10.3132 > cld(.pairs) # compact letter display 2 3 1 "a" "b" "a" (From Tukey s multiple comparisons, the difference between group 2 and group 1 is not significant with a confidence interval estimate for the difference of the two means covers 0. However, the differences between groups 3 and 1, also groups 3 and 2, are both statistically significant with confidence intervals cover a range of positive numbers. See Linear Hypotheses result above.)