Package lmertest. July 16, 2015
|
|
|
- Erik Simon
- 10 years ago
- Views:
Transcription
1 Type Package Title Tests in Linear Mixed Effects Models Version Package lmertest July 16, 2015 Maintainer Alexandra Kuznetsova Depends R (>= 3.0.0), Matrix, stats, methods, lme4 (>= 1.0) Imports plyr, MASS, Hmisc, ggplot2 Suggests pbkrtest Different kinds of tests for linear mixed effects models as implemented in 'lme4' package are provided. The tests comprise types I - III F tests for fixed effects, LR tests for random effects. The package also provides the calculation of population means for fixed factors with confidence intervals and corresponding plots. Finally the backward elimination of non-significant effects is implemented. LazyData TRUE License GPL (>= 2) Repository CRAN NeedsCompilation no Author Alexandra Kuznetsova [aut, cre], Per Bruun Brockhoff [aut, ths], Rune Haubo Bojesen Christensen [aut] Date/Publication :54:21 R topics documented: lmertest-package anova-methods carrots difflsmeans ham lmer lsmeans
2 2 lmertest-package mermodlmertest-class rand step summary-methods TVbo Index 17 lmertest-package The package performs different kinds of tests on lmer objects, such as F tests of types I - III hypotheses for the fixed part, likelihood ratio tests for the random part, least squares means (population means) and differences of least squares means for the factors of the fixed part with corresponding plots. The package also provides with a function step, that preforms backward elimination of non-significant effects, starting from the random effects, and then fixed ones. Details The package provides anova function, that gives data frame similar to what gives lme4 package but with p-values calculated from F statistics of types I - III hypotheses. There are two options for denominator degrees of freedom of F statistics: "Satterthwaite" and "Kenward-Roger". The calculation of anova with Kenward-Roger s approximation is based on function from pbkrtest package, the calculation of Satterthwaite s approximation is based on SAS proc mixed theory (see reference). The package also provides summary function, which gives the same as lme4 package summary function but with p-values and degrees of freedom added for the t-test (based on Satterthwaite approximation for denominator degrees of freedom). The tests on random effects are performed using likelihood ratio tests. Package: lmertest Type: Package Version: 1.0 Date: License: GPL The calculation of statistics for the fixed part was developed according to SAS Proc Mixed Theory (see reference). Author(s) Alexandra Kuznetsova <[email protected]>, Per Bruun Brockhoff, Rune Haubo Bojesen Christensen
3 anova-methods 3 References SAS Technical Report R Tests of Hypotheses in Fixed-Effects Linear Models Copyright (C) (SAS Institute Inc., Cary, NC, USA) Goodnight, J.H General Linear Models Procedure (S.A.S. Institute, Inc.) Schaalje G.B., McBride J.B., Fellingham G.W Adequacy of approximations to distributions of test Statistics in complex mixed linear models #import lmertest package # an object of class mermodlmertest m <- lmer(informed.liking ~ Gender+Information+Product +(1 Consumer), data=ham) # gives summary of lmer object. The same as of class mermod but with # additional p-values calculated based on Satterthwate s approximations summary(m) # anova table the same as of class mermod but with additional F statistics and # and denominator degrees of freedom and # p-values calculated based on Satterthwaite s approximations anova(m) # anova table the same as of class mermod but with additional F statistics and # denominator degrees of freedom and # p-values calculated based on Kenward-Roger s approximations if(requirenamespace("pbkrtest", quietly = TRUE)) anova(m, ddf = "Kenward-Roger") # anova table of class mermod anova(m, ddf="lme4") # backward elimination of non-significant effects of model m st <- step(m) plot(st) anova-methods Methods for function anova in package lmertest Methods for Function anova in Package lmertest
4 4 anova-methods Usage ## S4 method for signature mermodlmertest anova(object,..., ddf="satterthwaite", type=3) Arguments object object of class "mermodlmertest"... object of class "mermodlmertest". Then the model comparison statistisc will be calculated ddf type References By default the Satterthwaite s approximation to degrees of freedom is calculated. If ddf="kenward-roger", then the Kenward-Roger s approximation is calculated using KRmodcomp function from pbkrtest package. If ddf="lme4" then the anova table that comes from lme4 package is returned. type of hypothesis to be tested. Could be type=3 or type=2 or type = 1 (The definition comes from SAS theory) SAS Technical Report R Tests of Hypotheses in Fixed-Effects Linear Models Copyright (C) (SAS Institute Inc., Cary, NC, USA) Goodnight, J.H General Linear Models Procedure (S.A.S. Institute, Inc.) Schaalje G.B., McBride J.B., Fellingham G.W Adequacy of approximations to distributions of test Statistics in complex mixed linear models #import lmertest package m.ham <- lmer(informed.liking ~ Product*Information*Gender + (1 Consumer), data = ham) # type 3 anova table with denominator degrees of freedom # calculated based on Satterthwaite s approximation anova(m.ham) # type 1 anova table with denominator degrees of freedom # calculated based on Satterthwaite s approximation anova(m.ham, type = 1) # type3 anova table with additional F statistics and denominator degrees of freedom # calculated based on Kenward-Roger s approximation if(require(pbkrtest)) anova(m.ham, ddf = "Kenward-Roger")
5 carrots 5 # anova table, that is returned by lme4 package anova(m.ham, ddf = "lme4") carrots Consumer preference mapping of carrots In a consumer study 103 consumers scored their preference of 12 danish carrot types on a scale from 1 to 7. Moreover the consumers scored the degree of sweetness, bitterness and crispiness in the products. The carrots were harvested in autumn 1996 and tested in march In addition to the consumer survey, the carrot products were evaluated by a trained panel of tasters, the sensory panel, with respect to a number of sensory (taste, odour and texture) properties. Since usually a high number of (correlated) properties(variables) are used, in this case 14, it is a common procedure to use a few, often 2, combined variables that contain as much of the information in the sensory variables as possible. This is achieved by extracting the first two principal components in a principal components analysis(pca) on the product-by-property panel average data matrix. In this data set the variables for the first two principal components are named (sens1 and sens2). Usage carrots Format Source Consumer factor with 103 levels: numbering identifying consumers Frequency factor with 5 levels; "How often do you eat carrots?" 1: once a week or more, 2: once every two weeks, 3: once every three weeks, 4: at least once month, 5: less than once a month Gender factor with 2 levels. 1: male, 2:female Age factor with 4 levels. 1: less than 25 years, 2: years, 3: years, 4 more than 61 years Homesize factor with two levels. Number of persons in the household. 1: 1 or 2 persons, 2: 3 or more persons Work factor with 7 levels. different types of employment. 1: unskilled worker(no education), 2: skilled worker(with education), 3: office worker, 4: housewife (or man), 5: independent businessman/ self-employment, 6: student, 7: retired Income factor with 4 levels. 1: <150000, 2: , 3: , 4: > Per Bruun Brockhoff, The Royal Veterinary and Agricultural University, Denmark.
6 6 difflsmeans #import lme4 package and lmertest package m.carrots <- lmer(preference ~ sens2 + Homesize +(1+sens2 Consumer), data=carrots) # only elimination of the random part is required. #approximation of ddf is Satterthwaite step(m.carrots, reduce.random = FALSE) difflsmeans Calculates Differences of Least Squares Means and Confidence Intervals for the factors of a fixed part of mixed effects model of lmer object. Produces a data frame which resembles to what SAS software gives in proc mixed statement. The approximation for degrees of freedom is Satterthwaite s. Usage difflsmeans(model, test.effs=null,...) Arguments model test.effs linear mixed effects model (lmer object). charachter vector specyfying the names of terms to be tested. If NULL all the terms are tested.... other potential arguments. Value Produces Differences of Least Squares Means (population means) table with p-values and Confidence intervals. Author(s) Alexandra Kuznetsova, Per Bruun Brockhoff, Rune Haubo Bojesen Christensen See Also lsmeans, step, rand
7 ham 7 #import lme4 package and lmertest package #specify lmer model m1 <- lmer(informed.liking ~ Gender*Information +(1 Consumer), data=ham) #calculate least squares means for interaction Gender:Information difflsmeans(m1, test.effs="gender:information") #import TVbo data from lmertest package data(tvbo) m <- lmer(coloursaturation ~ TVset*Picture + (1 Assessor), data=tvbo) plot(difflsmeans(m, test.effs="tvset")) ham Conjoint study of dry cured ham One of the purposes of the study was to investigate the effect of information given to the consumers measured in hedonic liking for the hams. Two of the hams were Spanish and two were Norwegian, each origin representing different salt levels and different aging time. The information about origin was given in such way that both true and false information was given. essentially a 4*2 design with 4 samples and 2 information levels. A total of 81 Consumers participated in the study. Usage ham Format Consumer factor with 81 levels: numbering identifying consumers Product factor with four levels Informed.liking numeric: hedonic liking for the products Information factor with two levels Gender factor with two levels (gender) Age numeric: age of Consumer References "Alternative methods for combining design variables and consumer preference with information about attitudes and demographics in conjoint analysis". T. Naes, V.Lengard, S. Bolling Johansen, M. Hersleth
8 8 lmer #import lmertest package m <- lmer(informed.liking ~ Product*Information*Gender + (1 Product:Consumer), data=ham) #anova table with p-values with Satterthwaite s approximation for denominator #degrees of freedom anova(m) #analysis of random and fixed parts and post hoc #analysis of Product and Information effects step(m, reduce.random=false, reduce.fixed=false, test.effs=c("product", "Information")) lmer Fit Linear Mixed-Effects Models Fit a linear mixed model Details This lmer function is an overloaded function of lmer (mermod class from lme4 package). Value An object of class "mermodlmertest" See Also mermodlmertest class ## linear mixed models fm1 <- lmer(reaction ~ Days + (Days Subject), sleepstudy) fm2 <- lmer(reaction ~ Days + (1 Subject) + (0+Days Subject), sleepstudy) # anova table the same as of class mermod but with additional F statistics and # p-values calculated based on Satterthwaite s approximations anova(fm1)
9 lsmeans 9 # anova table the same as of class mermod but with additional F statistics and # p-values calculated based on Kenward-Roger s approximations if(requirenamespace("pbkrtest", quietly = TRUE)) anova(fm1, ddf="kenward-roger") # anova table the same as of class mermod anova(fm1, ddf="lme4") # gives summary of mermodlmertest class. The same as of class mermod but with # additional p-values calculated based on Satterthwate s approximations summary(fm1) ## multiple comparisons statistics. The one from lme4 package anova(fm1, fm2) lsmeans Calculates Least Squares Means and Confidence Intervals for the factors of a fixed part of mixed effects model of lmer object. Usage Produces a data frame which resembles to what SAS software gives in proc mixed statement. The approximation of degrees of freedom is Satterthwate s. lsmeans(model, test.effs = NULL,...) Arguments Value Note model test.effs linear mixed effects model (lmer object). charachter vector specyfying the names of terms to be tested. If NULL all the terms are tested.... other potential arguments. Produces Least Squares Means (population means) table with p-values and Confidence intervals. For construction of the contrast matrix popmatrix function from doby package was used.
10 10 mermodlmertest-class Author(s) Alexandra Kuznetsova, Per Bruun Brockhoff, Rune Haubo Bojesen Christensen References doby package, gplots package See Also step, rand, difflsmeans #import lme4 package and lmertest package #specify lmer model m1 <- lmer(informed.liking ~ Gender*Information +(1 Consumer), data=ham) #calculate least squares means for interaction Gender:Information lsmeans(m1, test.effs="gender:information") #import TVbo data from lmertest package data(tvbo) m <- lmer(coloursaturation ~ TVset*Picture + (1 Assessor), data=tvbo) plot(lsmeans(m)) lsmeans(m, test.effs="tvset") mermodlmertest-class Mixed Model Representations The mermodlmertest contains mermod class of lme4 package and overloads anova and summary functions. Objects from the Class Objects can be created via the lmer functions. See Also lmer()
11 rand 11 (m <- lmer(reaction ~ Days + (1 Subject) + (0+Days Subject), data = sleepstudy)) # type 3 anova table with denominator degrees of freedom # calculated based on Satterthwaite s approximation anova(m) # type 1 anova table with denominator degrees of freedom # calculated based on Satterthwaite s approximation anova(m, type=1) # type3 anova table with additional F statistics and denominator degrees of freedom # calculated based on Kenward-Roger s approximation if(requirenamespace("pbkrtest", quietly = TRUE)) anova(m, ddf="kenward-roger") # anova table, that is returned by lme4 package anova(m, ddf="lme4") # summary of mermodlmertest object. Returns the same as mermod object but with an #additional column of p values for the t test. summary(m) rand Performs likelihood ratio test on random effects of linear mixed effects model. Usage Returns a data frame with values of Chi square statistics and corresponding p-values of likelihood ratio tests. rand(model,...) Arguments model linear mixed effects model (lmer object).... other potential arguments.
12 12 step Details The columns of the data are: Chisq: The value of the chi square statistics Chi Df: The degrees of freedom for the test p.value: The p-value of the likelihood ratio test for the effect Value Produces a data frame with LR tests for the random terms. Author(s) Alexandra Kuznetsova, Per Bruun Brockhoff, Rune Haubo Bojesen Christensen See Also step, lsmeans, difflsmeans #import lme4 package and lmertest package #lmer model with correlation between intercept and slopes #in the random part m <- lmer(preference ~ sens2+homesize+(1+sens2 Consumer), data=carrots) # table with p-values for the random effects rand(m) step Performs backward elimination of non-significant effects of linear mixed effects model: performs automatic backward elimination of all effects of linear mixed effect model. First backward elimination of the random part is performed following by backward elimination of the fixed part. Finally LSMEANS (population means) and differences of LSMEANS for the fixed part of the model are calculated and the final model is provided. The p-values for the fixed effects are calculated from F test based on Sattethwaite s or Kenward-Roger approximation), p-values for the random effects are based on likelihood ratio test. All analysis may be performed on lmer object of lme4 package.
13 step 13 Usage step(model, ddf = "Satterthwaite", type = 3, alpha.random = 0.1, alpha.fixed = 0.05, reduce.fixed = TRUE, reduce.random = TRUE, fixed.calc = TRUE, lsmeans.calc = TRUE, difflsmeans.calc = TRUE, test.effs = NULL, keep.effs = NULL,...) Arguments model ddf type alpha.random alpha.fixed reduce.fixed reduce.random fixed.calc linear mixed effects model (lmer object). approximation for denominator degrees of freedom. By default Satterthwaite s approximation. ddf="kenward-roger"" calculates Kenward-Roger approximation type of hypothesis to be tested (SAS notation). Either type=1 or type=3. significance level for elimination of the random part (for LRT test) significance level for elimination of the fixed part (for F test and t-test for least squares means) logical for whether the reduction of the fixed part is required logical for whether the reduction of the random part is required logical for whether the calculation of the table for fixed effects is needed. If FALSE then only the analysis of random effects is done lsmeans.calc logical for whether the calculation of LSMEANS(population means) is required difflsmeans.calc logical for whether the calculation of differences of LSMEANS is required test.effs keep.effs charachter vector specifying the names of terms to be tested in LSMEANS. If NULL all the terms are tested. If lsmeans.calc==false then LSMEANS are not calculated. charachter vector specifying the names of terms to be kept in the model even if being non-significant... other potential arguments. Details Elimination of all effects is done one at a time. Elimination of the fixed part is done by the principle of marginality that is: the highest order interactions are tested first: if they are significant, the lower order effects are not tested for significance. The step function of lmertest overrides the one from stats package for lm objects. So if the lmertest is attached and one wants to call step fof lm object, then needs to use stats::step Value rand.table anova.table data frame with value of Chi square statistics, p-values for the likelihood ratio test for random effects data frame with tests for whether the model fixed terms are significant (Analysis of Variance)
14 14 step lsmeans.table Least Squares Means data frame with p-values and Confidence intervals diffs.lsmeans.table Differences of Least Squares Means data frame with p-values and Confidence intervals model Final model - object of merlmertest(contains mer class) or gls (after all the required reduction has been performed) Note For the random coefficient models: in the random part if correlation is present between slope and intercept, then the simplified model will contain just an intercept. That is if the random part of the initial model is (1+c f), then this model is compared to (1 f) by using LRT. If there are multiple slopes, then the the slope with the highest p-value (and higher then alpha level) is eliminated. That is if the random part of the initial model has the following form (1+c1+c2 f), then two simplified models are constracted and compared to the initial one: the first one has (1+c1 f) in the random part and the second one has: (1+c2 f). Author(s) Alexandra Kuznetsova, Per Bruun Brockhoff, Rune Haubo Bojesen Christensen See Also rand, lsmeans, difflsmeans #import lme4 package and lmertest package m <- lmer(informed.liking ~ Product*Information*Gender+ (1 Consumer) + (1 Product:Consumer), data=ham) #elimination of non-significant effects s <- step(m) #plot of post-hoc analysis of the final model plot(s) m <- lmer(coloursaturation ~ TVset*Picture+ (1 Assessor)+(1 Assessor:TVset), data=tvbo) step(m, keep.effs = "Assessor")
15 summary-methods 15 summary-methods Methods for Function summary in Package lmertest Methods for function summary in package lmertest Methods signature(object = "mermodlmertest",ddf = "Satterthwaite",...) summary of the results of linear mixed effects model fitting of object. Returns the same output as summary of "mermod" class but with additional columns with the names "df", "t value" and "Pr(>t)" representing degrees of freedom, t-statistics and p-values respectively calculated based on Satterthwaite s or Kenward-Roger s approximations. summary (fm1 <- lmer(reaction ~ Days + (Days Subject), sleepstudy)) ## will give you an additional column with p values for the t test summary(fm1) ##using Kenward-Roger approximations to degrees of freedom if(require(pbkrtest)) summary(fm1, ddf="kenward-roger") #will give the summary of lme4 package summary(fm1, ddf="lme4") TVbo TV dataset Usage The TVbo dataset comes from Bang and Olufsen company. The main purpose was to test products, specified by two attributes Picture and TVset. 15 different response variables (characteristics of the product) were assessed by trained panel list. TVbo
16 16 TVbo Format Source Assessor factor: numbering identifying assessors TVset factor: attribute of the product Picture factor: attribute of the product 15 Characteristics of the product numeric variables: Coloursaturation, Colourbalance, Noise, Depth, Sharpness, Lightlevel, Contrast, Sharpnessofmovement, Flickeringstationary, Flickeringmovement, Distortion, Dimglasseffect, Cutting, Flossyedges, Elasticeffect Bang and Olufsen company #import lme4 package and lmertest package m <- lmer(coloursaturation ~ TVset*Picture+ (1 Assessor)+(1 Assessor:TVset), data=tvbo) step(m, test.effs="tvset", reduce.fixed=false, reduce.random=true)
17 Index Topic classes mermodlmertest-class, 10 Topic datasets carrots, 5 ham, 7 TVbo, 15 Topic methods anova-methods, 3 lmer, 8 summary-methods, 15 Topic models lmer, 8 summary.mermodlmertest (summary-methods), 15 TVbo, 15 anova,any-method (anova-methods), 3 anova,mermodlmertest-method (anova-methods), 3 anova-methods, 3 anova.mermodlmertest (anova-methods), 3 carrots, 5 difflsmeans, 6, 10, 12, 14 ham, 7 lmer, 8, 10 lmertest (lmertest-package), 2 lmertest-package, 2 lsmeans, 6, 9, 12, 14 mermodlmertest, 8 mermodlmertest-class, 10 rand, 6, 10, 11, 14 step, 6, 10, 12, 12 summary, 15 summary,mermodlmertest-method (summary-methods), 15 summary-methods, 15 17
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
Package dsstatsclient
Maintainer Author Version 4.1.0 License GPL-3 Package dsstatsclient Title DataSHIELD client site stattistical functions August 20, 2015 DataSHIELD client site
Package empiricalfdr.deseq2
Type Package Package empiricalfdr.deseq2 May 27, 2015 Title Simulation-Based False Discovery Rate in RNA-Seq Version 1.0.3 Date 2015-05-26 Author Mikhail V. Matz Maintainer Mikhail V. Matz
COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
Chapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
xtmixed & denominator degrees of freedom: myth or magic
xtmixed & denominator degrees of freedom: myth or magic 2011 Chicago Stata Conference Phil Ender UCLA Statistical Consulting Group July 2011 Phil Ender xtmixed & denominator degrees of freedom: myth or
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
Multiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
Regression step-by-step using Microsoft Excel
Step 1: Regression step-by-step using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression
Package dunn.test. January 6, 2016
Version 1.3.2 Date 2016-01-06 Package dunn.test January 6, 2016 Title Dunn's Test of Multiple Comparisons Using Rank Sums Author Alexis Dinno Maintainer Alexis Dinno
Package retrosheet. April 13, 2015
Type Package Package retrosheet April 13, 2015 Title Import Professional Baseball Data from 'Retrosheet' Version 1.0.2 Date 2015-03-17 Maintainer Richard Scriven A collection of tools
Generalized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
Section 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Statistical Functions in Excel
Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.
data visualization and regression
data visualization and regression Sepal.Length 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0 I. setosa I. versicolor I. virginica I. setosa I. versicolor I. virginica Species Species
Final Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation
Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.
Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure
Technical report Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure Table of contents Introduction................................................................ 1 Data preparation
One-Way Analysis of Variance (ANOVA) Example Problem
One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means
SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout
Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS
Chapter 23 Inferences About Means
Chapter 23 Inferences About Means Chapter 23 - Inferences About Means 391 Chapter 23 Solutions to Class Examples 1. See Class Example 1. 2. We want to know if the mean battery lifespan exceeds the 300-minute
Package TSfame. February 15, 2013
Package TSfame February 15, 2013 Version 2012.8-1 Title TSdbi extensions for fame Description TSfame provides a fame interface for TSdbi. Comprehensive examples of all the TS* packages is provided in the
An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS
The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice
Package sendmailr. February 20, 2015
Version 1.2-1 Title send email using R Package sendmailr February 20, 2015 Package contains a simple SMTP client which provides a portable solution for sending email, including attachment, from within
" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
Independent t- Test (Comparing Two Means)
Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent
2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA
ABSTRACT An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA Often SAS Programmers find themselves in situations where performing
C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.
Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample
General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.
General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
Study Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
Logistic (RLOGIST) Example #1
Logistic (RLOGIST) Example #1 SUDAAN Statements and Results Illustrated EFFECTS RFORMAT, RLABEL REFLEVEL EXP option on MODEL statement Hosmer-Lemeshow Test Input Data Set(s): BRFWGT.SAS7bdat Example Using
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
Package uptimerobot. October 22, 2015
Type Package Version 1.0.0 Title Access the UptimeRobot Ping API Package uptimerobot October 22, 2015 Provide a set of wrappers to call all the endpoints of UptimeRobot API which includes various kind
MIXED MODEL ANALYSIS USING R
Research Methods Group MIXED MODEL ANALYSIS USING R Using Case Study 4 from the BIOMETRICS & RESEARCH METHODS TEACHING RESOURCE BY Stephen Mbunzi & Sonal Nagda www.ilri.org/rmg www.worldagroforestrycentre.org/rmg
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
Difference of Means and ANOVA Problems
Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly
Main Effects and Interactions
Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly
Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA
PROC FACTOR: How to Interpret the Output of a Real-World Example Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA ABSTRACT THE METHOD This paper summarizes a real-world example of a factor
Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480
1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500
12: Analysis of Variance. Introduction
1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider
Part 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
Package EstCRM. July 13, 2015
Version 1.4 Date 2015-7-11 Package EstCRM July 13, 2015 Title Calibrating Parameters for the Samejima's Continuous IRT Model Author Cengiz Zopluoglu Maintainer Cengiz Zopluoglu
Factors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
Statistical Models in R
Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Linear Models in R Regression Regression analysis is the appropriate
Regression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
Package changepoint. R topics documented: November 9, 2015. Type Package Title Methods for Changepoint Detection Version 2.
Type Package Title Methods for Changepoint Detection Version 2.2 Date 2015-10-23 Package changepoint November 9, 2015 Maintainer Rebecca Killick Implements various mainstream and
THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.
THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM
Basic Statistical and Modeling Procedures Using SAS
Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
Individual Growth Analysis Using PROC MIXED Maribeth Johnson, Medical College of Georgia, Augusta, GA
Paper P-702 Individual Growth Analysis Using PROC MIXED Maribeth Johnson, Medical College of Georgia, Augusta, GA ABSTRACT Individual growth models are designed for exploring longitudinal data on individuals
Premaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
Package ATE. R topics documented: February 19, 2015. Type Package Title Inference for Average Treatment Effects using Covariate. balancing.
Package ATE February 19, 2015 Type Package Title Inference for Average Treatment Effects using Covariate Balancing Version 0.2.0 Date 2015-02-16 Author Asad Haris and Gary Chan
Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish
Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Statistics Statistics are quantitative methods of describing, analysing, and drawing inferences (conclusions)
This can dilute the significance of a departure from the null hypothesis. We can focus the test on departures of a particular form.
One-Degree-of-Freedom Tests Test for group occasion interactions has (number of groups 1) number of occasions 1) degrees of freedom. This can dilute the significance of a departure from the null hypothesis.
UNDERSTANDING THE TWO-WAY ANOVA
UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma [email protected] The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
An introduction to IBM SPSS Statistics
An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive
HLM software has been one of the leading statistical packages for hierarchical
Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush
The Chi-Square Test. STAT E-50 Introduction to Statistics
STAT -50 Introduction to Statistics The Chi-Square Test The Chi-square test is a nonparametric test that is used to compare experimental results with theoretical models. That is, we will be comparing observed
Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance
Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance 14 November 2007 1 Confidence intervals and hypothesis testing for linear regression Just as there was
Chapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
A Primer on Forecasting Business Performance
A Primer on Forecasting Business Performance There are two common approaches to forecasting: qualitative and quantitative. Qualitative forecasting methods are important when historical data is not available.
Introduction to Hierarchical Linear Modeling with R
Introduction to Hierarchical Linear Modeling with R 5 10 15 20 25 5 10 15 20 25 13 14 15 16 40 30 20 10 0 40 30 20 10 9 10 11 12-10 SCIENCE 0-10 5 6 7 8 40 30 20 10 0-10 40 1 2 3 4 30 20 10 0-10 5 10 15
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
Chapter 2 Probability Topics SPSS T tests
Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform
Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
Logistic (RLOGIST) Example #3
Logistic (RLOGIST) Example #3 SUDAAN Statements and Results Illustrated PREDMARG (predicted marginal proportion) CONDMARG (conditional marginal proportion) PRED_EFF pairwise comparison COND_EFF pairwise
Package hazus. February 20, 2015
Package hazus February 20, 2015 Title Damage functions from FEMA's HAZUS software for use in modeling financial losses from natural disasters Damage Functions (DFs), also known as Vulnerability Functions,
MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)
MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL by Michael L. Orlov Chemistry Department, Oregon State University (1996) INTRODUCTION In modern science, regression analysis is a necessary part
CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS
CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS CHI-SQUARE TESTS OF INDEPENDENCE (SECTION 11.1 OF UNDERSTANDABLE STATISTICS) In chi-square tests of independence we use the hypotheses. H0: The variables are independent
Elementary Statistics Sample Exam #3
Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to
Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science [email protected]
Dept of Information Science [email protected] October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic
SAS Syntax and Output for Data Manipulation:
Psyc 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (in preparation) chapter 5. We will be examining
Package neuralnet. February 20, 2015
Type Package Title Training of neural networks Version 1.32 Date 2012-09-19 Package neuralnet February 20, 2015 Author Stefan Fritsch, Frauke Guenther , following earlier work
t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon
t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. [email protected] www.excelmasterseries.com
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test
Bivariate Statistics Session 2: Measuring Associations Chi-Square Test Features Of The Chi-Square Statistic The chi-square test is non-parametric. That is, it makes no assumptions about the distribution
One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate
1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,
ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups
ANOVA ANOVA Analysis of Variance Chapter 6 A procedure for comparing more than two groups independent variable: smoking status non-smoking one pack a day > two packs a day dependent variable: number of
Package bigdata. R topics documented: February 19, 2015
Type Package Title Big Data Analytics Version 0.1 Date 2011-02-12 Author Han Liu, Tuo Zhao Maintainer Han Liu Depends glmnet, Matrix, lattice, Package bigdata February 19, 2015 The
Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
E(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F
Random and Mixed Effects Models (Ch. 10) Random effects models are very useful when the observations are sampled in a highly structured way. The basic idea is that the error associated with any linear,
Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini
NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building
Package ERP. December 14, 2015
Type Package Package ERP December 14, 2015 Title Significance Analysis of Event-Related Potentials Data Version 1.1 Date 2015-12-11 Author David Causeur (Agrocampus, Rennes, France) and Ching-Fan Sheu
Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
Package missforest. February 20, 2015
Type Package Package missforest February 20, 2015 Title Nonparametric Missing Value Imputation using Random Forest Version 1.4 Date 2013-12-31 Author Daniel J. Stekhoven Maintainer
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.
Polynomial Regression POLYNOMIAL AND MULTIPLE REGRESSION Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model. It is a form of linear regression
ISyE 2028 Basic Statistical Methods - Fall 2015 Bonus Project: Big Data Analytics Final Report: Time spent on social media
ISyE 2028 Basic Statistical Methods - Fall 2015 Bonus Project: Big Data Analytics Final Report: Time spent on social media Abstract: The growth of social media is astounding and part of that success was
This chapter discusses some of the basic concepts in inferential statistics.
Research Skills for Psychology Majors: Everything You Need to Know to Get Started Inferential Statistics: Basic Concepts This chapter discusses some of the basic concepts in inferential statistics. Details
Psychology 205: Research Methods in Psychology
Psychology 205: Research Methods in Psychology Using R to analyze the data for study 2 Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 38 Outline 1 Getting ready
Multivariate Analysis of Variance (MANOVA): I. Theory
Gregory Carey, 1998 MANOVA: I - 1 Multivariate Analysis of Variance (MANOVA): I. Theory Introduction The purpose of a t test is to assess the likelihood that the means for two groups are sampled from the
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
