Use of deviance statistics for comparing models



Similar documents
HLM software has been one of the leading statistical packages for hierarchical

Introduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group

This can dilute the significance of a departure from the null hypothesis. We can focus the test on departures of a particular form.

The Latent Variable Growth Model In Practice. Individual Development Over Time

Introduction to General and Generalized Linear Models

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

An introduction to hierarchical linear modeling

SAS Software to Fit the Generalized Linear Model

Indices of Model Fit STRUCTURAL EQUATION MODELING 2013

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Specifications for this HLM2 run

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

Logit Models for Binary Data

STATISTICA Formula Guide: Logistic Regression. Table of Contents

E(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F

Regression Analysis: A Complete Example

Poisson Models for Count Data

Simple Linear Regression Inference

Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure

We extended the additive model in two variables to the interaction model by adding a third term to the equation.

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

The Basic Two-Level Regression Model

Introduction to Longitudinal Data Analysis

Introducing the Multilevel Model for Change

Longitudinal Data Analysis

xtmixed & denominator degrees of freedom: myth or magic

Choosing number of stages of multistage model for cancer modeling: SOP for contractor and IRIS analysts

Overview of Methods for Analyzing Cluster-Correlated Data. Garrett M. Fitzmaurice

Qualitative vs Quantitative research & Multilevel methods

Mihaela Ene, Elizabeth A. Leighton, Genine L. Blue, Bethany A. Bell University of South Carolina

Multivariate Normal Distribution

Appendix 1: Estimation of the two-variable saturated model in SPSS, Stata and R using the Netherlands 1973 example data

Chapter 5 Analysis of variance SPSS Analysis of variance

Linear Models for Continuous Data

Introduction to Data Analysis in Hierarchical Linear Models

Analyzing Ranking and Rating Data from Participatory On- Farm Trials

SAS Syntax and Output for Data Manipulation:

Electronic Thesis and Dissertations UCLA

Random effects and nested models with SAS

1.5 Oneway Analysis of Variance

Comparison of Estimation Methods for Complex Survey Data Analysis

Using PROC MIXED in Hierarchical Linear Models: Examples from two- and three-level school-effect analysis, and meta-analysis research

Ordinal Regression. Chapter

Generalized Linear Models

Individual Growth Analysis Using PROC MIXED Maribeth Johnson, Medical College of Georgia, Augusta, GA

Longitudinal Data Analyses Using Linear Mixed Models in SPSS: Concepts, Procedures and Illustrations

Technical report. in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE

Statistical Models in R

Hierarchical Linear Models

AN ILLUSTRATION OF MULTILEVEL MODELS FOR ORDINAL RESPONSE DATA

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES

Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases:

Analysis of Variance. MINITAB User s Guide 2 3-1

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Introduction to Hierarchical Linear Modeling with R

Multivariate Logistic Regression

Factors affecting online sales

Chapter 6: Multivariate Cointegration Analysis

Introduction to Structural Equation Modeling (SEM) Day 4: November 29, 2012

11. Analysis of Case-control Studies Logistic Regression

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

One-Way Analysis of Variance (ANOVA) Example Problem

Chapter 15. Mixed Models Overview. A flexible approach to correlated data.

ANOVA. February 12, 2015

Chapter 4: Vector Autoregressive Models

Time-Series Regression and Generalized Least Squares in R

Statistical Models in R

GLM I An Introduction to Generalized Linear Models

A Mean-Variance Framework for Tests of Asset Pricing Models

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Power and sample size in multilevel modeling

Module 5: Multiple Regression Analysis

Multilevel Modeling Tutorial. Using SAS, Stata, HLM, R, SPSS, and Mplus

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Multiple Linear Regression

Binary Logistic Regression

lavaan: an R package for structural equation modeling

An Introduction to Modeling Longitudinal Data

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Supplementary PROCESS Documentation

13. Poisson Regression Analysis

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Εισαγωγή στην πολυεπίπεδη μοντελοποίηση δεδομένων με το HLM. Βασίλης Παυλόπουλος Τμήμα Ψυχολογίας, Πανεπιστήμιο Αθηνών

Some Essential Statistics The Lure of Statistics

The Chi-Square Test. STAT E-50 Introduction to Statistics

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Introduction to Regression and Data Analysis

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lecture 6: Poisson regression

Stat 5303 (Oehlert): Tukey One Degree of Freedom 1

Lecture 18: Logistic Regression Continued

VI. Introduction to Logistic Regression

Transcription:

A likelihood-ratio test can be used under full ML. The use of such a test is a quite general principle for statistical testing. In hierarchical linear models, the deviance test is mostly used for multiparameter tests and for tests about the random part of the model. This approach is based on estimating two models, and. It is assumed that excludes the effects hypothesized to be null, while these effects are included in. For each model, a deviance statistic, equal to -2 ln L for that model, is computed. The deviance can be regarded as a measure of lack of fit between model and data. In general, the larger the deviance, the poorer the fit to the data. Note that the value of a deviance could be negative. For the interpretation of a negative deviance, please see Log likelihood values and negative deviances. The deviance in usually not interpreted directly, but rather compared to deviance(s) from other models fitted to the same data. The difference between the deviances and has a large-sample chi-square distribution with degrees of freedom equal to the difference in the number of parameters estimated. Large values of the chi-square statistic are taken as evidence that the null is implausible. An approximate rule for evaluating the chi-square is given by Snijders & Bosker, p 49: "...a difference in deviance between models should be at least twice as large as the difference in the number of parameters estimated". These authors also give two examples of using deviance statistics (p 89 of the text). Note, however, the implications of using other estimation procedures: REML in the linear twolevel models and PQL in the HGLM models. Example 1: Multivariate tests of variance-covariance components specification Below we compare the variance-covariance components of two Intercept-and-Slope-as-Outcome models. One treats as random and the other does not. The two models are shown below, with the deviance statistics of the model. Model 1: Level-1 Model Y = B0 + B1*(SES) + R Level-2 Model B0 = G00 + G01*(SECTOR) + G02*(MEANSES) + U0 B1 = G10 + G11*(SECTOR) + G12*(MEANSES) Statistics for current covariance components model -------------------------------------------------- Deviance = 46502.952743 Number of estimated parameters = 2 Model 2: Level-1 Model 5

hlm6 Y = B0 + B1*(SES) + R Level-2 Model B0 = G00 + G01*(SECTOR) + G02*(MEANSES) + U0 B1 = G10 + G11*(SECTOR) + G12*(MEANSES) + U1 Statistics for current covariance components model -------------------------------------------------- Deviance = 46501.875643 Number of estimated parameters = 4 The multi-parameter test for the variance-covariance components is implemented by a likelihoodratio test, which compares the deviance statistic of a restricted model with a more general alternative. The user must input the value of the deviance statistic and related degrees of freedom for the alternative specification. To specify a multivariate test of variance-covariance components Enter the deviance and the number of parameters in the Deviance Statistics box and in the Number of Parameters box respectively (the two numbers for our example are 46501.875643 and 4). The HLM2 output associated with this test appears below. Variance-Covariance components test ----------------------------------- Chi-square statistic = 1.07714 Number of degrees of freedom = 2 P-value = >.500 The test result indicates that a model that constrains the residual variance for the SES slopes, B1, to zero appears appropriate. Example 2: Testing of variance-covariance structures Consider the HMLM model from the Sustaining Effects (EG) data, we can assess the variancecovariance structures of the model using the deviance statistics of the following three models: Model 1. Unrestricted model. 6

Model 2. Random effects model with homogeneous level-1 variance. Where Model 3. Random effects model with heterogeneous level-1 variance. Where 7

hlm6 The output for the fitness of the three models is as follows. Summary of Model Fit Model Number of Deviance Parameters 1. Unrestricted 26 15960.50733 2. Homogeneous sigma-squared 9 16326.23111 3. Heterogeneous sigma-squared 14 16140.15892 Model Comparison Chi-square df P-value Model 1 vs Model 2 365.72378 17 0.000 Model 1 vs Model 3 179.65159 12 0.000 Model 2 vs Model 3 186.07219 5 0.000 The first test of the model comparison tests, comparing Model 1 vs Model 2, is testing for the The chi-square test statistic of 365.72 with 17 degree of freedom gives a p-value of 0.000, indicating that the null is implausible, and we can conclude that model 1 is a better fitted model than model 2. The second test of the model comparison tests, comparing Model 1 vs Model 3, is testing for the 8

The chi-square test statistic of 179.65 with 12 degree of freedom has a p-value of 0.000, indicating that the null does not seem tenable, and we can conclude that model 1 rather than model 3 is a better model. The third test of the model comparison tests, comparing Model 2 vs Model 3, is testing for the The chi-square test statistic of 186.07 with 5 degree of freedom gives a p-value of 0.000, indicating again that the null is implausible, and we can conclude that model 3 is significantly different from model 2. Summarizing the three tests above, we can draw a conclusion with respect to the variancecovariance structure of the HMLM model: The model with unrestricted variance-covariance structure for the combined level-1 and level-2 model is the best fitted model for the given data. 9