Introduction to Data Analysis in Hierarchical Linear Models

Similar documents
Εισαγωγή στην πολυεπίπεδη μοντελοποίηση δεδομένων με το HLM. Βασίλης Παυλόπουλος Τμήμα Ψυχολογίας, Πανεπιστήμιο Αθηνών

Introduction to Longitudinal Data Analysis

Introduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group

Illustration (and the use of HLM)

HLM software has been one of the leading statistical packages for hierarchical

An introduction to hierarchical linear modeling

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

Introduction to Hierarchical Linear Modeling with R

Moderation. Moderation

The Latent Variable Growth Model In Practice. Individual Development Over Time

The Basic Two-Level Regression Model

Introducing the Multilevel Model for Change

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Longitudinal Meta-analysis

College Readiness LINKING STUDY

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases:

Specifications for this HLM2 run

The importance of graphing the data: Anscombe s regression examples

SPSS Guide: Regression Analysis

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

AN ILLUSTRATION OF MULTILEVEL MODELS FOR ORDINAL RESPONSE DATA

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Hierarchical Linear Models

RMTD 404 Introduction to Linear Models

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Electronic Thesis and Dissertations UCLA

DISCRIMINANT FUNCTION ANALYSIS (DA)

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Qualitative vs Quantitative research & Multilevel methods

Introduction to Regression and Data Analysis

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

The 3-Level HLM Model

Multivariate Analysis of Variance (MANOVA)

Overview of Methods for Analyzing Cluster-Correlated Data. Garrett M. Fitzmaurice

Univariate Regression

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone:

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Multilevel Models for Social Network Analysis

Longitudinal Data Analyses Using Linear Mixed Models in SPSS: Concepts, Procedures and Illustrations

Multilevel Modeling Tutorial. Using SAS, Stata, HLM, R, SPSS, and Mplus

Power and sample size in multilevel modeling

Statistical Techniques Utilized in Analyzing TIMSS Databases in Science Education from 1996 to 2012: A Methodological Review

The University of North Carolina at Chapel Hill School of Social Work. SOWO 919 Longitudinal and Multilevel Analysis Spring Semester, 2009

Assignments Analysis of Longitudinal data: a multilevel approach

Simple linear regression

Example: Boats and Manatees

Mihaela Ene, Elizabeth A. Leighton, Genine L. Blue, Bethany A. Bell University of South Carolina

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

Multiple Regression: What Is It?

Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation

Module 3: Correlation and Covariance

UNDERSTANDING THE TWO-WAY ANOVA

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Psychology 205: Research Methods in Psychology

Statistical Models in R

Contrast Coding in Multiple Regression Analysis: Strengths, Weaknesses, and Utility of Popular Coding Structures

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

Fairfield Public Schools

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Premaster Statistics Tutorial 4 Full solutions

Using SAS Proc Mixed for the Analysis of Clustered Longitudinal Data

Family economics data: total family income, expenditures, debt status for 50 families in two cohorts (A and B), annual records from

Chapter 1 Introduction. 1.1 Introduction

Multivariate Analysis. Overview

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

10. Analysis of Longitudinal Studies Repeat-measures analysis

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Additional sources Compilation of sources:

data visualization and regression

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Chapter 23. Inferences for Regression

[This document contains corrections to a few typos that were found on the version available through the journal s web page]

Technical Report. Teach for America Teachers Contribution to Student Achievement in Louisiana in Grades 4-9: to

Overview of Factor Analysis

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

SAS Syntax and Output for Data Manipulation:

Outline: Demand Forecasting

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

Introduction to proc glm

Use of deviance statistics for comparing models

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

Simple Predictive Analytics Curtis Seare

Multiple Regression. Page 24

11. Analysis of Case-control Studies Logistic Regression

4. Simple regression. QBUS6840 Predictive Analytics.

Transcription:

Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University

Scope & Prerequisites Strong applied emphasis Focus on HLM software Has special functionality Other options: SPSS, SAS, MLWin, R Familiarity with regression assumed

Road to HLM Happiness Conceptualize model hierarchically Prepare data Import data into HLM Build statistical models Estimate and interpret models Graph models

What is HLM? Hierarchical Linear Model A multilevel statistical model Software program used for such models Deconstructing the name (in reverse) Model: It s a statistical model Linear: The model must be linear in the parameters Hierarchical: Nested data structures are explicitly modeled

When are data hierarchical? When units are grouped at higher units of analysis Such data may be nested within higher levels (i.e., units) of analysis Nesting can occur between subjects Children nested within classrooms Classrooms nested within schools and/or within subjects Repeated observations on the same individuals over time (observations nested within individuals)

Why not use regular regression on nested data? Increased Type I error Model misspecification Miss opportunity to examine potentially interesting contextual questions These problems increase as observations become less independent

Hierarchical Model Conceptualization What kind of hierarchical relations might be present? What factors could I incorporate in my model to reflect this organization?

HLM Caveats Adding levels of nesting increases the complexity of the model exponentially HLM can handle up to three levels Must have several times more lower level observations than upper level observations Parameter estimation uses maximum likelihood instead of least squares

Road to HLM Happiness Conceptualize model hierarchically Prepare data Import data into HLM Build statistical models Estimate and interpret models Graph models

Prep, prep, prep! This is the most labor intensive part of workflow, and is the source of many problems that come to us at the StatLab Two obstacles HLM doesn t do data manipulation or basic data description HLM requires a special data structure Solutions Plan ahead. Do all data screening, variable transformations, exploratory analyses, and assumption-checking beforehand

Data prep: SPSS example 1 Data set: IQ v & language achievement Two files Level 1: dependent variable (language achievement) and other child characteristics (e.g. IQ v ) Level 2: school characteristics (e.g. SES) Children are nested within schools 1 Extensively adapted from Bryk & Raudenbush (2002) and Bauer (2005)

Road to HLM Happiness Conceptualize model hierarchically Prepare data Import data into HLM Build statistical models Estimate and interpret models Graph models

Creating the Multivariate Data Matrix (MDM) Making an MDM file A caveat The procedure Check your summary statistics before building any models (cross-reference) Main window: are all of your variables there?

Road to HLM Happiness Conceptualize model hierarchically Prepare data Import data into HLM Build statistical models Estimate and interpret models Graph models

Build statistical models Basic model: random-effects ANOVA Test for mean group differences in population Between-group vs. total variance Key assumption check of HLM

Random-effects ANOVA Choose outcome variable Terms Toggle Level 2 error term Level 1 (r) vs. Level 2 (u) error terms The Mixed window

Random effects ANOVA Language achievement M1 M2 M3 GM

Road to HLM Happiness Conceptualize model hierarchically Prepare data Import data into HLM Build statistical models Estimate and interpret models Graph models

Random effects ANOVA Results Fixed effects: the intercept Is the grand mean significantly different from zero? Variance components (random effects) Level 2 (U0): significant variability between groups? Level 1 (R): significant variability within groups?

Random effects ANOVA Intraclass correlation (ICC) Proportion of total variance accounted for by between-group differences Level 2 variance component divided by sum of Level 1 and Level 2 variance components Ours is.23; HLM is warranted

Road to HLM Happiness Conceptualize model hierarchically Prepare data Import data into HLM Build statistical models Estimate and interpret models Graph models

Random effects regression Test for relationship between a Level 1 IV and the DV Test whether an IV explains any between groups variance Terms We are assuming a fixed slope

Random effects regression Language achievement IQ

Road to HLM Happiness Conceptualize model hierarchically Prepare data Import data into HLM Build statistical models Estimate and interpret models Graph models

Random effects regression Results Fixed effects Level 1 intercept: Mean of DV where IV is zero Level 1 slope: Change in DV with one unit of change in IV (just like OLS regression) Random effects Intercept: Between-group variance that is not explained by IV Residual variance: Within-group variance that is not explained by DV

Random effects regression Variance accounted for by IV Level 1: Compare residual variance component to random effects ANOVA model (8.0-6.5) / 8.0 =.19 Level 2: Do the same for the random intercept variance component (19.6-9.6) / 19.6 =.51

Fixed slopes Language achievement IQ

Random slopes Language achievement IQ

Random slopes Goal: test whether the IV - DV relationship varies between groups Add only if supported by theory Toggle Level 2b error term In output, look at slope variance component

Slopes as outcomes Goal: test cross level interactions Does the between-group variability in the IV - DV relation vary by a systematic factor? Add Level 2 predictor Terms

Slopes as outcomes Fixed effects For Level 1 intercept Intercept: predicted score on DV at mean value of L-1 IV Slope: Influence of Level 2 IV on DV For Level 1 slope Intercept: Influence of Level 1 IV on DV Slope: Influence of L-2 IV on L-1 IV - DV relation Random effects (same as before)

Road to HLM Happiness Conceptualize model hierarchically Prepare data Import data into HLM Build statistical models Estimate and interpret models Graph models

Graph: Simple slopes Useful for visualizing cross-level interactions Just like simple slope plots in regression Graph Equations > Model graphs Useful for categorical or continuous data

Graph: Level-1 equations Useful for: Visualizing variability in intercepts and slopes Identifying moderators Graph Equations > Level 1 equation graphing

Recommended Reading Bickel, R. (2007). Multilevel analysis for applied research: It's just regression! New York: Guilford Press. Bryk, A. & Raudenbush, S. (2002). Hierarchical Linear Models: Applications and data analysis methods (2nd ed.). Thousand Oaks, CA: Sage. Luke, D. (2004). Multilevel modeling. Thousand Oaks, CA: Sage. Heck, R. H., & Thomas, S. L. (2000). An introduction to multilevel modeling techniques. Lawrence Erlbaum Associates. Kreft, I. & de Leeuw, J. (1998). Introducing multilevel modeling. Sage. Singer, J. D., & Willett, J. B. (2003). Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence. Oxford Univ. Press. (Longitudinal focus)

HLM Resources on the Web UCLA s HLM portal http://statcomp.ats.ucla.edu/mlm Excellent example of analysis http://www.ats.ucla.edu/stat/hlm/seminars/ hlm_mlm/mlm_hlm_seminar.htm