Module 5: Introduction to Multilevel Modelling SPSS Practicals Chris Charlton 1 Centre for Multilevel Modelling

Similar documents
Module 5: Introduction to Multilevel Modelling. R Practical. Introduction to the Scottish Youth Cohort Trends Dataset. Contents

Module 14: Missing Data Stata Practical

Independent t- Test (Comparing Two Means)

Longitudinal Data Analyses Using Linear Mixed Models in SPSS: Concepts, Procedures and Illustrations

SPSS Guide: Regression Analysis

Chapter 5 Analysis of variance SPSS Analysis of variance

The 3-Level HLM Model

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

Chapter 15. Mixed Models Overview. A flexible approach to correlated data.

Family economics data: total family income, expenditures, debt status for 50 families in two cohorts (A and B), annual records from

Binary Logistic Regression

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

Using SAS Proc Mixed for the Analysis of Clustered Longitudinal Data

5. Multiple regression

Ordinal Regression. Chapter

Technical report. in SPSS AN INTRODUCTION TO THE MIXED PROCEDURE

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

One-Way Analysis of Variance

Chapter 2 Probability Topics SPSS T tests

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

Appendix 1: Estimation of the two-variable saturated model in SPSS, Stata and R using the Netherlands 1973 example data

How to set the main menu of STATA to default factory settings standards

LOGISTIC REGRESSION ANALYSIS

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

Simple linear regression

Final Exam Practice Problem Answers

Generalized Linear Models

SAS Syntax and Output for Data Manipulation:

Linear Models in STATA and ANOVA

TIMSS 2011 User Guide for the International Database

Handling attrition and non-response in longitudinal data

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Chapter 13 Introduction to Linear Regression and Correlation Analysis

ABSORBENCY OF PAPER TOWELS

Basic Statistical and Modeling Procedures Using SAS

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Multinomial and Ordinal Logistic Regression

Oklahoma School Grades: Hiding Poor Achievement

Schools Value-added Information System Technical Manual

Regression step-by-step using Microsoft Excel

Multiple Regression. Page 24

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Multilevel Modeling Tutorial. Using SAS, Stata, HLM, R, SPSS, and Mplus

A Basic Guide to Analyzing Individual Scores Data with SPSS

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

II. DISTRIBUTIONS distribution normal distribution. standard scores

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

2: Entering Data. Open SPSS and follow along as your read this description.

2. Linear regression with multiple regressors

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout

Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure

Two Related Samples t Test

Introduction to Longitudinal Data Analysis

Using An Ordered Logistic Regression Model with SAS Vartanian: SW 541

Directions for using SPSS

1996 DfEE study of Value Added for year olds in England

Multivariate Logistic Regression

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Module 4 - Multiple Logistic Regression

HLM software has been one of the leading statistical packages for hierarchical

Chapter 7: Simple linear regression Learning Objectives

SPSS Guide How-to, Tips, Tricks & Statistical Techniques

Impact of Enrollment Timing on Performance: The Case of Students Studying the First Course in Accounting

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

The Chi-Square Test. STAT E-50 Introduction to Statistics

College Readiness LINKING STUDY

Part 2: Analysis of Relationship Between Two Variables

A guide to level 3 value added in 2015 school and college performance tables

MULTIPLE REGRESSION WITH CATEGORICAL DATA

Individual Growth Analysis Using PROC MIXED Maribeth Johnson, Medical College of Georgia, Augusta, GA

Using R for Linear Regression

Specifications for this HLM2 run

Introduction to Multilevel Modeling Using HLM 6. By ATS Statistical Consulting Group

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Chapter 23. Inferences for Regression

SPSS Notes (SPSS version 15.0)

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Introduction to Data Analysis in Hierarchical Linear Models

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Simple Linear Regression Inference

Premaster Statistics Tutorial 4 Full solutions

2013 MBA Jump Start Program. Statistics Module Part 3

January 26, 2009 The Faculty Center for Teaching and Learning

Logistic Regression.

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Directions for Frequency Tables, Histograms, and Frequency Bar Charts

Using Excel for Statistics Tips and Warnings

Introduction to Hierarchical Linear Modeling with R

Multiple Linear Regression

Electronic Thesis and Dissertations UCLA

S P S S Statistical Package for the Social Sciences

Online Appendix The Earnings Returns to Graduating with Honors - Evidence from Law Graduates

5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.

Transcription:

Module 5: Introduction to Multilevel Modelling SPSS Practicals Chris Charlton 1 Centre for Multilevel Modelling Pre-requisites Modules 1-4 Contents P5.1 Comparing Groups using Multilevel Modelling... 4 P5.1.1 A multilevel model of attainment with school effects... 4 P5.1.2 Examining school effects (residuals)... 8 P5.2 Adding Student-level Explanatory Variables: Random Intercept Models... 14 P5.3 Allowing for Different Slopes across Schools: Random Slope Models... 19 P5.3.1 Testing for random slopes... 20 P5.3.2 Interpretation of random cohort effects across schools... 21 P5.3.3 Examining intercept and slope residuals for schools... 21 P5.3.4 Between-school variance as a function of cohort... 25 P5.3.5 Adding a random coefficient for gender (dichotomous x )... 28 P5.3.6 Adding a random coefficient for social class (categorical x)... 30 P5.4 Adding Level 2 Explanatory Variables... 38 P5.4.1 Contextual effects... 42 P5.4.2 Cross-level interactions... 47 P5.5 Complex Level 1 Variation... 51 P5.5.1 Within-school variance as a function of cohort (continuous X)... 51 P5.5.2 Within-school variance as a function of gender (dichotomous X)... 51 P5.5.3 Within-school variance as a function of cohort and gender... 55 P5.6 Appendices... 56 P5.6.1 Appendix 1... 56 P5.6.2 Appendix 2... 63 1 This SPSS practical is adapted from the corresponding MLwiN practical: Steele, F. (2008) Module 5: Introduction to Multilevel Modelling. LEMMA VLE, Centre for Multilevel Modelling. Accessed at http://www.cmm.bris.ac.uk/lemma/course/view.php?id=13. Centre for Multilevel Modelling 2014 1

Some of the sections within this module have online quizzes for you to test your understanding. To find the quizzes: EXAMPLE From within the LEMMA learning environment Go down to the section for Module 5: Introduction to Multilevel Modelling Click "5.1 Comparing Groups Using Multilevel Modelling" to open Lesson 5.1 Click Q 1 to open the first question Introduction to the Scottish Youth Cohort Trends Dataset You will be analysing data from the Scottish School Leavers Survey (SSLS), a nationally representative survey of young people. We use data from seven cohorts of young people collected in the first sweep of the study, carried out at the end of the final year of compulsory schooling (aged 16-17) when most sample members had taken Standard grades 2. In the practical for Module 3 on multiple regression, we considered the predictors of attainment in Standard grades (subject-based examinations, typically taken in up to eight subjects). In this practical, we extend the (previously single-level) multiple regression analysis to allow for dependency of exam scores within schools and to examine the extent of between-school variation in attainment. We also consider the effects on attainment of several school-level predictors. The dependent variable is a total attainment score. Each subject is graded on a scale from 1 (highest) to 7 (lowest) and, after recoding so that a high numeric value denotes a high grade, the total is taken across subjects. The analysis dataset contains the student-level variables considered in Module 3 together with a school identifier and three school-level variables: Variable name CASEID SCHOOLID SCORE COHORT90 Description and codes Anonymised student identifier Anonymised school identifier Point score calculated from awards in Standard grades taken at age 16. Scores range from 0 to 75, with a higher score indicating a higher attainment The sample includes the following cohorts: 1984, 1986, 1988, 1990, 1996 and 1998. The 2 We are grateful to Linda Croxford (Centre for Educational Sociology, University of Edinburgh) for providing us with these data. The dataset was constructed as part of an ESRC-funded project on Education and Youth Transitions in England, Wales and Scotland 1984-2002. Further analyses of the data can be found in Croxford, L. and Raffe, D. (2006) Education Markets and Social Class Inequality: A Comparison of Trends in England, Scotland and Wales. In R. Teese (Ed.) Inequality Revisited. Berlin: Springer. Centre for Multilevel Modelling 2014 2

FEMALE SCLASS SCHTYPE SCHURBAN SCHDENOM COHORT90 variable is calculated by subtracting 1990 from each value. Thus values range from - 6 (corresponding to 1984) to 8 (1998), with 1990 coded as zero Sex of student (1=female, 0=male) Social class, defined as the higher class of mother or father (1=managerial and professional, 2=intermediate, 3=working, 4=unclassified). School type, distinguishing independent schools from state-funded schools (1=independent, 0=state-funded) Urban-rural classification of school (1=urban, 0=town or rural) School denomination (1=Roman Catholic, 0=non-denominational) There are 33988 students in 508 schools. Open the worksheet to From within the LEMMA Learning Environment Go to Module 5: Introduction to Multilevel Modelling, and scroll down to SPSS Datafiles Click 5.1.sav You will see the Data Editor Window, after switching to Variable View: Centre for Multilevel Modelling 2014 3

P5.1 Comparing Groups using Multilevel Modelling P5.1.1 A multilevel model of attainment with school effects We will start with the simplest multilevel model which allows for school effects on attainment, but without explanatory variables. This null model may be written (5.1) where is the attainment of student i in school j, β 0 is the overall mean across schools, is the effect of school j on attainment, and is a student level residual. The school effects, which we will also refer to as school (or level 2) residuals, are assumed to follow a normal distribution with mean zero and variance. To run this model in SPSS we will use the MIXED command. Immediately after MIXED there is the response variable. The /FIXED option specifies the variables to include in the fixed part, in this case this is empty as the intercept is automatically included, and there are no other predictors in the fixed part. The /METHOD option allows selection of the estimation method, in this case maximum likelihood. /PRINT=SOLUTION requests that the parameter estimates are displayed after fitting. The /RANDOM option specifies which variables are included in the random part, as well as specifying the variable that defines the grouping (here, schools). Finally the /SAVE option specifies that we want to save the fixed-part prediction (FIXPRED), the prediction including random effects (PRED) and the standard errors of the prediction (SEPRED) back into the data set. Note that unlike software such as MLwiN the data does not have to be sorted a specific way in order to fit the model. MIXED score /FIXED= /METHOD=ML /PRINT=SOLUTION /RANDOM=INTERCEPT SUBJECT(schoolid) /SAVE=FIXPRED PRED SEPRED. Alternatively: Analyze>Mixed Models>Linear Add schoolid to Subjects Click Continue Add score to Dependent Variable Click Random Tick Include intercept Add schoolid to Combinations Click Continue Click Estimation Tick Maximum Likelihood (ML) Click Continue Centre for Multilevel Modelling 2014 4

Click Statistics Tick Parameter estimates Click Continue Click Save Tick Predicted values under Fixed Predicted Values Tick Predicted values under Predicted Values & Residuals Tick Standard errors under Predicted Values & Residuals Click Continue Click OK We will see the following tables of results (the Model Dimension and Type III Tests of Fixed Effects tables are not of interest for this analysis, so we will omit them from subsequent results): Model Dimension a Number of Covariance Number of Subject Levels Structure Parameters Variables Fixed Effects Intercept 1 1 Random Effects Intercept 1 Variance 1 schoolid Components Residual 1 Total 2 3 Information Criteria a -2 Log Likelihood 286539.064 Akaike's Information 286545.064 Criterion (AIC) Hurvich and Tsai's Criterion 286545.065 (AICC) Bozdogan's Criterion (CAIC) 286573.365 Schwarz's Bayesian 286570.365 Criterion (BIC) The information criteria are displayed in smaller-is-better forms. Type III Tests of Fixed Effects a Source Numerator df Denominator df F Sig. Centre for Multilevel Modelling 2014 5

Intercept 1 451.533 6861.108.000 Estimates of Fixed Effects a 95% Confidence Interval Parameter Estimate Std. Error df t Sig. Lower Bound Upper Bound Intercept 30.600595.369430 451.533 82.832.000 29.874579 31.326612 Estimates of Covariance Parameters a Parameter Estimate Std. Error Residual 258.357255 1.997715 Intercept [subject = Variance 61.024127 4.475315 schoolid] The overall mean attainment (across schools) is estimated as 30.60. The mean for school j is estimated as 30.60 +, where is the school residual which we will estimate in a moment. A school with > 0 has a mean that is higher than average, while < 0 for a below-average school. (We will obtain confidence intervals for residuals to determine whether differences from the overall mean can be considered real or due to chance.) Partitioning variance The between-school (level 2) variance in attainment is estimated as = 61.02, and the within-school between-student (level 1) variance (labelled Residual in the output) is estimated as =258.36. Thus the total variance is 61.02+258.36=319.38. The variance partition coefficient (VPC) is 61.02/319.38 = 0.19, which indicates that 19% of the variance in attainment can be attributed to differences between schools. Note, however, that we have not accounted for intake ability (measured by exams taken on entry to secondary school) so the school effects are not valueadded. Previous studies have found that between-school variance in progress, i.e. after accounting for intake attainment, is typically close to 10%. Testing for school effects Centre for Multilevel Modelling 2014 6

This document is only the first few pages of the full version. To see the complete document please go to learning materials and register: http://www.cmm.bris.ac.uk/lemma The course is completely free. We ask for a few details about yourself for our research purposes only. We will not give any details to any other organisation unless it is with your express permission.