What is discrete longitudinal data analysis?



Similar documents
Search Engine Marketing Presented by: Dale DeHart. Presented to: DMA 14 June, SOHO Prospecting. All rights reserved

The views and opinions expressed in the following PowerPoint slides are

Your IBM Blue is Cool. Certified Professions and

Learning to Listen with Hearing Technologies: An Interdisciplinary Perspective on Aural Rehabilitation

Error Detection and Correction

Supporting children at home with reading and writing

Qualitative Research. Session 3: Data gathering. Prof Jan Nieuwenhuis

Lecture 1: The Visual System

Requirement Types 1. Why should we care? 2. What are they? 3. How should we use them?

Basic Clinical Documentation

Aoccdrnig to rsceearh at Cmabrigde Uinervtisy, it deosn't mttaer in waht oredr the ltteers in a wrod are, the olny iprmoetnt tihng is taht the frist

The Pharmacist s Role in Reducing and Preventing Medication Errors: What are the Real World Issues?

Goals and Objectives. A Story. Diagnosis. What do diagnostic errors mean to you? University of Texas Health Science Center at San Antonio

PRA100 - Presentation Skills in Communication. Hakan Türkkuşu

COLLEGE ADMISSION PROCESS FOR STUDENTS WITH LEARNING DIFFERENCES

Big Christian Lovis Division of Medical Information Sciences university hospitals of Geneva

Cognitive Entity Authentication with Petname Systems

Executive Functions and Reading: A Neuropsychological Perspective

Briefing Document on Sound-Alike Look-Alike Drugs (SALADs)

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

How To Teach Deaf People To Speak English

Generalized Linear Models

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

How Do You Expect Me To Teach Reading and Writing?

Kofax White Paper. Best Practice for Automatic Invoice Capture with Advanced Recognition. Executive Summary. Business Problems

Memory Loss: It s Not Always Alzheimers. Andrew Massey, M.D. Department of Internal Medicine University of Kansas School of Medicine--Wichita

Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

Protocols for VoIP CHAPTER 8. Chapter 8

Correlation and Simple Linear Regression

Simple Predictive Analytics Curtis Seare

The importance of graphing the data: Anscombe s regression examples

Multiple Linear Regression

Introduction to Longitudinal Data Analysis

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Statistical Models in R

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Chapter 5 Analysis of variance SPSS Analysis of variance

Biostatistics Short Course Introduction to Longitudinal Studies

We extended the additive model in two variables to the interaction model by adding a third term to the equation.

A Basic Introduction to Missing Data

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

10. Analysis of Longitudinal Studies Repeat-measures analysis

HLM software has been one of the leading statistical packages for hierarchical

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

13. Poisson Regression Analysis

Statistical modelling with missing data using multiple imputation. Session 4: Sensitivity Analysis after Multiple Imputation

Machine Learning. CS494/594, Fall :10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004

Nominal and ordinal logistic regression

An analysis method for a quantitative outcome and two categorical explanatory variables.

Categorical Data Analysis

Univariate Regression

EDUCATION AND VOCABULARY MULTIPLE REGRESSION IN ACTION

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Overview. Longitudinal Data Variation and Correlation Different Approaches. Linear Mixed Models Generalized Linear Mixed Models

SPSS Guide: Regression Analysis

Statistical Models in R

Regression step-by-step using Microsoft Excel

Assignments Analysis of Longitudinal data: a multilevel approach

5. Linear Regression

Longitudinal Data Analysis. Wiley Series in Probability and Statistics

E(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

Didacticiel - Études de cas

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

Module 5: Multiple Regression Analysis

Simple Linear Regression Inference

GLM I An Introduction to Generalized Linear Models

Multivariate Logistic Regression

Week 5: Multiple Linear Regression

A Mixed Model Approach for Intent-to-Treat Analysis in Longitudinal Clinical Trials with Missing Values

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

Introducing the Multilevel Model for Change

Chapter 7: Simple linear regression Learning Objectives

ANOVA. February 12, 2015

CLICK! Work Anytime, Live Anywhere. Fabian Lim

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

BIO 226: APPLIED LONGITUDINAL ANALYSIS COURSE SYLLABUS. Spring 2015

SAS Software to Fit the Generalized Linear Model

MSwM examples. Jose A. Sanchez-Espigares, Alberto Lopez-Moreno Dept. of Statistics and Operations Research UPC-BarcelonaTech.

Master of Science in Statistics

Model Fitting in PROC GENMOD Jean G. Orelien, Analytical Sciences, Inc.

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Transcription:

What is discrete longitudinal data analysis? Ivonne Solis-Trapala NCRM Lancaster-Warwick Node ESRC Research Festival 2008

Discrete longitudinal data analysis The title explained: Discrete variables: Variables having only integer values, for example: Number of heart attacks (0,1,2...), Failure (0) or success (1) in a psychological test item. Longitudinal data: A variable for each of a number of subjects is measured a number of different time points.

Longitudinal data is not: Time series data: Single long series of measurements, Multivariate data: Single outcome of two or more different kinds of measurements on each subject; but: a large number of short time series

Example of time series data Cardiovascular events during the FIFA world cup in 2006 Source: Wilbert-Lampen et al. (2008) N. Engl. J. Med. 358 5 475 483

Example of multivariate data Fair admission process to a university A = student is admitted (yes/no) G = student s gender (female/male) D = department (Mathematics, Medicine, Engineering, Biology) A A D D G Fair: female admission rates similar to male admission rates at each department G Unfair: otherwise

Example of longitudinal data Improving communication skills of oncologists

Example of longitudinal data (cont.) A randomised controlled trial 160 clinicians took part in baseline filming. They were randomised to one of four groups: Time 1 Group A written feedback and course Group B course only Group C written feedback only Group D control 3 day course 1 month after feedback 3 day course, feedback given during course Filming and assessments 3 months after course Time 2 doctors in groups C and D invited to attend the course Filming and assessments 15 months after course Time 3 Reference: Fallowfield et al. (2002) The Lancet, 359, 650 656

Example of longitudinal data (cont.) The MIPS data MIPS = Medical Interaction Process DATA: COUNTS of primary outcomes, i.e. leading questions, expressions of empathy, focused questions Participants: 160 doctors 2 consultations filmed for each doctor at TIMES 1, 2, 3

Example of longitudinal data (cont.) Longitudinal performance

Statistical aspects Modelling independent discrete data Continuous response Discrete response Linear regression model e.g. ANOVA Generalised linear model e.g. logistic regression, Poisson regression i) Independent observations ii) Mean of the response is a linear function of some explanatory variables iii) The variance of the response is constant ii) Mean of the response is associated with a linear function of some explanatory variables through a link function iii) The variance of the response is a function of the mean

Why the model assumptions are important Consider the following four data sets x 1 y 1 x 2 y 2 x 3 y 3 x 4 y 4 10 8.04 10 9.14 10 7.46 8 6.58 8 6.95 8 8.14 8 6.77 8 5.76 13 7.58 13 8.74 13 12.74 8 7.71 9 8.81 9 8.77 9 7.11 8 8.84 11 8.33 11 9.26 11 7.81 8 8.47 14 9.96 14 8.10 14 8.84 8 7.04 6 7.24 6 6.13 6 6.08 8 5.25 4 4.26 4 3.10 4 5.39 19 12.50 12 10.84 12 9.13 12 8.15 8 5.56 7 4.82 7 7.26 7 6.42 8 7.91 5 5.68 5 4.74 5 5.73 8 6.89 Source: Anscombe, F.J. (1973) The American Statistician, 27, 17 21.

Fitting a linear regression model Same for four data sets Dependent variable: y Coefficient Estimate Std. Error t value p-value (Intercept) 3.0001 1.1247 2.67 0.0257 x 0.5001 0.1179 4.24 0.0022 Multiple R-Squared: 0.6665 F-statistic: 17.99 on 1 and 9 DF, p-value: 0.002170 Equation of regression line: y = 3 + 0.5x

Assess the linearity assumption! y 1 0 4 8 12 R 2 = 0.67 y 2 0 4 8 12 R 2 = 0.67 0 5 10 15 20 0 5 10 15 20 x 1 x 2 y 3 0 4 8 12 R 2 = 0.67 y 4 0 4 8 12 R 2 = 0.67 0 5 10 15 20 x 3 0 5 10 15 20 x 4

Some statistical approaches based on GLM for analysing longitudinal discrete data Target of inference: mean response ( ) vs. mean response of an individual ( ) Marginal models Random effects models Transition models P(y=1) 0.0 0.2 0.4 0.6 0.8 1.0 10 5 0 5 10 x

Example of marginal models Generalised estimating equations (GEE) describe the relationship between response variable and explanatory variables with a population average regression model the approach provides consistent regression coefficient estimates even if the correlation structure is mis-specified

How is GEE implemented? 1. specify the mean regression 2. make a plausible guess of the covariance matrix 3. fit the model 4. use the residuals to adjust standard errors

Can you raed tihs? I cdn uolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdanieg: the phaonmneal pweor of the hmuan mnid. Aoccdrnig to a rsceearch taem at Cmabrigde Uinervtisy, it deosn t mttaer in waht oredr the ltteers in a wrod are, the olny iprmoatnt tihng is taht the frist and lsat ltteer be in the rghit pclae. The rset can be a taotl mses and you can sitll raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe. Such a cdonition is arppoiatrely cllaed Typoglycemia

Finally... An example of misleading inferences when standard errors of regression parameters estimates are not adjusted: MIPS data revisited Robust conditional Poisson regression models comparing T 2 (3 month post-course) to T 1 (baseline) assessment Behaviour ˆβc naive se robust se Leading questions -0.30 0.13 0.18 Focused questions 0.23 0.077 0.13 Focused and open questions 0.16 0.067 0.10 Expressions of empathy 0.41 0.14 0.25 Summarising of information 0.054 0.11 0.24 Interruptions -0.15 0.30 0.41 Checking understanding -0.18 0.15 0.22 Reference: Solis-Trapala & Farewell (2005) Biometrical Journal, 47, 1 14