Biostatistics Short Course Introduction to Longitudinal Studies



Similar documents
Chapter 7: Simple linear regression Learning Objectives

Assignments Analysis of Longitudinal data: a multilevel approach

This can dilute the significance of a departure from the null hypothesis. We can focus the test on departures of a particular form.

Chapter 1. Longitudinal Data Analysis. 1.1 Introduction

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Factors affecting online sales

MTH 140 Statistics Videos

STT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012)

Statistical Models in R

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Two-sample hypothesis testing, II /16/2004

Chapter 5 Analysis of variance SPSS Analysis of variance

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

Curriculum - Doctor of Philosophy

The importance of graphing the data: Anscombe s regression examples

BASIC COURSE IN STATISTICS FOR MEDICAL DOCTORS, SCIENTISTS & OTHER RESEARCH INVESTIGATORS INFORMATION BROCHURE

Stat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb

10. Analysis of Longitudinal Studies Repeat-measures analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Study Design and Statistical Analysis

Using SAS Proc Mixed for the Analysis of Clustered Longitudinal Data

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

17. SIMPLE LINEAR REGRESSION II

Data Mining Introduction

Geostatistics Exploratory Analysis

Introduction to Longitudinal Data Analysis

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

Chapter Eight: Quantitative Methods

2013 MBA Jump Start Program. Statistics Module Part 3

Longitudinal Data Analysis

Module 5: Statistical Analysis

Introduction to Fixed Effects Methods

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

RMTD 404 Introduction to Linear Models

Analysis of Correlated Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Master of Public Health Program Competencies. Implemented Fall 2015

Electronic Thesis and Dissertations UCLA

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Data Analysis, Research Study Design and the IRB

Statistical Rules of Thumb

Predictor Coef StDev T P Constant X S = R-Sq = 0.0% R-Sq(adj) = 0.

WHAT IS A JOURNAL CLUB?

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Introduction to Data Analysis in Hierarchical Linear Models

HLM software has been one of the leading statistical packages for hierarchical

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Biostatistics: Types of Data Analysis

Module 223 Major A: Concepts, methods and design in Epidemiology

An Introduction to Modeling Longitudinal Data

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Elements of statistics (MATH0487-1)

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

An analysis method for a quantitative outcome and two categorical explanatory variables.

An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA

TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Simple Predictive Analytics Curtis Seare

COUPLE OUTCOMES IN STEPFAMILIES

What is discrete longitudinal data analysis?

Using R for Linear Regression

3. There are three senior citizens in a room, ages 68, 70, and 72. If a seventy-year-old person enters the room, the

Organizing Your Approach to a Data Analysis

Longitudinal Data Analyses Using Linear Mixed Models in SPSS: Concepts, Procedures and Illustrations

2. Simple Linear Regression

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Linear Models in STATA and ANOVA

PRACTICE PROBLEMS FOR BIOSTATISTICS

Recall this chart that showed how most of our course would be organized:

Statistics 305: Introduction to Biostatistical Methods for Health Sciences

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Correlation key concepts:

How To Run Statistical Tests in Excel

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

South Carolina College- and Career-Ready (SCCCR) Probability and Statistics

Lecture 6: Poisson regression

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

Econometrics and Data Analysis I

Qualitative vs Quantitative research & Multilevel methods

Basic Probability and Statistics Review. Six Sigma Black Belt Primer

socscimajor yes no TOTAL female male TOTAL

Data analysis process

Principles of Hypothesis Testing for Public Health

Elementary Statistics Sample Exam #3

II. DISTRIBUTIONS distribution normal distribution. standard scores

Cancer Biostatistics Workshop Science of Doing Science - Biostatistics

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection

Do Direct Stock Market Investments Outperform Mutual Funds? A Study of Finnish Retail Investors and Mutual Funds 1

Transcription:

Biostatistics Short Course Introduction to Longitudinal Studies Zhangsheng Yu Division of Biostatistics Department of Medicine Indiana University School of Medicine Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 1 / 27

Outline 1 Introduction 2 Example 3 Approaches to Data Analysis 4 Summary Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 2 / 27

Introduction Course objectives Familiarize with basics of longitudinal studies. Know the differences between longitudinal studies and cross-sectional studies. Learn how to perform simple analysis. Learn the basics of random effects models for longitudinal data. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 3 / 27

Introduction What are longitudinal studies? Measures collected repeatedly on the same individuals over time. Examples heights or weights in growth studies; numbers of tumors in cancer studies during the follow up period; depression scores in a mental health treatment study during the follow up period. In epidemiology, cohort study is a longitudinal study. A cohort is a set of individuals sharing a common characteristic (e.g. same baseline age) or experience in a particular time period. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 4 / 27

Introduction Why longitudinal studies? In longitudinal data analysis: - Interest is on the behavior of a response variable over time. - Require special statistical methods to address intra-individual correlation and inter-individual variability. They can tease out changes over time within individuals (age effects) from differences among groups of people bonded by time or common life experience (cohort effects). They will increase the sample size". Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 5 / 27

Introduction Cross-sectional studies They aim to describe the relationship between diseases and potential risk factors in a specified population at a particular time. They must be done on representative samples of the population for valid generalization. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 6 / 27

Introduction Longitudinal studies vs. cross-sectional studies Longitudinal Advantage - Age vs. cohort effects - Efficiency improvement - Time to event analysis Disadvantage - Expensive - Analysis more complicated - Missing values Cross-sectional Advantage - Less expensive - Simple data structure - Many analysis tools Disadvantage - Unable to detect age effects - more difficult to generalize Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 7 / 27

Introduction Reading ability example: cohort vs. age effects Reading Ability 1.5 2.0 2.5 3.0 3.5 4.0 4.5 2 3 4 5 6 Age (years) Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 8 / 27

Introduction Reading ability example: cohort vs. age effects Reading Ability 1.5 2.0 2.5 3.0 3.5 4.0 4.5 Reading Ability 1.5 2.0 2.5 3.0 3.5 4.0 4.5 2 3 4 5 6 Age (years) 2 3 4 5 6 Age (years) Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 8 / 27

Introduction Cohort vs. age effects Age effects: changes over time within subject. Cohort effects: differences between subjects at baseline. Cross-sectional data can NOT separate age from cohort effects. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 9 / 27

Example Pothoff and Roy dental study (1964) 27 children, 16 boys, 11 girls; On each child, distance (mm) from the center of the pituitary to the pteryomaxillary fissure measured on each child at ages 8, 10, 12, and 14 years of age; A continuous measure of growth. Questions of interest: Does distance change over time? What is the pattern of change? Are the patterns different for boys and girls? How? Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 10 / 27

Example Distance (mm) 15 20 25 30 Pothoff and Roy (1964) dental study Boys Girls 8 9 10 11 12 13 14 Age (years) Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 11 / 27

Example Distance (mm) 15 20 25 30 Sample average dental distance (Pothoff and Roy, 1964) Boys Girls 8 9 10 11 12 13 14 Age (years) Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 12 / 27

Example Some observations All children have all 4 measurements at the same time points (balanced). Children who start high or low tend to stay high or low. The individual pattern for most children follows a rough straight line increase (with some jitter). Average distance (across boys and across girls) follows an approximate straight line pattern. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 13 / 27

Example Statistical issues Scientific objectives can be formulated as regression problems: Dependence of a response variable on explanatory variables. Repeated observations on each experimental unit: observations from one unit to the next are independent; multiple observations on the same unit are dependent. Multiple observations from each subject makes longitudinal data powerful. It also makes it a challenge to analyze. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 14 / 27

Approaches to Data Analysis Analysis of longitudinal data Exploratory analysis comprises techniques to visualize patterns of data. Confirmatory analysis is judicial work, weighing evidence in data for or against hypotheses. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 15 / 27

Approaches to Data Analysis Exploratory analysis to longitudinal data Highlight aggregate patterns of potential scientific interest; Identify both cross-sectional and longitudinal patterns; Make easy the identification of unusual individuals or unusual observations. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 16 / 27

Approaches to Data Analysis Confirmatory analysis to longitudinal data ad hoc analysis: Reduce the repeated values into one or two summary variables; Treat repeatedly observed outcomes from the same subjects as if they are independent and perform regular regression. formal statistical modeling: Random effects models; Marginal models; Markov transition models. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 17 / 27

Approaches to Data Analysis Pothoff and Roy dental study (1964) GROUP=Boys Variable N Mean Std Dev minimum Maximum AGE8 16 22.875 2.452 17.0 27.5 AGE10 16 23.812 2.136 20.5 28.0 AGE12 16 25.718 2.651 22.5 31.0 AGE14 16 27.468 2.085 25.0 31.5 GROUP=Girls Variable N Mean Std Dev minimum Maximum AGE8 11 21.182 2.124 16.5 24.5 AGE10 11 22.227 1.902 19.0 25.0 AGE12 11 23.091 2.364 19.0 28.0 AGE14 11 24.091 2.437 19.5 28.0 Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 18 / 27

Approaches to Data Analysis ad hoc analysis of the dental study Do things change over time? For each gender, compare the mean at age 8 to the means at subsequent ages (i.e. age 10, 12, and 14) using paired t-tests; P-values for boys: 0.15, 0.0003, 0.0001; Conclusion? Multiple comparisons? How to characterize change"? Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 19 / 27

Approaches to Data Analysis ad hoc approach Are the patterns different for boys and girls? Compare mean distances between boys and girls at each ages 8, 10, 12, 14 using two-sample t-tests; P-values: 0.08, 0.06, 0.01, 0.001; Conclusion? Multiple comparisons? How to put this together " to say something about the differences in patterns and how they differ? What are the patterns, anyway? Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 20 / 27

Approaches to Data Analysis Random effects model: a formal statistical approach The model describes patterns of change as trends; e.g., a straight line. The model allows starting value (intercept) and trend vary among subject. The model acknowledges associations among observations on the same subject by formally modeling the correlation. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 21 / 27

Approaches to Data Analysis Graphical explication of the random effects model Y(t) = 16 + 0.8t + e(t) Y(t) = (16+noise) + 0.8t + e(t) Distance(mm) 10 15 20 25 Distance(mm) 10 15 20 25 0 1 2 3 4 5 6 Time (years) 0 1 2 3 4 5 6 Time (years) Y(t) = 16 + (0.8+noise)t + e(t) Y(t) = (16+nois) + (0.8+nois)t + e(t) Distance(mm) 10 15 20 25 Distance(mm) 10 15 20 25 0 1 2 3 4 5 6 Time (years) 0 1 2 3 4 5 6 Time (years) Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 22 / 27

Approaches to Data Analysis Illustration from the dental study Distance measurements for subject i at occasion j is denoted y ij. Note that t ij = 8, 10, 12, 14. Errors in measuring distance are likely committed at each time point. The term e ij represents the error" in measurement. Each child has its own growth trajectory in the form of a straight line with intercept β 0i and slope β 1i : y ij = β 0i +β 1i t ij + e ij. In a linear regression model with gender, age, and gender age interaction as covariates: Boys : β 0i = β 0B,β 1i = β 1B Girls : β 0i = β 0G,β 1i = β 1G Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 23 / 27

Approaches to Data Analysis Illustration from the dental study Random effect model: y ij = β 0i +β 1i t ij + e ij. Boys : β 0i = β 0B + b 0i,β 1i = β 1B + b 1i Girls : β 0i = β 0G + b 0i,β 1i = β 1G + b 1i There are population average intercept and slope across boys and across girls. Individual intercepts and slopes deviate from these. So if β 0i is high" relative to β 0B, then i s trajectory will be high" relative to other boys. β 1i may be steeper or shallower than the average increase (β 1B ). y ij for different j will depend β 0i and β 1i, so they are correlated. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 24 / 27

Approaches to Data Analysis Analysis results of the dental study Solution for Fixed Effects Effect gender Estimate S.E. DF t Value Pr > t Intcpt 16.341 1.019 25 16.04 <.0001 gender F 1.032 1.596 54 0.65 0.5205 gender M 0.... age 0.784 0.086 25 9.12 <.0001 age*gender F -0.305 0.135 54-2.26 0.0277 age*gender M 0.... Tests of Fixed Effects Effect Num DF Den DF F Value Pr > F gender 1 54 0.42 0.5205 age 1 25 88.00 <.0001 age*gender 1 54 5.12 0.0277 Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 25 / 27

Approaches to Data Analysis Analysis results of the dental study (continued) Is there a change in distance over time for boys and girls? β 1B = 0? β 1G = 0? - estimates: ˆβ 1B = 0.784, ˆβ 1G = 0.784 0.305 = 0.479; - p-values: < 0.0001. Is the pattern of change the same? β 1B = β 1G? (p-value = 0.027). Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 26 / 27

Summary Summary Information in longitudinal data is often not fully exploited because ad hoc methods are used. By conceptualizing longitudinal data, we gain a framework to make the most". Focused here on continuous measurements, but similar methods exist for binary (yes/no) and categorical outcomes. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 27 / 27

Acknowledgement Summary Slides courtesy of Dr. Menggang Yu. Zhangsheng Yu (Indiana University) Longitudinal Study Short Course for Physicians 28 / 27