SPPH 501 Analysis of Longitudinal & Correlated Data September, 2012



Similar documents
EDMS 769L: Statistical Analysis of Longitudinal Data 1809 PAC, Th 4:15-7:00pm 2009 Spring Semester

MATH5885 LONGITUDINAL DATA ANALYSIS

USC Columbia, Lancaster, Salkehatchie, Sumter & Union campuses

Department of Epidemiology and Public Health Miller School of Medicine University of Miami

BIO 226: APPLIED LONGITUDINAL ANALYSIS COURSE SYLLABUS. Spring 2015

College of Public Health & Health Professions

ADL - Longitudinal Data Analysis

PLS 801: Quantitative Techniques in Political Science

BIOM611 Biological Data Analysis

CBE 9190B ADVANCED STATISTICAL PROCESS ANALYSIS COURSE OUTLINE

RR765 Applied Multivariate Analysis


POS 6933 Section 016D Spring 2015 Multilevel Models Department of Political Science, University of Florida

CS&SS / STAT 566 CAUSAL MODELING

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

CS 425 Software Engineering

Module 223 Major A: Concepts, methods and design in Epidemiology

STAT3016 Introduction to Bayesian Data Analysis

Highlights the connections between different class of widely used models in psychological and biomedical studies. Multiple Regression

NORTHWESTERN UNIVERSITY Department of Statistics. Fall 2012 Statistics 210 Professor Savage INTRODUCTORY STATISTICS FOR THE SOCIAL SCIENCES

Executive Master of Public Administration. QUANTITATIVE TECHNIQUES I For Policy Making and Administration U6310, Sec. 03

Grading. The grading components are as follows: Midterm Exam 25% Final Exam 35% Problem Set 10% Project Assignment 20% Class Participation 10%

ECON 523 Applied Econometrics I /Masters Level American University, Spring Description of the course

The University of North Carolina at Greensboro CRS 605: Research Methodology in Consumer, Apparel, and Retail Studies (3 Credits) Spring 2014

Register for CONNECT using the code with your book and this course access information:

Economic Statistics (ECON2006), Statistics and Research Design in Psychology (PSYC2010), Survey Design and Analysis (SOCI2007)

CS 425 Software Engineering. Course Syllabus

CS Data Science and Visualization Spring 2016

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Linear Mixed-Effects Modeling in SPSS: An Introduction to the MIXED Procedure

PSYC 6190 LONGITUDINAL DATA ANALYSIS. Course Outline Fall 2015/2016

Assessing the Proposed 2014 Statistics Curriculum 9/22/2013 V0A. 1

LOGOM 3300: Business Statistics Fall 2015

Analysis of Correlated Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

PSYCH 6811 Statistical Methods in Psychology II Spring Course Description

Syllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015

Canisius College Computer Science Department Computer Programming for Science CSC107 & CSC107L Fall 2014

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

MYRA School of Business Mysore, India. 3. Case: Rodeo Jeans: Promotional and Pricing Strategy (2012)

Biology BSC 6932 Applied Regression for Scientists Fall 2014

BUAD 310 Applied Business Statistics. Syllabus Fall 2013

Course Syllabus. Purposes of Course:

Geography 676 Web Spatial Database Development and Programming

Applied Clinical Research (POPM*6230) Fall 2015

Econometrics and Data Analysis I

Financial Risk and Management

Commerce 4KF3 Project Management Fall 2014 Course Outline- Tentative. Information Systems Area DeGroote School of Business McMaster University

Overview of Methods for Analyzing Cluster-Correlated Data. Garrett M. Fitzmaurice

Advanced GIS M W F 12:00 1:00 GRG 316

Study Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles

College of Health and Human Services. Fall Syllabus

Truman College-Mathematics Department Math 125-CD: Introductory Statistics Course Syllabus Fall 2012

Brown University Department of Economics Spring 2015 ECON 1620-S01 Introduction to Econometrics Course Syllabus

Review of the Methods for Handling Missing Data in. Longitudinal Data Analysis

RMTD 404 Introduction to Linear Models

Metrics for Managers: Big Data and Better Answers. Fall Course Syllabus DRAFT. Faculty: Professor Joseph Doyle E

Customer Relationship Management (BUSML 4212)

FINC 4531 B Intermediate Corporate Finance Tuesdays and Thursdays from 5:30-6:45, Adamson 227 Expanded Course Outline Fall 2010

POL 451: Statistical Methods in Political Science

AMIS 7640 Data Mining for Business Intelligence

Introducing the Multilevel Model for Change

CS 425 Software Engineering. Course Syllabus

Quantitative Analysis in International Affairs American University School of International Service SIS Fall 2008

CSci 538 Articial Intelligence (Machine Learning and Data Analysis)

Office: LSK 5045 Begin subject: [ISOM3360]...

MBAD/DSBA 6278 (U90): Innovation Analytics (IA)

QMB Business Analytics CRN Fall 2015 W 6:30-9:15 PM -- Lutgert Hall 2209

COLLIN COLLEGE COURSE SYLLABUS

QMB Business Analytics CRN Fall 2015 T & R 9.30 to AM -- Lutgert Hall 2209

Mathematics for Business Analysis I Fall 2007

INFS5873 Business Analytics. Course Outline Semester 2, 2014

Introduction to mixed model and missing data issues in longitudinal studies

COMP-557: Fundamentals of Computer Graphics McGill University, Fall 2010

Audit Analytics. --An innovative course at Rutgers. Qi Liu. Roman Chinchila

Techniques of Statistical Analysis II second group

Faculty of Management Marketing Research MGT 3220 Y Fall 2015 Tuesdays, 6:00pm 8:50pm Room: S4027 Lab: N637

PTE505: Inverse Modeling for Subsurface Flow Data Integration (3 Units)

COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK DEPARTMENT OF INDUSTRIAL ENGINEERING AND OPERATIONS RESEARCH

SI649 Information Visualization. Learning Objectives. Tentative Schedule. Preliminary Syllabus, Fall 2012

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

Biology 360 Genetics Lecture Syllabus and Schedule, Fall 2012 Tentative

: Introduction to Machine Learning Dr. Rita Osadchy

Prepared by Seungwon Shawn Lee 1

Statistics 3202 Introduction to Statistical Inference for Data Analytics 4-semester-hour course

Differential Equations Department of Mathematics College of Southern Nevada MATH 285 Section 2001 Fall 2015 TR 9:30 AM-10:50 AM Cheyenne S 134

BBA 380 Management for Environmental Sustainability and Durable Competitive Advantage THE BBA PROGRAM

School of Business ACCT2105/BUSI0027 (Subclasses A, B, C) Introduction to Management Accounting/ Management Accounting I Course Syllabus

THE UNIVERSITY OF HONG KONG FACULTY OF BUSINESS AND ECONOMICS

ECON 424/CFRM 462 Introduction to Computational Finance and Financial Econometrics

Workflow Design and Analysis

FIVS 316 BIOTECHNOLOGY & FORENSICS Syllabus - Lecture followed by Laboratory

Tel: Tuesdays 12:00-2:45

Fall Lecture: MWRF: 12 noon - 12:50 p.m. at Moulton 214. Lab at Moulton 217: Sec. 2: M 9 a.m. - 11:50 a.m.; Sec. 3: M 2 p.m. - 4:50 p.m.

USP 531 GIS for Planners WINTER 2013

STAT 360 Probability and Statistics. Fall 2012

MBA K731 (E) Project Management Winter 2014 Course Outline. Information Systems Area DeGroote School of Business McMaster University

Dr. Stanny EXP 3082L Fall 2003 EXPERIMENTAL PSYCHOLOGY LABORATORY. Office Hours For Dr. Stanny: 9:00 AM - 11:30 AM Tuesday, Wednesday, & Thursday

Digital Communication Southwest College

CSCD18: Computer Graphics

Transcription:

SPPH 501 Analysis of Longitudinal & Correlated Data September, 2012 TIME & PLACE: Term 1, Tuesday, 1:30-4:30 P.M. LOCATION: INSTRUCTOR: OFFICE: SPPH, Room B104 Dr. Ying MacNab SPPH, Room 134B TELEPHONE: (604) 822-5593 EMAIL: Ying.macnab@ubc.ca OFFICE HOURS: By appointment TEACHING ASSISTANT: The T.A. for this course is Ms. Isabel Zhitong Zhu. She will provide instruction on computing and mark the assignments. COMPUTER LABS: Students in 501 receive priority to use the School computing lab on Tuesdays, Noon. - 1:30 P.M. Students are welcome to use their own rather than the lab computers. COURSE PHILOSOPHY AND OBJECTIVES: This course will introduce students to concepts and methods in the analysis of correlated data, with special emphasis on longitudinal and hierarchical data. By the end of the course students are expected to: 1. Recognize the types of study/sampling designs that give rise to correlated data. 2. Appreciate the benefits and limitations of longitudinal study designs for assessing the association between health outcomes and their determinants. 3. Translate conceptual models relating health outcomes and their determinants into statistical models. 4. Understand the distinction between fixed effects and random effects and the conditions under which it is appropriate to treat a parameter as a random effect. 5. Identify different analytic approaches to longitudinal/correlated data analysis and explain the advantages and disadvantages of each approach. PREREQUISITES: SPPH 400, 500 and 502 or their equivalents.

COURSE REFERENCES: The primary reference used in this course will be 1. Gelman A, and Hill J. Data Analysis Using Regression and Multilevel/Hierarchical Models. 2007. Cambridge. though some material will also be taken from: 1. Fitzmaurice GM, Laird NM, and Ware JH. 2011. Applied Longitudinal Analysis (2nd Edition). Wiley. 2. Diggle PJ, Heagerty P, Liang KY, and Zeger SL. 2002. Analysis of Longitudinal Data (2nd Edition). Oxford University Press. Computing: 1. The Gelman text provides some examples in R. Additional materials will be distributed during the course. 2. Singer JD. 1998. Using SAS PROC MIXED to Fit Multilevel Models, Hierarchical Models, and Individual Growth Models. Journal of Educational and Behavioral Statistics 24(4): 323-355. (For those intending to use SAS -- The examples in the Fitzmaurice text are all in SAS, but the Singer paper provides a more gentle introduction.) Additional: 1. Goldstein H. 2003. Multilevel Statistical Models (3rd Edition). Arnold. (A pdf of an older version, which suffices for this course, is available for download from http://www.ats.ucla.edu/stat/examples/msm_goldstein/default.htm.). 2. Pinheiro JC, and Bates DM. 2000. Mixed Effects Models in S and S-PLUS. Springer. (As indicated by the title, this text uses S-PLUS as the computing environment. While many of the basic functions and commands in R and S-PLUS are identical, this is not the case for mixed effects models so you should not purchase this text for learning the computing. I list it because although it is a text on computing, the examples used also illustrate many of the important concepts and theory important underlying mixed effects models.) COURSE NOTES: Course slides will be available on selected topics. Note that these are summary slides, not detailed content. Students are expected to have read the slides and related material prior to the class. Inclass time will usually alternate between short presentations of basic ideas by the instructor and class discussion of these ideas. STATISTICAL COMPUTING: Statistical analysis in this course will be illustrated using R. R is a free software environment for statistical computing and can be downloaded through the R Project homepage: www.r-project.org. Students are welcome to use other software packages (SAS, Stata, etc) instead of R to complete the assignments (limited support from the instructor and TA). COURSE EVALUATION: The course is graded on a Pass/Fail basis (68% required for a Pass). The components that go into the final grade are:

Assignments: 30% Final Project: 50% (30% report, 20% class presentation) Class Participation: 20% ASSIGNMENTS: Assignments will be distributed periodically. The main aim of the assignments is to help students become comfortable with translating conceptual models into mathematical models. Late assignments are not accepted. Assignments should be typed or neatly written. Mathematical notation and language should be used very carefully. Marks will be deducted for improper use of notation/jargon, particularly in cases where it renders the answer vague. Students working with the software are encouraged to consult with each other about how to get things done but all written work must be completed individually. Present only those the parts of the computer output that are relevant to your answer and highlight or underline the specific items of interest. Alternatively, transcribe those items to another page if you prefer. FINAL PROJECT: The due date for the write-up of the final project is December 4. There is considerable flexibility in the scope of the final project and students are encouraged to propose and develop a project in consultation with the instructor. Examples of projects include: 1. Critiquing of longitudinal analyses used in published studies: Identify the type of design and the models used. How appropriate were the models to the questions of interest? What were the limitations of the design and analysis? What might have been done or could be done in the future to better address the questions? 2. Design of a longitudinal study: Identify a question which requires longitudinal data to answer. Discuss possible ways (and the advantages/disadvantages of each) of collecting data to answer the question. Propose specific models for analyzing the data with discussion of how each model is related to and accounts for various features of the data. 3. Analysis of a dataset: Starting with a given dataset and various questions of interest, perform the necessary analyses (graphical, modeling, diagnostics, interpretation) to address those questions. 4. Special topics: Develop a "mini-lecture" that enhances material on a topic (e.g. missing data, transition models, causal models, etc) that is not discussed in-depth in class. 5. Other software packages: Develop a "tutorial" to show how the analyses illustrated in class using R would be done using another software package. Compare and contrast the capabilities/limitations of the two packages. EXAMS: There are no exams for this course. COURSE OUTLINE:

The following schedule is tentative. Changes may be made to accommodate the needs and interests of the students. Class #1 (Tuesday, Sep 4): Review of distributions, probability, conditional probability, expected value, correlation. Review of sampling designs and data structures. Sources of correlated data (hierarchical, longitudinal, spatial). Implications for statistical inference of various types (association, causality, prediction). Principles of regression modeling. Examples. Readings: Gelman, Ch 1 4, 9 Class #2 (Tuesday, Sep 11): In-depth discussion of applications of ideas from Class #1. Class #3 (Tuesday, Sep 18): Assignment #1 due. Grouped data. Stratification and clustering. Levels of analysis. Fixed and random effects. Analysis of studies with observations at two time-points. Exploring and summarizing grouped data. Readings: Gelman, Ch 11. Fitzmaurice, Ch 1 & 2. Diggle, Ch 1. Class #4 (Tuesday, Sep 25): Matrices. Correlation structures. Design and sample size considerations. Analytic approaches (derived variables, marginal models, mixed effects models). Derived variable analysis. Readings: Gelman, Ch 20. Fitzmaurice, Sec 7.1-7.7, Ch 3. Diggle, Ch 2 & 3. Class #5 (Tuesday, Oct 2): Assignment #2 due. Linear mixed effects models. Modeling of mean and variance structures. ML and REML estimation. Matrix representation. Readings: Gelman, Ch 12 & 13. Fitzmaurice, Ch 8. Diggle, Ch 4 & 5. Class #6 (Tuesday, Oct 9): Outline of proposed term project due. Review of generalized linear models (GLMs). Generalized linear mixed effects models (GLMMs). MLE-EM, PQL and MCMC estimation. Readings: Gelman, Ch 5, 6, 14 & 15. Fitzmaurice, Ch 10 & 12. Diggle, Ch 9. Class #7 (Tuesday, Oct 16): Assignment #3 due. Marginal models and GEE methods. Readings: Fitzmaurice, Ch 11. Diggle, Ch 8. Class #8 (Tuesday, Oct 23): Midterm tutorial/q&a led by the T.A. (The instructor will be absent.) Class #9 (Tuesday, Oct 30): Comparison of mixed effects models and marginal models. Readings: Fitzmaurice, Ch 13. Diggle, Ch 7. Class #10 (Tuesday, Nov 6): Draft of projects due for distribution to class for background/review send directly to class email list. Additional topics as time allows: missing data, transition models, time dependent confounding,

spatial models, Bayesian methods. Readings: Gelman, Ch 16 & 25. Fitzmaurice, Ch 14. Diggle, Ch 10, 12 & 13. Class #11 (Tuesday, Nov 13): Assignment #4 due. Additional topics (continued) as time allows. Student presentations. Class #12 (Tuesday, Nov 20): Student presentations. Class #13 (Tuesday, Nov 27): Student presentations.