Psychology 209. Longitudinal Data Analysis and Bayesian Extensions Fall 2012



Similar documents
Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Department of Epidemiology and Public Health Miller School of Medicine University of Miami

EDMS 769L: Statistical Analysis of Longitudinal Data 1809 PAC, Th 4:15-7:00pm 2009 Spring Semester

Longitudinal Data Analysis. Wiley Series in Probability and Statistics

Instructor/TA Info. Course Information. Instructor Information. Description. Prerequisites. Materials. Learning Outcomes

RR765 Applied Multivariate Analysis

CALIFORNIA STATE UNIVERSITY CHANNEL ISLANDS PSY494 POSITIVE PSYCHOLOGY RESEARCH FALL 2015 SYLLABUS DR. CHRISTY TERANISHI MARTINEZ

Statistics Graduate Courses

Regression and Multivariate Data Analysis

Handling attrition and non-response in longitudinal data

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

Intro to Graduate Education and Technology ED 500 Spring, 2012 ONLINE Course Outline

An Introduction to Latent Class Growth Analysis and Growth Mixture Modeling

RMTD 404 Introduction to Linear Models

STAT3016 Introduction to Bayesian Data Analysis

EMR 6410 Fundamentals of Measurement in the Behavioral Sciences. Course Pack Topics

DAVOOD TOFIGHI POSITION present Assistant Professor, School of Psychology, Georgia Institute of Technology EDUCATION

Florida State College at Jacksonville MAC 1105: College Algebra Summer Term 2011 Reference: MW 12:00 PM 1:45 PM, South Campus Rm: G-314

Advanced Statistics & Data Analysis

STA 4273H: Statistical Machine Learning

Math 830- Elementary Algebra

Faculty of Science School of Mathematics and Statistics

UNIVERSITY OF BRIDGEPORT School of Engineering COURSE SYLLABUS. TCMG-555-6T1 Project Management

BIO 226: APPLIED LONGITUDINAL ANALYSIS COURSE SYLLABUS. Spring 2015

Categorical Data Analysis

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES

2015 TUHH Online Summer School: Overview of Statistical and Path Modeling Analyses

PLS 801: Quantitative Techniques in Political Science

Introduction to Data Analysis in Hierarchical Linear Models

Statistics in Applications III. Distribution Theory and Inference

Program description for the Master s Degree Program in Mathematics and Finance

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA

Comparison of Estimation Methods for Complex Survey Data Analysis

Online College Algebra

Executive Master of Public Administration. QUANTITATIVE TECHNIQUES I For Policy Making and Administration U6310, Sec. 03

EVANGEL UNIVERSITY Behavioral Sciences Department

Dr. Stanny EXP 3082L Fall 2003 EXPERIMENTAL PSYCHOLOGY LABORATORY. Office Hours For Dr. Stanny: 9:00 AM - 11:30 AM Tuesday, Wednesday, & Thursday

MAT Elements of Modern Mathematics Syllabus for Spring 2011 Section 100, TTh 9:30-10:50 AM; Section 200, TTh 8:00-9:20 AM

Syllabus for MTH 311 Numerical Analysis

WILLIAM PATERSON UNIVERSITY CHRISTOS M. COTSAKOS COLLEGE OF BUSINESS Course Syllabus

POL 204b: Research and Methodology

Sociology 328A: Social Statistics (3.0 Credits) Department of Sociology, University of British Columbia Winter Term I, Sep 03 Nov 28, 2013

EDAD 695 Research Methodology 01W Course Syllabus Fall Ed Leadership Office: Education North 105, P.O. Box 3011 Commerce, Texas 75429

PTE505: Inverse Modeling for Subsurface Flow Data Integration (3 Units)

Faculty of Management Marketing Research MGT 3220 Y Fall 2015 Tuesdays, 6:00pm 8:50pm Room: S4027 Lab: N637

Course Title: Minorities and the Criminal Justice System Course Prefix: CRJS Course No.: 3933 Section No.: PO1

Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study

Introduction to Statistics (Online) Syllabus/Course Information

PCO 6317: Psychological Assessment 2 (Personality) Spring 2013 * Wednesdays 10:40 am - 1:40 pm * CWC 231

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

PART 1: INSTRUCTOR INFORMATION, COURSE DESCRIPTION AND TEACHING METHODS

California University of Pennsylvania Guidelines for New Course Proposals University Course Syllabus Approved: 2/4/13. Department of Psychology

MATH5885 LONGITUDINAL DATA ANALYSIS

COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK DEPARTMENT OF INDUSTRIAL ENGINEERING AND OPERATIONS RESEARCH

Course Descriptions. Seminar in Organizational Behavior II

Systems and Internet Marketing Syllabus Fall 2012 Department of Management, Marketing and International Business

CASPER COLLEGE COURSE SYLLABUS

REHB 461. Fall Introduction to Alcoholism and Drug Abuse. Mondays, 6:00-9:00 pm

Database Management Systems ISM 4212 (3 Credit Hours) Fall 2011

Psychological Testing (PSYCH 149) Syllabus

Introduction to Physics I (PHYS ) Fall Semester 2012

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

EXPECTED LEARNING OUTCOMES:

How To Pass Onliner College Algebra 1314 Online Online Online Online Online

Multicultural Psychological Assessment II: Advanced Clinical and Personality Assessment PSY F687, 3credits Summer 2014

Business Management ACC414 Intermediate Accounting 1 Fall 2009 Section 011

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

List of Ph.D. Courses

SYLLABUS Honors College Algebra MAC 1105H / 3 credit hours Fall 2014

1 Goals & Prerequisites

MATHEMATICAL TOOLS FOR ECONOMICS ECON SPRING 2012

Executive Master of Public Administration. QUANTITATIVE TECHNIQUES I For Policy Making and Administration U6311, Sec. 003

Prairie View A & M University

Mathematics within the Psychology Curriculum

APPLIED MISSING DATA ANALYSIS

HKUST Course Syllabus Outline (experimental only)

Ordinary Differential Equations

Semester: Fall 2015 Course Title: College Algebra Course Prefix: Math Course No.: 1113 Section No.:P36 CRN: 11830

Public Human Resources Management PAD/NAL 630, PAD 518 Fall Christine L. Rush Office Hours: Tuesdays 4:00 5:00

PSY 350 ABNORMAL PSYCHOLOGY SPRING 2011

ADVANCED CORPORATE FINANCE I 2009 SPRING SEMESTER

ECON 523 Applied Econometrics I /Masters Level American University, Spring Description of the course

SOUTHWEST COLLEGE Department of Mathematics

11. Time series and dynamic linear models

Governors State University College of Business and Public Administration. Course: STAT Statistics for Management I (Online Course)

School of Health Professions Department of Health Services Administration Healthcare Management Program

Psychology 420 (Sections 101 and 102) Experimental Psychology: Social Psychology Laboratory

Math 3E - Linear Algebra (3 units)

MATHEMATICAL TOOLS FOR ECONOMICS ECON FALL 2011

Review of the Methods for Handling Missing Data in. Longitudinal Data Analysis

Decision Sciences Data Analysis for Managers

PH 7525 Introduction to Data & Statistical Packages Course Reference #: Spring 2011

Transcription:

Instructor: Psychology 209 Longitudinal Data Analysis and Bayesian Extensions Fall 2012 Sarah Depaoli (sdepaoli@ucmerced.edu) Office Location: SSM 312A Office Phone: (209) 228-4549 (although email will result in a faster response) Office Hours: Wednesdays 4:00-6:00pm, or by appointment Course Website: http://faculty1.ucmerced.edu/sdepaoli Course Time/Location: MW 6:00-7:15pm, COB 279 Course Description: Psychology 209 focuses on the analysis of longitudinal data. This course is broken into several main sections of content all dealing with different aspects of longitudinal data analysis. First, a general introduction will be provided, which will include discussion on advantages and challenges of longitudinal studies. The second section of the course will specifically focus on analysis of variance approaches to longitudinal data analysis (e.g., repeated measures and MANOVA approaches). The third section will introduce several methods using mixed-effects regression models for outcomes that are continuous, binary, ordinal, nominal, and for count data. The fourth section of this course will introduce methods for analyzing longitudinal data that are part of the general latent variable framework. We will cover models that estimate continuous growth (e.g., growth curve modeling and growth mixture modeling) and categorical growth over time (e.g., latent transition analysis). The last section of this course will cover the Bayesian estimation framework as a tool for estimating various longitudinal models. Advantages and reasons for using this framework will be discussed, as well as the basic features of Bayesian estimation (e.g., estimation algorithms and samplers). We will cover various prior distributions needed to estimate longitudinal models. The issue of where priors come from (and various ways to specify them) will also be covered. Finally, we will discuss several advanced-bayesian topics in relation to longitudinal data analysis, including model specification (and mis-specification), model comparison, and some special topics for longitudinal mixture models (e.g., label switching and problems with priors). We will be working with several different computer programs throughout this course: Mplus, R, and WinBUGS (OpenBUGS). You need not have prior exposure to Mplus or WinBUGS (OpenBUGS), but it is assumed that you have had some prior exposure to R. If you are not familiar with the R programming environment, please see the instructor to discuss supplementary material that you might benefit from before the semester begins; 1

note that exposure through PSY 202a and 202b is sufficient. Required Text: Hedeker, D., & Gibbons, R. D. (2006). Longitudinal Data Analysis. Hoboken, NJ: John Wiley & Sons. Other Required Readings (Instructor will Distribute): ***Please do not be intimidated by this additional reading list. In some cases, we will just be reading a portion of the assigned piece. This list has been incorporated into the course to help you grasp a deeper understanding of the topics while becoming more familiar with methodological literature. Collins, L. M., & Lanza, S. T. (2010). Latent Class and Latent Transition Analysis. Hoboken, NJ: John Wiley & Sons. Congdon, P. (2006a). Bayesian Models for Categorical Data. Hoboken, NJ: John Wiley & Sons. Congdon, P. (2006b). Bayesian Statistical Modelling. Hoboken, NJ: John Wiley & Sons. Depaoli, S. (in preparation). The effect of data-driven priors on parameter estimate bias for continuous growth models. Depaoli, S. (under review). Mixture class recovery in GMM under varying degrees of class separation: Frequentist versus Bayesian estimation. Depaoli, S. (2012). The ability for posterior predictive checking to identify model misspecification in Bayesian growth mixture modeling. Structural Equation Modeling. Grimm, K. A., & Ram, N. (2009). Nonlinear growth models in Mplus and SAS. Structural Equation Modeling, 16, 676 701. Jasra, A., Holmes, C. C., & Stephens, D. A. (2005). Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling. Statistical Science, 20, 50 67. Kaplan, D., & Depaoli, S. (2012). Bayesian statistical methods. In T. Little (Ed.), Handbook of quantitative methods (pp. TBD). Oxford: Oxford University Press. Kaplan, D. & Walpole, S. (2005). A stage sequential model of reading transitions: Evidence from the early childhood longitudinal study. Journal of Educational Psychology, 97, 551 563. Muthén, B. O. (2001). Second-generation structural equation modeling with a combination of categorical and continuous latent variables: New opportunities for latent class/latent growth modeling. In L. M. Collins & A. G. Sayer (Eds.), New methods for the analysis of change (pp. 291 322). Washington DC: APA. 2

Muthén, B. O. (2004). Latent variable analysis: Growth mixture modeling and related techniques for longitudinal data. In D. Kaplan (Ed.), Handbook of quantitative methodology for the social sciences (pp. 345 368). Newbury Park, CA: Sage Publications. Muthén, B. O., & Asparouhov, T. (2008). Growth mixture modeling: Analysis with non- Gaussian random effects. In G. Fitzmaurice, M. Davidian, G. Verbeke, & G. Molenberghs (Eds.), Longitudinal data analysis (pp. 143 165). Boca Raton: Chapman & Hall/CRC Press. Nylund, K. (2007). Latent transition analysis: Modeling extensions and an application to peer victimization. Doctoral dissertation, University of California, Los Angeles. Nylund, K., Asparouhov, T. & Muthén, B. (2007). Deciding on the number of classes in latent class analysis and growth mixture modeling: A Monte Carlo simulation study. Structural Equation Modeling, 14, 535 569. Sinharay, S. (2004). Experiences with Markov chain Monte Carlo convergence assessment in two psychometric examples. Journal of Educational and Behavioral Statistics, 29, 461 488. Tofighi, D., & Enders, C. K. (2008). Identifying the correct number of classes in growth mixture models. In G. R. Hancock & K. M. Samuelson (Eds.), Advances in latent variable mixture models (pp. 317 341). Charlotte, NC: Information age Publishing. Prerequisites: Successful completion of Psychology 202a/202b (or equivalent) or instructor consent. Understanding of ANOVA and regression analysis. Basic understanding of matrix algebra operations (material covered in 202b or akin is sufficient). Introductory knowledge of structural equation modeling (material covered in 202b or akin is sufficient). Basic familiarity with the R programming environment. Course Goals: Build a strong foundation in analyzing longitudinal data. Learn how to address various problems that commonly arise with longitudinal data (e.g., missing data and estimation problems). Understand the different longitudinal models that can be used with different types of data. Gain comfort in reading methodological literature (even applied researchers should be comfortable with this by the end of this course). 3

Recognize connections between some common longitudinal models. Gain exposure to the Bayesian estimation framework. Learn about different problematic features of Bayesian estimation of longitudinal models (e.g., label switching). Learn about various model testing and model comparison procedures under the frequentist and Bayesian modeling frameworks. Learn basic programming in Mplus, R, and WinBUGS (OpenBUGS). Course Outcomes: You should be able to demonstrate proficient knowledge of longitudinal data analysis and estimation through the homework assignments and a final paper. Specifically, through these assignments you should: Be able to identify the most appropriate analysis procedure for a particular longitudinal question. Understand the differences/similarities between ANOVA, regression, and SEM longitudinal data analysis approaches. Understand assumptions that are made within each modeling framework we discuss. Understand some of the drawbacks to longitudinal procedures and how to address or circumvent common problems. Be able to understand and manipulate prior distributions on model parameters within the Bayesian framework. Be able to run convergence diagnostics for Bayesian estimation algorithms. Utilize model constraints to aid in the prevention of label switching within longitudinal mixture models. Be able to write code and estimate a variety of models within the Mplus, R, and WinBUGS (OpenBUGS) programs. Be able to produce appropriate plots using all programs covered in this course. Understand methodological literature written about frequentist and Bayesian longitudinal data analysis methods. Build comfort with the mathematical notation used to represent longitudinal models, the algorithms used to estimate those models, and the statistics/criteria used to assess fit and compare models. 4

Course Grades: Grading is based on: seven homework assignments (49%), a take-home final project (51%). Homework assignments will be graded based on 10 points. The final project will be graded based on 100 points. A weighted linear combination will be taken at the end of the semester to determine the final course grade using the following criteria: Percentage Grade 101+ A+ 95-100 A 92-94 A- 89-91 B+ 85-88 B 82-84 B- 79-81 C+ 75-78 C 72-74 C- 69-71 D+ 65-68 D 62-64 D- 0-61 F If a student is on the cusp between two letter grades, classroom participation will determine whether the student receives the higher or lower letter grade. Assignments: Assignments will be handed out in class and will incorporate a combination of hand calculations, data analysis, and conceptual problems. All datasets for these assignments will be provided by the instructor. You will typically have one week to complete these assignments. Assignments are due at the beginning of class on the date specified. Final Paper: You will be expected to write a paper in APA style using some of the techniques that you have learned this semester. I encourage you to use your own data if you have access to longitudinal data. If you do not have access, I will provide you with some resources where you can obtain free longitudinal data sets. The paper will be comprised of an introduction where you substantively justify your variable(s) and model(s). You will need to write a complete methods section which describes your variable(s) and design this needs to also include a (brief) description of the frequentist and/or Bayesian methods that you will employ. This methods section will be followed by a results section where you will describe any relevant diagnostics and results based on frequentist model estimation and/or results based on Bayesian model estimation; you should include any relevant figures and tables as well. Finally, you will need to write a discussion section tying your findings and your introduction together. The final product that you hand in should be a manuscript of (near) publishable quality. I recognize that it is perhaps unreasonable to request a manuscript 5

ready for submission at the end of a one-semester course, however this paper should be of the quality that only minor tweaking and/or expanding of thoughts are needed before submission. Grading will be based on several different components, including substantive development of the model, handling of model diagnostics/fit, correct interpretation of model results, description of estimation processes, and APA style. I will also grade on the professionalism and technical aspects of your writing. This assignment will be due on the day of our scheduled final. I will not pre-read any assignments (i.e., I am not grading your paper multiple times throughout the course), but I am more than happy to discuss your papers with you (e.g., direction of analysis, aspects of the introduction, etc.) at any point during the semester. More details on this assignment will be provided as the semester progresses. Specifically, I will hand out a more detailed assignment sheet which has due dates for certain milestones for the assignment (e.g., a date you need to hand in a short summary of your data source and topic, etc). (Note: If you have missing data in your final project data, please see me early in the semester since we will not directly cover this topic in class until the end of the term.) Academic Integrity: Students should be familiar with University policies on academic honesty. A general code of conduct for the University of California can be found at: http://www.ucop.edu/ucophome/coordrev/ucpolicies/aos/uc100.html. Basically, do not cheat or plagiarize. This will earn you a very uncomfortable meeting with me and a zero on the assignment. Students are encouraged to work together on computational aspects of the homework assignments using the various software/programming languages we will work with. However, it is expected that you work independently on hand-calculation, discussion, and interpretation portions of the assignments. The words you submit in your written assignments should be entirely your own. Likewise, I expect that the take-home project will be done independent of your classmates. If you have questions/issues arise, please come see me. Late Assignments: Late assignments will not be accepted and will result in a grade of zero unless a reasonable justification can be made to me. Acceptable reasons for late work might be that you could not turn in the assignment due to illness (which may require a doctor note) or due to conference or other professional travel. I will never accept the excuses of I was too busy or I forgot. We are all busy and deadlines are important to respect. If you cannot turn in an assignment on time, please come and talk to me about it as soon as possible. Course Outline: **Note that I have never taught this course before so nothing on this schedule is set in stone. Week 2: 8/27: Introduction, advantages, challenges, the simplest longitudinal models Reading: H&G Chapter 1 8/29: Analysis of variance approaches 6

Reading: H&G Chapter 2 Week 3: 9/3: Labor Day, NO CLASS 9/5: Analysis of variance approaches cont. Assignments: HW 1 handed out, Term paper instructions handed out Week 4: 9/10: Analysis of variance approaches cont. 9/12: Multivariate analysis of variance approaches Reading: H&G Chapter 3 Week 5: 9/17: Intro to mixed-effects regression models Reading: H&G Chapter 4 Assignments: HW 1 due; HW 2 handed out 9/19: Mixed-effects regression models for continuous outcomes Week 6: 9/24: MRMs cont., Introduction to the EM algorithm Assignments: HW 2 due 9/26: EM algorithm cont. Week 7: 10/1: Finish EM algorithm, Mixed-effects polynomial regression models Reading: H&G Chapter 5 Assignments: Term paper abstract due, HW 3 handed out 10/3: Mixed effects polynomial regression models cont., Binary outcome MRMs Reading: H&G Chapters 5, 9 Week 8: 10/8: Binary and Ordinal outcome MRMs cont. Reading: H&G Chapters 9-10 10/10: Some specific autoregressive models Reading: H&G Chapter 7; Nylund (2007) [select pages to be assigned] Assignments: HW 3 due Week 9: 10/15: Growth curve modeling Reading: Muthen (2001); Muthen (2004); Muthen and Asparouhov (2008) 10/17: Intro to finite mixture modeling, growth mixture modeling Assignments: HW 4 handed out 7

Week 10: 10/22: The correct number of mixtures, confirmatory versus exploratory models Reading: Nylund, Asparouhov, & Muthen (2007); Tofighi & Enders (2008) 10/24: Non-linear latent growth models Reading: Grimm & Ram (2009) Week 11: 10/29: Categorical LVMs, intro to LCA (not longitudinal, but necessary) Reading: Collins and Lanza (2010) [chapter 7]; Nylund (2007) [select pages to be assigned] Assignments: HW 4 due 10/31: Latent transition analysis and other important Markov models Reading: Kaplan & Wapole (2005) Assignments: HW 5 handed out (11/1) Week 12: 11/5: Introduction to Bayesian estimation (estimation algorithm, samplers, priors) Reading: Kaplan & Depaoli (in press) 11/7: Introduction to Bayesian estimation cont, convergence diagnostics Reading: Sinharay (2004) Assignments: HW 5 due (11/9) Week 13: 11/12: Veteran s Day, NO CLASS 11/14: Finish convergence, WinBUGS: Someone save us all!! Assignments: HW 6 handed out Week 14: 11/19: Mommy, where do priors come from?? Reading: Depaoli (under review); Depaoli (in preparation) 11/21: Chopping Carrots Day: NO CLASS Week 15: 11/26: Bayesian longitudinal analysis Assignments: HW 6 due, HW 7 handed out 11/28: Bayesian model comparison and fit: All models are wrong...and so are the model assessment procedures! Reading: Depaoli (2012), Kaplan and Depaoli (2012) Week 16: 12/3: Bayesian model comparison and fit, Label Switching intro Reading: Jasra and Stephens (2005) 12/5: Problems: within- and between-chain label switching (mixture and non) Assignments: HW 7 due FINAL EXAM WEEK 12/13: Scheduled Final Exam Day Assignments: Final paper due by 6:00pm!! 8