MATH5885 LONGITUDINAL DATA ANALYSIS



Similar documents
Faculty of Science School of Mathematics and Statistics

SPPH 501 Analysis of Longitudinal & Correlated Data September, 2012

USC Columbia, Lancaster, Salkehatchie, Sumter & Union campuses

School of the Arts and Media

Department of Epidemiology and Public Health Miller School of Medicine University of Miami

ELEC4445 Entrepreneurial Engineering

BIOINFORMATICS METHODS AND APPLICATIONS

EDMS 769L: Statistical Analysis of Longitudinal Data 1809 PAC, Th 4:15-7:00pm 2009 Spring Semester

School of Public Health and Health Services Department of Epidemiology and Biostatistics

ACCT5910 BUSINESS ANALYSIS AND VALUATION

College of Public Health & Health Professions

MNGT5232 Data Analysis and Decision Making Under Uncertainty

Australian School of Business School of Information Systems, Management and Technology

ECON2103 Business and Government. Course Outline Semester 2, Part A: Course-Specific Information

RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE MATH 111H STATISTICS II HONORS

INFS2848 INFORMATION SYSTEMS PROJECT MANAGEMENT COMP3711 SOFTWARE PROJECT MANAGEMENT

INFS5991 BUSINESS INTELLIGENCE METHODS. Course Outline Semester 1, 2015

Quantitative Analysis for Business BSNS102. Course Outline Semester One 2014

Australian School of Business School of Banking and Finance FINS 5533 REAL ESTATE FINANCE AND INVESTMENT

BIO 226: APPLIED LONGITUDINAL ANALYSIS COURSE SYLLABUS. Spring 2015

Australian School of Business School of Banking and Finance FINS3634 CREDIT ANALYSIS AND LENDING

Course outline. Code: BUS501 Title: Business Analytics and Statistics

Name of the module: Multivariate biostatistics and SPSS Number of module:

Australian School of Business School of Economics ECON 5111 ECONOMICS OF STRATEGY

GENS9004 /PSYC1022 Psychology of Addiction

Australian School of Business School of Banking and Finance FINS 5542 APPLIED FUNDS MANAGEMENT

School of Psychology PSYC 332: Behaviour Analysis 2013-Trimester 1. Lecturers: N Buist M Hunt A Macaskill

ECON828 International Investment and Risk. Semester 1, Department of Economics

INFS2608 ENTERPRISE DATABASE MANAGEMENT

School of Education. EDST5101: Interventionist and Experimental Design and Analysis. Semester 1

Australian School of Business School of Information Systems Technology and Management INFS5978 ACCOUNTING INFORMATION SYSTEMS

Mathematics Department Course outline Statistics for Social Science DW

MTH 110: Elementary Statistics (Online Course) Course Syllabus Fall 2012 Chatham University

INFS 5848 PROJECT, PORTFOLIO and PROGRAM MANAGEMENT

ACCT5949 Managing Agile Organisations

INFS 2605 BUSINESS APPLICATION PROGRAMMING

INFS5873 Business Analytics. Course Outline Semester 2, 2014

INFS5621 ENTERPRISE RESOURCE PLANNING (ERP) SYSTEMS

Mathematics Department Laboratory Technology - Analytical Chemistry Program Calculus I 201-NYA-05

FINS 3635 OPTIONS, FUTURES AND RISK MANAGEMENT TECHNIQUES

Master of Science (MS) in Biostatistics Program Director and Academic Advisor:


Australian School of Business. School of Banking and Finance FINS 1613

ACCT5910 BUSINESS ANALYSIS AND VALUATION. Course Outline Summer School, 2014

MART466 Digital Marketing COURSE OUTLINE

INFS 2603 BUSINESS SYSTEMS ANALYSIS

THE UNIVERSITY OF NEW SOUTH WALES

STAT 121 Hybrid SUMMER 2014 Introduction to Statistics for the Social Sciences Session I: May 27 th July 3 rd

PHILOSOPHY: THINKING ABOUT REASONING

INFS5978 ACCOUNTING INFORMATION SYSTEMS. Course Outline Semester 2, 2013

City University of Hong Kong. Information on a Course offered by Department of Information Systems with effect from Semester A in 2008 / 2009

ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE. School of Mathematical Sciences

Australian School of Business School of Marketing MARK5811 APPLIED MARKETING RESEARCH

Current requirements for a major (page 83 of current catalog)

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Australian School of Business School of Accounting ACCT5930 FINANCIAL ACCOUNTING

BUAD 310 Applied Business Statistics. Syllabus Fall 2013

Truman College-Mathematics Department Math 125-CD: Introductory Statistics Course Syllabus Fall 2012

UNIVERSITY OF SOUTHERN CALIFORNIA SCHOOL OF SOCIAL WORK MULTIPLE REGRESSION FOR SOCIAL WORK RESEARCH SOWK 761L, SPRING 2012

College of Health and Human Services. Fall Syllabus

MMPA 513 ACCOUNTING SYSTEMS

Decision Sciences Data Analysis for Managers

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Introductory Chemistry (Allied Health Emphasis)- Chem 1406 Course Syllabus: Summer 2015

PSYC 3018 Abnormal Psychology

QUAN 102 STATISTICS FOR BUSINESS

Statistics with Aviation Applications Math 211 Mode of Delivery Lecture Blended Course Syllabus

Data Mining and Business Intelligence CIT-6-DMB. Faculty of Business 2011/2012. Level 6

ELEC2141 DIGITAL CIRCUIT DESIGN

Economic Statistics (ECON2006), Statistics and Research Design in Psychology (PSYC2010), Survey Design and Analysis (SOCI2007)

Australian School of Business School of Actuarial Studies. ACTL3002 / ACTL5105 Life Insurance and Superannuation Models

BASIC COURSE IN STATISTICS FOR MEDICAL DOCTORS, SCIENTISTS & OTHER RESEARCH INVESTIGATORS INFORMATION BROCHURE

Australian School of Business School of Accounting ACCT 5917 VALUE CREATION FROM THE OFFICE OF THE CFO

Unit Outline: KXA352 Software Engineering Project B

GGR462/JPG1914: GIS RESEARCH PROJECT. Course Outline

School of Public Health and Health Services Department of Prevention and Community Health

Statistics in Applications III. Distribution Theory and Inference

CBE 9190B ADVANCED STATISTICAL PROCESS ANALYSIS COURSE OUTLINE

Course outline. Code: PSY204 Title: Social Psychology

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

Math 1302 (College Algebra) Syllabus Fall 2015 (Online)

Australian School of Business School of Information Systems, Technology and Management INFS4806 / INFS5906 INFORMATION SYSTEMS FORENSICS

Faculty of Business School of Information Systems Technology and Management. INFS1602 Computer Information Systems

Ph.D. Biostatistics Note: All curriculum revisions will be updated immediately on the website

Information Technology Management Project INFS5740. Session 1, 2006

Use advanced techniques for summary and visualization of complex data for exploratory analysis and presentation.

Unit Outline: KXT312 Advanced Algorithmic Problem Solving & Programming

Australian School of Business School of Accounting ACCT 5907 CORPORATE FINANCIAL ANALYSIS

INFS3603 BUSINESS INTELLIGENCE (BI)

Course outline. Code: BUS706 Title: International Business Law and Ethics

STAT3016 Introduction to Bayesian Data Analysis

INFS1603 BUSINESS DATABASES

MATH 241: DISCRETE MATHEMATICS FOR COMPUTER SCIENCE, Winter CLASSROOM: Alumni Hall 112 Tuesdays and Thursdays, 6:00-8:15 pm

ATV - Lifetime Data Analysis

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

School of Humanities and Languages. ARTS2480, INTERMEDIATE FRENCH A Semester 1, 2015

School of Computing and Information Sciences

ENGG*4040 Medical Imaging Modalities Fall 2015

Transcription:

MATH5885 LONGITUDINAL DATA ANALYSIS Semester 1, 2013 CRICOS Provider No: 00098G 2013, School of Mathematics and Statistics, UNSW

MATH5885 Course Outline Information about the course Course Authority: William Dunsmuir Lecturer: William Dunsmuir Room RC-2057 Phone 9385 7035 email W.Dunsmuir@unsw.edu.au. Consultation: Monday 4-5pm Office 2-057. Credit, Prerequisites, Exclusions: This course counts for 6 Units of Credit (6UOC). There are no prerequisites for this course. There are no exclusions for this course. Teaching Format: There will be one 3.0 hour teaching session per week: Monday 5:00-8:00pm. Classes will start with formal lectures in the assigned lecture room (for approximately 2 hours depending on the material to be covered). After a short break classes will resume in the Computer Lab RC-M020 (for the balance of time until 8pm). Tutorials: There are no separate tutorials. My Blackboard: Almost all course material (such as lecture notes, assignments, project descriptions, lab instructions for R and SAS work, data and some other readings) will be provided via Blackboard. Material for the following week will be posted, at the latest, by Thursday of the current week. Course aims Longitudinal data arise when multiple measurements of a response are collected over time for each individual in the study. The aim of this course is to introduce the main statistical concepts, methods and models used in the analysis of longitudinal data. Relation to other mathematics and statistics courses This course is an elective course for the Master of Statistics program and a core course for the Master of Biostatistics program. Whilst there are no formal prerequisites for the course, it does make use of techniques based on statistical theory such as maximum likelihood estimation and hypothesis testing, and some familiar- 2

ity with these ideas is assumed. In this sense the course is related to MATH5905, Statistical Inference, which covers the theory behind some of the techniques we will apply. The course is also related to MATH5806, Applied Regression Analysis, which deals with modelling of relationships between variables, and MATH5855, Multivariate Analysis, which covers techniques for joint modelling of multiple responses and to MATH5845 Time Series Analysis which covers techniques for modelling data collected through time on one or a few series. The focus in MATH5885 is on modelling response profiles over time, whilst accounting for the correlation that will typically exist between measurements taken on the same individual. Longitudinal data are frequently collected as part of a clinical trial or an observational study, so the techniques learned will have applications in the types of studies covered in MATH5906, Design and Analysis of Clinical Trials, and MATH5826, Statistical Methods in Epidemiology. Student Learning Outcomes By the end of this course, you should be able to: Explain the key features of longitudinal data and what distinguishes them from cross-sectional data Use exploratory data analysis techniques to visualize patterns in longitudinal data using R Explain the different types of correlation that can occur in longitudinal data, and choose appropriate models for correlation structures in given datasets Derive the likelihood function for correlated data and use it to calculate maximum likelihood estimates Explain the concept of residual maximum likelihood estimation (REML) and use it to calculate REML estimates Show how linear models incorporating correlated errors can be used to analyse longitudinal data Explain the theory of linear mixed effects models for analysing continuous longitudinal data, and show how this extends to generalized linear mixed effects models for analysing non-normal longitudinal data Explain how marginal models can be used to analyse non-normal longitudinal data via generalized estimating equations Explain the distinction between a subject-specific and population-average interpretation of parameters 3

Classify the different types of missing data that occur in longitudinal studies and explain the implications for analysis Use R and SAS to fit the models studied to longitudinal datasets, and interpret the output Carry out estimation and inference for the models studied Check the validity of a model for longitudinal data Solve theoretical problems related to longitudinal data analysis Relation to graduate attributes Computing skills developed in this course will improve information literacy (Science Graduate Attribute 6). Assignments, problems, and lab exercises will develop research, inquiry and analytical thinking abilities (Science Graduate Attribute 1). Teaching strategies underpinning the course To support the learning outcomes, the course will use the following teaching strategies: Lectures, explaining the necessary statistical concepts and theory applicable to longitudinal studies Computer labs, providing essential practice in applying the techniques explained in lectures Independent study of the course notes and readings, to reflect more deeply on ideas introduced in lectures Problems and assignments, giving you an opportunity to independently solve theoretical problems, analyse datasets using statistical software, reflect on aspects of the course, and evaluate your understanding Assessment in this course will use problem-solving tasks of a similar form to those in practice problems and computer labs, to encourage the development of the core analytical and computing skills underpinning this course. 4

Rationale for learning and teaching strategies We believe that effective learning is best supported by a climate of inquiry, in which students are actively engaged in the learning process. Hence this course is structured with a strong emphasis on problem-solving tasks in computer labs and in assessment tasks, and students are expected to devote the majority of their class and study time to the solving of such tasks. Assessment Assessment in this course will consist of two assignments (10% each), a substantial project (20%), and a final examination (60%). Knowledge and abilities assessed: All assessment tasks will assess the learning outcomes outlined above, specifically, the ability to explain the concepts and theory underlying statistical techniques for longitudinal data, to apply the techniques in analysing real datasets and critically interpret the results of analyses, and to solve theoretical problems related to the statistical aspects of longitudinal studies. Assessment criteria: The main criteria for marking assessment tasks involving explanation of theory and solution of theoretical problems will be clear and logical presentation of correct solutions. In the case of assessment tasks involving the application of techniques to the analysis of real datasets, the main criteria will be selection and justification of appropriate analysis methods; clear, logical, and well-documented computer code; well-organised output giving evidence of successful implementation; correct interpretation of results; and clear, complete, and fully justified conclusions. Assignments Rationale: Assignments will give students an opportunity to practice solving theoretical problems related to longitudinal data analysis, and to apply statistical methods in analysing real datasets and interpreting the results of those analyses. Assignments and projects must be YOUR OWN WORK, or severe penalties will be incurred. You should consult the University web page on plagiarism www.lc.unsw.edu.au/plagiarism Schedule and weighting: Task Date Avail. Date Due Form of Submission Weighting Asst 1 Week 3 5:00pm Mon Week 5 Written 10% Asst 2 Week 7 5:00pm Mon Week 9 Written 10% 5

Assignments must be submitted by 5:00pm (at the start of lecture class). In general, late assignments will NOT be accepted without good documented reasons (such as illness or misadventure). Project Rationale: The project will develop student abilities in modelling a real and substantive set of longitudinal data and will ensure that students are fluent in applying the modelling and software skills learnt in the course so that they can reach appropriate conclusions and write a clear summary of their findings. Duration:The final project is due at the start of lectures in Week 11 so that I can provide feedback and a final mark by Week 12 lecture. A brief verbal progress presentation to groups of class members will be required in week 8. Further details of the project requirements and the allocation of marks for the progress and final reports will be available in week 4 of the course. Weighting: 20% of your final mark. Examination Duration: 2 hours. Rationale: The final examination will assess student mastery of the material covered in the lectures. Weighting: 60% of your final mark. Further details about the final examination will be available in class closer to the time. Additional resources and support Lecture notes Lecture notes (including computer lab exercises and practice problems) will be distributed via Blackboard by the end of the week preceding the lecture. Textbooks There are no set textbooks. 6

References Students in the past have found the following references to be helpful and the course material is strongly linked to these, particularly the first listed: 1. Fitzmaurice GM, Laird NM, and Ware JH. (2004). Applied Longitudinal Analysis. John Wiley and Sons. 2. Diggle PJ, Heagerty P, Liang K-Y, and Zeger SL. (2002). Analysis of Longitudinal Data. Other references will be suggested in class. Blackboard All course materials will be available on Blackboard. You should check regularly for new materials. Computer laboratories Computer laboratories (RC-M020 and RC-G012) are open 9-5 Monday-Friday on teaching days. RC-M020 has extended teaching hours (usually 8:30-9pm Monday- Friday, and 9-5 Monday-Friday on non-teaching weeks). UNSW Library website: http://info.library.unsw.edu.au/web/services/services.html Course Evaluation and Development The School of Mathematics and Statistics evaluates each course each time it is run. We carefully consider the student responses and their implications for course development. In response to the 2009 student evaluations of the course the midsession test has been replaced by a substantial project and some elements of course content and structure have been modified. Administrative matters Important Information for Postgraduate Course Work Students See http://www.maths.unsw.edu.au/honpg/current/coursework/pginfos109.pdf 7

School Rules and Regulations Fuller details of the general rules regarding attendance, release of marks, special consideration etc are available via the School of Mathematics and Statistics Web page at http://www.maths.unsw.edu.au/students/current/policies/studentpolicy.html. UNSW Occupational Health and Safety policies and expectations: https://my.unsw.edu.au/student/atoz/occupationalhealth.html Student equity and diversity issues should be directed to the Student Equity Officers (Disability) in the Student Equity and Diversity Unit (9385-4734). Students with Disabilities Further information for students with disabilities is available at https://my.unsw.edu.au/student/atoz/ Plagiarism and academic honesty Plagiarism is the presentation of the thoughts or work of another as one s own. Issues you must be aware of regarding plagiarism and the university s policies on academic honesty and plagiarism can be found at http://www.lc.unsw.edu.au/plagiarism and https://my.unsw.edu.au/student/academiclife/assessment/studentmisconduct.html. 8

Course schedule An approximate schedule for the course is given below. Date of class Topic Computer lab Assessment Week 1 Characteristics Lab 1 4 March of longitudinal data Week 2 Exploratory Lab 2 11 March data analysis Week 3 Summary statistics and Lab 3 Assignment 1 18 March other simple approaches available Week 4 Correlation models Lab 4 Project 25 March Details Mid-session break Week 5 Linear models Lab 5 Assignment 1 8 April for correlated data due 5:00pm Week 6 Maximum likelihood Lab 6 15 April estimation and REML Week 7 Linear mixed models Lab 7 Assignment 2 22 April available Week 8 Linear mixed models Lab 8 Project 29 April Presentation Week 9 Generalized linear Lab 9 Assignment 2 6 May mixed models due 5:00pm Week 10 Generalized linear Lab 10 13 May mixed models Week 11 Generalized LMMs Lab 11 Project 20 May Missing Data Final Report Week 12 Missing Data, Review Lab 12 27 May 9