Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Size: px
Start display at page:

Download "Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD"

Transcription

1 Tips for surviving the analysis of survival data Philip Twumasi-Ankrah, PhD

2 Big picture In medical research and many other areas of research, we often confront continuous, ordinal or dichotomous outcomes For these outcomes, we have a very well structured set of methods/tools of analysis

3 Types of Analysis Based on Variable Characteristics

4 Types of Analysis Based on Measurement Scale

5 Survival Analysis The Idea One other common outcome is time to event (survival time)

6 Time-to-event outcome The Idea

7 What is Survival Analysis? Survival Analysis is referred to statistical methods for analyzing survival data

8 Survival Analysis The Idea Survival Analysis is also known as Reliability theory or reliability analysis in engineering, Duration analysis or duration modeling in economics or Event history analysis in sociology.

9 What is Survival Analysis? Survival Analysis is referred to statistical methods for analyzing survival data Survival data could be derived from laboratory studies of animals or from clinical and epidemiologic studies Survival data could relate to outcomes for studying acute or chronic diseases

10 Survival Analysis The Idea Survival analysis attempts to answer questions such as: What is the fraction of a population which will survive past a certain time? Of those that survive, at what rate will they die or fail? Can multiple causes of death or failure be taken into account? How do particular circumstances or characteristics increase or decrease the odds of survival?

11 Important Areas of Application Clinical Trials and Sources of Survival data Example: Recovery Time after heart surgery Longitudinal or Cohort Studies Example: Time to observing the event of interest Life Insurance Example: Time to file a claim Quality Control Example: The amount of force needed to damage a part such that it is not useable

12 Unique Features of Survival Event involved Analysis Progression on a dimension (usually time) until the event happens Length of progression may vary among subjects Event might not happen for some subjects

13 Terminology of Survival Analysis Time-to-event: The time from entry into a study until a subject has a particular outcome Censoring: Subjects are said to be censored if they are lost to follow up or drop out of the study, or if the study ends before they die or have an outcome of interest. They are counted as alive or disease-free for the time they were enrolled in the study.

14 Examples of Events Examples of events: Death, infection, MI, hospitalization Recurrence of cancer after treatment Marriage, soccer goal Light bulb fails, computer crashes Balloon filling with air bursts 14

15 Structure of Survival Data Two-variable outcome : Time variable: t i = time at last diseasefree observation or time at event Censoring variable: c i =1 if had the event; c i =0 no event by time t i

16 Censoring Incomplete observations Right Incomplete follow-up Common and Easy to deal with Left Event has occurred before observation started (T 0 ), but exact time is unknown Not easy to deal with

17 Right Censoring May be due to: Event had not occurred at termination of the study Event occurred due to a cause that is not the cause of interest Loss to follow-up or drop-out of study. In this situation, we know that subject survived at least to time t.

18 Left Censoring Examples: Age smoking starts Data from interviews of adults Adult subject reports regular smoking Does not remember when he started smoking regularly Study of incidence of CMV infection in children Two subjects already infected at enrollment

19 Key Assumption with Censoring Censoring is independent of intervention and event of interest. Those still at risk at time t in the study are a random sample of the population at risk at time t, for all t This assumption means that the risk of the event occuring can be estimated in a fair/unbiased/valid way

20 Censoring with Covariate Effect Censoring must be independent within group Censoring must be independent given X Censoring can depend on X Among those with the same values of X, censored subjects must be at similar risk of subsequent events as subjects with continued follow-up Censoring can be different across groups

21 Other Concepts Truncation is about entering the study Right: Event has occurred (e.g. cancer registry) Left: staggered entry Remember: Censoring is about leaving the study Right: Incomplete follow-up (common) Left: Observed time > survival time

22 Left Truncation More in epidemiology than in medical studies Key Assumption Those who enter the study at time t are a random sample of those in the population still at risk at t. Example: Observational study of seizures in young children What is the relation between vaccine immunization and risk of first seizure? Time axis = age Some children observed from birth Others move in to the area at a later time but were Included at the time of entry into the cohort

23 Time Notation Denote observation time by t t defines the time axis (scale) t = 0 is the time origin or beginning of observation tmax = end of observation T: random outcome variable time at which event occurs Example: (T = 3) denotes a determination of event occurrence (s) at time 3 units.

24 Example I Recurrence of herpes lesions after treatment for a primary episode Event = recurrence Time origin = end of primary episode Time scale = months from end of primary episode T = time from end of primary episode to first recurrence

25 Example II Occupational exposure at nickel refinery Event = death from lung cancer Origin = first exposure Employment at refinery Scale = years since first exposure T = time: first employed to death from LC

26 Population Mortality Event = death Time origin = date of birth Time scale = age (years) T = age at death

27 Analysis of Time-To-Event Data

28 Remember: Features of Survival Event involved Analysis Progression on a dimension (usually time) until the event happens Length of progression may vary among subjects Event might not happen for some subjects

29 Analysis of Time-To-Event Data There are certain aspects of survival analysis data, such as Censoring and Non-normality, That generate great difficulty when trying to analyze the data using traditional statistical models such as multiple linear regression. The non-normality aspect of the data violates the normality assumption of most commonly used statistical model such as regression or ANOVA, etc.

30 Analysis of Time-To-Event Data Why not compare mean time-to-event between your groups using a t-test or linear regression? ignores censoring Why not compare proportion of events in your groups using risk/odds ratios or logistic regression? ignores time

31 Analysis of Time-To-Event Data The Right Tool for the Right Job

32 What is survival analysis? Model time to failure or time to event Unlike linear regression, survival analysis has a dichotomous (binary) outcome Unlike logistic regression, survival analysis analyzes the time to an event Able to account for censoring

33 Objectives of Survival Analysis Estimate time-to-event for a group of individuals, such as time until second heartattack for a group of MI patients. To compare time-to-event between two or more groups, such as treated vs. placebo MI patients in a randomized controlled trial. To assess the relationship of co-variables to time-to-event, such as: does weight, insulin resistance, or cholesterol influence survival time of MI patients?

34 Concepts in Survival Analysis Survival Function - A function describing the proportion of individuals surviving to or beyond a given time. Notation: T survival time of a randomly selected individual t a specific point in time. S(t) = P(T > t) Survival Function λ(t) instantaneous failure rate at time t aka hazard function

35 Tips for the Analysis of Survival Data In any data analysis it is always a great idea to do some univariate analysis before proceeding to more complicated models. In survival analysis it is highly recommended to look at the Kaplan-Meier curves for all the categorical predictors. This will provide insight into the shape of the survival function for each group and give an idea of whether or not the groups are proportional (i.e. the survival functions are approximately parallel).

36 Tips for the Analysis of Survival Data We also consider the tests of equality across strata to explore differences in survival probability between levels of the predictor. It is not feasible to calculate a Kaplan-Meier curve for the continuous predictors since there would be a curve for each level of the predictor and a continuous predictor simply has too many different levels. Instead we consider the Cox proportional hazard model with a single continuous predictor.

37 Estimation of The Survival Function Steps Identify the observed failure times: t (1) < <t (k) Number of individuals at risk before t (i) n i Number of individuals with failure time t (i) d i Estimated hazard function at t (i)

38 Estimation of The Survival Function There are two ways to estimate the survival function The Life-Table Method Product-Moment Method or Kaplan-Meier Method

39 Example

40 Life-Table D = death; C = censored; N = number of individuals who are alive (at risk) at beginning of the interval N = N (C/2) = number of individuals who are at risk during the interval S(t) = cumulative survival

41 Kaplan-Meier Estimate The beginning of each interval is determined by death Each interval contains one death (or more if there are ties) N(t) includes individuals with censored data at t

42

43 Assumptions for KM method Survival probabilities are the same for patients entering into the study early or late Actual event time is known Patients who are censored have the same survival probabilities as those who continue to be followed

44 Comparison Of Two Survival Curves Let S (t) and S (t) be the survival 1 2 functions of the two groups. The null hypothesis is H : S (t) =S (t), for all t > The alternative hypothesis is: H : S (t) S (t), for some t >

45 Log-Rank Test to Compare 2 Survival Functions H 0 : Two Survival Functions are Identical H A : Two Survival Functions Differ T. S.: T P val MH R. R.: T MH = O 1 z V 1 α / 2 E : 2P( Z T 1 MH )

46 Limitations of Kaplan-Meier Mainly descriptive Doesn t control for covariates Requires categorical predictors Can t accommodate time-dependent variables

47 Cox Proportional Hazards Model Goal: Compare two or more groups (treatments), adjusting for other risk factors on survival times (like Multiple regression) p Explanatory variables (including dummy variables) Models Relative Risk of the event as function of time and covariates:

48 Example in SPSS

49 Life-Table

50

51 Output

52 Kaplan-Meier

53 Kaplan-Meier Two Groups

54 Adding Plots

55 Adding Plots

56 Cox Regression

57 Output

58 Questions?

Data Analysis, Research Study Design and the IRB

Data Analysis, Research Study Design and the IRB Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB

More information

Study Design and Statistical Analysis

Study Design and Statistical Analysis Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing

More information

Survival Analysis of Dental Implants. Abstracts

Survival Analysis of Dental Implants. Abstracts Survival Analysis of Dental Implants Andrew Kai-Ming Kwan 1,4, Dr. Fu Lee Wang 2, and Dr. Tak-Kun Chow 3 1 Census and Statistics Department, Hong Kong, China 2 Caritas Institute of Higher Education, Hong

More information

Life Tables. Marie Diener-West, PhD Sukon Kanchanaraksa, PhD

Life Tables. Marie Diener-West, PhD Sukon Kanchanaraksa, PhD This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Study Design. Date: March 11, 2003 Reviewer: Jawahar Tiwari, Ph.D. Ellis Unger, M.D. Ghanshyam Gupta, Ph.D. Chief, Therapeutics Evaluation Branch

Study Design. Date: March 11, 2003 Reviewer: Jawahar Tiwari, Ph.D. Ellis Unger, M.D. Ghanshyam Gupta, Ph.D. Chief, Therapeutics Evaluation Branch BLA: STN 103471 Betaseron (Interferon β-1b) for the treatment of secondary progressive multiple sclerosis. Submission dated June 29, 1998. Chiron Corp. Date: March 11, 2003 Reviewer: Jawahar Tiwari, Ph.D.

More information

Lecture 15 Introduction to Survival Analysis

Lecture 15 Introduction to Survival Analysis Lecture 15 Introduction to Survival Analysis BIOST 515 February 26, 2004 BIOST 515, Lecture 15 Background In logistic regression, we were interested in studying how risk factors were associated with presence

More information

Kaplan-Meier Survival Analysis 1

Kaplan-Meier Survival Analysis 1 Version 4.0 Step-by-Step Examples Kaplan-Meier Survival Analysis 1 With some experiments, the outcome is a survival time, and you want to compare the survival of two or more groups. Survival curves show,

More information

Guide to Biostatistics

Guide to Biostatistics MedPage Tools Guide to Biostatistics Study Designs Here is a compilation of important epidemiologic and common biostatistical terms used in medical research. You can use it as a reference guide when reading

More information

Introduction. Survival Analysis. Censoring. Plan of Talk

Introduction. Survival Analysis. Censoring. Plan of Talk Survival Analysis Mark Lunt Arthritis Research UK Centre for Excellence in Epidemiology University of Manchester 01/12/2015 Survival Analysis is concerned with the length of time before an event occurs.

More information

Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time

Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time Hajime Uno Dana-Farber Cancer Institute March 16, 2015 1 Introduction In a comparative, longitudinal

More information

Competency 1 Describe the role of epidemiology in public health

Competency 1 Describe the role of epidemiology in public health The Northwest Center for Public Health Practice (NWCPHP) has developed competency-based epidemiology training materials for public health professionals in practice. Epidemiology is broadly accepted as

More information

Evaluation of Treatment Pathways in Oncology: Modeling Approaches. Feng Pan, PhD United BioSource Corporation Bethesda, MD

Evaluation of Treatment Pathways in Oncology: Modeling Approaches. Feng Pan, PhD United BioSource Corporation Bethesda, MD Evaluation of Treatment Pathways in Oncology: Modeling Approaches Feng Pan, PhD United BioSource Corporation Bethesda, MD 1 Objectives Rationale for modeling treatment pathways Treatment pathway simulation

More information

Early mortality rate (EMR) in Acute Myeloid Leukemia (AML)

Early mortality rate (EMR) in Acute Myeloid Leukemia (AML) Early mortality rate (EMR) in Acute Myeloid Leukemia (AML) George Yaghmour, MD Hematology Oncology Fellow PGY5 UTHSC/West cancer Center, Memphis, TN May,1st,2015 Off-Label Use Disclosure(s) I do not intend

More information

13. Poisson Regression Analysis

13. Poisson Regression Analysis 136 Poisson Regression Analysis 13. Poisson Regression Analysis We have so far considered situations where the outcome variable is numeric and Normally distributed, or binary. In clinical work one often

More information

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way

More information

The American Cancer Society Cancer Prevention Study I: 12-Year Followup

The American Cancer Society Cancer Prevention Study I: 12-Year Followup Chapter 3 The American Cancer Society Cancer Prevention Study I: 12-Year Followup of 1 Million Men and Women David M. Burns, Thomas G. Shanks, Won Choi, Michael J. Thun, Clark W. Heath, Jr., and Lawrence

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

Measures of Prognosis. Sukon Kanchanaraksa, PhD Johns Hopkins University

Measures of Prognosis. Sukon Kanchanaraksa, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

Design and Analysis of Phase III Clinical Trials

Design and Analysis of Phase III Clinical Trials Cancer Biostatistics Center, Biostatistics Shared Resource, Vanderbilt University School of Medicine June 19, 2008 Outline 1 Phases of Clinical Trials 2 3 4 5 6 Phase I Trials: Safety, Dosage Range, and

More information

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study.

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study. Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study Prepared by: Centers for Disease Control and Prevention National

More information

Gordon S. Linoff Founder Data Miners, Inc. gordon@data-miners.com

Gordon S. Linoff Founder Data Miners, Inc. gordon@data-miners.com Survival Data Mining Gordon S. Linoff Founder Data Miners, Inc. gordon@data-miners.com What to Expect from this Talk Background on survival analysis from a data miner s perspective Introduction to key

More information

Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015

Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015 1 Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015 Instructor: Joanne M. Garrett, PhD e-mail: joanne_garrett@med.unc.edu Class Notes: Copies of the class lecture slides

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

Missing data and net survival analysis Bernard Rachet

Missing data and net survival analysis Bernard Rachet Workshop on Flexible Models for Longitudinal and Survival Data with Applications in Biostatistics Warwick, 27-29 July 2015 Missing data and net survival analysis Bernard Rachet General context Population-based,

More information

Statistics for Biology and Health

Statistics for Biology and Health Statistics for Biology and Health Series Editors M. Gail, K. Krickeberg, J.M. Samet, A. Tsiatis, W. Wong For further volumes: http://www.springer.com/series/2848 David G. Kleinbaum Mitchel Klein Survival

More information

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc.

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc. An Application of the G-formula to Asbestos and Lung Cancer Stephen R. Cole Epidemiology, UNC Chapel Hill Slides: www.unc.edu/~colesr/ 1 Acknowledgements Collaboration with David B. Richardson, Haitao

More information

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components An Application of Weibull Analysis to Determine Failure Rates in Automotive Components Jingshu Wu, PhD, PE, Stephen McHenry, Jeffrey Quandt National Highway Traffic Safety Administration (NHTSA) U.S. Department

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

200609 - ATV - Lifetime Data Analysis

200609 - ATV - Lifetime Data Analysis Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 200 - FME - School of Mathematics and Statistics 715 - EIO - Department of Statistics and Operations Research 1004 - UB - (ENG)Universitat

More information

Introduction to Event History Analysis DUSTIN BROWN POPULATION RESEARCH CENTER

Introduction to Event History Analysis DUSTIN BROWN POPULATION RESEARCH CENTER Introduction to Event History Analysis DUSTIN BROWN POPULATION RESEARCH CENTER Objectives Introduce event history analysis Describe some common survival (hazard) distributions Introduce some useful Stata

More information

ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE

ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE The 7 issues to be addressed outlined in paragraph 9 of the cover

More information

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form. General Remarks This template of a data extraction form is intended to help you to start developing your own data extraction form, it certainly has to be adapted to your specific question. Delete unnecessary

More information

Quantifying Life expectancy in people with Type 2 diabetes

Quantifying Life expectancy in people with Type 2 diabetes School of Public Health University of Sydney Quantifying Life expectancy in people with Type 2 diabetes Alison Hayes School of Public Health University of Sydney The evidence Life expectancy reduced by

More information

Basic Study Designs in Analytical Epidemiology For Observational Studies

Basic Study Designs in Analytical Epidemiology For Observational Studies Basic Study Designs in Analytical Epidemiology For Observational Studies Cohort Case Control Hybrid design (case-cohort, nested case control) Cross-Sectional Ecologic OBSERVATIONAL STUDIES (Non-Experimental)

More information

Cancer research in the Midland Region the prostate and bowel cancer projects

Cancer research in the Midland Region the prostate and bowel cancer projects Cancer research in the Midland Region the prostate and bowel cancer projects Ross Lawrenson Waikato Clinical School University of Auckland MoH/HRC Cancer Research agenda Lung cancer Palliative care Prostate

More information

Dealing with Missing Data

Dealing with Missing Data Dealing with Missing Data Roch Giorgi email: roch.giorgi@univ-amu.fr UMR 912 SESSTIM, Aix Marseille Université / INSERM / IRD, Marseille, France BioSTIC, APHM, Hôpital Timone, Marseille, France January

More information

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING

More information

Nominal and ordinal logistic regression

Nominal and ordinal logistic regression Nominal and ordinal logistic regression April 26 Nominal and ordinal logistic regression Our goal for today is to briefly go over ways to extend the logistic regression model to the case where the outcome

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March 2008 9.00-9.

SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March 2008 9.00-9. SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION Friday 14 March 2008 9.00-9.45 am Attempt all ten questions. For each question, choose the

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software January 2011, Volume 38, Issue 5. http://www.jstatsoft.org/ Lexis: An R Class for Epidemiological Studies with Long-Term Follow-Up Martyn Plummer International Agency

More information

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal 1. STUDY TITLE: Longitudinal Assessment of Chronic Health Conditions: The Aging of Childhood Cancer Survivors 2. WORKING GROUP AND INVESTIGATORS:

More information

Introduction to Longitudinal Data Analysis

Introduction to Longitudinal Data Analysis Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction

More information

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared

More information

2 Precision-based sample size calculations

2 Precision-based sample size calculations Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size

More information

Likelihood of Cancer

Likelihood of Cancer Suggested Grade Levels: 9 and up Likelihood of Cancer Possible Subject Area(s): Social Studies, Health, and Science Math Skills: reading and interpreting pie charts; calculating and understanding percentages

More information

Organizing Your Approach to a Data Analysis

Organizing Your Approach to a Data Analysis Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

7.1 The Hazard and Survival Functions

7.1 The Hazard and Survival Functions Chapter 7 Survival Models Our final chapter concerns models for the analysis of data which have three main characteristics: (1) the dependent variable or response is the waiting time until the occurrence

More information

Mortality Assessment Technology: A New Tool for Life Insurance Underwriting

Mortality Assessment Technology: A New Tool for Life Insurance Underwriting Mortality Assessment Technology: A New Tool for Life Insurance Underwriting Guizhou Hu, MD, PhD BioSignia, Inc, Durham, North Carolina Abstract The ability to more accurately predict chronic disease morbidity

More information

Sample Size Planning, Calculation, and Justification

Sample Size Planning, Calculation, and Justification Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa

More information

Efficacy analysis and graphical representation in Oncology trials - A case study

Efficacy analysis and graphical representation in Oncology trials - A case study Efficacy analysis and graphical representation in Oncology trials - A case study Anindita Bhattacharjee Vijayalakshmi Indana Cytel, Pune The views expressed in this presentation are our own and do not

More information

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION SOLUTIONS 1. a. To calculate the mean, we just add up all 7 values, and divide by 7. In Xi i= 1 fancy

More information

Komorbide brystkræftpatienter kan de tåle behandling? Et registerstudie baseret på Danish Breast Cancer Cooperative Group

Komorbide brystkræftpatienter kan de tåle behandling? Et registerstudie baseret på Danish Breast Cancer Cooperative Group Komorbide brystkræftpatienter kan de tåle behandling? Et registerstudie baseret på Danish Breast Cancer Cooperative Group Lotte Holm Land MD, ph.d. Onkologisk Afd. R. OUH Kræft og komorbiditet - alle skal

More information

How to get accurate sample size and power with nquery Advisor R

How to get accurate sample size and power with nquery Advisor R How to get accurate sample size and power with nquery Advisor R Brian Sullivan Statistical Solutions Ltd. ASA Meeting, Chicago, March 2007 Sample Size Two group t-test χ 2 -test Survival Analysis 2 2 Crossover

More information

Program Attendance in 41 Youth Smoking Cessation Programs in the U.S.

Program Attendance in 41 Youth Smoking Cessation Programs in the U.S. Program Attendance in 41 Youth Smoking Cessation Programs in the U.S. Zhiqun Tang, Robert Orwin, PhD, Kristie Taylor, PhD, Charles Carusi, PhD, Susan J. Curry, PhD, Sherry L. Emery, PhD, Amy K. Sporer,

More information

LOGISTIC REGRESSION ANALYSIS

LOGISTIC REGRESSION ANALYSIS LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic

More information

Chapter 1. Longitudinal Data Analysis. 1.1 Introduction

Chapter 1. Longitudinal Data Analysis. 1.1 Introduction Chapter 1 Longitudinal Data Analysis 1.1 Introduction One of the most common medical research designs is a pre-post study in which a single baseline health status measurement is obtained, an intervention

More information

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department of Mathematics and Statistics Degree Level Expectations, Learning Outcomes, Indicators of

More information

Predicting Customer Churn in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS

Predicting Customer Churn in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS Paper 114-27 Predicting Customer in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS Junxiang Lu, Ph.D. Sprint Communications Company Overland Park, Kansas ABSTRACT

More information

Exercise Answers. Exercise 3.1 1. B 2. C 3. A 4. B 5. A

Exercise Answers. Exercise 3.1 1. B 2. C 3. A 4. B 5. A Exercise Answers Exercise 3.1 1. B 2. C 3. A 4. B 5. A Exercise 3.2 1. A; denominator is size of population at start of study, numerator is number of deaths among that population. 2. B; denominator is

More information

Social inequalities in all cause and cause specific mortality in a country of the African region

Social inequalities in all cause and cause specific mortality in a country of the African region Social inequalities in all cause and cause specific mortality in a country of the African region Silvia STRINGHINI 1, Valentin Rousson 1, Bharathi Viswanathan 2, Jude Gedeon 2, Fred Paccaud 1, Pascal Bovet

More information

Personalized Predictive Medicine and Genomic Clinical Trials

Personalized Predictive Medicine and Genomic Clinical Trials Personalized Predictive Medicine and Genomic Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute http://brb.nci.nih.gov brb.nci.nih.gov Powerpoint presentations

More information

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY Ramon Alemany Montserrat Guillén Xavier Piulachs Lozada Riskcenter - IREA Universitat de Barcelona http://www.ub.edu/riskcenter

More information

Introduction to Survival Analysis

Introduction to Survival Analysis John Fox Lecture Notes Introduction to Survival Analysis Copyright 2014 by John Fox Introduction to Survival Analysis 1 1. Introduction I Survival analysis encompasses a wide variety of methods for analyzing

More information

Hormones and cardiovascular disease, what the Danish Nurse Cohort learned us

Hormones and cardiovascular disease, what the Danish Nurse Cohort learned us Hormones and cardiovascular disease, what the Danish Nurse Cohort learned us Ellen Løkkegaard, Clinical Associate Professor, Ph.d. Dept. Obstetrics and Gynecology. Hillerød Hospital, University of Copenhagen

More information

Statistical Models in R

Statistical Models in R Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova

More information

How To Model The Fate Of An Animal

How To Model The Fate Of An Animal Models Where the Fate of Every Individual is Known This class of models is important because they provide a theory for estimation of survival probability and other parameters from radio-tagged animals.

More information

(1) Comparison of studies with different follow-up periods

(1) Comparison of studies with different follow-up periods (1) Comparison of studies with different follow-up periods Is the absolute potency of amphiboles and relative potency of chrysotile underestimated because of studies with substantially incomplete follow-up?

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Glossary of Statistical Terms

Glossary of Statistical Terms Department of Biostatistics Vanderbilt University School of Medicine biostat.mc.vanderbilt.edu/clinstat May 26, 2015 Glossary of Statistical Terms adjusting or controlling for a variable: Assessing the

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

Overview of study designs

Overview of study designs Overview of study designs In epidemiology, measuring the occurrence of disease or other healthrelated events in a population is only a beginning. Epidemiologists are also interested in assessing whether

More information

Kaplan-Meier Plot. Time to Event Analysis Diagnostic Plots. Outline. Simulating time to event. The Kaplan-Meier Plot. Visual predictive checks

Kaplan-Meier Plot. Time to Event Analysis Diagnostic Plots. Outline. Simulating time to event. The Kaplan-Meier Plot. Visual predictive checks 1 Time to Event Analysis Diagnostic Plots Nick Holford Dept Pharmacology & Clinical Pharmacology University of Auckland, New Zealand 2 Outline The Kaplan-Meier Plot Simulating time to event Visual predictive

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

Competing-risks regression

Competing-risks regression Competing-risks regression Roberto G. Gutierrez Director of Statistics StataCorp LP Stata Conference Boston 2010 R. Gutierrez (StataCorp) Competing-risks regression July 15-16, 2010 1 / 26 Outline 1. Overview

More information

Methods for Meta-analysis in Medical Research

Methods for Meta-analysis in Medical Research Methods for Meta-analysis in Medical Research Alex J. Sutton University of Leicester, UK Keith R. Abrams University of Leicester, UK David R. Jones University of Leicester, UK Trevor A. Sheldon University

More information

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Tests for Two Survival Curves Using Cox s Proportional Hazards Model Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.

More information

School of Public Health and Health Services Department of Epidemiology and Biostatistics

School of Public Health and Health Services Department of Epidemiology and Biostatistics School of Public Health and Health Services Department of Epidemiology and Biostatistics Master of Public Health and Graduate Certificate Biostatistics 0-04 Note: All curriculum revisions will be updated

More information

Linda Staub & Alexandros Gekenidis

Linda Staub & Alexandros Gekenidis Seminar in Statistics: Survival Analysis Chapter 2 Kaplan-Meier Survival Curves and the Log- Rank Test Linda Staub & Alexandros Gekenidis March 7th, 2011 1 Review Outcome variable of interest: time until

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1 BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects

More information

L Lang-Lazdunski, A Bille, S Marshall, R Lal, D Landau, J Spicer

L Lang-Lazdunski, A Bille, S Marshall, R Lal, D Landau, J Spicer Pleurectomy/decortication, hyperthermic pleural lavage with povidone-iodine and systemic chemotherapy in malignant pleural mesothelioma. A 10-year experience. L Lang-Lazdunski, A Bille, S Marshall, R Lal,

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)

More information

Ordinal Regression. Chapter

Ordinal Regression. Chapter Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe

More information

Logistic regression modeling the probability of success

Logistic regression modeling the probability of success Logistic regression modeling the probability of success Regression models are usually thought of as only being appropriate for target variables that are continuous Is there any situation where we might

More information

List of Examples. Examples 319

List of Examples. Examples 319 Examples 319 List of Examples DiMaggio and Mantle. 6 Weed seeds. 6, 23, 37, 38 Vole reproduction. 7, 24, 37 Wooly bear caterpillar cocoons. 7 Homophone confusion and Alzheimer s disease. 8 Gear tooth strength.

More information

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Abstract This paper considers the modeling of claim durations for existing claimants under income

More information

Master of Public Health Program Competencies. Implemented Fall 2015

Master of Public Health Program Competencies. Implemented Fall 2015 Master of Public Program Competencies Implemented Fall 2015 Master of Public Core Competencies SPH Q501 Biostatistics 1. Describe the roles biostatistics serve in the discipline of public health. 2. Apply

More information

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

PRECOMBAT Trial. Seung-Whan Lee, MD, PhD On behalf of the PRECOMBAT Investigators

PRECOMBAT Trial. Seung-Whan Lee, MD, PhD On behalf of the PRECOMBAT Investigators Premier of Randomized Comparison of Bypass Surgery versus Angioplasty Using Sirolimus-Eluting Stent in Patients with Left Main Coronary Artery Disease PRECOMBAT Trial Seung-Whan Lee, MD, PhD On behalf

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

Big Data Health Big Health Improvements? Dr Kerry Bailey MBBS BSc MSc MRCGP FFPH Dr Kelly Nock MPhys PhD

Big Data Health Big Health Improvements? Dr Kerry Bailey MBBS BSc MSc MRCGP FFPH Dr Kelly Nock MPhys PhD Big Data Health Big Health Improvements? Dr Kerry Bailey MBBS BSc MSc MRCGP FFPH Dr Kelly Nock MPhys PhD Epidemiology Infection 2006 Dec;134(6):1167-73. Epub 2006 Apr 20. Risk factors for hospital-acquired

More information

Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer

Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer Patricia A. Berglund, Institute for Social Research - University of Michigan Wisconsin and Illinois SAS User s Group June 25, 2014 1 Overview

More information

Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry

Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry Paper 12028 Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry Junxiang Lu, Ph.D. Overland Park, Kansas ABSTRACT Increasingly, companies are viewing

More information

2 Right Censoring and Kaplan-Meier Estimator

2 Right Censoring and Kaplan-Meier Estimator 2 Right Censoring and Kaplan-Meier Estimator In biomedical applications, especially in clinical trials, two important issues arise when studying time to event data (we will assume the event to be death.

More information

Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University

Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information