Applied Statistics: Lent Term

Size: px
Start display at page:

Download "Applied Statistics: Lent Term"

Transcription

1 Applied Statistics: Lent Term Example: Survival data analysis Right Censored Survival Data The data are from a clinical trial investigating the effect of the drug 6-mercaptopurine (6-MP) on the remission times (in weeks) of leukaemia patients. There are two groups of patients in this study: those who were randomised to receive 6-MP and those who were randomised to receive the placebo. These data can be found in the file Leukaem, which is a comma separated value file. Also recorded in this data-set is information on the logarithm (base 10) of the white blood cell count (log(wbc)) and the gender of the 42 patients. In addition, the data-set contains a censoring indicator which identifies those patients who came out of remission (CENSOR=1) and those who were censored (CENSOR=0). Read into R the data-set and call it Leukaem. Examine the data-set. Do you observe any initial differences between the two therapy groups? Kaplan-Meier curves Attach the survival package: library(survival) To create a survival object that describes the right censored survival data to R, we issue the following command: Surv(time=Leukaem$RMISTIME,event=Leukaem$CENSOR) where the two arguments correspond to the observed survival time (time) and the event status indicator (event). Type?Surv to find out more about Surv. The overall survival curve with confidence band can be obtained as follows: plot(survfit(surv(rmistime,censor)~1,data=leukaem),conf.int=t,col=2) The tabular form of this curve can also be obtained: summary(survfit(surv(rmistime,censor)~1,data=leukaem)) The survival curves for the two therapy groups can be displayed as follows:

2 plot(survfit(surv(rmistime,censor)~drug,data=leukaem),col=2:3,lty=1:2,xlab="remission times in weeks",ylab="proportion in remission") legend(25,0.8,c("placebo","6-mp"), lty=1:2,col=2:3) title("kaplan-meier curves for the Leukaemia Patients") Plot now the survival curves by sex and then by log(wbc). Note that for the log(wbc) variable you need to decide how to categorise it. Comparing survival curves From looking at the survival curves for the two drug groups there appears to be a difference in the remission experiences of the 6-MP and Placebo groups, with patients in the 6-MP group remaining, overall, in remission longer than patients in the Placebo group. To confirm this we perform the log-rank test. survdiff(surv(rmistime,censor)~drug,data=leukaem) Perform now the log-rank test on the gender and the categorised log(wbc) variables. What are the results? Was it appropriate to use the log-rank test for these variables? Cox proportional hazards model Below are the commands for fitting a particular Cox proportional hazards model. Use these commands to find the "best" fitting model. leuk.ph <- coxph(surv(rmistime,censor)~drug,data=leukaem) summary(leuk.ph) Do you understand the output? Is there a drug*log(wbc) interaction? Interpret the results. Can you figure out how to plot the predicted survival curve for a person who is on 6-MP and has a log(wbc) of 4.00? (Hint:?survfit) Assessing the validity of the proportional hazards assumption We now assess whether the proportional hazards assumption is valid for the drug variable. First graphically, plot(survfit(surv(rmistime,censor)~drug,data=leukaem),col=2:3,lty=1:2,fun="cloglog", xlim=c(1,40), ylim= c(-3,3), xlab="time of remission (weeks)",ylab="log(-log(s(t)))") Then analytically, leuk.ph <- coxph(surv(rmistime,censor)~drug,data=leukaem) cox.zph(leuk.ph) # using the default K-M transformation of time Test whether the proportional hazards assumption is being satisfied for the sex and log(wbc) variables (and any other variable you create or use). If the PH assumption is not satisfied, re-fit the Cox model but now stratify by the variable not satisfying the

3 PH assumption. This is done by using the function strata("variable name") as an argument on the right hand side of the formula in coxph(.). That is, coxph(surv(rmistime,censor) ~ strata(.) +...,data=leukaem). Now find the most appropriate model and interpret your results. Other R commands which may be useful are step and basehaz. Below is the Leukaem data used for the above practical. RMISTIME DRUG CENSOR LOGWBC SEX

4 Male =1; Female =0; 6-MP =1 Placebo =0 and CENSOR=1 indicates not in remission anymore. Remission is a disease symptom-free state.

5 Right Censored and Left Truncated Survival Data The data-set used in this part of the practical is taken from Klein and Moeschberger (2003). It consists of the ages (in months) of 462 elderly individuals (97 males and 365 females) who were resident at the retirement centre, Channing House, during the period January 1964 to July 1975 when they died or left the centre, as well as the ages (in months) at which the individuals entered the retirement community. Interest lies in the life times of the individuals. However, in addition to right censoring, the life times in this data-set are subject to left truncation as an individual must survive to a sufficient age to enter the retirement community. Individuals who die at an early age are excluded from the study. Also, included in the data-set is the gender of the individuals (coded: Male=1 and Female=2). The data channing can be found in library(kmsurv). To get access to the data once the KMsurv package is loaded type data(channing) Familiarise yourself with the data-set. Do you notice anything peculiar? Modified Kaplan-Meier curves Before estimating the Kaplan-Meier estimator we need to tell R that this is a survival data-set (i.e. a survival object) with right censoring and left truncation. We do this through use of the function Surv(), once again. srvobj <- Surv(time = channing$ageentry, time2 = channing$age, event = channing$death, type="counting")) Can you tell why this does not work? To overcome this problem we create a new data-set called channing1. channing1 <- channing[(channing$time!=0),] Now repeat the earlier step using channing1 instead of channing to create the survival object. Below we have used the function with srvobj <- with(channing1, Surv(time = ageentry, time2 = age, event = death, type="counting")) srvobj The overall estimated K-M curve can now be calculated. summary(survfit(srvobj~1)) plot(survfit(srvobj~1), xlab ="Age at death (in months)", ylab = "Estimated Conditional Proportion Surviving") How do you interpret this estimator?

6 The estimated survival curves by gender can also be calculated. plot(survfit(srvobj~gender,data=channing1), lty=1:2, col=1:2, xlab = "Age at death (in months)", ylab = "Estimated Conditional Proportion Surviving") legend(200,0.9,c("male","female"),lty=1:2,col=1:2) Do you notice anything peculiar? Let us now look at the males in a bit more detail. summary(survfit(srvobj~1,data=channing1,subset=(gender==1))) Do you understand what has happened here? Does this estimator have any real meaning? Let us now estimate the conditional survival curves for males and females separately, given that they have survived up to age 68 years old (i.e. 816 months). channing2 <- channing1[(channing1$ageentry>=816),] srvobj2 <- Surv(time = channing2$ageentry, time2 = channing2$age, event = channing2$death, type="counting") srvobj2 plot(survfit(srvobj2~gender,data=channing2), lty=1:2, col=1:2, xlim = c(800,1250), xlab = "Age at death (in months)", ylab = "Estimated Conditional Proportion Surviving") legend(1100,0.9,c("male","female"),lty=1:2,col=1:2) Is there any evidence (by eye-balling) of a difference between the survival experiences of males and females? Can you figure out how to estimate the conditional survival probabilities for males and females separately, given that they have survived up to age 75 years old? Cox proportional hazards model Let us now investigate formally whether there is an effect of gender on the survival experiences of the individuals in Channing House. channing.cox <- coxph(srvobj~factor(gender),data=channing1) summary(channing.cox) Can you interpret the results obtained? What is the hazards ratio for males versus females? The R function relevel is a useful command when one would like to change the reference level of a factor variable. See if you can figure out how to apply it above. Let us end by checking the proportional hazards assumption analytically. cox.zph(channing.cox) Is there any evidence for non-proportionality of hazards?

Linda Staub & Alexandros Gekenidis

Linda Staub & Alexandros Gekenidis Seminar in Statistics: Survival Analysis Chapter 2 Kaplan-Meier Survival Curves and the Log- Rank Test Linda Staub & Alexandros Gekenidis March 7th, 2011 1 Review Outcome variable of interest: time until

More information

Package smoothhr. November 9, 2015

Package smoothhr. November 9, 2015 Encoding UTF-8 Type Package Depends R (>= 2.12.0),survival,splines Package smoothhr November 9, 2015 Title Smooth Hazard Ratio Curves Taking a Reference Value Version 1.0.2 Date 2015-10-29 Author Artur

More information

The Kaplan-Meier Plot. Olaf M. Glück

The Kaplan-Meier Plot. Olaf M. Glück The Kaplan-Meier Plot 1 Introduction 2 The Kaplan-Meier-Estimator (product limit estimator) 3 The Kaplan-Meier Curve 4 From planning to the Kaplan-Meier Curve. An Example 5 Sources & References 1 Introduction

More information

Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time

Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time Hajime Uno Dana-Farber Cancer Institute March 16, 2015 1 Introduction In a comparative, longitudinal

More information

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)

More information

Competing-risks regression

Competing-risks regression Competing-risks regression Roberto G. Gutierrez Director of Statistics StataCorp LP Stata Conference Boston 2010 R. Gutierrez (StataCorp) Competing-risks regression July 15-16, 2010 1 / 26 Outline 1. Overview

More information

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:

More information

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn A Handbook of Statistical Analyses Using R Brian S. Everitt and Torsten Hothorn CHAPTER 6 Logistic Regression and Generalised Linear Models: Blood Screening, Women s Role in Society, and Colonic Polyps

More information

Measures of Prognosis. Sukon Kanchanaraksa, PhD Johns Hopkins University

Measures of Prognosis. Sukon Kanchanaraksa, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Kaplan-Meier Survival Analysis 1

Kaplan-Meier Survival Analysis 1 Version 4.0 Step-by-Step Examples Kaplan-Meier Survival Analysis 1 With some experiments, the outcome is a survival time, and you want to compare the survival of two or more groups. Survival curves show,

More information

SAS and R calculations for cause specific hazard ratios in a competing risks analysis with time dependent covariates

SAS and R calculations for cause specific hazard ratios in a competing risks analysis with time dependent covariates SAS and R calculations for cause specific hazard ratios in a competing risks analysis with time dependent covariates Martin Wolkewitz, Ralf Peter Vonberg, Hajo Grundmann, Jan Beyersmann, Petra Gastmeier,

More information

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD Tips for surviving the analysis of survival data Philip Twumasi-Ankrah, PhD Big picture In medical research and many other areas of research, we often confront continuous, ordinal or dichotomous outcomes

More information

A short course in Longitudinal Data Analysis ESRC Research Methods and Short Course Material for Practicals with the joiner package.

A short course in Longitudinal Data Analysis ESRC Research Methods and Short Course Material for Practicals with the joiner package. A short course in Longitudinal Data Analysis ESRC Research Methods and Short Course Material for Practicals with the joiner package. Lab 2 - June, 2008 1 jointdata objects To analyse longitudinal data

More information

Comparison of Survival Curves

Comparison of Survival Curves Comparison of Survival Curves We spent the last class looking at some nonparametric approaches for estimating the survival function, Ŝ(t), over time for a single sample of individuals. Now we want to compare

More information

5 Correlation and Data Exploration

5 Correlation and Data Exploration 5 Correlation and Data Exploration Correlation In Unit 3, we did some correlation analyses of data from studies related to the acquisition order and acquisition difficulty of English morphemes by both

More information

Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1

Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1 Paper 11682-2016 Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1 Raja Rajeswari Veggalam, Akansha Gupta; SAS and OSU Data Mining Certificate

More information

Life Tables. Marie Diener-West, PhD Sukon Kanchanaraksa, PhD

Life Tables. Marie Diener-West, PhD Sukon Kanchanaraksa, PhD This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Introduction. Survival Analysis. Censoring. Plan of Talk

Introduction. Survival Analysis. Censoring. Plan of Talk Survival Analysis Mark Lunt Arthritis Research UK Centre for Excellence in Epidemiology University of Manchester 01/12/2015 Survival Analysis is concerned with the length of time before an event occurs.

More information

Randomized trials versus observational studies

Randomized trials versus observational studies Randomized trials versus observational studies The case of postmenopausal hormone therapy and heart disease Miguel Hernán Harvard School of Public Health www.hsph.harvard.edu/causal Joint work with James

More information

Efficacy analysis and graphical representation in Oncology trials - A case study

Efficacy analysis and graphical representation in Oncology trials - A case study Efficacy analysis and graphical representation in Oncology trials - A case study Anindita Bhattacharjee Vijayalakshmi Indana Cytel, Pune The views expressed in this presentation are our own and do not

More information

Technical Information

Technical Information Technical Information Trials The questions for Progress Test in English (PTE) were developed by English subject experts at the National Foundation for Educational Research. For each test level of the paper

More information

Abstract. Introduction. System Requirement. GUI Design. Paper AD17-2011

Abstract. Introduction. System Requirement. GUI Design. Paper AD17-2011 Paper AD17-2011 Application for Survival Analysis through Microsoft Access GUI Zhong Yan, i3, Indianapolis, IN Jie Li, i3, Austin, Texas Jiazheng (Steven) He, i3, San Francisco, California Abstract This

More information

Graphics in R. Biostatistics 615/815

Graphics in R. Biostatistics 615/815 Graphics in R Biostatistics 615/815 Last Lecture Introduction to R Programming Controlling Loops Defining your own functions Today Introduction to Graphics in R Examples of commonly used graphics functions

More information

LOGISTIC REGRESSION ANALYSIS

LOGISTIC REGRESSION ANALYSIS LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic

More information

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Logistic regression generalizes methods for 2-way tables Adds capability studying several predictors, but Limited to

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

BIOM611 Biological Data Analysis

BIOM611 Biological Data Analysis BIOM611 Biological Data Analysis Spring, 2015 Tentative Syllabus Introduction BIOMED611 is a ½ unit course required for all 1 st year BGS students (except GCB students). It will provide an introduction

More information

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Tests for Two Survival Curves Using Cox s Proportional Hazards Model Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software January 2011, Volume 38, Issue 5. http://www.jstatsoft.org/ Lexis: An R Class for Epidemiological Studies with Long-Term Follow-Up Martyn Plummer International Agency

More information

Parametric and non-parametric statistical methods for the life sciences - Session I

Parametric and non-parametric statistical methods for the life sciences - Session I Why nonparametric methods What test to use? Rank Tests Parametric and non-parametric statistical methods for the life sciences - Session I Liesbeth Bruckers Geert Molenberghs Interuniversity Institute

More information

2 Right Censoring and Kaplan-Meier Estimator

2 Right Censoring and Kaplan-Meier Estimator 2 Right Censoring and Kaplan-Meier Estimator In biomedical applications, especially in clinical trials, two important issues arise when studying time to event data (we will assume the event to be death.

More information

Lung Cancer. Public Outcomes Report. Submitted by Omar A. Majid, MD

Lung Cancer. Public Outcomes Report. Submitted by Omar A. Majid, MD Public Outcomes Report Lung Cancer Submitted by Omar A. Majid, MD Lung cancer is the most common cancer-related cause of death among men and women. It has been estimated that there will be 226,1 new cases

More information

200609 - ATV - Lifetime Data Analysis

200609 - ATV - Lifetime Data Analysis Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 200 - FME - School of Mathematics and Statistics 715 - EIO - Department of Statistics and Operations Research 1004 - UB - (ENG)Universitat

More information

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form. General Remarks This template of a data extraction form is intended to help you to start developing your own data extraction form, it certainly has to be adapted to your specific question. Delete unnecessary

More information

Package SurvCorr. February 26, 2015

Package SurvCorr. February 26, 2015 Type Package Title Correlation of Bivariate Survival Times Version 1.0 Date 2015-02-25 Package SurvCorr February 26, 2015 Author Meinhard Ploner, Alexandra Kaider and Georg Heinze Maintainer Georg Heinze

More information

Stage IV Renal Cell Carcinoma. Changing Management in A Comprehensive Community Cancer Center. Susquehanna Health Cancer Center

Stage IV Renal Cell Carcinoma. Changing Management in A Comprehensive Community Cancer Center. Susquehanna Health Cancer Center Stage IV Renal Cell Carcinoma Changing Management in A Comprehensive Community Cancer Center Susquehanna Health Cancer Center 2000 2009 Warren L. Robinson, MD, FACP January 27, 2014 Introduction 65,150

More information

Paper PO06. Randomization in Clinical Trial Studies

Paper PO06. Randomization in Clinical Trial Studies Paper PO06 Randomization in Clinical Trial Studies David Shen, WCI, Inc. Zaizai Lu, AstraZeneca Pharmaceuticals ABSTRACT Randomization is of central importance in clinical trials. It prevents selection

More information

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association

More information

ADVANCE: a factorial randomised trial of blood pressure lowering and intensive glucose control in 11,140 patients with type 2 diabetes

ADVANCE: a factorial randomised trial of blood pressure lowering and intensive glucose control in 11,140 patients with type 2 diabetes ADVANCE: a factorial randomised trial of blood pressure lowering and intensive glucose control in 11,140 patients with type 2 diabetes Effects of a fixed combination of the ACE inhibitor, perindopril,

More information

Each function call carries out a single task associated with drawing the graph.

Each function call carries out a single task associated with drawing the graph. Chapter 3 Graphics with R 3.1 Low-Level Graphics R has extensive facilities for producing graphs. There are both low- and high-level graphics facilities. The low-level graphics facilities provide basic

More information

Supplementary appendix

Supplementary appendix Supplementary appendix This appendix formed part of the original submission and has been peer reviewed. We post it as supplied by the authors. Supplement to: Gold R, Giovannoni G, Selmaj K, et al, for

More information

SPONTANEOUS MESOTHELIOMA DATA: AN INTERPRETATION. Robin Howie, Robin Howie Associates, Edinburgh.

SPONTANEOUS MESOTHELIOMA DATA: AN INTERPRETATION. Robin Howie, Robin Howie Associates, Edinburgh. SPONTANEOUS MESOTHELIOMA DATA: AN INTERPRETATION Robin Howie, Robin Howie Associates, Edinburgh. SPONTANEOUS MESOTHELIOMA MESOTHELIOMA DEATHS HSE (2003a) estimated there are about 26 spontaneous deaths

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Solutions to Homework 10 Statistics 302 Professor Larget

Solutions to Homework 10 Statistics 302 Professor Larget s to Homework 10 Statistics 302 Professor Larget Textbook Exercises 7.14 Rock-Paper-Scissors (Graded for Accurateness) In Data 6.1 on page 367 we see a table, reproduced in the table below that shows the

More information

Product Development News

Product Development News Article from: Product Development News May 2006 Issue 65 Features Comfort Food for an Actuary: Cognitive Testing in Underwriting the Elderly 1 by Eric D. Golus, Laura Vecchione and Thomas Ashley Eric D.

More information

PRACTICE PROBLEMS FOR BIOSTATISTICS

PRACTICE PROBLEMS FOR BIOSTATISTICS PRACTICE PROBLEMS FOR BIOSTATISTICS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION 1. The duration of time from first exposure to HIV infection to AIDS diagnosis is called the incubation period.

More information

Simple Predictive Analytics Curtis Seare

Simple Predictive Analytics Curtis Seare Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use

More information

Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16

Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Summary of R software commands used to generate bootstrap and permutation test output and figures in Chapter 16 Since R is command line driven and the primary software of Chapter 16, this document details

More information

Topic 3 - Survival Analysis

Topic 3 - Survival Analysis Topic 3 - Survival Analysis 1. Topics...2 2. Learning objectives...3 3. Grouped survival data - leukemia example...4 3.1 Cohort survival data schematic...5 3.2 Tabulation of events and time at risk...6

More information

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1. Normal Distribution Definition A continuous random variable has a normal distribution if its probability density e -(y -µ Y ) 2 2 / 2 σ function can be written as for < y < as Y f ( y ) = 1 σ Y 2 π Notation:

More information

# load in the files containing the methyaltion data and the source # code containing the SSRPMM functions

# load in the files containing the methyaltion data and the source # code containing the SSRPMM functions ################ EXAMPLE ANALYSES TO ILLUSTRATE SS-RPMM ######################## # load in the files containing the methyaltion data and the source # code containing the SSRPMM functions # Note, the SSRPMM

More information

Pharmacogenomic markers in EGFR-targeted therapy of lung cancer

Pharmacogenomic markers in EGFR-targeted therapy of lung cancer Pharmacogenomic markers in EGFR-targeted therapy of lung cancer Rafal Dziadziuszko, MD, PhD University of Colorado Cancer Center, Aurora, CO, USA Medical University of Gdansk, Poland EMEA Workshop on Biomarkers,

More information

Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)

Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study) Cairo University Faculty of Economics and Political Science Statistics Department English Section Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study) Prepared

More information

Evaluation of Treatment Pathways in Oncology: Modeling Approaches. Feng Pan, PhD United BioSource Corporation Bethesda, MD

Evaluation of Treatment Pathways in Oncology: Modeling Approaches. Feng Pan, PhD United BioSource Corporation Bethesda, MD Evaluation of Treatment Pathways in Oncology: Modeling Approaches Feng Pan, PhD United BioSource Corporation Bethesda, MD 1 Objectives Rationale for modeling treatment pathways Treatment pathway simulation

More information

2. Background This was the fourth submission for everolimus requesting listing for clear cell renal carcinoma.

2. Background This was the fourth submission for everolimus requesting listing for clear cell renal carcinoma. PUBLIC SUMMARY DOCUMENT Product: Everolimus, tablets, 5 mg and 10 mg, Afinitor Sponsor: Novartis Pharmaceuticals Australia Pty Ltd Date of PBAC Consideration: November 2011 1. Purpose of Application To

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

Missing Data in Survival Analysis and Results from the MESS Trial

Missing Data in Survival Analysis and Results from the MESS Trial Missing Data in Survival Analysis and Results from the MESS Trial J. K. Rogers J. L. Hutton K. Hemming Department of Statistics University of Warwick Research Students Conference, 2008 Outline Background

More information

Analysis and Interpretation of Clinical Trials. How to conclude?

Analysis and Interpretation of Clinical Trials. How to conclude? www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support

More information

Getting started with qplot

Getting started with qplot Chapter 2 Getting started with qplot 2.1 Introduction In this chapter, you will learn to make a wide variety of plots with your first ggplot2 function, qplot(), short for quick plot. qplot makes it easy

More information

The Cox Proportional Hazards Model

The Cox Proportional Hazards Model The Cox Proportional Hazards Model Mario Chen, PhD Advanced Biostatistics and RCT Workshop Office of AIDS Research, NIH ICSSC, FHI Goa, India, September 2009 1 The Model h i (t)=h 0 (t)exp(z i ), Z i =

More information

Q&A on methodology on HIV estimates

Q&A on methodology on HIV estimates Q&A on methodology on HIV estimates 09 Understanding the latest estimates of the 2008 Report on the global AIDS epidemic Part one: The data 1. What data do UNAIDS and WHO base their HIV prevalence estimates

More information

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal 1. STUDY TITLE: Longitudinal Assessment of Chronic Health Conditions: The Aging of Childhood Cancer Survivors 2. WORKING GROUP AND INVESTIGATORS:

More information

Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015

Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015 1 Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015 Instructor: Joanne M. Garrett, PhD e-mail: joanne_garrett@med.unc.edu Class Notes: Copies of the class lecture slides

More information

Is it statistically significant? The chi-square test

Is it statistically significant? The chi-square test UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical

More information

Seton Medical Center Hepatocellular Carcinoma Patterns of Care Study Rate of Treatment with Chemoembolization 2007 2012 N = 50

Seton Medical Center Hepatocellular Carcinoma Patterns of Care Study Rate of Treatment with Chemoembolization 2007 2012 N = 50 General Data Seton Medical Center Hepatocellular Carcinoma Patterns of Care Study Rate of Treatment with Chemoembolization 2007 2012 N = 50 The vast majority of the patients in this study were diagnosed

More information

Didacticiel Études de cas

Didacticiel Études de cas 1 Theme Data Mining with R The rattle package. R (http://www.r project.org/) is one of the most exciting free data mining software projects of these last years. Its popularity is completely justified (see

More information

Expectancy Effects of Performance Enhancing Supplements on Motivation to Exercise. Chris Dawson and Alfonso Ribero.

Expectancy Effects of Performance Enhancing Supplements on Motivation to Exercise. Chris Dawson and Alfonso Ribero. Running Head: EXPECTANCY EFFECTS ON MOTIVATION Expectancy Effects of Performance Enhancing Supplements on Motivation to Exercise Chris Dawson and Alfonso Ribero Hanover College Expectancy Effects 2 Abstract

More information

Some Statistical Applications In The Financial Services Industry

Some Statistical Applications In The Financial Services Industry Some Statistical Applications In The Financial Services Industry Wenqing Lu May 30, 2008 1 Introduction Examples of consumer financial services credit card services mortgage loan services auto finance

More information

Prognostic impact of uric acid in patients with stable coronary artery disease

Prognostic impact of uric acid in patients with stable coronary artery disease Prognostic impact of uric acid in patients with stable coronary artery disease Gjin Ndrepepa, Siegmund Braun, Martin Hadamitzky, Massimiliano Fusaro, Hans-Ullrich Haase, Kathrin A. Birkmeier, Albert Schomig,

More information

How to get accurate sample size and power with nquery Advisor R

How to get accurate sample size and power with nquery Advisor R How to get accurate sample size and power with nquery Advisor R Brian Sullivan Statistical Solutions Ltd. ASA Meeting, Chicago, March 2007 Sample Size Two group t-test χ 2 -test Survival Analysis 2 2 Crossover

More information

sample median Sample quartiles sample deciles sample quantiles sample percentiles Exercise 1 five number summary # Create and view a sorted

sample median Sample quartiles sample deciles sample quantiles sample percentiles Exercise 1 five number summary # Create and view a sorted Sample uartiles We have seen that the sample median of a data set {x 1, x, x,, x n }, sorted in increasing order, is a value that divides it in such a way, that exactly half (i.e., 50%) of the sample observations

More information

Variability in Life-Table Estimates Based on the National Health Interview Survey Linked Mortality Files

Variability in Life-Table Estimates Based on the National Health Interview Survey Linked Mortality Files Variability in Life-Table Estimates Based on the National Health Interview Survey Linked Mortality Files Van Parsons 1, Nathaniel Schenker, Kimberly Lochner, Gloria Wheatcroft and Elsie Pamuk 1 National

More information

Life Insurance Modelling: Notes for teachers. Overview

Life Insurance Modelling: Notes for teachers. Overview Life Insurance Modelling: Notes for teachers In this mathematics activity, students model the following situation: An investment fund is set up Investors pay a set amount at the beginning of 20 years.

More information

What Cancer Patients Need To Know

What Cancer Patients Need To Know Taking Part in Clinical Trials What Cancer Patients Need To Know NATIONAL INSTITUTES OF HEALTH National Cancer Institute Generous support for this publication was provided by Novartis Oncology. Taking

More information

Thrombin generation and recurrent venous thromboembolism

Thrombin generation and recurrent venous thromboembolism Thrombin generation and recurrent venous thromboembolism T. Zhu, W. Ye, F. Dali Ali, L. Carcaillon, V. Remones, M. Alhenc Gelas, P. Gaussem, P.Y. Scarabin, J. Emmerich INSERM U765, University Paris Descartes,

More information

Development of the nomolog program and its evolution

Development of the nomolog program and its evolution Development of the nomolog program and its evolution Towards the implementation of a nomogram generator for the Cox regression Alexander Zlotnik, Telecom.Eng. Víctor Abraira Santos, PhD Ramón y Cajal University

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

The Procedures of Monte Carlo Simulation (and Resampling)

The Procedures of Monte Carlo Simulation (and Resampling) 154 Resampling: The New Statistics CHAPTER 10 The Procedures of Monte Carlo Simulation (and Resampling) A Definition and General Procedure for Monte Carlo Simulation Summary Until now, the steps to follow

More information

Methodology Understanding the HIV estimates

Methodology Understanding the HIV estimates UNAIDS July 2014 Methodology Understanding the HIV estimates Produced by the Strategic Information and Monitoring Division Notes on UNAIDS methodology Unless otherwise stated, findings in this report are

More information

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components An Application of Weibull Analysis to Determine Failure Rates in Automotive Components Jingshu Wu, PhD, PE, Stephen McHenry, Jeffrey Quandt National Highway Traffic Safety Administration (NHTSA) U.S. Department

More information

Big data size isn t enough! Irene Petersen, PhD Primary Care & Population Health

Big data size isn t enough! Irene Petersen, PhD Primary Care & Population Health Big data size isn t enough! Irene Petersen, PhD Primary Care & Population Health Introduction Reader (Statistics and Epidemiology) Research team epidemiologists/statisticians/phd students Primary care

More information

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize

More information

Life Settlements and Viaticals

Life Settlements and Viaticals Life Settlements and Viaticals Paul Wilmott, Wilmott Associates, paul@wilmottcom This is taken from the Second Edition of Paul Wilmott On Quantitative Finance, published by John Wiley & Sons 26 In this

More information

Beyond Age and Sex: Enhancing Annuity Pricing

Beyond Age and Sex: Enhancing Annuity Pricing Beyond Age and Sex: Enhancing Annuity Pricing Joelle HY. Fong The Wharton School (IRM Dept), and CEPAR CEPAR Demography & Longevity Workshop University of New South Wales 26 July 2011 Motivation Current

More information

Statistical estimation using confidence intervals

Statistical estimation using confidence intervals 0894PP_ch06 15/3/02 11:02 am Page 135 6 Statistical estimation using confidence intervals In Chapter 2, the concept of the central nature and variability of data and the methods by which these two phenomena

More information

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090 Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 trhodes@mib.com Abstract

More information

Binary Logistic Regression

Binary Logistic Regression Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including

More information

DECISION AND SUMMARY OF RATIONALE

DECISION AND SUMMARY OF RATIONALE DECISION AND SUMMARY OF RATIONALE Indication under consideration Clinical evidence Clofarabine in the treatment of relapsed acute myeloid leukaemia (AML) The application was for clofarabine to remain in

More information

Modelling spousal mortality dependence: evidence of heterogeneities and implications

Modelling spousal mortality dependence: evidence of heterogeneities and implications 1/23 Modelling spousal mortality dependence: evidence of heterogeneities and implications Yang Lu Scor and Aix-Marseille School of Economics Lyon, September 2015 2/23 INTRODUCTION 3/23 Motivation It has

More information

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Abstract This paper considers the modeling of claim durations for existing claimants under income

More information

Package dsmodellingclient

Package dsmodellingclient Package dsmodellingclient Maintainer Author Version 4.1.0 License GPL-3 August 20, 2015 Title DataSHIELD client site functions for statistical modelling DataSHIELD

More information

Health Disparities in Multiple Myeloma. Kenneth R. Bridges, M.D. Senior Medical Director Onyx Pharmaceuticals, Inc.

Health Disparities in Multiple Myeloma. Kenneth R. Bridges, M.D. Senior Medical Director Onyx Pharmaceuticals, Inc. Health Disparities in Multiple Myeloma Kenneth R. Bridges, M.D. Senior Medical Director Onyx Pharmaceuticals, Inc. Multiple Myeloma Overview Multiple myeloma (MM) is a type of blood cancer that develops

More information

Time Series Analysis with R - Part I. Walter Zucchini, Oleg Nenadić

Time Series Analysis with R - Part I. Walter Zucchini, Oleg Nenadić Time Series Analysis with R - Part I Walter Zucchini, Oleg Nenadić Contents 1 Getting started 2 1.1 Downloading and Installing R.................... 2 1.2 Data Preparation and Import in R.................

More information

Supplementary Online Content

Supplementary Online Content Supplementary Online Content Arterburn DE, Olsen MK, Smith VA, Livingston EH, Van Scoyoc L, Yancy WS, Jr. Association Between Bariatric Surgery and Long-Term Survival. JAMA. doi:10.1001/jama.2014.16968.

More information

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY Ramon Alemany Montserrat Guillén Xavier Piulachs Lozada Riskcenter - IREA Universitat de Barcelona http://www.ub.edu/riskcenter

More information

Center Activity (07/01/2009-06/30/2010) Center Region United States Tables for More Information

Center Activity (07/01/2009-06/30/2010) Center Region United States Tables for More Information Program Summary Center Activity (07/01/2009-06/30/2010) Center Region United States Tables for More Information Deceased donor transplants (n=number) 0 169 0 07C,08C,09C On waitlist at start (n) 0 0 0

More information

Cancer Survival - How Long Do People Survive?

Cancer Survival - How Long Do People Survive? A research briefing paper by Macmillan Cancer Support Introduction Key findings 3 People with cancer are surviving longer 4 Median survival time has seen dramatic improvement for some cancers 5 Median

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

SAS Enterprise Guide in Pharmaceutical Applications: Automated Analysis and Reporting Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN

SAS Enterprise Guide in Pharmaceutical Applications: Automated Analysis and Reporting Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN Paper PH200 SAS Enterprise Guide in Pharmaceutical Applications: Automated Analysis and Reporting Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN ABSTRACT SAS Enterprise Guide is a member

More information