Package SurvCorr. February 26, 2015



Similar documents
Package survpresmooth

Package smoothhr. November 9, 2015

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Package plan. R topics documented: February 20, 2015

More details on the inputs, functionality, and output can be found below.

Package ATE. R topics documented: February 19, Type Package Title Inference for Average Treatment Effects using Covariate. balancing.

Package empiricalfdr.deseq2

Package neuralnet. February 20, 2015

Gamma Distribution Fitting

5 Correlation and Data Exploration

Each function call carries out a single task associated with drawing the graph.

Package retrosheet. April 13, 2015

Graphics in R. Biostatistics 615/815

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Package zic. September 4, 2015

Package dsmodellingclient

Package CoImp. February 19, 2015

Data exploration with Microsoft Excel: analysing more than one variable

Two Related Samples t Test

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Package MDM. February 19, 2015

An introduction to IBM SPSS Statistics

Package RIGHT. March 30, 2015

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Directions for using SPSS

Using Excel for inferential statistics

Package ERP. December 14, 2015

Multivariate Logistic Regression

Package EstCRM. July 13, 2015

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models

sample median Sample quartiles sample deciles sample quantiles sample percentiles Exercise 1 five number summary # Create and view a sorted

Point Biserial Correlation Tests

Pearson's Correlation Tests

II. DISTRIBUTIONS distribution normal distribution. standard scores

Package cgdsr. August 27, 2015

Confidence Intervals for Spearman s Rank Correlation

APPLIED MISSING DATA ANALYSIS

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Package dsstatsclient

Chapter 2 Probability Topics SPSS T tests

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Package polynom. R topics documented: June 24, Version 1.3-8

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

Additional sources Compilation of sources:

Package CIFsmry. July 10, Index 6

Package mcmcse. March 25, 2016

Optimization of sampling strata with the SamplingStrata package

Getting started with qplot

Package GSA. R topics documented: February 19, 2015

Scatter Plots with Error Bars

Lab 13: Logistic Regression

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Descriptive Statistics

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan , Fall 2010

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

containing Kendall correlations; and the OUTH = option will create a data set containing Hoeffding statistics.

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Interpretation of Somers D under four simple models

Simple Linear Regression Inference

Association Between Variables

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Binary Diagnostic Tests Two Independent Samples

HYPOTHESIS TESTING: POWER OF THE TEST

Package uptimerobot. October 22, 2015

Package tagcloud. R topics documented: July 3, 2015

Package bigdata. R topics documented: February 19, 2015

Descriptive Analysis

Package TRADER. February 10, 2016

Package sendmailr. February 20, 2015

Analysing Questionnaires using Minitab (for SPSS queries contact -)

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

Practical Differential Gene Expression. Introduction

Package copa. R topics documented: August 9, 2016

Least Squares Estimation

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Package DCG. R topics documented: June 8, Type Package

Imputing Values to Missing Data

Chapter 5 Analysis of variance SPSS Analysis of variance

Paper DV KEYWORDS: SAS, R, Statistics, Data visualization, Monte Carlo simulation, Pseudo- random numbers

Chapter 7: Simple linear regression Learning Objectives

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

Viewing Ecological data using R graphics

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

Homework 11. Part 1. Name: Score: / null

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Study Guide for the Final Exam

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Portfolio Distribution Modelling and Computation. Harry Zheng Department of Mathematics Imperial College

Package cpm. July 28, 2015

Journal of Statistical Software

Geostatistics Exploratory Analysis

Package SHELF. February 5, 2016

SPSS Tests for Versions 9 to 13

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

NCSS Statistical Software

Transcription:

Type Package Title Correlation of Bivariate Survival Times Version 1.0 Date 2015-02-25 Package SurvCorr February 26, 2015 Author Meinhard Ploner, Alexandra Kaider and Georg Heinze Maintainer Georg Heinze <georg.heinze@meduniwien.ac.at> Depends R (>= 3.0.2) Imports survival, fields Estimates correlation coefficients with associated confidence limits for bivariate, partially censored survival times. Uses the iterative multiple imputation approach proposed by Schemper, Kaider, Wakounig and Heinze, Statistics in Medicine 2013. Provides a scatterplot function to visualize the bivariate distribution, either on the original time scale or as copula. License GPL NeedsCompilation no Repository CRAN Date/Publication 2015-02-26 02:18:35 R topics documented: SurvCorr-package...................................... 2 diabetes........................................... 3 kidney............................................ 3 plot.survcorr......................................... 4 print.survcorr........................................ 5 summary.survcorr...................................... 6 survcorr........................................... 7 Index 10 1

2 SurvCorr-package SurvCorr-package Correlation Analysis of Survival Times by Iterative Multiple Imputation Details This R-package implements the iterative multiple imputation algorithm as proposed by Schemper, Kaider, Wakounig and Heinze (2013) for estimation of a correlation coefficient for bivariate possibly censored time-to-event data. Package: SurvCorr Type: Package Version: 1.0 Date: 2015-02-25 License: GPL The analysis of correlations within pairs of survival times is of interest to many research topics in medicine, such as the correlation of survival-type endpoints of twins, the correlation of times till failure in paired organs, or the correlation of survival time with a surrogate endpoint. The dependence of such times is assumed monotonic and thus quantification by rank correlation coefficients appropriate. The typical censoring of such times requires more involved methods of estimation and inference as have been developed in recent years. As an alternative to the maximum likelihood methodology for the normal copula approach (NCE) this package implements an iterative multiple imputation (IMI) method which requires only about 0.05% of the computing time of NCE, without sacrificing statistical performance. For IMI, survival probabilities at death or censoring times are first transformed to normal deviates. Then, those deviates which relate to censored times are iteratively augmented, by conditional multiple imputation, until convergence is obtained for the normal scores rank correlation, which is similar to Spearman s rank correlation. Schemper, Kaider, Wakounig and Heinze (2013) compared statistical properties of NCE and IMI by means of a Monte Carlo study and by means of three real data sets; two of them are available in this package. The package s main function is survcorr, accompanied by appropriate print and summary methods. A plot method can be used to visualize the bivariate distribution either as a copula, or as distribution of survival times. In the former case, one can plot original (uncensored) values along with univariately or bivariately imputed values. Author(s) Meinhard Ploner, Alexandra Kaider and Georg Heinze (e-mail: georg.heinze@meduniwien.ac.at) Schemper,M., Kaider,A., Wakounig,S. & Heinze,G. (2013): "Estimating the correlation of bivariate failure times under censoring", Statistics in Medicine, 32, 4781-4790 http://dx.doi.org/10.

diabetes 3 1002/sim.5874. diabetes Diabetes Data Format Source Diabetic retinopathy: how strongly are times to blindness of a treated and an untreated eye correlated in patients suffering from diabetic retinopathy? The analysis is based on a sample of n=197 paired failure times (censoring 73% and 49% for the treated and untreated eyes, respectively) described by Huster, Brookmeyer, and Self (1989). Both eyes of an individual are observed for the same time, and therefore dots on the diagonal generally indicate pairs of censored times. diabetes A data.frame containing 197 rows. http://www.mayo.edu/research/documents/diabeteshtml/doc-10027460 Huster WJ, Brookmeyer R, Self SG. Modelling paired survival data with covariates. Biometrics 1989; 45:145-156. kidney Kidney Data Infections under dialysis: does time until infection of the first application of a portable dialysis machine correlate with the time until recurrence of infection during the second application? The infection occurs at the point of insertion of the catheter, and, when it occurs, the catheter is removed. After successful treatment of the infection, the catheter is reinserted again. In the data set of 38 patients, the times until infection in both periods are reported as well as corresponding censoring indicators. Censoring of the times till infection occurs if the catheter is removed because of other reasons than infection. kidney

4 plot.survcorr Format Source A data.frame containing 38 rows and 5 variables (ID, time1, status1, time2, status2). McGilchrist and Aisbett (1991) McGilchrist CA, Aisbett CW.Regression with frailty in survival analysis. Biometrics 1991; 47:461-466. plot.survcorr Plot Correlated Bivariate Survival Times Produces a scatterplot of bivariate survival times, either on the original times scale or as copula (uniform marginal distributions). Censored observations are inserted either by their imputed values (copula plot) or marked by arrows (survival times plot). The first time variable will be plotted on the y-axis, the second on the x-axis. ## S3 method for class survcorr plot(x, what = "uniform", imputation = 1, xlab = switch(what, copula= expression(hat(f)(t[2])), uniform = expression(hat(f)(t[2])), times = expression(t[2])), ylab = switch(what, copula = expression(hat(f)(t[1])), uniform = expression(hat(f)(t[1])), times = expression(t[1])), xlim, ylim, main = switch(what, copula = "Bivariate Copula",uniform = "Bivariate Copula", times = "Bivariate Survival Times"), legend = TRUE, cex.legend = switch(what, copula = 0.8, uniform = 0.8, times = 0.7), pch = "*", colevent = "black", colimput = "gray",...) Arguments x what imputation xlab an object of class survcorr what should be plotted: "uniform" or "copula" to plot the bivariate copula, "times" to plot the survival times. The default is to plot the copula. If the copula is plotted, then the index of the imputated data set to be used to replace censored observation can be given (e.g., imputation=1:5. Default: imputation=1) An optional x-axis label.

print.survcorr 5 ylab An optional y-axis label. xlim Optional limits for x-axis. ylim Optional limits for y-axis. main Optional title. legend Optional legend. cex.legend Optional font size of legend. pch Optional plot character. colevent Color of symbols representing uncensored times (default="black"). colimput Color of symbols representing imputations for censored times (default="gray").... Further options to be passed to the plot function. Value no return value; function is called for its side effects Author(s) Meinhard Ploner, Alexandra Kaider, Georg Heinze Schemper,M., Kaider,A., Wakounig,S. & Heinze,G. (2013): "Estimating the correlation of bivariate failure times under censoring", Statistics in Medicine, 32, 4781-4790 http://dx.doi.org/10. 1002/sim.5874. Examples ## Example 2 data(diabetes) obj <- survcorr(formula1=surv(time1, STATUS1) ~ 1, formula2=surv(time2, STATUS2) ~ 1, data=diabetes, M=100, MCMCSteps=10, alpha=0.05, epsilon=0.001) plot(obj, "times") plot(obj, "copula", imputation=1) plot(obj, "copula", imputation=7) print.survcorr Print Method for survcorr Objects Print method for survcorr objects (correlation of bivariate survival times). Summarizes most important results: estimated correlation coefficient and confidence interval.

6 summary.survcorr ## S3 method for class survcorr print(x,...) Arguments x a survcorr object... additional options passed to print Details print method for objects of class survcorr Value the estimated correlation coefficient and lower and upper (1-alpha) confidence limits Author(s) Meinhard Ploner, Alexandra Kaider, Georg Heinze Schemper,M., Kaider,A., Wakounig,S. & Heinze,G. (2013): "Estimating the correlation of bivariate failure times under censoring", Statistics in Medicine, 32, 4781-4790 http://dx.doi.org/10. 1002/sim.5874. summary.survcorr Summary Method for survcorr Objects Summarizes results of a correlation analysis of bivariate survival times ## S3 method for class survcorr summary(object,...) Arguments object a survcorr object... further arguments passed to summary

survcorr 7 Details Value Summarizes the results of a correlation analysis of bivariate survival times. Beside of the calculated correlation coefficient and its confidence interval, a contingency table of the bivariate event status, some of the most important input parameters, as well as posterior mean and variance of the transformed correlation coefficients are printed. NULL Author(s) Meinhard Ploner, Alexandra Kaider, Georg Heinze Schemper,M., Kaider,A., Wakounig,S. & Heinze,G. (2013): "Estimating the correlation of bivariate failure times under censoring", Statistics in Medicine, 32, 4781-4790 http://dx.doi.org/10. 1002/sim.5874. survcorr Correlation Analysis of Survival Times by Iterative Multiple Imputation This R-package implements the iterative multiple imputation algorithm as proposed by Schemper, Kaider, Wakounig and Heinze (2013) for estimation of a correlation coefficient for bivariate possibly censored time-to-event data. survcorr(formula1, formula2, data, methods = "imi", alpha = 0.05, intra = FALSE, M = 10, MCMCSteps = 10, epsilon = 0.001, maxiter = 100) Arguments formula1 formula2 data methods alpha intra Survival object for first time-to-event variable, e.g. Surv(time1, status1)~1 Survival object for second time-to-event variable, e.g. Surv(time2, status2)~1 Data set to look up variables Correlation method(s). Currently, only "imi" (iterative multiple imputation) is implemented. One minus confidence level (for confidence interval computation) If TRUE, an intraclass correlation coefficient will be computed, assuming that the two time-to-event variables are interchangeable in each observation.

8 survcorr M MCMCSteps epsilon maxiter Number of imputations (for IMI) Number of MCMCSteps (for IMI) Accuracy of numerical estimation of correlation coefficients Maximum number of iterations in IMI Details The analysis of correlations within pairs of survival times is of interest to many research topics in medicine, such as the correlation of survival-type endpoints of twins, the correlation of times till failure in paired organs, or the correlation of survival time with a surrogate endpoint. The dependence of such times is assumed monotonic and thus quantification by rank correlation coefficients appropriate. The typical censoring of such times requires more involved methods of estimation and inference as have been developed in recent years. As an alternative to the maximum likelihood methodology for the normal copula approach (NCE) this package implements an iterative multiple imputation (IMI) method which requires only about 0.05% of the computing time of NCE, without sacrificing statistical performance. For IMI, survival probabilities at death or censoring times are first transformed to normal deviates. Then, those deviates which relate to censored times are iteratively augmented, by conditional multiple imputation, until convergence is obtained for the normal scores rank correlation, which is similar to Spearman s rank correlation. Schemper, Kaider, Wakounig and Heinze (2013) compared statistical properties of NCE and IMI by means of a Monte Carlo study and by means of three real data sets; two of them are available in this package. Value rho ci.lower ci.upper simdata M MCMCSteps rj.trans rj.t.mean var df alpha call estimated correlation coefficient lower limit of confidence interval for rho upper limit of confidence interval for rho imputed data sets for each iteration, with components M (number of imputations), z1m, z2m (imputed normal deviates), delta1, delta2 (censoring indicators), t1, t2 (imputed non-censored survival times) number of imputations number of MCMC steps in iterative imputation the M atanh-transformed correlation coefficients from M imputed data sets the posterior mean of the atanh-transofmred correlation coefficients over the M imputations the variance of atanh(rho), with components within, between and total the number of degrees of freedom (important for confidence interval computation) 1-confidence level the function call (useful for making use of update(obj)) Author(s) Meinhard Ploner, Alexandra Kaider, Georg Heinze

survcorr 9 Schemper,M., Kaider,A., Wakounig,S. & Heinze,G. (2013): "Estimating the correlation of bivariate failure times under censoring", Statistics in Medicine, 32, 4781-4790 http://dx.doi.org/10. 1002/sim.5874. Examples ## Example 1 data(kidney) obj = survcorr(formula1=surv(time1, STATUS1) ~ 1, formula2=surv(time2, STATUS2) ~ 1, data=kidney, M=1000, MCMCSteps=10, alpha=0.05, epsilon=0.001) ## Example 2 data(diabetes) obj = survcorr(formula1=surv(time1, STATUS1) ~ 1, formula2=surv(time2, STATUS2) ~ 1, data=diabetes, M=100, MCMCSteps=10, alpha=0.05, epsilon=0.001) plot(obj, "times")

Index Topic IMI plot.survcorr, 4 print.survcorr, 5 summary.survcorr, 6 survcorr, 7 SurvCorr-package, 2 Topic correlation plot.survcorr, 4 print.survcorr, 5 summary.survcorr, 6 survcorr, 7 SurvCorr-package, 2 Topic datasets diabetes, 3 kidney, 3 Topic survival plot.survcorr, 4 print.survcorr, 5 summary.survcorr, 6 survcorr, 7 SurvCorr-package, 2 diabetes, 3 kidney, 3 plot (plot.survcorr), 4 plot.survcorr, 4 print (print.survcorr), 5 print.survcorr, 5 summary (summary.survcorr), 6 summary.survcorr, 6 Surv (survcorr), 7 SurvCorr (SurvCorr-package), 2 survcorr, 7 SurvCorr-package, 2 10