More details on the inputs, functionality, and output can be found below.



Similar documents
Reliability estimators for the components of series and parallel systems: The Weibull model

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Gamma Distribution Fitting

Personalized Predictive Medicine and Genomic Clinical Trials

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Applications of R Software in Bayesian Data Analysis

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University

Nonparametric adaptive age replacement with a one-cycle criterion

Statistics in Retail Finance. Chapter 6: Behavioural models

Chapter 3 RANDOM VARIATE GENERATION

Generalized Linear Models

Journal of Statistical Software

A Latent Variable Approach to Validate Credit Rating Systems using R

STA 4273H: Statistical Machine Learning

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Bayesian Phylogeny and Measures of Branch Support

STATISTICA Formula Guide: Logistic Regression. Table of Contents

SUMAN DUVVURU STAT 567 PROJECT REPORT

Endpoint Selection in Phase II Oncology trials

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Probabilistic Methods for Time-Series Analysis

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

Efficacy analysis and graphical representation in Oncology trials - A case study

Likelihood Approaches for Trial Designs in Early Phase Oncology

Bayesian Statistics: Indian Buffet Process

Imputing Values to Missing Data

Quantitative Inventory Uncertainty

Bayesian Analysis for the Social Sciences

Package dsmodellingclient

Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT

A Bayesian hierarchical surrogate outcome model for multiple sclerosis

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Methods for Meta-analysis in Medical Research

Scatter Plots with Error Bars

An Introduction to Using WinBUGS for Cost-Effectiveness Analyses in Health Economics

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

Lecture 15 Introduction to Survival Analysis

Web-based Supplementary Materials for Bayesian Effect Estimation. Accounting for Adjustment Uncertainty by Chi Wang, Giovanni

Supplement to Call Centers with Delay Information: Models and Insights

Characteristics of Global Calling in VoIP services: A logistic regression analysis

Problem of Missing Data

A Predictive Probability Design Software for Phase II Cancer Clinical Trials Version 1.0.0

Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time

> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) quantile diff (error)

R2MLwiN Using the multilevel modelling software package MLwiN from R

Why Don t Lenders Renegotiate More Home Mortgages? Redefaults, Self-Cures and Securitization ONLINE APPENDIX

Sampling for Bayesian computation with large datasets

Evaluation of Treatment Pathways in Oncology: Modeling Approaches. Feng Pan, PhD United BioSource Corporation Bethesda, MD

Statistics Graduate Courses

Predicting Customer Default Times using Survival Analysis Methods in SAS

Two Related Samples t Test

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Bayesian inference for population prediction of individuals without health insurance in Florida

Statistical Rules of Thumb

Parallelization Strategies for Multicore Data Analysis

How To Check For Differences In The One Way Anova

LCMM: a R package for the estimation of latent class mixed models for Gaussian, ordinal, curvilinear longitudinal data and/or time-to-event data

Confidence Intervals for One Standard Deviation Using Standard Deviation

An Introduction to Point Pattern Analysis using CrimeStat

Introduction. Survival Analysis. Censoring. Plan of Talk

Important Probability Distributions OPRE 6301

Fixed-Effect Versus Random-Effects Models

Bayesian Estimation Supersedes the t-test

SAS Syntax and Output for Data Manipulation:

Introduction to Bayesian Analysis Using SAS R Software

Confidence Intervals for the Difference Between Two Means

Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = Adverse impact as defined by the 4/5ths rule was not found in the above data.

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS

Using SAS PROC MCMC to Estimate and Evaluate Item Response Theory Models

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Imputing Missing Data using SAS

A Bootstrap Metropolis-Hastings Algorithm for Bayesian Analysis of Big Data

Kaplan-Meier Survival Analysis 1

Bayesian Model Averaging CRM in Phase I Clinical Trials

2. Making example missing-value datasets: MCAR, MAR, and MNAR

Dirichlet Processes A gentle tutorial

THE RAPID ENROLLMENT DESIGN FOR PHASE I CLINICIAL TRIALS

Bayesian Statistics in One Hour. Patrick Lam

Integrating Financial Statement Modeling and Sales Forecasting

Time Series Analysis

Bayesian Statistical Analysis in Medical Research

Regression Modeling Strategies

Credibility and Pooling Applications to Group Life and Group Disability Insurance

BayesX - Software for Bayesian Inference in Structured Additive Regression

Using simulation to calculate the NPV of a project

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page

Learning outcomes. Knowledge and understanding. Competence and skills

The Kaplan-Meier Plot. Olaf M. Glück

Handling attrition and non-response in longitudinal data

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES

Transcription:

Overview: The SMEEACT (Software for More Efficient, Ethical, and Affordable Clinical Trials) web interface (http://research.mdacc.tmc.edu/smeeactweb) implements a single analysis of a two-armed trial comparing a novel therapy to a previously-studied control on which reliable historical information is available. The software s hierarchical Bayesian statistical approach uses either a simple exponential (see Section 4.1 of reference [1] below) or piecewise exponential (reference [2]) likelihood for time-to event data, and is formulated to borrow strength dynamically from the historical control data. The procedure facilitates more partial pooling of the control information in the absence of evidence for heterogeneity. The hierarchical model employs a sparsity-inducing spike-and-slab hyperprior distribution on the hierarchical precision parameter whose two components can be specified to vary between a uniform distribution to a two-point mixture distribution between roughly no borrowing and near exchangeability. The web interface enables the user to analyze either a sample dataset provided with the software, or the user s own data uploaded to the compute server. To determine the appropriate number of pieces in the piecewise exponential likelihood [2], the software uses the AIC-optimal time-axis partition to conduct joint posterior inference on the historical and concurrent data via implementation of an AIC search (using the current data alone) among equidistant quantiles of width determined by the parameter Min.Event. Additionally, an AIC score is computed for the simple exponential model used in [1] having constant baseline hazard over the entire follow-up period. If the AIC-optimal partition in the first step contains at least two intervals, the piecewise exponential model [2] is used; otherwise, the simple exponential model is used. Program output includes information about the treatment effect parameter ξ, which is essentially a log-acceleration factor used in a proportional hazards adjustment of the baseline hazard function to capture the effect of the novel treatment. Posterior summaries are provided for this treatment effect ξ, corresponding hazard rate ratio (HR), and median survival for treatment and control. In addition, we provide the effective historical sample size (EHSS, the effective number of controls being borrowed from the historical data) and the novel treatment randomization probability one should use for the next patient given the results of the current interim analysis, a function of EHSS and the number of future patients remaining to enroll into the study, R. More details on the inputs, functionality, and output can be found below. Inputs: The software involves 7 inputs. Analysis of the program-supplied example data uses the Default data option, while analysis of user-supplied external data instead uses the Upload data option. Finally, the Quick demo option outputs results for the default data obtained from analysis using the default design and hyperparameters, with no user input permitted.

With the Upload data option, the user uploads to the compute server using Browse to select the proper local directory path. The data must consist of separate.csv files containing the historical (dat.h) and concurrent (dat.c) datasets (see examples). Additionally, the user must specify the planned number of future patients that remain to enroll into the trial (R). Default values are specified for the other 4 inputs with the option for the user to modify. Inputs are described in detail as follows: - Historical Data must contain o a single column labeled T containing time-to-event duration for each patient (assumed unique by row) o a single column labeled EVENT corresponding indicator that the event was observed (1=event, 0=right-censored) o example data file historical.csv - Current Data must contain o a single column labeled T containing time-to-event duration for each patient (assumed unique by row) o a single column labeled EVENT containing indicators that the event was observed (1=event, 0=right-censored) o a single column labeled TREATMENT containing indicators that the patient received the novel treatment (1=treatment, 0=control) o example data file current.csv - R = Additional number of future patients remaining to be randomized o Used to calculate the adaptive randomization probability for assigning the next patient to enroll to novel treatment. (acceptable values: R>0). o Set by default to 100 in the example analysis. - Min.Event denotes the minimum number of events required within each interval of the AIC-optimal time-axis partition used in the piecewise exponential model likelihood. It is set by default to 30 with the option for the user to modify (acceptable values: 1 Min.Event N). - S_u is a hyperparameter in Bayesian hierarchical model characterizing the upper bound of the slab specified in the hyperprior for τ. It is set by default to 2 (the value specified in reference [2]) with the option for the user to modify (acceptable values: 0.1 S_u 5000). Setting S_u equal to the location of the spike (which is fixed at 5000, again per reference [2]) yields a simple Uniform(0.1, 5000) hyperprior distribution for τ. - p_0 is a hyperparameter in the Bayesian hierarchical model characterizing one minus the prior probability mass assigned to the spike (again, with location fixed at 5000 per [2]). It is set by default to 0.5 (the value specified in [2]) with the option for the user to modify (acceptable values: 0 p_0 1).

- MCMC Iterations = number Markov chain Monte Carlo samples to be used to conduct posterior inference implemented within each of 4 parallel chains after a burn-in period of the same length. Values are restricted to (1000-6000). Larger values extend run time durations required for implementation of the MCMC algorithm and post-processing of the resultant samples. Functionality: The web interface performs the following: 1) Computes Kaplan-Meier curves for both the historical controls and current time to event data, generates corresponding figures. 2) Performs search for AIC-optimal time-axis partition among observed equidistant quantiles of width determined by parameter Min.Event using the concurrent data alone. Also considers AIC in the absence of a partition of the time-axis, corresponding to a simple exponential model having constant baseline hazard over the entire follow-up period. 3) Fits the joint Bayesian hierarchical model with AIC-optimal time-axis partition. If the partition contains at least two intervals then the piecewise exponential model described in [2] used. Otherwise, the exponential model described in Section 4.1 of [1] is used. 4) Fits the corresponding reference models to the concurrent data alone. 5) Computes the effective historical sample size (EHSS) (equation (9) in reference [2]), and the treatment randomization probability appropriate for the next patient to enroll after an interim analysis (equation (11) in reference [2]), a function of the allocation of patients to treatment and control arms, EHSS, and the number of additional future enrollments R. 6) Computes posterior summaries for the novel treatment effect (ξ), corresponding hazard rate ratio (HR), and median survival for treatment and control from the MCMC samples. 7) Generates a figure depicting posterior survival curves corresponding to control and treatment cohorts. References: 1. Hobbs, B.P., Sargent, D.J., and Carlin, B.P. (2012). Commensurate priors for incorporating historical information in clinical trials using general and generalized linear models. Bayesian Analysis, 7: 639 674. 2. Hobbs, B.P., Carlin, B.P., and Sargent, D.J. (2013). Adaptive adjustment of the randomization ratio using historical control data. Clinical Trials, 10: 430-440.

3. Hobbs, B.P., Carlin, B.P., Mandrekar, S., and Sargent, D.J. (2011). Hierarchical commensurate and power prior models for adaptive incorporation of historical information in clinical trials. Biometrics, 67: 1047--1056. Outputs: Example table and figures generated using the default, example data (designed to resemble the colon cancer data analyzed in [2]) are as follows: Table 1: Posterior summary for novel treatment effect ξ, median survival for control and treatment; effective historical sample size (EHSS), and the probability used to randomized to treatment the next patient to enroll (AR probability) Parameter Mean SD Mean HR (95% HPD) Pr( HR<1 Data ) ξ -0.372 0.098 0.72(0.57-0.87) 0.999 Median Mean (95% HPD) control 305(279-333) treatment 373(330-420) # Current treated 200 # Current controls 200 # Historical controls 200 EHSS 60 AR probability 0.798 Calculation Time (sec.) 5 Note. SD = posterior standard deviation; HR = hazard rate ratio; HPD = highest posterior density interval; Pr( HR<1 Data ) = posterior tail probability < 1 for the hazard rate ratio indicating strength of evidence for an improvement associated with treatment arm Note that in this example, the treatment effect is significant, with 95% highest posterior density (HPD) credible interval for the hazard rate ratio (HR) excluding the null value (1.0). Also, the evidence for commensurability is very strong, leading to a high EHSS and correspondingly large adaptive randomization (AR) probabilities (shown assuming that R=100 future patients remain to be enroll into the trial), the result of the procedure having decided to use the historical controls and thus leading to a desire for more future patients to be assigned to novel treatment in order to achieve balance at the study s completion. Additional information on the adaptive randomization probability function can be found in Appendix.

Figure 1: Kaplan Meier curve derived from the historical control data, with 95% logtransformed pointwise confidence intervals. Right-censored observations are marked by +. Figure 2: Kaplan Meier curves derived from the current control and treatment data, with 95% log-transformed pointwise confidence intervals. Right-censored observations are marked by +.

Figure 3: Posterior mean survival curves with 95% pointwise highest posterior density credible intervals derived from Bayesian analysis of the historical and current data using the joint commensurate prior model.

Appendix: Adaptive randomization probability Figure 4 below illustrates the adaptive randomization probability (11 in reference [2]) as a function of R (the number of future patients remaining to enroll into the trial) for our example analysis summarized in Table 1. Let and denote the numbers of patients assigned to control and treatment arms in the current trial. The adaptive randomization probability for the next patient will be 1 if EHSS R. In our example an equal numbers of patient have been assigned to the control and treatment arms ( ) at the time of the example interim analysis. In this scenario, if EHSS is larger than the number of patients remaining to enroll into the trial then maximal possible balance is achieved by assigning all remaining patients to the novel treatment. Thus, the AR probability is equal to 1 for R EHSS. For R > EHSS, the AR probability decreases from 1 in proportion to the number of future patients that could be assigned to control while still achieving balance after the trial s final enrollment (relative to the current value of EHSS). The AR probability of 0.798 reported in Table 1 corresponds to the case where R=100 additional patients remain to enroll into the trial. Figure 4: Adaptive randomization probability (11 in reference [2]) as a function of the number of patients remaining to enroll into the trial derived from analysis using the example data.