Cancer Biostatistics Workshop Science of Doing Science - Biostatistics

Similar documents
False Discovery Rates

Statistical issues in the analysis of microarray data

Likelihood Approaches for Trial Designs in Early Phase Oncology

Study Design and Statistical Analysis

Statistical Analysis Strategies for Shotgun Proteomics Data

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Redwood Building, Room T204, Stanford University School of Medicine, Stanford, CA

Endpoint Selection in Phase II Oncology trials

Personalized Predictive Medicine and Genomic Clinical Trials

Statistical Analysis. NBAF-B Metabolomics Masterclass. Mark Viant

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Tutorial for proteome data analysis using the Perseus software platform

Analysis Issues II. Mary Foulkes, PhD Johns Hopkins University

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Statistics Graduate Courses

Bayesian Phase I/II clinical trials in Oncology

Data, Measurements, Features

Descriptive Statistics

Fixed-Effect Versus Random-Effects Models

II. DISTRIBUTIONS distribution normal distribution. standard scores

Applying Statistics Recommended by Regulatory Documents

Regression Analysis: A Complete Example

Statistics in Medicine Research Lecture Series CSMC Fall 2014

2013 MBA Jump Start Program. Statistics Module Part 3

Introduction to Regression and Data Analysis

Introduction to Longitudinal Data Analysis

From Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis

Gene Expression Analysis

Functional Data Analysis of MALDI TOF Protein Spectra

Validation and Calibration. Definitions and Terminology

Simple Linear Regression Inference

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology

Principles of Hypothesis Testing for Public Health

Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?

CLINICAL TRIALS: Part 2 of 2

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection

Guide to Biostatistics

Statistical Considerations for Phase II Trials

Introduction to Statistics and Quantitative Research Methods

Use advanced techniques for summary and visualization of complex data for exploratory analysis and presentation.

Chapter 5 Analysis of variance SPSS Analysis of variance

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Course Course Name # Summer Courses DCS Clinical Research 5103 Questions & Methods CORE. Credit Hours. Course Description

Likelihood: Frequentist vs Bayesian Reasoning

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Section 13, Part 1 ANOVA. Analysis Of Variance

Statistical Rules of Thumb

Package ERP. December 14, 2015

Validation of measurement procedures

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

HYPOTHESIS TESTING WITH SPSS:

Biostatistics Short Course Introduction to Longitudinal Studies

Study Design. Date: March 11, 2003 Reviewer: Jawahar Tiwari, Ph.D. Ellis Unger, M.D. Ghanshyam Gupta, Ph.D. Chief, Therapeutics Evaluation Branch

School of Public Health and Health Services Department of Epidemiology and Biostatistics

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Molecular markers and clinical trial design parallels between oncology and rare diseases?

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Power Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power

Sample Size Planning, Calculation, and Justification

Accelerating Development and Approval of Targeted Cancer Therapies

Analytical Test Method Validation Report Template

Biomarker Discovery and Data Visualization Tool for Ovarian Cancer Screening

Introduction to Data Mining

Multivariate Normal Distribution

Testing Multiple Secondary Endpoints in Confirmatory Comparative Studies -- A Regulatory Perspective

BIOS 6660: Analysis of Biomedical Big Data Using R and Bioconductor, Fall 2015 Computer Lab: Education 2 North Room 2201DE (TTh 10:30 to 11:50 am)

AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Planning sample size for randomized evaluations

One-Way Analysis of Variance (ANOVA) Example Problem

10:20 11:30 Roundtable discussion: the role of biostatistics in advancing medicine and improving health; career as a biostatistician Monday

MTH 140 Statistics Videos

Two-Way ANOVA tests. I. Definition and Applications...2. II. Two-Way ANOVA prerequisites...2. III. How to use the Two-Way ANOVA tool?...

Data Mining and Machine Learning in Bioinformatics

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

BASIC COURSE IN STATISTICS FOR MEDICAL DOCTORS, SCIENTISTS & OTHER RESEARCH INVESTIGATORS INFORMATION BROCHURE

Protein Protein Interaction Networks

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

AC - Clinical Trials

Fairfield Public Schools

Logistic Regression (1/24/13)

How To Check For Differences In The One Way Anova

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Analysis and Interpretation of Clinical Trials. How to conclude?

Gene expression analysis. Ulf Leser and Karin Zimmermann

Measurement Systems Correlation MSC for Suppliers

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

August 2012 EXAMINATIONS Solution Part I

Study Guide for the Final Exam

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

6: Introduction to Hypothesis Testing

The Product Review Life Cycle A Brief Overview

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

Data Integration. Lectures 16 & 17. ECS289A, WQ03, Filkov

Elements of statistics (MATH0487-1)

Transcription:

Cancer Biostatistics Workshop Science of Doing Science - Biostatistics Yu Shyr, PhD Jan. 18, 2008 Cancer Biostatistics Center Vanderbilt-Ingram Cancer Center Yu.Shyr@vanderbilt.edu

Aims Cancer Biostatistics Workshop Introduction. Issues of the experiment design of biomedical research adaptive, Bayesian. P-value, type I error, FDR and methods of multiple comparison. Topics of the high-dimensional data experiment including the assessment of the quality control. Tools of the visualization.

Series One: What You Probably Don t Know About Biostatistics January 18 th - Yu Shyr, PhD The Science of Doing Science Biostatistics February 15 th Dan Ayers, MS Skittles, An Ounce of Measurement, and Intentional Science March 21 st Ayumi Shintani, MPH, PhD Introduction to Power and Sample Size Estimation

Series Two: Essentials of Efficient Experimental Design April 18 th - Tatsuki Koyama, PhD Phase II Clinical Trials May 16 th - Leena Choi, PhD Adaptive Designs in Dose-Finding (Phase I) Studies: A Bayesian Approach June 20 th - Fei Ye, PhD Phase III Clinical Trials

Series Three: High-Dimensional Data August 15 th - Chun Li, PhD Genome-wide Association Analysis for the Shanghai Breast Cancer Study September 19 th - Irene Feurer, PhD An Introduction to Principal Component Analysis and Data Reduction Methodologies October 17 th - Ming Li, PhD Statistical Analysis Strategies for MALDI-TOF and Shotgun Proteomics Data

Special Topics: Observational Data and Epidemiology November 21 st - Ayumi Shintani, MPH, PhD Introduction to Observational Study Design ~~---~-----------~---~~ Please call 6-2502 or e-mail shaun.m.haskins@vanderbilt.edu if you have any questions or feedback.

Type of the Biomedical Studies in vitro cell lines in vivo animal models Clinical Trials Phase I, II, III, IV Cohort Study Prospective, Retrospective Case-Control Study Nested Case-Control

Experiment Design Primary objective of the study Primary hypothesis of the study Primary endpoint of the study How many groups? How many time points? Preliminary data

Adaptive Experiment Design Adaptive trial design refers to a clinical trial methodology that allows trial design modifications to be made after patients have been enrolled in a study, without compromising the scientific method. In order to maintain the integrity of the trial, these modifications should be clearly defined in the protocol. When designed well, an adaptive trial empowers sponsors to respond to data collected during the trial. This is achieved by re-focusing the trial in a way that maximizes the impact of each subject s contribution. Examples of adaptive trial designs include dropping a treatment arm, modifying the sample size, balancing treatment assignments using adaptive randomization or simply stopping a study early for success or failure.

Two Stage Design One of the primary objectives of a phase II clinical trial of a new drug or regimen is to determine whether it has sufficient biological activity against the disease under study to warrant more extensive development. Several two-stage Phase II designs will be discussed here Gehan, Fleming, Simon s Optimal, Simons s Minimax, Balanced Design

Gehan Design Gehan s (1961) design is the oldest design. It is a twostage design for estimating the response rate but providing for early termination if the drug shows insufficient antitumor activity. The design is most commonly used with a first stage of 14 patients. If no responses (completed response or partial response) are observed, the trial is terminated. The rationale for stopping is that if the true response probability were at least 20%, at least one response would very likely have been observed in the first 14 patients. If no responses are seen, it is unlikely that the true response probability is at least 20%.

If at least one response is observed in the first 14 patients, then a second stage of accrual is carried out to obtain an estimate of the response rate. The number of patients to accrue in the second stage depends upon the number of responses observed in the first stage and upon the precision desired for the final estimate of response rate. If the first stage consists of 14 patients, the second stage will consist of between one and 11 patients if a standard error of 10% is desired. The second stage will consist of between 45 and 86 patients if a standard error of 5% is desired.

Simon s Optimal Design The optimal two-stage designs (Simon, 1989) are optimal in the sense that the expected sample size is minimized if the regimen has low activity subject to constraints upon the size of the type I ( ) and type II ( ) errors. Example of Simon s Optimal Design: Clinical uninteresting level = 5% response rate, Clinical interesting level = 20% response rate, type I error ( ) = 0.05 type II error ( ) = 0.20 or Power = 0.80 (Power = 1 type II error). Stage I Reject drug if response rate 0/10 Stage II Reject drug if response rate 3/29

Frequentist versus Bayesians Frequentists: α and β errors Bayesians: Quantify designs with other properties General philosophy Start with prior information ( prior distribution ) Observe data ( likelihood function ) Combine prior and data to get posterior distribution Make inferences based on posterior

Bayesian inference No p-values and confidence intervals From the posterior distribution: Posterior probabilities Prediction intervals Credible intervals Bayesian designs Can look at data as often as you like (!) Use information as it accumulates Make what if? calculations Helps decide whether to stop now or not

Bayesian Designs Requires prior Reflects uncertainty about the response rate Can be vague, uninformative Can be controversial: inference may change

Bayesian Designs

Posterior Probabilities

Other priors What if we had used a different prior? Assume informative orange prior

Experiment Design Variability of In Situ Proteomic Profiling and Implications for Study Design in Colorectal Tumors International Journal of Oncology (2007)

Overview The comprehension of intrinsic tumor heterogeneity is vital for understanding of tumor progression mechanisms as well as for providing efficient treatments. In situ proteomic profiling of tumors is a powerful technology with potential to enhance our understanding of tumor biology, but sources of variability due to patient and tumor heterogeneity are poorly understood and are the topic of this investigation.

Questions Is proteomic profiling in adenomas more homogenous across patients than in normal mucosa specimens? Does primary carcinoma exhibit greater heterogeneity than normal mucosa and adenomas? Is Inter- and intra-case variability approximately equal for protein peaks? What is the optimal number of sub-samples per case that will reduce the total number of cases required? How to characterize intra- and inter-case variability of highthroughput protein expression in colorectal tumors?

Table IV Power = 80% Type I error = 5% Intra-Case Variance 0.2 0.5 1.0 Subsample Inter-Case Variance Inter-Case Variance Inter-Case Variance Number (m) 0.2 0.5 1.0 0.2 0.5 1.0 0.2 0.5 1.0 1 6 11 19 11 16 24 19 24 32 5 4 9 17 5 10 17 7 11 19 20 4 8 16 4 9 16 4 9 17

P-value, Type I error, FDR, and multiple comparison Type I error: For testing a null hypothesis, H0, against an alternative hypothesis, the type I error of the test is the rejection of the null hypothesis when it is true. The p value of a statistical test is the probability of observing a sample outcome as extreme or more extreme as the one observed. This probability is computed under the assumption that the null hypothesis is true.

y Linear Regression 15 group=1 group=2 10 5 0 15 group=3 group=4 10 5 0 5 10 15 20 x 5 10 15 20 Graphs by group

Linear Regression Regression Results: Slope 0.5 (p=0.0022) Parameter Standard T for H0: Variable DF Estimate Error Parameter=0 Prob > T INTERCEP 1 3.000091 1.12474681 2.667 0.0257 X 1 0.500091 0.11790550 4.241 0.0022 Which graph on the previous slide corresponds to this analysis?

Solutions for the Multiple Comparison Problem A MCP Solution Must Control False Positives How to measure multiple false positives? Familywise Error Rate (FWER) Chance of any false positives Controlled by Bonferroni & Random Field Methods False Discovery Rate (FDR) Proportion of false positives among rejected tests

False Discovery Rate Accept Reject Null True (no diff) V 0A V 0R m 0 Null False V 1A V 1R m 1 N A N R V Observed FDR Obs FDR = V 0R /(V 1R +V 0R ) = V 0R /N R If N R = 0, obsfdr = 0 Only know N R, not how many are true or false Control is on the expected FDR FDR = E(obsFDR)

p-value 0 1 Benjamini & Hochberg Procedure Select desired limit q on FDR Order p-values, p (1) p (2)... p (V) Let r be largest i such that JRSS-B (1995) 57:289-300 p (i) i / v q / c(v) p (i) Reject all hypotheses corresponding to p (1),..., p (r). i / v i/v q/c(v) 0 1

Benjamini & Hochberg: Key Properties FDR is controlled E(obsFDR) q m 0 /V Conservative, if large fraction of nulls false Adaptive Threshold depends on amount of feature More feature, More small p-values, More p (i) less than i/v q/c(v)

Benjamini & Hochberg Procedure c(v) = 1 Positive Regression Dependency on Subsets P(X 1 c 1, X 2 c 2,..., X k c k X i =x i ) is non-decreasing in x i Only required of test statistics for which null true Special cases include Independence Multivariate Normal with all positive correlations Same, but studentized with common std. err. c(v) = i=1,...,v 1/i log(v)+0.5772 Arbitrary covariance structure Benjamini & Yekutieli (2001). Ann. Stat. 29:1165-1188

High Dimensional Data The major challenge in high throughput experiments, e.g., microarray data, MALDI-TOF data, Array CGH data, Shotgun Proteomic data, or SNP data, is that the data is often high dimensional. When the number of dimensions reaches thousands or more, the computational time for the pattern recognition algorithms can become unreasonable. This can be a problem, especially when some of the features are not discriminatory.

Statistical Determinants of Analytic Uncertainty Contributions to uncertainty can come from a variety of resources: Analytic variation Sampling methods Sample preparation Biological Heterogeneity in Population Specimen Collection/Handling Effects - Tumor: surgical related effects - Cell Line: culture condition Biological Heterogeneity in Specimen

Statistical Determinants of Analytic Uncertainty ISO 17025 5.4.6.3 When estimating the uncertainty of measurement, all uncertainty components which are of importance in the given situation shall be taken into account using appropriate methods of analysis. repeatability standard deviation (intra-lab) reproducibility standard deviation (inter-lab) intermediate precision (more factors in the system)

Statistical Determinants of Analytic Uncertainty The main objectives in performing an R&R study are to identify and quantify the absolute and relative contribution of each source of variation, to decide if the measurement process is adequate or not and, if not, to correct the errors by recalibrating the instrument, training the operators, other mathematical corrections, etc. In quality control and quality assurance, the goal is to reduce the variability in the system and, consequently, the variability in the biomarker measurement.

Quality Control Assessment Reproducibility Correlation of Variation (CV) SD/Mean Intra-class Correlation Coefficient (ICC) Intra / Intra + Inter Variance Component Analysis Mixed/Random Effect Model. The model: investigators, day, spot, machine, lab, etc. Goal Make sure the data is reproducible!!

Accuracy WFCCM Class Prediction Model 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 Training data set - 93 vs 91 Testing data set - 52 vs 58 0 0 50 100 150 200 250 300 Features

Receiver Operating Characteristic (ROC) Curve Training Test

WFCCM Class Prediction Model 1 9 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 50 100 150 200 250 300 % GP1 % GP2 % correct B % Gp1 B % Gp2 B %

Receiver Operating Characteristic (ROC) Curve Training Test

Hierarchical clustering of gene expression data 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21

Multidimensional scaling (MDS)

Multidimensional Scaling

Agulnik, M. et al. J Clin Oncol; 25:2184-2190 2007 Multidimensional Scaling

Dear Dr. Shyr, Statistical Issues in Clinical Research - Design, Analysis and Interpretation I am a fellow in the Department of...we put together a paper recently... I had performed the statistical analysis incorrectly... A reviewer pointed out that with so many t tests some might give a significant result by chance and suggested we present ANOVA F statistics instead...we will pay the full cost of any such work you may perform... Thanks and Regards Dr...

Statistical Issues in Clinical Research - Design, Analysis and Interpretation When Do You Need a Biostatistician? Study Design Implementation Statistical Data Analysis Abstract Writing Paper Writing Letter to the paper reviewer

END