UNTIED WORST-RANK SCORE ANALYSIS



Similar documents
2. Making example missing-value datasets: MCAR, MAR, and MNAR

Sensitivity Analysis in Multiple Imputation for Missing Data

Permutation Tests for Comparing Two Populations

MEASURES OF LOCATION AND SPREAD

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

3.4 Statistical inference for 2 populations based on two samples

Basic Statistical and Modeling Procedures Using SAS

HYPOTHESIS TESTING: POWER OF THE TEST

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Analysis Issues II. Mary Foulkes, PhD Johns Hopkins University

Non-Parametric Tests (I)

CHAPTER 14 NONPARAMETRIC TESTS

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Dongfeng Li. Autumn 2010

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Non-Inferiority Tests for Two Means using Differences

Introduction to mixed model and missing data issues in longitudinal studies

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Permuted-block randomization with varying block sizes using SAS Proc Plan Lei Li, RTI International, RTP, North Carolina

5. Linear Regression

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Math 151. Rumbos Spring Solutions to Assignment #22

Chapter 4. Probability Distributions

Lloyd Spencer Lincoln Re

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Nominal and ordinal logistic regression

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Parametric and non-parametric statistical methods for the life sciences - Session I

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

The Wilcoxon Rank-Sum Test

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Interpretation of Somers D under four simple models

NCSS Statistical Software

Least Squares Estimation

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see. level(#) , options2

R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models

Sample Size Calculation for Longitudinal Studies

JANUARY 2016 EXAMINATIONS. Life Insurance I

An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA

Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank

1-3 id id no. of respondents respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

Lecture Notes Module 1

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION PROFILES

NCSS Statistical Software. One-Sample T-Test

Mind on Statistics. Chapter 13

Study Design. Date: March 11, 2003 Reviewer: Jawahar Tiwari, Ph.D. Ellis Unger, M.D. Ghanshyam Gupta, Ph.D. Chief, Therapeutics Evaluation Branch

Chapter 4 Statistical Inference in Quality Control and Improvement. Statistical Quality Control (D. C. Montgomery)

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop

Imputation of missing data under missing not at random assumption & sensitivity analysis

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Interim Analysis in Clinical Trials

Supplementary appendix

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Chapter 4: Statistical Hypothesis Testing

Statistics in Medicine Research Lecture Series CSMC Fall 2014

The power of a test is the of. by using a particular and a. value of the that is an to the value

Two-sample hypothesis testing, II /16/2004

Likelihood Approaches for Trial Designs in Early Phase Oncology

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Paper PO06. Randomization in Clinical Trial Studies

Handling missing data in Stata a whirlwind tour

Personalized Predictive Medicine and Genomic Clinical Trials

Solution. Let us write s for the policy year. Then the mortality rate during year s is q 30+s 1. q 30+s 1

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Tests for Two Proportions

Experimental Design for Influential Factors of Rates on Massive Open Online Courses

DATA INTERPRETATION AND STATISTICS

Modeling Lifetime Value in the Insurance Industry

Permutation & Non-Parametric Tests

Komorbide brystkræftpatienter kan de tåle behandling? Et registerstudie baseret på Danish Breast Cancer Cooperative Group

Basics of Statistical Machine Learning

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Premium Calculation - continued

The Relationship Between Rodent Offspring Blood Lead Levels and Maternal Diet

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Nonparametric Statistics

Missing Data Sensitivity Analysis of a Continuous Endpoint An Example from a Recent Submission

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Section 13, Part 1 ANOVA. Analysis Of Variance

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Survey Analysis: Options for Missing Data

Recall this chart that showed how most of our course would be organized:

2 Precision-based sample size calculations

Mitral Valve Repair versus Replacement for Severe Ischemic Mitral Regurgitation. Michael Acker, MD For the CTSN Investigators AHA November 2013

Confidence Intervals for the Difference Between Two Means

Financial Risk Forecasting Chapter 8 Backtesting and stresstesting

Premium calculation. summer semester 2013/2014. Technical University of Ostrava Faculty of Economics department of Finance

1 Nonparametric Statistics

6.2 Permutations continued

Transcription:

Paper PO16 UNTIED WORST-RANK SCORE ANALYSIS William F. McCarthy, Maryland Medical Research Institute, Baltime, MD Nan Guo, Maryland Medical Research Institute, Baltime, MD ABSTRACT When the non-fatal outcome measures are missing completely at random, then the analysis of the non-missing data is unbiased (Rubin, 1976; Little, 1976; and Little and Rubin, 1987). This implies that a subset of measurements actually observed provides an unbiased description of the treatment effect in the entire population. When the nonfatal outcome measures are infmatively missing and an analysis is based only on the subset of measurements actually observed, the description of the treatment effect may be biased (Lachin, 1999). This paper describes a method f an unbiased description of treatment effect when the non-fatal outcome measures are infmatively missing. In addition, we present a SAS Program to calculate the power of the untied wst-rank sce analysis and a SAS Program to perfm the untied wst-rank sce analysis. KEY WORD missing data; non-fatal outcomes; unbiased description of efficacy; SAS Program INTRODUCTION In some randomized clinical trials, patients are scheduled to undergo some assessment of a non-fatal outcome at a fixed time ( times) after the initiation of treatment. Often, these follow-up assessments may be missing f some patients because a disease-related event occurred pri to the time of the follow-up assessment. In such a situation, these follow-up assessments are infmatively missing because the disease-related event and the non-fatal outcome both indicate progression of the underlying disease. F example, a study of congestive heart failure may schedule patients to undergo exercise testing at 12 weeks, but this measurement may be missing f those patients who died of heart disease pri to the exercise testing. When the non-fatal outcome measures are missing completely at random, then the analysis of the non-missing data is unbiased (Rubin, 1976; Little, 1976; and Little and Rubin, 1987). This implies that a subset of measurements actually observed provides an unbiased description of the treatment effect in the entire population. When the non-fatal outcome measures are infmatively missing and an analysis is based only on the subset of measurements actually observed, the description of the treatment effect may be biased (Lachin, 1999). PROCEDURE FOR THE UNTIED WORST-RANK SCORE ANALYSIS Let i denote the study group (1=placebo, 2= drug). Let j denote a patient (1,2,,n i ), where n i is the number of patients in group i. Let * ij denote an indicat variable, which indicates whether the infmative event (death) occurs in the ijth patient pri to the end of the study (1= Yes, 0 = No). * ij = I(t ij T); T is the fixed time of the follow-up measurement; and t ij denotes the survival time of the ijth patient. All patients will receive a value f t ij (t ij T), regardless of their value f * ij. Let x ij denote the observed primary outcome measure f the ijth patient. If a higher value of x indicates a better outcome, then use the approach outlined below (When a higher value of x indicates a wse outcome, use the approach outlined in Appendix A). Patients who die will be ranked on their time to death (t ij ), and will receive a rank sce cresponding to a value that is wse (lower) than any actually observed in the surviving patient population. These imputed ranks will reflect the relative dering of the event times, with the shtest time to death ranked wst (lowest). Surviving patients will be

ranked by their x ij with those having the highest value of x ij receiving the best (highest) ranking. The data structure will look as follows: t min,, t max, x min,, x max. F the ijth patient who has died pri to the follow-up measurement, use the value -9999 f x ij. -9999 is used f several reasons: 1) to identify those patients who have no follow-up measurement, therefe no valid x ij and 2) to allow f the calculation of Q ij ; if the standard missing value of. is used, Q ij cannot be calculated. -9999 is a value far removed from any valid value expected f x ij. Let Q ij denote the value to use in the rank analysis f the ijth patient. Q ij = (1 - * ij )x ij + * ij (0 + t ij ) [1] where 0 is a negative constant such that (0 + T) < >. > is equal to the wst possible valid value f x ij ; (0 + T) < > allows one to distinguish surviving patients from those who have died; thus set 0 to - 9999. A rank analysis based on the {Q ij } provides an unbiased test of the joint null hypothesis [2] against the restricted alternative hypothesis [3]; Lachin, 1999. H 0 : [G 1 (x) = G 2 (x) and K 1 (t) = K 2 (t)] (0 < t T) [2] H 1 : [G 1 (x) G 2 (x) and K 1 (t) K 2 (t)] [3] [G 1 (x) G 2 (x) and K 1 (t) = K 2 (t)] [G 1 (x) = G 2 (x) and K 1 (t) K 2 (t)] G i (x) is the cumulative probability distribution of the observable values of x f all event-free members of the ith group observed at time T; i.e., G i (x) = Pr(x ij x t > T). K i (t) is the cumulative distribution of the infmative event times t in the ith group. The notation G 1 (x) G 2 (x) indicates that G 1 (x) is shifted to the left of G 2 (x); i.e., the observable values in group 1 (placebo) tend to be less than those of group 2 (drug). This indicates that there is a difference in fav of group 2 since higher values of x are better. The notation K 1 (x) K 2 (x) indicates that K 1 (x) is shifted to the left of K 2 (x); i.e., the infmative event times in group 1 (placebo) tend to be less than those of group 2 (drug). This indicates that there is a difference in fav of group 2 since higher values of t are better. Test the joint null hypothesis [2] that the two study groups do not differ with respect to the survival times and the distributions of the observable measurements. Use the restricted alternative hypothesis [3], which indicates that the drug group tends to have higher values of x and/ t while not having lower values f either. APPENDIX A When higher values of x are wse, patients who are infmatively censed early thus have higher rank sces than those infmatively censed late. The data structure will look as follows: x min,, x max, t max,, t min. F the ijth patient who has died pri to the follow-up measurement, use the value 9999 f x ij. 9999 is used f several reasons: 1) to identify those patients who have no follow-up measurement, therefe no valid x ij and 2) to allow f the calculation of Q ij ; if the standard missing value of. is used, Q ij cannot be calculated. 9999 is a value far removed from any valid value expected f x ij.

Let Q ij denote the value to use in the rank analysis f the ijth patient. Q ij = (1 - * ij )x ij + * ij (0 - t ij ) [1 ] where 0 is a positive constant such that (0 - T) > >. > is equal to the wst possible valid value f x ij ; (0 - T) > > allows one to distinguish surviving patients from those who have died; thus set 0 to 9999. A rank analysis based on the {Q ij } provides an unbiased test of the joint null hypothesis [2 ] against the restricted alternative hypothesis [3 ]; Lachin, 1999. H 0 : [G 1 (x) = G 2 (x) and K 1 (t) = K 2 (t)] (0 < t T) [2 ] H 1 : [G 1 (x) G 2 (x) and K 1 (t) K 2 (t)] [3 ] [G 1 (x) G 2 (x) and K 1 (t) = K 2 (t)] [G 1 (x) = G 2 (x) and K 1 (t) K 2 (t)] The notation G 1 (x) G 2 (x) indicates that G 1 (x) is shifted to the right of G 2 (x); i.e., the observable values in group 1 (placebo) tend to be me than those of group 2 (drug). This indicates that there is a difference in fav of group 2 since lower values of x are better. The notation K 1 (x) K 2 (x) indicates that K 1 (x) is shifted to the left of K 2 (x); i.e., the infmative event times in group 1 (placebo) tend to be less than those of group 2 (drug). This indicates that there is a difference in fav of group 2 since higher values of t are better. REFERENCES Rubin D (1976). Inference and missing data. Biometrika, 63: 581-592. Little RJA (1976). Comments on inference and missing data by Rubin DB. Biometrika, 63: 590-591. Little RJA and Rubin DB (1987). Statistical analysis with missing data. Wiley, New Yk, NY. Lachin JM (1999). Wst-rank sce analysis with infmatively missing observations in clinical trials. Controlled Clinical Trials 20:408-422. McMahon RP (1989). Analysis of nonfatal outcomes in clinical trials in which mtality is present. The Institute of Statistics, University of Nth Carolina at Chapel Hill, Mimeo series, No. 1867T. McMahon RP and Harrell FE (2000). Power calculation f clinical trials when the outcome is a composite ranking of survival and a nonfatal outcome. Controlled Clinical Trials 21:305-312. POWER CALCULATION FOR UNTIED WORST-RANK SCORE ANALYSIS /* PROGRAM TO COMPUTE THE POWER OF AN UNTIED WORST-RANK SCORE ANALYSIS OUTCOME IS A COMPOSITE RANKING OF SURVIVAL AND A NONFATAL OUTCOME [Lachin JM (1999). Wst-rank sce analysis with infmatively missing observations in clinical trials. Controlled Clinical trials 20: 408-422] USES THE CONSERVATIVE ASSUMPTION THAT THE TREATMENT HAS A BENEFICIAL EFFECT ON THE NONFATAL OUTCOME BUT NO EFFECT ON MORTALITY */ ASSUMES A NORMALLY DISTRIBUTED NONFATAL OUTCOME **** Power Calculation based on: McMahon and Harrell (2000). Power calculation f clinical trials when the outcome is a composite ranking of survival and a nonfatal outcome. Controlled Clinical Trials 21:305-312; **** WRITTEN BY WF MCCARTHY 10/14/03;

*$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$; *$ calpha: $; *$ critical values f 2-sided alphas $; *$ alpha=0.05 calpha=1.96 $; *$ alpha=0.01 calpha=2.58 $; *$ alpha=0.001 calpha=3.29 $; *$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$; *$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$; *$ calpha: $; *$ critical values f 1-sided alphas $; *$ alpha=0.05 calpha=1.65 $; *$ alpha=0.01 calpha=2.33 $; *$ alpha=0.001 calpha=3.09 $; *$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$; *# YOU INPUT THE FOLLOWING: #; *# arm_n: the number of patients in each study arm (requires an equal sample size in each study arm) #; *# alpha: type I err probability. E.g., 5 % = 0.05 #; *# calpha: select appropriate critical value from above tables. E.g., if alpha=0.05 (2-sided) select calpha=1.96 #; *# survival: the survival probability of the patients in each study arm at the end of the study, #; *# the survival probability is required to be the same in both study arms. E.g., 90 % =0.90 #; *# effect: the absolute treatment effect. E.g., 6 % =0.06 #; *# sigma: the common standard deviation. E.g., 10 % = 0.10 #; *# #; *# NOTE: x = the value of the nonfatal outcome and t = the value of the survival time #; *# THE PROGRAM COMPUTES THE FOLLOWING: #; *# ste: the standardized treatment effect #; *# py12: Pr(x1i > x2j t1i and t2j >= T), the conditional probability that patient i #; *# (given treatment 1)has a "better" nonfatal outcome than patient j (given treatment 2), #; *# given both patients have survived to time T #; *# py112: Pr(x1i > x2j and x1i' > x2j all of t1i, t1i', and t2j >= T), i' not equal to i #; *# py122 : Pr(x1i > x2j and x1i > x2j' all of t1i, and t2j, t2j' >= T), j' not equal to j #; *# pr12: Pr(r1i > r2j), the probability that patient i (given treatment 1) has a #; *# "better" rank than patient j (given treatment 2) #; *# vpr12: the variance of pr12 #; data temp; input arm_n alpha calpha survival effect sigma; cards; 75 0.05 1.96 0.90 0.06 0.10 ; data temp; set temp; ste=effect/(sigma*sqrt(2)); py12=0.50805+0.35359*ste; py112=-0.27134+1.19088*py12;

py122=py112; pr12=((0.5)*(1-(survival)**2))+((py12)*(survival)**2); vpr12=((2/3)*(1-(survival)**3)+(py112+py122)*(survival)**3-2*((0.5)*(1-(survival)**2)+(py12)*(survival)**2)**2)/(arm_n); power= 1-probnm((calpha)+((0.5-pr12)/sqrt(vpr12))); title "Power of the Untied Wst-Rank Sce Analysis"; proc print data=temp noobs; var arm_n alpha calpha survival effect sigma ste py12 py112 vpr12 power; Power of the Untied Wst-Rank Sce Analysis arm_n alpha calpha survival effect sigma ste py12 py112 vpr12 power 75 0.05 1.96 0.9 0.06 0.1 0.42426 0.65807 0.51234.001850707 0.84522 **** Example SAS Program f the Implementation of the Procedure f the Untied Wst-Rank Sce Analysis; **** group=1 (placebo), group=2 (Drug A); **** x is delta EF (%) -9999; **** t is survival time (months); **** d is indicat variable f death (1=yes 0=no); **** z is the value to use in the rank analysis; data test; input patient group x t d; cards; 1 1-18 6.0 0 2 1 25 6.0 0 3 1 35 6.0 0 4 1 43 6.0 0 5 1 55 6.0 0 6 1-9999 2.5 1 7 1-9999 4.2 1 8 1-9999 5.0 1 9 1-9999 5.3 1 1 2-15 6.0 0 2 2 55 6.0 0 3 2 58 6.0 0 4 2 62 6.0 0 5 2 63 6.0 0 6 2 67 6.0 0 7 2 70 6.0 0 8 2-9999 4.7 1 9 2-9999 5.8 1 ;

data test; set test; z = (1-d)*x + d*(-9999 + t); proc st; by group z; proc npar1way wilcoxon; class group; var z; **** Ranking of z in each Group; Obs patient group x t d z 1 6 1-9999 2.5 1-9996.5 2 7 1-9999 4.2 1-9994.8 3 8 1-9999 5.0 1-9994.0 4 9 1-9999 5.3 1-9993.7 5 1 1-18 6.0 0-18.0 6 2 1 25 6.0 0 25.0 7 3 1 35 6.0 0 35.0 8 4 1 43 6.0 0 43.0 9 5 1 55 6.0 0 55.0 10 8 2-9999 4.7 1-9994.3 11 9 2-9999 5.8 1-9993.2 12 1 2-15 6.0 0-15.0 13 2 2 55 6.0 0 55.0 14 3 2 58 6.0 0 58.0 15 4 2 62 6.0 0 62.0 16 5 2 63 6.0 0 63.0 17 6 2 67 6.0 0 67.0 18 7 2 70 6.0 0 70.0

The NPAR1WAY Procedure Wilcoxon Sces (Rank Sums) f Variable z Classified by Variable group Sum of Expected Std Dev Mean group N Sces Under H0 Under H0 Sce ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ 1 9 61.50 85.50 11.318907 6.833333 2 9 109.50 85.50 11.318907 12.166667 Average sces were used f ties. Wilcoxon Two-Sample Test Statistic 61.5000 Nmal Approximation Z -2.0762 One-Sided Pr < Z 0.0189 Two-Sided Pr > Z 0.0379 t Approximation One-Sided Pr < Z 0.0267 Two-Sided Pr > Z 0.0534 Z includes a continuity crection of 0.5. We will use the z value and two-sided nmal approximation p-value generated by the NPAR1WAY Procedure f the moniting of the primary study outcome. CONTACT INFORMATION William F. McCarthy Principal Statistician Direct of Clinical Trial Statistics and SAS Programming Maryland Medical Research Institute 600 Wyndhurst Avenue Baltime, Maryland 21210-2425 (410) 435-4200 www.mmri.g wmccarthy@mmri.g Nan Guo Seni Lead SAS Programmer SAS Certified Advanced Programmer f SAS9 Maryland Medical Research Institute 600 Wyndhurst Avenue Baltime, Maryland 21210-2425 (410) 435-4200 www.mmri.g nguo@mmri.g SAS and all other SAS Institute Inc. product service names are registered trademarks trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies.