Influence of missing data on analysis and power calculation in bioequivalence studies

Similar documents
Bioavailability / Bioequivalence

How to get accurate sample size and power with nquery Advisor R

DAIC - Advanced Experimental Design in Clinical Research

Statistics and Pharmacokinetics in Clinical Pharmacology Studies

THE LOG TRANSFORMATION IS SPECIAL

BIOAVAILABILITY & BIOEQUIVALENCE TRIALS

Statistics 104: Section 6!

A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION PROFILES

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Dongfeng Li. Autumn 2010

Randomization Based Confidence Intervals For Cross Over and Replicate Designs and for the Analysis of Covariance

A Multivariate Test For Similarity of Two Dissolution Profiles

Introduction to General and Generalized Linear Models

Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Simple Linear Regression Inference

Lecture 8. Confidence intervals and the central limit theorem

Joint Exam 1/P Sample Exam 1

Clinical Study Synopsis for Public Disclosure

How To Check For Differences In The One Way Anova

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

The Variability of P-Values. Summary

Least Squares Estimation

Final Exam Practice Problem Answers

Math 151. Rumbos Spring Solutions to Assignment #22

SAS Software to Fit the Generalized Linear Model

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

COMPLETE MARKETS DO NOT ALLOW FREE CASH FLOW STREAMS

START Selected Topics in Assurance

3.4 Statistical inference for 2 populations based on two samples

Module 2 Probability and Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Overview of Dissolution for BA/BE

GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,

HYPOTHESIS TESTING: POWER OF THE TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

Chapter 7 Section 1 Homework Set A

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan , Fall 2010

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

Geostatistics Exploratory Analysis

Predict the Popularity of YouTube Videos Using Early View Data

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

A Model of Optimum Tariff in Vehicle Fleet Insurance

1. Then f has a relative maximum at x = c if f(c) f(x) for all values of x in some

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

2 Sample t-test (unequal sample sizes and unequal variances)

Handling missing data in Stata a whirlwind tour

Content Distribution Scheme for Efficient and Interactive Video Streaming Using Cloud

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

Sensitivity Analysis in Multiple Imputation for Missing Data

東 海 大 學 資 訊 工 程 研 究 所 碩 士 論 文

Introduction to proc glm

Clinical Study Synopsis

Using PROC MIXED in Hierarchical Linear Models: Examples from two- and three-level school-effect analysis, and meta-analysis research

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Hypothesis Testing for Beginners

4. Continuous Random Variables, the Pareto and Normal Distributions

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Analysis of Variance. MINITAB User s Guide 2 3-1

The Relationship Between Rodent Offspring Blood Lead Levels and Maternal Diet

E(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F

University of Chicago Graduate School of Business. Business 41000: Business Statistics Solution Key

Statistical Data analysis With Excel For HSMG.632 students

Elements of statistics (MATH0487-1)

Sample Size Calculation for Longitudinal Studies

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

2013 MBA Jump Start Program. Statistics Module Part 3

Unit 26 Estimation with Confidence Intervals

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

xtmixed & denominator degrees of freedom: myth or magic

THE NUMBER OF GRAPHS AND A RANDOM GRAPH WITH A GIVEN DEGREE SEQUENCE. Alexander Barvinok

Crossover - A search algorithm and GUI for cross-over designs. Kornelius Rohmeyer

Practice Problems for Homework #6. Normal distribution and Central Limit Theorem.

Notes on Applied Linear Regression

Prediction for Multilevel Models

Hypothesis testing - Steps

Challenges of Cloud Scale Natural Language Processing

HANDOUT #2 - TYPES OF STATISTICAL STUDIES

The Probit Link Function in Generalized Linear Models for Data Mining Applications

Non-Inferiority Tests for Two Means using Differences

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS

Exploratory Data Analysis

PATTERN MIXTURE MODELS FOR MISSING DATA. Mike Kenward. London School of Hygiene and Tropical Medicine. Talk at the University of Turku,

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

SAS Syntax and Output for Data Manipulation:

Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance


Credit Risk Models: An Overview

F nest. Monte Carlo and Bootstrap using Stata. Financial Intermediation Network of European Studies

Multiple Linear Regression

9.07 Introduction to Statistical Methods Homework 4. Name:

Transcription:

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 1/33 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel Ulm University in cooperation with Boehringer Ingelheim Pharma GmbH & Co. KG May 4 th, 2011

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 2/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 3/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 4/33 Bioequivalence studies (BE studies) Test if test- and reference formulations differ marginally with respect to pharmacokinetic (PK) characteristics Endpoints: PK-parameters Primary: AUC and C max Problem concerning analysis and power calculation: At how many missing values should further subjects be recruited?

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 5/33 Typical design of BE studies: 2x2-Crossover Test- (T) and reference (R) formulation are given in two periods N subjetcs are allocated randomly evenly to one of the following sequences: RT and TR Treatment according to sequence and period

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 6/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion

Model (on log-scale) Y ijk = S ik + P j + F jk + C k + e ijk Y ijk S ik P j F jk C k e ijk logarithm of response (PK parameter) measured on subject i in sequence k in period j, i = 1,...,N, j=1, 2, k = 1, 2 random i-th subject effect in sequence k, N(0,τ 2 )i.i.d. fixed effect in period j fixed treatment effect in the k-th sequence in period j fixed effect in the k-th sequence residual (random) error associated with the i-th subject in sequence k in period j, N(0,σ 2 )i.i.d. Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 7/33

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 8/33 Why random subject effect? EMA: All effects should be considered fixed Subjects with one missing value excluded from analysis If applicable, further subjects have to be recruited! FDA: All avaible data should be included in the analysis Properties of REML can be used (e.g. in SAS with PROC MIXED) When should further subjects be recruited?

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 9/33 Model assumptions Number of subjects: N = 24 Y ijk = S ik + P j + F jk + C k + e ijk S ik : subject S ik N(0,τ 2 )withτ 2 = 0.25 P j : period P 1 =0, P 2 = 0.2 F jk : treatment F 11 = F 22 =0, F 12 = F 21 := F 2 = 0.1 C k : sequence C 1 = C 2 = 0 e ijk : residual error e ijk N(0,σ 2 )withσ 2 = 0.06 Modeling of response per sequence and period Sequence Period 1 Period 2 RT Y i11 = S i1 + e i11 Y i21 = S i1 + 0.2 + 0.1 + e i21 TR Y i12 = S i2 + 0.1 + e i12 Y i22 = S i2 + 0.2 + e i22

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 10/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 11/33 Randomly missing values With N = 24 subjects: Number of missing values: 1, 2, 3, 4, 8, 12, 16, 20 Three alternatives to distribute missing values: - for both treatments and in both periods (case I) - only in period 2 (case II) - only for treatment R (case III)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 12/33 Pseudo code DO 1 TO 1000 END 1. Simulate complete dataset for 24 subjects 2. Analyse complete dataset with PROC MIXED 3. FOR i = 1, 2, 3, 4, 8, 12, 16, 20 DO END PROC SURVEYSELECT to simulate incomplete datasets with i missing values for case I - III 4. Analyse incomplete dataset with PROC MIXED

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 13/33 One missing value for both treatments (case I)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 14/33 Two missing values for both treatments (case I)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 15/33 Three or more missing values for both treatments (case I)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 16/33 Missing values in period 2 (case II) or for treatment R (case III)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 17/33 PROC SURVEYSELECT

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 18/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 19/33 Is there a difference between the formulations? Test if: the 90% confidence interval, which covers the treatment quotient µ T µ R, lies within the equivalence domain [0.80, 1.25] the 90% confidence interval, which covers the treatment difference µ T µ R, lies within the equivalence domain [ 0.223, 0.223] (log-scale)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 20/33 PROC MIXED Theory: Mixed model Y = X β + Zγ + fix random

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 21/33 Value of interest: gcv Among others considered: gcv := 100 exp(mse) 1 derived from CV of a log-normally distributed random variable X CV := Var(X ) E(X )

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 22/33 Evaluation with PROC BOXPLOT σ 2 =0.06 gcv = 25, missing values for treatment R (case III)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 23/33 Summary statistics with PROC MEANS σ 2 =0.06 gcv = 25, missing values for treatment R (case III) Missing values Number of simulations Min P25 Mean P75 Max 10 1000 13.765 22.001 24.569 26.828 37.540 11 1000 13.887 22.001 24.594 26.917 40.143 12 1000 13.887 21.820 24.536 27.045 39.052 13 1000 14.258 21.662 24.561 27.138 39.950 14 1000 15.301 19.552 24.317 28.520 48.217 18 1000 17.589 19.137 24.417 29.182 53.245 12 1000 14.040 16.892 24.018 29.659 68.274 16 1000 10.599 13.904 23.498 30.804 68.017 20 1000 10.0001 10.011 20.338 36.371 87.275

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 24/33 Summary statistics graphic σ 2 =0.06 gcv = 25, missing values for treatment R (case III)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 25/33 Power Goal: The probability to reject the hypothesis that the formulations are different when there is no relevent difference should lie e.g. between 80% and90% This probability is called Power gcv is a decisive factor here

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 26/33 PROC POWER

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 27/33 Power graphic: Mean of simulations σ 2 =0.06 gcv = 25, missing values for treatment R (case III)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 28/33 Power graphic: P75 of simulations σ 2 =0.06 gcv = 25, missing values for treatment R (case III)

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 29/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 30/33 When should further subjects be recruited? For N = 24 subjects and 1000 simulations each (per number of missing values and case): In general, no further subjects have to be recruited Based on the observations in the 75% percentile: - at 10 to 12 missing values further subjects should be recruited - especially if the values are missing mainly for both treatments and in both periods

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 31/33 Outlook: Further questions cohorts and other covariates Variability of the rest variance σ 2

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 32/33 References [1] SAS/Stat 9.2 User s Guide: The Mixed Procedure (Book Excerpt). SAS Publishing, 2008 [2] S.C. Chow, J.P. Liu. Design and Analysis of Bioavailability and Bioequivalence Studies. Marcel Dekker, Inc., New York, 1992 [3] S. Patterson, B. Jones. Bioequivalence and Statistics in Clinical Pharmacology. Chapman & Hall / CRC, Boca Raton, 2006 [4] G.A. Milliken, D.E. Johnson. Analysis of Messy Data Volume I: Designed Experiments. Wadsworth, Inc., Belmont, 1984

Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 33/33 Thank you for your attention! Special thanks to Julia Habeck and Michaela Mattheus from Boehringer Ingelheim Pharma GmbH & Co. KG