How to get accurate sample size and power with nquery Advisor R



Similar documents
Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Bioavailability / Bioequivalence

Sample Size Planning, Calculation, and Justification

Non-Inferiority Tests for Two Means using Differences

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Tests for Two Proportions

Confidence Intervals for One Standard Deviation Using Standard Deviation

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

3.4 Statistical inference for 2 populations based on two samples

HYPOTHESIS TESTING: POWER OF THE TEST

Confidence Intervals for Cp

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Design and Analysis of Phase III Clinical Trials

Data Analysis, Research Study Design and the IRB

Parametric and non-parametric statistical methods for the life sciences - Session I

Non-Inferiority Tests for One Mean

Point Biserial Correlation Tests

DAIC - Advanced Experimental Design in Clinical Research

Additional sources Compilation of sources:

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Two Correlated Proportions (McNemar Test)

Independent t- Test (Comparing Two Means)

Chapter 2 Probability Topics SPSS T tests

Influence of missing data on analysis and power calculation in bioequivalence studies

2 Precision-based sample size calculations

Non-Inferiority Tests for Two Proportions

II. DISTRIBUTIONS distribution normal distribution. standard scores

Biostatistics: Types of Data Analysis

Confidence Intervals for Cpk

Chapter 5 Analysis of variance SPSS Analysis of variance

Methods of Sample Size Calculation for Clinical Trials. Michael Tracy

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

individualdifferences

Pearson's Correlation Tests

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

11. Analysis of Case-control Studies Logistic Regression

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Laboratory 3 Type I, II Error, Sample Size, Statistical Power

Tests for One Proportion

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Chi-square test Fisher s Exact test

How To Check For Differences In The One Way Anova

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

UNDERSTANDING THE TWO-WAY ANOVA

Nonparametric Statistics

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Confidence Intervals for the Difference Between Two Means

Section 13, Part 1 ANOVA. Analysis Of Variance

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Study Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles

A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION PROFILES

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Section 12 Part 2. Chi-square test

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Ordinal Regression. Chapter

Simple Linear Regression Inference

Lesson Outline Outline

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Principles of Hypothesis Testing for Public Health

Statistics and Pharmacokinetics in Clinical Pharmacology Studies

NCSS Statistical Software

Study Design and Statistical Analysis

Descriptive Statistics

Two Related Samples t Test

A Multivariate Test For Similarity of Two Dissolution Profiles

Randomized Block Analysis of Variance

THE LOG TRANSFORMATION IS SPECIAL

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Likelihood Approaches for Trial Designs in Early Phase Oncology

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Solutions to Homework 10 Statistics 302 Professor Larget

BIOAVAILABILITY & BIOEQUIVALENCE TRIALS

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

STAT 350 Practice Final Exam Solution (Spring 2015)

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Data Analysis Tools. Tools for Summarizing Data

PRACTICE PROBLEMS FOR BIOSTATISTICS

Confidence Intervals for Exponential Reliability

Two-sample hypothesis testing, II /16/2004

Research Methods & Experimental Design

Guide to Biostatistics

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Mind on Statistics. Chapter 12

Statistics Review PSY379

Logit Models for Binary Data

SPSS Tests for Versions 9 to 13

The Dummy s Guide to Data Analysis Using SPSS

Part 2: Analysis of Relationship Between Two Variables

Interpretation of Somers D under four simple models

Transcription:

How to get accurate sample size and power with nquery Advisor R Brian Sullivan Statistical Solutions Ltd. ASA Meeting, Chicago, March 2007

Sample Size Two group t-test χ 2 -test Survival Analysis 2 2 Crossover Odds Ratio

What sample size do I need? Formulate the study Specify analysis parameters Specify effect size for test Compute sample size or power Sensitivity analysis Choose sample size

What sample size do I need? Formulate the study Specify analysis parameters Specify effect size for test Compute sample size or power Sensitivity analysis Choose sample size

What sample size do I need? Formulate the study Specify analysis parameters Specify effect size for test Compute sample size or power Sensitivity analysis Choose sample size

What sample size do I need? Formulate the study Specify analysis parameters Specify effect size for test Compute sample size or power Sensitivity analysis Choose sample size

What sample size do I need? Formulate the study Specify analysis parameters Specify effect size for test Compute sample size or power Sensitivity analysis Choose sample size

What sample size do I need? Formulate the study Specify analysis parameters Specify effect size for test Compute sample size or power Sensitivity analysis Choose sample size

Formulate the study Type of analysis being undertaken Clinical trial Laboratory study Survey research What is being measured? How is it being analyzed?

Formulate the study Type of analysis being undertaken Clinical trial Laboratory study Survey research What is being measured? How is it being analyzed?

Formulate the study Type of analysis being undertaken Clinical trial Laboratory study Survey research What is being measured? How is it being analyzed?

Specify analysis parameters What parameters are involved? Significance level Test type: one or two sided

Specify analysis parameters What parameters are involved? Significance level Test type: one or two sided

Specify effect size Use information from previous/similar studies What effect size would be important to detect?

Compute sample size or power Specify a value for sample size or power and perform the calculation solving for the other

Recalculate sample size for different scenarios about effect size Recalculate sample size for different levels of power Compare different sample sizes Plot sample size vs power

Choose sample size Based on the trade off between power attainable for the desired effect size and sample size feasibility, choose a sample size for the study

Two group t-test Study Design Does a new drug reduce anemia in elderly women after hip fracture? Two group, randomized, parallel, double-blind study Drug for two weeks, three times per week New drug vs placebo Equal n s

Two group t-test Study Design Does a new drug reduce anemia in elderly women after hip fracture? Two group, randomized, parallel, double-blind study Drug for two weeks, three times per week New drug vs placebo Equal n s

Outcome and Analysis Primary outcome: Mean change in hematocrit level (percentage of red blood cells in the blood) from pre-treatment to post-treatment Compare mean change in hematocrit level between two groups using two-group t-test Null Hypothesis: Mean change is the same in both groups

Outcome and Analysis Primary outcome: Mean change in hematocrit level (percentage of red blood cells in the blood) from pre-treatment to post-treatment Compare mean change in hematocrit level between two groups using two-group t-test Null Hypothesis: Mean change is the same in both groups

Outcome and Analysis Primary outcome: Mean change in hematocrit level (percentage of red blood cells in the blood) from pre-treatment to post-treatment Compare mean change in hematocrit level between two groups using two-group t-test Null Hypothesis: Mean change is the same in both groups

Calculation Parameters Significance level, α Test type: one or two sided, s Expected difference, d = µ 1 µ 2 Standard deviation, σ Effect size, δ = d/σ Power, 1 β Sample size, n

Previous Analysis There have been four previous studies from which data can be drawn. Study n Population Mean Hct 1 6 Elderly women with fracture 32.3% 2 32 Elderly women no fracture 33.5% 3,4 Change in placebo 0.0 3,4 Change in drug groups 2.7-4.7 Expected change: 0% in placebo group 2-2.4% in drug group

Previous Analysis There have been four previous studies from which data can be drawn. Study n Population Mean Hct 1 6 Elderly women with fracture 32.3% 2 32 Elderly women no fracture 33.5% 3,4 Change in placebo 0.0 3,4 Change in drug groups 2.7-4.7 Expected change: 0% in placebo group 2-2.4% in drug group

Previous Analysis From previous studies the standard deviation of hematocrit levels were between 3% and 6%. Here we need the standard deviation for the change in hematocrit level and there is less information on this. From previous studies it can be estimated to be between 1.5% to 2.5%. A reasonable estimate would be 2%.

Example Using the following details we can proceed with the calculation: Parameter Value Significance level.05 Test type 2 sided Expected difference 2-2.4 Standard deviation 2 Illustrated in nquery Advisor file: hematocrit.nqa.

Sensitivity Analysis Figure: Plot of Sample Size vs Power

Choose Sample Size A sample size of 20 in each group will have 92% power to detect a difference in means of 2.200 assuming that the common standard deviation is 2.000 using a two group t-test with a 0.050 two-sided significance level.

χ 2 -test Study Design Does a certain diet drug combination increase the likelihood of requiring valve surgery? Case-Control study Cases: patients who had valve surgery in defined two year period Controls: matched using age, gender and body-mass index

χ 2 -test Study Design Does a certain diet drug combination increase the likelihood of requiring valve surgery? Case-Control study Cases: patients who had valve surgery in defined two year period Controls: matched using age, gender and body-mass index

Outcome and Analysis Primary outcome: Proportion of cases and controls that have taken the diet drugs Compare proportions of cases and controls that have taken diet drugs using McNemar s χ 2 test for paired responses

Outcome and Analysis Primary outcome: Proportion of cases and controls that have taken the diet drugs Compare proportions of cases and controls that have taken diet drugs using McNemar s χ 2 test for paired responses

Calculation Parameters Significance level, α Test type: one or two sided, s Expected difference in proportions, δ = π 1 π 2 Proportion of discordant pairs, η = π 10 + π 01 Power, 1 β Sample size, n

Previous Analysis It is expected that about 5% of controls would have taken the diet drugs. The investigators are interested in detecting an increase to 10% among the cases with 90% power.

Previous Analysis A two by two table of proportions of cases and controls using the diet drugs is postulated as: Case yes Case no Row proportion Control yes 0.03 0.02 0.05 Control no 0.07 0.88 0.95 Column proportion 0.10 0.90 1.00

Example Using the following details we can proceed with the calculation: Parameter Value Significance level 0.05 Test type 2 sided Difference in proportions 0.05 Proportion of discordant pairs 0.09 Illustrated in nquery Advisor file: surgery.nqa.

Sensitivity Analysis Figure: Plot of Difference in Proportions vs Sample Size

Choose Sample Size A sample size of 342 pairs will have 90% power to detect a difference in proportions of 0.050 when the proportion of discordant pairs is expected to be 0.090 and the method of analysis is a McNemar s test of equality of paired proportions with a 0.050 two-sided significance level.

Survival Analysis Study Design Does TIPS surgery for bleeding esophageal varices prolong life over shunt surgery? Survival analysis log-rank test Randomization into two parallel groups All patients must be followed for a minimum of one year

Survival Analysis Study Design Does TIPS surgery for bleeding esophageal varices prolong life over shunt surgery? Survival analysis log-rank test Randomization into two parallel groups All patients must be followed for a minimum of one year

Outcome and Analysis Primary outcome: Time to death due to any cause Groups compared using survival analysis log-rank test Null Hypothesis: Time to death is the same in both groups

Outcome and Analysis Primary outcome: Time to death due to any cause Groups compared using survival analysis log-rank test Null Hypothesis: Time to death is the same in both groups

Outcome and Analysis Primary outcome: Time to death due to any cause Groups compared using survival analysis log-rank test Null Hypothesis: Time to death is the same in both groups

Calculation Parameters Significance level, α Test type: one or two sided, s Group proportions at time t, π 1, π 2 Hazard ratio, h = ln(π 1) ln(π 2 ) Power, 1 β Sample size, n

Previous Analysis Results from previous studies give an indication of the expected proportions. It is expected that 65% of patients will survive one year after shunt surgery. TIPS surgery would be considered to be markedly worse if only 45% of patients survived one year.

Example Using the following details we can proceed with the calculation: Parameter Value Significance level 0.05 Test type 2 sided Group 1 proportion 0.65 Group 2 proportion 0.45 Illustrated in nquery Advisor file: surgery2.nqa.

Sensitivity Analysis Figure: Plot of Hazard vs Sample Size

Choose Sample Size When the sample size in each group is 131, with a Total number of events required, E, of 110, a 0.050 level two-sided log-rank test for equality of survival curves will have 90% power to detect the difference between a Group 1 proportion π 1 at time t of 0.650 and a Group 2 proportion π 2 at time t of 0.450 (a constant hazard ratio of 0.539); this assumes no dropouts before time t.

2 2 Crossover Study Design Does a new drug have similar steady state blood levels to that of a standard drug? Two-way crossover design Two period, two treatment AB, BA New drug vs standard drug

2 2 Crossover Study Design Does a new drug have similar steady state blood levels to that of a standard drug? Two-way crossover design Two period, two treatment AB, BA New drug vs standard drug

Outcome and Analysis Primary outcome: Mean steady state blood level Interested to know will the mean steady state blood level for the new drug be within 15% of that of the standard Two group t-test (TOST) for equivalence in means Null Hypotheses: Difference upper level and Difference lower level

Outcome and Analysis Primary outcome: Mean steady state blood level Interested to know will the mean steady state blood level for the new drug be within 15% of that of the standard Two group t-test (TOST) for equivalence in means Null Hypotheses: Difference upper level and Difference lower level

Outcome and Analysis Primary outcome: Mean steady state blood level Interested to know will the mean steady state blood level for the new drug be within 15% of that of the standard Two group t-test (TOST) for equivalence in means Null Hypotheses: Difference upper level and Difference lower level

Outcome and Analysis Primary outcome: Mean steady state blood level Interested to know will the mean steady state blood level for the new drug be within 15% of that of the standard Two group t-test (TOST) for equivalence in means Null Hypotheses: Difference upper level and Difference lower level

Calculation Parameters Significance level (one sided), α Lower equivalence limit for µ T µ S, L Upper equivalence limit for µ T µ S, U Expected difference in means, d = µ T µ S Crossover ANOVA, MSE Standard deviation of differences, σ d Power, 1 β Sample size, n

Previous Analysis Previous study on standard drug indicated a mean steady state blood level of 16 Pilot study of two different doses for new drug showed standard deviation of about 2.8 for each The correlation between blood levels for the two periods was 0.55

Example Using the following details we can proceed with the calculation: Parameter Value Significance level 0.05 Lower limit -2.4 Upper limit 2.4 Expected difference 0.0 Standard deviation 2.8 Correlation 0.55 Illustrated in nquery Advisor file: blood.nqa.

Sensitivity Analysis Figure: Plot of Sample Size vs Power

Choose Sample Size When the sample size in each sequence group is 7, (a total sample size of 14) a crossover design will have 80% power to reject both the null hypothesis that the test mean minus the standard mean is below -2.400 and the null hypothesis that the test mean minus the standard mean is above 2.400 i.e., that the test and standard are not equivalent, in favor of the alternative hypothesis that the means of the two treatments are equivalent, assuming that the expected difference in means is 0.000, the Crossover ANOVA MSE is 1.878 (the Standard deviation of differences, σ d, is 2.656) and that each test is made at the 5.0% level.

Odds Ratio Study Design Does an educational program for expectant mothers reduce the risk of preterm birth? two group study, odds ratio cases vs controls cases receive education equal n s

Odds Ratio Study Design Does an educational program for expectant mothers reduce the risk of preterm birth? two group study, odds ratio cases vs controls cases receive education equal n s

Outcome and Analysis Primary outcome: Proportion of preterm births The educational program would be of interest if it reduced the rate of preterm births to produce and odds ratio of 0.5 How wide is the 95% confidence interval for the odds ratio likely to be if 500 women are assigned to each group?

Outcome and Analysis Primary outcome: Proportion of preterm births The educational program would be of interest if it reduced the rate of preterm births to produce and odds ratio of 0.5 How wide is the 95% confidence interval for the odds ratio likely to be if 500 women are assigned to each group?

Outcome and Analysis Primary outcome: Proportion of preterm births The educational program would be of interest if it reduced the rate of preterm births to produce and odds ratio of 0.5 How wide is the 95% confidence interval for the odds ratio likely to be if 500 women are assigned to each group?

Calculation Parameters Significance level, α Test type: one or two sided, s Group proportions, π 1, π 2 Odds ratio, ψ = π 2(1 π 1 ) π 1 (1 π 2 ) Log odds ratio, B = ln(ψ) Distance from B to limit, ω Sample size, n

Previous Analysis Previous studies of similar populations have found preterm birth rates between 7% and 9%. A reasonable estimate would be 8%.

Example Using the following details we can proceed with the calculation: Parameter Value Significance level 0.05 Test type 2 sided Control proportion 0.08 Odds ratio 0.5 n per group 500 Illustrated in nquery Advisor file: education.nqa.

Sensitivity Analysis Figure: Plot of Sample Size vs Distance from B to limit

Choose Sample Size When the sample size in each group is 500, the Control proportion, π 1, is 0.080, and the Education proportion, π 2, is 0.042, a two-sided 95.0% confidence interval for a ln(odds ratio) expected to be -0.693 will extend 0.545 from the observed ln(odds ratio) (corresponding to confidence limits of 0.290 and/or 0.862 for an odds ratio of 0.500).

Bibliography Elashoff J.D. nquery Advisor Version 6.01 User s Guide. (2005) Los Angeles, CA. Dixon, W.J., Massey, F.J. Introduction to Statistical Analysis. 4th Edition McGraw-Hill (1983) O Brien, R.G., Muller, K.E. Applied Analysis of Variance in Behavioral Science Marcel Dekker, New York (1983) pp. 297-344 Miettinen, O.S. "On the matched-pairs design in the case of all-or-none responses" Biometrics 24(1986) pp. 339-352 Freedman, L.S. "Tables of the number of patients required in clinical trials using the logrank test" Statistics in Medicine 1(1982) pp. 121-129 Collett, D. Modelling Survival Data in Medical Research Chapman and Hall (1994) Section 9.2 Chow, S.C, Liu, J.P. Design and Analysis of Bioavailability and Bioequivalence Studies Marcel Dekker, Inc. (1992) Schuirmann, D.J. "A comparison of the two one-sided tests procedure and the power approach for assessing the equivalence of average bioavailability" Journal of Pharmacokinetics and Biopharmaceutics 15(1987) pp. 657-680 Phillips, K.E. "Power of the two one-sided tests procedure in bioequivalence" Journal of Pharmacokinetics and Biopharmaceutics 18(1990) pp. 137-143 Owen, D.B. "A special case of a bivariate non-central t-distribution" Biometrika 52(1965) pp. 437-446 O Neill, R.T. "Sample size for estimation of the odds ratio in unmatched case-control studies" American Journal of Epidemiology 120(1984) pp. 145-153