Power Analysis & Sample Size Calculation:

Similar documents
Descriptive Statistics

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Study Guide for the Final Exam

Sample Size and Power in Clinical Trials

WISE Power Tutorial All Exercises

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Correlational Research

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Survey Research: Choice of Instrument, Sample. Lynda Burton, ScD Johns Hopkins University

Sample Size Planning, Calculation, and Justification

SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March

Fairfield Public Schools

22. HYPOTHESIS TESTING

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Guide to Biostatistics

Lecture Notes Module 1

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Projects Involving Statistics (& SPSS)

II. DISTRIBUTIONS distribution normal distribution. standard scores

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

Research Methods & Experimental Design

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

University of Maryland School of Medicine Master of Public Health Program. Evaluation of Public Health Competencies

Hypothesis testing - Steps

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Organizing Your Approach to a Data Analysis

Study Design and Statistical Analysis

Section 13, Part 1 ANOVA. Analysis Of Variance

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Statistics in Retail Finance. Chapter 2: Statistical models of default

2 Precision-based sample size calculations

Testing Hypotheses About Proportions

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Introduction to Hypothesis Testing

HYPOTHESIS TESTING WITH SPSS:

Non-Parametric Tests (I)

Introduction to Statistics and Quantitative Research Methods

p ˆ (sample mean and sample

WHAT IS A JOURNAL CLUB?

Programme du parcours Clinical Epidemiology UMR 1. Methods in therapeutic evaluation A Dechartres/A Flahault

Unit 26 Estimation with Confidence Intervals

Understand the role that hypothesis testing plays in an improvement project. Know how to perform a two sample hypothesis test.

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Additional sources Compilation of sources:

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Inclusion and Exclusion Criteria

R&M Solutons

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Chapter 3. Sampling. Sampling Methods

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Statistics 2014 Scoring Guidelines

Analysis and Interpretation of Clinical Trials. How to conclude?

12: Analysis of Variance. Introduction

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

CHAPTER 13. Experimental Design and Analysis of Variance

HYPOTHESIS TESTING: POWER OF THE TEST

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Chapter Four. Data Analyses and Presentation of the Findings

Introduction to Statistics Used in Nursing Research

Normality Testing in Excel

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

Simple Regression Theory II 2010 Samuel L. Baker

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

Randomized trials versus observational studies

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Non Parametric Inference

Principles of Hypothesis Testing for Public Health

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Imputation and Analysis. Peter Fayers

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1


Elements of statistics (MATH0487-1)

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

Chapter 7: Simple linear regression Learning Objectives

DATA COLLECTION AND ANALYSIS

H. The Study Design. William S. Cash and Abigail J. Moss National Center for Health Statistics

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Basic of Epidemiology in Ophthalmology Rajiv Khandekar. Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

How to choose an analysis to handle missing data in longitudinal observational studies

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Measurement in ediscovery

Transcription:

Power Analysis & Sample Size Calculation: Why is it important? F. Farrokhyar, MPhil, PhD Department of Surgery Department of Clinical Epidemiology & Biostatistics

Casual relationship Any association shown in a study: Errors and biases Confounding Chance Causation/association

Errors and biases Error is inaccuracy which is similar between different groups of study. non-differential error Bias is inaccuracy which is different in size and direction between different groups of study; such as selection bias, recall bias, observation bias, measurement bias.

Confounding Confounding bias a distortion of an exposure outcome relationship brought about by the association of another factor with both outcome and exposure. Low exercise Overall OR=4.2 True OR=3.0 MI Positive association True OR=2.0 Obesity

Chance variation The third non-causal explanation for an association is due to chance variation. - is the difference in outcome between the two groups larger than we would except to occur purely by chance? There are two general approaches to assess the role of chance in a clinical observation: Confidence intervals concept of precision Hypothesis testing P value

Stating Hypotheses Clinical hypothesis Does laparoscopic surgery reduces post-op pain in patients with hernia repair? Null Hypothesis Mean post-op pain with laparoscopic surgery is no different from mean post-op pain with open surgery. H 0 : μ L = μ O Alternate Hypothesis - Mean post-op pain with laparoscopic surgery is different from mean post-op pain with open surgery. H a : μ L μ O or H a : μ L < μ O

Assumptions in hypotheses testing The reason for having a null hypothesis is to be able to calculate errors in decision making. Most common assumptions about hypothesis testing are; Errors are drawn from normal distribution except in non-parametric tests. Errors are independent

Relations among truth, decision, and errors Decision Truth H 0 true H 0 true Correct decision True negative (Probability 1-α) H 0 false Type I error False positive (probability α) H 0 false Type II error False negative (probability β) Correct decision True positive (probability 1- β)

Errors probabilities Type I (α) error the null hypothesis may be rejected when it should have been accepted. It s probability of occurring by chance is denoted as α. α = prob[rejecting H 0 H 0 true] Type II (β) error the null hypothesis may be accepted when it should have been rejected. It s probability of occurring by chance is denoted β. β = prob[accepting H 0 H 0 false] 1- β = power of the test.

Why do we need to calculate sample size? To make sure our study sample is large enough to detect the minimum clinically important differences under the fixed chances of type I and II errors. Larger samples provide better estimates, narrower confidence intervals and smaller test errors. Note: The calculated sample size is the minimum number required.

Sample size calculation or Power Analysis Key concepts: Type I error or α error Type II error or β error Outcome of interest; whether the outcome of interest is a categorical (proportions), or a continuous (means) variable. Effect size: the magnitude of the difference to be detected. One-sided or two-sided test

o o o o Steps in sample size calculation To calculate sample size: we fix type I error at α=5% try to make type II error (β) small- usually 20% (power=1- β=80%). we find the information on the proportion or mean and standard deviation using the data from our study or from literature. we decide on the minimum clinically important difference we are willing to detect (d).

Sample Size Formula: Continuous Variable ONE MEAN: n= (z 1-α/2 + z 1-β ) 2 σ 2 (m - μ) 2 d 2 σ= population standard deviation, μ= population mean, m= sample mean Z 1- α/2 = 1.96 when α=5% for two-sided test Z 1-α = 1.645 when α=5% for one-sided test Z 1- β = 0.842 when β = 20% (80% power)

Example for one mean An emergency physician wants to test the effectiveness of a GI cocktail to treat emergency dyspeptic symptoms on a scale of 1-10 pain score. Standard deviation of pain score without treatment is σ =1.73. He considers a reduction in pain score of d=1.5 points as clinically relevant. He believes that the treatment cannot increase pain. How many patients will he need with α=5% and power=80%?

Example for one mean An emergency physician wants to test the effectiveness of a GI cocktail to treat emergency dyspeptic symptoms on a scale of 1-10 pain score. σ of pain score without treatment is 1.73. He considers a reduction in pain scale of d=1.5 points as clinically relevant. He believes that the treatment cannot increase pain. How many patients will he need using α=5% and power=80%? z 1-α =1.645, for 1- sided test when α=5% n= n= 8.21 (1.645+ 0.842) 2 (1.73) 2 (1.5) 2

Sample Size Formula: Continuous variable TWO MEANS: n 1 = n 2 = (z 1-α/2 + z 1-β ) 2 (σ 1 2 + σ 22 ) (m 1 m 2 ) 2 m 1, m 2 = mean value of each group σ 1, σ 2 = standard deviation for each group

Example for two means From literature, mean LV mass indexed regression in patients who had a mechanical valve replacement is 115 + 20 g/m 2 and those who had a tissue valve is 131 + 35 g/m 2. We want to detect a 15 g/m 2 difference in mean LV mass indexed regression between two valve types with alpha of 0.05 and 90% power. How many patient do we need?

Example for two means From literature, mean LV mass indexed regression in patients who had a mechanical valve replacement is 115+ 20 g/m 2 and those who had a tissue valve is 131+ 35 g/m 2. We want to detect 15 g/m 2 difference in mean LV mass indexed regression between two valve types with alpha of 0.05 and 90% power. How many patient do we need? Z 1-β =1.282, is the statistic for 1-β = 10% n 1 = n 2 = n 1 = n 2 =76 (1.96+1.282) 2 (35 2 + 20 2 ) (15) 2

Sample Size Formula: Categorical variable ONE PROPORTION: n = [(z 1-α/2 π(1-π) + z 1-β p(1-p)] 2 (p - π) 2 π = population proportion, p = sample proportion Z 1- α/2 = 1.96 when α=0.05 for two-sided test Z 1-α = 1.645 when α=0.05 for one-sided test Z 1- β = 0.842 when β = 0.2 (80% power) Z 1- β =1.282 when β = 0.1 (90% power)

Example for one proportion The prevalence of IBS in general population is 20%. We think the prevalence might be higher in IBD patients. How many patients with IBD do we need to detect a difference of 10% with 80% power and 5% significance level.

Example for one proportion The prevalence of IBS in general population is 20%. We think the prevalence might be higher in IBD patients. How many patients with IBD do we need to detect a difference of 10% with 80% power and 5% significance level. n = [(1.96 0.2(1-0.2) + 0.84 0.3(1-0.3)] 2 (0.3-0.2) 2 π = 0.2, p = 0.3, z1-α/2 =1.96, z 1-β = 0.842 N= 117

Sample Size Formula: Categorical variable TWO PROPORTIONS: when p m is not close to 0. n 1 = n 2 = [(z 1-α/2 2p m (1-p m ) + z 1-β p 1 (1-p 1 )+ p 2 (1-p 2 )] 2 (p 1 p 2 ) 2 p 1, p 2 = sample proportions from our data P m = p 1 + p 2 / 2

Example for two proportions From literature the prevalence of wound infection in patients after colon resection is 18%. How many patients do we need to detect a difference of 6% wound infection between new antibiotic and standard treatment when α=5% and β = 20%.

Example for two proportions From literature the prevalence of wound infection in patients after colon resection is 18%. How many patients do we need to detect a difference of 6% wound infection between new antibiotic and standard treatment when α=5% and β = 20%. [(1.96 2*0.15*0.85) + 0.84 0.12*0.88+0.18*0.82)] 2 n 1 = n 2 = (0.12-0.18) 2 p 1 =0.18, p 2 =0.12, P m = 0.15, n 1 = n 2 = 555

Sample Size Formula: Categorical variable TWO PROPORTIONS: when p m is close to 0. n 1 = n 2 = [(z 1-α/2 + z 1-β ) p 1 + p 2 ] 2 (p 1 p 2 ) 2 p 1, p 2 = sample proportions from our data

Example for two proportions From our pilot data, the wound infection from open hernia repair is 6% and laparoscopic repair is 2%. How many patients do we need to detect a difference of 4% in wound infection between laparoscopic and open technique when α=5% and β = 20%.

Example for two proportions From our pilot data, the wound infection from open hernia repair is 6% and laparoscopic repair is 2%. How many patients do we need to detect a difference of 4% in wound infection between laparoscopic and open technique when α=5% and β = 20%. [(1.96+ 0.842) 0.06+0.02] 2 n 1 = n 2 = (0.06-0.02) 2 p 1 =0.06, p 2 =0.02, n 1 = n 2 = 393

Sample size calculated vs. sample size needed The calculated sample size should be increased by a safety factor to account for the possible sources of error in the calculation of sample size that could affect study power such as wrong data source, randomness, lost to follow up, mortality, low compliance, allocation ratio and so on.