Sample Size and Power

Size: px
Start display at page:

Download "Sample Size and Power"

Transcription

1 Sample Size and Power Scott Evans, Ph.D. 1 Sample Size Considerations A pharmaceutical company calls and says, We believe we have found a cure for the common cold. How many patients do I need to study to get our product approved by the FDA? Scott Evans, Ph.D. 2 1

2 Where to begin? N = (Total Budget / Cost per patient)? Hopefully not! Scott Evans, Ph.D. 3 Where to begin? Understand the research question Learn about the application and the problem. Learn about the disease and the medicine. Crystal Ball Visualize the final analysis and the statistical methods to be used. Scott Evans, Ph.D. 4 2

3 Where to begin? Analysis determines sample size. Sample size calculations are based upon the planned method of analysis. If you don t know how the data will be analyzed (e.g., 2-sample t-test), then you cannot accurately estimate the sample size. Scott Evans, Ph.D. 5 Sample Size Calculation Formulate a PRIMARY research question. Identify: 1. A hypothesis to test (write down H 0 and H A ), or 2. A quantity to estimate (e.g., using confidence intervals) Scott Evans, Ph.D. 6 3

4 Sample Size Calculation Determine the endpoint or outcome measure associated with the hypothesis test or quantity to be estimated. How do we measure or quantify the responses? Is the measure continuous, binary, or a timeto-event? Scott Evans, Ph.D. 7 Sample Size Calculation Based upon the PRIMARY outcome Other analyses (i.e., secondary outcomes) may be planned, but the study may not be powered to detect effects for these outcomes. Scott Evans, Ph.D. 8 4

5 Sample Size Calculation Two strategies Hypothesis Testing Estimation with Precision Scott Evans, Ph.D. 9 Sample Size Calculation Using Hypothesis Testing By far, the most common approach. The idea is to choose a sample size such that both of the following conditions simultaneously hold: If the null hypothesis is true, then the probability of incorrectly rejecting is (no more than) α If the alternative hypothesis is true, then the probability of correctly rejecting is (at least) 1-β = power. Scott Evans, Ph.D. 10 5

6 Reality H o True H o False Test Result Reject H o Do not reject H o Type I error (α) 1-α Power (1-β) Type II error (β) Scott Evans, Ph.D. 11 α Determinants of Sample Size: Hypothesis Testing Approach β An effect size to detect Estimates of variability Scott Evans, Ph.D. 12 6

7 What is Needed to Determine the Sample-Size? α Up to the investigator or FDA regulation (often = 0.05) How much type I (false positive) error can you afford? Scott Evans, Ph.D. 13 What is Needed to Determine the Sample-Size? 1-β (power) Up to the investigator (often 80%-90%) How much type II (false negative) error can you afford? Not regulated by FDA Scott Evans, Ph.D. 14 7

8 Choosing α and β Weigh the cost of a Type I error versus a Type II error. In early phase clinical trials, we often do not want to miss a significant result and thus often consider designing a study for higher power (perhaps 90%) and may consider relaxing the α error (perhaps 0.10). In order to approve a new drug, the FDA requires significance in two Phase III trials strictly designed with α error no greater than 0.05 (Power = 1-β is often set to 80%). Scott Evans, Ph.D. 15 Effect Size The minimum difference (between groups) that is clinically relevant or meaningful. Not readily apparent Requires clinical input Often difficult to agree upon Note for noninferiority studies, we identify the maximum irrelevant or non-meaningful difference. Scott Evans, Ph.D. 16 8

9 Estimates of Variability Often obtained from prior studies Explore the literature and data from ongoing studies for estimates needed in calculations Consider conducting a pilot study to estimate this May need to validate this estimate later Scott Evans, Ph.D. 17 Other Considerations 1-sample vs. 2-sample Independent samples or paired 1-sided vs. 2-sided Scott Evans, Ph.D. 18 9

10 Example: Cluster Headaches A experimental drug is being compared with placebo for the treatment of cluster headaches. The design of the study is to randomize an equal number of participants to the new drug and placebo. The participants will be administered the drug or matching placebo. One hour later, the participants will score their pain using the visual analog scale (VAS) for pain. A continuous measure ranging from 0 (no pain) to 10 (severe pain). Scott Evans, Ph.D. 19 Example: Cluster Headaches The planned analysis is a 2-sample t- test (independent groups) comparing the mean VAS score between groups, one hour after drug (or placebo) initiation H 0 : μ 1 =μ 2 vs. H A : μ 1 μ 2 Scott Evans, Ph.D

11 Example: Cluster Headaches It is desirable to detect differences as small as 2 units (on the VAS scale). Using α=0.05 and β=0.80, and an assumed standard deviation (SD) of responses of 4 (in both groups), 63 participants per group (126 total) are required. STATA Command: sampsi 0 2, sd(4) a(0.05) p(.80) Note: you just need a difference of 2 in the first two numbers Scott Evans, Ph.D. 21 Example: Part 2 Let s say that instead of measuring pain on a continuous scale using the VAS, we simply measured response (i.e., the headache is gone) vs. non-response. Scott Evans, Ph.D

12 Example: Part 2 The planned analysis is a 2-sample test (independent groups) comparing the proportion of responders, one hour after drug (or placebo) initiation H 0 : p 1 =p 2 vs. H A : p 1 p 2 Scott Evans, Ph.D. 23 Example: Part 2 It is desirable to detect a difference in response rates of 25% and 50%. Using α=0.05 and β=0.80, STATA Command: sampsi , a(0.05) p(.80) 66 per group (132 total) w/ continuity correction 58 per group (116 total) without continuity correction Scott Evans, Ph.D

13 Notes for Testing Proportions One does not need to specify a variability since it is determined from the proportion. The required sample size for detecting a difference between 0.25 and 0.50 is different from the required sample size for detecting a difference between 0.70 and 0.95 (even though both are 0.25 differences) because the variability is different. This is not the case for means. Scott Evans, Ph.D. 25 Caution for Testing Proportions Some software computes the sample size for testing the null hypothesis of the equality of two proportions using a continuity correction while others calculate sample size without this correction. Answers will differ slightly, although either method is acceptable. STATA uses a continuity correction The website does not Scott Evans, Ph.D

14 Sample Size Calculation Using Estimation with Precision Not nearly as common, but equally as valid. The idea is to estimate a parameter with enough precision to be meaningful. E.g., the width of a confidence interval is narrow enough Scott Evans, Ph.D. 27 α Determinants of Sample Size: Estimation Approach Estimates of variability Precision E.g., The (maximum) desired width of a confidence interval Scott Evans, Ph.D

15 Example: Evaluating a Diagnostic Examination It is desirable to estimate the sensitivity of an examination by trained site nurses relative to an oral medicine specialist for the diagnosis of Oral Candidiasis (OC) in HIV-infected people. Precision: It is desirable to estimate the sensitivity such that the width of a 95% confidence interval is 15%. Scott Evans, Ph.D. 29 Example: Evaluating a Diagnostic Examination Note: sensitivity is a proportion The (large sample) CI for a proportion is: ˆ(1 ˆ) ˆ(1 pˆ), p ˆ z p p, pˆ + z p a / 2 a/ n 2 n Scott Evans, Ph.D

16 Example: Evaluating a Diagnostic Examination We wish the width of the CI to be <0.15 Using an estimated proportion of 0.25 and α=0.05, we can calculate n=129. Since sensitivity is a conditional probability, we need 129 that are OC+ as diagnosed by the oral health specialist. If the prevalence of OC is ~20%, then we would need to enroll or screen ~129/(0.20)=645. Scott Evans, Ph.D. 31 Sensitivity Analyses Sample size calculations require assumptions and estimates. It is prudent to investigate how sensitive the sample size estimates are to changes in these assumptions (as they may be inaccurate). Thus, provide numbers for a range of scenarios and various combinations of parameters (e.g., for various values combinations of α, β, estimates of variance, effect sizes, etc.) Scott Evans, Ph.D

17 Example: Sample Size Sensitivity Analyses for the Study of Cluster Headaches μ 1 μ 2 SD Power=80% Power=90% Scott Evans, Ph.D. 33 Effects of Determinants In general, the following increases the required sample size (with all else being equal): Lower α Lower β Higher variability Smaller effect size to detect More precision required Scott Evans, Ph.D

18 Caution In general, higher sample size implies higher power. Does this mean that a higher sample size is always better? Not necessarily. Studies can be very costly. It is wasteful to power studies to detect between-group differences that are clinically irrelevant. Scott Evans, Ph.D. 35 Sample Size Adjustments Complications (e.g., loss-to-follow-up, poor adherence, etc.) during clinical trials can impact study power. This may be less of a factor in lab experiments. Expect these complications and plan for them BEFORE the study begins. Adjust the sample size estimates to account for these complications. Scott Evans, Ph.D

19 Complications that Decrease Power Missing data Poor Adherence Multiple tests Unequal group sizes Use of nonparametric testing (vs. parametric) Noninferiority or equivalence trials (vs. superiority trials) Inadvertent enrollment of ineligible subjects or subjects that cannot respond Scott Evans, Ph.D. 37 Adjustment for Lost-to-Follow-up Loss-to-Follow-Up (LFU) refers to when a participants endpoint status is not available (missing data). If one assumes that the LFU is non-informative or ignorable (i.e., random and not related to treatment), then a simple sample size adjustment can be made. This is a very strong assumption as LFU is often associated with treatment. The assumption is further difficult to validate. Researchers need to consider the potential bias of examining only subjects with non-missing data. Scott Evans, Ph.D

20 Adjustment for Lost-to-Follow-up Calculate the sample size N. Let x=proportion expected to be lost-to-followup. N adj =N/(1-x) Note: no LFU adjustment is necessary if you plan to impute missing values. However, if you use imputation, an adjustment for a dilution effect may be warranted. Scott Evans, Ph.D. 39 Adjustment for Poor Adherence Adjustment for the dilution effect due to poor adherence or the inclusion (perhaps inadvertently) of subjects that cannot respond: Calculate the sample size N. Let x=proportion expected to be non-adherent. N adj =N/(1-x) 2 Scott Evans, Ph.D

21 Inflation Factor for Non-adherence Proportion non- Adherent Inflation Factor Scott Evans, Ph.D. 41 Adjustment for Unequal Allocation When comparing groups, power is maximized when groups sizes are equal (with all else being equal) There may be other reasons however, to have some group sizes larger than others E.g., having more people on an experimental therapy (rather than placebo) to obtain more safety information of the product Scott Evans, Ph.D

22 Adjustment for Unequal Allocation Adjustment for unequal allocation in two groups: Let Q E and Q C be the sample fractions such that Q E +Q C =1. Note power is optimized when Q E =Q C =0.5 Calculate sample size N bal for equal sample sizes (i.e., Q E =Q C =0.5) N unbal =N bal ((Q E -1 +Q C -1 )/4) Scott Evans, Ph.D. 43 Adjustment for Nonparametric Testing Most sample-size calculations are performed expecting use of parametric methods (e.g., t- test). This is often done because formulas (and software) for these methods are readily available However, parametric assumptions (e.g., normality) do not always hold. Thus nonparametric methods may be required. Scott Evans, Ph.D

23 Adjustment for Nonparametric Testing Pitman Efficiency Applicable for 1 and 2 sample t-tests Method Calculate sample size N par. N nonpar = N par /(0.864) Scott Evans, Ph.D. 45 Example: Cluster Headaches Recall the cluster headache example in which the required sample size was 126 (total) for detecting a 2 unit (VAS scale) difference in means. If we expect 10% of the participants to be non-adherent then an appropriate inflation is needed 126/(1-0.1) 2 =156 If we further expect that we will have to perform a nonparametric test (instead of a t-test) due to nonnormality, then further inflation is required: 156/(0.864)=181 Round to 182 to have an equal number (81) in each group Scott Evans, Ph.D

24 Adjustment: Noninferiority/Equivalence Studies Calculate sample size for standard superiority trial but reverse the roles of α and β. Works for large sample binary and continuous data. Does not work for time-to-event data. Scott Evans, Ph.D. 47 More Adjustments? Adjustments are needed if: You plan interim analyses Group sequential designs You have more than one primary test to be conducted Multiple comparison adjustments E.g., Bonferroni (if 2 tests or comparisons are to be made, then power each at α/2. Additional adjustments may be needed for stratification, blocking, or matching. Scott Evans, Ph.D

25 Sample Size Re-estimation Hot Topic in clinical trials Re-estimating sample size based on interim data Complicated Must be done carefully to maintain scientific integrity and blinding. Scott Evans, Ph.D

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Sample Size Planning, Calculation, and Justification

Sample Size Planning, Calculation, and Justification Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text)

Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text) Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text) Key features of RCT (randomized controlled trial) One group (treatment) receives its treatment at the same time another

More information

Analysis and Interpretation of Clinical Trials. How to conclude?

Analysis and Interpretation of Clinical Trials. How to conclude? www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

2 Precision-based sample size calculations

2 Precision-based sample size calculations Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

Inclusion and Exclusion Criteria

Inclusion and Exclusion Criteria Inclusion and Exclusion Criteria Inclusion criteria = attributes of subjects that are essential for their selection to participate. Inclusion criteria function remove the influence of specific confounding

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

MISSING DATA: THE POINT OF VIEW OF ETHICAL COMMITTEES

MISSING DATA: THE POINT OF VIEW OF ETHICAL COMMITTEES I CONGRESSO NAZIONALE BIAS 2009 29/30 APRILE 2009 ELI LILLY SESTO FIORENTINO (FI) MISSING DATA: THE POINT OF VIEW OF ETHICAL COMMITTEES Anna Chiara Frigo Department of Environmental Medicine and Public

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Study Design and Statistical Analysis

Study Design and Statistical Analysis Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Guidelines for AJO-DO submissions: Randomized Clinical Trials June 2015

Guidelines for AJO-DO submissions: Randomized Clinical Trials June 2015 Guidelines for AJO-DO submissions: Randomized Clinical Trials June 2015 Complete and transparent reporting allows for accurate assessment of the quality of trial and correct interpretation of the trial

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance. Session 1: ICH E9 and E10. PSI Conference, May 2011

TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance. Session 1: ICH E9 and E10. PSI Conference, May 2011 TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance Session 1: PSI Conference, May 2011 Kerry Gordon, Quintiles 1 E9, and how to locate it 2 ICH E9 Statistical Principles for Clinical Trials (Issued

More information

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so: Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a

More information

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals. 1 BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine

More information

Non-inferiority studies: non-sense or sense?

Non-inferiority studies: non-sense or sense? Non-inferiority studies: non-sense or sense? Prof. Emmanuel Lesaffre Department of Biostatistics, Erasmus MC, Rotterdam, the Netherlands L-Biostat, K.U.Leuven, Leuven, Belgium 1 2 3 1. Review of study

More information

Non-Inferiority Tests for Two Proportions

Non-Inferiority Tests for Two Proportions Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS

AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS Susan Ellenberg, Ph.D. Perelman School of Medicine University of Pennsylvania School of Medicine FDA Clinical Investigator Course White Oak, MD November

More information

Imputation and Analysis. Peter Fayers

Imputation and Analysis. Peter Fayers Missing Data in Palliative Care Research Imputation and Analysis Peter Fayers Department of Public Health University of Aberdeen NTNU Det medisinske fakultet Missing data Missing data is a major problem

More information

StatCrunch and Nonparametric Statistics

StatCrunch and Nonparametric Statistics StatCrunch and Nonparametric Statistics You can use StatCrunch to calculate the values of nonparametric statistics. It may not be obvious how to enter the data in StatCrunch for various data sets that

More information

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Online 12 - Sections 9.1 and 9.2-Doug Ensley Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 12 - Sections 9.1 and 9.2 1. Does a P-value of 0.001 give strong evidence or not especially strong

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

Non-Inferiority Tests for Two Means using Differences

Non-Inferiority Tests for Two Means using Differences Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other 1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

Transferability of Economic Evaluations in Clinical Trials

Transferability of Economic Evaluations in Clinical Trials Transferability of Economic Evaluations in Clinical Trials Henry Glick Institutt for helseledelse og helseøkonomi November 25, 2008 The Problem Multicenter and multinational studies are the norm for the

More information

Organizing Your Approach to a Data Analysis

Organizing Your Approach to a Data Analysis Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Chapter Eight: Quantitative Methods

Chapter Eight: Quantitative Methods Chapter Eight: Quantitative Methods RESEARCH DESIGN Qualitative, Quantitative, and Mixed Methods Approaches Third Edition John W. Creswell Chapter Outline Defining Surveys and Experiments Components of

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Fixed-Effect Versus Random-Effects Models

Fixed-Effect Versus Random-Effects Models CHAPTER 13 Fixed-Effect Versus Random-Effects Models Introduction Definition of a summary effect Estimating the summary effect Extreme effect size in a large study or a small study Confidence interval

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1 Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

How to evaluate medications in Multiple Sclerosis when placebo controlled RCTs are not feasible

How to evaluate medications in Multiple Sclerosis when placebo controlled RCTs are not feasible University of Florence Dept. of Neurosciences Careggi University Hospital Dept of Neurosciences How to evaluate medications in Multiple Sclerosis when placebo controlled RCTs are not feasible Luca Massacesi,

More information

Study Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles

Study Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles Study Design Sample Size Calculation & Power Analysis RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles Contents 1. Background 2. Common Designs 3. Examples 4. Computer

More information

WISE Power Tutorial All Exercises

WISE Power Tutorial All Exercises ame Date Class WISE Power Tutorial All Exercises Power: The B.E.A.. Mnemonic Four interrelated features of power can be summarized using BEA B Beta Error (Power = 1 Beta Error): Beta error (or Type II

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

Analysis of Variance ANOVA

Analysis of Variance ANOVA Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

Critical Appraisal of Article on Therapy

Critical Appraisal of Article on Therapy Critical Appraisal of Article on Therapy What question did the study ask? Guide Are the results Valid 1. Was the assignment of patients to treatments randomized? And was the randomization list concealed?

More information

Guideline on missing data in confirmatory clinical trials

Guideline on missing data in confirmatory clinical trials 2 July 2010 EMA/CPMP/EWP/1776/99 Rev. 1 Committee for Medicinal Products for Human Use (CHMP) Guideline on missing data in confirmatory clinical trials Discussion in the Efficacy Working Party June 1999/

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

6. MEASURING EFFECTS OVERVIEW CHOOSE APPROPRIATE METRICS

6. MEASURING EFFECTS OVERVIEW CHOOSE APPROPRIATE METRICS 45 6. MEASURING EFFECTS OVERVIEW In Section 4, we provided an overview of how to select metrics for monitoring implementation progress. This section provides additional detail on metric selection and offers

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

This clinical study synopsis is provided in line with Boehringer Ingelheim s Policy on Transparency and Publication of Clinical Study Data.

This clinical study synopsis is provided in line with Boehringer Ingelheim s Policy on Transparency and Publication of Clinical Study Data. abcd Clinical Study for Public Disclosure This clinical study synopsis is provided in line with s Policy on Transparency and Publication of Clinical Study Data. The synopsis which is part of the clinical

More information

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Skewed Data and Non-parametric Methods

Skewed Data and Non-parametric Methods 0 2 4 6 8 10 12 14 Skewed Data and Non-parametric Methods Comparing two groups: t-test assumes data are: 1. Normally distributed, and 2. both samples have the same SD (i.e. one sample is simply shifted

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Evidenced Based Therapy Practice within the BOP. Presented by: CDR Eric Payne, DPT, OCS

Evidenced Based Therapy Practice within the BOP. Presented by: CDR Eric Payne, DPT, OCS Evidenced Based Therapy Practice within the BOP Presented by: CDR Eric Payne, DPT, OCS JASPA*(Journal associated score of personal angst) J: Are you ambivalent about renewing your JOURNAL subscriptions?

More information

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation

More information

Descriptive Methods Ch. 6 and 7

Descriptive Methods Ch. 6 and 7 Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational

More information

Power Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power

Power Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power Power Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power By Jason C. Cole, PhD QualityMetric, Inc. Senior Consulting Scientist jcole@qualitymetric.com 310-539-2024 Consulting

More information

Linear Models in STATA and ANOVA

Linear Models in STATA and ANOVA Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples

More information

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1 BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

Sample size estimation is an important concern

Sample size estimation is an important concern Sample size and power calculations made simple Evie McCrum-Gardner Background: Sample size estimation is an important concern for researchers as guidelines must be adhered to for ethics committees, grant

More information

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities. Consider a study in which How many subjects? The importance of sample size calculations Office of Research Protections Brown Bag Series KB Boomer, Ph.D. Director, boomer@stat.psu.edu A researcher conducts

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

What are confidence intervals and p-values?

What are confidence intervals and p-values? What is...? series Second edition Statistics Supported by sanofi-aventis What are confidence intervals and p-values? Huw TO Davies PhD Professor of Health Care Policy and Management, University of St Andrews

More information

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population. SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13

Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional

More information

Lean Six Sigma Black Belt Body of Knowledge

Lean Six Sigma Black Belt Body of Knowledge General Lean Six Sigma Defined UN Describe Nature and purpose of Lean Six Sigma Integration of Lean and Six Sigma UN Compare and contrast focus and approaches (Process Velocity and Quality) Y=f(X) Input

More information

Form B-1. Inclusion form for the effectiveness of different methods of toilet training for bowel and bladder control

Form B-1. Inclusion form for the effectiveness of different methods of toilet training for bowel and bladder control Form B-1. Inclusion form for the effectiveness of different methods of toilet training for bowel and bladder control Form B-2. Assessment of methodology for non-randomized controlled trials for the effectiveness

More information

Planning sample size for randomized evaluations

Planning sample size for randomized evaluations TRANSLATING RESEARCH INTO ACTION Planning sample size for randomized evaluations Simone Schaner Dartmouth College povertyactionlab.org 1 Course Overview Why evaluate? What is evaluation? Outcomes, indicators

More information

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

Self-Check and Review Chapter 1 Sections 1.1-1.2

Self-Check and Review Chapter 1 Sections 1.1-1.2 Self-Check and Review Chapter 1 Sections 1.1-1.2 Practice True/False 1. The entire collection of individuals or objects about which information is desired is called a sample. 2. A study is an observational

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Current Topic IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Sitanshu Sekhar Kar 1, Archana Ramalingam 2 1Assistant Professor; 2 Post- graduate, Department of Preventive and Social Medicine,

More information

What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago

What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago 8 June 1998, Corrections 14 February 2010 Abstract Results favoring one treatment over another

More information