Statistics for the intensivist (2) ICU Fellowship Training Radboudumc

Similar documents
Statistics for the intensivist (3) ICU Fellowship Training Radboudumc

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

2 Precision-based sample size calculations

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

Hypothesis Testing: Two Means, Paired Data, Two Proportions

What are confidence intervals and p-values?

HYPOTHESIS TESTING: POWER OF THE TEST

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Testing Hypotheses About Proportions

p ˆ (sample mean and sample

Chapter 3. Sampling. Sampling Methods

Statistics 2014 Scoring Guidelines

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13

WISE Power Tutorial All Exercises

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

R&M Solutons

1.5 Oneway Analysis of Variance

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Hypothesis testing - Steps

II. DISTRIBUTIONS distribution normal distribution. standard scores

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Inclusion and Exclusion Criteria

Non-inferiority studies: non-sense or sense?

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Fixed-Effect Versus Random-Effects Models

Testing Research and Statistical Hypotheses

Fairfield Public Schools

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Study Guide for the Final Exam

Multiple samples: Pairwise comparisons and categorical outcomes

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

UNDERSTANDING THE TWO-WAY ANOVA

Unit 26 Estimation with Confidence Intervals

Lesson 9 Hypothesis Testing

Chapter 3 RANDOM VARIATE GENERATION

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

abc Mark Scheme Statistics 6380 General Certificate of Education 2006 examination - January series SS02 Statistics 2

Lecture Notes Module 1

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Descriptive Statistics

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

One-Way Analysis of Variance (ANOVA) Example Problem

Rapid Critical Appraisal of Controlled Trials

Sample size estimation and power analysis for clinical research studies

AP Stats- Mrs. Daniel Chapter 4 MC Practice

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

A) B) C) D)

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Section 13, Part 1 ANOVA. Analysis Of Variance

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Sample Size and Power in Clinical Trials

The power of a test is the of. by using a particular and a. value of the that is an to the value

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Solutions to Homework 10 Statistics 302 Professor Larget

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Two-sample inference: Continuous data

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Chapter 7: Simple linear regression Learning Objectives

Scientific Methods II: Correlational Research

One-Way Analysis of Variance

NQF-ENDORSED VOLUNTARY CONSENSUS STANDARDS FOR HOSPITAL CARE

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS

Sample size estimation is an important concern

Guide to Biostatistics

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Technical Information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Tutorial 5: Hypothesis Testing


AN ANALYSIS OF INSURANCE COMPLAINT RATIOS

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Introduction Hypothesis Methods Results Conclusions Figure 11-1: Format for scientific abstract preparation

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Chapter 2 Probability Topics SPSS T tests

Social Studies 201 Notes for November 19, 2003

Using Excel for inferential statistics

4/8/13. Pre-test Audience Response. Prostate Cancer Screening and Treatment of Prostate Cancer: The 2013 Perspective

SECOND M.B. AND SECOND VETERINARY M.B. EXAMINATIONS INTRODUCTION TO THE SCIENTIFIC BASIS OF MEDICINE EXAMINATION. Friday 14 March

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

Chapter 8 Paired observations

MULTIPLE REGRESSION WITH CATEGORICAL DATA

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

MATH 140 HYBRID INTRODUCTORY STATISTICS COURSE SYLLABUS

Transcription:

Statistics for the intensivist (2) ICU Fellowship Training Radboudumc

Hypothesis testing The null hypothesis Therapeutic hypothermia (33 0 C) vs normothermic hypothermia (36.8 0 C) N = 260, children age between 2 days - < 18 years, comatose after OHCA % 20 15 10 20 P = 0.14 12 5 0 1-year survival with VABS-II 70

Null hypothesis Due to sampling variation, it is extremely unlikely that the same proportion of patients in each group will die Observed differences may be due to treatment or may be due to chance Hypothesis testing tries to establish which of these explanations is most likely (but can never prove the truth)

Null hypothesis Null hypothesis states that there is no difference between the two treatments Hypothesis testing explores how likely it is that the observed difference would be seen by chance alone if the null hypothesis were true - how likely is it that we find 20% survival in the hypothermia group and 12% in the normothermia group while in fact there is no difference

P-value P = probability Measures how likely it is that the observed difference between groups (20 vs 12%) is due to chance and thus measures the strength of evidence against the null hypothesis In this example there is a 14% chance that the observed difference is solely based on chance

Interpreting P values Patients Tidal volume Mortality Low Tv Control Low Tv Control Low Tv Control P-value 33% Amato 29 24 6.1 ± 0.2 11.9 ± 0.5 38 71 < 0.001 Stewart 60 60 7.2 ± 0.8 10.6 ± 0.2 50 3% 47 0.72 Brochard 58 58 7.2 ± 0.2 10.4 ± 0.2 47 9% 38 0.38 Brower 26 26 7.3 ± 0.1 10.2 ± 0.1 50 4% 46 0.60 ARDSnet 432 429 6.3 ± 0.1 11.7 ± 0.1 31 9% 40 0.007 Villar 50 45 7.3 ± 0.9 10.2 ± 1.2 30 24% 54 0.017

Interpretation ARDSnet trial suggests that there is less than 0.7% chance that the 9% mortality difference between low- and high tidal volume ventilation is based on coincidence if in fact there is no difference Brochard trial suggests that there is less than 38% chance that the 9% mortality difference between low- and high tidal volume ventilation is based on coincidence if in fact there is no difference

Interpretation P-value is significantly influenced by sample size (compare Brochard ARDSnet) P-value changes with the size of the effect (compare Brochard Villar) Never use an arbitrary cut-off P-value to say that something is true or not true Study of Villar does not prove that LTVV decreases mortality because in 1.7% of samples this difference will be found by chance while in fact there is no difference (Type I error) Study of Brochard does not prove that LTVV is not harmful despite the 38% chance that the observed difference of 9% is due to chance (Type II error - Power issue) Statistical significancy does not mean clinically relevant

P-values and confidence intervals 50 Controle P = 0.07 Optimization 30-D mortality or moderate/major complication (%) 40 30 20 10 0 43,4 36,6 N = 734 - major GI surgery Optimise trial Pearse RM. JAMA 2014;311:2181-2190

17-29% minder kans op ernstige complicatie bij tune-up Pearse RM. JAMA 2014;311:2181-2190

Sample size calculation Power = probability of correctly identifying a difference between two groups in the study sample when one genuinely exists in the populations from which the samples were drawn Higher power is usually obtained by increasing the sample size Always calculate the power when a clinically relevant difference between groups is not statistically significant

Hypothermia after OHCA in children MC trial (N = 38) in comatose children (2 D - 18 y) within 6 hrs after OHCA Therapeutic hypothermia (33.0 o C) vs normothermia (36.8 o C) Primary outcome 12 M survival with VABS-II score 70 Moler FW. N Engl J Med 2015;372:1898-1908

Hypothermia after OHCA in children Hypothermia Normothermia 20 20 P = 0.14 Power calculation Absolute effect size of 15-20% % 15 10 12 Primary outcome rate in normothermia group 15-35% 5 Needed SS 275 for power of 85% 0 1-Y survival N = 260 Totally unrealistic

Factors affecting sample size calculation Factor Magnitude Impact on identification Required sample size P-value Small Difficult to achieve significance Large Large Significance easy to obtain Small Power Low Identification unlikely Small High Identification more probable Large Effect Small Difficult to identify Large Large Easy to identify Small Very often p-value set at 0.05 and power at 80-95%

Effect size Take a clinically relevant value (a 0.1% mortality reduction in ARDS will not change practice)

Assume.. You have a new inotropic agent (Levosimendan) You would like to compare this to placebo and see if the CO increases You think that a clinically relevant increase in CO is 1 liter/min and that the CO of the placebo group will be 3 ± 1 L/min Calculate the Standardized difference = target difference (1 liter) / standard deviation (1 liter) Standardized difference = 1

SS calculation Standardized difference Altman nomogram Or a formula N = 2/d 2 C p,power N = number patients/group, d = standardized difference, C p,power = constant P-value Power 80% Power 90% Power 95% 0.05 7.9 10.5 13.0 0.01 11.7 14.9 17.8 For p-value 0.05 and Power 0.80 = 32 patients (2 16) = 2/1 2 7.9 = 16 per group For p-value 0.01 and Power 0.80 = 46 patients (2 23)

SS calculation for proportions Hypothermia Normothermia Power calculation P = 0.14 Absolute effect size of 15-20% 20 15 20 Primary outcome rate in normothermia group 15-35% Needed SS 275 for power of 85% % 10 5 0 1-Y survival N = 260 12 Standardized difference = (p1 - p2)/ [pmean(1-pmean)] = 0.40-0.20/ 0.3(1-0.3)] = 0.436

SS calculation for proportions Or a formula N = number patients/group, p1 = proportion 1, p2 = proportion 2, C p,power = constant P-value Power 80% Power 90% Power 95% 0.05 7.9 10.5 13.0 0.01 11.7 14.9 17.8 For p-value 0.01 and Power 0.85 = 270 patients (2 135) = 130 per group

Calculating power afterwards Hypothermia Normothermia 1100 patients needed 20 15 20 % 10 12 5 Power < 25% to detect significant difference 0 1-Y survival Standardized difference = (0.20-0.12)/ [0.16(1-0.16)] = 0.08/ [0.16(0.84)] = 0.218