Lesson 14 14 Outline Outline



Similar documents
Chi-square test Fisher s Exact test

Use of the Chi-Square Statistic. Marie Diener-West, PhD Johns Hopkins University

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Case-control studies. Alfredo Morabia

Nominal and Real U.S. GDP

Guide to Biostatistics

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

PRIMARY CONTENT MODULE Algebra I -Linear Equations & Inequalities T-71. Applications. F = mc + b.

Principles of Hypothesis Testing for Public Health

AP STATISTICS (Warm-Up Exercises)

Two Correlated Proportions (McNemar Test)

Bayes Theorem & Diagnostic Tests Screening Tests

Categorical Data Analysis

Nominal and ordinal logistic regression

Binary Diagnostic Tests Two Independent Samples

Sample size estimation for correlations with pre-specified confidence interval

Section 12 Part 2. Chi-square test

FINAL EXAM SECTIONS AND OBJECTIVES FOR COLLEGE ALGEBRA

CALCULATIONS & STATISTICS

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

P (B) In statistics, the Bayes theorem is often used in the following way: P (Data Unknown)P (Unknown) P (Data)

III. INTRODUCTION TO LOGISTIC REGRESSION. a) Example: APACHE II Score and Mortality in Sepsis

Summary Measures (Ratio, Proportion, Rate) Marie Diener-West, PhD Johns Hopkins University

BASIC COURSE IN STATISTICS FOR MEDICAL DOCTORS, SCIENTISTS & OTHER RESEARCH INVESTIGATORS INFORMATION BROCHURE

How to get accurate sample size and power with nquery Advisor R

Statistics 305: Introduction to Biostatistical Methods for Health Sciences

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

Week 3&4: Z tables and the Sampling Distribution of X

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Data Analysis, Research Study Design and the IRB

Point and Interval Estimates

Laboratory 3 Type I, II Error, Sample Size, Statistical Power

A.2. Exponents and Radicals. Integer Exponents. What you should learn. Exponential Notation. Why you should learn it. Properties of Exponents

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

5.1 Identifying the Target Parameter

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Tests for Two Proportions

Hypothesis testing - Steps

Critical Appraisal of Article on Therapy

The Method of Partial Fractions Math 121 Calculus II Spring 2015

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lecture 1: Introduction to Epidemiology

Review of Fundamental Mathematics

Lecture 25. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

6.4 Normal Distribution

SAMPLE SIZE TABLES FOR LOGISTIC REGRESSION

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Descriptive Statistics

Solving DEs by Separation of Variables.

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Sample Size Planning, Calculation, and Justification

Equations. #1-10 Solve for the variable. Inequalities. 1. Solve the inequality: Solve the inequality: 4 0

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Review. March 21, S7.1 2_3 Estimating a Population Proportion. Chapter 7 Estimates and Sample Sizes. Test 2 (Chapters 4, 5, & 6) Results

Confidence Intervals for Spearman s Rank Correlation

Notes from February 11

Lecture Notes Module 1

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

11. Analysis of Case-control Studies Logistic Regression

DynaMed. Any time, An y w here

Confounding in Epidemiology

Lean Six Sigma Analyze Phase Introduction. TECH QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Two Related Samples t Test

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

ADVANCE: a factorial randomised trial of blood pressure lowering and intensive glucose control in 11,140 patients with type 2 diabetes

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

A Simple Method for Estimating Relative Risk using Logistic Regression. Fredi Alexander Diaz-Quijano

Simple Linear Regression Inference

Introduction. Survival Analysis. Censoring. Plan of Talk

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Chapter 7 Review. Confidence Intervals. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page

Multinomial and Ordinal Logistic Regression

Section 13, Part 1 ANOVA. Analysis Of Variance

3.4 Statistical inference for 2 populations based on two samples

Poisson Regression or Regression of Counts (& Rates)

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Confidence Intervals for the Difference Between Two Means

4. Continuous Random Variables, the Pareto and Normal Distributions

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Lecture 19: Conditional Logistic Regression

Section 1. Logarithms

9231 FURTHER MATHEMATICS

Simple Linear Regression

Nonlinear Regression:

5.1 Radical Notation and Rational Exponents

LOGISTIC REGRESSION ANALYSIS

STAT 2300: BUSINESS STATISTICS Section 002, Summer Semester 2009

Simple linear regression

Solvency II Pillar III Quantitative Reporting Templates (QRTs) Sinead Clarke, Eoin King 11 th December 2012

Having a coin come up heads or tails is a variable on a nominal scale. Heads is a different category from tails.

Partial Fractions Examples

CHAPTER FIVE. Solutions for Section 5.1. Skill Refresher. Exercises

Transcription:

Lesson 14 Confidence Intervals of Odds Ratio and Relative Risk

Lesson 14 Outline Lesson 14 covers Confidence Interval of an Odds Ratio Review of Odds Ratio Sampling distribution of OR on natural log scale 95% Confidence Interval of OR example Confidence Interval of Relative Risk Review of Relative Risk Sampling distribution of RR on natural log scale 95% Confidence Interval of RR example 2

Review: Odds Ratio and Relative Risk Both Odds Ratio and Relative Risk are measures of association (or relationship) between two nominal variables. The Odds Ratio is typically estimated from data collected in a Case-Control Control study. The Relative Risk can be estimated from data collected in a prospective study. 3

Associations between 2 nominal variable If there is no association between the two variables, the OR or RR = 1 An OR or RR > 1.0 or < 1.0 indicates a possible statistical ti ti relationship (or association) between the two variables. Hypothesis tests for the OR and RR are not used to determine statistical significance of the association Instead, Confidence intervals of OR or RR are constructed and used to determine whether or not the association is statistically significant. 4

Confidence Interval of Odds Ratio

Review: Odds Ratio Odds of an event = probability that the event occurs probability that the event does not occur Odds Ratio odds of event for group 1 odds of event for group 2 6

Review: Disease OR for Exposed compared dt to Not te Exposed Disease No Disease Total Exposed Group a b a+b Not Exposed Group c d c+d Odds of Disease for Exposed = a/(a+b) b/(a+b) = a/b Odds of Disease for Not Exposed = c/(c+d) d/(c+d) = c/d OR = a/b c/d = ad / bc 7

Confidence Interval for an Odds Ratio The confidence interval for an Odds Ratio has the same general formula as the Confidence Interval for a population p mean or population proportion Point Estimate Confidence Coefficient *Standard Error The difference is that the confidence interval for the Odds Ratio is calculated on the natural log (LN) scale and then converted back to the original i scale. 8

Sampling Distribution of the Odds Ratio The sampling distribution of the Odds Ratio is positively skewed. However, It is approximately normally distributed on the natural log scale. The confidence interval is calculated l on the natural log scale (LN) After finding the limits on the LN scale, use the EXP function to find the limits on the original scale. 9

95% Confidence Interval of OR: the steps 1. Calculate the Odds Ratio from the data 2. Find the Natural log of the OR using the LN function in Excel 3. The confidence coefficient is from the standard normal distribution: 1.96 for a 95% confidence interval 4. Calculate the SE of LN(OR) see next slide 5. The lower and upper limits on the log scale = LN(OR) ± 1.96* SE LN(OR) = (LL, UL) 6. Use the EXP function to find the CI limits on the original scale: EXP(LL), EXP(UL) 10

SE of LN(OR) The SE of LN(OR) is calculated from the cell counts in the 2 X 2 table: Cases Controls Exposed a b Not exposed c d SE of LN(OR) = 1 a 1 b 1 c 1 d 11

95% Confidence Interval of Odds Ratio First calculate the upper and lower limits on the LN scale 1 1 1 1 ln( OR) 1.96 * a b c d After calculating the upper and lower limits on the LN scale, use the exponential function (EXP) to find the lower and upper limits on the original scale If the 95% Confidence Interval does not contain the value 1.0, the association is statistically significant at alpha = 0.05 05 12

Example: Preterm delivery and Mother s SES level Study background: A case-control control study of the epidemiology i of preterm delivery was undertaken at Yale-New Haven Hospital in Connecticut during 1977. The table on the following slide contains data for Mother s socioeconomic status (SES) for those with (cases) and without (controls) preterm delivery. 13

Socio- Economic Status Preterm Delivery Cases Controls Upper 11 40 Upper Middle Middle Lower Middle Lower 14 33 59 53 45 64 91 58 Unknown 5 5 Construct a 95% Confidence Interval for the odds ratio for Preterm delivery comparing Lower to Upper SES 14

Data table for Confidence Interval of Odds Ratio of preterm delivery comparing lower to upper SES Preterm Delivery SES Yes: Cases No: Controls Lower 53 58 Upper 11 11 40 15

1. Calculate the OR for Preterm Delivery: Lower SES compared to Upper SES Preterm Delivery SES Yes: Cases No: Controls Lower 53 58 Upper 11 40 Odds Ratio = (53/58) / (11/40) = 3.32 16

2. Calculate the LN (OR) and 3. Find the confidence coefficient i 2. Calculate the LN(OR) LN(3.32) = 1.20 This is the point estimate t for the confidence interval 3. The confidence coefficient for the LN(OR) is from the standard normal distribution. For a 95% confidence interval, the coefficient = 1.96 17

4. Calculate the SE of LN(OR) SES Preterm Delivery Yes: Cases No: Controls Lower 53 58 Upper 11 11 40 SE ln(or) 1 53 1 58 1 11 1 40 0.39 18

5. & 6. Find the Lower and Upper limits on the LN scale and the original scale 1.20 1.96*0.39 (0.44,1.97) 5. The upper and lower limits of the confidence interval on the LN scale are (0.44, 1.97) 6. Use the EXP function in Excel to find the interval limits of the 95% CI of the OR: (1.55, 7.14) 19

Interpretation of the CI for Preterm Delivery and SES State the conclusion in the original scale The odds ratio for pre-term delivery for Lower SES compared to the Upper SES is 3.32 indicating increased odds of pre-term delivery for mothers in Lower SES The 95% Confidence Interval of the Odds Ratio (1.55, 7.14) indicates that odds of pre-term delivery are significantly ifi higher h for the Lower SES group compared to the Upper SES group (at 0.05 significance level) because the CI does not contain 1. 20

Interpretation etat of 95% Confidence Interval of OR An Odds Ratio = 1 indicates no association between the exposure and the disease. If the 95% confidence interval for the OR does not contain 1.0 we can conclude that there is a statistically significant* association between een the exposure and the disease. * at the 0.05 05 significance level If the 95% confidence interval for the OR contains 1.0, the association is not significant at the 0.05 level. 21

Confidence Interval of Relative Risk

Review: Relative Risk Risk of an event = probability that the event occurs Relative Risk probability that event occurs for group 1 probability that event occurs for group 2 23

Review: RR Calculated from a table Disease No Disease Total Exposed Group a b a+b Not Exposed Group c d c+d Risk of Disease for Exposed = a/(a+b) Risk of Disease for Not exposed = c/(c+d) RR = a/(a+b) c/(c+d) 24

Confidence Interval for Relative Risk Like the Odds Ratio, the sampling distribution of the Relative Risk is positively skewed but is approximately normally distributed on the natural log scale. Constructing a Confidence Interval for the Relative Risk is similar to constructing a CI for the Odds Ratio except that there is a different formula for the SE. 25

95% Confidence Interval of RR 1. Estimate the RR from the data 2. Find the natural log of RR: LN(RR) 3. The confidence coefficient is from the standard normal distribution: 1.96 for a 95% confidence interval 4. Calculate l the SE of LN(RR) see next slide 5. Calculate the lower and upper limits on the LN scale: LN(RR) ± 1.96 196* SE LN(RR) 6. Use the EXP function to find the limits on the original scale: EXP(LL), EXP (UL) 26

SE of LN(RR) The SE of LN(RR) is calculated from the cell counts in the 2X2t 2 table: Disease No Disease Total Exposed Group a b a+b Not Exposed c d c+d Not Exposed Group SE ln(rr) b a(a b) d c(c d) This formula is algebraically equivalent to the formula in text on pg. 201 27

95% Confidence Interval of Relative Risk First calculate the upper and lower limits on the LN scale l( ln( RR ) 1.96* b a( a b) d c( c d) After calculating the upper and lower limits on the LN scale, use the exponential function to find the limits on the original scale If the 95% Confidence Interval does not contain the value 1.0, the association is statistically significant at alpha = 0.05 05 28

Example: 95% CI for RR of Myocardial Infarction (MI) Background: Physicians enrolled in the Physician s Health Study were randomly assigned to take a daily aspirin or placebo. The table provides the number with MI in each group. Myocardial Infarction (MI) Group Yes No Aspirin 139 10898 11037 Placebo 239 10795 11034 378 21693 22071 Calculate the RR and 95% CI for the RR for MI (aspirin group compared to placebo group) 29

1. Calculate the RR for MI: aspirin group compared to placebo Myocardial Infarction (MI) Group Yes No Aspirin 139 10898 11037 Placebo 239 10795 11034 378 21693 22071 RR = (139/11037) / (239/11034) = 0.581 30

2. Calculate the LN (RR) and 3. Find the confidence coefficient i 2. Calculate the LN(RR) LN(0.581) = -0.543 This is the point estimate t for the confidence interval 3. The confidence coefficient for the LN(RR) is from the standard normal distribution. For a 95% confidence interval, the coefficient = 1.96 31

4. Calculate the SE LN(RR) Myocardial Infarction (MI) Group Yes No Aspirin 139 10898 11037 Placebo 239 10795 11034 378 21693 22071 SE ln(rr) 10898 139*11037 10795 239*11034 0.106 32

5. & 6. Find the Lower and Upper limits on the LN scale and the original scale 0.543 1.96*0.106 ( 0.75, 0.336) 5. The upper and lower limits of the confidence interval on the LN scale are (-0.750, -0.336) 6. Use the EXP function in Excel to find the interval limits of the 95% CI of the RR: (0.472, 0.715) 33

Interpretation of 95% CI of RR for MI The Relative Risk estimate = 0.58 which indicates that physicians in the aspirin group had a lower risk of MI than physicians in the placebo group. The 95% confidence interval indicates that t the decreased risk related to daily aspirin use is significant (at alpha = 0.05) 05) since the interval (0.472, 0.715) does not contain 1. 34

General e Interpretation etat of 95% Confidence Interval of RR A Relative Risk = 1.0 indicates no association between the exposure and the disease. If the 95% confidence interval for the RR does not contain 1.0 we can conclude that there is a statistically significant* association between een the exposure and the disease. * at the 0.05 05 significance level If the 95% confidence interval for the RR contains 1.0, the association is not significant at the 0.05 level. 35

NotesontheORandRRCI: on the RR The RR estimate will not be exactly in the middle of the confidence interval on the original scale The LN(RR) is exactly in the middle of the interval on the LN scale The OR estimate will not be exactly in the middle of the confidence interval on the original scale The LN(OR) is exactly in the middle of the interval on the LN scale Check that t the RR or OR estimate t is contained in the final interval it s easy to forget the EXP step 36

Readings and Assignments Reading: Chapter 8 pgs 200-201 201 Work through the Lesson 14 practice exercises Excel Module 14 works through the examples in this lesson for the confidence intervals of OR and RR Excel functions: LN and EXP There is no CI function for the RR and OR you just work through the steps Complete Homework 10 37