SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12



Similar documents
Cost-Benefit and Cost-Effectiveness Analysis. Kevin Frick, PhD Johns Hopkins University

Week 4: Standard Error and Confidence Intervals

Session 7 Bivariate Data and Analysis

II. DISTRIBUTIONS distribution normal distribution. standard scores

Simple Regression Theory II 2010 Samuel L. Baker

4. Continuous Random Variables, the Pareto and Normal Distributions

CALCULATIONS & STATISTICS

Chapter 3 RANDOM VARIATE GENERATION

Statistical handling of data in economic analysis

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Sample Size and Power in Clinical Trials

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Covariance and Correlation

Descriptive Statistics

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

Zeros of a Polynomial Function

Monte Carlo analysis used for Contingency estimating.

Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes

An introduction to Value-at-Risk Learning Curve September 2003

Standard Deviation Estimator

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Study Guide for the Final Exam

Common sense, and the model that we have used, suggest that an increase in p means a decrease in demand, but this is not the only possibility.

The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy

6.4 Normal Distribution

5.1 Identifying the Target Parameter

Risk Analysis and Quantification

Chapter 4. Probability and Probability Distributions

Fairfield Public Schools

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Introduction to nonparametric regression: Least squares vs. Nearest neighbors

Quantitative Methods for Finance

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary

Algebra 2 Chapter 1 Vocabulary. identity - A statement that equates two equivalent expressions.

Algebra Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Measurement with Ratios

Problem of the Month: Fair Games

MEASURES OF VARIATION

GRADES 7, 8, AND 9 BIG IDEAS

Association Between Variables

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Module 3: Correlation and Covariance

The Fourth International DERIVE-TI92/89 Conference Liverpool, U.K., July Derive 5: The Easiest... Just Got Better!

Algebra 1 Course Information

Survey Research: Choice of Instrument, Sample. Lynda Burton, ScD Johns Hopkins University

Algebra 1 Course Title

Graphical Integration Exercises Part Four: Reverse Graphical Integration

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Year 9 set 1 Mathematics notes, to accompany the 9H book.

PUBLIC HEALTH OPTOMETRY ECONOMICS. Kevin D. Frick, PhD

Math 1. Month Essential Questions Concepts/Skills/Standards Content Assessment Areas of Interaction

Two-sample inference: Continuous data

7.6 Approximation Errors and Simpson's Rule

Lecture Notes Module 1

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

The aerodynamic center

Definition 8.1 Two inequalities are equivalent if they have the same solution set. Add or Subtract the same value on both sides of the inequality.

Statistics Graduate Courses

Introduction to Quantitative Methods

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data.

POLYNOMIAL FUNCTIONS

Multivariate Normal Distribution

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

Physics Lab Report Guidelines

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

AMS 5 CHANCE VARIABILITY

AP Physics 1 and 2 Lab Investigations

Chapter 11 Number Theory

From the help desk: Bootstrapped standard errors

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

Derive 5: The Easiest... Just Got Better!

How To Check For Differences In The One Way Anova

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Common Core Unit Summary Grades 6 to 8

Means, standard deviations and. and standard errors

Statistical Rules of Thumb

Notes on Orthogonal and Symmetric Matrices MENU, Winter 2013

The correlation coefficient

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

Simple Random Sampling

Descriptive Statistics and Measurement Scales

The Assumption(s) of Normality

Economics and Economic Evaluation

HYPOTHESIS TESTING WITH SPSS:

Introduction to Path Analysis

Transcription:

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this site. Copyright 2007, The Johns Hopkins University and Kevin Frick. All rights reserved. Use of these materials permitted only in accordance with license rights granted. Materials provided AS IS ; no representations or warranties provided. User assumes all responsibility for use, and all liability related thereto, and must independently review all materials for accuracy and efficacy. May contain materials owned by others. User is responsible for obtaining permissions for use from third parties as needed.

SENSITIVITY ANALYSIS AND INFERENCE Lecture 12

Points of Discussion 1. Univariate Sensitivity Analyses 2. Tornado Diagrams (many univariate) 3. Bivariate/multivariate sensitivity analyses 4. How to think about confidence intervals 5. How to calculate/interpret confidence intervals with primary data (bootstrapping) 6. How to calculate/interpret confidence intervals with simulations (probabilistic SA) 7. How to combine 5 and 6 8. Cost-effectiveness acceptability curves 9. How to transform back to something that looks more like a net benefit measure

Sensitivity analysis Simple definition Vary a critical component of the calculation by a meaningful amount Objective To determine whether results are insensitive to substantial but plausible variation in a parameter To judge the robustness of conclusions To indicate where more detailed studies to obtain better data to pin down the ICER are needed

What are critical components? Prices and quantities Sensitivity, specificity, and prevalence Functional relationships Are risks always the same? Multiple exposures Different personal characteristics Health related quality of life measures Discount rate Measures of costs and effectiveness may be differentially sensitive to changes

Univariate sensitivity analysis Vary one parameter at a time over a range Best estimate, low value, high value 95% confidence interval Extreme values in plausible range What to do if it is difficult to find a value Perform threshold analyses Assess (usually reasoning-based) whether value is likely to be above or below threshold of interest Can suggest a need for further research to determine range of parameter

Threshold analysis Solve equation for what value of a given parameter would make the incremental cost effectiveness ratio equal to a chosen value $50,000/QALY $100,000/QALY The same ICER as was found for another intervention in a previous analysis Can be relatively simple algebra in some cases May be more intricate if looking at one parameter in a complex decision tree Search over multiple values

Threshold Sensitivity Analysis Reproduced with permission from Frick KD, et al. Modeled cost-effectiveness of the experience corps Baltimore based on a pilot randomized trial. Journal of Urban Health 2004;81:106-117. All rights reserved.

Pros and cons of univariate sensitivity analyses Pros Can indicate whether anything alters the conclusion about the ICER s relationship with the acceptable range Cons In the real world more than one thing varies at a time No sense of the likelihood of being at different places in the range Ignoring correlation between variation in multiple parameters can overstate uncertainty

What can we do with univariate sensitivity analyses? Single univariate sensitivity analysis Not much Multiple univariate sensitivity analyses Tornado diagram Provides a graphical way to show the varying effects of different parameters

Tornado Diagram Methods 1. Calculate the point estimate of the ICER 2. Calculate the most favorable ICERs (lowest $/QALY) when varying each parameter of interest 3. Calculate the least favorable ICERs (highest $/QALY) when varying each parameter of interest 4. Calculate the range for each parameter 5. Order parameters from widest range to most narrow range 6. Draw graph centered at point estimate

Tornado Diagram Example Prevalece Sensitivity Specificity Cost of Diagnosis 0 50 100 150 200 $1,000s/QALY

Reporting on a Tornado Diagram Give ranges for parameters Give ranges for ICERs Tell which parameters were tested Sometimes do not show all on tornado diagram Interpret the variable for which the uncertainty about the value has the biggest effect on the ICER Suggests need for more information

Why might we need more than univariate sensitivity analysis? The ICER depends on multiple parameters Ratio of two numbers Numbers are functions of other parameters Parameters may be be correlated Interaction of certain factors may imply that total effect is larger or smaller than the sum of the parts

What to do with multiple sources of uncertainty? Bivariate or more generally multivariate sensitivity analysis How to choose which variables and how to choose how to have them vary together? Combination of extreme values Only useful if demonstrates insensitivity to changes Unlikely that all extreme parameters will occur together Look for set of values that provides best and worst case All most of least favorable rather than all highest or lowest Simulation Return to this one later

Confidence intervals for ICERs Confidence interval of ratio is not simple because Even if absolute and incremental are normally distributed Even if absolute and incremental effectiveness are normally distributed The ratio of two normally distributed variables is not normally distributed

Complication #1 Zero Denominator If the 95% effectiveness confidence interval includes zero, cannot calculate a 95% confidence interval for the ICER ICER goes to infinity as effect goes to zero If the CI for the effect includes zero, the CI for the ICER is discontinuous Sign of ICER changes when cross zero effectiveness Both negative and positive infinity

Early Efforts at CIs Confidence Box Concept Calculate separate confidence intervals for the numerator and denominator of the incremental cost-effectiveness ratio Draw the box formed by the minimum and maximum values in the confidence intervals of each measure Use the ratios implied by the northwest and southeast corners

Confidence Box - Graphically $ Works best if both the NE and SW corners of the box are in the first quadrant 95% CI Effect

Confidence Box - Implications Assumes costs and effects are independent Each confidence interval is calculated with no consideration of the confidence interval for the other measure Potential uncertainty about Zero effectiveness in the denominator Dominance

Confidence Box What is the Real Interval? Recall that CIs reflect Type I error Proportion of times that the true parameter lies outside the estimated interval in repeated sampling Chance that the true parameter is within the interval is (1-α) Based on probability, the chance of two events simultaneously is (1- α) 2 The confidence interval formed by the confidence box is approximately a 90% confidence interval (0.95*0.95) So, what (if anything) do we know about the wedge formed by the rays from the origin through the corners?

More Confidence Box Problems - Graphically $ a A a b B b A & B are areas with simultaneous extreme values but that is not all that is left out of confidence interval Effect From Briggs & Fenn. Health Economics. 7:723.

Confidence Ellipse - Concept Consider the fact that the costs and effects may not be independent More effective treatment may cost more Individuals who live longer have an opportunity to accrue greater costs over time We can use an ellipse to capture the confidence interval rather than a box

Confidence Ellipse - Implication If costs and effectiveness are positively correlated the CI will be smaller In contrast, the CI will be wider if costs and effectiveness are negatively correlated Thus, correlation can substantially influence the interpretation of the results

Confidence Ellipse Independent Costs and $ Effects Note that the circle contains a greater proportion of the probability density function than the square, but that the relationship between the wedges is ambiguous Effect

Confidence Ellipse - Positively Correlated Costs and Effects $ Effect

Confidence Ellipse Negatively Correlated Costs $ and Effects Effect

Confidence Ellipse Comparison with Confidence Boxes To calculate ellipse use the joint probability density Relationship between ellipses and boxes The ellipse contains more of the probability density function than the box But, the wedges formed by the rays tangential to the box and ellipse contains more of the probability density than either shape alone

Confidence Interval Objective Find a way to determine the wedge that contains 95% of the probability density function Issue remains that the confidence interval may still include zero effectiveness Alternatives Mathematical expression Bootstrapping Cost effectiveness acceptability curves

Fieller - Concept Distribution of ratio Assume that the costs and effects follow a joint normal distribution R 2 var R E C ( E) + var( C) 2R cov( E, C) ( 0,1) Calculate exact CI such that the wedge in a graph on the CE plane will be exactly 95% Results are conditional on correctness of parametric assumptions ~ N

Fieller - Graphical $ Box Fieller Box Fieller Relationship between Fieller wedge and box wedge is ambiguous. Primarily know that Fieller wedge is exact CI. Effect

Fieller -Mathematically Find roots of quadratic equation R 2 [ ( )] 2 2 2 E z var 2 cov(, ) α / 2 E R E C zα / 2 E C [ var( )] 2 2 + C z C = 0 α / 2 [ ] Roots = [-b ± sqrt(b 2 4ac)] / 2a

Bootstrapping - Concept Empirical sample distribution Multiple draws from a data set Calculate statistics based on each draw Draw conclusions based on those results

Bootstrapping Basic Method Draw sample of same size as original sample with replacement from data Obtain confidence limits by taking observations at 2.5 percentile and 97.5 percentile Distinct from probabilistic sensitivity analysis because drawing observations from data set rather than random points in a distribution

Bootstrapping-type Example Reproduced with permission from Frick KD, et al. Modeled cost-effectiveness of the experience corps Baltimore based on a pilot randomized trial. Journal of Urban Health 2004;81:106-117. All rights reserved.

Bootstrapping Interpretation Note that cannot necessarily interpret CI directly May be better to ask questions like: What proportion of draws lead to ICERs below $50,000 per QALY? What proportion of draws lead to ICERs below $100,000 per QALY What proportion of draws indicate that the intervention is dominated?

Probabilistic sensitivity analyses through simulation Perhaps sensitivity analyses are not useful Not based on the likelihood of values occurring Probabilistic sensitivity analyses Use information on priors about each parameter A somewhat Bayesian approach How likely is it that the ICER is in an acceptable range Can indicate usefulness of additional

What does a probabilistic sensitivity analysis really do? Define a distribution of a parameter Draw a random value from the distribution Calculate the ICER (usually just two alternatives) with the value drawn Repeat the process a number of times Report the mean & standard deviation of the distribution, or percentiles of the distribution, or what proportion of the results are below a threshold value for the ICER

Cost-Effectiveness Acceptability Curves Crossover between CEA and CBA CEAC s can be used for two purposes Summarize (for every possible value of health effect) the evidence in favor of the intervention leading to a positive net benefit Construct a confidence interval

CEAC - Approaches Parametric Use joint normality assumption from Fieller s theorem and looks at the proportion of the joint density that is costeffective Non-parametric Use bootstrapping or probabilistic sensitivity analysis and determine what proportion of ICER estimates lie below any value of health effect

CEAC Graphical Example (1) + + + + + + + + + + Q + + + + + + + $

CEAC Graphical Example (2) Probability that net benefit is greater than zero 0.95 0.05 Dollar value of a QALY

CEAC Graphical Example (2) Probability that net benefit is greater than zero 0.95 0.05 Dollar value of a QALY

Lack of upper end of CEAC Frick et al. Journal of Urban Health example from several slides back is perfect example The distribution is so unfavorable for the intervention in question that there is never a 97.5% chance that the intervention has a positive net benefit Alternatively, may have so many cost saving alternatives that never have only 2.5% with a positive net benefit

Compare & Contrast Techniques Generally If parametric and non-parametric results are similar, use parametric results Replicability Generally more powerful The acceptability curve tends toward a Bayesian approach, i.e. the probability that the intervention is cost-effective Some authors have compared theoretical implications of techniques Some authors have compared empirical results with techniques

Back to Net Benefits While some had problems with CBA because of the need to place a dollar value on life, it is easier to work with confidence intervals NMB = λ E C λ = dollar value of a QALY var(nmb) = λ 2 var( E) + var( C) 2λ cov( E, C)

Briggs AH, et al. Thinking outside the box: recent advances in the analysis and presentation of uncertainty in costeffectiveness studies. Annu Rev Public Health. 2002;23:377-401.. Reprinted, with permission, from the Annual Review of Public Health, Volume 23 2002 by Annual Reviews www.annualreviews.org

Polsky et al. s Monte Carlo Experiment Repeated samples from data Box method using a different assumption about how wide to make the individual cost/effect confidence intervals Taylor series method An alternative mathematical approximation Fieller Non-parametric bootstrap

Polsky et al. s objectives Determine which method came closest to the target miscoverage level of 5% Determine whether or not the miscoverage was symmetric 2.5% of time entire CI is greater than true value and 2.5% of time entire CI is less than true value

Polsky et al. Results Fieller was most consistently near 5% Bootstrap was also fairly consistent Box and Taylor methods were less consistent as the correlation between costs and effectiveness increased in the positive range Taylor series method had asymmetric missed coverage