Critical Appraisal of Article on Therapy



Similar documents
Types of Studies. Systematic Reviews and Meta-Analyses

Systematic Reviews and Meta-analyses

Critical appraisal of systematic reviews

Introduction to study design

Clinical Study Design and Methods Terminology

Case-control studies. Alfredo Morabia

What is critical appraisal?

Komorbide brystkræftpatienter kan de tåle behandling? Et registerstudie baseret på Danish Breast Cancer Cooperative Group

Guide to Biostatistics

Study Designs. Simon Day, PhD Johns Hopkins University

Interpreting Diagnostic Tests (tutorial by Thomas Tape at U of Nebraska)

Rapid Critical Appraisal of Controlled Trials

Glossary of Methodologic Terms

Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text)

Diagnostic Studies Dr. Annette Plüddemann

Critical Appraisal of a Research Paper

Chapter 7 Sampling (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.

Paper PO06. Randomization in Clinical Trial Studies

Bayes Theorem & Diagnostic Tests Screening Tests

Observing and describing the behavior of a subject without influencing it in any way.

Principles of Systematic Review: Focus on Alcoholism Treatment

Inclusion and Exclusion Criteria

ASKING CLINICAL QUESTIONS

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

Secondary Uses of Data for Comparative Effectiveness Research

Normal Range. Reference Values. Types of Reference Ranges. Reference Range Study. Reference Values

CHAPTER 4 QUALITY ASSURANCE AND TEST VALIDATION

Measures of diagnostic accuracy: basic definitions

THE INTERNET STROKE CENTER PRESENTATIONS AND DISCUSSIONS ON STROKE MANAGEMENT

A guide for the patient

Cancer in Primary Care: Prostate Cancer Screening. How and How often? Should we and in which patients?

Guidance for Industry Diabetes Mellitus Evaluating Cardiovascular Risk in New Antidiabetic Therapies to Treat Type 2 Diabetes

MODULE 1.3 WHAT IS MENTAL HEALTH?

Oncology Nursing Society Annual Progress Report: 2008 Formula Grant

Critical appraisal of a journal article

GENERAL INFORMATION. Adverse Event (AE) Definition (ICH GUIDELINES E6 FOR GCP 1.2):

Why Sample? Why not study everyone? Debate about Census vs. sampling

How To Understand The Evidence From A Clinical Trial

Consultation Response Medical profiling and online medicine: the ethics of 'personalised' healthcare in a consumer age Nuffield Council on Bioethics

Incidence and Prevalence

How to Prepare a Research Protocol for WHO? Shyam Thapa, PhD Scientist

Simple Regression Theory II 2010 Samuel L. Baker

Assessing study quality: critical appraisal. MRC/CSO Social and Public Health Sciences Unit

Methods for Meta-analysis in Medical Research

Chapter 8: Quantitative Sampling

This clinical study synopsis is provided in line with Boehringer Ingelheim s Policy on Transparency and Publication of Clinical Study Data.

Data Extraction and Quality Assessment

The largest clinical study of Bayer's Xarelto (rivaroxaban) Wednesday, 14 November :38

Journal Club: Niacin in Patients with Low HDL Cholesterol Levels Receiving Intensive Statin Therapy by the AIM-HIGH Investigators

CALCULATIONS & STATISTICS

Current reporting in published research

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs

Evidence-based Medicine: Answering Questions of Diagnosis

APPRAISAL POLICY 1. BACKGROUND

Course Notes Frequency and Effect Measures

Can I have FAITH in this Review?

DETECTION AND NONOPERATIVE MANAGEMENT OF PEDIATRIC DEVELOPMENTAL DYSPLASIA OF THE HIP IN INFANTS UP TO SIX MONTHS OF AGE SUMMARY

Chapter 4. Probability and Probability Distributions

Lesson Outline Outline

What are confidence intervals and p-values?

Hindrik Vondeling Department of Health Economics University of Southern Denmark Odense, Denmark

AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS

Non-random/non-probability sampling designs in quantitative research

To achieve this aim the specific objectives of this PhD will include:

What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago

Service delivery interventions

Telemedicine Resuscitation & Arrest Trials (TreAT)

Challenges in Longitudinal Data Analysis: Baseline Adjustment, Missing Data, and Drop-out

Web appendix: Supplementary material. Appendix 1 (on-line): Medline search strategy

Behavioral and Physical Treatments for Tension-type and Cervicogenic Headache

Cohort Studies. Sukon Kanchanaraksa, PhD Johns Hopkins University

Sample Size and Power in Clinical Trials

Randomization in Clinical Trials

Preservation and Incorporation of Valuable Endoscopic Innovations (PIVI)

Renovascular Hypertension

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Research Methods & Experimental Design

Mary B Codd. MD, MPH, PhD, FFPHMI UCD School of Public Health, Physiotherapy & Pop. Sciences

This series of articles is designed to

11. Analysis of Case-control Studies Logistic Regression

Article Four Different Types of Evidence / Literature Reviews

MRC Autism Research Forum Interventions in Autism

GUIDELINES FOR REVIEWING QUANTITATIVE DESCRIPTIVE STUDIES

1.0 Abstract. Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA. Keywords. Rationale and Background:

Alcohol-use disorders overview

SIGN CRITICAL APPRAISAL COURSE: EXERCISE 2

The Mozart effect Methods of Scientific Research

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Avinesh Pillai Department of Statistics University of Auckland New Zealand 16-Jul-2015

Critical appraisal. Gary Collins. EQUATOR Network, Centre for Statistics in Medicine NDORMS, University of Oxford

Stuart B Black MD, FAAN Chief of Neurology Co-Medical Director: Neuroscience Center Baylor University Medical Center at Dallas

Treatment of Low Risk MDS. Overview. Myelodysplastic Syndromes (MDS)

A Guide To Producing an Evidence-based Report

Transcription:

Critical Appraisal of Article on Therapy What question did the study ask? Guide Are the results Valid 1. Was the assignment of patients to treatments randomized? And was the randomization list concealed? 2. Was follow-up of patients sufficiently long and complete? 3. Were patients analyzed in the groups which they were randomized? 4. Were patients and clinicians kept blind to treatment? 5. Were the groups treated equally, apart from the experimental treatment? Comment 6. Were the groups similar at the start of the trial? Are the valid results of this randomized study important? 1. What is the magnitude of the treatment effect? 2. How precise is this estimate of the treatment effect? Are these valid, important results applicable to our patient? 1. Is our patient so different from those in the study that its results cannot apply? 2. Is the treatment feasible in our setting? 3. What are our patient s potential benefits and harms form the therapy? 4. What are our patient s values and expectations for both the outcome we are trying to prevent and the treatment we are offering?

What question did the study ask? How to Review an Article on Therapy Guide Comment Are the results Valid 1. Was the assignment of patients to treatments randomized? 2. And was the randomization list concealed? 2. Was follow-up of patients sufficiently long and complete? 3. Were patients analyzed in the groups which they were randomized? 4. Were patients and clinicians kept blind to treatment? 5. Were the groups treated equally, apart from the experimental treatment? Randomization is important in balancing prognostic factors (both known & unknown) between treatment & control groups. This methodology is the gold standard for clinical trials. It is important that physicians who enter patients into a Randomized Clinical Trials (RCT) cannot influence which group their patient enters (control or experimental). The randomization should be concealed in some fashion sealed envelopes or calling to remote site for assignment are two examples. Was the follow-up of patients sufficiently long to see a clinically important effect? Example several weeks is fine for streptococcal pharyngitis, while years may be appropriate for chronic diseases, ie cancer, cardiac disease Was there an acceptable loss of follow-up of patients? Greater than 20% is usually considered unacceptable. To preserve the value of randomization intention to treat analysis should be performed. The subject is analyzed in the group which they were randomized. Blinding both the clinician and patient to the treatment (or lack thereof) is ideal. Sometimes patients and clinicians can t be blinded, such as in surgical trials. The most important blinding is that of the assessment of the outcomes of the study. The experimental and control group should be treated equally apart from the experimental treatment. They should have the same testing, the same number of follow-up visits, the same educational interventions other than experimental treatment.

6. Were the groups similar at the start of the trial? While randomization should make the groups similar, they may not be exactly equal. Groups should be similar in all prognostically important ways. If they are not similar, there should be some adjustment for potentially important prognostic factors carried out in the analysis phase. This can include stratification or multiple regression analysis. Are the valid results of this randomized study important? 1. What is the magnitude of the treatment effect? 2. How precise is this estimate of the treatment effect? CER control event rate EER experimental event rate Relative risk reduction (RRR) = (CER EER)/CER Absolute risk reduction (ARR) = CER EER Number Needed to Treat (NNT) = 1/ARR *Note there are formulas to calculate confidence intervals for each of the above measures. They are not included as you are not expected to be able to calculate them. Look for 95% confidence intervals. Confidence intervals are the measure of precision The wider the confidence intervals, the less precise the measurement. This is relative. Are these valid, important results applicable to our patient? 1. Is our patient so different from those in the study that its results cannot apply? Our patient does not have to fit all the inclusion criteria of this study. Consider whether our patient s sociodemographic features or pathobiology are so different from those in the study that its results are useless to us and our patient. 2. Is the treatment feasible in our setting? Is the treatment economically feasible and available in our geographic region? 3. What are our patient s potential benefits and harms form the therapy? There is always constant weighing of the treatment s potential benefits and harms. 4. What are our patient s values and expectations for both the outcome we are trying to prevent and the treatment we are offering? We must elicit our patient s preferences for both the outcome we are trying to prevent and the treatment we are offering.

HOW TO REVIEW AN ARTICLE ON DIAGNOSIS What question did the study ask? Guides Comments Are the results of this diagnostic study valid? 1. Was there an independent blind comparison with a reference (gold) standard of diagnosis? 2. Was the diagnostic test evaluated in an appropriate spectrum of patients (like those in whom it would be used in practice)? 3. Was the reference standard applied regardless of the diagnostic test result? 4. Was the test (or cluster of tests) validated in a second, independent group of patients? Are the valid results of this diagnostic study important? 2x2 Table Target Disorder Total Diagnostic Present Absent Test Positive a b a + b Result Negative c d c + d Total a + c b + d a + b + c+ d 1. Sensitivity = a/(a+c) 2. Specificity = d/(bsensitivity = a/(a+c) 3. Specificity = d/(b+d)

4. Likelihood Ratio for a positive test result = LR+=sens/(1-spec) 5. Likelihood Ratio for a negative test result=lr-=(1-sens)/spec 6. Positive Predictive Value = a/(a+b) 7. Negative Predictive Value = d/(c+d) 8. Pre-test Probability (prevalence) = (a+c)/(a+b+c+d) 9. Pre-test-odds = prevalence/(1-prevalence) 10. Post-test odds = Pre-test odds x Likelihood Ratio 11. Post-test Probability = Post-test odds/(post-test odds + 1) +d) 12. Likelihood Ratio for a positive test result = LR+=sens/(1-spec) 13. Likelihood Ratio for a negative test result=lr-=(1-sens)/spec 14. Positive Predictive Value = a/(a+b) 15. Negative Predictive Value = d/(c+d) 16. Pre-test Probability (prevalence) = (a+c)/(a+b+c+d) 17. Pre-test-odds = prevalence/(1-prevalence) 18. Post-test odds = Pre-test odds x Likelihood Ratio 19. Post-test Probability = Post-test odds/(post-test odds + 1) Can we apply this valid, important evidence about a diagnostic test in caring for our patient? 1. Is the diagnostic test available, affordable, accurate, and precise in our setting? 2. Can we generate a clinically sensible estimate of our patient s pre-test probability (from personal experience, prevalence statistics, or primary studies)? Are the study patients similar to our own? Is it unlikely that the disease possibilities or probabilities have changed since this evidence was gathered? 3. Will the resulting post-test probabilities affect our management and help our patient? Could it move us across a test- treatment threshold? Would our patient be a willing partner in carrying it out?

HOW TO REVIEW AN ARTICLE ON DIAGNOSIS What question did the study ask? Guides Are the results of this diagnostic study valid? Comments 1. Was there an independent blind comparison with a reference (gold) standard of diagnosis? 2. Was the diagnostic test evaluated in an appropriate spectrum of patients (like those in whom it would be used in practice)? 3. Was the reference standard applied regardless of the diagnostic test result? The gold standard is used to define the true state of the patient The gold standard must be reasonable and doable, but does not have to be the perfect test. Example coronary angiography is considered the gold standard for the identification of coronary artery disease. One could argue that autopsy is the gold standard, but difficult to perform in live patients You want to see that patients with a wide spectrum of the target diagnosis (mild, moderate & severe) and patients with commonly confused diagnoses (e.g., diagnostic value of rales include patients with congestive heart failure, chronic lung disease, pneumonia) are included. Spectrum bias refers to the bias introduced by having too narrow a spectrum of patients (severity or other commonly confused diagnoses) Every patient in the study should get both the reference standard and the diagnostic test being studied. Beware of studies that only compare the reference standard and diagnostic test in patients who are positive for a screening test (can be either reference standard or

diagnostic test. For example, a study examining the value of d-dimer in the diagnosis of deep vein thrombosis, only patients who had a positive venous ultrasound (reference standard) got a d- dimer. 4. Was the test (or cluster of tests) validated in a second, independent group of patients? Ideally (especially for cluster of tests or clinical prediction rules), the diagnostic test should be validated in a second independent sample. This does not often occur. Are the valid results of this diagnostic study important? See worksheet on test characteristics for formulas. In addition for likelihood ratios SpPin LR+ > 10 SnNout LR- < 0.1 Can we apply this valid, important evidence about a diagnostic test in caring for our patient? In order to be useful, a test needs to be affordable and available. It doesn t do 1. Is the diagnostic test available, affordable, your patient any good if they cannot afford accurate, and precise in our setting? the test or have to go to the North Pole to get it. 2. Can we generate a clinically sensible estimate of our patient s pre-test probability (from personal experience, prevalence statistics, or primary studies)? Are the study patients similar to our own? Is it unlikely that the disease possibilities or probabilities have changed since this evidence was gathered? 3. Will the resulting post-test probabilities affect our management and help our patient? Could it move us across a test- treatment threshold? Would our patient be a willing partner in carrying it out?

What question did the study ask? HOW TO REVIEW AN SYSTEMATIC REVIEW Guide Comments Deciding whether the results are valid 1. Is this a systematic review of randomized trials? 2. Does it include a methods section that describes: (a) Finding and including all the relevant trials? (b) Assessing their individual validity 3. Were the results consistent from study to study? 4. Were individual patient data used in the analysis or aggregate data? (may important in meta-analysis) Are the valid results of the systematic review important? 1. What is the magnitude of the treatment effect? 2. How precise is the treatment effect? Will the results help me in caring for my patients? 1. Is our patient so different from those in the study that its results cannot apply? 2. Is the treatment feasible in our setting? 3. What are our patient s potential benefits and harms form the therapy? 4. What are our patient s values and expectations for both the outcome we are trying to prevent and the treatment we are offering?

What question did the study ask? HOW TO REVIEW A SYSTEMATIC REVIEW Guide Comments Deciding whether the results are valid 1. Is this a systematic review of randomized trials? 2. Does it include a methods section that describes: (a) finding and including all the relevant trials? (b) assessing their individual validity? Systematic reviews (SRs) may go by other names including overview and meta-analysis. A meta-analysis is really a subset of SR where the results of many studies are statistically combined. A SRs is like any other research it has a focused question. Its data are the results of other studies instead of generating new data. While most SRs will only include randomized trials, some will include high quality studies, such as cohort studies. If the SR includes studies other than randomized trials, they should be analyzed separate. In other words a meta-analysis should not combine data from both randomized and non-randomized studies. A good SR will examine many sources for evidence including standard bibliographic databases, hand-searching journals, conference proceedings, theses, databanks of pharmaceutical firms and contacting experts or authors of published articles. Really good SRs will also search the European literature and/or foreign language literature. Each study that meets the inclusion criteria (or doesn t fail exclusion criteria) should be independently assessed by at least two reviewers. If there is a disagreement between reviewers, there should be some mechanism to resolve the differences. There are published criteria for reviewing and numerically rating studies. 3. Were the results consistent from study to study? While different studies may show differing quantitative effects, the studies should be consistent (homogeneous) with respect to overall benefit or harm. Beware of the SR that has some studies that show benefit, some no benefit and some clear-cut harm. These results would be very heterogeneous.

If the results are heterogeneous, the author should explain why such as differing study populations (by race, gender, and age), doses of medication, or duration of therapy. 4. Were individual patient data used in the analysis or aggregate data? (may important in meta-analysis) Combining individual patient data is preferable. It allows you to examine subgroups. Combining summary data does not offer the same flexibility. To use individual patient data you would need to go back to the original source of the data and obtain permission to use it. Are the valid results of the systematic review important? 1. What is the magnitude of the treatment effect? 2. How precise is the treatment effect? Many SRs are beginning to report their results with NNT. However, the reporting of Odds Ratio (OR) or Relative Risk (RR) is still very common. Both OR and RRR can be converted into NNT. Look for 95% confidence intervals. Confidence intervals are the measure of precision The wider the confidence intervals, the less precise the measurement. This is relative Will the results help me in caring for my patients? 1. Is our patient so different from those in the study that its results cannot apply? Our patient does not have to fit all the inclusion criteria of this study. Consider whether our patient s sociodemographic features or pathobiology are so different from those in the study that its results are useless to us and our patient. 2. Is the treatment feasible in our setting? Is the treatment economically feasible and available in our geographic region? 3. What are our patient s potential benefits and harms form the therapy? There is always constant weighing of the treatment s potential benefits and harms. 4. What are our patient s values and expectations for both the outcome we are trying to prevent and the treatment we are offering? We must elicit our patient s preferences for both the outcome we are trying to prevent and the treatment we are offering.