PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)



Similar documents
PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Elizabeth Comino Centre fo Primary Health Care and Equity 12-Aug-2015

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Jeffrey Sparks Brigham and Women's Hospital Harvard Medical School USA 25-Nov-2015

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Richard Franklin James Cook University, Australia 15-Jun-2015.

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Fogelholm, Mikael University of Helsinki 19-Jul-2015

The Initial and 24 h (After the Patient Rehabilitation) Deficit of Arterial Blood Gases as Predictors of Patients Outcome

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Saket Girotra University of Iowa, Iowa City, IA United States 04-Aug-2015

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Avinesh Pillai Department of Statistics University of Auckland New Zealand 16-Jul-2015

Carrieri, Vicenzo University of Salerno 19-Jun-2013

Organizing Your Approach to a Data Analysis

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Tatyana A Shamliyan. I do not have COI. 30-May-2012

Analyzing Research Articles: A Guide for Readers and Writers 1. Sam Mathews, Ph.D. Department of Psychology The University of West Florida

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Anna Sinaiko Harvard School of Public Health United States of America 20-Dec-2013

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Joel Lexchin York University Canada 14-Jul-2014

Project Database quality of nursing (Quali-NURS)

Regression Modeling Strategies

Summary and general discussion

Guidance for Peer Reviewers. The Journal of the American Osteopathic Association (JAOA)

PEER REVIEW HISTORY ARTICLE DETAILS

Protective Effects of Rheumatoid Arthritis in Septic ICU Patients

A Comparison of Costs Between Medical and Surgical Patients in an Academic Pediatric Intensive Care Unit

Title: First year growth response to growth hormone treatment predicts the longterm prepubertal growth response in children

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Dingcheng Li Mayo Clinic, USA 20-Dec-2015

Running head: SAMPLE FOR STUDENTS 1. Sample APA Paper for Students Interested in Learning APA Style 6th Edition. Jeffrey H. Kahn

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Lepelletier Nantes University Hospital, Nantes, France 20-Jul-2015

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

Quality Improvement Primers. Measurement for Quality Improvement

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

Electronic Medical Record Mining. Prafulla Dawadi School of Electrical Engineering and Computer Science

FERRIS STATE UNIVERSITY SCHOOL OF NURSING PORTFOLIO GUIDELINES AND EVALUATION PROCESS PRELICENSURE BSN PROGRAM. Introduction:

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW

How To Understand And Understand The Relationship Between Patient And Physician Mistrust In China

Introduction. Survival Analysis. Censoring. Plan of Talk

The Relationship between Speed and Car Driver Injury Severity

HIST 499: Senior Seminar in History. Sample Syllabus

Accountable Care: Implications for Managing Health Information. Quality Healthcare Through Quality Information

PREDICTIVE ANALYTICS FOR THE HEALTHCARE INDUSTRY

Deriving Value from ORSA. Board Perspective

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

Florida s Certificate of Need

Critical Appraisal of the Medical Literature

WHITE PAPER Insurance Rates and Your Credit: What Every Consumer Should Know. October 21,

Leadership Success Factors

Author's response to reviews

Critical Appraisal of a Research Paper

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

BUS 373 ORGANIZATIONAL BEHAVIOR Syllabus Fall 2015

Discussion Paper On the validation and review of Credit Rating Agencies methodologies

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

The ACGME Resident Survey Aggregate Reports: An Analysis and Assessment of Overall Program Compliance

THE RISK DISTRIBUTION CURVE AND ITS DERIVATIVES. Ralph Stern Cardiovascular Medicine University of Michigan Ann Arbor, Michigan.

Impact of Part D Coverage Gap on Adherence to Diabetes Medications Feng Zeng, Ph.D.

Predictive Analytics in Workplace Safety:

6. MATERIAL RESOURCE AND SUPPORT NEEDS OF TEACHERS

Impact of Critical Care Nursing on 30-day Mortality of Mechanically Ventilated Older Adults

IBA Business Analytics Data Challenge

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Value-Based Purchasing Program Overview. Maida Soghikian, MD Grand Rounds Scripps Green Hospital November 28, 2012

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal

Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015

SURVEILLANCE VALIDATION GUIDE. for healthcare associated Staphylococcus aureus Bloodstream Infection

multi-media response study

TECHNICAL REPORT WRITING GUIDELINES

Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Nahyuha Chomi, Eunice United Kingdom 03-Jul-2015

Mortality Assessment Technology: A New Tool for Life Insurance Underwriting

Department of Behavioral Sciences and Health Education

Link Prediction in Social Networks

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Dr Andreas Xyrichis King's College London, UK 14-Jun-2015

Addressing Student Retention and Persistence Issue in Online Classes

Improving Employee Satisfaction in Healthcare through Effective Employee Performance Management

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

How To: Analyse & Present Data

Strategies for Identifying Students at Risk for USMLE Step 1 Failure

ERC University & Degree Requirements for Freshmen. This video will provide an overview of your University & Degree requirements.

Title:Online training course on critical appraisal for nurses: adaptation and assessment

CISM ITEM DEVELOPMENT GUIDE

Aspects of Risk Adjustment in Healthcare

MHI3000 Big Data Analytics for Health Care Final Project Report

DATA CONSISTENCY, COMPLETENESS AND CLEANING. By B.K. Tyagi and P.Philip Samuel CRME, Madurai

of the Dossier Volume: Page:

Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL

Newborn outcomes after cesarean section for fetal distress in BC

Transcription:

PEER REVIEW HISTORY BMJ Open publishes all reviews undertaken for accepted manuscripts. Reviewers are asked to complete a checklist review form (http://bmjopen.bmj.com/site/about/resources/checklist.pdf) and are provided with free text boxes to elaborate on their assessment. These free text comments are reproduced below. ARTICLE DETAILS TITLE (PROVISIONAL) AUTHORS The Deyo-Charlson and Elixhauser-van Walraven Comorbidity Indices as predictors of mortality in critically ill patients Ladha, Karim; Zhao, Kevin; Quraishi, Sadeq; Kurth, Tobias; Eikermann, Matthias; Kaafarani, Haytham; Klein, Eric; Seethala, Raghu; Lee, Jarone VERSION 1 - REVIEW REVIEWER REVIEW RETURNED Tainter, Christopher University of California, San Diego, Emergency Medicine 30-Jun-2015 GENERAL COMMENTS This is a well-described and useful analysis of an extraordinary data set. I think it is ready for publication, but offer a few discretionary revisions: Page 11, paragraph 3, line 44: This appears to be a statistically significant finding, not necessarily a clinically relevant one. I would strongly encourage inclusion of a graphic (table) describing the calculation of both comorbidity indices. Was an analysis performed using hospital and/or ICU length of stay as outcome measures? This would be another very useful outcome, and should be considered for inclusion. I suggest moving the citation (superscript numbers) after the punctuation. REVIEWER Zampieri, Fernando G. Hospital das Clínicas, Intensive Care Unit REVIEW RETURNED 01-Jul-2015 GENERAL COMMENTS Ladha and coworkers performed a nice study aiming to compare the predictive capability of two commonly used comorbidity indexes (Charlson and Elixhauser) in critically ill patients. This a timely topic and the large sample size available represents an excellent opportunity to increase knowledge in the field. Nevertheless, I have some major concerns: 1. Introduction: Fine, no major comments. 2. Methods/Results: - The main question posed by the authors is whether DCCI is superior or not to EVCI. The authors have chosen to address this issue by comparison of ROC AUC using DeLong tests at predefined time windows. I do not believe that multiple ROC curve testing at

specific time frames is the most appropriate way to answer the question you are asking. I am specially concern by increase in type 1 error due to multiple testing. In fact, as discussed by the authors, many positive findings are a consequence of the large sample size, but multiple testing may also be involved. Assuming you may have data for one-year survival, why not building a Cox model and time dependent ROC curves? A suggested approach would be as following: a. Assessing proportionality of the effect of each comorbidity index in 1-year outcome (since you are using R, you could use the cox.zph function in the survival package). b. Construct a time dependent ROC curve for each comorbidity index (timeroc package is very handful for this); weighting can be done using Cox model. c. Compare both time dependent ROC curves (also may be done in timeroc). The theoretical advantages of such approach is correcting for multiple comparisons and providing a nice graphical display of the evolution of AUC over time. The reason for this last comment will be clear in my next comments. - I see that the starting point for data collection was ICU admission. While this is frequently the chosen starting point, it may not be the most adequate depending on the objective of your manuscript (for a discussion on this, please see Ranzani OT et al, Crit Care 2013). You are pooling ICU mortality together with in-hospital and afterhospital mortality. Depending on your hypothesis, perhaps splittings groups according to ICU and/or hospital survival may make more sense than only evaluating fixed time frames. - The finding that the AUC for mortality for each score increases as time spam increases is very interesting (in my opinion, the most interesting finding of the manuscript). Reasons for this may be obvious, but are not properly discussed. Comorbidities and age may be more important prognosis determinants only after a few months following ICU discharge (see Lone et al, Blue Journal 2013 and Ranzani et al, Crit Care, 2015). As the effects of critical illness vanishes, baseline patient features may become prominent. As one moves away from critical illness, the impact of comorbidities in prognosis may be more relevant and, therefore, the discriminative capability of any score that measures comorbidities will be higher. This ought to be explored. - While ROC curves may be appropriate for accuracy comparison they do not provide any information regarding calibration of a measurement. The authors tried to overcome this by performing a reclassification analysis (by the way, did you use any specific R package for reclassification analysis? The PredictABEL package is frequently used for this. If so, please quote the R package accordingly) but I still miss a more robust calibration assessment (calibration plot maybe?). - Why correcting for age, gender and race? Was there any clue that these variables would be relevant? Or maybe correlated to comorbidities? If so, how was this assessed? - I really like Table S1. I think it should be in the body of the manuscript. 3. Discussion: Should address points mentioned above. 4. Tone of the conclusions and key points should be tempered.

VERSION 1 AUTHOR RESPONSE Reviewer: 1 Reviewer Name Christopher R Tainter Institution and Country University of California, San Diego USA Please state any competing interests or state None declared : None declared Please leave your comments for the authors below This is a well-described and useful analysis of an extraordinary data set. I think it is ready for publication, but offer a few discretionary revisions: Page 11, paragraph 3, line 44: This appears to be a statistically significant finding, not necessarily a clinically relevant one. The wording has been changed and the term clinically relevant has been removed. I would strongly encourage inclusion of a graphic (table) describing the calculation of both comorbidity indices. We have now included a table describing the frequency of each comorbidity used in the calculation of the indices within the manuscript as table 2. The methods used to calculate the indices have been previously described and these papers are cited in the manuscript. Was an analysis performed using hospital and/or ICU length of stay as outcome measures? This would be another very useful outcome, and should be considered for inclusion. This is an excellent idea and we have now included this analysis in the manuscript. I suggest moving the citation (superscript numbers) after the punctuation. This has been changed. Reviewer: 2 Reviewer Name Fernando G Zampieri Institution and Country 1 - Hospital das Clínicas, Faculty of Medicine, University of São Paulo, São Paulo, Brazil 2 - Intensive Care Unit, Hospital Alemão Oswaldo Cruz, São Paulo, Brazil Please state any competing interests or state None declared : None declared Please leave your comments for the authors below Thank you for the opportunity to review your work. Ladha and coworkers performed a nice study aiming to compare the predictive capability of two commonly used comorbidity indexes (Charlson and Elixhauser) in critically ill patients. This a timely topic and the large sample size available represents an excellent opportunity to increase knowledge in the field. Nevertheless, I have some major concerns: 1. Introduction: Fine, no major comments.

2. Methods/Results: - The main question posed by the authors is whether DCCI is superior or not to EVCI. The authors have chosen to address this issue by comparison of ROC AUC using DeLong tests at predefined time windows. I do not believe that multiple ROC curve testing at specific time frames is the most appropriate way to answer the question you are asking. I am specially concern by increase in type 1 error due to multiple testing. In fact, as discussed by the authors, many positive findings are a consequence of the large sample size, but multiple testing may also be involved. Assuming you may have data for one-year survival, why not building a Cox model and time dependent ROC curves? A suggested approach would be as following: a. Assessing proportionality of the effect of each comorbidity index in 1-year outcome (since you are using R, you could use the cox.zph function in the survival package). b. Construct a time dependent ROC curve for each comorbidity index (timeroc package is very handful for this); weighting can be done using Cox model. c. Compare both time dependent ROC curves (also may be done in timeroc). The theoretical advantages of such approach is correcting for multiple comparisons and providing a nice graphical display of the evolution of AUC over time. The reason for this last comment will be clear in my next comments. This is an interesting idea that we had not previously considered. While the reviewer s comments regarding multiple testing are well taken, the conclusion of the paper is that even with a statistically significant difference, there is no meaningful or clinically significant difference. Additionally, the method proposed does not account for multiple testing since the recommended procedure does not compare one curve to another. Rather, once the model is created, separate ROC curves can be constructed for various time points and AUCs are compared at each time point specified. However we felt that the analytic approach suggested by the reviewer could be used as an additional analysis and have included a figure in the manuscript demonstrating the consistency of the results. Of note though, we did not use the cox weighting but rather the Kaplan-Meier estimator. This is because using the recommended cox.zph function, the assumption of proportional hazards was not met. - I see that the starting point for data collection was ICU admission. While this is frequently the chosen starting point, it may not be the most adequate depending on the objective of your manuscript (for a discussion on this, please see Ranzani OT et al, Crit Care 2013). You are pooling ICU mortality together with in-hospital and after-hospital mortality. Depending on your hypothesis, perhaps splittings groups according to ICU and/or hospital survival may make more sense than only evaluating fixed time frames. The main focus of the paper was to determine how these indices could be used to predict mortality at different time intervals regardless of where the death occurred. We felt that this was the more useful outcome and there have been other studies examining in-hospital mortality and these indices. - The finding that the AUC for mortality for each score increases as time spam increases is very interesting (in my opinion, the most interesting finding of the manuscript). Reasons for this may be obvious, but are not properly discussed. Comorbidities and age may be more important prognosis determinants only after a few months following ICU discharge (see Lone et al, Blue Journal 2013 and Ranzani et al, Crit Care, 2015). As the effects of critical illness vanishes, baseline patient features may become prominent. As one moves away from critical illness, the impact of comorbidities in prognosis may be more relevant and, therefore, the discriminative capability of any score that

measures comorbidities will be higher. This ought to be explored. Thank you for this point and we agree with the reviewer that this is a very interesting concept. As we did not have organ dysfunction scores, we can only theorize in our discussion that the improved predictability is related to patients previous comorbidities, age, and the new comorbidities that the patient collected during their ICU stay. We have now included the following in the discussion: Another interesting finding was that both indices improved mortality prediction over time. The AUC for the DCCI improved from 0.65 at 30 days to 0.72 at 365 days; and, similarly the AUC for EVCI improved from 0.66 at 30 days to 0.72 at 365 days. This improvement most likely is because as critically-ill patients survive past their ICU stay, the major determinants of survival are related to their baseline comorbidities, age, and new organ dysfunction as a result of their critical illness. Without prospective and organ dysfunction data, this is only a theory but two previous studies are congruent with our findings, where the researchers found that patients with higher organ dysfunction scores during their ICU stay had higher utilization of healthcare resources and mortality up to one year, and possibly 5 years. - While ROC curves may be appropriate for accuracy comparison they do not provide any information regarding calibration of a measurement. The authors tried to overcome this by performing a reclassification analysis (by the way, did you use any specific R package for reclassification analysis? The PredictABEL package is frequently used for this. If so, please quote the R package accordingly) but I still miss a more robust calibration assessment (calibration plot maybe?). The PredictABEL package was used and is now quoted. We used the same package to create calibration plots as requested and they are now included in the manuscript. - Why correcting for age, gender and race? Was there any clue that these variables would be relevant? Or maybe correlated to comorbidities? If so, how was this assessed? These were chosen since they are factors that are readily available in administrative databases and thus can easily be included in a predictive model. This point is now made in the methods section. - I really like Table S1. I think it should be in the body of the manuscript. This table has been moved to the body of the manuscript. 3. Discussion: Should address points mentioned above. As noted above we have modified the discussion section to account for the points previously mentioned by the reviewer. 4. Tone of the conclusions and key points should be tempered. We agree with the reviewer that ending the manuscript and conclusion with the statement that the DCCI or EVCI can possibly replace physiology-based indices should be tempered. We have added the following sentence to the end of the conclusion:

Further studies are needed to determine if the DCCI or EVCI can be used in place of physiologybased risk-indices. VERSION 2 REVIEW REVIEWER Zampieri, Fernando G. Hospital das Clínicas, Intensive Care Unit REVIEW RETURNED 14-Aug-2015 GENERAL COMMENTS The reviewer completed the checklist but made no further comments.