Predicting Chief Complaints at Triage Time in the Emergency Department
|
|
|
- Jack Crawford
- 10 years ago
- Views:
Transcription
1 Predicting Chief Complaints at Triage Time in the Emergency Department Yacine Jernite, Yoni Halpern New York University New York, NY Steven Horng Beth Israel Deaconess Medical Center Boston, MA David Sontag New York University New York, NY Abstract As hospitals increasingly use electronic medical records for research and quality improvement, it is important to provide ways to structure medical data without losing either expressiveness or time. We present a system that helps achieve this goal by building an extended ontology of chief complaints and automatically predicting a patient s chief complaint, based on their vitals and the nurses description of their state at arrival. 1 Introduction While recent years have seen an increase in the adoption of Electronic Medical Records (EMRs) and an interest in using them to improve the quality of care in hospitals, there is still considerable debate as to how best to capture data for both clinical care and secondary uses such as research, quality improvement, and quality measurement. Unstructured data (free text) is preferred by clinicians because it is more expressive and easier to input. Structured data is preferred by researchers and administrators because it can easily be used for secondary analysis. In the emergency department, a patient s chief complaint represents their reason for the visit. It has the potential to be used to subset patients into cohorts, initiate decision support, and perform research. However, it is routinely collected as free text. The need to collect chief complaints as structured data has been advocated for by every Emergency Medicine organization [8]. However, an appropriate chief complaint ontology will consist of over 2,000 terms, making manual input of structured data difficult, if not impossible. In this extended abstract, we present a novel use of natural language processing and machine learning that is able to utilize already collected unstructured clinical data to make collection of structured chief complaint data more efficient and reliable. 1.1 Clinical problem When a patient arrives at the Emergency Department (ED), they are processed at the triage station by a nurse who writes a note summarizing their state (e.g. medical history, symptoms) and a chief complaint used to assign them to the right pathway. We focus on the latter step in this work, building a system that learns to predict the chief complaint automatically from the summary of the patient s state (the triage note), and building an extended ontology to support it. 1
2 Since we want our system to be used in a practical setting, the two following requirements must be met: the user must feel that the software actually saves them time, and that its results can be trusted. The first item leads to a requirement that the program runs instantaneously and that the user interface be well-designed. The second item necessitates that the system be correct most of the time, and that it never give shocking results (we will give an example of such an unwanted behavior in Section 2). An important benefit of the system is that it transforms the chief complaints field in the EMR from free text to a categorical variable in a way that actually saves time (as opposed to simply having the nurse choose the complaint from an extended list), and makes the chief complaints easier to use in other systems at later stages of the patient s stay in the ED. 1.2 Related work The present work is set in a context of growing interest for the applications of medical Natural Language Processing. A variety of software such as ctakes [15], MedLee [6], NegEx [2] and MedConcepts [9] perform numerous NLP tasks, such as dependency parsing, negation detection or concept recognition, specifically on medical text. We focus here on applying NLP and machine learning methods to predicting chief complaints. This will allow us to improve the quality of chief complaints, by enabling use of a large standardized coding system in a practical way. Chief complaints are widely used for a variety of applications. For example, in syndromic surveillance, Chapman et al. [3] used them to monitor the 2002 Winter Olympic Games, and Mandl et al. [12] proposed to take advantage of chief complaints for early detection of fast-spreading diseases. Another application of chief complaints is for improving diagnosis and triage, since they can be used as variables within prediction algorithms and to initiate clinical pathways. For example, Aronsky and Haug [1] use chief complaints in a Bayesian network for diagnosis of community-acquired pneumonia, and Goldman et al. [7] rely on them to predict myocardial infarctions in ED patients. Finally, chief complaints are used to retrospectively analyze clinical data for research purposes, such as to study the prevalence of pain in the ED, as in Cordell et al. [4], or the factors that lead to missing diagnoses of myocardial infarction, as in McCarthy et al. [13]. In contrast to the typical work on using chief complaints, which focuses on natural language processing of chief complaints, we completely change the workflow, providing context-specific algorithms to enable rapid natural language-based entry of coded chief complaints. This is related to the work of Pakhomov et al. [14] on mapping diagnoses to ICD-9 (International Classification of Diseases) codes, and that of Larkey and Croft [11], who assign ICD codes to discharge summaries, although the structures of the ontologies and machine learning techniques used are quite different from ours. 2 Approach We decided to formalize the task of learning to predict the chief complaints for a patient as a multiclass learning problem; indeed, although a patient may come to the ED for more than one reason and thus have multiple labels, more than 4 in 5 actually have a single chief complaint. To that end, we chose to train a linear Support Vector Machine (SVM) on a bag-of-words representation of the triage notes. One useful feature of the linear SVM is that it makes it easy to see which words were most important in the decision, and makes analysis of the results much easier. For each concept in our new ontology we specified an example chief complaint (e.g. SEIZURE ) which resulted in labeled examples to use for training. In order for the SVM to work as intended, we had to deal with the following two issues. First, we realized that some chief complaints appeared too rarely for the SVM to learn to predict them. This was fixed by extending our ontology of chief complaints to have all descriptions of the same concept (such as SEIZURES, S/P SEIZURE, S/P SZ, SZ, SEIZURE ) linked to a single label (SEIZURE). Second, we observed a few errors that we believed would hurt the credibility of the system when used in a practical setting. One example is the following note: pt here with complains severe sudden onset abd pain, nausea and vomiting, blood in emesis, no black or bloody stools 2
3 The chief complaints for this note were [ N/V, ABD PAIN ], but our system predicted the 5 most likely labels to be [ BLOOD IN STOOL/MELENA, ABD PAIN, ST, ABDOMINAL PAIN, H/A ]. Predicting a chief complaint that is explicitly negated in the text seems to be an egregious mistake to a human, but it is a direct consequence of the bag-of-words assumption of the model. Because of this, we had to add a pre-processing step of negation detection, which we describe and report results for in Section 3. 3 Experiments 3.1 Experimental setup We developed our system on a dataset of triage notes with an average 1.2 chief complaints reported per note. We separated this data into a training set of notes and a validation set (to choose the regularization parameter of the SVM) and test set of notes each. To address some of the failings of the bag-of-words assumption, we applied the following preprocessing steps to our data. First, we detected and aggregated significant bi-grams, such as mental status or shortness of breath. We then looked at the performance of three negation detection systems: NegEx [2], a NegEx-like system to which we added a few rules tailored to our data, and a perceptron classifier trained to predict the scope of a negation. The perceptron performed best, as shown in Table 1, and we applied it as a second pre-processing step. In addition to the performance gain, the main advantage of the perceptron is that it can easily be re-trained to adapt to a new hospital without needing an expert to design their own set of rules to complement the NegEx system. NegEx added rules perceptron Precision Recall F Table 1: Performance of the different negation detection algorithms on 200 test sentences. We then used the improved bag-of-words representation of the text, as well as vital signs measured at triage (temperature, blood pressure, etc.), within two learning systems. The first treats the problem as a multiple label prediction task and tries learning a binary SVM classifier [10] for each of the chief complaints, comparing their outputs to sort the labels from most to least likely. The second consisted of a single multiclass SVM [5] which automatically provides such a ranking. Table 2 compares the performance of both systems according to two measures. The Best-n accuracy measures how often the list of n most likely predicted labels actually contained all of the true chief complaints, and DCG stands for the Discounted Cumulative Gain, which measures the quality of the whole ranking. Both measures show that the multiclass SVM performs much better, resulting in our choosing it to build our final system. many-to-one multiclass SVM negation detection none perceptron none perceptron Best Best DCG Table 2: Performance of the linear SVMs on chief complaint prediction. 3.2 Live application Our initial goal was to propose for each patient a set of 10 possible chief complaints for the nurses to select from. However, the user might feel that the software doesn t actually help if they have to go through the list of proposals and still input the right answer manually whenever the system fails. To remedy this situation, we decided to only propose the 5 best guesses of the system, and to additionally set up an intelligent auto-complete for the case when the nurse still wants to input their 3
4 Figure 1: Screenshots of the system now running at BIDMC hospital on note : 69 y/o M patient with severe intermittent RUQ pain. Began soon after eating bucket of ice cream and cupcake. Also is a heavy drinker. Left: the system correctly proposes both RUQ abdominal pain and Allergic reaction as possible chief complaints. Right: If the nurse does not see the label they want, they can start typing and see a list of suggested auto-completes. Again, the four most likely labels describe RUQ abdominal pain and Allergic reaction. answer, based on the ranking of chief complaints output by the SVM. An example of the interface is presented in Figure 1 (the patient name is anonymized). Getting our system to be usable in a practical setting required two further improvements. First, the initial system took about 5 seconds per note, far longer than the triage nurses patience. Using Python s shelve package to store the SVM weights as a persistent dictionary brought this time down to about 200 ms. We also discovered that a small set of patients are taken in without a triage note, but still need to be assigned a chief complaint. We added the absence of text as a feature for the SVM, which allowed for a better ranking of chief complaints for the auto-complete interface. 4 Conclusion In this work, we proposed a system to predict a patient s chief complaints based on a description of their state. Applied in a real-world setting, this provides us with a useful classification of patients which can be used for other tasks, without slowing down the triage process. While our algorithm already provides results that are good enough to be of use in practice, we hope to add some new features in the future. One notable direction that would have benefits similar to those of negation detection is time resolution, and it is an issue we are planning to address next. Finally, recall that while we built the current system on noisily annotated data, where we had to manually transform some of the labels, its use will create a much cleaner dataset, which we plan to use in many downstream applications. References [1] D. Aronsky and P. J. Haug. Diagnosing community-acquired pneumonia with a bayesian network. In Proceedings of the AMIA Symposium, page 632, [2] W. W. Chapman, W. Bridewell, P. Hanbury, G. F. Cooper, and B. G. Buchanan. A simple algorithm for identifying negated findings and diseases in discharge summaries. Journal of biomedical informatics, 34(5): ,
5 [3] W. W. Chapman, L. M. Christensen, M. M. Wagner, P. J. Haug, O. Ivanov, J. N. Dowling, and R. T. Olszewski. Classifying free-text triage chief complaints into syndromic categories with natural language processing. Artificial intelligence in medicine, 33(1):31 40, [4] W. H. Cordell, K. K. Keene, B. K. Giles, J. B. Jones, J. H. Jones, and E. J. Brizendine. The high prevalence of pain in emergency medical care. The American journal of emergency medicine, 20(3): , [5] K. Crammer and Y. Singer. On the algorithmic implementation of multiclass kernel-based vector machines. The Journal of Machine Learning Research, 2: , [6] C. Friedman. Medlee-a medical language extraction and encoding system. Columbia University, and Queens College of CUNY, [7] L. Goldman, E. F. Cook, D. A. Brand, T. H. Lee, G. W. Rouan, M. C. Weisberg, D. Acampora, C. Stasiulewicz, J. Walshon, G. Terranova, et al. A computer protocol to predict myocardial infarction in emergency department patients with chest pain. New England Journal of Medicine, 318(13): , [8] S. W. Haas, D. Travers, J. E. Tintinalli, D. Pollock, A. Waller, E. Barthell, C. Burt, W. Chapman, K. Coonan, D. Kamens, et al. Toward vocabulary control for chief complaint. Academic Emergency Medicine, 15(5): , [9] P. Jindal and D. Roth. Using knowledge and constraints to find the best antecedent. In Proceedings of International Conference on Computational Linguistics (COLING), pages , [10] T. Joachims. Training linear svms in linear time. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pages ACM, [11] L. S. Larkey and W. B. Croft. Automatic assignment of icd9 codes to discharge summaries. Center for Intelligent Information Retrieval Technical Report, [12] K. D. Mandl, J. M. Overhage, M. M. Wagner, W. B. Lober, P. Sebastiani, F. Mostashari, J. A. Pavlin, P. H. Gesteland, T. Treadwell, E. Koski, et al. Implementing syndromic surveillance: a practical guide informed by the early experience. Journal of the American Medical Informatics Association, 11(2): , [13] B. D. McCarthy, J. R. Beshansky, R. B. D Agostino, and H. P. Selker. Missed diagnoses of acute myocardial infarction in the emergency department: results from a multicenter study. Annals of emergency medicine, 22(3): , [14] S. V. Pakhomov, J. D. Buntrock, and C. G. Chute. Automating the assignment of diagnosis codes to patient encounters using example-based and machine learning techniques. Journal of the American Medical Informatics Association, 13(5): , [15] G. K. Savova, J. J. Masanz, P. V. Ogren, J. Zheng, S. Sohn, K. C. Kipper-Schuler, and C. G. Chute. Mayo clinical text analysis and knowledge extraction system (ctakes): architecture, component evaluation and applications. Journal of the American Medical Informatics Association, 17(5): ,
An Introduction to Data Mining
An Introduction to Intel Beijing [email protected] January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
Find the signal in the noise
Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical
Travis Goodwin & Sanda Harabagiu
Automatic Generation of a Qualified Medical Knowledge Graph and its Usage for Retrieving Patient Cohorts from Electronic Medical Records Travis Goodwin & Sanda Harabagiu Human Language Technology Research
Recognizing and Encoding Disorder Concepts in Clinical Text using Machine Learning and Vector Space Model *
Recognizing and Encoding Disorder Concepts in Clinical Text using Machine Learning and Vector Space Model * Buzhou Tang 1,2, Yonghui Wu 1, Min Jiang 1, Joshua C. Denny 3, and Hua Xu 1,* 1 School of Biomedical
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis Yue Dai, Ernest Arendarenko, Tuomo Kakkonen, Ding Liao School of Computing University of Eastern Finland {yvedai,
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, [email protected] Abstract: Independent
II. RELATED WORK. Sentiment Mining
Sentiment Mining Using Ensemble Classification Models Matthew Whitehead and Larry Yaeger Indiana University School of Informatics 901 E. 10th St. Bloomington, IN 47408 {mewhiteh, larryy}@indiana.edu Abstract
Automated Problem List Generation from Electronic Medical Records in IBM Watson
Proceedings of the Twenty-Seventh Conference on Innovative Applications of Artificial Intelligence Automated Problem List Generation from Electronic Medical Records in IBM Watson Murthy Devarakonda, Ching-Huei
TITLE Dori Whittaker, Director of Solutions Management, M*Modal
TITLE Dori Whittaker, Director of Solutions Management, M*Modal Challenges Impacting Clinical Documentation HITECH Act, Meaningful Use EHR mandate and adoption Need for cost savings Migration to ICD 10
In Proceedings of the Eleventh Conference on Biocybernetics and Biomedical Engineering, pages 842-846, Warsaw, Poland, December 2-4, 1999
In Proceedings of the Eleventh Conference on Biocybernetics and Biomedical Engineering, pages 842-846, Warsaw, Poland, December 2-4, 1999 A Bayesian Network Model for Diagnosis of Liver Disorders Agnieszka
Data Mining On Diabetics
Data Mining On Diabetics Janani Sankari.M 1,Saravana priya.m 2 Assistant Professor 1,2 Department of Information Technology 1,Computer Engineering 2 Jeppiaar Engineering College,Chennai 1, D.Y.Patil College
Problem-Centered Care Delivery
HOW INTERFACE TERMINOLOGY MAKES STANDARDIZED HEALTH INFORMATION POSSIBLE Terminologies ensure that the languages of medicine can be understood by both humans and machines. by June Bronnert, RHIA, CCS,
Challenges of Cloud Scale Natural Language Processing
Challenges of Cloud Scale Natural Language Processing Mark Dredze Johns Hopkins University My Interests? Information Expressed in Human Language Machine Learning Natural Language Processing Intelligent
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
Bagged Ensemble Classifiers for Sentiment Classification of Movie Reviews
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 3 Issue 2 February, 2014 Page No. 3951-3961 Bagged Ensemble Classifiers for Sentiment Classification of Movie
An Introduction to Machine Learning and Natural Language Processing Tools
An Introduction to Machine Learning and Natural Language Processing Tools Presented by: Mark Sammons, Vivek Srikumar (Many slides courtesy of Nick Rizzolo) 8/24/2010-8/26/2010 Some reasonably reliable
Semi-Supervised Learning for Blog Classification
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Semi-Supervised Learning for Blog Classification Daisuke Ikeda Department of Computational Intelligence and Systems Science,
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
Divide-n-Discover Discretization based Data Exploration Framework for Healthcare Analytics
for Healthcare Analytics Si-Chi Chin,KiyanaZolfaghar,SenjutiBasuRoy,AnkurTeredesai,andPaulAmoroso Institute of Technology, The University of Washington -Tacoma,900CommerceStreet,Tacoma,WA980-00,U.S.A.
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
Knowledge Discovery using Text Mining: A Programmable Implementation on Information Extraction and Categorization
Knowledge Discovery using Text Mining: A Programmable Implementation on Information Extraction and Categorization Atika Mustafa, Ali Akbar, and Ahmer Sultan National University of Computer and Emerging
Visual Analytics to Enhance Personalized Healthcare Delivery
Visual Analytics to Enhance Personalized Healthcare Delivery A RENCI WHITE PAPER A data-driven approach to augment clinical decision making CONTACT INFORMATION Ketan Mane, PhD kmane@renci,org 919.445.9703
WHITE PAPER. QualityAnalytics. Bridging Clinical Documentation and Quality of Care
WHITE PAPER QualityAnalytics Bridging Clinical Documentation and Quality of Care 2 EXECUTIVE SUMMARY The US Healthcare system is undergoing a gradual, but steady transformation. At the center of this transformation
Natural Language Processing in the EHR Lifecycle
Insight Driven Health Natural Language Processing in the EHR Lifecycle Cecil O. Lynch, MD, MS [email protected] Health & Public Service Outline Medical Data Landscape Value Proposition of NLP
Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery
Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery Jan Paralic, Peter Smatana Technical University of Kosice, Slovakia Center for
Cleaned Data. Recommendations
Call Center Data Analysis Megaputer Case Study in Text Mining Merete Hvalshagen www.megaputer.com Megaputer Intelligence, Inc. 120 West Seventh Street, Suite 10 Bloomington, IN 47404, USA +1 812-0-0110
An intelligent tool for expediting and automating data mining steps. Ourania Hatzi, Nikolaos Zorbas, Mara Nikolaidou and Dimosthenis Anagnostopoulos
An intelligent tool for expediting and automating data mining steps Ourania Hatzi, Nikolaos Zorbas, Mara Nikolaidou and Dimosthenis Anagnostopoulos Outline Data Mining, current tools An intelligent tool
Sentiment analysis on tweets in a financial domain
Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International
Artificial Neural Network Approach for Classification of Heart Disease Dataset
Artificial Neural Network Approach for Classification of Heart Disease Dataset Manjusha B. Wadhonkar 1, Prof. P.A. Tijare 2 and Prof. S.N.Sawalkar 3 1 M.E Computer Engineering (Second Year)., Computer
Active Learning SVM for Blogs recommendation
Active Learning SVM for Blogs recommendation Xin Guan Computer Science, George Mason University Ⅰ.Introduction In the DH Now website, they try to review a big amount of blogs and articles and find the
CENG 734 Advanced Topics in Bioinformatics
CENG 734 Advanced Topics in Bioinformatics Week 9 Text Mining for Bioinformatics: BioCreative II.5 Fall 2010-2011 Quiz #7 1. Draw the decompressed graph for the following graph summary 2. Describe the
A Logistic Regression Approach to Ad Click Prediction
A Logistic Regression Approach to Ad Click Prediction Gouthami Kondakindi [email protected] Satakshi Rana [email protected] Aswin Rajkumar [email protected] Sai Kaushik Ponnekanti [email protected] Vinit Parakh
Machine Learning Final Project Spam Email Filtering
Machine Learning Final Project Spam Email Filtering March 2013 Shahar Yifrah Guy Lev Table of Content 1. OVERVIEW... 3 2. DATASET... 3 2.1 SOURCE... 3 2.2 CREATION OF TRAINING AND TEST SETS... 4 2.3 FEATURE
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace
A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace what is this class about? health informatics managing and making sense of biomedical information but mostly from an
Assisting bug Triage in Large Open Source Projects Using Approximate String Matching
Assisting bug Triage in Large Open Source Projects Using Approximate String Matching Amir H. Moin and Günter Neumann Language Technology (LT) Lab. German Research Center for Artificial Intelligence (DFKI)
Exploring the Challenges and Opportunities of Leveraging EMRs for Data-Driven Clinical Research
White Paper Exploring the Challenges and Opportunities of Leveraging EMRs for Data-Driven Clinical Research Because some knowledge is too important not to share. Exploring the Challenges and Opportunities
Healthcare Data: Secondary Use through Interoperability
Healthcare Data: Secondary Use through Interoperability Floyd Eisenberg MD MPH July 18, 2007 NCVHS Agenda Policies, Enablers, Restrictions Date Re-Use Landscape Sources of Data for Quality Measurement,
Clintegrity 360 QualityAnalytics
WHITE PAPER Clintegrity 360 QualityAnalytics Bridging Clinical Documentation and Quality of Care HEALTHCARE EXECUTIVE SUMMARY The US Healthcare system is undergoing a gradual, but steady transformation.
SUBSTANCE USE DISORDER SOCIAL DETOXIFICATION SERVICES [ASAM LEVEL III.2-D]
SUBSTANCE USE DISORDER SOCIAL DETOXIFICATION SERVICES [ASAM LEVEL III.2-D] I. Definitions: Detoxification is the process of interrupting the momentum of compulsive drug and/or alcohol use in an individual
A Method for Automatic De-identification of Medical Records
A Method for Automatic De-identification of Medical Records Arya Tafvizi MIT CSAIL Cambridge, MA 0239, USA [email protected] Maciej Pacula MIT CSAIL Cambridge, MA 0239, USA [email protected] Abstract
Big Data Text Mining and Visualization. Anton Heijs
Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark
75-09.1-08-02. Program criteria. A social detoxi cation program must provide:
CHAPTER 75-09.1-08 SOCIAL DETOXIFICATION ASAM LEVEL III.2-D Section 75-09.1-08-01 De nitions 75-09.1-08-02 Program Criteria 75-09.1-08-03 Provider Criteria 75-09.1-08-04 Admission and Continued Stay Criteria
A Bayesian Network Model for Diagnosis of Liver Disorders Agnieszka Onisko, M.S., 1,2 Marek J. Druzdzel, Ph.D., 1 and Hanna Wasyluk, M.D.,Ph.D.
Research Report CBMI-99-27, Center for Biomedical Informatics, University of Pittsburgh, September 1999 A Bayesian Network Model for Diagnosis of Liver Disorders Agnieszka Onisko, M.S., 1,2 Marek J. Druzdzel,
Combining Global and Personal Anti-Spam Filtering
Combining Global and Personal Anti-Spam Filtering Richard Segal IBM Research Hawthorne, NY 10532 Abstract Many of the first successful applications of statistical learning to anti-spam filtering were personalized
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Modeling Temporal Data in Electronic Health Record Systems
International Journal of Information Science and Intelligent System, 3(3): 51-60, 2014 Modeling Temporal Data in Electronic Health Record Systems Chafiqa Radjai 1, Idir Rassoul², Vytautas Čyras 3 1,2 Mouloud
Maschinelles Lernen mit MATLAB
Maschinelles Lernen mit MATLAB Jérémy Huard Applikationsingenieur The MathWorks GmbH 2015 The MathWorks, Inc. 1 Machine Learning is Everywhere Image Recognition Speech Recognition Stock Prediction Medical
life science data mining
life science data mining - '.)'-. < } ti» (>.:>,u» c ~'editors Stephen Wong Harvard Medical School, USA Chung-Sheng Li /BM Thomas J Watson Research Center World Scientific NEW JERSEY LONDON SINGAPORE.
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail
Automatic Matching of ICD-10 codes to Diagnoses in Discharge Letters
Automatic Matching of ICD-10 codes to Diagnoses in Discharge Letters Svetla Boytcheva State University of Library Studies and Information Technologies, 119, Tzarigradsko Shosse, 1784 Sofia, Bulgaria [email protected]
Free Text Phrase Encoding and Information Extraction from Medical Notes. Jennifer Shu
Free Text Phrase Encoding and Information Extraction from Medical Notes by Jennifer Shu Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements
Blog Post Extraction Using Title Finding
Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School
An Information Extraction Framework for Cohort Identification Using Electronic Health Records
An Information Extraction Framework for Cohort Identification Using Electronic Health Records Hongfang Liu PhD 1, Suzette J. Bielinski PhD 1, Sunghwan Sohn PhD 1, Sean Murphy 1, Kavishwar B. Wagholikar
How To Use A Fault Docket System For A Fault Fault System
Journal of Engineering and Applied Science Volume 4, December 2012 2012 Cenresin Publications www.cenresinpub.org APPLICATION OF NEURAL NETWORK TECHNIQUE TO TELECOMMUNICATION FAULT DOCKET SYSTEM 1 Okpeki
BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL
The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University
Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05
Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 2015-03-05 Roman Kern (KTI, TU Graz) Ensemble Methods 2015-03-05 1 / 38 Outline 1 Introduction 2 Classification
Prediction of Heart Disease Using Naïve Bayes Algorithm
Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,
Biovigilance Component Hemovigilance Module Incident Reporting
Biovigilance Component Hemovigilance Module Incident Reporting National Center for Emerging and Zoonotic Infectious Diseases Division of Healthcare Quality Promotion 1 Objectives Review key terms used
Putting IBM Watson to Work In Healthcare
Martin S. Kohn, MD, MS, FACEP, FACPE Chief Medical Scientist, Care Delivery Systems IBM Research [email protected] Putting IBM Watson to Work In Healthcare 2 SB 1275 Medical data in an electronic or
Applying Machine Learning to Stock Market Trading Bryce Taylor
Applying Machine Learning to Stock Market Trading Bryce Taylor Abstract: In an effort to emulate human investors who read publicly available materials in order to make decisions about their investments,
Intelligent Tools For A Productive Radiologist Workflow: How Machine Learning Enriches Hanging Protocols
GE Healthcare Intelligent Tools For A Productive Radiologist Workflow: How Machine Learning Enriches Hanging Protocols Authors: Tianyi Wang Information Scientist Machine Learning Lab Software Science &
Identify Disorders in Health Records using Conditional Random Fields and Metamap
Identify Disorders in Health Records using Conditional Random Fields and Metamap AEHRC at ShARe/CLEF 2013 ehealth Evaluation Lab Task 1 G. Zuccon 1, A. Holloway 1,2, B. Koopman 1,2, A. Nguyen 1 1 The Australian
Beating the NCAA Football Point Spread
Beating the NCAA Football Point Spread Brian Liu Mathematical & Computational Sciences Stanford University Patrick Lai Computer Science Department Stanford University December 10, 2010 1 Introduction Over
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
The American Academy of Ophthalmology Adopts SNOMED CT as its Official Clinical Terminology
The American Academy of Ophthalmology Adopts SNOMED CT as its Official Clinical Terminology H. Dunbar Hoskins, Jr., M.D., P. Lloyd Hildebrand, M.D., Flora Lum, M.D. The road towards broad adoption of electronic
Impact of Feature Selection on the Performance of Wireless Intrusion Detection Systems
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Impact of Feature Selection on the Performance of ireless Intrusion Detection Systems
KNOWLEDGE-BASED IN MEDICAL DECISION SUPPORT SYSTEM BASED ON SUBJECTIVE INTELLIGENCE
JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 22/2013, ISSN 1642-6037 medical diagnosis, ontology, subjective intelligence, reasoning, fuzzy rules Hamido FUJITA 1 KNOWLEDGE-BASED IN MEDICAL DECISION
6/14/2010. Clinical Decision Support: Applied Decision Aids in the Electronic Medical Record. Addressing high risk practices
Clinical Decision Making in Emergency Medicine Ponte Vedra 2010 Evidence based decision support Clinical Decision Support: Applied Decision Aids in the Electronic Medical Record The ED as a high risk settings
Investigating Clinical Care Pathways Correlated with Outcomes
Investigating Clinical Care Pathways Correlated with Outcomes Geetika T. Lakshmanan, Szabolcs Rozsnyai, Fei Wang IBM T. J. Watson Research Center, NY, USA August 2013 Outline Care Pathways Typical Challenges
Generating Patient Problem Lists from the ShARe Corpus using SNOMED CT/SNOMED CT CORE Problem List
Generating Patient Problem Lists from the ShARe Corpus using SNOMED CT/SNOMED CT CORE Problem List Danielle Mowery Janyce Wiebe University of Pittsburgh Pittsburgh, PA [email protected] [email protected]
The Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
Emergency Room (ER) Visits: A Family Caregiver s Guide
Family Caregiver Guide Emergency Room (ER) Visits: A Family Caregiver s Guide Your family member may someday have a medical emergency and need to go to a hospital Emergency Room (ER), which is also called
Ernestina Menasalvas Universidad Politécnica de Madrid
Ernestina Menasalvas Universidad Politécnica de Madrid EECA Cluster networking event RITA 12th november 2014, Baku Sectors/Domains Big Data Value Source Public administration EUR 150 billion to EUR 300
SNOMED CT. The Language of Electronic Health Records
SNOMED CT The Language of Electronic Health Records Contents SNOMED CT: An overview page 02 What is a Clinical Terminology? What is SNOMED CT? The International Health Terminology Standards Development
The Enron Corpus: A New Dataset for Email Classification Research
The Enron Corpus: A New Dataset for Email Classification Research Bryan Klimt and Yiming Yang Language Technologies Institute Carnegie Mellon University Pittsburgh, PA 15213-8213, USA {bklimt,yiming}@cs.cmu.edu
