How To Retrieve Similar Cases From A Medical Record
|
|
- Dwayne Augustine Hart
- 3 years ago
- Views:
Transcription
1 Retrieval of Similar Electronic Health Records Using UMLS Concept Graphs Laura Plaza and Alberto Díaz Universidad Complutense de Madrid C/Profesor José García Santesmases, s/n, Madrid 28040, Spain Abstract. Physicians often use information from previous clinical cases in their decision-making process. However, the large amount of patient records available in hospitals makes an exhaustive search unfeasible. We propose a method for the retrieval of similar clinical cases, based on mapping the text onto UMLS concepts and representing the patient records as semantic graphs. The method also deals with the problems of negation detection and concept identification in clinical free text. To evaluate the approach, an evaluation collection has been developed. The results show that our method correlates well with the expert judgments and outperforms remarkably the traditional term-vector space model. Keywords: Similar Case Retrieval, Graph Theory, Electronic Health Record, UMLS. 1 Introduction In their daily work, physicians often face complex clinical cases and need to refer to similar patient cases encountered before, especially when untypical cases are presented. In fact, as stated in [1], previous knowledge about the problem is proved to be one of the most influencing factors in the decision-making process. However, as shown in [2], the excessive time required to find this knowledge may undermine its convenience if no effective access technologies are provided. When dealing with medical records, the information is mainly stored as free-text. This information is considerably more difficult to analyze than that presented in more formal texts, such as textbooks or scientific papers, since it exhibits unique sublanguage characteristics (e.g. verbless sentences, lack of punctuation and spelling errors). Moreover, negation detection plays an important role when trying to understand the meaning of medical information. The task of retrieving similar medical records may be considered a particular case of Information Retrieval (IR) that may be stated as: Given a reference record, retrieve other records from the clinical database that are similar to the reference one. To decide the criteria to assess the similarity between records is the most important and difficult issue in this task. In this paper, we propose a method for the retrieval of similar Electronic Health Records (EHRs), based on mapping the text onto UMLS concepts and representing the patient records C.J. Hopfe et al. (Eds.): NLDB 2010, LNCS 6177, pp , c Springer-Verlag Berlin Heidelberg 2010
2 Retrieval of Similar Electronic Health Records Using UMLS Concept Graphs 297 as semantic graphs, where the vertices are UMLS concepts and the edges represent is-a relations between them. The method also deals with the problems of negation detection and concept identification in clinical free text. 2 Background 2.1 Related Work Information Retrieval in medicine dates back to the middle 1960s. Early approaches use words to index the documents in the corpus. However, though term-based indexing using a vector space model is simple and powerful, conceptbased indexing using controlled terminologies can improve the performance of IR. A pioneer work on IR in the medical context which makes use of these resources is SAPHIRE [3]. The aim of this project was the development of methods for indexing and retrieving medical documents from bibliographic databases. Most NLP works in the biomedical domain have focused on journal articles rather than clinical free text. A recent initiative for the development of NLP of medical records is the CLEF forum; in particular, the ImageCLEF track 1.The goal is to retrieve the most relevant images to a predefined set of queries, based on their clinical case descriptions. The best results in 2005 and 2006 editions came from Lacoste et al. [4], who applied a semantic indexing to identify key concepts in certain UMLS semantic categories. Other systems that competed in the task made use of other resources, such as MeSH, to expand the queries [5]. Apart from these few approaches, most medical IR systems do not use any external resource, but base their decisions on the terms in the text, and only a few of them use ad hoc lists or manual tagging to determine which of these terms are symptoms, diseases or other relevant features [6]. Ontologies and controlled vocabularies may avoid the need of producing and maintaining these lists, while capturing the semantic in the text and the relations between the terms. 2.2 The Electronic Health Record and the Domain Peculiarities Clinical records present unique attributes that must be taken into account in any NLP system. First, the structure and content vary with the audience needs. It is a mix of highly structured information and other idiosyncratic narrative text. Sometimes, images are included. Finally, medical records differ greatly in size: from a few lines to several pages. In summary, any information that is relevant for the decision process can be part of the medical record, so that it is either impossible or surprisingly difficult to find a predictable retrieval pattern. Second, negation detection is a fundamental issue, since it can invert the sense of a text. Negation in natural language can be extremely subtle, but medical language is much more restricted and negations are expected to be more direct [7]. A variety of approaches have addressed negation in medical texts [8,9]. 1 ImageCLEF medical retrieval task,
3 298 L. Plaza and A. Díaz Third, the peculiarities of the terminology and the writing practices of physicians make concept detection a very ambitious task [10]. The first challenge is the problem of synonyms and homonyms. Another handicap is the presence of neologisms. Finally, elisions and abbreviations complicate the automatic disambiguation of medical text. 2.3 The Use of UMLS for Concept Annotation One of the most popular biomedical terminologies in NLP applications is the Unified Medical Language System (UMLS) [11]. UMLS consists of 3 main components: the Specialist Lexicon, the Metathesaurus and the Semantic Network. The Metathesaurus comprises a collection of biomedical concepts derived from more than 100 vocabulary sources and the relations among them. The Semantic Network consists of a set of categories (semantic types) that provides a categorization of the concepts in the Metathesaurus. Using UMLS for concept annotation presents two main advantages: first, it lists more than entries of ambiguous terms (which attenuate the problems of synonymy and homonymy); second, it contains numerous entries for elisions and abbreviations. In order to map the text onto UMLS concepts, the MetaMap Transfer tool (MMTx) is used. MetaMap [12] allows mapping biomedical free-form text to Metathesaurus concepts with a high level of accuracy [13]. 3 A Method for Automatic Retrieval of Similar EHRs In this section, a concept graph-based method for retrieving similar EHRs is presented. It consists of 4 steps. Each step is discussed in the following subsections. Besides, in order to clarify how the algorithm works, the following radiology report from the CMC-NLP corpus 2 is used as working example: CLINICAL HISTORY: Eleven years old with ALL, bone narrow transplant on Jan.2, now with three day history of cough. IMPRESSION: No focal pneumonia. Likely chronic changes at the left lung base. Mild anterior wedging of the thoracic vertebral bodies. 3.1 Extraction of UMLS Concepts In this step, the text in each EHR from the database is mapped onto UMLS concepts and semantic types using MetaMap. In order to understand what information is relevant for the retrieval of similar clinical cases, a hospital physician has been consulted. The decision apparently depends on multiple criteria (e.g. the physician s specialty and particular concerns). However, as general guidelines, two clinical records can be considered similar if: (1) the same symptom or 2 Computational Medicine Center s 2007 Medical Natural Language Processsing Challenge (CMC-NLP 2007),
4 Retrieval of Similar Electronic Health Records Using UMLS Concept Graphs 299 Table 1. Relevant UMLS semantic types Category UMLS semantic types Sign or Symptom Symptoms and signs Finding Diseases Disease or Syndrome Pathologic Function Procedures Therapeutic or Preventive Procedure Diagnosis Procedure Body parts Body Location or Region Body Part, Organ, or Organ Component Medicaments Pharmacologic substance sign is presented (e.g. fever or 5 kg weight loss), (2) the patients have received the same diagnosis (e.g. bacterial pneumonia), (3) the same test or procedure is reported (e.g. cerebral NMR or endoscopy biopsy) or (4) the same medicament has been administered (e.g. clopidogrel). Therefore, accordingto the domain expert guidelines relevant attributes to the task are: symptoms or signs, diseases, procedures and medicaments. Identifying which concepts in a medical record correspond to each of these categories is not trivial. However, the UMLS Semantic Network includes very useful information to map the concepts in the patient records to the previous categories. Then, only the concepts from a subset of UMLS semantic types are considered. Table 1 shows these semantic types along with their mappings to the categories above. A further category has been added, body parts, since they are often involved in the descriptions of procedures and diseases (e.g. fractured rib). 3.2 Negation Detection The aim of this step is to detect negated concepts in the EHRs since, according to the domain expert, absent symptoms or diseases are not relevant for the task. Since negations within medical records usually appear in a reduced number of forms, a simple lexical scanner from regular expressions is used. According to this, we have come up with 4 negation classes. Table 2 shows the lexical patterns used to detect them, along with some examples of their occurrences within the corpora. Concept stands for the previously identified UMLS concepts. 3.3 Semantic Graph Representation The next step consists of creating a graph-based representation for each EHR in the database. First, the concepts identified in the previous step are retrieved from the UMLS Metathesaurus along with their complete hierarchy of hypernyms. Second, all concept hierarchies for each category are merged, building a unique graph for each category in the EHR, where the edges represent semantic relations, and the vertices represent distinct concepts in the text. Finally, each concept is assigned a weight using equation (1), where α is the set of all the parents of the concept A, including A, and β is the set of all the parents of
5 300 L. Plaza and A. Díaz Table 2. Negation classes Lexical Pattern Examples no without rule out + adj? + concept + (or concept)* Without focal scarring Rule out fever or cough no without + noun + of + concept + (or concept)* No signs of tuberculosis No evidence of hydroureter evaluate for + (noun adj)? + concept + (or concept)* Evaluate for foreign body Evaluate for abnormalities Lack of kyphosis lack absence of + (noun adj)? + concept + (or concept)* Absence of heart murmur concept B, including B. This formula will assign greater weight to an edge as the concepts that it links become more specific. Fig. 1 shows these graphs for the record example. The edges of one of these graphs have been labeled with their weights. It can be observed how the acronym ALL (Acute Lymphocytic Leukemia) is expanded by MetaMap. weight(a, B) = α β α β (1) Fig. 1. An example of EHR semantic representation 3.4 Computing the Similarity between EHRs The purpose of the last step is to compute the similarity between the graphbased representations of two patient records. To this end, a non-democratic vote
6 Retrieval of Similar Electronic Health Records Using UMLS Concept Graphs 301 mechanism is used, similar to that proposed in [14]. Given two graphs, A and B, so that the similarity of A to B has to be measured, each concept of A which is present in B assigns a score equal to the weight of that concept in the graph A, and 0 otherwise. Next, the sum of the scores for all concepts in A is computed. This result is normalized in the interval [0, maximum similarity]. Finally, the comparison between the representations of the patient records is accomplished by computing the similarity between the hierarchies of the graphs for each category and adding these partial results to calculate a global similarity value. 4 Evaluation Methodology To the authors knowledge, no corpus of relevance judgments of medical record similarity is publicly available. For this reason, a collection of radiology reports from the Cincinnati Children s Hospital Medical Center s Department of Radiology is used. This corpus was designed for being used in the CMC-NLP 2007 ICD-9-CM categorization task. To evaluate the method, 50 reports have been obtained from this corpus, using a stratified sampling, so that for each ICD category, a number of reports proportional to its size are selected. To avoid very small categories, only ten categories have been used. We will refer to this set as our test collection. Next, a subset of 20 reports has been separated from the test collection, once again using a stratified sampling. We will refer to this set as our query collection. Two hospital physicians were asked to select, for each record in the query collection, the most similar ones within the test collection. To measure the interjudge agreement, the Kappa statistic [15] has been calculated. An average kappa value equal to is obtained, which indicates that there is a substantial (0.61<= k <= 0.80) agreement. The evaluation is done by comparing the EHRs retrieved by the system which those retrieved by the experts and calculating precision and recall metrics. As the method s output is a ranking of EHRs, two different evaluation approaches are used: first, the number of records to retrieve is set to 3 and 5 respectively; and second, the average precision at all different levels of recall is calculated [16]. Since the relevance judgments differ across judges, two experiments are presented: first, only overlapping judgments are considered; second, the union of the judgments is used. 5 Results and Discussion In order to verify if the method proposed yields better results than the classic term vector space model, we compare the retrieval precision and recall obtained by both methods. We first examine the results of our algorithm when 3 and 5 document cutoffs are used (Table 3: Union-3, Intersection-3, Union-5, Intersection-5). Note that the order of documents in the ranking is irrelevant. Our method significantly outperforms the vector space model in precision and recall, both with the union and the intersection of the relevance judgements. It can be also observed that
7 302 L. Plaza and A. Díaz when the 5 document cutoff is used, the precision in our method considerably decreases. The reason is that we have forced the system to retrieve 5 documents when the experts have retrieved only documents per query on average. On the contrary, precision in the vector space model slightly increases with the number of documents retrieved. The reason is that a good number of the relevant documents retrieved are not ranked in the top 3 positions, but in positions 4 and 5. As expected, recall increases with the number of documents retrieved in both systems. Table 3. Results of the evaluation and comparison with the vector space model Graph-based Term-based Precision Recall F-measure Precision Recall F-measure Union Intersection Union Intersection Union Intersection We next examine the retrieval performance when precision at all levels of recall is considered (Table 3: Union, Intersection), so that the position of the relevant documents in the ranking does matter now. Once again our algorithm obtains a significant improvement in precision over the term-based approach. 6 Conclusion and Future Work In this paper, a novel approach to the automatic retrieval of similar EHRs has been presented. The method represents the medical record as a set of semantic graphs using UMLS concepts and relations. This way it gets a richer representation than the one provided by traditional models based on terms. The method achieves relatively high precision and recall, which are also well balanced, which indicates that even though some relevant records are not ranked in the top positions, most retrieved documents are relevant. However, the intermediate results of the method have shown that the indexing is not sufficiently exhaustive, as UMLS occasionally fails to recover relevant concepts especially when these concepts are expressed in their shortened forms. Another important impairment to concept identification comes from the spelling errors that frequently occur in the clinical records. Future work will test the method on a different evaluation collection which ideally will present longer medical records structured in different sections, so that the position of the concepts in these sections could condition whether or not these concepts are relevant for indexing. Furthermore, a user-oriented evaluation is planned which attempt to assess the system from cost-effectiveness and benefit points of view.
8 Retrieval of Similar Electronic Health Records Using UMLS Concept Graphs 303 Acknowledgements This research has been partially funded by the Spanish Ministerio de Ciencia e Innovación (TIN C03-01) and by the Spanish Ministerio de Industria, Turismo y Comercio (TIS ). References 1. Gorman, P., Helfand, M.: Information seeking in primary care: how physicians choose which clinical question to pursuit and which to leave unanswered. Medical Decision Making 15, (1995) 2. Ely, J., Osheroff, J., Ebell, M., Chambliss, M., Vinson, D., Stevermer, J., Pifer, E.: Obstacles to answering doctors questions about patient care with evidence: qualitative study. British Medical Journal 324, (2002) 3. Hersh, W., Hickam, D.: Information retrieval in medicine: The Saphire experience. Journal of the American Society for Information Science 46, (1995) 4. Lacoste, C., Lim, J., Chevallet, J., Le, D.: Medical-image retrieval based on knowledge-assisted text and image indexing. IEEE Transactions on Circuits and Systems for Video Technology 17, (2007) 5. Navarro, S., Llopis, F., Muñoz, R.: Different Multimodal Approaches using IR-n in ImageCLEFphoto In: On-line Working Notes CLEF (2008) 6. Kwiatkowska, M., Atkins, S.: Case representation and retrieval in the diagnosis and treatment of obstructive sleep apnea: A semiofuzzy approach. In: Proceedings of the 7th ECCBR (2004) 7. Gindl, S., Kaiser, K., Miksch, S.: Syntactical negation detection in clinical practice guidelines (2008) 8. Mutalik, A., Deshpande, A., Nadkarni, P.: Use of general-purpose negation detection to augment concept indexing of medical documents. A quantitative study using the UMLS. JAIMA 8, (2001) 9. Morante, R., Liekens, A., Daelemans, W.: Learning the scope of negation in biomedical texts. In: Proceedings of the EMNLP Conference, pp (2008) 10. Nadkarni, P.: Information retrieval in medicine: overview and applications. Journal of Postgraduate Medicine 46, (2000) 11. Nelson, S., Powell, T., Humphreys, B.: The Unified Medical Language System (UMLS) Project. In: Kent, A., Hall, C.M. (eds.) Encyclopedia of Library and Information Science, Marcel Dekker, Inc., New York (2002) 12. Aronson, A.: Effective mapping of biomedical text to the umls metathesaurus: the metamap program. In: Proceedings of the AMIA Annual Symposium, pp (2001) 13. Pratt, W., Yetisgen-Yildiz, M.: A study of biomedical concept identification: Metamap vs. people. In: Proceedings of the AMIA Annual Symposium, pp (2003) 14. Yoo, I., Hu, X., Song, I.Y.: A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method. BMC Bioinformatics 8(9) (2007) 15. Cohen, J.: A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, (1960) 16. Salton, G.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
LABERINTO at ImageCLEF 2011 Medical Image Retrieval Task
LABERINTO at ImageCLEF 2011 Medical Image Retrieval Task Jacinto Mata, Mariano Crespo, Manuel J. Maña Dpto. de Tecnologías de la Información. Universidad de Huelva Ctra. Huelva - Palos de la Frontera s/n.
More informationIdentify Disorders in Health Records using Conditional Random Fields and Metamap
Identify Disorders in Health Records using Conditional Random Fields and Metamap AEHRC at ShARe/CLEF 2013 ehealth Evaluation Lab Task 1 G. Zuccon 1, A. Holloway 1,2, B. Koopman 1,2, A. Nguyen 1 1 The Australian
More informationAn Automated System for Conversion of Clinical Notes into SNOMED Clinical Terminology
An Automated System for Conversion of Clinical Notes into SNOMED Clinical Terminology Jon Patrick, Yefeng Wang and Peter Budd School of Information Technologies University of Sydney New South Wales 2006,
More informationBuilding a Spanish MMTx by using Automatic Translation and Biomedical Ontologies
Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies Francisco Carrero 1, José Carlos Cortizo 1,2, José María Gómez 3 1 Universidad Europea de Madrid, C/Tajo s/n, Villaviciosa
More informationImproving Health Records Search Using Multiple Query Expansion Collections
Improving Health Records Search Using Multiple Query Expansion Collections Dongqing Zhu and Ben Carterette Information Retrieval Lab University of Delaware Outline Introduction Retrieval Task & Data Retrieval
More informationFind the signal in the noise
Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical
More informationDiagnosis Code Assignment Support Using Random Indexing of Patient Records A Qualitative Feasibility Study
Diagnosis Code Assignment Support Using Random Indexing of Patient Records A Qualitative Feasibility Study Aron Henriksson 1, Martin Hassel 1, and Maria Kvist 1,2 1 Department of Computer and System Sciences
More informationOverview of the TREC 2012 Medical Records Track
Overview of the TREC 2012 Medical Records Track Ellen M. Voorhees National Institute of Standards and Technology William Hersh Oregon Health & Science University Abstract The TREC Medical Records track
More informationA Knowledge-Based Approach to Organizing Retrieved Documents
A Knowledge-Based Approach to Organizing Retrieved Documents Wanda Pratt Information & Computer Science University of California, Irvine Irvine, CA 92697-3425 pratt@ics.uci.edu Marti A. Hearst School of
More informationApproaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval
Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information
More informationKing Mongkut s University of Technology North Bangkok 4 Division of Business Computing, Faculty of Management Science
(IJCSIS) International Journal of Computer Science and Information Security, Ontology-supported processing of clinical text using medical knowledge integration for multi-label classification of diagnosis
More informationTMUNSW: Identification of disorders and normalization to SNOMED-CT terminology in unstructured clinical notes
TMUNSW: Identification of disorders and normalization to SNOMED-CT terminology in unstructured clinical notes Jitendra Jonnagaddala a,b,c Siaw-Teng Liaw *,a Pradeep Ray b Manish Kumar c School of Public
More informationFolksonomies versus Automatic Keyword Extraction: An Empirical Study
Folksonomies versus Automatic Keyword Extraction: An Empirical Study Hend S. Al-Khalifa and Hugh C. Davis Learning Technology Research Group, ECS, University of Southampton, Southampton, SO17 1BJ, UK {hsak04r/hcd}@ecs.soton.ac.uk
More informationMIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS
More informationExtraction of Radiology Reports using Text mining
Extraction of Radiology Reports using Text mining A.V.Krishna Prasad 1 Dr.S.Ramakrishna 2 Dr.D.Sravan Kumar 3 Dr.B.Padmaja Rani 4 1 Research Scholar S.V.University, Tirupathi & Associate Professor CS MIPGS,Hyderabad,
More informationToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database
ToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database Dina Vishnyakova 1,2, 4, *, Julien Gobeill 1,3,4, Emilie Pasche 1,2,3,4 and Patrick Ruch
More informationtemporal eligibility criteria; 3) Demonstration of the need for information fusion across structured and unstructured data in solving certain
How essential are unstructured clinical narratives and information fusion to clinical trial recruitment? Preethi Raghavan, MS, James L. Chen, MD, Eric Fosler-Lussier, PhD, Albert M. Lai, PhD The Ohio State
More informationLinking Specialized Online Medical Discussions to Online Medical Literature
Linking Specialized Online Medical Discussions to Online Medical Literature Sam Stewart 1, Syed Sibte Raza Abidi 1, Allen Finley 2 1 NICHE Research Group, Faculty of Computer Science, Dalhousie University,
More informationComparing Metamap to MGrep as a Tool for Mapping Free Text to Formal Medical Lexicons
Comparing Metamap to MGrep as a Tool for Mapping Free Text to Formal Medical Lexicons Samuel Alan Stewart* 1, Maia Elizabeth von Maltzahn 2, Syed Sibte Raza Abidi 1 1 NICHE Research Group, Dalhousie University,
More informationHow To Cluster On A Search Engine
Volume 2, Issue 2, February 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: A REVIEW ON QUERY CLUSTERING
More informationElectronic Health Record (EHR) Standards Survey
Electronic Health Record (EHR) Standards Survey Compiled by: Simona Cohen, Amnon Shabo Date: August 1st, 2001 This report is a short survey about the main emerging standards that relate to EHR - Electronic
More informationIntelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives
Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives Search The Way You Think Copyright 2009 Coronado, Ltd. All rights reserved. All other product names and logos
More informationText Mining in Radiology Reports
Text Mining in Radiology Reports Tianxia Gong, Chew Lim Tan, Tze Yun Leong Department of Computer Science, School of Computing, National University of Singapore {gong tianxia, tancl, leongty}@comp.nus.edu.sg
More informationSEMANTICS ENABLED PROACTIVE AND TARGETED DISSEMINATION OF NEW MEDICAL KNOWLEDGE
SEMANTICS ENABLED PROACTIVE AND TARGETED DISSEMINATION OF NEW MEDICAL KNOWLEDGE Lakshmish Ramaswamy & I. Budak Arpinar Dept. of Computer Science, University of Georgia laks@cs.uga.edu, budak@cs.uga.edu
More informationAnnotating Medical Forms using UMLS
Annotating Medical Forms using UMLS Victor Christen 1, Anika Groß 1, Julian Varghese 2, Martin Dugas 2, Erhard Rahm 1 1 Department of Computer Science, Universität Leipzig, Germany 2 Institute of Medical
More informationDublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection
Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection Gareth J. F. Jones, Declan Groves, Anna Khasin, Adenike Lam-Adesina, Bart Mellebeek. Andy Way School of Computing,
More informationTransformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery
Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery Jan Paralic, Peter Smatana Technical University of Kosice, Slovakia Center for
More informationAutomated assignment of ICD-9-CM codes to radiology reports
Automated assignment of ICD-9-CM codes to radiology reports Richárd Farkas University of Szeged Filip Ginter University of Turku Overview Why clinical coding? Importance, use of automated coding Challenge
More informationExploring Electronic Medical Record (EMR) Using an Information Retrieval Perspective IEEE ICOCI 2006
Exploring Electronic Medical Record (EMR) Using an Information Retrieval Perspective IEEE ICOCI 2006 Nasriah Zakaria Faculty of Electrical and Electronic Engineering University Sains Malaysia Nibong Tebal,
More informationApplication of a Medical Text Indexer to an Online Dermatology Atlas
Application of a Medical Text Indexer to an Online Dermatology Atlas GR Kim, MD 1, AR Aronson, PhD 2, JG Mork, MS 2, BA Cohen, MD 3, CU Lehmann, MD 1 1 Division of Health Sciences Informatics, Johns Hopkins
More informationSearch and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
More informationExploration and Visualization of Post-Market Data
Exploration and Visualization of Post-Market Data Jianying Hu, PhD Joint work with David Gotz, Shahram Ebadollahi, Jimeng Sun, Fei Wang, Marianthi Markatou Healthcare Analytics Research IBM T.J. Watson
More informationAn Information Retrieval System for Expert and Consumer Users
An Information Retrieval System for Expert and Consumer Users Rena Peraki, Euripides G.M. Petrakis, Angelos Hliaoutakis Department of Electronic and Computer Engineering Technical University of Crete (TUC)
More informationAutomated Problem List Generation from Electronic Medical Records in IBM Watson
Proceedings of the Twenty-Seventh Conference on Innovative Applications of Artificial Intelligence Automated Problem List Generation from Electronic Medical Records in IBM Watson Murthy Devarakonda, Ching-Huei
More information2. SYSTEM ARCHITECTURE CLINICAL DECISION SUPPORT TRACK 1. INTRODUCTION
San Francisco State University at TREC 2014: Clinical Decision Support Track and Microblog Track Aayush Bhandari, James Klinkhammer, and Anagha Kulkarni Department of Computer Science, San Francisco State
More informationA Framework for Personalized Healthcare Service Recommendation
A Framework for Personalized Healthcare Service Recommendation Choon-oh Lee, Minkyu Lee, Dongsoo Han School of Engineering Information and Communications University (ICU) Daejeon, Korea {lcol, niklaus,
More informationA VOICE MEDIATED SYSTEM FOR STRUCTURED ENTRY OF MEDICAL DATA
A VOICE MEDIATED SYSTEM FOR STRUCTURED ENTRY OF MEDICAL DATA IST-2001-38143 ATVN-EU-GP Conference Applications Relating to Health 1-3 December 2005, Pułtusk near Warsaw, Poland 1 AGENDA AND TIME SCHEDULE
More informationElection of Diagnosis Codes: Words as Responsible Citizens
Election of Diagnosis Codes: Words as Responsible Citizens Aron Henriksson and Martin Hassel Department of Computer & System Sciences (DSV), Stockholm University Forum 100, 164 40 Kista, Sweden {aronhen,xmartin}@dsv.su.se
More informationTerm extraction for user profiling: evaluation by the user
Term extraction for user profiling: evaluation by the user Suzan Verberne 1, Maya Sappelli 1,2, Wessel Kraaij 1,2 1 Institute for Computing and Information Sciences, Radboud University Nijmegen 2 TNO,
More informationNatural Language Processing Supporting Clinical Decision Support
Natural Language Processing Supporting Clinical Decision Support Applications for Enhancing Clinical Decision Making NIH Worksop; Bethesda, MD, April 24, 2012 Stephane M. Meystre, MD, PhD Department of
More informationCIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet
CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet Muhammad Atif Qureshi 1,2, Arjumand Younus 1,2, Colm O Riordan 1,
More informationAhmed AlBarrak PhD Medical Informatics Associate Professor, Family & Community Med. Chairman, Medical Informatics Department College of Medicine King
Ahmed AlBarrak PhD Medical Informatics Associate Professor, Family & Community Med. Chairman, Medical Informatics Department College of Medicine King Saud University albarrak@ksu.edu.sa What are Medical
More informationFree Text Phrase Encoding and Information Extraction from Medical Notes. Jennifer Shu
Free Text Phrase Encoding and Information Extraction from Medical Notes by Jennifer Shu Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements
More informationExtending Software Quality Models - A Sample In The Domain of Semantic Technologies
Extending Software Quality Models - A Sample In The Domain of Semantic Technologies Filip Radulovic Ontology Engineering Group Departamento de Inteligencia Artificial Facultad de Informática, Universidad
More informationMedical-Miner at TREC 2011 Medical Records Track
Medical-Miner at TREC 2011 Medical Records Track 1 J.M. Córdoba, 1 M.J. Maña, 1 N.P. Cruz, 1 J. Mata, 2 F. Aparicio, 2 M. Buenaga, 3 D. Glez-Peña, 3 F. Fdez-Riverola 1 Universidad de Huelva 2 Universidad
More informationNatural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
More informationTheodoros. N. Arvanitis, RT, DPhil, CEng, MIET, MIEEE, AMIA, FRSM
TRANSFoRm Theodoros. N. Arvanitis, RT, DPhil, CEng, MIET, MIEEE, AMIA, FRSM Biomedical Informatics, Signals & Systems Research Laboratory School of Electronic, Electrical & Computer Engineering College
More informationTITLE Dori Whittaker, Director of Solutions Management, M*Modal
TITLE Dori Whittaker, Director of Solutions Management, M*Modal Challenges Impacting Clinical Documentation HITECH Act, Meaningful Use EHR mandate and adoption Need for cost savings Migration to ICD 10
More informationAutomatic Code Assignment to Medical Text
Automatic Code Assignment to Medical Text Koby Crammer and Mark Dredze and Kuzman Ganchev and Partha Pratim Talukdar Department of Computer and Information Science, University of Pennsylvania, Philadelphia,
More informationBiTeM group report for TREC Medical Records Track 2011
BiTeM group report for TREC Medical Records Track 2011 J. Gobeill a, A. Gaudinat a, E. Pasche b, D. Teodoro b, D. Vishnyakova b, P. Ruch a a BiTeM group, University of Applied Sciences, Information Studies,
More informationTaxonomy learning factoring the structure of a taxonomy into a semantic classification decision
Taxonomy learning factoring the structure of a taxonomy into a semantic classification decision Viktor PEKAR Bashkir State University Ufa, Russia, 450000 vpekar@ufanet.ru Steffen STAAB Institute AIFB,
More informationTravis Goodwin & Sanda Harabagiu
Automatic Generation of a Qualified Medical Knowledge Graph and its Usage for Retrieving Patient Cohorts from Electronic Medical Records Travis Goodwin & Sanda Harabagiu Human Language Technology Research
More informationMedical Information-Retrieval Systems. Dong Peng Medical Informatics Group
Medical Information-Retrieval Systems Dong Peng Medical Informatics Group Outline Evolution of medical Information-Retrieval (IR). The information retrieval process. The trend of medical information retrieval
More informationA Joint Model of Feature Mining and Sentiment Analysis for Product Review Rating
A Joint Model of Feature Mining and Sentiment Analysis for Product Review Rating Jorge Carrillo de Albornoz, Laura Plaza, Pablo Gervás, and Alberto Díaz Universidad Complutense de Madrid, Departamento
More informationAn Overview of a Role of Natural Language Processing in An Intelligent Information Retrieval System
An Overview of a Role of Natural Language Processing in An Intelligent Information Retrieval System Asanee Kawtrakul ABSTRACT In information-age society, advanced retrieval technique and the automatic
More informationApplication of ontologies for the integration of network monitoring platforms
Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,
More informationToward a Natural Language Interface for EHR Questions
Toward a Natural Language Interface for EHR Questions Kirk Roberts, PhD, and Dina Demner-Fushman, MD, PhD Lister Hill National Center for Biomedical Communications U.S. National Library of Medicine Abstract
More informationBuilding a Question Classifier for a TREC-Style Question Answering System
Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given
More informationIdentifying Focus, Techniques and Domain of Scientific Papers
Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 sonal@cs.stanford.edu Christopher D. Manning Department of
More informationInteractive Dynamic Information Extraction
Interactive Dynamic Information Extraction Kathrin Eichler, Holmer Hemsen, Markus Löckelt, Günter Neumann, and Norbert Reithinger Deutsches Forschungszentrum für Künstliche Intelligenz - DFKI, 66123 Saarbrücken
More informationPractical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment
Cross-Border Challenges in Informatics with a Focus on Disease Surveillance and Utilising Big-Data L. Stoicu-Tivadar et al. (Eds.) 2014 The authors. This article is published online with Open Access by
More informationPOSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition
POSBIOTM-NER: A Machine Learning Approach for Bio-Named Entity Recognition Yu Song, Eunji Yi, Eunju Kim, Gary Geunbae Lee, Department of CSE, POSTECH, Pohang, Korea 790-784 Soo-Jun Park Bioinformatics
More informationSemantic Relatedness Metric for Wikipedia Concepts Based on Link Analysis and its Application to Word Sense Disambiguation
Semantic Relatedness Metric for Wikipedia Concepts Based on Link Analysis and its Application to Word Sense Disambiguation Denis Turdakov, Pavel Velikhov ISP RAS turdakov@ispras.ru, pvelikhov@yahoo.com
More informationTerminology Extraction from Log Files
Terminology Extraction from Log Files Hassan Saneifar 1,2, Stéphane Bonniol 2, Anne Laurent 1, Pascal Poncelet 1, and Mathieu Roche 1 1 LIRMM - Université Montpellier 2 - CNRS 161 rue Ada, 34392 Montpellier
More informationInteractive Recovery of Requirements Traceability Links Using User Feedback and Configuration Management Logs
Interactive Recovery of Requirements Traceability Links Using User Feedback and Configuration Management Logs Ryosuke Tsuchiya 1, Hironori Washizaki 1, Yoshiaki Fukazawa 1, Keishi Oshima 2, and Ryota Mibe
More informationAtigeo at TREC 2014 Clinical Decision Support Task
Atigeo at TREC 2014 Clinical Decision Support Task Yishul Wei*, ChenChieh Hsu*, Alex Thomas, Joseph F. McCarthy Atigeo, LLC 800 Bellevue Way NE, Suite 600 Bellevue, WA 98004 USA joe.mccarthy@atigeo.com
More informationBig Data Text Mining and Visualization. Anton Heijs
Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark
More information1 o Semestre 2007/2008
Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Outline 1 2 3 4 5 Outline 1 2 3 4 5 Exploiting Text How is text exploited? Two main directions Extraction Extraction
More informationIntroduction to IR Systems: Supporting Boolean Text Search. Information Retrieval. IR vs. DBMS. Chapter 27, Part A
Introduction to IR Systems: Supporting Boolean Text Search Chapter 27, Part A Database Management Systems, R. Ramakrishnan 1 Information Retrieval A research field traditionally separate from Databases
More informationThe XLDB Group at CLEF 2004
The XLDB Group at CLEF 2004 Nuno Cardoso, Mário J. Silva, and Miguel Costa Grupo XLDB - Departamento de Informática, Faculdade de Ciências da Universidade de Lisboa {ncardoso, mjs, mcosta} at xldb.di.fc.ul.pt
More informationSupporting Software Development Process Using Evolution Analysis : a Brief Survey
Supporting Software Development Process Using Evolution Analysis : a Brief Survey Samaneh Bayat Department of Computing Science, University of Alberta, Edmonton, Canada samaneh@ualberta.ca Abstract During
More informationExperiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
More informationHow To Understand And Understand A Negative In Bbg
Some Aspects of Negation Processing in Electronic Health Records Svetla Boytcheva 1, Albena Strupchanska 2, Elena Paskaleva 2 and Dimitar Tcharaktchiev 3 1 Department of Information Technologies, Faculty
More informationSearch Trails using User Feedback to Improve Video Search
Search Trails using User Feedback to Improve Video Search *Frank Hopfgartner * David Vallet *Martin Halvey *Joemon Jose *Department of Computing Science, University of Glasgow, Glasgow, United Kingdom.
More informationCINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation.
CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation. Miguel Ruiz, Anne Diekema, Páraic Sheridan MNIS-TextWise Labs Dey Centennial Plaza 401 South Salina Street Syracuse, NY 13202 Abstract:
More informationBrunswick, NJ: Ometeca Institute, Inc., 2005. 617 pp. $35.00. (0-9763547-0-5).
Information Retrieval Design. James D. Anderson and Jose Perez-Carballo. East Brunswick, NJ: Ometeca Institute, Inc., 2005. 617 pp. $35.00. (0-9763547-0-5). Information Retrieval Design is a textbook that
More informationProteinQuest user guide
ProteinQuest user guide 1. Introduction... 3 1.1 With ProteinQuest you can... 3 1.2 ProteinQuest basic version 4 1.3 ProteinQuest extended version... 5 2. ProteinQuest dictionaries... 6 3. Directions for
More informationClustering Technique in Data Mining for Text Documents
Clustering Technique in Data Mining for Text Documents Ms.J.Sathya Priya Assistant Professor Dept Of Information Technology. Velammal Engineering College. Chennai. Ms.S.Priyadharshini Assistant Professor
More informationUsing Discharge Summaries to Improve Information Retrieval in Clinical Domain
Using Discharge Summaries to Improve Information Retrieval in Clinical Domain Dongqing Zhu 1, Wu Stephen 2, Masanz James 2, Ben Carterette 1, and Hongfang Liu 2 1 University of Delaware, 101 Smith Hall,
More informationSoftware Engineering of NLP-based Computer-assisted Coding Applications
Software Engineering of NLP-based Computer-assisted Coding Applications 1 Software Engineering of NLP-based Computer-assisted Coding Applications by Mark Morsch, MS; Carol Stoyla, BS, CLA; Ronald Sheffer,
More informationInterface Terminology to Facilitate the Problem List Using SNOMED CT and other Terminology Standards
Interface Terminology to Facilitate the Problem List Using SNOMED CT and other Terminology Standards Kshitij Saxena MD, MHSA Regional Medical Director, Adventist Health System Agenda Introduction Problem
More information3M Health Information Systems
3M Health Information Systems 1 Data Governance Disparate Systems Interoperability Information Exchange Reporting Public Health Quality Metrics Research Data Warehousing Data Standards What is the 3M Healthcare
More informationHELP DESK SYSTEMS. Using CaseBased Reasoning
HELP DESK SYSTEMS Using CaseBased Reasoning Topics Covered Today What is Help-Desk? Components of HelpDesk Systems Types Of HelpDesk Systems Used Need for CBR in HelpDesk Systems GE Helpdesk using ReMind
More informationProtein Protein Interaction Networks
Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics
More informationSPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
More informationProblem-Centered Care Delivery
HOW INTERFACE TERMINOLOGY MAKES STANDARDIZED HEALTH INFORMATION POSSIBLE Terminologies ensure that the languages of medicine can be understood by both humans and machines. by June Bronnert, RHIA, CCS,
More informationA Statistical Text Mining Method for Patent Analysis
A Statistical Text Mining Method for Patent Analysis Department of Statistics Cheongju University, shjun@cju.ac.kr Abstract Most text data from diverse document databases are unsuitable for analytical
More informationErnestina Menasalvas Universidad Politécnica de Madrid
Ernestina Menasalvas Universidad Politécnica de Madrid EECA Cluster networking event RITA 12th november 2014, Baku Sectors/Domains Big Data Value Source Public administration EUR 150 billion to EUR 300
More informationModeling Temporal Data in Electronic Health Record Systems
International Journal of Information Science and Intelligent System, 3(3): 51-60, 2014 Modeling Temporal Data in Electronic Health Record Systems Chafiqa Radjai 1, Idir Rassoul², Vytautas Čyras 3 1,2 Mouloud
More informationUnsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques
Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques Ramakanth Kavuluru 1,2, Sifei Han 2, and Daniel Harris 2 1 Division of Biomedical
More information11-792 Software Engineering EMR Project Report
11-792 Software Engineering EMR Project Report Team Members Phani Gadde Anika Gupta Ting-Hao (Kenneth) Huang Chetan Thayur Suyoun Kim Vision Our aim is to build an intelligent system which is capable of
More informationMED 2400 MEDICAL INFORMATICS FUNDAMENTALS
MED 2400 MEDICAL INFORMATICS FUNDAMENTALS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Course code: MED 2400
More informationKNOWLEDGE-BASED IN MEDICAL DECISION SUPPORT SYSTEM BASED ON SUBJECTIVE INTELLIGENCE
JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 22/2013, ISSN 1642-6037 medical diagnosis, ontology, subjective intelligence, reasoning, fuzzy rules Hamido FUJITA 1 KNOWLEDGE-BASED IN MEDICAL DECISION
More informationPharmaSUG2011 Paper HS03
PharmaSUG2011 Paper HS03 Using SAS Predictive Modeling to Investigate the Asthma s Patient Future Hospitalization Risk Yehia H. Khalil, University of Louisville, Louisville, KY, US ABSTRACT The focus of
More informationClassification and Prioritization of Biomedical Literature for the Comparative Toxicogenomics Database
Classification and Prioritization of Biomedical Literature for the Comparative Toxicogenomics Database Dina VISHNYAKOVA a,b,d,1, Emilie PASCHE a,b,d, Julien GOBEILL a,c,d, Arnaud GAUDINAT a,c,d, Christian
More informationI. The SMART Project - Status Report and Plans. G. Salton. The SMART document retrieval system has been operating on a 709^
1-1 I. The SMART Project - Status Report and Plans G. Salton 1. Introduction The SMART document retrieval system has been operating on a 709^ computer since the end of 1964. The system takes documents
More informationTECHNICAL REPORT WRITING GUIDELINES
TECHNICAL REPORT WRITING GUIDELINES Prepared by LEAH M. AKINS and JEFFERSON H. AKINS for TECHNICAL/ENGINEERING STUDENTS ABSTRACT This document specifies the recommended format to be used when submitting
More informationA Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches
J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 105 A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria
More informationA Method for Automatic De-identification of Medical Records
A Method for Automatic De-identification of Medical Records Arya Tafvizi MIT CSAIL Cambridge, MA 0239, USA tafvizi@csail.mit.edu Maciej Pacula MIT CSAIL Cambridge, MA 0239, USA mpacula@csail.mit.edu Abstract
More information