An Information Extraction Framework for Cohort Identification Using Electronic Health Records
|
|
|
- Tamsyn Bates
- 10 years ago
- Views:
Transcription
1 An Information Extraction Framework for Cohort Identification Using Electronic Health Records Hongfang Liu PhD 1, Suzette J. Bielinski PhD 1, Sunghwan Sohn PhD 1, Sean Murphy 1, Kavishwar B. Wagholikar MBBS PhD 1, Siddhartha R. Jonnalagadda PhD 1, Ravikumar K.E. PhD 1, Stephen T. Wu PhD 1, Iftikhar J. Kullo MD 2, Christopher G Chute MD PhD 1 1 Department of Health Sciences Research, 2 Division of Cardiovascular Diseases, Mayo Clinic, Rochester, MN Abstract Information extraction (IE), a natural language processing (NLP) task that automatically extracts structured or semi-structured information from free text, has become popular in the clinical domain for supporting automated systems at point-of-care and enabling secondary use of electronic health records (EHRs) for clinical and translational research. However, a high performance IE system can be very challenging to construct due to the complexity and dynamic nature of human language. In this paper, we report an IE framework for cohort identification using EHRs that is a knowledge-driven framework developed under the Unstructured Information Management Architecture (UIMA). A system to extract specific information can be developed by subject matter experts through expert knowledge engineering of the externalized knowledge resources used in the framework. Introduction With the rapid adoption of Electronic Health Records (EHRs), it is desirable to harvest the information and knowledge in EHRs to support automated systems at point-of-care and to enable secondary use of EHRs for clinical and translational research. Much of the EHR data is in free text form. Comparing to structured data, free text is a more conventional way in the health care environment to express concepts and events. The free text records are generated by automated or manual transcription of dictation recordings and direct entry by the care providers. However, free text is very challenging for searching, summarization, decision-support, or statistical analysis. To reduce medical errors, improve health care quality, and enable secondary use of EHRs, information extraction (IE), which structures and encodes clinical information stored in free text, is necessary. Approaches to IE are based on either symbolic techniques (e.g., NegEx 1 ) or statistical machine learning. Clinical IE applications using symbolic techniques can be cumbersome to implement and may lack portability. IE applications based on statistical NLP techniques require annotated examples and are easy to apply but may not accurately capture the relationships among words in a document such as negation. As pointed out by Wagholikar et al. 2 and Chapman et al. 3, when a task involves a specific subdomain (as in preventive care decision support of cervical cancer) or a limited number of named entities (as in detection of influenza), sublanguage analysis detecting subdomain semantics combined with contextual information detection and expert knowledge engineering is a viable approach. Through a collaboration with IBM, Mayo Clinic has developed a system called Mayo Clinic Information Extraction system used to process all clinical notes available in the Enterprise Data Trust (EDT) 4. A variation of this system named clinical Text Analysis and Knowledge Extraction System (ctakes) has been released under an Apache open source license through the open health NLP consortium (OHNLP) 5. Additional modules have been implemented within ctakes recently including a smoking status identification module 6, a refined drug information extraction module, and a side effect extraction module 7. Here, we report a new IE framework for extracting named entities and their corresponding contextual information under ctakes that is purely knowledge-driven. We will assume named entities are handcrafted based on expert knowledge with or without sublanguage analysis. We test the framework through implementing a couple of NLP components for cohort identification. In the following, we describe background information of the modules in the IE framework. We then describe the system in detail followed by case studies on two emerge phenotypes to evaluate the IE framework. Background In this section, we describe the background information about section and contextual information detection. We then describe two Electronic Medical Records and Genomics (emerge) phenotypes used in evaluating the IE framework. Section Detection - Clinical notes are often divided into sections, or segments, such as "history of present illness" or "past medical history." These sections may have subsections as well, such as the "cardiovascular exam" section of the "physical exam." One can gain greater understanding of clinical notes by recognition of the section in which a
2 name entity locates. For instance, both "past medical history" and "family medical history" sections can contain a list of diseases, but the context information is very different. Section tagging is an early step in NLP applications for clinical notes. One such system for section detection is SecTag which recognizes section headers through terminology lookup, machine learning, spelling correction, and scoring techniques 8,9. The terminology used by SecTag provides a list of concepts that represent particular section headings by extending Logical Observation Identifiers Names and Codes (LOINC ). Each concept in SecTag has one or more synonyms that may be used to specify a section in an actual note. Contextual Information Contextual information of a condition includes: negation (is the condition negated or not), temporality (historical or current), and experiencer (who has the condition). ConText is a system that determines the values for the above three contextual properties of a clinical condition 10. The contextual property negation specifies the status of the clinical existence of a condition. The default value of this property is affirmed. If a clinical condition occurs within the scope of a trigger term for negation, ConText will change the default value to negative. For example, in the sentence The patient denies any nausea, the value of negation for the condition nausea will be negated. The emerge consortium - The emerge consortium was organized and funded by the National Human Genome Research Institute (NHGRI) and the National Institute of General Medical Sciences (NIGMS) to develop, disseminate and apply approaches for combining DNA biorepositories with the EHR for large-scale, highthroughput genetic research 11. The emerge consortium has demonstrated the applicability and portability of EHR derived phenotype algorithms using different types and modalities of clinical data for algorithm execution including billing and diagnoses codes, NLP, laboratory measurements, patient procedure encounters, and medication data 12,13. We implemented the NLP component for the following two emerge phenotypes where Mayo has been the primary site of developing the algorithms: Peripheral arterial disease (PAD) PAD is a highly prevalent disease affecting about 8 million individuals aged 40 years or older in the US with nearly 20% of the elderly (>70y) patients seen in general medical practice affected by the disease. It is associated with significant mortality and morbidity, underscoring the necessity of a rigorous investigation of factors that influence susceptibility to PAD. An NLP system has already been developed under UIMA for detecting PAD cases from radiology notes in An evaluation on a test data of manually annotated 455 Mayo cases indicated that the accuracy agreement between the 2010 system and the gold standard was However, other emerge sites require a substantial amount of time to deploy the 2010 system. Here, we reimplemented the algorithm by referring to the official PAD algorithm available publicly aiming to demonstrate that a customized NLP engine can be developed efficiently. Heart failure (HF) HF is a complex disease in which the heart is unable to supply sufficient blood flow to the body and is diagnosed based on the presence of clinical symptoms and further characterized by cardiac ejection fraction (i.e. reduced or preserved). In 2010, HF affected 6.6 million Americans at a cost of 34.4 billion 15,16. However, the syndromic nature of HF presents challenges to identify HF cases and controls from EHR data for research given that the diagnosis relies on clinical evaluation. Mayo is in the process of developing the HF phenotyping algorithm. We have conducted sublanguage analysis and derived knowledge-based rules for HF that can be executed on clinical notes to identify HF patients. System Description As mentioned, our IE framework is knowledge-driven and developed under Unstructured Information Management Architecture (UIMA) which has been widely adopted to implement systems for processing unstructured content 17. Different UIMA components can be combined to create a pipeline of modular tools, and all components use the same data structure, the Common Analysis Structure (CAS). In general, a UIMA pipeline consists of three types of components, a Collection Reader for accessing the documents from a source and initializing a CAS object for each document. The analysis of the documents is performed by Analysis Engines that add annotations to the CAS objects. Finally, CAS Consumers are used for final processing, e.g., for exporting the annotated information to a format that can be used for downstream analysis (e.g., building machine learning classifiers). Figure 1 shows an overall architecture of the system as well as an example of knowledge needed. After initializing a document to a CAS object, sentences in the document are detected, tokens and chunks are generated. Section detection is then performed so that we can specify sections to extract information. The default section dictionary is the SecTag terminology supplemented with section synonyms acquired from i2b NLP corpora 18. The information extracted can be of two types: concept mention (CM) or matching (MATCH) where the context information detection (adapted from ConText) is performed only on all CM instances. There are three knowledge
3 Figure 1. System architecture of the IE framework under ctakes. components: regular expression, normalization and match rule, for executing the IE engine where the regular expression component specifies patterns used in the match rule components and normalization target is to specify the target form of a regular expression. For example, severe occlusion and high-grade occlusion in diagnosis or impression sections will be matched instances and they are normalized to pos_occlusion. Those knowledge components are externalized from the IE engine to facilitate customizability and maintenance. Experiment To evaluate the described IE framework, we implemented two emerge phenotyping NLP algorithms where one (PAD) has been defined in emerge I and the other (HF) is in the process of development in emerge II. The following describes the experiments in detail. Peripheral arterial disease (PAD) We downloaded the emerge PAD algorithm 19 and crafted three knowledge components used in the IE framework. We evaluated the performance of the implementation by classifying each of the 455 documents (i.e., the gold standard data set used to evaluate the PAD algorithm deployed back in 2010) into two classes: Class I - positive and probable documents, and Class II - negative and unknown documents. The knowledge components include two groups of concepts: anatomical concepts and disorder/procedure concepts. Each radiology note was processed through the IE Engine and then classified using the following rule: if a document has a sentence containing one positive mention from anatomical terms and one positive mention from disorder/procedure terms, we classify the document as Class I, otherwise, Class II. We reported the error matrix where various performance metrics can be derived. Heart failure (HF) To develop HF algorithms, we started with sublanguage analysis to acquire terms and their corresponding context information for HF patients. The data set used for sublanguage analysis includes 706 HF patients from the Heart Failure in the Community Cohort (HL72435), a gold standard cohort of manually abstracted cases defined according to Framingham Heart Failure Criteria 20. Structured EHR data (e.g., billing and diagnoses codes, echocardiography measurement, and lab values) was combined with analyses of unstructured data (clinical notes) to identify the set of parameters needed to capture all the cases. This preliminary version of the algorithm was executed in 6,307 subjects in the Mayo Genome Consortia (MayoGC 21 ), a large cohort of Mayo Clinic patients with EHR-linked genotype data and 616 emerge participants. Results and Discussion The implementation of the PAD algorithm under the new IE framework took less than one hour. Table 1 shows the contingency matrix for this new PAD implementation. The accuracy of the algorithm is 88% (399= divided by 455) which is acceptable but less than 93% achieved by the 2010 PAD system as reported previously 14. Note that we could not define the classification the same way as previously reported 14 since there are only three classes
4 Gold Standard (positive, probable, and negative) in the current version of the gold standard while there were four classes (positive, probably, negative, and unknown) in the previous version of the gold standard. Utilizing an existing terminology source, the UMLS, over 100 HF terms were picked by domain experts. Sublanguage analyses of the clinical notes identified six common terms present in 90% of cases from the HF Cohort: multi-organ failure, cardiac failure, heart failure, CHF, LVF, ventricular failure, under major problem or chief complaint sections. The NLP algorithm for HF is to identify the presence of those six common terms in major problem and chief complaint sections with contextual information as non-negative and non-probable. Table 2 shows the preliminary result when comparing the NLP algorithm and the usual cohort identification based on primary HF diagnosis codes (428.X). Among 706 patients in an existing HF cohort (i.e., the patients are all HF patients), there are 586 (83%) of them identified by both methods, 72 (10%) of them are identifiable only by ICD codes and 41 (6%) identifiable by only the NLP algorithm. There are 7 cases not identifiable by both methods. Combining the NLP algorithm with ICD9 codes has a recall of 99%. In the MayoGC/eMERGE Cohort, we identified 535 patients agreed by both as HF cases while additional 94 patients are identified by HF diagnosis codes as cases and 684 patients are identified only by the NLP algorithm as cases. The remaining 5,610 patients in MayoGC/eMERGE cohort are identified as non-hf cases. Abstraction of the 94 code-only cases revealed that 79% were due to coding errors with the remaining missing an NLP hit due to the lack of electronic notes (i.e. referral patients, HF predated start of EHR, or uncommon HF terms used). Preliminary abstraction of 50 of the NLP only patients demonstrated that about 40% were true HF cases. Table 1. Statistics of contingency matrix on the implementation of PAD algorithm. System Output Class I Class II (Positive or Probable) (Negative or Unknown) Class I (Positive or Probable) Class II (Negative or UnKnown) Table 2. Statistics of patients identified by the HF algorithm. Cohort N (subjects) Code and NLP Code Only NLP Only Neither HF Case Cohort (83%) 72 (10%) 41 (6%) 7 (1%) MayoGC/eMERGE Cohort (8%) 94 (1%) 684 (10%) 5610 (81%) Note that our downstream classification of PAD documents is based on one simple rule (i.e., the co-occurrence of a positive anatomical mention and a positive disorder/procedure mention) compared to the implementation reported previously. The rationale behind this is that manually crafted complex decision rules generally have poor portability across different institutions and we can always tend to statistical machine learning for automated inference. High throughput phenotyping using EHR data is challenging. Our results on HF phenotyping have shown that reliance on structured data such as ICD9 diagnosis codes is insufficient to accurately identify cases. The preliminary data reported for HF cases suggests that a multi-modal approach to phenotyping can substantially improve our ability to accurately identify HF cases and non-cases. The proposed framework facilitates the construction of IE systems by subject matter experts or study coordinators who may not be familiar with NLP. Further studies are required to test the cross-institution portability of the systems using the IE framework. The reported IE framework will be open source to the research community. Conclusion In this paper, we have presented a knowledge-driven IE framework for cohort identification through externalizing knowledge components needed for section and contextual information extraction as well as the specific IE tasks in hand. Through implementing two emerge phenotyping NLP algorithms, we demonstrate the framework can be used for fast implementation of IE systems for extracting specific clinical information taking sections and contextual information into consideration. Acknowledgement
5 This work was supported by National Institutes of Health: emerge II (HG06379) and Heart Failure in the Community (R01 HL72435). References 1. Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. Evaluation of negation phrases in narrative clinical reports. Proc AMIA Symp. 2001: Wagholikar KB, Maclaughlin KL, Henry MR, et al. Clinical decision support with automated text processing for cervical cancer screening. Journal of the American Medical Informatics Association: JAMIA. Sep ;19(5): Chapman WW, Gundlapalli AV, South BR, Dowling JN. Natural language processing for biosurveillance. Infectious Disease Informatics and Biosurveillance. 2011: Chute CG, Beck SA, Fisk TB, Mohr DN. The Enterprise Data Trust at Mayo Clinic: a semantically integrated warehouse of biomedical data. Journal of American Medical Informatics Association: JAMIA. Mar-Apr 2010;17(2): Savova GK, Masanz JJ, Ogren PV, et al. Mayo clinical Text Analysis and Knowledge Extraction System (ctakes): architecture, component evaluation and applications. Journal of the American Medical Informatics Association : JAMIA. Sep-Oct 2010;17(5): Sohn S, Savova GK. Mayo clinic smoking status classification system: extensions and improvements. Proc AMIA Symp. 2009:: Sohn S, Kocher JP, Chute CG, Savova GK. Drug side effect extraction from clinical narratives of psychiatry and psychology patients. Journal of the American Medical Informatics Association : JAMIA. Dec 2011;18 Suppl 1:i Denny JC, Spickard A, 3rd, Johnson KB, Peterson NB, Peterson JF, Miller RA. Evaluation of a method to identify and categorize section headers in clinical documents. Journal of the American Medical Informatics Association : JAMIA. Nov-Dec 2009;16(6): Denny JC, Miller RA, Johnson KB, Spickard A, 3rd. Development and evaluation of a clinical note section header terminology. ProcAMIA Symp. 2008: Harkema H, Dowling JN, Thornblade T, Chapman WW. ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports. J Biomed Inform. Oct 2009;42(5): Kho AN, Pacheco JA, Peissig PL, et al. Electronic Medical Records for Genetic Research: Results of the emerge Consortium. Science Translational Medicine. April 20, 2011;3(79):79re Kullo IJ, Ding K, Jouni H, Smith CY, Chute CG. A genome-wide association study of red blood cell traits using the electronic medical record. PLoS ONE. 2010;5(9). 13. Kullo IJ, Fan J, Pathak J, Savova GK, Ali Z, Chute CG. Leveraging informatics for genetic studies: use of the electronic medical record to enable a genome-wide association study of peripheral arterial disease. Journal of the American Medical Informatics Association: JAMIA. Sep-Oct 2010;17(5): Savova GK, Fan J, Ye Z, et al. Discovering peripheral arterial disease cases from radiology notes using natural language processing. Proc AMIA Symp. 2010: Roger VL, Go AS, Lloyd-Jones DM, et al. Heart Disease and Stroke Statistics Update: A Report From the American Heart Association. Circulation. Jan ;125(1):e2-e Heidenreich PA, Trogdon JG, Khavjou OA, et al. Forecasting the future of cardiovascular disease in the United States: a policy statement from the American Heart Association. Circulation. Mar ;123(8): Selkoe DJ. Alzheimer's disease: genes, proteins, and therapy. Physiol. Rev. 2001;81: %L Uzuner O, South BR, Shen S, DuVall SL i2b2/va challenge on concepts, assertions, and relations in clinical text. Journal of the American Medical Informatics Association: JAMIA. Sep-Oct 2011;18(5): emerge Library of Phenotype Algorithms Roger VL, Weston SA, Redfield MM, et al. Trends in heart failure incidence and survival in a community-based population. JAMA. Jul ;292(3): Bielinski SJ, Chai HS, Pathak J, et al. Mayo Genome Consortia: A Genotype-Phenotype Resource for Genome- Wide Association Studies With an Application to the Analysis of Circulating Bilirubin Levels. Mayo Clin Proc Jul; 86(7):
Natural Language Processing in the EHR Lifecycle
Insight Driven Health Natural Language Processing in the EHR Lifecycle Cecil O. Lynch, MD, MS [email protected] Health & Public Service Outline Medical Data Landscape Value Proposition of NLP
IBM Watson and Medical Records Text Analytics HIMSS Presentation
IBM Watson and Medical Records Text Analytics HIMSS Presentation Thomas Giles, IBM Industry Solutions - Healthcare Randall Wilcox, IBM Industry Solutions - Emerging Technology jstart The Next Grand Challenge
Travis Goodwin & Sanda Harabagiu
Automatic Generation of a Qualified Medical Knowledge Graph and its Usage for Retrieving Patient Cohorts from Electronic Medical Records Travis Goodwin & Sanda Harabagiu Human Language Technology Research
TRANSLATIONAL BIOINFORMATICS 101
TRANSLATIONAL BIOINFORMATICS 101 JESSICA D. TENENBAUM Department of Bioinformatics and Biostatistics, Duke University Durham, NC 27715 USA [email protected] SUBHA MADHAVAN Innovation Center for
Find the signal in the noise
Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical
Using EHRs for Heart Failure Therapy Recommendation Using Multidimensional Patient Similarity Analytics
Digital Healthcare Empowering Europeans R. Cornet et al. (Eds.) 2015 European Federation for Medical Informatics (EFMI). This article is published online with Open Access by IOS Press and distributed under
A Multi-locus Genetic Risk Score for Abdominal Aortic Aneurysm
A Multi-locus Genetic Risk Score for Abdominal Aortic Aneurysm Zi Ye, 1 MD, Erin Austin, 1,2 PhD, Daniel J Schaid, 2 PhD, Iftikhar J. Kullo, 1 MD Affiliations: 1 Division of Cardiovascular Diseases and
Developing VA GDx: An Informatics Platform to Capture and Integrate Genetic Diagnostic Testing Data into the VA Electronic Medical Record
Developing VA GDx: An Informatics Platform to Capture and Integrate Genetic Diagnostic Testing Data into the VA Electronic Medical Record Scott L. DuVall Jun 27, 2014 1 Julie Lynch Vickie Venne Dawn Provenzale
Predicting Chief Complaints at Triage Time in the Emergency Department
Predicting Chief Complaints at Triage Time in the Emergency Department Yacine Jernite, Yoni Halpern New York University New York, NY {jernite,halpern}@cs.nyu.edu Steven Horng Beth Israel Deaconess Medical
Integrating Public and Private Medical Texts for Patient De-Identification with Apache ctakes
Integrating Public and Private Medical Texts for Patient De-Identification with Apache ctakes Presented By: Andrew McMurry & Britt Fitch (Apache ctakes committers) Co-authors: Guergana Savova, Ben Reis,
Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future
Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future Daniel Masys, MD Professor and Chair Department of Biomedical Informatics Professor of Medicine Vanderbilt
Recognizing and Encoding Disorder Concepts in Clinical Text using Machine Learning and Vector Space Model *
Recognizing and Encoding Disorder Concepts in Clinical Text using Machine Learning and Vector Space Model * Buzhou Tang 1,2, Yonghui Wu 1, Min Jiang 1, Joshua C. Denny 3, and Hua Xu 1,* 1 School of Biomedical
Problem-Centered Care Delivery
HOW INTERFACE TERMINOLOGY MAKES STANDARDIZED HEALTH INFORMATION POSSIBLE Terminologies ensure that the languages of medicine can be understood by both humans and machines. by June Bronnert, RHIA, CCS,
TMUNSW: Identification of disorders and normalization to SNOMED-CT terminology in unstructured clinical notes
TMUNSW: Identification of disorders and normalization to SNOMED-CT terminology in unstructured clinical notes Jitendra Jonnagaddala a,b,c Siaw-Teng Liaw *,a Pradeep Ray b Manish Kumar c School of Public
How To Determine Pad
Process Representation #1 : The PAD algorithm as a sequential flow thru all sections An exploded version of the above scoped section flow is shown below. Notes: The flow presupposes existing services (
Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats
Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats Genomic Medicine 8 meeting Alexa McCray Christopher G Chute Rex
Software Architecture Document
Software Architecture Document Natural Language Processing Cell Version 1.0 Natural Language Processing Cell Software Architecture Document Version 1.0 1 1. Table of Contents 1. Table of Contents... 2
Healthcare Data: Secondary Use through Interoperability
Healthcare Data: Secondary Use through Interoperability Floyd Eisenberg MD MPH July 18, 2007 NCVHS Agenda Policies, Enablers, Restrictions Date Re-Use Landscape Sources of Data for Quality Measurement,
Research Skills for Non-Researchers: Using Electronic Health Data and Other Existing Data Resources
Research Skills for Non-Researchers: Using Electronic Health Data and Other Existing Data Resources James Floyd, MD, MS Sep 17, 2015 UW Hospital Medicine Faculty Development Program Objectives Become more
Automated Problem List Generation from Electronic Medical Records in IBM Watson
Proceedings of the Twenty-Seventh Conference on Innovative Applications of Artificial Intelligence Automated Problem List Generation from Electronic Medical Records in IBM Watson Murthy Devarakonda, Ching-Huei
Carolina s Journey: Turning Big Data Into Better Care. Michael Dulin, MD, PhD
Carolina s Journey: Turning Big Data Into Better Care Michael Dulin, MD, PhD Current State: Massive investments in EMR systems Rapidly Increase Amount of Data (Velocity, Volume, Veracity) The Data has
MISSING DATA ANALYSIS AMONG PATIENTS IN THE PINNACLE REGISTRY
MISSING DATA ANALYSIS AMONG PATIENTS IN THE PINNACLE REGISTRY In order to improve the efficiency of PINNACLE Registry data analytics, a missing data analysis has been conducted on PINNACLE Registry data
Beacon User Stories Version 1.0
Table of Contents 1. Introduction... 2 2. User Stories... 2 2.1 Update Clinical Data Repository and Disease Registry... 2 2.1.1 Beacon Context... 2 2.1.2 Actors... 2 2.1.3 Preconditions... 3 2.1.4 Story
Empowering Personalized Medicine with Big Data and Semantic Web Technology: Promises, Challenges, and Use Cases
Empowering Personalized Medicine with Big Data and Semantic Web Technology: Promises, Challenges, and Use Cases Maryam Panahiazar, Vahid Taslimitehrani, Ashutosh Jadhav, Jyotishman Pathak Center for Science
Computer-assisted coding and natural language processing
Computer-assisted coding and natural language processing Without changes to current coding technology and processes, ICD-10 adoption will be very difficult for providers to absorb, due to the added complexity
CDA for Common Document Types: Objectives, Status, and Relationship to Computer-assisted Coding
CDA for Common Document Types: Objectives, Status, and Relationship to Computer-assisted Coding CDA for Common Document Types: Objectives, Status, and Relationship to Computer-assisted Coding by Liora
Not all NLP is Created Equal:
Not all NLP is Created Equal: CAC Technology Underpinnings that Drive Accuracy, Experience and Overall Revenue Performance Page 1 Performance Perspectives Health care financial leaders and health information
Congestive Heart Failure Management Program
Congestive Heart Failure Management Program The Congestive Heart Failure Program is the third statewide disease management program developed by CCNC. The clinical directors reviewed prevalence and outcome
HEALTH EVIDENCE REVIEW COMMISSION (HERC) COVERAGE GUIDANCE: DIAGNOSIS OF SLEEP APNEA IN ADULTS DATE: 5/9/2013 HERC COVERAGE GUIDANCE
HEALTH EVIDENCE REVIEW COMMISSION (HERC) COVERAGE GUIDANCE: DIAGNOSIS OF SLEEP APNEA IN ADULTS DATE: 5/9/2013 HERC COVERAGE GUIDANCE The following diagnostic tests for Obstructive Sleep Apnea (OSA) should
Electronic Health Record (EHR) Standards Survey
Electronic Health Record (EHR) Standards Survey Compiled by: Simona Cohen, Amnon Shabo Date: August 1st, 2001 This report is a short survey about the main emerging standards that relate to EHR - Electronic
Clinical and research data integration: the i2b2 FSM experience
Clinical and research data integration: the i2b2 FSM experience Laboratory of Biomedical Informatics for Clinical Research Fondazione Salvatore Maugeri - FSM - Hospital, Pavia, italy Laboratory of Biomedical
Delivering the power of the world s most successful genomics platform
Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE
ADVANCING MEASUREMENT OF PATIENT- CENTERED OUTCOMES AND QUALITY METRICS WITH ELECTRONIC HEALTH RECORDS
ADVANCING MEASUREMENT OF PATIENT- CENTERED OUTCOMES AND QUALITY METRICS WITH ELECTRONIC HEALTH RECORDS Tina Hernandez-Boussard, PhD, MPH, MS Director, Surgical Health Services Research Unit Assistant Professor
The What, When, Where and How of Natural Language Processing
The What, When, Where and How of Natural Language Processing There s a mystique that surrounds natural language processing (NLP) technology, regarding how it works, and what it can and cannot do. Although
What is your level of coding experience?
The TrustHCS Academy is dedicated to growing new coders to enter the field of medical records coding while also increasing the ICD-10 skills of seasoned coding professionals. A unique combination of on-line
Application of Data Mining Methods in Health Care Databases
6 th International Conference on Applied Informatics Eger, Hungary, January 27 31, 2004. Application of Data Mining Methods in Health Care Databases Ágnes Vathy-Fogarassy Department of Mathematics and
The American Academy of Ophthalmology Adopts SNOMED CT as its Official Clinical Terminology
The American Academy of Ophthalmology Adopts SNOMED CT as its Official Clinical Terminology H. Dunbar Hoskins, Jr., M.D., P. Lloyd Hildebrand, M.D., Flora Lum, M.D. The road towards broad adoption of electronic
Chapter 3: Data Mining Driven Learning Apprentice System for Medical Billing Compliance
Chapter 3: Data Mining Driven Learning Apprentice System for Medical Billing Compliance 3.1 Introduction This research has been conducted at back office of a medical billing company situated in a custom
Natural Language Processing for Clinical Informatics and Translational Research Informatics
Natural Language Processing for Clinical Informatics and Translational Research Informatics Imre Solti, M. D., Ph. D. [email protected] K99 Fellow in Biomedical Informatics University of Washington Background
Health Information Management AAS Degree Program Offered at the HNL Online Campus
Health Information Management AAS Degree Program Offered at the HNL Online Campus Objective: Health Information Technology AAS degree program is designed to equip students with the skills and knowledge
Big Data Analytics for Healthcare
Big Data Analytics for Healthcare Jimeng Sun Chandan K. Reddy Healthcare Analytics Department IBM TJ Watson Research Center Department of Computer Science Wayne State University 1 Healthcare Analytics
Genomics and the EHR. Mark Hoffman, Ph.D. Vice President Research Solutions Cerner Corporation
Genomics and the EHR Mark Hoffman, Ph.D. Vice President Research Solutions Cerner Corporation Overview EHR from Commercial Perspective What can be done TODAY? What could be done TOMORROW? What are some
The Big Picture: IDNT in Electronic Records Glossary
TERM DEFINITION CCI Canada Health Infoway Canadian Institute for Health Information EHR EMR EPR H L 7 (HL7) Canadian Classification of Interventions is the Canadian standard for classifying health care
Using EHRs to extract information, query clinicians, and insert reports
Using EHRs to extract information, query clinicians, and insert reports Meghan Baker, MD, ScD NIH HCS Collaboratory EHR working group webinar March 26, 2013 1 E S P V A E R S Electronic Support for Public
Structuring Unstructured Clinical Narratives in OpenMRS with Medical Concept Extraction
Structuring Unstructured Clinical Narratives in OpenMRS with Medical Concept Extraction Ryan M Eshleman, Hui Yang, and Barry Levine [email protected], [email protected], [email protected] Department
TITLE Dori Whittaker, Director of Solutions Management, M*Modal
TITLE Dori Whittaker, Director of Solutions Management, M*Modal Challenges Impacting Clinical Documentation HITECH Act, Meaningful Use EHR mandate and adoption Need for cost savings Migration to ICD 10
Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers
Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers Terminology Management is a foundational element to satisfying the Meaningful Use Stage 2 criteria and due to its complexity, and
Searching biomedical data sets. Hua Xu, PhD The University of Texas Health Science Center at Houston
Searching biomedical data sets Hua Xu, PhD The University of Texas Health Science Center at Houston Motivations for biomedical data re-use Improve reproducibility Minimize duplicated efforts on creating
Advanced Practice Nurse-managed Heart Failure Clinic Benefits Patient s Quality of Life and Limits Readmissions
Nursing and Health 1(3): 47-51, 2013 DOI: 10.13189/nh.2013.010301 http://www.hrpub.org Advanced Practice Nurse-managed Heart Failure Clinic Benefits Patient s Quality of Life and Limits Readmissions Christina
Smarter Healthcare@IBM Research. Joseph M. Jasinski, Ph.D. Distinguished Engineer IBM Research
Smarter Healthcare@IBM Research Joseph M. Jasinski, Ph.D. Distinguished Engineer IBM Research Our researchers work on a wide spectrum of topics Basic Science Industry specific innovation Nanotechnology
Update on Prostate Cancer: Screening, Diagnosis, and Treatment Making Sense of the Noise and Directions Forward
Update on Prostate Cancer: Screening, Diagnosis, and Treatment Making Sense of the Noise and Directions Forward 33 rd Annual Internal Medicine Update December 5, 2015 Ryan C. Hedgepeth, MD, MS Chief of
Bench to Bedside Clinical Decision Support:
Bench to Bedside Clinical Decision Support: The Role of Semantic Web Technologies in Clinical and Translational Medicine Tonya Hongsermeier, MD, MBA Corporate Manager, Clinical Knowledge Management and
Technical Issues in Aggregating and Analyzing Data from Heterogeneous EHR Systems
Technical Issues in Aggregating and Analyzing Data from Heterogeneous EHR Systems Josh Denny, MD, MS [email protected] Vanderbilt University, Nashville, Tennessee, USA 2/12/2015 EHR data are dense
Big Data Analytics and Healthcare
Big Data Analytics and Healthcare Anup Kumar, Professor and Director of MINDS Lab Computer Engineering and Computer Science Department University of Louisville Road Map Introduction Data Sources Structured
High-Throughput Phenotyping from Electronic Health Records for Research
High-Throughput Phenotyping from Electronic Health Records for Research Jyotishman Pathak, PhD Director, Clinical Informatics Program and Services Kern Center for the Science of Health Care Delivery Associate
BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, [email protected]) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
SOLUTION BRIEF. IMAT Enhances Clinical Trial Cohort Identification. imatsolutions.com
SOLUTION BRIEF IMAT Enhances Clinical Trial Cohort Identification imatsolutions.com Introduction Timely access to data is always a top priority for mature organizations. Identifying and acting on the information
ehr Sharable Data Vicky Fung Senior Health Informatician ehr Information Standards Office
ehr Sharable Data Vicky Fung Senior Health Informatician ehr Information Standards Office ehr - Vision DH HA epr Private Private Hospit als Hospitals EHR Repository Access Portal CMS onramp PPP Clinics
Generating Patient Problem Lists from the ShARe Corpus using SNOMED CT/SNOMED CT CORE Problem List
Generating Patient Problem Lists from the ShARe Corpus using SNOMED CT/SNOMED CT CORE Problem List Danielle Mowery Janyce Wiebe University of Pittsburgh Pittsburgh, PA [email protected] [email protected]
Patient Similarity-guided Decision Support
Patient Similarity-guided Decision Support Tanveer Syeda-Mahmood, PhD IBM Almaden Research Center May 2014 2014 IBM Corporation What is clinical decision support? Rule-based expert systems curated by people,
Health Information Technology Courses
Health Information Technology Courses Course ID Course Title Credits HIT-100 Introduction to Healthcare 3 HIT-110 Medical Terminology I 3 HIT-120 Medical Terminology II 3 HIT-130 Medical Transcription/Editing
11-792 Software Engineering EMR Project Report
11-792 Software Engineering EMR Project Report Team Members Phani Gadde Anika Gupta Ting-Hao (Kenneth) Huang Chetan Thayur Suyoun Kim Vision Our aim is to build an intelligent system which is capable of
> Semantic Web Use Cases and Case Studies
> Semantic Web Use Cases and Case Studies Case Study: Applied Semantic Knowledgebase for Detection of Patients at Risk of Organ Failure through Immune Rejection Robert Stanley 1, Bruce McManus 2, Raymond
The Prolog Interface to the Unstructured Information Management Architecture
The Prolog Interface to the Unstructured Information Management Architecture Paul Fodor 1, Adam Lally 2, David Ferrucci 2 1 Stony Brook University, Stony Brook, NY 11794, USA, [email protected] 2 IBM
Explanation of care coordination payments as described in Section 223.000 of the PCMH provider manual
Explanation of care coordination payments as described in Section 223.000 of the PCMH provider manual Determination of beneficiary risk Per beneficiary amounts Per beneficiary amounts 1 For the first year
Meaningful Use - HL7 Version 2
Meaningful Use - HL7 Version 2 HL7 Version 2 and Surveillance Your Ambassadors Today Anna Orlova, PhD, Executive Director, Public Health Data Standards Consortium and Johns Hopkins University Lori Reed-Fourquet,
EHR Software Feature Comparison
EHR Comparison ELECTRONIC MEDICAL RECORDS Patient demographics Manages the input and maintenance of patient information including demographics, insurance, contacts, referrals, notes and more. Consents
Clinical Decision Support Product Area Rong Chen MD PhD Chief Medical Informatics Officer
Clinical Decision Support Product Area Rong Chen MD PhD Chief Medical Informatics Officer CAMBIO HEALTHCARE SYSTEMS 2015-6-16 WWW.CAMBIOHEALTHCARE.CO.UK 1 Patient CAMBIO HEALTHCARE SYSTEMS Core Assumptions
HIE Ready 2.0 SPECIFICATIONS MATRIX. Product Name: Version Number: Preferred Message and Trigger
HIE Ready 2.0 SPECIFICATIONS MATRIX Entity Name: Street Address: City, State, Zip: Point of Contact Name: E-mail & Phone: Alternate Contact Name: Alternate E-mail & Phone: Product Name: Version Number:
EHR Data Reuse through openehr Archetypes
EHR Data Reuse through openehr Archetypes Rong Chen MD, PhD Chief Medical Informatics Officer 2012.09.19-1- 2012-10-02 Agenda Background introduction (3 min) Experience of extracting EHR data from regional
Extracting value from scientific literature: the power of mining full-text articles for pathway analysis
FOR PHARMA & LIFE SCIENCES WHITE PAPER Harnessing the Power of Content Extracting value from scientific literature: the power of mining full-text articles for pathway analysis Executive Summary Biological
