Core research questions for natural language processing of clinical text. Noémie Elhadad

Size: px
Start display at page:

Download "Core research questions for natural language processing of clinical text. Noémie Elhadad noemie@dbmi.columbia.edu"

Transcription

1 Core research questions for natural language processing of clinical text Noémie Elhadad

2 NLP s promise for medicine and health } Increasingly large amounts of texts } Clinical literature in PubMed } Patients & health consumers talking online } Patient notes in the Electronic Health Record (EHR) } Natural language processing } Leverage the information from natural text (with all of its noise, idiosyncrasies and ambiguities) into a format amenable to computing } Support clinical discovery } Provide tailored access to relevant information and helps in decision making

3 NLP s promise for medicine and health } Clinical discovery } Discover patterns about patient populations, their diseases and their treatments based on their records } Clinical decision making } Extract information in a patient s record as input to decision support systems } Improved workflow for Electronic Health Record users } Intelligent search } Question answering } Summarization of longitudinal records }

4 How far away are we? } NLP in the general domain } Spam filtering, machine translation, sentiment analysis, Siri, Watson, } Linguistically informed, data-driven approaches } Shared corpora for training and testing } Clinical NLP } Some success stories } Still many challenges and open research questions

5 Outline } What is in the EHR (and isn t free text going away anyway)? } Towards natural language understanding } The EHR and the Truth it s complicated } Information redundancy

6 Outline } What is in the EHR (and isn t free text going away anyway)? } Towards natural language understanding } The EHR and the Truth it s complicated } Information redundancy

7

8 Information captured in the EHR } What is captured? } Diagnoses (billing codes) } Laboratory results (time series) } Medications (mix of free text and codes) } Reports (structured documents) } Notes from physicians, nurses, social workers, etc. } Why is it captured? } Clinical purposes } Billing purposes } Legal purposes } Compliance purposes

9 Narrative vs. structured data } Ongoing debate in health IT } Even simple phenotypic information is not trivial to encode in a structured format } Smoking status of a patient } Mismatch between structured data and narratives } Diagnoses } Medications } Labs

10 At NewYork-Presbyterian (Eclipsys data)

11 Outline } What is in the EHR (and isn t free text going away anyway)? } Towards natural language understanding } The EHR and the Truth it s complicated } Information redundancy

12 Towards natural language understanding } Information extraction } Locates and structures specific information in text } Can help answer questions like how many patients in my institution come in with shortness of breath on any given day? 43 yo female with history of GERD woke up w/ SOB and LUE discomfort 1 day PTA. She presented to [**Hospital2 72**] where she was ruled out for MI by enzymes. She underwent stress test the following day at [**Hospital2 72**]. She developed SOB and shoulder pain during the test.

13 Information extraction (1) } Named entity recognition } Conditions and symptoms, Diagnostic procedures and laboratory tests, Therapeutic procedures, } How to determine that GERD is a condition, but stress test is diagnostic procedure? Rely on terminologies and ontologies (UMLS) } How to determine that LUE discomfort is a symptom, even though it is not in the UMLS? Rely on context and syntax 43 yo female with history of GERD woke up w/ SOB and LUE discomfort 1 day PTA. She presented to [**Hospital2 72**] where she was ruled out for MI by enzymes. She underwent stress test the following day at [**Hospital2 72**]. She developed SOB and shoulder pain during the test.

14 Information extraction (2) } Normalize the entities (map to a semantic concept in an ontology) } GERD à C Gastroesophageal reflux disease, stress test à C Exercise Stress Test } How to determine that PTA is not mapped to Post Traumatic Amnesia, Percutaneous Transluminal Angioplasty, or Parent Teacher Association? } How to determine that enzymes maps to Clinical Enzyme Test, but not Enzyme? Rely on the context (e.g., 1 day and MI by ) 43 yo female with history of GERD woke up w/ SOB and LUE discomfort 1 day PTA. She presented to [**Hospital2 72**] where she was ruled out for MI by enzymes. She underwent stress test the following day at [**Hospital2 72**]. She developed SOB and shoulder pain during the test.

15 Information extraction (3) } Identify modifiers associated with the entities } GERD has one temporal modifier history of } SOB has one temporal modifier 1 day PTA } LUE discomfort has one temporal modifier 1 day PTA and one anatomical modifier LUE } MI has one negation modifier ruled out } 43 yo female with history of GERD woke up w/ SOB and LUE discomfort 1 day PTA. She presented to [**Hospital2 72**] where she was ruled out for MI by enzymes. She underwent stress test the following day at [**Hospital2 72**]. She developed SOB and shoulder pain during the test.

16 Information extraction (3) } Challenges and Solutions } How to determine the types of modifiers of interest? Task dependent: some are obvious, some not always needed } How to recognize them in text? Rely on syntax and context 43 yo female with history of GERD woke up w/ SOB and LUE discomfort 1 day PTA. She presented to [**Hospital2 72**] where she was ruled out for MI by enzymes. She underwent stress test the following day at [**Hospital2 72**]. She developed SOB and shoulder pain during the test.

17 Information extraction (4) } Identify relations associated among the entities } enzyme is the test used to rule out MI } SOB occurred during stress test } stress test and test refer to the same entity 43 yo female with history of GERD woke up w/ SOB and LUE discomfort 1 day PTA. She presented to [**Hospital2 72**] where she was ruled out for MI by enzymes. She underwent stress test the following day at [**Hospital2 72**]. She developed SOB and shoulder pain during the test.

18 Information extraction (4) } Challenges and Solutions } How to determine the types of relations? Task dependent } How to recognize them in text? Use context (patterns X during Y ruled out for X by Y ) } How to identify them when they are not explicit in text (e.g., pharmacovigilance)? Use statistical models to find patterns of occurrences 43 yo female with history of GERD woke up w/ SOB and LUE discomfort 1 day PTA. She presented to [**Hospital2 72**] where she was ruled out for MI by enzymes. She underwent stress test the following day at [**Hospital2 72**]. She developed SOB and shoulder pain during the test.

19 Towards robust information extraction } Shared annotated lexical resources needed } Access to shared clinical texts is difficult } Some task-specific annotated corpora } i2b2 challenges, computational medicine challenge, } Task-independent multi-layered annotations (ongoing) } ShARe MIMIC II notes annotated for POS, chunks, disorder mentions and 13 modifiers } SHARPn clinical element models } Temporal relations following ISO TimeML } Overall, 1.5M tokens, several institutions, rich annotation

20 Outline } What is in the EHR (and isn t free text going away anyway)? } Towards natural language understanding } The EHR and the Truth it s complicated } Information redundancy

21 Allegory of the cave EHR

22 Allegory of the cave EHR } Uncontrolled data source } From the patient } From the clinician

23 Allegory of the cave EHR } Erroneous sometimes } Mistakes } Rounding } Not enough information } Important data is not captured by the EHR } Specifically true for the conditions whose phenotype is unknown } Healthy patients are not well represented } Sparse } Captive population } Data missing not at random (healthy patients are not measured often) } Too much information } For a given task, lots of irrelevant information } Information redundancy copy and paste

24 Implications on methods } Learning from large clinical corpora, especially longitudinal records } Detangle the biases of the EHR from clinical patterns } (This is true for non-linguistic data in the EHR as well)

25 Outline } What is in the EHR (and isn t free text going away anyway)? } Towards natural language understanding } The EHR and the Truth it s complicated } Information redundancy

26 Information redundancy in the EHR } Copy and paste } Clinicians copy chunks of text from a previous note into current note } No new information since last note } Effect on documentation quality } Error propagation } Notes become incoherent with time } What is the effect on text mining?

27 Information redundancy in the EHR } Scenario of use: phenotyping / disease model } Look for terms that are representative of a patient cohort (collocation identification) } Discover clusters of terms that co-occur often in patients notes (topic model) } 1,604 patients with CKD stage 3 } 22,564 notes (three note types only), span 1-10 years } 6.1M tokens, 140K vocabulary } 600K UMLS concept tokens, 7K concept vocabulary } High redundancy at the string level

28 Example of topic modeling (LDA) output on corpus of CKD patients # Topic 1 Topic 2 Topic 3 1.# renal htn# Pulm# 2.# ckd# lisinopril# Pulmonary# 3.# cr# hctz# Ct# 4.# kidney# bp# Chest# 5.# appt# lipitor# Copd# 6.# lasix# asa# lung# 7.# disease# date# pfts# 8.# anemia# amlodipine# sob# 9.# pth# ldl# cough# 10.# iv# hpl# pna# 11.# mi# hl# sputum# 12.# chronic# buttock# severe# 13.# rf# trazodone# defect# 14.# gfr# nephrolithiasis# findings# 15.# missed# uncontrolled# prevacid# 16.# refer# adrenal# fev# 17.# pap# erythema# long# 18.# secondary# metoprolol# ace# 19.# pmd# drugs# pack# 20.# itching# ## spiriva#

29 LDA behavior on traditional corpus } Wall Street Journal corpus } 400 documents } 600 documents } 1,300 documents } Train LDA on the three subsets and compare their performances on the same withheld WSJ subset } The higher the log-likelihood, the more successful the model is at modeling the content of a corpus } Given two models on the same data, the one with the lower number of topics has better explanatory power (fewer latent variables are needed to explain the data)

30 LDA behavior on traditional corpus } More data is better (1,300 vs. 400 documents) } Same shape LDA-WSJ Log Likelihood of model filt WSJ-400 WSJ-600 WSJ Number of Topics

31 LDA on non-traditional corpus } What happens when we introduce redundancy in the Wall Street Journal? } WSJx2: 2,600 documents } WSJx3: 3,900 documents } WSJx5: 3,309 documents every document is sampled randomly between 1 and 5 times(most similar to EHR corpus)

32 LDA on non-traditional corpus } Adding redundancy worsens the models (WSJx5 is almost as bad as WSJ-600) LDA-WSJ Log Likelihood of model filt WSJ-600 WSJ-1300 WSJx2 WSJx WSJx Number of Topics

33 Information redundancy in the EHR } Redundancy has a negative impact on topic modeling } Similar experiments with collocation extraction } Redundant EHR and synthetic WSJ corpora have more collocations identified } But their quality is dubious } Mitigation strategies to take the fact that several notes come from the same patients into account } Don t treat each note independently of each other } But don t remove redundancy artificially because redundant information is still meaningful } Fingerprinting, RedLDA

34 LDA, Fingerprinting, RedLDA on EHR Corpus

35 Conclusions } The EHR and the narrative part are a gold mine of information } The infrastructural challenges to opening data to researchers are being addressed (slowly, but surely) } The scientific challenges are concrete ones, beyond the get more data and train better models } Better understanding of the characteristics of EHR } Novel methods that do not violate the assumptions of the EHR

36 Thank you! Sharon Lipsky Gorman, Karthik Natarajan, Rimma Pivovarov, Adler Perotte, David Albers, George Columbia University Raphael Cohen, Iddo Aviram, Michael Ben Gurion University R01 LM (Elhadad) An NLP approach to generating patient record summaries NLM HHS (Elhadad) Causal inference on narrative and structured temporal data to augment discovery and care R01 GM (Chapman, Elhadad, Savova) Annotation, development and evaluation for clinical information extraction

Redundancy in electronic health record corpora: analysis, impact on text mining performance and mitigation strategies

Redundancy in electronic health record corpora: analysis, impact on text mining performance and mitigation strategies Cohen et al. BMC Bioinformatics 2013, 14:10 RESEARCH ARTICLE Open Access Redundancy in electronic health record corpora: analysis, impact on text mining performance and mitigation strategies Raphael Cohen

More information

Integrating Public and Private Medical Texts for Patient De-Identification with Apache ctakes

Integrating Public and Private Medical Texts for Patient De-Identification with Apache ctakes Integrating Public and Private Medical Texts for Patient De-Identification with Apache ctakes Presented By: Andrew McMurry & Britt Fitch (Apache ctakes committers) Co-authors: Guergana Savova, Ben Reis,

More information

Secondary Use of EMR Data View from SHARPn AMIA Health Policy, 12 Dec 2012

Secondary Use of EMR Data View from SHARPn AMIA Health Policy, 12 Dec 2012 Secondary Use of EMR Data View from SHARPn AMIA Health Policy, 12 Dec 2012 Christopher G. Chute, MD DrPH, Professor, Biomedical Informatics, Mayo Clinic Chair, ISO TC215 on Health Informatics Chair, International

More information

Generating Patient Problem Lists from the ShARe Corpus using SNOMED CT/SNOMED CT CORE Problem List

Generating Patient Problem Lists from the ShARe Corpus using SNOMED CT/SNOMED CT CORE Problem List Generating Patient Problem Lists from the ShARe Corpus using SNOMED CT/SNOMED CT CORE Problem List Danielle Mowery Janyce Wiebe University of Pittsburgh Pittsburgh, PA dlm31@pitt.edu wiebe@cs.pitt.edu

More information

Bench to Bedside Clinical Decision Support:

Bench to Bedside Clinical Decision Support: Bench to Bedside Clinical Decision Support: The Role of Semantic Web Technologies in Clinical and Translational Medicine Tonya Hongsermeier, MD, MBA Corporate Manager, Clinical Knowledge Management and

More information

Uncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM

Uncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM Uncovering Value in Healthcare Data with Cognitive Analytics Christine Livingston, Perficient Ken Dugan, IBM Conflict of Interest Christine Livingston Ken Dugan Has no real or apparent conflicts of interest

More information

Emphysema. Introduction Emphysema is a type of chronic obstructive pulmonary disease, or COPD. COPD affects about 64 million people worldwide.

Emphysema. Introduction Emphysema is a type of chronic obstructive pulmonary disease, or COPD. COPD affects about 64 million people worldwide. Emphysema Introduction Emphysema is a type of chronic obstructive pulmonary disease, or COPD. COPD affects about 64 million people worldwide. Emphysema involves damage to the air sacs in the lungs. This

More information

Patient Similarity-guided Decision Support

Patient Similarity-guided Decision Support Patient Similarity-guided Decision Support Tanveer Syeda-Mahmood, PhD IBM Almaden Research Center May 2014 2014 IBM Corporation What is clinical decision support? Rule-based expert systems curated by people,

More information

Find the signal in the noise

Find the signal in the noise Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical

More information

Query Log Analysis of an Electronic Health Record Search Engine

Query Log Analysis of an Electronic Health Record Search Engine Query Log Analysis of an Electronic Health Record Search Engine Lei Yang 1, Qiaozhu Mei 1,2, Kai Zheng 1,3, David A. Hanauer 4,5 1, School of Information 2. Department of EECS 3. School of Public Health

More information

3M Health Information Systems

3M Health Information Systems 3M Health Information Systems 1 Data Governance Disparate Systems Interoperability Information Exchange Reporting Public Health Quality Metrics Research Data Warehousing Data Standards What is the 3M Healthcare

More information

Identify Disorders in Health Records using Conditional Random Fields and Metamap

Identify Disorders in Health Records using Conditional Random Fields and Metamap Identify Disorders in Health Records using Conditional Random Fields and Metamap AEHRC at ShARe/CLEF 2013 ehealth Evaluation Lab Task 1 G. Zuccon 1, A. Holloway 1,2, B. Koopman 1,2, A. Nguyen 1 1 The Australian

More information

Text Mining - Scope and Applications

Text Mining - Scope and Applications Journal of Computer Science and Applications. ISSN 2231-1270 Volume 5, Number 2 (2013), pp. 51-55 International Research Publication House http://www.irphouse.com Text Mining - Scope and Applications Miss

More information

How to extract transform and load observational data?

How to extract transform and load observational data? How to extract transform and load observational data? Martijn Schuemie Janssen Research & Development Department of Pharmacology & Pharmacy, The University of Hong Kong Outline Observational data & research

More information

SEMANTIC DATA PLATFORM FOR HEALTHCARE. Dr. Philipp Daumke

SEMANTIC DATA PLATFORM FOR HEALTHCARE. Dr. Philipp Daumke SEMANTIC DATA PLATFORM FOR HEALTHCARE Dr. Philipp Daumke ABOUT AVERBIS Founded: 2007 Location: Focus: Languages: Current Sectors: Freiburg, Germany Terminology Management, Text Mining, Search multilingual

More information

Summarizing the Patient Record & Modeling Diseases from EHR Observations. Noémie Elhadad noemie.elhadad@columbia.edu

Summarizing the Patient Record & Modeling Diseases from EHR Observations. Noémie Elhadad noemie.elhadad@columbia.edu Summarizing the Patient Record & Modeling Diseases from EHR Observations Noémie Elhadad noemie.elhadad@columbia.edu http://www.washingtonexaminer.com/doctors-hospitals-rethink-electronic-medical-records-mandated-by-2009-law/article/2554622

More information

Natural Language Processing in the EHR Lifecycle

Natural Language Processing in the EHR Lifecycle Insight Driven Health Natural Language Processing in the EHR Lifecycle Cecil O. Lynch, MD, MS cecil.o.lynch@accenture.com Health & Public Service Outline Medical Data Landscape Value Proposition of NLP

More information

Healthcare Data: Secondary Use through Interoperability

Healthcare Data: Secondary Use through Interoperability Healthcare Data: Secondary Use through Interoperability Floyd Eisenberg MD MPH July 18, 2007 NCVHS Agenda Policies, Enablers, Restrictions Date Re-Use Landscape Sources of Data for Quality Measurement,

More information

EHR Standards and Semantic Interoperability

EHR Standards and Semantic Interoperability EHR Standards and Semantic Interoperability Setting the Frame of Reference Dr. Marco Eichelberg OFFIS - Institute for Information Technology E-Mail: eichelberg@offis.de Page 1 Introduction Semantic Interoperability:

More information

Free Text Phrase Encoding and Information Extraction from Medical Notes. Jennifer Shu

Free Text Phrase Encoding and Information Extraction from Medical Notes. Jennifer Shu Free Text Phrase Encoding and Information Extraction from Medical Notes by Jennifer Shu Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements

More information

Emergency Scenario. Chest Pain

Emergency Scenario. Chest Pain Emergency Scenario Chest Pain This emergency scenario reviews chest pain in a primary care patient, and is set up for roleplay and case review with your staff. 1) The person facilitating scenarios can

More information

Travis Goodwin & Sanda Harabagiu

Travis Goodwin & Sanda Harabagiu Automatic Generation of a Qualified Medical Knowledge Graph and its Usage for Retrieving Patient Cohorts from Electronic Medical Records Travis Goodwin & Sanda Harabagiu Human Language Technology Research

More information

Network Big Data: Facing and Tackling the Complexities Xiaolong Jin

Network Big Data: Facing and Tackling the Complexities Xiaolong Jin Network Big Data: Facing and Tackling the Complexities Xiaolong Jin CAS Key Laboratory of Network Data Science & Technology Institute of Computing Technology Chinese Academy of Sciences (CAS) 2015-08-10

More information

Text Analytics. A business guide

Text Analytics. A business guide Text Analytics A business guide February 2014 Contents 3 The Business Value of Text Analytics 4 What is Text Analytics? 6 Text Analytics Methods 8 Unstructured Meets Structured Data 9 Business Application

More information

Predicting What Matters Using Predictive Analytics to Reduce Suffering, Save Lives, and Optimize the Cost of Care

Predicting What Matters Using Predictive Analytics to Reduce Suffering, Save Lives, and Optimize the Cost of Care Predicting What Matters Using Predictive Analytics to Reduce Suffering, Save Lives, and Optimize the Cost of Care Predictive Analytics for Population Health Management NCHICA Learning Objectives By the

More information

Global Headquarters: 5 Speen Street Framingham, MA 01701 USA P.508.935.4445 F.508.988.7881 www.idc-hi.com

Global Headquarters: 5 Speen Street Framingham, MA 01701 USA P.508.935.4445 F.508.988.7881 www.idc-hi.com Global Headquarters: 5 Speen Street Framingham, MA 01701 USA P.508.935.4445 F.508.988.7881 www.idc-hi.com U n l o cking the Power of Unstructured Data W H I T E P A P E R Sponsored by: IBM Susan Feldman

More information

TITLE Dori Whittaker, Director of Solutions Management, M*Modal

TITLE Dori Whittaker, Director of Solutions Management, M*Modal TITLE Dori Whittaker, Director of Solutions Management, M*Modal Challenges Impacting Clinical Documentation HITECH Act, Meaningful Use EHR mandate and adoption Need for cost savings Migration to ICD 10

More information

PATIENT REPORTED OUTCOMES IN ESOPHAGEAL DISEASE: AN OPEN SOURCE WEB BASED FRAMEWORK

PATIENT REPORTED OUTCOMES IN ESOPHAGEAL DISEASE: AN OPEN SOURCE WEB BASED FRAMEWORK Northwestern University Feinberg School of Medicine PATIENT REPORTED OUTCOMES IN ESOPHAGEAL DISEASE: AN OPEN SOURCE WEB BASED FRAMEWORK Andrew Gawron, MD, PhD Co-authors: Sherri LaVela, Jane Holl, David

More information

Electronic Health Record (EHR) Standards Survey

Electronic Health Record (EHR) Standards Survey Electronic Health Record (EHR) Standards Survey Compiled by: Simona Cohen, Amnon Shabo Date: August 1st, 2001 This report is a short survey about the main emerging standards that relate to EHR - Electronic

More information

Automated Problem List Generation from Electronic Medical Records in IBM Watson

Automated Problem List Generation from Electronic Medical Records in IBM Watson Proceedings of the Twenty-Seventh Conference on Innovative Applications of Artificial Intelligence Automated Problem List Generation from Electronic Medical Records in IBM Watson Murthy Devarakonda, Ching-Huei

More information

Find your future in the history

Find your future in the history Find your future in the history Is your radiology practice ready for the future? Demands are extremely high as radiology practices move from a fee-for-service model to an outcomes-based model centered

More information

Beacon User Stories Version 1.0

Beacon User Stories Version 1.0 Table of Contents 1. Introduction... 2 2. User Stories... 2 2.1 Update Clinical Data Repository and Disease Registry... 2 2.1.1 Beacon Context... 2 2.1.2 Actors... 2 2.1.3 Preconditions... 3 2.1.4 Story

More information

TMUNSW: Identification of disorders and normalization to SNOMED-CT terminology in unstructured clinical notes

TMUNSW: Identification of disorders and normalization to SNOMED-CT terminology in unstructured clinical notes TMUNSW: Identification of disorders and normalization to SNOMED-CT terminology in unstructured clinical notes Jitendra Jonnagaddala a,b,c Siaw-Teng Liaw *,a Pradeep Ray b Manish Kumar c School of Public

More information

FHIM Model Content Overview

FHIM Model Content Overview FHIM Model Content Overview Federal Health Information Model (FHIM) and Associated Terminology Models Goal Produce a logical, health information model that supports semantic interoperability and that is

More information

TRANSFoRm: Vision of a learning healthcare system

TRANSFoRm: Vision of a learning healthcare system TRANSFoRm: Vision of a learning healthcare system Vasa Curcin, Imperial College London Theo Arvanitis, University of Birmingham Derek Corrigan, Royal College of Surgeons Ireland TRANSFoRm is partially

More information

Computer-assisted coding and natural language processing

Computer-assisted coding and natural language processing Computer-assisted coding and natural language processing Without changes to current coding technology and processes, ICD-10 adoption will be very difficult for providers to absorb, due to the added complexity

More information

Exploration and Visualization of Post-Market Data

Exploration and Visualization of Post-Market Data Exploration and Visualization of Post-Market Data Jianying Hu, PhD Joint work with David Gotz, Shahram Ebadollahi, Jimeng Sun, Fei Wang, Marianthi Markatou Healthcare Analytics Research IBM T.J. Watson

More information

Visual Analytics to Enhance Personalized Healthcare Delivery

Visual Analytics to Enhance Personalized Healthcare Delivery Visual Analytics to Enhance Personalized Healthcare Delivery A RENCI WHITE PAPER A data-driven approach to augment clinical decision making CONTACT INFORMATION Ketan Mane, PhD kmane@renci,org 919.445.9703

More information

An Overview of Asthma - Diagnosis and Treatment

An Overview of Asthma - Diagnosis and Treatment An Overview of Asthma - Diagnosis and Treatment Asthma is a common chronic disorder of the airways that is complex and characterized by variable and recurring symptoms, airflow obstruction, bronchial hyperresponsiveness,

More information

Maximize the value of your COPD population health programs with advanced analytics PLAYBOOK

Maximize the value of your COPD population health programs with advanced analytics PLAYBOOK Maximize the value of your COPD population health programs with advanced analytics PLAYBOOK STEP ONE: Analyze your patient population Bend the cost curve: Learning more about your patients can lead to

More information

Interoperability and Analytics February 29, 2016

Interoperability and Analytics February 29, 2016 Interoperability and Analytics February 29, 2016 Matthew Hoffman MD, CMIO Utah Health Information Network Conflict of Interest Matthew Hoffman, MD Has no real or apparent conflicts of interest to report.

More information

Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers

Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers Terminology Management is a foundational element to satisfying the Meaningful Use Stage 2 criteria and due to its complexity, and

More information

What you can accomplish with IBMContent Analytics

What you can accomplish with IBMContent Analytics What you can accomplish with IBMContent Analytics An Enterprise Content Management solution What is IBM Content Analytics? Alex On February 14-16, IBM s Watson computing system made its television debut

More information

An Essential Ingredient for a Successful ACO: The Clinical Knowledge Exchange

An Essential Ingredient for a Successful ACO: The Clinical Knowledge Exchange An Essential Ingredient for a Successful ACO: The Clinical Knowledge Exchange Jonathan Everett Director, Health Information Technology Chinese Community Health Care Association Darren Schulte, MD, MPP

More information

Optum One. The Intelligent Health Platform

Optum One. The Intelligent Health Platform Optum One The Intelligent Health Platform The Optum One intelligent health platform enables healthcare providers to manage patient populations. The platform combines the industry s most advanced integrated

More information

Respiratory Care. A Life and Breath Career for You!

Respiratory Care. A Life and Breath Career for You! Respiratory Care A Life and Breath Career for You! Respiratory Care Makes a Difference At 9:32 am, Lori Moreno brought a newborn baby struggling to breathe back to life What have you accomplished today?

More information

Meaningful Use as a Driver for EHR Adoption

Meaningful Use as a Driver for EHR Adoption white paper Meaningful Use as a Driver for EHR Adoption How Nuance Healthcare Can Help Bridge the Gap to Structured Data Entry healthcare Contents Executive Summary...3 Introduction...3 Emerging Hybrid

More information

Delivering Real World Evidence. Canada Let s Get Real!

Delivering Real World Evidence. Canada Let s Get Real! Delivering Real World Evidence from Electronic Medical Records in Canada Let s Get Real! Neil Corner Director, Real World Evidence, IMS Brogan Alison Dziarmaga Director, Real World Evidence, AstraZeneca

More information

Medical Research from Medical Records Mikel Aickin, PhD Family & Community Medicine University of Arizona, USA The Problem Medical researchers believe that therapeutic knowledge comes from randomized clinical

More information

Health Information Exchange Language - Bostaik

Health Information Exchange Language - Bostaik Bootstrapping Adoption of a Universal Exchange Language for Health Information Exchange Speakers: Tajh L. Taylor, Lowell Vizenor OMG SOA in Healthcare Conference July 15, 2011 Agenda The Health Information

More information

How To Analyze Health Data

How To Analyze Health Data POPULATION HEALTH ANALYTICS ANALYTICALLY-DRIVEN INSIGHTS FOR POPULATION HEALTH LAURIE ROSE, PRINCIPAL CONSULTANT HEALTH CARE GLOBAL PRACTICE DISCUSSION TOPICS Population Health: What & Why Now? Population

More information

Article details. Abstract. Version 1

Article details. Abstract. Version 1 Article details Title Authors Abstract Version 1 Osteoarthritis in Family Physician Practices in Canada: A Report of the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) Morkem, Rachael; Birtwhistle,

More information

Introduction To Evidence-Based Practice. Keith Posley, MD Lauren Maggio, MS (LIS)

Introduction To Evidence-Based Practice. Keith Posley, MD Lauren Maggio, MS (LIS) Introduction To Evidence-Based Practice Keith Posley, MD Lauren Maggio, MS (LIS) Goals & Objectives Define Evidence-Based Medicine (EBM) Articulate the four steps of EBP Learn to formulate answerable clinical

More information

Session 11 PD, Provider Perspectives of Values Based Payment Programs. Moderator: William T. O'Brien, FSA, FCA

Session 11 PD, Provider Perspectives of Values Based Payment Programs. Moderator: William T. O'Brien, FSA, FCA Session 11 PD, Provider Perspectives of Values Based Payment Programs Moderator: William T. O'Brien, FSA, FCA Presenters: Donald Fry, M.D. Lillian Louise Dittrick, FSA, MAAA Colleen Audrey Norris, ASA,

More information

The What, When, Where and How of Natural Language Processing

The What, When, Where and How of Natural Language Processing The What, When, Where and How of Natural Language Processing There s a mystique that surrounds natural language processing (NLP) technology, regarding how it works, and what it can and cannot do. Although

More information

Improving Health Records Search Using Multiple Query Expansion Collections

Improving Health Records Search Using Multiple Query Expansion Collections Improving Health Records Search Using Multiple Query Expansion Collections Dongqing Zhu and Ben Carterette Information Retrieval Lab University of Delaware Outline Introduction Retrieval Task & Data Retrieval

More information

ADVANCING MEASUREMENT OF PATIENT- CENTERED OUTCOMES AND QUALITY METRICS WITH ELECTRONIC HEALTH RECORDS

ADVANCING MEASUREMENT OF PATIENT- CENTERED OUTCOMES AND QUALITY METRICS WITH ELECTRONIC HEALTH RECORDS ADVANCING MEASUREMENT OF PATIENT- CENTERED OUTCOMES AND QUALITY METRICS WITH ELECTRONIC HEALTH RECORDS Tina Hernandez-Boussard, PhD, MPH, MS Director, Surgical Health Services Research Unit Assistant Professor

More information

temporal eligibility criteria; 3) Demonstration of the need for information fusion across structured and unstructured data in solving certain

temporal eligibility criteria; 3) Demonstration of the need for information fusion across structured and unstructured data in solving certain How essential are unstructured clinical narratives and information fusion to clinical trial recruitment? Preethi Raghavan, MS, James L. Chen, MD, Eric Fosler-Lussier, PhD, Albert M. Lai, PhD The Ohio State

More information

DRUG UTILIZATION EVALUATION OF ANTIHYPERTENSIVE DRUGS IN DIABETIC PATIENTS WITH CKD

DRUG UTILIZATION EVALUATION OF ANTIHYPERTENSIVE DRUGS IN DIABETIC PATIENTS WITH CKD WORLD JOURNAL OF PHARMACY AND PHARMACEUTICAL SCIENCES Elhami et al. SJIF Impact Factor 5.210 Volume 4, Issue 11, 1159-1166 Research Article ISSN 2278 4357 DRUG UTILIZATION EVALUATION OF ANTIHYPERTENSIVE

More information

Survey Results: Requirements and Use Cases for Linguistic Linked Data

Survey Results: Requirements and Use Cases for Linguistic Linked Data Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group

More information

Medical Records Analysis

Medical Records Analysis Medical Records Analysis Karen A. Mulroy, Partner Evans & Dixon, L.L.C. The analysis of medical legal issues posed in any case can be complicated, requiring some close reading and detective work to both

More information

As the Director of The Clinical Center of Excellence at Rutgers University, I am speaking on

As the Director of The Clinical Center of Excellence at Rutgers University, I am speaking on Department of Environmental and Occupational Medicine Rutgers, The State University of New Jersey EOHSI 170 Frelinghuysen Road Piscataway, NJ 08540 Email: udasin@eohsi.rutgers.edu Phone: 848-445-6016 Fax:

More information

White Paper Synoptic Reporting and Structured Data Capture

White Paper Synoptic Reporting and Structured Data Capture White Paper Synoptic Reporting and Structured Data Capture Business Case for Synoptic Reporting and Beyond Introduction The amount of clinically significant findings a surgical pathologist is expected

More information

RSNA 2015 RCA22 A Practical Introduction to Structured Reporting Tools and Resources. DICOM Structured Reports (SR)

RSNA 2015 RCA22 A Practical Introduction to Structured Reporting Tools and Resources. DICOM Structured Reports (SR) RSNA 2015 RCA22 A Practical Introduction to Structured Reporting Tools and Resources DICOM Structured Reports (SR) David Clunie (dclunie@dclunie.com) PixelMed Publishing Background & Disclosures Owner,

More information

A leader in the development and application of information technology to prevent and treat disease.

A leader in the development and application of information technology to prevent and treat disease. A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today

More information

Open Domain Information Extraction. Günter Neumann, DFKI, 2012

Open Domain Information Extraction. Günter Neumann, DFKI, 2012 Open Domain Information Extraction Günter Neumann, DFKI, 2012 Improving TextRunner Wu and Weld (2010) Open Information Extraction using Wikipedia, ACL 2010 Fader et al. (2011) Identifying Relations for

More information

Clinical Decision Support Systems An Open Source Perspective

Clinical Decision Support Systems An Open Source Perspective Decision Support Systems An Open Source Perspective John McKim CTO, Knowledge Analytics Incorporated john@knowledgeanalytics.com http://www.knowledgeanaytics.com OSEHRA Open Source Summit 2014 Agenda CDS

More information

Clinical Study Design and Methods Terminology

Clinical Study Design and Methods Terminology Home College of Veterinary Medicine Washington State University WSU Faculty &Staff Page Page 1 of 5 John Gay, DVM PhD DACVPM AAHP FDIU VCS Clinical Epidemiology & Evidence-Based Medicine Glossary: Clinical

More information

Improving EHR Semantic Interoperability Future Vision and Challenges

Improving EHR Semantic Interoperability Future Vision and Challenges Improving EHR Semantic Interoperability Future Vision and Challenges Catalina MARTÍNEZ-COSTA a,1 Dipak KALRA b, Stefan SCHULZ a a IMI,Medical University of Graz, Austria b CHIME, University College London,

More information

Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives

Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives Search The Way You Think Copyright 2009 Coronado, Ltd. All rights reserved. All other product names and logos

More information

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks Text Analytics World, Boston, 2013 Lars Hard, CTO Agenda Difficult text analytics tasks Feature extraction Bio-inspired

More information

SEMANTICS ENABLED PROACTIVE AND TARGETED DISSEMINATION OF NEW MEDICAL KNOWLEDGE

SEMANTICS ENABLED PROACTIVE AND TARGETED DISSEMINATION OF NEW MEDICAL KNOWLEDGE SEMANTICS ENABLED PROACTIVE AND TARGETED DISSEMINATION OF NEW MEDICAL KNOWLEDGE Lakshmish Ramaswamy & I. Budak Arpinar Dept. of Computer Science, University of Georgia laks@cs.uga.edu, budak@cs.uga.edu

More information

John Sharp, MSSA, PMP Manager, Research Informatics Quantitative Health Sciences Cleveland Clinic, Cleveland, Ohio

John Sharp, MSSA, PMP Manager, Research Informatics Quantitative Health Sciences Cleveland Clinic, Cleveland, Ohio John Sharp, MSSA, PMP Manager, Research Informatics Quantitative Health Sciences Cleveland Clinic, Cleveland, Ohio Co-Director BiomedicalResearch Informatics Clinical and Translational Science Consortium

More information

Singapore s National Electronic Health Record

Singapore s National Electronic Health Record Singapore s National Electronic Health Record The Roadmap to 2010 Dr Sarah Christine Muttitt Chief Information Officer Information Systems Division 17 th July, 2009 Taking the Next Step (MSM April 2008)

More information

PREDICTIVE ANALYTICS FOR THE HEALTHCARE INDUSTRY

PREDICTIVE ANALYTICS FOR THE HEALTHCARE INDUSTRY PREDICTIVE ANALYTICS FOR THE HEALTHCARE INDUSTRY By Andrew Pearson Qualex Asia Today, healthcare companies are drowning in data. According to IBM, most healthcare organizations have terabytes and terabytes

More information

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Spring 2015 Thomas Hill, Ph.D. VP Analytic Solutions Dell Statistica Overview and Agenda Dell Software overview Dell in

More information

Exploring the Challenges and Opportunities of Leveraging EMRs for Data-Driven Clinical Research

Exploring the Challenges and Opportunities of Leveraging EMRs for Data-Driven Clinical Research White Paper Exploring the Challenges and Opportunities of Leveraging EMRs for Data-Driven Clinical Research Because some knowledge is too important not to share. Exploring the Challenges and Opportunities

More information

Big Data in Healthcare: Current Possibilities and Emerging Opportunities

Big Data in Healthcare: Current Possibilities and Emerging Opportunities Big Data in Healthcare: Current Possibilities and Emerging Opportunities Andrew Bate Senior Director, Epidemiology Group Lead, Analytics 23 th March 2015 The Long Road In Developing a New Medicine Clinical

More information

Renovascular Hypertension

Renovascular Hypertension Renovascular Hypertension Philip Stockwell, MD Assistant Professor of Medicine (Clinical) Warren Alpert School of Medicine Cardiology for the Primary Care Provider September 28, 201 Renovascular Hypertension

More information

Semantic Search in E-Discovery. David Graus & Zhaochun Ren

Semantic Search in E-Discovery. David Graus & Zhaochun Ren Semantic Search in E-Discovery David Graus & Zhaochun Ren This talk Introduction David Graus! Understanding e-mail traffic David Graus! Topic discovery & tracking in social media Zhaochun Ren 2 Intro Semantic

More information

Standardized Representation for Electronic Health Record-Driven Phenotypes

Standardized Representation for Electronic Health Record-Driven Phenotypes Standardized Representation for Electronic Health Record-Driven Phenotypes April 8, 2014 AMIA Joint Summits for Translational Research Rachel L. Richesson, PhD Shelley A. Rusincovitch Michelle M. Smerek

More information

A Service Oriented Approach for Guidelines-based Clinical Decision Support using BPMN

A Service Oriented Approach for Guidelines-based Clinical Decision Support using BPMN e-health For Continuity of Care C. Lovis et al. (Eds.) 2014 European Federation for Medical Informatics and IOS Press. This article is published online with Open Access by IOS Press and distributed under

More information

Overview of the TACITUS Project

Overview of the TACITUS Project Overview of the TACITUS Project Jerry R. Hobbs Artificial Intelligence Center SRI International 1 Aims of the Project The specific aim of the TACITUS project is to develop interpretation processes for

More information

MEDICARE RISK ADJUSTMENT A PROSPECTIVE APPROACH TO RISK ADJUSTMENT AND ACCURATE DOCUMENTATION AND CODING

MEDICARE RISK ADJUSTMENT A PROSPECTIVE APPROACH TO RISK ADJUSTMENT AND ACCURATE DOCUMENTATION AND CODING MEDICARE RISK ADJUSTMENT A PROSPECTIVE APPROACH TO RISK ADJUSTMENT AND ACCURATE DOCUMENTATION AND CODING WHAT IS RISK ADJUSTMENT? Risk Adjustment ensures that accurate payments are made to Medicare Advantage

More information

DRAFT. Medical School Health IT Curriculum Topics. Matthew Swedlund, John Beasley, Erkin Otles, Eneida Mendonca

DRAFT. Medical School Health IT Curriculum Topics. Matthew Swedlund, John Beasley, Erkin Otles, Eneida Mendonca Medical School Health IT Curriculum Topics DRAFT Matthew Swedlund, John Beasley, Erkin Otles, Eneida Mendonca PC3. Use information technology to optimize patient care. 1. Attributes relating to appropriate

More information

Text Mining in. by Mark Greenwood

Text Mining in. by Mark Greenwood Text Mining in Electronic Medical Records by Mark Greenwood Medical Records care-plans diagnoses medical history prescriptions symptoms chronic/acute Uses? Primary Record information Communication between

More information

Evaluation of Negation Phrases in Narrative Clinical Reports

Evaluation of Negation Phrases in Narrative Clinical Reports Evaluation of Negation Phrases in Narrative Clinical Reports Wendy W. Chapman PhD 1, Will Bridewell BS 2, Paul Hanbury BS 1, Gregory F. Cooper MD PhD 1,2, and Bruce G. Buchanan PhD 1,2 1 Center for Biomedical

More information

Terminology Extraction from Log Files

Terminology Extraction from Log Files Terminology Extraction from Log Files Hassan Saneifar 1,2, Stéphane Bonniol 2, Anne Laurent 1, Pascal Poncelet 1, and Mathieu Roche 1 1 LIRMM - Université Montpellier 2 - CNRS 161 rue Ada, 34392 Montpellier

More information

2014/02/13 Sphinx Lunch

2014/02/13 Sphinx Lunch 2014/02/13 Sphinx Lunch Best Student Paper Award @ 2013 IEEE Workshop on Automatic Speech Recognition and Understanding Dec. 9-12, 2013 Unsupervised Induction and Filling of Semantic Slot for Spoken Dialogue

More information

Using Predictive Analytics to Reduce COPD Readmissions

Using Predictive Analytics to Reduce COPD Readmissions Using Predictive Analytics to Reduce COPD Readmissions Agenda Information about PinnacleHealth Today s Environment PinnacleHealth Case Study Questions? PinnacleHealth System Non-profit, community teaching

More information

Chronic Kidney Disease and Diabetes

Chronic Kidney Disease and Diabetes Anyone with diabetes can get kidney disease. Diabetes and high blood pressure are the most common causes of kidney disease, and people often have both. Chronic (long term) kidney disease (CKD) caused by

More information

Patient Safety: Achieving A New Standard for Care. Institute of Medicine Committee on Data Standards for Patient Safety November, 2003

Patient Safety: Achieving A New Standard for Care. Institute of Medicine Committee on Data Standards for Patient Safety November, 2003 Patient Safety: Achieving A New Standard for Care Institute of Medicine Committee on Data Standards for Patient Safety November, 2003 Outline Committee charge and definitions System support of patient

More information

Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary: Standardizing lab data to LOINC for meaningful use

Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary: Standardizing lab data to LOINC for meaningful use Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary: Standardizing lab data to LOINC for meaningful use Executive summary By using standard terminologies to report on core

More information

User Needs and Requirements Analysis for Big Data Healthcare Applications

User Needs and Requirements Analysis for Big Data Healthcare Applications User Needs and Requirements Analysis for Big Data Healthcare Applications Sonja Zillner, Siemens AG In collaboration with: Nelia Lasierra, Werner Faix, and Sabrina Neururer MIE 2014 in Istanbul: 01-09-2014

More information

Data Management. Shanna M. Morgan, MD Department of Laboratory Medicine and Pathology University of Minnesota

Data Management. Shanna M. Morgan, MD Department of Laboratory Medicine and Pathology University of Minnesota Data Management Shanna M. Morgan, MD Department of Laboratory Medicine and Pathology University of Minnesota None Disclosures Objectives History of data management in medicine Review of data management

More information

Accountable Care: Implications for Managing Health Information. Quality Healthcare Through Quality Information

Accountable Care: Implications for Managing Health Information. Quality Healthcare Through Quality Information Accountable Care: Implications for Managing Health Information Quality Healthcare Through Quality Information Introduction Healthcare is currently experiencing a critical shift: away from the current the

More information

PharmaSUG2011 Paper HS03

PharmaSUG2011 Paper HS03 PharmaSUG2011 Paper HS03 Using SAS Predictive Modeling to Investigate the Asthma s Patient Future Hospitalization Risk Yehia H. Khalil, University of Louisville, Louisville, KY, US ABSTRACT The focus of

More information

Terminology Services in Support of Healthcare Interoperability

Terminology Services in Support of Healthcare Interoperability Terminology Services in Support of Healthcare Russell Hamm Informatics Consultant Apelon, Inc. Co-chair HL7 Vocabulary Workgroup Outline Why Terminology Importance of Terminologies Terminologies in Healthcare

More information

Making Sense of Physician Notes: A Big Data Approach. Anupam Joshi UMBC joshi@umbc.edu Joint work with students, UBMC Colleagues, and UMMS Colleagues

Making Sense of Physician Notes: A Big Data Approach. Anupam Joshi UMBC joshi@umbc.edu Joint work with students, UBMC Colleagues, and UMMS Colleagues Making Sense of Physician Notes: A Big Data Approach Anupam Joshi UMBC joshi@umbc.edu Joint work with students, UBMC Colleagues, and UMMS Colleagues Where we are Significant progress in applying NLP and

More information

HIT Standards Committee Transitional Vocabulary Task Force Wednesday October 14th, 2015, 10:30 A.M. 12:00 P.M. Et

HIT Standards Committee Transitional Vocabulary Task Force Wednesday October 14th, 2015, 10:30 A.M. 12:00 P.M. Et HIT Standards Committee Transitional Vocabulary Task Force Wednesday October 14th, 2015, 10:30 A.M. 12:00 P.M. Et Agenda Overview Of The Transitional Vocabularies Task Force Workgroup Charge And Workplan

More information