EHR Search Engine SIBM. SJ. Darmoni, R. Lelong, C. Cabot, B. Dahamna



Similar documents
Find the signal in the noise

The Electronic Healthcare Record for Clinical Research (EHR4CR) information model and terminology

Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment

EHR Standards Landscape

Personalized Medicine: Humanity s Ultimate Big Data Challenge. Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences


SDMX technical standards Data validation and other major enhancements

How To Use Data Analysis To Get More Information From A Computer Or Cell Phone To A Computer

THE EHR4CR PLATFORM AND SERVICES

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

Presente e futuro del Web Semantico

Industry 4.0 and Big Data

SemWeB Semantic Web Browser Improving Browsing Experience with Semantic and Personalized Information and Hyperlinks

I n t e r S y S t e m S W h I t e P a P e r F O R H E A L T H C A R E IT E X E C U T I V E S. In accountable care

Big Data and Text Mining

DISCUSSION PAPER ON SEMANTIC AND TECHNICAL INTEROPERABILITY. Proposed by the ehealth Governance Initiative Date: October 22 nd, 2012

Improving EHR Semantic Interoperability Future Vision and Challenges

Use Cases for Argonaut Project. Version 1.1

SOLUTION BRIEF. IMAT Enhances Clinical Trial Cohort Identification. imatsolutions.com

FHIM Model Content Overview

Taming Big Data Variety with Semantic Graph Databases. Evren Sirin CTO Complexible

An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives

Clinical and research data integration: the i2b2 FSM experience

Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary (HDD): Implemented with a data warehouse

Scalable End-User Access to Big Data HELLENIC REPUBLIC National and Kapodistrian University of Athens

Semantic Search in Portals using Ontologies

UniGR Workshop: Big Data «The challenge of visualizing big data»

A Semantic Model for Multimodal Data Mining in Healthcare Information Systems

UIMA and WebContent: Complementary Frameworks for Building Semantic Web Applications

COURSE SYLLABUS COURSE TITLE:

How To Understand The Difference Between Terminology And Ontology

Patient-Centric Secure-and-Privacy-Preserving Service-Oriented Architecture for Health Information Integration and Exchange

ezdi s semantics-enhanced linguistic, NLP, and ML approach for health informatics

Application of ontologies for the integration of network monitoring platforms

Models Supporting Development of Complex Information Systems in Healthcare. Case study: an Obstetrics-Gynecology Department

Applying Standards in Cross-sector Communication for an Integrated Health Environment

EHR Archetypes in practice: getting feedback from clinicians and the role of EuroRec

HL7 and SOA Based Distributed Electronic Patient Record Architecture Using Open EMR

COMP9321 Web Application Engineering

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

Microsoft End to End Business Intelligence Boot Camp

Reasoning / Medical Context / Physicians Area of Competence

The Big Picture: IDNT in Electronic Records Glossary

EHR Systems: an Introduction

Clinical Knowledge Manager. Product Description 2012 MAKING HEALTH COMPUTE

Ontology-based Archetype Interoperability and Management

Module 1: Getting Started with Databases and Transact-SQL in SQL Server 2008

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities

M3039 MPEG 97/ January 1998

SNOMED CT Cardiology Reference Set Development, Malaysia. SNOMED CT Conference Amsterdam, The Netherlands October 2014

Open Ontology Repository Initiative

Electronic Health Record (EHR) Standards Survey

Writing Queries Using Microsoft SQL Server 2008 Transact-SQL

A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

Leveraging Big Data Technologies to Support Research in Unstructured Data Analytics

Presenting data: how to convey information most effectively Centre of Research Excellence in Patient Safety 20 Feb 2015

Reinventing Business Intelligence through Big Data

Exploration and Visualization of Post-Market Data

Reference Architecture, Requirements, Gaps, Roles

Eliminating Barriers to Genuine Health Information Exchange. Copyright 2014 Allscripts Healthcare Solutions, Inc. 1

Business Intelligence: Recent Experiences in Canada

Semantically Steered Clinical Decision Support Systems

Advanced Aspects of Hospital Information Systems

Standardised and Flexible Health Data Management with an Archetype Driven EHR System (EHRflex)

Recognition and Privacy Preservation of Paper-based Health Records

A Knowledge Management Framework Using Business Intelligence Solutions

Problem-Centered Care Delivery

Ernestina Menasalvas Universidad Politécnica de Madrid

Transcription:

EHR Search Engine SIBM SJ. Darmoni, R. Lelong, C. Cabot, B. Dahamna RAVEL project, funded by French Research Agency (Technologies for Health Program) 1

Context and projects Establishment of EHR (Electronics Health Record) is becoming democratized. More and more structured and heterogenous patient data. Information Retrieval on EHR Ergonomic visualization of EHR data «Easy input of medical data» Goal : To build Ergonomic user interfaces acceptable to doctors so that the EHR can be indexed with reference terminologies. «Retrieval And Vizualisation of ELectronic health records» Goal : To perform information retrieval on patient data medical decision-support tool and have efficient holistic views of those data suitable for a given issue. 2

Search & Visualisation EHR Search engine Query on metadata and structured data Full-text query Domain ontologies Semantic T/O & index Index Integration of heteregenous data HeTOP T/O server Data warehouse metadata and structured data documents SERVICES Indexation Annotation Déidentification CDA structured data & unstructured data ETL Semantic enrichment during ETL CDA Healthcare production Semantic enrichment into EHR

RIDOPI search engine Object oriented query Language without SQL-like syntax Search Engine Grammar Generic EAV (Entity Attribute Value) data model Database 2 000 patients with heavy medical history (>20 procedures) 4

IR on EHR «Old» documentary information retrieval New information retrieval scope DOCUMENT TITLE : therapy for asthma PUBLICATION_DATE : 2008 PATIENT ESTABLISHEMENT PUBLISHER NAME : HAS DESCRIPTOR LABEL : asthma STAY MEDICAL UNIT OBJECT_X Property Oriented Formalism MEDIACL TESTINGS MEDICAL PROCEDURES ICD10 codes Use of «field codes» discriminating/specifying/constraining returned object via its directs property/attributes/fields (datatype or object property) 1. restricted query scope : max reachable property depth = 1 2. Irrevelante for interdependent constraints Examples : Asthma.mc (Documents indexed with «asthma» keyword (.mc)) has.ed (Documents which publisher (.ed) is HAS) asthme.ti (Documents which title (.ti) contains asthma) xxx.objectx (not managed because object has a depth > 1) in relation to document object) BIOLOGICAL RESULT (LOINC) code CCAM code (French Terminology) Object oriented formalism MEDICAL UNIT 1. Querys constraint scope not limited : depth > 1 2. Multiple objects related to each other handled : Constraint can be defined in relation to one other 5

Object oriented query language Formalism description : Query language based on the use of query unit of pattern : «OBJECT(CONSTRAINTS)» A unit points out the set of «OBJECTS» satisfying the contraints described in «CONSTRAINTS» «CONSTRAINTS» is a boolean query and accepts AND/OR/NOT operators to link contraints together Samples : CONSTRAINT can be simple direct attributes of OBJECTS : patient(datenaissancepatient=1937-01-01 AND sexepatient="m") patient(birthdate=1937-01- 01 AND gender="m") CONSTRAINT can be an entire query unit : patient( acte( dateacte = 2005-01-01 ) ) patient(act(actdate=2005-01-01)) «return patients which acts occured on 2005-01-01» patient(sejour(acte(codeccam()))) patient(stay(act(ccamcode()))) «return patients which stays which acts wich CCAM code satisfyies» 6

Additional functionalities This formalism handle symbolic data as before in Doc CISMeF, but handle also now : Numeric data : 1. Operations «+», «-», «/», «*» 2. comparators «<», «<=», «>», «>=», «=», «!=» analyse(codeexeresultatbiologique(label="phosphore") AND 1.17<valeurNumeriqueAnalyse<=2-0.81) biologicaltest(biologicalresultexecode(label="phosphore") AND 1.17<numericValue<=2-0.81) Chronological data : 1. Operation : «+», «-» 2. Comparators : «<», «<=», «>», «>=», «=», «!=» patient(datenaissancepatient>today-365*50) patient(birthdate>today-365*50) sejour(6<=datesortiesejour-dateentreesejour<7) stay(6<=outdate-indate<7) 7

Query sample Query patient() Means All patients patient(sexepatient="m" ET datenaissancepatient>1950-12-31) patient(gender= M AND birthdate>1950-12-31) patient(sexepatient="f" ET acte(label="prelevement SANGUIN")) patient(gender= F AND act(label= PRELEVEMENT SANGUIN )) sejour(patient(id="dm_pat_21") AND acte(label="prelevement SANGUIN")) stay(patient(id= DM_PAT_21 ) AND act(label= PRELEVEMENT SANGUIN )) unitemedicale(sejour(patient(id="dm_pat_21") AND acte(label="prelevement SANGUIN"))) medicalunit(stay(patient(id="dm_pat_21") AND act(label="prelevement SANGUIN"))) Male patients born after 31-12-1950 Female patients who have been given a blood test Stays of the DM_PAT_21 patient in wich a blood test act occured Medical units in wich stays of the patient DM_PAT_21 and in wich a blood test act occured. acte(codeccam(id="cca_am_eqqm006") AND dateacte="max") act(ccamcode(id="cca_am_eqqm006") AND actdate="max") analyse(patient(id="dm_pat_1078") AND codeexeresultatbiologique(label="plaquettes") AND 10*valeurNumeriqueAnalyse<borneInfAnalyse) biologicaltest(patient(id="dm_pat_1078") AND biologicalresultexecode (label="plaquettes") AND 10*numericValue<lowLimit) Last act of a given CCAM code Platelet test of the patient DM_PAT_1078 which result is more than 10 times lower to the normal threshold. 8

Visualization RIDoPI : «Information Retrieval on EHR» URL : http://cispro.chu-rouen.fr/ridopi/ 2 main contexts Multi-patient context = free querying among patients, acts, stays mono-patient context = visualization of patient data Free querying 9

Mono patient context View per patient Patients data categories (stays, acts, biological tests ) Patients data 10

Stays 11

Biological tests kreatinin Visualization of several test on the same graph 12

Querying in mono patient context Query in monopatient context Mono-patient Result of the query 13

Free querying cholecystectomy Pattern matching Multi-patient query Mono-patient constraint Back to mono-patient view Lelong, R et coll. Rewriting Natural Language Queries using Patterns in an Electronic Health Records System. European Conference on Information Retrieval; Workshop on Multimodal Retrieval in the Medical Domain, 14

Integration of NLP tools into RAVEL search engine

Extension to omics data All Missense variants for patient DM_PAT_125

Expression : genes

Scalability Study with Prof. Musette on every patient who came in dermatology since 1985 60.000 patients (nominative data) Six men-months to perform the correct ETL No local help (computing department) No help from the vendor (expected income 0 K)

Discussion/Conclusion Powerful generic tool (but rather complex) LISP Management of double context (mono & multi-patient) Acute international competition I2B2/Transmart, already implemented in 70 UH Benchmark comparing functionalities of RIDOPI vs. I2B2/Transmart

Questions? Email: Stefan.Darmoni@chu-rouen.fr