OpenFurther: An Infrastructure for Clinical & Translational Research in a Distributed Environment

Similar documents
F OpenFurther. OpenFurther: Federating and generating OMOP datasets. Center for Clinical and Translational Sciences

From Fishing to Attracting Chicks

STRIDE An Integrated Standards-Based Translational Research Informatics Platform

How to extract transform and load observational data?

Theodoros. N. Arvanitis, RT, DPhil, CEng, MIET, MIEEE, AMIA, FRSM

OpenCDS: an Open-Source, Standards-Based, Service-Oriented Framework for Scalable CDS

Ernesto Ongaro BI Consultant February 19, The 5 Levels of Embedded BI

Employing SNOMED CT and LOINC to make EHR data sensible and interoperable for clinical research

Adam Rauch Partner, LabKey Software Extending LabKey Server Part 1: Retrieving and Presenting Data

Public Health and the Learning Health Care System Lessons from Two Distributed Networks for Public Health

Pediatric Research Database

Terminology Services in Support of Healthcare Interoperability

I n t e r S y S t e m S W h I t e P a P e r F O R H E A L T H C A R E IT E X E C U T I V E S. In accountable care

Achilles a platform for exploring and visualizing clinical data summary statistics

A Semantic Foundation for Achieving HIE Interoperability

Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary: Standardizing lab data to LOINC for meaningful use

Director for Center for Health Data and Informatics. Deputy Director Utah Department of Health Public Health and IT

Find the signal in the noise

IDRT: Platform Architecture And Tools to Support The Re-use of Routine Clinical Data For Research

ehr Sharable Data Vicky Fung Senior Health Informatician ehr Information Standards Office

Environmental Health Science. Brian S. Schwartz, MD, MS

Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary: Enabling effective health information exchange

Apps to display patient data, making SMART available in the i2b2 platform

CloudCERT (Testbed framework to exercise critical infrastructure protection)

EHR Standards Landscape

Reverse Engineering in Data Integration Software

Medical Informatics: A Fishing Story

3M Health Information Systems

OpenCDS: an Open-Source, Standards-Based, Service-Oriented Framework for Scalable CDS

Property & Casualty Insurance Solutions from CCS Technology Solutions

Meaningful Use Stage 2 Update: Deploying SNOMED CT to provide decision support in the EHR

Briefing on ehr Content Hong Kong Clinical Terminology Table (HKCTT) By Maggie LAU Health Informatician, ehriso

Eligible Professionals please see the document: MEDITECH Prepares You for Stage 2 of Meaningful Use: Eligible Professionals.

Use of the Research Patient Data Registry at Partners Healthcare, Boston

Standardized Terminologies Used in the Learning Health System

Clinical Data Services

Interoperability and Analytics February 29, 2016

Setting the World on FHIR

Utility of Common Data Models for EHR and Registry Data Integration: Use in Automated Surveillance

Clinical Decision Support Consortium Knowledge Management Overview

A Framework to Assess VistA Open-Source SOA-Stacks

Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary (HDD): Implemented with a data warehouse

Cerner i2b2 User s s Guide and Frequently Asked Questions. v1.3

Informatics Domain Task Force (idtf) CTSA PI Meeting 02/04/2015

IDRT: Integration and Maintenance of Medical Terminologies in i2b2

Graph database approach to management and use of SNOMED CT encoded clinical data W. Scott Campbell, Jay Pedersen, James R. Campbell University of

TRANSFoRm: Vision of a learning healthcare system

Considering Third Generation ediscovery? Two Approaches for Evaluating ediscovery Offerings

Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment

Electronic Health Record (EHR) Standards Survey

Open Source Software Tools for Electronic Clinical Quality Measures

SNOMED CT in the EHR Present and Future. with the patient at the heart

Classifying Adverse Events From Clinical Trials

IBM Watson and Medical Records Text Analytics HIMSS Presentation

Software Architecture Document

Preparing Electronic Health Records for Multi-Site CER Studies

The Healthcare Services Platform Consortium

A Service-oriented Architecture for Business Intelligence

Karl Lum Partner, LabKey Software Evolution of Connectivity in LabKey Server

INFORMATION TECHNOLOGY FOR UNIVERSAL HEALTH COVERAGE (IT4UHC) September 2013 Manila, Philippines

Building the European Biodiversity. Observation Network (EU BON)

Open Source Modular Units for Electronic Patient Records. Hari Kusnanto Faculty of Medicine, Gadjah Mada University

Building patient-level predictive models Martijn J. Schuemie, Marc A. Suchard and Patrick Ryan

Clinical and research data integration: the i2b2 FSM experience

Research Opportunities using the PaTH Network

Standards and their role in Healthcare ICT Strategy. 10th Annual Public Sector IT Conference

Health Information Exchange in Minnesota & North Dakota

A collaborative platform for knowledge management

ConnectVirginia EXCHANGE Onboarding and Certification Guide. Version 1.4

New Jersey Department of Health. Electronic Laboratory Reporting On-Boarding Manual. Version 1.4

FDA's Mini-Sentinel Program to Evaluate the Safety of Marketed Medical Products. Progress and Direction

Natural Language Processing in the EHR Lifecycle

Graph Database Performance: An Oracle Perspective

Transcription:

OpenFurther: An Infrastructure for Clinical & Translational Research in a Distributed Environment AMIA Systems Demonstration November 19, 2013 Center for Clinical and Translational Sciences

Overview Introduction OpenFurther Technical Components of OpenFurther Deployments Demonstration of Federated Queries

Distributed Computing Environments Industry Semantic Complexity Penetration Finance Low High Travel Medium High Retail Low High Engineering High Medium Defense High High Translational & Clinical Sciences and Practice Very High Very Low

Federating Infrastructures Static Federation (cagrid) Dynamic Federation (OpenFurther) CDM CDM CDM CDM CDM to DM Static Translation Layer to Common Data Model DM DM DM DM CDM to DM F CDM to DM Static Translation/Aggregation Layer CDM to DM Dynamic Translation Layer Static Aggregation (i2b2)

Heterogenous Federation FADAPT ADAPT ADAPT ADAPT ADAPT ADAPT ADAPT ADAPT ADAPT

Design Goals Modular design to support component based implementation On-the-fly real-time federation of health information from heterogeneous data sources Data source partners do not need to extract data and/or build a new database Data remains in its native format Is as up-to-date as the data source Join data from multiple sources for research Framework to support granular security control to join targeted data across data sources

Typical Applications Cohort Finding for Prospective Research Comparative Effectiveness Research (CER) Infrastructure Public Health Surveillance Datasets for Observational Studies

OpenFurther : Demo Version Open Source Instance Demonstrative version that can be downloaded, tried, modified and deployed for testing and experimentation purposes. F Public Datasets: OMOP: Secondary data for Comparative Effectiveness Research OpenMRS: Medical Record System Release: AMIA 2013 http://openfurther.org/

OpenFurther Technical Architecture

OpenFurther Utilizes components available from standards organizations and open source initiatives Service Oriented Architecture (SOA) and Enterprise Service Bus Relevant to national projects Architecture is open and sharable. Systematically support centralized and distributed governance models.

Component Overview Query Tool! Federated Query Engine! Data Source Adapters! Admin & Security Components! Virtual Identity Resolution on the GO (VIRGO)! Quality & Analytics Framework! Metadata Repository! Terminology/ Ontology Server Query Tool Security Counts & Data Quality Analysis Terminology Server F Metadata Repository VIRGO ADAPT ADAPT ADAPT ADAPT Security Data Sources

i2b2/openfurther Query Tool Architecture Web Server JBoss/Tomcat - Axis2 Webapp i2b2 Web Client XML Query InterceptingFilter web.xml! configuration F FQE Web! Service QueryToolService

Federated Query Engine FQE Storage In Memory 1 FQE! S T A T U S 3 Status Msgs A A B B R E S U L T 4 Result Msgs 2 Query Topic Subscribe Subscribe Data Source Façade Layer Data Source 1 Adapter ADAPT In Memory Data Source 2 Adapter ADAPT

Data Source Adapters F FQE Layer Subscribed Status Result Datasource Façade 1 Initialize MDR 2 Query&Translation DTS 3 Execution 4 Result Translation Data Source

Security Components Unauthenticated Authentication System LDAP, CAS, SAML Authentication:! Currently support Central Authentication Service (CAS) against LDAP!! Future needs require Federated Authentication using SAML Authenticated to OpenFurther Contextual:!! User properties, roles, and authorization is different depending on the context (namespace) Security Module: OpenFurther Namespace Federated Query Engine Request Topic Authorization:!! Authorization happens at multiple levels: within the FQE and within the data source adapters.!! Authorization within the OpenFurther namespace may be different than within the data source adapter (a different namespace).!! Already Authorized:!! Some data source adapters may not even require explicit authorization. Subscribed Subscribed Security Module: DS1 Namespace Data Source Adapter Data Source Adapter

Virtual Identity Resolution on the GO (VIRGO) On the fly demographics analysis for record linkage! No permanent PHI storage Response composed of OpenFurther IDs Algorithms! Field-specific weights Adaptable to others Technologies used! Java, Groovy, Grails Web-services MySQL (temp storage) ElasticSearch (indexing)

Quality & Analytics Framework A service oriented architecture that can assess the quality of heterogeneous electronic health data sources in a distributed environment. User! Interface Quality! Service Adapters F Data Sources Quality Analysis Repository Terminology! Services Metadata! Services

Metadata Repository (MDR) Built in-house, standards-based Artifacts (Knowledge) Logical Models, Local Models, model mappings Administrative information Descriptive information XQuery Translation Programs Models supported - Open Source Initiative Local: FURTHeR, UUEDW, UPDBL, Intermountain Healthcare Datamart Public: OMOP, i2b2, OpenMRS

Translating Metadata Translating Coded Values Value Data Element Value Code Namespace Value Label OpenFurther.Person.administrativeGender SNOMED 248153007 Female MDR Translate Metadata Translate Value DTS OMOP.Person.genderConceptId OMOP 2940 Female

Query Translation Query: Females age 30 to 40 who have diabetes OpenFurther Query administrativegender = SNOMED:248153007! AND age between 30 and 40! AND observation.observationcode = ICD9:250 Translate Metadata Translate Metadata Translate Metadata Translate Code Translate Code Translate Code OMOP Query genderconceptid = 2940! AND yearofbirth between 1972 and 1982! AND conditionoccurence.conditionconceptid = 64547!

Result Translation 1,000 records require > 12,000 translations (Person data class only) Get Master Person ID OMOP.Patient personid! yearofbirth! genderconceptd! raceconceptid! ethnicityconceptid! providerid! caresiteid!! Calculate Age Translate Metadata Translate Metadata Translate Metadata Translate Code Translate Metadata Translate Code OpenFurther.Person masterpatientid! age! dateofbirth! administrativegender! race! maritalstatus! religion! dateofdeath! pedigreequality! Translate Metadata Translate Code Translate Metadata Translate Code

Terminology/Ontology Server Apelon s Distributed Terminology Server an integrated set of open source components that provides comprehensive terminology services in distributed application environments Provides tools for: Management standard and local terminologies Mapping local to standards Browsing & Searching terminologies Creation subsets Extension of standards terminologies Versioning of terminologies OpenFurther includes a layer of web-services that leverage DTS APIs

DTS Terminology Content Within UU s Installation! ~25 Standard Namespaces (ICD-9,ICD-10, SNOMED, LOINC, RxNorm & More) ~33 Local Namespaces! ~1 Million Concepts! ~2 Million Total Mappings! Subscribe for content updates With Demo Installation! Free content available from Apelon (subsets of ICD9, SNOMED, LOINC)! Local content for two simulated data sources! Schultz Cancer Repository (OMOP Data Source) Schultz Healthcare Systems (OpenMRS Data Source)! Mappings between local data sources and standards! Additional content could obtained from their respective sources or Apelon.

Deployments University of Utah FURTHeR: Cohort Identification & Datasets for Analysis PHIS+: Aggregated database for performing Comparative Effectiveness Research University of North Carolina: Cohort Identification

University of Utah Hospitals & Clinics FURTHeR Huntsman Cancer Institute Clinical Intermountain Healthcare VAMC! Salt Lake City Genotypic Phenotypic Public Health Genealogical Other Partners F Utah Department of Health Vision for OpenFurther at the University of Utah

Comparative Effectiveness Research Infrastructure

PHIS+ PHIS PHIS+ Augment Children s Hospital Association s (CHA) existing electronic database of administrative data - Pediatric Health Information System (PHIS) with clinical data to conduct Comparative Effectiveness Research studies. UU Biomedical Informatics - Informatics Partners Agency for Healthcare Research and Quality (AHRQ) funded project.

PHIS+ Overview 3" Data$Streams$ 4) CER$Studies$ 5$ Years&Data& Laboratory" Pneumonia) 2007$ $2011$ Microbiology" Appendici.s) 2009$ $Development$ Radiology" Osteomyeli.s) 2012.$ Gastroesophageal) Reflux)Disease)

The PHIS+ Process 6 2 5 4 3 1 6 Pediatric Research in Inpatient Setting (PRIS) Sites

Developmental Process Overview Narus et. al, Federating Clinical Data from Six Pediatric Hospitals: Process and Initial Results from the PHIS+ Consortium. AMIA 2011

PHIS+ CER Database 2007-11 Laboratory Site Results LOINC Lab Test Code A 15,011,312 538 B 33,214,540 1,214 C 16,868,383 860 D 25,706,608 1,089 E 38,422,668 1,016 F 14,507,629 2,131 Total 143,731,140 *6,848 (2,992) 1,654,406 Kids Site Radiology Reports CPT Radiology Procedure Code A 445,681 280 B 1,151,383 349 C 635,458 296 D 980,740 482 E 1,098,693 497 F 201,708 477 Total 4,513,663 *2,381 (714) Microbiology Site Culture Results SNOMED Specimen Code SNOMED Culture Procedure Code SNOMED Organism Code RxNorm Antimicrobial Code Susceptibility Results LOINC Susceptibility Test Code A 247,933 114 70 113 57 487,813 97 B 359,780 58 42 56 58 393,594 85 C 231,071 179 46 162 59 340,100 99 D 335,606 110 34 145 57 376,844 75 E 486,315 130 56 160 59 605,000 76 F 176,848 264 71 121 51 283,865 89 Total 1,837,553 *855 (451) *319 (95) *757 (203) *341 (74) 2,487,216 *521 (136) * The first number is the total number of standard codes, the second in parenthesis is the distinct number of standard codes across all sites.

Thank You

Contribute to F OpenFurther Learn more at: openfurther.org To contribute, join and/or post on our mailing lists, submit a pull request on GitHub, or a bug on JIRA.!! Community email Lists: openfurther-user@googlegroups.com for user and implementation discussion. openfurther-dev@googlegroups.com for development discussion. openfurther-security@googlegroups.com for security sensitive issues and discussions. otherfurther-commits@googlegroups.com for developer commits to version control! Source code: http://github.com/openfurther/!! Reference manual:! https://github.com/openfurther/further-open-doc/blob/master/reference-manual.asciidoc! Bugs & feature requests: https://openfurther.atlassian.net

Acknowledgements Current OpenFurther Team Rick Bradshaw, MS Dustin Schultz, MS Ryan Butcher, MS Ram Gouripeddi, MBBS, MS Randy Madsen, BS Phillip Warner, MS Peter Mo, MS Naresh Sundar Ranjan, MS Bernie La Salle, BS Julio Facelli, PhD Collaborations!! UU EDW Team UU PPR/UPDB Team UU CHPC! Collaboration Partners Intermountain Healthcare SLC VAMC UDOH PHIS+ Team members across 6 institutions Children s Hospitals Association Apelon Center for High Performance Computing, UU OpenFurther supported by the NCRR/ NCATS Grants UL1RR025764 and 3UL1RR025764-02S2, National Center for Clinical and Translational Science 1UL1TR001067, along with the University of Utah Research Foundation, and grant 1D1BRH20425 (DHHS). PHIS+ funded by grant R01 HS019862 from AHRQ, (DHHS). VIRGO (MPI) supported by NLM Grant 5RC2LM010798. REDCap is supported by Center for Clinical and Translational Sciences grant support 8UL1TR000105.

References Gouripeddi R, Mitchel JA, Narus SP et al. Federating Clinical Data from Six Pediatric Hospitals: Process and Initial Results for Microbiology from the PHIS+ Consortium AMIA Annu Symp Proc. 2012 (in press).! Narus SP, Srivastava R, Gouripeddi R, et al. Federating Clinical Data from Six Pediatric Hospitals: Process and Initial Results from the PHIS+ Consortium. AMIA Annu Symp Proc. 2011;2011:994 1003. http://www.ncbi.nlm.nih.gov/pubmed/22195159.! Bradshaw RL, Matney S, Livne OE, et al. Architecture of a Federated Query Engine for Heterogeneous Resources. AMIA Annu Symp Proc. 2009;2009:70 74. http://www.ncbi.nlm.nih.gov/pmc/articles/ PMC2815441/! Livne OE, Schultz ND, Narus SP. Federated Querying Architecture with Clinical & Translational Health IT Application. Journal of Medical Systems. 2011;35(5):1211 1224. http://dl.acm.org/citation.cfm? id=1883028! Matney S, Bradshaw RL, Livne OE, et al. Developing a Semantic Framework for Clinical and Translational Research. 2011 AMIA Summit on Translational Bioinformatics. Available at: http:// proceedings.amia.org/16pc81/! Schultz ND, Bradshaw RL, Mitchell JA. Utilizing Previous Result Sets as Criteria for New Queries within FURTHeR. 2012 SHARPn Summit on Secondary Use, Rochester, MN.! He S, Hurdle JF, Botkin JR Narus SP. Integrating A Federated Healthcare Data Query Platform With Electronic IRB Information Systems, AMIA Annu Symp Proc. 2010 Nov 13:2010:291-5. PubMed PMID: 21346987. http://www.ncbi.nlm.nih.gov/pubmed/21346987! Rocha RA, Hurdle JF, Matney S, Narus SP, Meystre S, LaSalle B, Deshmukh V, Hunter C, Mineau GP, Facelli JC, Mitchell JA. Utah s Statewide Informatics Platform for Translational & Clinical Science, AMIA Annu Symp Proc. 2008 Nov 6:1114. PubMed PMID: 18999122 http://www.ncbi.nlm.nih.gov/ pubmed/18999122! Website: http://openfurther.org