IDRT: Platform Architecture And Tools to Support The Re-use of Routine Clinical Data For Research



Similar documents
The Integrated Data Repository Toolkit (IDRT)

IDRT: Integration and Maintenance of Medical Terminologies in i2b2

Windows Installation Guide

Full VM Tutorial. i2b2 Desktop Installation (Windows) Informatics for Integrating Biology and the Bedside

i2b2 Installation Guide

Project Management (PM) Cell

Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment

Software Architecture Document

Bringing Semantics to the i2b2 Framework

Oracle Data Integrator integration with OBIEE

Third-Party Software Support. Converting from SAS Table Server to a SQL Server Database

Implementing a Data Warehouse with Microsoft SQL Server 2012

Enterprise Service Bus

Informatics for Integrating Biology and the Bedside. User Guide. Project Request. Document Version: I2b2 Software Release: 1.

Project (Group) Management Installation Guide (Linux) Version 1.3. Copyright 2007 MGH

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

StreamServe Persuasion SP4

Quick start. A project with SpagoBI 3.x

East Asia Network Sdn Bhd

cloud4health cloud-based textmining to exploit the value of freetext documentation within electronic health records Dr.

IT Infrastructure Components to support clinical care and translational research projects in a comprehensive cancer center

Oracle BI Applications (BI Apps) is a prebuilt business intelligence solution.

Course Outline. Module 1: Introduction to Data Warehousing

Use of the Research Patient Data Registry at Partners Healthcare, Boston

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

a retrieval information system for medical research Convergence with I2B2

Implementing a Data Warehouse with Microsoft SQL Server

Informatics for Integrating Biology & the Bedside. i2b2 Workbench Developer s Guide. Document Version: 1.0 i2b2 Software Release: 1.3.

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Software Architecture Document

KonyOne Server Installer - Linux Release Notes

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Implementing a SQL Data Warehouse 2016

Implementing a Data Warehouse with Microsoft SQL Server

Apps to display patient data, making SMART available in the i2b2 platform

Kony MobileFabric. Sync Windows Installation Manual - WebSphere. On-Premises. Release 6.5. Document Relevance and Accuracy

Talend for Data Integration guide

Implementing a Data Warehouse with Microsoft SQL Server 2014

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

Vanilla44 New Features

Data processing goes big

Palo Open Source BI Suite

Define ODBC Database Library using Management Console

OLAP Cube Manual deployment and Error resolution with limited licenses and Config keys

Implementing a Data Warehouse with Microsoft SQL Server

Release Bulletin Sybase ETL Small Business Edition 4.2

Implementing a SAS 9.3 Enterprise BI Server Deployment TS-811. in Microsoft Windows Operating Environments

How To Connect A Java To A Microsoft Database To An Ibm.Com Database On A Microsq Server On A Blackberry (Windows) Computer (Windows 2000) On A Powerpoint (Windows 5) On An Ubio.Com

Microsoft SQL Server Express 2005 Install Guide

Setting up SQL Translation Framework OBE for Database 12cR1

Implementing Microsoft SQL Server 2008 Exercise Guide. Database by Design

Oracle Data Integrator for Big Data. Alex Kotopoulis Senior Principal Product Manager

Embarcadero DB Change Manager 6.0 and DB Change Manager XE2

Microsoft Data Warehouse in Depth

MICROSTRATEGY 9.3 Supplement Files Setup Transaction Services for Dashboard and App Developers

Setting up the Oracle Warehouse Builder Project. Topics. Overview. Purpose

Integrating SAP BusinessObjects with Hadoop. Using a multi-node Hadoop Cluster

AWS Schema Conversion Tool. User Guide Version 1.0

Open Source Modular Units for Electronic Patient Records. Hari Kusnanto Faculty of Medicine, Gadjah Mada University

Implementing and Maintaining Microsoft SQL Server 2008 Integration Services

IBM DB2 XML support. How to Configure the IBM DB2 Support in oxygen

Oracle SQL Developer Migration

UltraQuest Cloud Server. White Paper Version 1.0

Change Manager 5.0 Installation Guide

ITG Software Engineering

Implementing a Data Warehouse with Microsoft SQL Server 2012

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

Installing OneStop Reporting Products

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Installing The SysAidTM Server Locally

CONFIGURATION AND APPLICATIONS DEPLOYMENT IN WEBSPHERE 6.1

Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a Data Warehouse with Microsoft SQL Server 2012

Integration Service Database. Installation Guide - Oracle. On-Premises

inforouter V8.0 Server & Client Requirements

Instant Data Warehousing with SAP data

HP Application Lifecycle Management

Data modelling methods in clinical trials: Experiences from the CTMND project (ctmnd.org)

SQL Server 2008 R2 Express Edition Installation Guide

Informatics for Integrating Biology and the Bedside. i2b2 User Guide. Import Data View. Document Version: I2b2 Software Release: 1.

Bitrix Site Manager ASP.NET. Installation Guide

Jet Data Manager 2012 User Guide

Compiere ERP & CRM Installation Instructions Linux System - EnterpriseDB

Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012

Cúram Business Intelligence Reporting Developer Guide

10775A Administering Microsoft SQL Server 2012 Databases

Implementing a Data Warehouse with Microsoft SQL Server

2933A: Developing Business Process and Integration Solutions Using Microsoft BizTalk Server 2006

Course 20463:Implementing a Data Warehouse with Microsoft SQL Server

Installation Guide. Version 2.1. on Oracle Java Cloud Service

MS-10775: Administering Microsoft SQL Server 2012 Databases. Course Objectives. Required Exam(s) Price. Duration. Methods of Delivery.

Transcription:

First European i2b2 Academic User Meeting IDRT: Platform Architecture And Tools to Support The Re-use of Routine Clinical Data For Research The IDRT Team (in alphabetical order): Christian Bauer, Benjamin Baum, Jan Christoph, Igor Engel, Thomas Ganslandt, Matthias Löbe, Sebastian Mate (presenter), Daniel Plog, Hans-Ulrich Prokosch, Matthias Quade, Ulrich Sax, Sebastian Stäubert, Lars Voitel, Alfred Winter TMF - Technologie und Methodenplattform für die vernetzte medizinische Forschung e. V.

Introduction Secondary use of routine clinical data provides a big chance for medical research: To support clinical trials accurate determination and exploitation of the number of study patients through feasibility analysis and recruitment support optimize study design with retro perspective analysis re-use of routine data as a part of the study documentation To support projects with third-party funding e.g. Harvard hospitals (2005) 1 : 40% of the running projects use clinical warehouse data, received funding >130 million USD Competitive advances through exploitation of routine care data adequate infrastructure and access to data is necessary 1 Nalichowski R. Calculating the Benefits of a Research Patient Data Repository. Proc AMIA Symp. 2006;1044 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 2

Introduction Short history of i2b2 i2b2 is the open source variant of the RPDR (Patient Research Data Registry) developed since 1999 at the Massachusetts General Hospital (MGH) and Partners Healthcare, Inc., which is in productive use since 2002 2005: already 1073 users used in several hospitals: MGH, BWH, FH, SRH, NWH 2004: request for applications by the NIH as part of their Roadmap for National Centers for Biomedical Computing (NCBC) Birth of i2b2 (Informatics for Integrating Biology and the Bedside): Application of i2b2 by I. Kohane(Harvard Medical School) and J. Glaser (CIO of Partners Healthcare) => received funding in 2005 as one of the NCBCs Since 2008: i2b2 is available for download 2010: second funding phase 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 3

Introduction 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 4

i2b2 ontology time line 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 5

Introduction i2b2 s difficulties from a user s point of view: Installation & configuration complex: although a comprehensive documentation exists, much can go wrong Import of clinical data no integrated support (except for data in i2b2 format) especially not for common standard formats (ODM, CSV, 21) Import of German standard terminologies and non-standard terms i2b2 demo data only contains some US standard terminologies import to i2b2 has to be re-invented always again, although: standard terminologies already exist in all hospitals (supplied by DIMI: ICD, OPS, ) other standard formats (e.g. ODM, CSV) already include their metadata terms => why not develop a standard methodology to import those common terminologies? Bearing German data protection and privacy in mind: Standardized creation of pseudonyms and record linkage based on demographic data 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 6

Goals source data source data? terminologies i2b2 ID management Courtesy of Thomas Ganslandt 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 7

Basic Workplan Arbeitspaket AP1: Anforderungsanalyse AP1.1: Aktualisierung der vorliegenden Anforderungsanalyse 1 AP2: Plattform, Quelldatenauf-bereitung und -integration AP2.1: Bereitstellung integrierte Installation, Administration 3 AP2.2: Modul ETL-Tool 3 AP2.3: Modul Extraktion von ODM-Datenquellen 3 AP2.4: Modul Extraktion von tabellarischen Datenquellen 3 AP2.5: Package Anbindung 21-Datenquellen 3 AP3: Datenschutz AP3.1: Integration PID-Generator 3 AP4: Semantische Integration AP4.1: Bereitstellung Standardterminologien 3 AP5: Dissemination AP5.1: Best Practice-Dokumentation Ontologieaufbau 2 AP5.2: Realisierung Use Case Clinical DWH 2 AP5.3: Abschlussworkshop Installation & Administration Improve i2b2 Wizard Semantic Integration Provide standard terminologies for i2b2 and automatic mapping of data during import AP6: Projektkoordination & Abschlussbericht 1 PM ER PM GÖ PM LE 1 2 3 4 5 6 7 8 9 10 11 12 ETL for Common Data Formats ODM, 21, CSV, SQL Easy IDRT Import Tool Flexible & powerful Talend Jobs Data Protection & Privacy Integration oftmf PID-Generator Meilensteine 12 9 6 M1 M2 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 8

IDRT Architecture Loading Concept Dimension Table Observation Fact Table Patient Dimension Table i2b2 Transformation Target Metadata Mapping Talend Open Studio Oracle Database Modelling Staging Standard Metadata Source Metadata Source Fact Data Oracle Database Extraction Individual extractors for each terminology CSV OWL... Configurable extractors for each type of datasource ODM SQL CSV PIDgen Talend Open Studio Java Datasources ICD LOINC... Standard Terminologies CDMS EHR 21 Operative Systems Source Systems 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 9

IDRT Architecture Loading Concept Dimension Table Observation Fact Table Patient Dimension Table i2b2 Transformation Target Metadata Mapping Talend Open Studio Oracle Database i2b2 Wizard Modelling Staging Standard Metadata Source Metadata Source Fact Data Oracle Database PIDgen Extraction Individual extractors for each terminology CSV OWL... Configurable extractors for each type of datasource ODM SQL CSV Talend Open Studio Java Datasources ICD LOINC... Standard Terminologies CDMS EHR 21 Operative Systems Source Systems Evaluation: Use-case CDW 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 10

i2b2wizard Installation & Administration Semi-graphical program to install and configure i2b2 Full i2b2 installation support Automatic download and extraction of required software packages Automatic installation of Linux software packages Automatic configuration and compilation of the i2b2 source code Automatic handling of all database work (schema creation, etc.) Full support for i2b2 administration Create and remove i2b2 projects and users Load and remove the Boston Demodata Change Hive ID and network interface and database configuration Change various passwords (e. g. from I2B2HIVE and I2B2PM) Recursive resolving of dependencies for setup targets / administration tasks SHRINE installation (currently not up-to-date) 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 11

i2b2wizard Installation & Administration Achieved enhancements in the IDRT project Always updated to latest i2b2 versions JBoss log monitoring: automatic detection of installation errors Automatic JDBC tests to ensure correct database settings Custom schema names for I2B2PM and I2B2HIVE Many minor changes and improvements Current development: modular Version 2.0 (current beta is 1.9.4) Will support different i2b2 versions Will support different operating systems (Linux& Windows/Cygwin) Will support different database systems (Oracle& Microsoft SQLServer) Great backward-compatibility Increase community involvement 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 12

i2b2wizard: Architecture Installation & Administration i2b2wizard-1.9.4/ wizard.sh scripts/ create DB schemas, register users i2b2 Wizard start script Oracle/ SQLServer start.sh stop.sh config/ packages/ Script to manually start JBoss server Script to manually stop JBoss server Stores all setting (projects, users), *-ds.xml files, Downloaded i2b2 source code, ZIP and RPM/DPKG files configures /i2b2src/ Uncompressed & configured i2b2 source code build& deploy /opt/jboss-4.2.2.ga/ JBoss directory (running i2b2 instance) 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 13

i2b2wizard: Architecture Installation & Administration i2b2wizard-1.9.4/ wizard.sh scripts/ start.sh stop.sh config/ packages/ configures create DB schemas, register users init.sh Oracle/ menu_system.sh i2b2 Wizard start script SQLServer shrine.sh wizard_features.sh Script to manually start JBoss server Script to manually database_types/ stop JBoss server MSSQL.sh Stores all setting (projects, users), *-ds.xml files, ORACLE.sh Downloaded i2b2 source code, ZIP and RPM files i2b2_versions/ i2b2_1.6.08.sh /i2b2src/ Uncompressed & configured i2b2source code os_versions/ build& deploy Ubuntu_12.04.2_LTS.sh /opt/jboss-4.2.2.ga/ JBoss directory WindowsCygwin.sh (i2b2 installation) 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 14

21 Importer German 21 KHEntG data set used by InEKGmbH to create next DRGs and payment rates obligatory and therefore used by all German hospitals contains ICD10, OPS (procedures), DRGs, encounter and special hospital data => set of simple CSV files IDRT 21 Importer: one mouse click importer reduces difficulties in getting started with i2b2 and IDRT makes use of IDRT CSV importer and terminology mapping Extraction Individual extractors for each terminology CSV OWL... Configurable extractors for each type of datasource ODM SQL CSV Datasources ICD LOINC... Standard Terminologies CDMS EHR 21 Operative Systems 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 15

Use-case CDW Idea: Show that it is possible (and efficient) to combine the simple importers (e.g. 21 importer) with other data imports. Prototypical setup in Erlangen: 1. Import 21 (IDRT 21 importer), lab values and medications (IDRT SQL Importer) 2. Load the corresponding terminologies (by using the IDRT jobs) 3. Run a transformation step to enrich imported data with the terminologies Transformation 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 16

Discussion& Outlook IDRT adds tremendous value to the standard i2b2 package IDRT delivers an easy-to-use package to install i2b2 and to populate its database with common German data sets Flexible approach: power users can use the Talendjobs (encapsulated into the IDRT tool) directly in Talend Take just what you need: i2b2 Wizard and ETL tools can be used independently The i2b2 software is evolving: IDRT2 Add support for i2b2 ontology modifiers (required to group complex data elements such as laboratory values) IDRT editor to customize i2b2 ontologies Interface to the German MDR project Update SHRINE support IDRT2 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 17

Thanks for your attention! For more information about the IRDT project and downloads please visit: http://idrt.imise.uni-leipzig.de 1. European i2b2 Academic User Meeting - Results from the German Integrated Data Repository Toolkit Project Page 18