Preparing Electronic Health Records for Multi-Site CER Studies



Similar documents
AHRQ S Perspective On Clinical Utility. Gurvaneet Randhawa, MD, MPH Medical Officer, Senior Advisor on Clinical Genomics and Personalized Medicine

Interoperability and Analytics February 29, 2016

Research Into Care: Identifying Barriers and Gaps in Care. AAFP National Research Network Robert Graham Center Wilson D. Pace, MD

Achilles a platform for exploring and visualizing clinical data summary statistics

From Fishing to Attracting Chicks

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

patient-centered SCAlable National Network for Effectiveness Research

Utility of Common Data Models for EHR and Registry Data Integration: Use in Automated Surveillance

Public Health and the Learning Health Care System Lessons from Two Distributed Networks for Public Health

Enterprise Healthcare BI

Environmental Health Science. Brian S. Schwartz, MD, MS

Guidelines for Patient-Centered Medical Home (PCMH) Recognition and Accreditation Programs. February 2011

A Commercial Approach to De-Identification Dan Wasserstrom, Founder and Chairman De-ID Data Corp, LLC

REQUIREMENTS GUIDE: How to Qualify for EHR Stimulus Funds under ARRA

UPDATE ON THE CLINICAL AND TRANSLATIONAL SCIENCE AWARD (CTSA) PROGRAM

Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment

Orchestrated IT in Action.

March 12, Attention: HIT Policy Committee Meaningful Use Comments. CMS-0033-P, Proposed Rules, Electronic Health Record (EHR) Incentive Program

Toward Meaningful Use of HIT

A Grid Architecture for Manufacturing Database System

Design and Implementation of an Automated Geocoding Infrastructure for the Duke Medicine Enterprise Data Warehouse

Secondary Uses of Data for Comparative Effectiveness Research

The Do s and Don ts of Medical Device integration

Understanding Diagnosis Assignment from Billing Systems Relative to Electronic Health Records for Clinical Research Cohort Identification

Core services and the path to the future of the ILHIE

HealthFusion MediTouch EHR Stage 2 Proposed Rule Comments

Version: January 2008 ASTM E-31: EHR and Informatics Standards Education For Health Professional Disciplines. Background

Service Oriented Architecture and the DBA Kathy Komer Aetna Inc. New England DB2 Users Group. Tuesday June 12 1:00-2:15

TennCare/Medicaid EHR Provider Incentive Program

Planning for Health Information Technology and Exchange in Public Health

Big Data and the Data Lake. February 2015

A New Paradigm for Large, Simple Trials Based on Electronic Health Records

Adventures in EHR Computable Phenotypes: Lessons Learned from the Southeastern Diabetes Initiative (SEDI)

AN INTEGRATION APPROACH FOR THE STATISTICAL INFORMATION SYSTEM OF ISTAT USING SDMX STANDARDS

New York ehealth Collaborative. Health Information Exchange and Interoperability April 2012

The CTSA Program: Collaborative Innovation for Translational Research

Protect, Share, and Distribute Healthcare Data with ehealth Managed Services

Analance Data Integration Technical Whitepaper

Setting the World on FHIR

Data Analytics in Health Care

Welcome to the Meaningful Use and Data Analytics PowerPoint presentation in the Data Analytics Toolkit. In this presentation, you will be introduced

Find the signal in the noise

EHR Data Reuse through openehr Archetypes

Research Data Extraction Service. Research and Woodruff Health Sciences IT

Building Regional and National Health Information Systems. Mike LaRocca

Improving Evidence-Based Primary Care for Chronic Kidney Disease. Walter L. Calmbach MD MPH South Texas Ambulatory Research Network (STARNet)

The course will run simultaneously with the MSCI students.

Database Marketing, Business Intelligence and Knowledge Discovery

State Innovation Models Initiative:

Going beyond Meaningful Use with EMR solutions from the Centricity portfolio

Research Opportunities using the PaTH Network

Host Site: Denver Public Health Department, Epidemiology, Informatics, and Preparedness

COMPARATIVE EFFECTIVENESS RESEARCH (CER) AND SOCIAL WORK: STRENGTHENING THE CONNECTION

Meaningful Use - The Basics

Digital Healthcare: Author. A HIPAA compliant cloud strategy. Choosing a Cloud Service Provider. Alex Ginzburg

CCO Incentive Metrics: Requirements for Reporting on EHR- Based Measures in 2015 GUIDANCE DOCUMENTATION

Analance Data Integration Technical Whitepaper

The Big Picture: IDNT in Electronic Records Glossary

Big data in primary care

FDA's Mini-Sentinel Program to Evaluate the Safety of Marketed Medical Products. Progress and Direction

GoodData. Platform Overview

An Easily Accessed Clinical Research Database from your Epic EMR

Transcription:

Preparing Electronic Health Records for Multi-Site CER Studies Michael G. Kahn 1,3,4, Lisa Schilling 2 1 Department of Pediatrics, University of Colorado, Denver 2 Department of Medicine, University of Colorado, Denver 3 Colorado Clinical and Translational Sciences Institute 4 Department of Clinical Informatics, Children s Hospital Colorado AcademyHealth Annual Research Meeting Building a Data Infrastructure for Multi-stakeholder Comparative Effectiveness Research 26 June 2012 Michael.Kahn@ucdenver.edu Funding provided by AHRQ 1R01HS019908 (Scalable Architecture for Federated Translational Inquiries Network)

Setting the context: AHRQ Distributed Research Networks AHRQ ARRA OS: Recovery Act 2009: Scalable Distributed Research Networks for Comparative Effectiveness Research (R01) Goal: enhance the capability and capacity of electronic health networks designed for distributed research to conduct prospective, comparative effectiveness research on outcomes of clinical interventions. Funding provided by AHRQ 1R01HS019908 (Scalable Architecture for Federated Translational Inquiries Network)

AHRQ Distributed Research Networks Funded Projects SAFTINet: Scalable Architecture for Federated Therapeutic Inquiries Network Lisa M. Schilling, University of Colorado Denver (R01 HS19908-01) SCANNER: Scalable National Network for Effectiveness Research Lucila Ohno-Machado, University of California San Diego (R01 HS19913-01) SPAN: Scalable PArtnering Network for CER: Across Lifespan, Conditions, and Settings John F. Steiner, Kaiser Foundation Research Institute (R01 HS19912-01) Funding provided by AHRQ 1R01HS019908 (Scalable Architecture for Federated Translational Inquiries Network)

SAFTINet Partners Clinical partners Colorado Community Managed Care Network and the Colorado Associated Community Health Information Enterprise Colorado Federally Qualified Health Centers Denver Health and Hospital Authority Cherokee Health Systems, Tennessee Technology partners University of Utah, Center for High Performance Computing QED Clinical, Inc., d/b/a CINA Medicaid partners Colorado Health Care Policy & Financing Utah Department of Public Health (partnership in development) TennCare and Tennessee managed care organizations (partnership in development) Leadership University of Colorado Denver American Academy of Family Physicians, National Research Network

Key Differences between EHR and CER data EHR Data CER Data EHR->CER task Fully identified LDS or de-identified Strip identifiers; keep mappings? Local codes and values Standardized codes and values Terminology and value set mapping (manual!) Broad data domains Focused data domains Filtering by patient, encounter, date, facility Variable data quality; high level of missingness Substantial data quality processes applied Data profiling; iterative investigations Lots of free text Fully coded data only NLP or ignore free text Local access only Shared access Distributed or centralized data access Single data source Multiple data sources Record linkage

Key Differences between EHR and CER data EHR Data CER Data EHR->CER task Fully identified LDS or de-identified Strip identifiers; keep mappings? Local codes and values Standardized codes and values Terminology and value set mapping (manual!) Broad data domains Focused data domains Filtering by patient, encounter, date, facility Variable data quality; high level of missingness Substantial data quality processes applied Data profiling; iterative investigations Lots of free text Fully coded data only NLP or ignore free text Local access only Shared access Distributed or centralized data access Single data source Multiple data sources Record linkage

Key Differences between EHR and CER data EHR Data CER Data EHR->CER task Fully identified LDS or de-identified Strip identifiers; keep mappings? Local codes and values Standardized codes and values Terminology and value set mapping (manual!) Broad data domains Focused data domains Filtering by patient, encounter, date, facility Variable data quality; high level of missingness Substantial data quality processes applied Data profiling; iterative investigations Lots of free text Fully coded data only NLP or ignore free text Local access only Shared access Distributed or centralized data access Single data source Multiple data sources Record linkage

Key Differences between EHR and CER data EHR Data CER Data EHR->CER task Fully identified LDS or de-identified Strip identifiers; keep mappings? Local codes and values Standardized codes and values Terminology and value set mapping (manual!) Broad data domains Focused data domains Filtering by patient, encounter, date, facility Variable data quality; high level of missingness Substantial data quality processes applied Data profiling; iterative investigations Lots of free text Fully coded data only NLP or ignore free text Local access only Shared access Distributed or centralized data access Single data source Multiple data sources Record linkage

Key Differences between EHR and CER data EHR Data CER Data EHR->CER task Fully identified LDS or de-identified Strip identifiers; keep mappings? Local codes and values Standardized codes and values Terminology and value set mapping (manual!) Broad data domains Focused data domains Filtering by patient, encounter, date, facility Variable data quality; high level of missingness Substantial data quality processes applied Data profiling; iterative investigations Lots of free text Fully coded data only NLP or ignore free text Local access only Shared access Distributed or centralized data access Single data source Multiple data sources Record linkage

Key Differences between EHR and CER data EHR Data CER Data EHR->CER task Fully identified LDS or de-identified Strip identifiers; keep mappings? Local codes and values Standardized codes and values Terminology and value set mapping (manual!) Broad data domains Focused data domains Filtering by patient, encounter, date, facility Variable data quality; high level of missingness Substantial data quality processes applied Data profiling; iterative investigations Lots of free text Fully coded data only NLP or ignore free text Local access only Shared access Distributed or centralized data access Single data source Multiple data sources Record linkage

Key Differences between EHR and CER data EHR Data CER Data EHR->CER task Fully identified LDS or de-identified Strip identifiers; keep mappings? Local codes and values Standardized codes and values Terminology and value set mapping (manual!) Broad data domains Focused data domains Filtering by patient, encounter, date, facility Variable data quality; high level of missingness Substantial data quality processes applied Data profiling; iterative investigations Lots of free text Fully coded data only NLP or ignore free text Local access only Shared access Distributed or centralized data access Single data source Multiple data sources Record linkage

A common data model is critical! Other EHR Other EHR Other EHR CINA CDR Local Data Warehouse Existing Clinical Registries Limited Data Set Common Data Model Common Terminology Crossing the CER chasm!! Limited Data Set Common Data Model Common Terminology Limited Data Set Common Data Model Common Terminology CER Common Query Interface

ROSITA-GRID-PORTAL

Grid Portal

Why ROSITA? ROSITA: Reusable OMOP and SAFTINet Interface Adaptor ROSITA: The only bilingual Muppet Converts EHR data into research limited data set 1. Replaces local codes with standardized codes 2. Replaces direct identifiers with random identifiers 3. Supports clear-text and encrypted record linkage 4. Provides data quality metrics 5. Pushes data sets to grid node for distributed queries

ROSITA: transforming EHR data for comparative effectiveness research Client CDW Medicaid ETL XML ETL XMK ROSITA JDBC JDBC OMOP CDM V3 Grid Data Service SAFTINet Data Quality Data Service

SAFTINet ETL specifications

SAFTINet ETL Specifications

SAFTINet ETL Specifications

Transforming EHR Data: What does ROSITA do?

What does ROSITA do?

What does ROSITA do?

Why ROSITA? Converts EHR data into research limited data set 1. Replaces local codes with standardized codes 2. Replaces direct identifiers with random identifiers 3. Supports clear-text and encrypted record linkage 4. Provides data quality metrics 5. Pushes data sets to grid node for distributed queries

Do not have Medicaid figured out

ROSITA Security Discussion Framework

ROSITA: Current Status Software development underway In Phase 1: 16 week development clinical data only; no Medicaid Phase 2: Medicaid + record linkage OMOP data model V4 finalized! Clinical & financial extensions All SAFTINet partners have begun ETL activities Two sites have provided full ETL extracts for development and testing Everything is/will be available

Questions? Michael.Kahn@ucdenver.edu Funding provided by AHRQ 1R01HS019908 (Scalable Architecture for Federated Translational Inquiries Network)