Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future



Similar documents
Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider

Integration of genomic data into electronic health records

Major US Genomic Medicine Programs: NHGRI s Electronic Medical Records and Genomics (emerge) Network

Outcome Data, Links to Electronic Medical Records. Dan Roden Vanderbilt University

Of Codes, Genomes, and Electronic Health Records: It s Only Sensitive If It Hurts When You Touch It

Anonymization of Administrative Billing Codes with Repeated Diagnoses Through Censoring

Standardized Representation for Electronic Health Record-Driven Phenotypes

Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects

EHRs and large scale comparative effectiveness research

Bench to Bedside Clinical Decision Support:

SAP Healthcare Analytics Solutions Provide physicians and researchers access to patient data from various systems in realtime

NIH s Genomic Data Sharing Policy

Technical Issues in Aggregating and Analyzing Data from Heterogeneous EHR Systems

Potentials for EHR Phenotyping in SJS/TEN

Electronic Phenotyping and Genomic Research

The Use of Patient Records (EHR) for Research

Voluntary Genomic Data Submissions at the U.S. FDA

Genomic Testing: Actionability, Validation, and Standard of Lab Reports

Healthcare Professional. Driving to the Future 11 March 7, 2011

Integration of Genetic and Familial Data into. Electronic Medical Records and Healthcare Processes

An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives

A Tech Entrepreneur Meets Academic Pathology

Overview. Overarching observations

Incorporating Research Into Sight (IRIS) Essentia Rural Health Institute Marshfield Clinic Penn State University

#Aim2Innovate. Share session insights and questions socially. Genomics, Precision Medicine and the EHR 6/17/2015

Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats

Medical Informatics An Overview Saudi Board For Community Medicine

Big Data in Biobanking towards targeted medicine

TRANSFoRm: Vision of a learning healthcare system

Executive Summary Think Research. Using Electronic Medical Records to Bridge Patient Care and Research

MED 2400 MEDICAL INFORMATICS FUNDAMENTALS

ICD-10-CM The Case for Moving Forward

Big Data for Population Health

Genomic Medicine Centers Meeting VII. Summary of Themes Discussed in Keynote and Panel Discussions October 3, 2014

GENETIC DATA ANALYSIS

Using Big Data to Advance Healthcare Gregory J. Moore MD, PhD February 4, 2014

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE E15

The Future of the Electronic Health Record. Gerry Higgins, Ph.D., Johns Hopkins

How Can Institutions Foster OMICS Research While Protecting Patients?

A Comparison of Hemorrhagic and Ischemic Strokes among Blacks and Whites:

Electronic Medical Records for Genetic Research: Results of the emerge Consortium

Personal Genomes and Ethical Issues How to Analyze Your Own Genome

Implementing CDSMP in an integrated health care system

Using EHRs to extract information, query clinicians, and insert reports

Genomics and the EHR. Mark Hoffman, Ph.D. Vice President Research Solutions Cerner Corporation

Integrating Bioinformatics, Medical Sciences and Drug Discovery

HL7 Clinical Genomics and Structured Documents Work Groups

Vision for the Cohort and the Precision Medicine Initiative Francis S. Collins, M.D., Ph.D. Director, National Institutes of Health Precision

Course Syllabus. Upon completion of all six modules, participants will have:

Translational & Interoperable Health Infostructure - The Servant of Three Masters. Abridged Presentation

European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute

Electronic Health Record (EHR) Standards Survey

A Multi-locus Genetic Risk Score for Abdominal Aortic Aneurysm

Implementing Proactive Pharmacogenetic Testing as a Standard of Care

Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility

Practical Development and Implementation of EHR Phenotypes. NIH Collaboratory Grand Rounds Friday, November 15, 2013

A National Clinical Decision Support Infrastructure to Enable Genetically-Guided Population Health Management at Scale

Secondary Use of EMR Data View from SHARPn AMIA Health Policy, 12 Dec 2012

Theodoros. N. Arvanitis, RT, DPhil, CEng, MIET, MIEEE, AMIA, FRSM

Environmental Health Science. Brian S. Schwartz, MD, MS

TRANSLATIONAL BIOINFORMATICS 101

School of Nursing. Presented by Yvette Conley, PhD

The National Institute of Genomic Medicine (INMEGEN) was

Prognosis for Healthcare: The Future of Medicine

EHR Data Preservation

Medical Informatics: A Fishing Story

A Primer of Genome Science THIRD

Continuity of Care Record (CCR) The Concept Paper of the CCR

Reasoning / Medical Context / Physicians Area of Competence

Genetic Alliance BioBank: A Virtual Tour of Registry Solutions to Accelerate Research February 22, 2010

Issues with Tissues. Bertha delanda Celia Molvin/Kevin Murphy Research Compliance Office Stanford University

Electronic Health Records - An Overview - Martin C. Were, MD MS March 24, 2010

Integrated Biomedical and Clinical Research Informatics for Translational Medicine and Therapeutics

Draft Pan-Canadian Primary Health Care Electronic Medical Record Content Standard, Version 2.0 (PHC EMR CS) Frequently Asked Questions

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik

Human Research Protection Program University of California, San Diego ISSUES ON DNA AND INFORMED CONSENT

Understanding Diagnosis Assignment from Billing Systems Relative to Electronic Health Records for Clinical Research Cohort Identification

A New Paradigm for Large, Simple Trials Based on Electronic Health Records

Predictive Analytics Drives Population Health Management

ITT Advanced Medical Technologies - A Programmer's Overview

escience and Post-Genome Biomedical Research

Disclosure slide. Objectives. The Meaningful Use of EMR. The Meaningful Use of EMR. The Meaningful Use of EMR. Peter Cherouny, M.D.

Worldwide Collaborations in Molecular Profiling

How To Change Medicine

2011 CMIO and Physician IT Survey Report

How To Use Centricity Emr

How To Transition From Icd 9 To Icd 10

SMART-on-FHIR Genomics: Enabling Precision Medicine by Bridging Clinical and Genomic Information

LifeLines Cohort Study. Salome Scholtens, Manager Research Office

Building open source safety-critical medical device platforms and Meaningful Use EHR gateways

Presenting data: how to convey information most effectively Centre of Research Excellence in Patient Safety 20 Feb 2015

Maher Khelifi. A thesis submitted in partial fulfillment of the requirements for the degree of. Master of Science. University of Washington

Introduction to Information and Computer Science: Information Systems

Factors for success in big data science

Records and Clinical Trials

VISUAL INTEGRATION OF RESULTS FROM A LARGE DNA BIOBANK (BIOVU) USING SYNTHESIS-VIEW *

Medical Informatics in Healthcare Organizations: A Survey of Healthcare Information Managers

Health Care Delivery and Information Technology (HIT) Subcommittee : DRAFT Recommendations

How To Use An Electronic Medical Record

Appendix 6.2 Data Source Described in Detail Hospital Data Sets

Transcription:

Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future Daniel Masys, MD Professor and Chair Department of Biomedical Informatics Professor of Medicine Vanderbilt University School of Medicine Principal Investigator electronic Medical Records and Genomics (emerge) Coordination Center

Topics The emerge consortium: on the frontier of genomes and phenotypes derived from clinical sources Lessons for future standards learned from the last 40 years of biomedical informatics

RFA HG-07-005: Genome-Wide Studies in Biorepositories with Electronic Medical Record Data 2007 NIH Request for Applications from the National Human Genome Research Institute The purpose of this funding opportunity is to provide support for investigative groups affiliated with existing biorepositories to develop necessary methods and procedures for, and then to perform, if feasible, genome-wide studies in participants with phenotypes and environmental exposures derived from electronic medical records, with the aim of widespread sharing of the resulting individual genotype-phenotype data to accelerate the discovery of genes related to complex diseases.

emerge: an NHGRI-funded consortium for Biobanks linked to EMR data Consortium members Group Health of Puget Sound Marshfield Clinic Mayo Clinic Northwestern University Vanderbilt University Condition of NIH funding: contribute genomic and EMR-derived phenotype data to dbgap database at NCBI

Dementia Cataracts Type II diabetes Peripheral vascular disease Coordinating center QRS duration

Each site includes DNA linked to electronic medical records Each project includes community engagement, genome science, natural language processing capability for EMR data Research plans include identifying a phenotype of interest in 3,000 subjects and conduct of a genomewide association study at each center: Σ=18,000 Supplemental funding provided for cross-network phenotypes

Supplement phenotypes using genotyped samples from primary phenotypes* *The benefit of data from routine clinical testing results recorded in EHR RBC/WBC Diabetic Retinopathy Lipid Levels & Height GFR GHC 3,579 230 3,114 1,713 Marshfield 3,865 213 3,693 3,929 Mayo 3,346 806 3,175 3,340 NU 2,484 139 2,816 1,485 VU 2,650 1,449 1,631 2,679

Informatics Issues in emerge Determination of comparability of patient populations across institutions Data exchange standards for phenotype data Representation of change over time (repeated measures) and clinical uncertainty for EMR-derived observations (definite probable possible for assertion and negation) Re-identification potential of clinical data and associated samples: maximizing scientific value while complying with federal privacy protection policies

Lessons learned from emerge Structured data alone (ICD9 codes, labs, meds) is not enough to accurately define clinical phenotypes in EMR data. Natural language processing of clinician notes improves both sensitivity and specificity of phenotype selection logic Once selection logic has been validated at one site, it can be successfully shared as pseudocode and implemented at other sites with widely varying EMR systems architectures

The problem with ICD codes ICD code sets contain both false negatives and false positives False negatives: Outpatient billing limited to 4 diagnoses/visit Outpatient billing done by physicians (e.g., takes too long to find the unknown ICD9) Inpatient billing done by professional coders: omit codes that don t pay well can only code problems actually explicitly mentioned in documentation False positives Diagnoses evolve over time -- physicians may initially bill for suspected diagnoses that later are determined to be incorrect Billing the wrong code (perhaps it is easier to find for a busier clinician) Physicians may bill for a different condition if it pays for a given treatment Example: Anti-TNF biologics (e.g., infliximab) originally not covered for psoriatic arthritis, so rheumatologists would code the patient as having rheumatoid arthritis

Observations about clinical genomics Genomic data is the current poster child for complexity in healthcare No practitioner can absorb and remember more than a tiny fraction of the knowledge base of human variation Therefore, computerized clinical decision support is the only effective way to insert genomic variation-based guidance into clinical care

Need for Patient-Specific Decision Support Assistance Facts per Decision 1000 100 Proteomics and other effector molecules Functional Genetics: Gene expression profiles Decisions by clinical phenotype i.e., traditional health care 10 1990 2000 2010 2020 Structural Genetics: e.g. SNPs, haplotypes Human Cognitive Capacity

Observations about genomic data in general Predictably, some future uses of genomic data will be so creative that nobody could have predicted them. Raw data e.g., primary DNA sequence, specific SNP values, should be retained when adding clinical interpretations, not discarded in the synthesis and reporting of those interpretations

Lessons learned about standards The most successful and widely used standards are messaging and communications standards e.g., TCP/IP, http, smtp, ftp, HL7. Knowledge representation standards tend to be widely used only if driven by financial processes, government mandate (e.g., ICD-9), or sustained government investment (e.g., MeSH, SNOMED, UMLS Metathesaurus)

Lessons learned about standards, cont d Familiar and usable wins over theoretically correct but too complex e.g., HL7 v.2 vs. v.3 Lindberg s axiom: Systems that get used get better.

URL: www.gwas.net

Commander Willard Decker: V'ger... expects an answer. Captain James T. Kirk: An answer? I don't even know the question. Star Trek, the movie 1979