Integration of Biobanks into Clinical Health Care through Linkage to Medical Records The emerge experience



Similar documents
Integration of Biobanks into Clinical Healthcare through Linkage to Medical Records The emerge experience

Outcome Data, Links to Electronic Medical Records. Dan Roden Vanderbilt University

Major US Genomic Medicine Programs: NHGRI s Electronic Medical Records and Genomics (emerge) Network

Electronic Phenotyping and Genomic Research

Potentials for EHR Phenotyping in SJS/TEN

Technical Issues in Aggregating and Analyzing Data from Heterogeneous EHR Systems

Incorporating Research Into Sight (IRIS) Essentia Rural Health Institute Marshfield Clinic Penn State University

EHRs and large scale comparative effectiveness research

Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider

How Can Institutions Foster OMICS Research While Protecting Patients?

Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future

Standardized Representation for Electronic Health Record-Driven Phenotypes

Genomic Testing: Actionability, Validation, and Standard of Lab Reports

How To Build A Personalized Genomic Medicine Program At Mission Health

The Contribution of large Healthcare Systems to Improving Treatment for Patients with Rare Diseases

Environmental Health Science. Brian S. Schwartz, MD, MS

EXPANDING THE EVIDENCE BASE IN OUTCOMES RESEARCH: USING LINKED ELECTRONIC MEDICAL RECORDS (EMR) AND CLAIMS DATA. ISPOR Workshop, May 22, 2013

Reduced Risk & Improved Efficacy through Integrated Pharmacogenetics

EMR Systems and the Conduct of Clinical Research. Daniel E Ford, MD, MPH Vice Dean for Clinical Investigation Johns Hopkins School of Medicine

Introduction to genetic testing and pharmacogenomics

The Future of the Electronic Health Record. Gerry Higgins, Ph.D., Johns Hopkins

Using Big Data to Advance Healthcare Gregory J. Moore MD, PhD February 4, 2014

The M.U.R.D.O.C.K. Study

Clinical Implementation of Psychiatric Pharmacogenomic Testing

Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats

TRANSLATIONAL BIOINFORMATICS 101

Clinical Implementation of. Pharmacist-Managed Service. Kristine R. Crews, Pharm.D., BCPS St. Jude Children s Research Hospital

Research Skills for Non-Researchers: Using Electronic Health Data and Other Existing Data Resources

Leveraging Social Networks to Conduct Observational Research: A Paradigm Shift in Methodology. Presented by: Elisa Cascade, MediGuard/Quintiles

Integration of Genetic and Familial Data into. Electronic Medical Records and Healthcare Processes

Introduction to Information and Computer Science: Information Systems

Developing VA GDx: An Informatics Platform to Capture and Integrate Genetic Diagnostic Testing Data into the VA Electronic Medical Record

THE ROLE OF HEALTH INFORMATION TECHNOLOGY IN PATIENT-CENTERED CARE COLLABORATION Louisiana HIPAA & EHR Conference Presenter: Chris Williams

What is Big Data? And Why it Matters to You! Shannon McWeeney, PhD 13 th January 2016

If you are signing for a minor child, you refers to your child throughout the consent document.

John Sharp, MSSA, PMP Manager, Research Informatics Quantitative Health Sciences Cleveland Clinic, Cleveland, Ohio

SE Minnesota Beacon Enabling Population Health Research

Trends in Technology based Health Investments (A USA view)

Electronic Health Record Strategies for Pay for Performance. James O Connor MD Director of Clinical Informatics

Mayo Site Report. emerge phase II Steering Committee meeting. New York City, NY June 30, 2015

Implementing Proactive Pharmacogenetic Testing as a Standard of Care

Collect and label sample according to standard protocols. Gently invert tube 8-10 times immediately after draw. DO NOT SHAKE. Do not centrifuge.

Using EHRs for Heart Failure Therapy Recommendation Using Multidimensional Patient Similarity Analytics

Records and Clinical Trials

Clinical Data Services

2010 QARR QUICK REFERENCE GUIDE Adults

Improving Diabetes Care for All New Yorkers

Biobanks: DNA and Research

Criteria: CWQI HCS-123 (This criteria is consistent with CMS guidelines for External Infusion Insulin Pumps)

An update on family history.

Beacon User Stories Version 1.0

1. General Information About The Mitochondrial Disease Biobank

Smarter Research. Joseph M. Jasinski, Ph.D. Distinguished Engineer IBM Research

Clinical and research data integration: the i2b2 FSM experience

Regulatory Issues in Genetic Testing and Targeted Drug Development

Evaluation of Diagnostic and Screening Tests: Validity and Reliability. Sukon Kanchanaraksa, PhD Johns Hopkins University

David Liebovitz, MD CMIO, Northwestern Medical Faculty Foundation Carl Christensen CIO, Northwestern Medical Faculty Foundation

Genetic Alliance BioBank: A Virtual Tour of Registry Solutions to Accelerate Research February 22, 2010

What s Next for Data Sharing: Insight from the NIH Experience

Anonymization of Administrative Billing Codes with Repeated Diagnoses Through Censoring

REQUEST FOR IMAGe SYNDROME TESTING

A Comprehensive Review of the emerge Infobutton Project

Male New Patient Package

TYPE 2 DIABETES IN CHILDREN DIAGNOSIS AND THERAPY. Ines Guttmann- Bauman MD Clinical Associate Professor, Division of Pediatric Endocrinology, OHSU

Electronic Medical Record Etiquette For Alec

The Ethics of Research with Stored Samples and Data

Big Data for Population Health and Personalised Medicine through EMR Linkages

Extraction of Pharmacogenomics Traits from Electronic Health Records

UAB HEALTH SYSTEM AMBULATORY EHR IMPLEMENTATION

Secondary Uses of Data for Comparative Effectiveness Research

Manitoba EMR Data Extract Specifications

The PREDICT program: Implementing prospective pharmacogenetics for inpatient and outpatient clinical care

Carolina s Journey: Turning Big Data Into Better Care. Michael Dulin, MD, PhD

CFAR Network of Integrated Clinical Systems(CNICS): The Use of Real-Time, Patient-Centered, Clinical Metrics

Family History and Diabetes. Practical Genomics for the Public Health Professional

An Essential Ingredient for a Successful ACO: The Clinical Knowledge Exchange

Health Disparities in Multiple Myeloma. Kenneth R. Bridges, M.D. Senior Medical Director Onyx Pharmaceuticals, Inc.

Big Bad Data. FMSRE Results Presentation

Medicare Supplement plan application

Integration of genomic data into electronic health records

A Pilot Study of the Incidence of Exposure to Drugs for which Preemptive Pharmacogenomic Testing Is Available

Baskets of Care Diabetes Subcommittee

Chapter 13: Mining Electronic Health Records in the Genomics Era

Continuity of Care Record (CCR) The Concept Paper of the CCR

CDS Starter Kit: Diabetes f ollow-up care

How To Get A Medical Insurance Plan From A Doctor

Vision for the Cohort and the Precision Medicine Initiative Francis S. Collins, M.D., Ph.D. Director, National Institutes of Health Precision

Distributed Networking

IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS

NCI Community Cancer Centers Program Program Overview Ascension Health St. Vincent Indianapolis Hospital

1960 Ogden St. Suite 120, Denver, CO 80218,

A Comparison of Hemorrhagic and Ischemic Strokes among Blacks and Whites:

Expanding Access through Pharmacy-Based Point-of-Care Testing

NIH s Genomic Data Sharing Policy

A leader in the development and application of information technology to prevent and treat disease.

Use of Electronic Health Records in Clinical Research: Core Research Data Element Exchange Detailed Use Case April 23 rd, 2009

Preconception Clinical Care for Women Medical Conditions

Center for. Clinical Informatics. Clinical Informatics. Mission Statement

THE VIRTUAL DATA WAREHOUSE (VDW) AND HOW TO USE IT

Aggregate data available; release of county or case-based data requires approval by the DHMH Institutional Review Board

Transcription:

Integration of Biobanks into Clinical Health Care through Linkage to Medical Records The emerge experience Rex L. Chisholm, PhD Adam and Richard T. Lind Professor of Medical Genetics Vice Dean for Scientific Affairs Feinberg School of Medicine Northwestern University

Personalized Medicine

Critical factors for Defining Genetic Contributions to Disease: A key to Personalized Medicine Methods to measure genetic variation in an individual human genome Large numbers of well phenotyped human genomes Development of standards of care / best practices Methods to deploy genomics based decision support

Northwestern s Biobank: NUgene Launched in fall 2002 IRB approved Voluntary participation via informed consent Collect biological specimens under a broad consent One-time questionnaire at enrollment Longitudinal medical information captured from EMR Secure application and database Enables High-throughput phenotyping Recontact option for additional research Resource facilitating genetic research institution-wide Genotype data returned to NUgene

Informed Consent Document Allows: Blood sample to be drawn and DNA stored for length of study Access to participants EHR data Questionnaire information to be obtained Future use of DNA sample for genetic variation research Access to de-identified DNA and medical information by third parties with approved research studies, including companies Recontact (optional) Most EHR not appropriately consented (special protections for mental health and?! genetics)

Privacy Protections Patient identifiers replaced with a computer generated barcode Single link between the DNA sample and medical information remains in a secure database, called NOTIS Physicians are not informed of patient s participation or results Certificate of Confidentiality Issued by the NIH to protect participant s information involved in certain research projects from subpoena and other legal actions

Data Flow Participant Enrollment Materials Medical Record Coding & Data Parsing Process MRNs SSNs Patient Identifiers Coded Data NOTIS Encryption Decryption NUgene Database Phenotypic Engine Enterprise Data Warehouse

NUgene today Over 10,600 participants are enrolled Male: 42% Female: 58% Median age: 51 Age distribution: 18-85+ Ethnic breakdown similar to census data for 6-county area Over half enrolled through &/or seen at primary care clinics Average participant has 31 distinct diagnoses (ICD9 non V/E codes), & an average of 16 distinct diagnoses assigned at least twice** Average patient followed over 8 years, some patients >20 years Overall participation rate is 28% Uptake rate is ~52% if physician mentions the study 92% of participants agree to be contacted for future research or additional health information **assigned the same ICD-9 code on 2 or more dates

Data Sources Questionnaire (self-report): Completed once, at time of enrollment: Demographic information Environmental exposures Medications Self-reported family and medical history for select conditions Electronic billing record data Retrospective and prospective diagnosis (ICD9) & procedure (CPT/ICD9CM) codes Electronic medical record data Retrospective and prospective: Medical history and diagnoses Lab tests and results Medications and therapies Family and social history Free text physician notes*

EHRs at the Northwestern Medical Center Operates a state of the art EHR system Well integrated systems Mix of commercially and internally developed systems Electronic data capture for virtually all aspects of inpatient and outpatient care, including Over 15 years of clinical data Anthropometric and clinical measures, prescriptions, diagnoses, lab measures, and clinical notes Using high quality, widely used commercial systems Cerner PowerChart inpatient records EpicCare Ambulatory EHR outpatient records PRIMES inpatient registration and billing records IDX outpatient registration and billing records

ICD9 ICD9 Description # Pts** % Pts** 401Essential hypertension 3310 34% 272Disorders of lipoid metabolism 3269 34% Participant Counts Per Selected Top 3-Digit ICD9s* 530Diseases of esophagus 2259 23% 427Cardiac dysrhythmias 1825 19% 451Phlebitis and thrombophlebitis 1688 17% 278Overweight, obesity and other hyperalimentation 1566 16% 715Osteoarthrosis and allied disorders 1411 15% 477Allergic rhinitis 1408 14% 216Benign neoplasm of skin 1380 14% 250Diabetes mellitus 1367 14% 311Depressive disorder, not elsewhere classified 1261 13% 692Contact dermatitis and other eczema 1238 13% 244Acquired hypothyroidism 1102 11% *From billing, encounter, problem list, med Hx. **Participants w/ EMR data as of 9/24/2010. 493Asthma 1043 11% 366Cataract 1010 10%

Top Laboratory Tests within Population Laboratory Tests # Participants* % Participants* # of Tests Glucose 8,509 88 265,963 Creatinine 8,422 87 124,848 Calcium 8,392 86 258,308 Potassium 8,387 86 265,022 Sodium 8,384 86 262,436 Chloride 8,384 86 261,752 Mean Corpuscular Volume (MCV) 8,364 86 247,740 Red Cell Distribution Width (RDW) 8,364 86 247,733 Mean Corp. Hemoglobin Conc. (MCHC) 8,364 86 247,740 Mean Corpuscular Hemoglobin (MCH) 8,364 86 247,740 Blood Urea Nitrogen 8,366 86 245,748 Bicarbonate 8,332 86 243,499 Hemoglobin 8,320 86 227,208 Hematocrit 8,320 86 227,201 Red Cell Count 8,296 85 224,380 * Based on participants with EMR data as of 9/24/2010.

An NHGRI funded consortium Coordinating Center

emerge I Goals Test the ability to leverage EMRs and biobanks for genomic research Evaluate validity & utility of EMR phenotypes for Genomics Develop & validate electronic phenotyping algorithms Conduct association studies of genome wide data with EMR derived phenotypes

Type II Diabetes Case Algorithm * Abnormal lab= Random glucose > 200mg/dl, Fasting glucose > 125 mg/dl, or hemoglobin A1c 6.5%.

Type II Diabetes Control Algorithm

Type II Diabetes Chart Review Blinded clinician review of 100 random charts (50 cases & 50 controls) Case PPV = 98% Control PPV = 98%

Mega-Analysis (adjusted) TCF7L2

Phase I Phenotypes

Merged Genotype Dataset 17,046 emerge samples with GWAS data Majority of samples genotyped using 660W Samples collected for various studies GH Dementia Marshfield Cataracts and HDL-C Mayo PAD NW T2D VU Normal ECGs Can we use existing dataset for another experiment?

Primary Hypothyroidism Most common form is chronic lymphocytic hypothyroidism (Hashimoto s thyroiditis) More common in females (~10x) Other associated factors age race/ethnicity family history of thyroid disease No published* GWAS (as of 07/23/2011)

No thyroid-altering medications (e.g., Phenytoin, Lithium) 2+ non-acute visits ICD 9s for Hypothyroidism Abnormal TSH/FT4 No ICD-9s for Hypothyroidism Normal TSH Thyroid replacement medication No secondary causes (e.g., pregnancy, ablation) No thyroid replace. meds No hx of myasthenia gravis Case Control

Phenotype Algorithm Validation Site EMR-based Cases/Controls Chart Review Cases/Controls Case PPV (%) Control PPV (%) Group Health 397/1,160 50/50 98 100 Marshfield 514/1,187 50/50 91 100 Mayo Clinic 233/1,884 100/100 82 96 Northwestern 92/470 50/50 98 100 Vanderbilt 81/352 50/50 98 100 All sites (weighted) 1,317/5,053 92.4 98.5 Am J Hum Genet. 2011 Oct 7;89(4):529-42

Unmatched Analysis 1,317 cases 5,053 controls Am J Hum Genet. 2011 Oct 7;89(4):529-42

Unmatched Analysis Am J Hum Genet. 2011 Oct 7;89(4):529-42

FOXE1 Forkhead box E1 (thyroid transcription factor 2) ~3.46kb intronless gene Thyroid transcription factor which likely plays a crucial role in thyroid morphogenesis Mutations associated with congenital hypothyroidism and cleft palate with thyroid dysgenesis The map localization of this gene suggests it may also be a candidate gene for squamous cell epithelioma and hereditary sensory neuropathy type I From EntrezGene

PheWAS for rs965513 (FOXE1) Analysis of 866 phenotypes in 13,617 European Americans Adjusted for age and sex

Hypothyroidism Conclusions No genotyping experiment yields new genetic associations Associations overlap with thyroid cancer GWAS Associations replicate in external dataset FOXE1 likely candidate, may be associated with other thyroid diseases

emerge II Goals EHR based Phenotyping Establish new Genotype Phenotype associations Clinical Use: Defining actionability/clinical utility/validity Integration into EHR/Visualization/Clinical Decision support Physician and Patient attitudes/education Consent/Regulatory Privacy/Security/CLIA/CAP

emerge II phenotyping: Lower GI Phenotypes Colon Polyps Diverticulosis *National Polyp Study Work Group 32

Colon Polyp NLP Algorithm 1. ±Colon Polyps: checks EHR for colonoscopy with linked path report w. polyp mention 2. Type + Location: NLP on path reports to extract type and location Sensitivity PPV Specificity NPV ±Colon Polyps 98% 94% 94% 98% Type + Location 96% 98% NA NA

Colon Polyp Findings By Location

Preliminary GWAS GRM8 ARHGAP15 ARHGAP15 PRMT8 GINS2 660 Diverticulosis Diverticulitis M

Pharmacogenomics Deploy VIP platform developed by PGRN investigators Apply to participants enriched for encountering drugs for which there is a CPIC guideline Return appropriate genotype results through appropriate decision support tools Archive novel variants for further study

Develop list of actionable variants (emerge, CPIC, ) Identify target patients Resequence VIP genes; Identify actionable variants Aim 1 Aim 2 Actionable variants Aim 3 EMR deposit Result display Decision support Outcomes Performance metrics Healthcare impact Create repository of variants of unknown significance Initiate studies of function and of genotype phenotype relationships

PGx candidate drug gene pairs Gene Drug Comment CYP2C19 clopidogrel Best evidence in patients with coronary stents CYP2C9 VKORC1 CYP4F2 HLA B*5701 Warfarin Abacavir Algorithms to predict starting dose available. Vary by ancestry HLA B*1502 carbamazepine Higher frequency variant in Asian subjects SLCO1B1 Simvastatin Especially at higher dosages or with interacting drugs TPMT 6 MP, azathioprine CYP2D6 Codeine PM status predicts non response CYP2D6 some SSRIs PM status predicts intolerance of effective dosages.

Results from CLIA certified lab Omics Ancillary EHR Omic Repository Complex Data Textual Reports Reports Analytics Human Interpretation Simple HL7 Data Actionable Attributes Lab Results External CDS EHR CDS Actionable Results Patient Information Human Interpretation

Omics Ancillary Omic Repository BestPractice Advisories Results from CLIA certified lab Complex Data EHR Reports Risk for Hemochromatosis Patient at increased risk of developing hemochromatosis based on genetic test results. Current guidelines Human recommend assessing for signs and symptoms of hemochromatosis and measuring ferritin Interpretation and transferritin. Lab Results Acknowledge Reason: Discussed results w/ patient Open SmartSet: Hemochromatosis screening Laboratory Results Hemochromatosis Genetic Risk Fact Sheet EHR CDS Actionable Results Patient Information Accept Cancel Human Interpretation

Omics Ancillary Omic Repository Results from CLIA certified lab Complex Data EHR Reports Analytics Human Interpretation Lab Results Actionable Attributes EHR CDS Actionable Results Patient Information Human Interpretation

PheKB.org a knowledgebase for discovering phenotypes Public Side of the Website Final phenotypes algorithms and implementation results Private Side of the Website Ability to create phenotype workspaces and collaborate on development, testing, and validation on algorithms Version history Access controls Only viewable by authors Shared with a collaborative groups(s) Public ( Final algorithms only) Comment on Phenotypes shared with you Receive alerts on Phenotypes you are following

Summary Biobanks and EHRs are increasingly playing a critical role in identifying associations between genetic variation, disease risk, drug efficacy and clinical outcomes Longitudinal mining of electronic medical records can be used to provide the most up to date phenotype associated with human biospecimens Research use can be an important driver of EHR quality Networks of EHR-linked biobanks that share samples and data have the potential to increase statistical power to detect genetic associations, population diversity in these studies, and overall research efficiency Methods to store genomic variation in EHR will enable personalized medicine

Acknowledgements NU emerge team: Rex Chisholm Phil Greenland Abel Kho Bill Lowe Wendy Wolf Maureen Smith Geoff Hayes Loren Armstrong Doug Scheftner Laura Rasmussen-Torvik Pedro Avila Amy Lemke May Law Jen A. Pacheco Joel Humowiecki Will Thompson Arun Muthalagu Krishna Dornadula Funding for emerge studies is provided by NHGRI (U01HG004609) NUgene Governance Committee & Community Advisory Committee NUgene team: Rex Chisholm, PhD (PI) Warren Kibbe, PhD (Co-founder) William Lowe, MD (Medical Dir.) Wendy Wolf, PhD Maureen Smith, MS, CGC Jennifer A. Pacheco Tony Miqueli Sharon Aufox, MS, CGC Oana Popescu Nicole Sheehan Noah Goss Maribeth Miceli More information about NUgene: http://www.nugene.org