Instrumenting the Healthcare Enterprise for Big Data Research. Shawn Murphy MD, Ph.D. Leicester, October
|
|
|
- Lorraine Morton
- 9 years ago
- Views:
Transcription
1 Instrumenting the Healthcare Enterprise for Big Data Research Shawn Murphy MD, Ph.D. Leicester, October
2 Informatics for Integrating Biology and the Bedside (i2b2) Software for explicitly integrating person-oriented data to a way that is optimized for clinical genomics research Allows integration of clinical data, trials data, and genotypic data A portable and extensible application framework Software is built in a modular pattern that allows additions without disturbing core parts Available as open source at
3 An i2b2 Environment (the Hive) is built from i2b2 Cells A B C Hive of software services provided by i2b2 cells remote local observation_fact Patient_Num Encounter_Num Concept_CD Observer_CD Start_Date Modifier_CD Instance_Num End_Date ValType_CD TVal_Char NVal_Num ValueFlag_CD Observation_Blob visit_dimension Encounter_Num Start_Date End_Date Active_Status_CD Location_CD* patient_dimension Patient_Num Birth_Date Death_Date Vital_Status_CD Age_Num* Gender_CD* Race_CD* Ethnicity_CD* concept_dimension Concept_Path Concept_CD Name_Char observer_dimension Observer_Path Observer_CD Name_Char 1 1 modifier_dimension Modifier_Path Modifier_CD Name_Char observation_fact Patient_Num Encounter_Num Concept_CD Observer_CD Start_Date Modifier_CD Instance_Num End_Date ValType_CD TVal_Char NVal_Num ValueFlag_CD Observation_Blob visit_dimension Encounter_Num Start_Date End_Date Active_Status_CD Location_CD* patient_dimension Patient_Num Birth_Date Death_Date Vital_Status_CD Age_Num* Gender_CD* Race_CD* Ethnicity_CD* concept_dimension Concept_Path Concept_CD Name_Char observer_dimension Observer_Path Observer_CD Name_Char 1 1 modifier_dimension Modifier_Path Modifier_CD Name_Char observation_fact Patient_Num Encounter_Num Concept_CD Observer_CD Start_Date Modifier_CD Instance_Num End_Date ValType_CD TVal_Char NVal_Num ValueFlag_CD Observation_Blob visit_dimension Encounter_Num Start_Date End_Date Active_Status_CD Location_CD* patient_dimension Patient_Num Birth_Date Death_Date Vital_Status_CD Age_Num* Gender_CD* Race_CD* Ethnicity_CD* concept_dimension Concept_Path Concept_CD Name_Char observer_dimension Observer_Path Observer_CD Name_Char 1 1 modifier_dimension Modifier_Path Modifier_CD Name_Char observation_fact Patient_Num Encounter_Num Concept_CD Observer_CD Start_Date Modifier_CD Instance_Num End_Date ValType_CD TVal_Char NVal_Num ValueFlag_CD Observation_Blob visit_dimension Encounter_Num Start_Date End_Date Active_Status_CD Location_CD* patient_dimension Patient_Num Birth_Date Death_Date Vital_Status_CD Age_Num* Gender_CD* Race_CD* Ethnicity_CD* concept_dimension Concept_Path Concept_CD Name_Char observer_dimension Observer_Path Observer_CD Name_Char 1 1 modifier_dimension Modifier_Path Modifier_CD Name_Char Data Model Data Repository Cell
4 An i2b2 Enterprise is built from i2b2 Hives Enterprise Research Patient Data Registry visit_dimension patient_dimension Encounter_Num 1 Patient_Num observation_fact 1 Start_Date Patient_Num Birth_Date End_Date Encounter_Num Death_Date Concept_CD Active_Status_CD Vital_Status_CD Location_CD* Observer_CD Age_Num* Start_Date Gender_CD* Modifier_CD Race_CD* observer_dimension Instance_Num Ethnicity_CD* Observer_Path End_Date ValType_CD Observer_CD TVal_Char Name_Char NVal_Num concept_dimension ValueFlag_CD Concept_Path Observation_Blob modifier_dimension Modifier_Path Concept_CD Name_Char Modifier_CD Name_Char
5 Research Patient Data Registry (RPDR) at Partners Healthcare to find patient cohorts for clinical research 1) Queries for aggregate patient numbers - Warehouse of in & outpatient clinical data million Partners Healthcare patients billion diagnoses, medications, procedures, laboratories, & physical findings coupled to demographic & visit data - Authorized use by faculty status - Clinicians can construct complex queries - Queries cannot identify individuals, internally can produce identifiers for (2) Query construction in web tool Encrypted identifiers Z731984X Z74902XX Deidentified Data Warehouse 2) Returns detailed patient data - Start with list of specific patients, usually from (1) - Authorized use by IRB Protocol - Returns contact and PCP information, demographics, providers, visits, diagnoses, medications, procedures, laboratories, microbiology, reports (discharge, LMR, operative, radiology, pathology, cardiology, pulmonary, endoscopy), and images into a Microsoft Access database and text files OR Real identifiers
6 2014 s usage of RPDR > 5100 registered users over 12 years, 655 new in teams/year gathering data for research studies 2634 detailed patient data sets returned to these teams in 2014, containing data of 24.7 million patient records. From a survey of 153 teams Importance of the data received from the RPDR was evaluated in relation to the study it was supporting. $ million total research support critically dependent on RPDR from patient data received throughout life of funding. Usefulness of Detailed Data 106 Total Responses Not Useful 15% Useful 42% Critical 43% ~300 data marts were created to support hospital operations, representing about 80 million patient records
7 Theory of Kimball translated to Healthcare Data Concept DIMENSION concept_key concept_text search_hierarchy Encounter DIMENSION encounter_key encounter_date hospital_of_service Star schema Patient-Concept FACTS patient_key concept_key start_date end_date practitioner_key encounter_key value_type numeric_value textual_value abnormal_flag Patient DIMENSION patient_key patient_id (encrypted) sex age birth_date race deceased ZIP Pract. DIMENSION practitioner_key name service Binary Tree start search 2200 million
8 Adapting to Big Data Fact Table Works Well for Heath Data Integration! BUT People need to agree to give you their data When database gets very large Can be difficult to update and not fragment index Expensive to pay for database and high performance server Innovation tends to suffer when need to integrate into a large mission-critical system
9 An i2b2 Enterprise is built from i2b2 Hives Enterprise of different i2b2 Hives observation_fact Patient_Num Encounter_Num Concept_CD Observer_CD Start_Date Modifier_CD Instance_Num End_Date ValType_CD TVal_Char NVal_Num ValueFlag_CD Observation_Blob visit_dimension Encounter_Num Start_Date End_Date Active_Status_CD Location_CD* patient_dimension Patient_Num Birth_Date Death_Date Vital_Status_CD Age_Num* Gender_CD* Race_CD* Ethnicity_CD* concept_dimension Concept_Path Concept_CD Name_Char observer_dimension Observer_Path Observer_CD Name_Char 1 1 modifier_dimension Modifier_Path Modifier_CD Name_Char observation_fact Patient_Num Encounter_Num Concept_CD Observer_CD Start_Date Modifier_CD Instance_Num End_Date ValType_CD TVal_Char NVal_Num ValueFlag_CD Observation_Blob visit_dimension Encounter_Num Start_Date End_Date Active_Status_CD Location_CD* patient_dimension Patient_Num Birth_Date Death_Date Vital_Status_CD Age_Num* Gender_CD* Race_CD* Ethnicity_CD* concept_dimension Concept_Path Concept_CD Name_Char observer_dimension Observer_Path Observer_CD Name_Char 1 1 modifier_dimension Modifier_Path Modifier_CD Name_Char A observation_fact Patient_Num Encounter_Num Concept_CD Observer_CD Start_Date Modifier_CD Instance_Num End_Date ValType_CD TVal_Char NVal_Num ValueFlag_CD Observation_Blob visit_dimension Encounter_Num Start_Date End_Date Active_Status_CD Location_CD* patient_dimension Patient_Num Birth_Date Death_Date Vital_Status_CD Age_Num* Gender_CD* Race_CD* Ethnicity_CD* concept_dimension Concept_Path Concept_CD Name_Char observer_dimension Observer_Path Observer_CD Name_Char 1 1 modifier_dimension Modifier_Path Modifier_CD Name_Char observation_fact Patient_Num Encounter_Num Concept_CD Observer_CD Start_Date Modifier_CD Instance_Num End_Date ValType_CD TVal_Char NVal_Num ValueFlag_CD Observation_Blob visit_dimension Encounter_Num Start_Date End_Date Active_Status_CD Location_CD* patient_dimension Patient_Num Birth_Date Death_Date Vital_Status_CD Age_Num* Gender_CD* Race_CD* Ethnicity_CD* concept_dimension Concept_Path Concept_CD Name_Char observer_dimension Observer_Path Observer_CD Name_Char 1 1 modifier_dimension Modifier_Path Modifier_CD Name_Char
10 Inside a Parent PICI Ontology Patient Resolution P1 H1 H2 H3 P2 H1 H2? Pa Pb QUERY P3 H1? H3 Pc SUBQUERIES Query Distribution Engine
11 Flow of Distributed Query Parent Child Child
12 Creating Enterprise Wide Query and Analysis System for Big Data RPDR Redcap Survey Data Public Health (CMS) i2b2 Navigator Imaging DICOM Partners Biobank Genomics Sets Notes / Text Repository Big Data Commons Integrates disparate islands of patient data (clinical and research data) onto a Common Big Data Platform 12
13 RPDR Search for Diabetes, A1C, and SGLT2
14 RPDR Enhance w/ LMR Note Search
15 RPDR Additional Patients Returned
16 I2b2/RPDR Text-based Queries Transparent to users - add text based searching to query tool Researchers could search for phrases of interest in a variety of clinical notes housed in a separate notes repository Cardiology, Discharge Summaries, Endoscopy, LMR, Operation, Pathology, Pulmonary, and Radiology Notes are updated nightly in a separate notes repository Several steps taken to conceal PHI through query Researchers will not be able to query on numbers of any kind Names are excluded from text index making them unsearchable Search terms are also matched against a PHI database Separate database is securely maintained with latest patient PHI, including names, addresses, numbers, nicknames, etc. Search terms are queried in Notes Repository and PHI database Patients that are returned from both databases are excluded from results Final aggregate counts are obfuscated to hide true totals and limit ability to identify one individual
17 Creating Enterprise Wide Query and Analysis System for Big Data RPDR Redcap Survey Data Public Health (CMS) i2b2 Navigator Imaging DICOM Partners Biobank Genomics Sets Notes / Text Repository Big Data Commons Integrates disparate islands of patient data (clinical and research data)onto a Common Big Data Platform 17
18 The Partners Biobank Samples Consent The Partners Biobank provides samples (plasma, serum, and DNA) collected from consented patients. Data Research Discoveries 25,000 patients have consented to date! Samples are available for distribution to Partners investigators* to help identify novel Personalized Medicine opportunities that reduce cost and provide better care *with required approval from the Partners Institutional Review Board (IRB). Improved Clinical Care for All Patients
19 Biobank Integrative Genomics Strategy Partners BioBank Samples (Whole Blood Extracted DNA/RNA) Genotyping Transcriptome Epigenome Profiling Illumina MEGArray: Multi-Ethnic GWAS/Exome SNP Array Whole Transcriptome Analysis: RNA-seq Methylation Analysis: HumanM450K Array Array Cost: $59/ sample Array Cost: $40-50/sample Array Cost: $150/sample Genome/Transcriptome Analysis: ~$100/sample Genome/Transcriptome/Epigenome Analysis: ~$260/sample
20 Data Integration Phenotype Discovery Center Electronic Medical Record (EMR) Data Coded Data Demographics Diagnoses Lab Results Additional Data Other Research Data Survey Data Samples RPDR Medications Procedures Etc. Biobank Data Text Data (Notes/Reports) Physician Notes Imaging Reports Pathology Reports Etc. Genetic Data (coming in 2015) GWAS Consent Biobank Portal Utilities Validated Phenotypes Type II Diabetes Coronary Artery Disease Congestive Heart Failure Rheumatoid Arthritis Calculated Controls (Charlson Index) Data Visualization Annotation IBD Multiple Sclerosis Bipolar Disorder Data Queries Extract Data Natural Language Processing Data and Sample Requests Links all the data: Sample, consent, EMR, GWAS, survey, phenotypes, controls DNA Serum Plasma Recontact Consent Status Research 2
21 Curating a Disease Algorithm 1. Create a gold standard training set. 3. Develop the classification algorithm. Using the data analysis file and the training set from step 1, assess the frequency of each variable. Remove variables with low prevalence. Apply adaptive LASSO penalized logistic regression to identify highly predictive variables for the algorithm 2. Create a comprehensive list of features (concepts/variables) that describe the phenotype of interest 4. Apply the algorithm to all subjects in the superset and assign each subject a probability of having the phenotype 95% Specificity 21
22 Use Phenotyping Algorithms to define cohorts of treatmentresistant and treatmentresponsive depression Initially: AUC = 0.54 Finally: AUC = 0.87 Depressed at Encounter Well at Encounter Initially: AUC = 0.55 Finally: AUC = 0.86 Clinical Status Model Specificity Sensitivity Precision AUC Depressed Billing Codes (0.03) 0.57 (0.14) 0.54 (0.02) Depressed NLP (0.05) 0.78 (0.02) 0.88 (0.02) Depressed NLP + Billing Codes (0.06) 0.78 (0.02) 0.87 (0.02) Well Billing Codes (0.02) 0.26 (0.27) 0.55 (0.03) Well NLP (0.06) 0.86 (0.02) 0.85 (0.02) Well NLP + Billing Codes (0.07) 0.85 (0.02) 0.86 (0.02)
23 Biobank Portal Curated Diseases Validated Phenotype Count* Predictive Positive Value Bipolar Disease 71 89% Congestive Heart Failure % Coronary Artery Disease 2,420 97% Crohn s Disease % Multiple Sclerosis 94 90% Rheumatoid Arthritis % Type 2 Diabetes Mellitus 1,887 97% Ulcerative Colitis % Healthy Controls based on Charlson Index * Based on 15,880 patients ** Based on 21,300 patients Count** 0 10-year survival probability is >98.3% 2, year survival probability is >95.87% 4, year survival probability is >90.15% 6,545
24 Partners Biobank Portal Login Screen
25 Partners Biobank Portal Splash Screen
26 Biobank Portal Run a Query Step 1: Run a query in the Biobank Portal Two criteria: a. The patient is a smoker (from the Health Information Survey) b. The patient agrees to be re-contacted
27 Biobank Portal Request Samples Based on Prior Query Step 2: Request Samples in the Biobank Portal based on the prior query
28 Partners Biobank Portal Defining a Query
29 Partners Biobank Portal Viewing Query
30 Partners Biobank Portal Running a Query
31 Partners Biobank Portal Download De-Identified Data
32
33 Partners Biobank Portal Specify Data for Download
34 Download table and select patients
35 Partners Biobank Portal Request Genetic Data
36 Navigating NGS Variant Data with Sequence Ontology Combination of concepts and modifiers to identify: An SNV/SNP located on a 3 UTR An SNV/SNP associated with a certain gene An SNV/SNP of specified zygosity
37 DEMO 37
38 38 Log on to Parent
39 39 Parent show all available ontologies
40 40 Formulate Query as usual in Parent
41 Parent combines child patient sets Child #1 has Diagnoses Child #2 has Lab tests Answers is Intersection 41
42 42
43 43 Child # 2 has Lab tests (and meds/demog)
44 44 Query to Child #2 returned 2 patients
45 45
46 Child #1 has Diagnoses (and proc/demog) Query to Child #1 returned 89 patients 46
47 I2b2 Community Software distributed as open source
48 Patient Data Registries Innovative Tools & IT solutions Research Partners Biobank Quality Patient Care
49 Innovation Platform Secure computational platform for Big Data Providing analytic tools to gather and compute upon big data within a secure virtual machine Working with EMC to provide storage, computation, and virtualization Enabling growth using secure cloud services Connecting to clinical care For effective recruitment of patients to clinical trials For providing innovative insights into clinical care using big data resources at Partners
50 Innovation Platform Linking to EMR with I2b2 Apps Published 2011
51 Substitutable Medical Apps and Reusable Technologies SMART REST SMART Connect
52
53 SMART Apps link i2b2 to the EMR Substitutable Medical Application and Reusable Technology Commissioned form the Office of the National Coordinator Allows Big Data from i2b2 to integrate Clinical Apps into the Epic EMR. Paradigm is similar to Mobile Apps with a proposed standard interface using FHIR sponsored by the National Argonaut EMR Project.
54 SMART-i2b2 Creates the Clinical Innovation Platform Connecting Partners Big Data Commons with the practice of Precision Medicine Navigates complex issues with Intellectual Property and integration into Epic Standard, highly regulated Application Program Interface in HL7 FHIR (Fast Healthcare Interoperability Resources) to be provided for clinical applications to dip into Partners Big Data Commons with potential to scale to National level Pre-built plug-in architecture for Apps that are built to SMART specifications will allow providers in EMR to receive Decision Support from Big Data Commons
55 Treating Cancer Patient with Precision Medicine Partners Big Data Commons
56 Enabling Innovation to reach into EMR SMART Apps run in EMR Got Statins? BP Centiles Cancer Risk SMART-Enabled DW - I2b2/RPDR Partners Big Data Commons SMART-Enabled Document/Pivotal SMART-Enabled EMR - Epic Research Applications run directly in Data Lake
57 Tribute to I2b2 Core Team Isaac Kohane Susanne Churchill Shawn Murphy Griffin Weber Paul Avillach Michael Mendis Lori Phillips Janice Donahoe Nich Wattanasin David Wang Christopher Herrick Bill Wang Vivian Gainer Andrew Cagan SMART i2b2 Team Nich Wattanasin Kenneth Mandl Joshua Mandel i2b2 SHRINE Team William Simons Douglas MacFadden I2b2/tranSMART Team Paul Avillach Michael McDuffy Jeremy Easton-Marks mi2b2 Team Randy Gollub Christopher Herrick Bill Wang
58 I2b2, SHRINE, and SMART Information and Software on the Web i2b2 Homepage ( i2b2 Software ( i2b2 Community Site ( SHRINE at Harvard ( SHRINE Software: ( SMART Platforms Homepage (
Opportunities and Limitations of Big Data to Address Diversity. Shawn Murphy MD, Ph.D.
Opportunities and Limitations of Big Data to Address Diversity Shawn Murphy MD, Ph.D. The Research Data Warehouse at Partners Healthcare Partners Enterprise Research Patient Data Registry Multiple Systems
Use of the Research Patient Data Registry at Partners Healthcare, Boston
Use of the Research Patient Data Registry at Partners Healthcare, Boston Advancing Clinical Research with Hospital Clinical Records Shawn Murphy MD, Ph.D. Massachusetts General Hospital Outline of presentation
The i2b2 Hive and the Clinical Research Chart
The i2b2 Hive and the Clinical Research Chart Henry Chueh Shawn Murphy The i2b2 Hive is centered around two concepts. The first concept is the existence of services provided by applications that are wrapped
i2b2 Clinical Research Chart
i2b2 Clinical Research Chart Shawn Murphy MD, Ph.D. Griffin Weber MD, Ph.D. Michael Mendis Vivian Gainer MS Lori Phillips MS Rajesh Kuttan Wensong Pan MS Henry Chueh MD Susanne Churchill Ph.D. John Glaser
i2b2 Clinical Research Chart
i2b2 Clinical Research Chart Shawn Murphy MD, Ph.D. Griffin Weber MD, Ph.D. Michael Mendis Vivian Gainer MS Lori Phillips MS Rajesh Kuttan Wensong Pan MS Henry Chueh MD Susanne Churchill Ph.D. John Glaser
i2b2 Clinical Research Chart
i2b2 Clinical Research Chart Shawn Murphy MD, Ph.D. Griffin Weber MD, Ph.D. Michael Mendis Andrew McMurry Vivian Gainer MS Lori Phillips MS Rajesh Kuttan Wensong Pan MS Henry Chueh MD Susanne Churchill
Informatics for Integrating Biology and the Bedside. i2b2 User Guide. Import Data View. Document Version: 1.5.1 I2b2 Software Release: 1.
Informatics for Integrating Biology and the Bedside i2b2 User Guide Import Data View Document Version: 1.5.1 I2b2 Software Release: 1.5 Table of Contents INTRODUCTION...3 LAYOUT OF THE IMPORT WIZARD TAB...3
@mandl. Bringing Big Data to the Point of Care by Creating the App Store for Health
Bringing Big Data to the Point of Care by Creating the App Store for Health @mandl Kenneth D. Mandl, MD, MPH Professor, Harvard Medical School Director, Boston Children s Hospital Computational Health
Apps to display patient data, making SMART available in the i2b2 platform
Apps to display patient data, making SMART available in the i2b2 platform Nich Wattanasin, MS 1, Alyssa Porter, MS 1, Stella Ubaha 1, Michael Mendis 1, Lori Phillips, MS 1, Joshua Mandel, MD 3, Rachel
Employing SNOMED CT and LOINC to make EHR data sensible and interoperable for clinical research
Employing SNOMED CT and LOINC to make EHR data sensible and interoperable for clinical research James R. Campbell MD W. Scott Campbell PhD Hubert Hickman MS James McClay MD Implementation Showcase October
Personalized Medicine: Humanity s Ultimate Big Data Challenge. Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences
Personalized Medicine: Humanity s Ultimate Big Data Challenge Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences 2012 Oracle Corporation Proprietary and Confidential 2 3 Humanity
Medical Informatics: A Fishing Story
Medical Informatics: A Fishing Story Russ Waitman, PhD Director of Medical Informatics, Associate Professor, Department of Biostatistics Director, Frontiers Biomedical Informatics Assistant Vice Chancellor,
An Easily Accessed Clinical Research Database from your Epic EMR
Loyola University Chicago Health Sciences Division Stritch School of Medicine (SSOM) An Easily Accessed Clinical Research Database from your Epic EMR February 13, 2014 Speakers: Richard H. Kennedy, Ph.D.
Interoperability and Analytics February 29, 2016
Interoperability and Analytics February 29, 2016 Matthew Hoffman MD, CMIO Utah Health Information Network Conflict of Interest Matthew Hoffman, MD Has no real or apparent conflicts of interest to report.
Clinical and research data integration: the i2b2 FSM experience
Clinical and research data integration: the i2b2 FSM experience Laboratory of Biomedical Informatics for Clinical Research Fondazione Salvatore Maugeri - FSM - Hospital, Pavia, italy Laboratory of Biomedical
Data Repository (CRC) Cell
Informatics for Integrating Biology and the Bedside i2b2 Software Architecture Data Repository (CRC) Cell Document Version: 1.7.1 i2b2 Software Version: 1.7.00 Table of Contents DOCUMENT MANAGEMENT...
EHR OpenNESS: Innovation and Uptake
EHR OpenNESS: Innovation and Uptake Kenneth D. Mandl, MD, MPH Director, Intelligent Health Laboratory Children s Hospital Informatics Program Center for Biomedical Informatics Harvard Medical School To
Cerner i2b2 User s s Guide and Frequently Asked Questions. v1.3
User s s Guide and v1.3 Contents General Information... 3 Q: What is i2b2?... 3 Q: How is i2b2 populated?... 3 Q: How often is i2b2 updated?... 3 Q: What data is not in our i2b2?... 3 Q: Can individual
Enabling the Health Continuum with Informatics. Jeroen Tas Healthcare Informatics.Solutions.Services
Enabling the Health Continuum with Informatics Jeroen Tas Healthcare Informatics.Solutions.Services The changing global healthcare landscape requires a fundamentally different informatics approach Enabling
Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats
Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats Genomic Medicine 8 meeting Alexa McCray Christopher G Chute Rex
Smarter Healthcare@IBM Research. Joseph M. Jasinski, Ph.D. Distinguished Engineer IBM Research
Smarter Healthcare@IBM Research Joseph M. Jasinski, Ph.D. Distinguished Engineer IBM Research Our researchers work on a wide spectrum of topics Basic Science Industry specific innovation Nanotechnology
Informatics for Integrating Biology and the Bedside. User Guide. Project Request. Document Version: 1.6.1 I2b2 Software Release: 1.
Informatics for Integrating Biology and the Bedside User Guide Project Request Document Version: 1.6.1 I2b2 Software Release: 1.6 Table of Contents Document Management 3 About this Guide 4 1. Prerequisites
Clinical Data Services
Clinical Data Services Data Storage, Data Collection Data Management Human Research Academy October 2014 CTS Research Development Services 706.721.6247 [email protected] Objectives Participants will:
Environmental Health Science. Brian S. Schwartz, MD, MS
Environmental Health Science Data Streams Health Data Brian S. Schwartz, MD, MS January 10, 2013 When is a data stream not a data stream? When it is health data. EHR data = PHI of health system Data stream
From Fishing to Attracting Chicks
The Greater Plains Collaborative: a PCORNet Clinical Data Research Network s Strategies for Creating an Interoperable Architecture From Fishing to Attracting Chicks Russ Waitman, PhD Associate Professor,
A Commercial Approach to De-Identification Dan Wasserstrom, Founder and Chairman De-ID Data Corp, LLC
A Commercial Approach to De-Identification Dan Wasserstrom, Founder and Chairman De-ID Data Corp, LLC De-ID Data Corp, LLC Founded to: ENHANCE DATA ACCESS WHILE PROTECTING PATIENT PRIVACY Founders Problem
IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS
IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS 29 OCTOBER 2015 DR. DIRK J. EVERS BACKGROUND TreatmentMAP
SAP Healthcare Analytics Solutions Provide physicians and researchers access to patient data from various systems in realtime
SAP Healthcare Analytics Solutions Provide physicians and researchers access to patient data from various systems in realtime Stephan Schindewolf, SAP SE, July 13, 2015 Facts per Decision Need Decision
Find the signal in the noise
Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives Chalapathy Neti, Ph.D. Associate Director, Healthcare Transformation, Shahram Ebadollahi, Ph.D. Research Staff Memeber IBM Research,
GENETIC DATA ANALYSIS
GENETIC DATA ANALYSIS 1 Genetic Data: Future of Personalized Healthcare To achieve personalization in Healthcare, there is a need for more advancements in the field of Genomics. The human genome is made
How Can Institutions Foster OMICS Research While Protecting Patients?
IOM Workshop on the Review of Omics-Based Tests for Predicting Patient Outcomes in Clinical Trials How Can Institutions Foster OMICS Research While Protecting Patients? E. Albert Reece, MD, PhD, MBA Vice
Informatics Resources for Clinical Research. Jim Cimino, Geoff Gordon, Matt Wyatt and James Willig UAB Informatics Institute April 6, 2016
Informatics Resources for Clinical Research Jim Cimino, Geoff Gordon, Matt Wyatt and James Willig UAB Informatics Institute April 6, 2016 Today s Take-Home Messages Overview of the clinical informatics
Informatics Domain Task Force (idtf) CTSA PI Meeting 02/04/2015
Informatics Domain Task Force (idtf) CTSA PI Meeting 02/04/2015 Informatics Domain Task Force (idtf) Lead Team Paul Harris, Vanderbilt University Medical Center, (co-chair) Steven Reis, University of Pittsburgh
Big Data Analytics- Innovations at the Edge
Big Data Analytics- Innovations at the Edge Brian Reed Chief Technologist Healthcare Four Dimensions of Big Data 2 The changing Big Data landscape Annual Growth ~100% Machine Data 90% of Information Human
Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future
Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future Daniel Masys, MD Professor and Chair Department of Biomedical Informatics Professor of Medicine Vanderbilt
NIH s Genomic Data Sharing Policy
NIH s Genomic Data Sharing Policy 2 Benefits of Data Sharing Enables data generated from one study to be used to explore a wide range of additional research questions Increases statistical power and scientific
Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center
Computational Challenges in Storage, Analysis and Interpretation of Next-Generation Sequencing Data Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Next Generation Sequencing
Practical Implementation of a Bridge between Legacy EHR System and a Clinical Research Environment
Cross-Border Challenges in Informatics with a Focus on Disease Surveillance and Utilising Big-Data L. Stoicu-Tivadar et al. (Eds.) 2014 The authors. This article is published online with Open Access by
Incorporating Research Into Sight (IRIS) Essentia Rural Health Institute Marshfield Clinic Penn State University
Incorporating Research Into Sight (IRIS) Essentia Rural Health Institute Marshfield Clinic Penn State University Aim 1. Develop and validate electronic algorithms for ophthalmic conditions and efficacy
Research Data Extraction Service. Research and Woodruff Health Sciences IT
Research Data Extraction Service Research and Woodruff Health Sciences IT 1 Topics Emory s Clinical Data Warehouse Types of data available Accessing the data Key Points Clinical Data Warehouse initiated
Transla6ng from Clinical Care to Research: Integra6ng i2b2 and OpenClinica
Transla6ng from Clinical Care to : Integra6ng i2b2 and OpenClinica Aaron Abend Managing Director, Recombinant Data Corp May 13, 2011 Copyright 2011 Recombinant Data Corp. All rights reserved. 1 About Recombinant
Big Data for Population Health
Big Data for Population Health Prof Martin Landray Nuffield Department of Population Health Deputy Director, Big Data Institute, Li Ka Shing Centre for Health Information and Discovery University of Oxford
Setting the World on FHIR
Setting the World on FHIR W. Ed Hammond. Ph.D., FACMI, FAIMBE, FIMIA, FHL7 Director, Duke Center for Health Informatics Director, Applied Informatics Research, DHTS Director of Academic Affairs, MMCi Program
Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects
Report on the Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects Background and Goals of the Workshop June 5 6, 2012 The use of genome sequencing in human research is growing
Digital Health: Catapulting Personalised Medicine Forward STRATIFIED MEDICINE
Digital Health: Catapulting Personalised Medicine Forward STRATIFIED MEDICINE CRUK Stratified Medicine Initiative Somatic mutation testing for prediction of treatment response in patients with solid tumours:
A leader in the development and application of information technology to prevent and treat disease.
A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today
NIH Genomic Data Sharing (GDS) Policy Guidance Memo #2 1
MEMORANDUM TO: Principal Investigators and Research Staff DATE: 2/22/15 FROM: Anne Klibanski, MD, Partners Chief Academic Officer (CAO) Paul Anderson, MD, PhD, BWH CAO Harry Orf, PhD, MGH Sr. Vice President-Research
Supporting Clinical and Translational Research with Informatics
Supporting Clinical and Translational Research with Informatics Thomas R. Campion, Jr., Ph.D. Assistant Professor of Healthcare Policy and Research Assistant Professor of Healthcare Policy and Research
Delivering the power of the world s most successful genomics platform
Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE
Integration of Genetic and Familial Data into. Electronic Medical Records and Healthcare Processes
Integration of Genetic and Familial Data into Electronic Medical Records and Healthcare Processes By Thomas Kmiecik and Dale Sanders February 2, 2009 Introduction Although our health is certainly impacted
Ganzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
Factors for success in big data science
Factors for success in big data science Damjan Vukcevic Data Science Murdoch Childrens Research Institute 16 October 2014 Big Data Reading Group (Department of Mathematics & Statistics, University of Melbourne)
Public Health and the Learning Health Care System Lessons from Two Distributed Networks for Public Health
Public Health and the Learning Health Care System Lessons from Two Distributed Networks for Public Health Jeffrey Brown, PhD Assistant Professor Department of Population Medicine Harvard Medical School
EMR Systems and the Conduct of Clinical Research. Daniel E Ford, MD, MPH Vice Dean for Clinical Investigation Johns Hopkins School of Medicine
EMR Systems and the Conduct of Clinical Research Daniel E Ford, MD, MPH Vice Dean for Clinical Investigation Johns Hopkins School of Medicine Clinical Research Environment Research protocols are becoming
Biomedical Big Data and Precision Medicine
Biomedical Big Data and Precision Medicine Jie Yang Department of Mathematics, Statistics, and Computer Science University of Illinois at Chicago October 8, 2015 1 Explosion of Biomedical Data 2 Types
Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik
Leading Genomics Diagnostic harma Discove Collab Shanghai Cambridge, MA Reykjavik Global leadership for using the genome to create better medicine WuXi NextCODE provides a uniquely proven and integrated
SMART-on-FHIR Genomics: Enabling Precision Medicine by Bridging Clinical and Genomic Information
SMART-on-FHIR Genomics: Enabling Precision Medicine by Bridging Clinical and Genomic Information Presented by: Gil Alterovitz Assistant Professor, Harvard Medical School and Computational Health Informatics
Balancing Big Data for Security, Collaboration and Performance
Balancing Big Data for Security, Collaboration and Performance Sai Balu Lineberger Cancer Center UNC Chapel Hill Oct 14, 2014 About UNC Oldest Public University -1793 Top 5 Public University. 46th World
National Cancer Institute
National Cancer Institute Information Systems, Technology, and Dissemination in the SEER Program U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health Information Systems, Technology,
Secondary Uses of Data for Comparative Effectiveness Research
Secondary Uses of Data for Comparative Effectiveness Research Paul Wallace MD Director, Center for Comparative Effectiveness Research The Lewin Group [email protected] Disclosure/Perspectives Training:
Introduction to Information and Computer Science: Information Systems
Introduction to Information and Computer Science: Information Systems Lecture 1 Audio Transcript Slide 1 Welcome to Introduction to Information and Computer Science: Information Systems. The component,
Visual Analytics to Enhance Personalized Healthcare Delivery
Visual Analytics to Enhance Personalized Healthcare Delivery A RENCI WHITE PAPER A data-driven approach to augment clinical decision making CONTACT INFORMATION Ketan Mane, PhD kmane@renci,org 919.445.9703
School of Nursing. Presented by Yvette Conley, PhD
Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression
Project Management (PM) Cell
Informatics for Integrating Biology and the Bedside i2b2 Design Document Project Management (PM) Cell Document Version: 1.7.1 i2b2 Software Version: 1.7.00 Table of Contents DOCUMENT MANAGEMENT... 4 1.
MyResearch @ UCSF. MyResearch 101. [email protected] 415 476 9830. Presented by. ITS Academic Research Systems. Powered by
MyResearch @ UCSF MyResearch 101 Powered by Presented by [email protected] 415 476 9830 ITS Academic Research Systems Contact ITS Help Desk 514-4100 email [email protected] September 29, 2014
BRISSkit: Biomedical Research Infrastructure Software Service kit. Jonathan Tedds. http://www.le.ac.uk/brisskit #brisskit #umfcloud
BRISSkit: Biomedical Research Infrastructure Software Service kit http://www.le.ac.uk/brisskit #brisskit #umfcloud Jonathan Tedds University of Leicester [email protected] @jtedds JISC University Modernisation
Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers
Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers Terminology Management is a foundational element to satisfying the Meaningful Use Stage 2 criteria and due to its complexity, and
New Clinical Research & Care Opportunities Through Big Data Informatics
New Clinical Research & Care Opportunities Through Big Data Informatics Gregory A. Jones Chief Technology Officer Health Sciences Global Business Unit September 2014 Safe Harbor Statement The following
Predictive Analytics Drives Population Health Management
Predictive Analytics Drives Population Health Management March 1, 2016 David Seo, MD University of Miami Health System AVP IT Clinical Applications and CMIO Chitra Raghu Lockheed Martin Program Manager
Technical Issues in Aggregating and Analyzing Data from Heterogeneous EHR Systems
Technical Issues in Aggregating and Analyzing Data from Heterogeneous EHR Systems Josh Denny, MD, MS [email protected] Vanderbilt University, Nashville, Tennessee, USA 2/12/2015 EHR data are dense
Bench to Bedside Clinical Decision Support:
Bench to Bedside Clinical Decision Support: The Role of Semantic Web Technologies in Clinical and Translational Medicine Tonya Hongsermeier, MD, MBA Corporate Manager, Clinical Knowledge Management and
Carolina s Journey: Turning Big Data Into Better Care. Michael Dulin, MD, PhD
Carolina s Journey: Turning Big Data Into Better Care Michael Dulin, MD, PhD Current State: Massive investments in EMR systems Rapidly Increase Amount of Data (Velocity, Volume, Veracity) The Data has
CardioSoft * Diagnostic System
GE Healthcare CardioSoft * Diagnostic System Connecting Hearts and Minds Clinically connected. Simply smart. CardioSoft Diagnostic System from GE Healthcare is more than a software program it is a data
Outcome Data, Links to Electronic Medical Records. Dan Roden Vanderbilt University
Outcome Data, Links to Electronic Medical Records Dan Roden Vanderbilt University Coordinating Center Type II Diabetes Case Algorithm * Abnormal lab= Random glucose > 200mg/dl, Fasting glucose > 125 mg/dl,
Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider
Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider Daniel Masys, M.D. Affiliate Professor Biomedical and Health Informatics University of Washington, Seattle
