Achilles a platform for exploring and visualizing clinical data summary statistics
|
|
|
- Christina Little
- 10 years ago
- Views:
Transcription
1 Biomedical Informatics discovery and impact Achilles a platform for exploring and visualizing clinical data summary statistics Mark Velez, MA Ning "Sunny" Shang, PhD Department of Biomedical Informatics, Columbia University NIH BD2K biocaddie webinar, August 13 th, 2015
2 Outline OHDSI ACHILLES demo Applications of ACHILLES 2
3 What is OHDSI The Observational Health Data Sciences and Informatics (OHDSI) program is a multistakeholder, interdisciplinary collaborative To bring out the value of observational health data through large-scale analytics and evidence generation Clinical characterization Population-level estimation Patient-level prediction 3
4 What is OHDSI Single observational data source is unlikely to be sufficient for research analysis needs Analyze multiple data sources concurrently Using a common data model and the foundational infrastructure to enable observational research By 2014, 58 databases in CDM > 250 million patients covered 4
5 What is OHDSI Mission To transform medical decision making by creating reliable scientific evidence about disease natural history, healthcare delivery, and the effects of medical interventions through large-scale analysis of observational health database for populationlevel estimation and patient-level predictions 5
6 OHDSI Infrastructure Data Source 1 Data Source 2 Data Source 3 Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) statistical analysis e.g. Treatment pathway Analytic tools ACHILLES CIRCE Others 6
7 OMOP Common Data Model (CDM) V 5 7
8 Data transform in CDM Extracting, transforming, and loading (ETL) process WhiteRabbit: analyzes the structure and content of a database RabbitInAHat: connects and maps tables and columns from the raw dataset to the CDM dataset ETL-CDMBuilder: transform raw data to CDM 8
9 ACHILLES (Automated Characterization of Health Information at Large-scale Longitudinal Evidence Systems) An open source analytics framework Interactively explore population-level summary statistics for the data stored in CDM Profile your CDM data Explore population-level summaries Review data quality assessment Data in CDM Summary statistics Web visualization of statistics 9
10 ACHILLES implementation ACHILLES R package Oracle / SQL Server / Postgres / Redshift Summary statistics export into Json to prepare data for visualization Visualization by AchillesWeb (HTML5 / JavaScript) create strata tables Data quality queries (Heel) Export to JSON Visualization (AchillesWeb) 10
11 ACHILLES Summary Statistics 1 Summary of data set / clinical database Size of the database First /Continuous observation 11
12 Dashboard Summary of clinical dataset 12
13 ACHILLES Summary Statistics 2 Person demographic information and demographic information over death 13
14 Person 14
15 Death 15
16 ACHILLES Summary Statistics 3 Metadata (e.g. observation periods, data density) Observation periods document time intervals during which health care information captured Data density describes the unit quantity of records and concepts pertains in each database 16
17 Observation Periods 17
18 Data Density 18
19 ACHILLES Summary Statistics 3 Prevalence of condition/condition era/ observation/drug exposure/drug era/procedure/visit Treemap view Table view Drill down view 19
20 Condition Treemap view 20
21 Condition Table view 21
22 Condition Drill down view 1 22
23 Condition Drill down view 2 23
24 ACHILLES Summary Statistics 4 Achilles HEEL Data quality control component 24
25 Achilles Heel Data quality tool 25
26 ACHILLES Heel Error Types Error Type Clinical facts Example Illogical change Monthly change of count of condition is more than 100% Invalid ids Improper value based on norm Improper value based on inter-relationship Terminology Not standard vocabulary Non-mapped concept Wrong mapping concept Person has invalid provider_id Year of birth is less than 1800 Negative payment A condition is recorded after the patient is dead a concept is not a standard OMOP vocabulary concept Data with unmapped concepts Drug is not coded with RxNorm 26
27 Applications of ACHILLES Explore summary statistics about the clinical data Public domain (de-identified information) Integrate with clinical systems Achilles integrating other OHDSI tools Framework for other applications 27
28 ACHILLES collaborating with other OHDSI tools ACHILLES Database profiling CIRCE Cohort definition HERACLES Cohort characterization 28
29 ACHILLES Framework for other applications biocaddie DDI Suitability Framework 29
30 Suitability General definition the quality or state of being especially suitable or fitting [Merriam-Webster] In our project The extent to which a clinical dataset to meet the research needs for observational studies Data suitability is how suitable the data are for a specific research purpose 30
31 Research methods EHR characteristics lit review Suitability conceptual framework Web-based survey Metrics with Columbia EHR Hybrid Approach Categories Measures Implementation by Customizing ACHILLES Observa tional studyderived submeasur e Desider ata studyderived submeasur e 31
32 Can I access? User -- Researcher What s inside? (content) Suitability of Clinical Database for Observational Study Are data usable? Policy and Administration Data policy documentat ion Administrati ve platform Technical accessibility Relevance Healthcare organization description Data organization documentation Research data inventory Available and retrievable temporal information Descriptive metadata and provenance documentation Data provenance Database content synopsis Usability Data representatio n Usefulness Cohort availability Database linkability Quality Data quality control Database data quality Research sample data quality Accessibility Representation Intrinsic Contextual Data (data characteristics)
33 Suitability Survey 33
34 Implementation 34
35 Important websites OHDSI Main GitHub Page: Forum: ACHILLES R Package for Generating Statistics for ACHILLES: Web Application for Viewing ACHILLES Results: Demo ple%20database/dashboard 35
Next-generation Phenotyping Using Interoperable Big Data
Biomedical Informatics discovery and impact Next-generation Phenotyping Using Interoperable Big Data George Hripcsak, Chunhua Weng Columbia University Medical Center Collab with Mount Sinai Medical Center
How to extract transform and load observational data?
How to extract transform and load observational data? Martijn Schuemie Janssen Research & Development Department of Pharmacology & Pharmacy, The University of Hong Kong Outline Observational data & research
Learning from observational databases: Lessons from OMOP and OHDSI
Learning from observational databases: Lessons from OMOP and OHDSI Patrick Ryan Janssen Research and Development David Madigan Columbia University http://www.omop.org http://www.ohdsi.org The sole cause
Sisense. Product Highlights. www.sisense.com
Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze
Open-Source Big Data Analytics in Healthcare
Open-Source Big Data Analytics in Healthcare Jon Duke, George Hripcsak, Patrick Ryan www.ohdsi.org/medinfo-2015-tutorial Introduction Introducing OHDSI The Observational Health Data Sciences and Informatics
Connecting Basic Research and Healthcare Big Data
Elsevier Health Analytics WHS 2015 Big Data in Health Connecting Basic Research and Healthcare Big Data Olaf Lodbrok Managing Director Elsevier Health Analytics [email protected] t +49 89 5383 600
Theodoros. N. Arvanitis, RT, DPhil, CEng, MIET, MIEEE, AMIA, FRSM
TRANSFoRm Theodoros. N. Arvanitis, RT, DPhil, CEng, MIET, MIEEE, AMIA, FRSM Biomedical Informatics, Signals & Systems Research Laboratory School of Electronic, Electrical & Computer Engineering College
Utility of Common Data Models for EHR and Registry Data Integration: Use in Automated Surveillance
Utility of Common Data Models for EHR and Registry Data Integration: Use in Automated Surveillance Frederic S. Resnic, MS, MSc Chairman, Department of Cardiovascular Medicine Co-Director, Comparative Effectiveness
Classifying Adverse Events From Clinical Trials
Classifying Adverse Events From Clinical Trials Bernard LaSalle, Richard Bradshaw University of Utah, Biomedical Informatics, Salt Lake City, UT USA [email protected] Abstract The use of adverse
Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
Searching biomedical data sets. Hua Xu, PhD The University of Texas Health Science Center at Houston
Searching biomedical data sets Hua Xu, PhD The University of Texas Health Science Center at Houston Motivations for biomedical data re-use Improve reproducibility Minimize duplicated efforts on creating
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011)
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011) Key Dates Release Date: June 6, 2013 Response Date: June 25, 2013 Purpose This Request
Find the signal in the noise
Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives
An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives Chalapathy Neti, Ph.D. Associate Director, Healthcare Transformation, Shahram Ebadollahi, Ph.D. Research Staff Memeber IBM Research,
Real-Time Market Monitoring using SAS BI Tools
Paper 1835-2014 Real-Time Market Monitoring using SAS BI Tools Amol Deshmukh, CA ISO Corporation, Folsom Jeff McDonald, CA ISO Corporation, Folsom Abstract The Department of Market Monitoring at California
Business Intelligence & Product Analytics
2010 International Conference Business Intelligence & Product Analytics Rob McAveney www. 300 Brickstone Square Suite 904 Andover, MA 01810 [978] 691 8900 www. Copyright 2010 Aras All Rights Reserved.
Integrated Enterprise Reporting
Integrated Enterprise Reporting July 2013 Don McNatty Abstract Integrated Reporting can be accomplished by using reporting and dashboard tools that support accessing multiple data sources directly or through
Karl Lum Partner, LabKey Software [email protected]. Evolution of Connectivity in LabKey Server
Karl Lum Partner, LabKey Software [email protected] Evolution of Connectivity in LabKey Server Connecting Data to LabKey Server Lowering the barrier to connect scientific data to LabKey Server Increased
Preparing Electronic Health Records for Multi-Site CER Studies
Preparing Electronic Health Records for Multi-Site CER Studies Michael G. Kahn 1,3,4, Lisa Schilling 2 1 Department of Pediatrics, University of Colorado, Denver 2 Department of Medicine, University of
Exploration and Visualization of Post-Market Data
Exploration and Visualization of Post-Market Data Jianying Hu, PhD Joint work with David Gotz, Shahram Ebadollahi, Jimeng Sun, Fei Wang, Marianthi Markatou Healthcare Analytics Research IBM T.J. Watson
From Fishing to Attracting Chicks
The Greater Plains Collaborative: a PCORNet Clinical Data Research Network s Strategies for Creating an Interoperable Architecture From Fishing to Attracting Chicks Russ Waitman, PhD Associate Professor,
In this presentation, you will be introduced to data mining and the relationship with meaningful use.
In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine
How To Choose A Business Intelligence Toolkit
Background Current Reporting Challenges: Difficulty extracting various levels of data from AgLearn Limited ability to translate data into presentable formats Complex reporting requires the technical staff
Visual Analytics to Enhance Personalized Healthcare Delivery
Visual Analytics to Enhance Personalized Healthcare Delivery A RENCI WHITE PAPER A data-driven approach to augment clinical decision making CONTACT INFORMATION Ketan Mane, PhD kmane@renci,org 919.445.9703
MED 2400 MEDICAL INFORMATICS FUNDAMENTALS
MED 2400 MEDICAL INFORMATICS FUNDAMENTALS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Course code: MED 2400
School of Nursing University of Minnesota Informatics Competencies across the Curriculum
Informatics Competencies across the Curriculum Data - basic elements & related standards BSN DNP PhD Resources Course Assignment Describe the value of using standardized terminologies and messaging standards
Oracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
Big Data R&D Initiative
Big Data R&D Initiative Howard Wactlar CISE Directorate National Science Foundation NIST Big Data Meeting June, 2012 Image Credit: Exploratorium. The Landscape: Smart Sensing, Reasoning and Decision Environment
QAD Business Intelligence Data Warehouse Demonstration Guide. May 2015 BI 3.11
QAD Business Intelligence Data Warehouse Demonstration Guide May 2015 BI 3.11 Overview This demonstration focuses on the foundation of QAD Business Intelligence the Data Warehouse and shows how this functionality
MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012
MEDICAL DATA MINING Timothy Hays, PhD Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012 2 Healthcare in America Is a VERY Large Domain with Enormous Opportunities for Data
Environmental Health Science. Brian S. Schwartz, MD, MS
Environmental Health Science Data Streams Health Data Brian S. Schwartz, MD, MS January 10, 2013 When is a data stream not a data stream? When it is health data. EHR data = PHI of health system Data stream
Adam Rauch Partner, LabKey Software [email protected]. Extending LabKey Server Part 1: Retrieving and Presenting Data
Adam Rauch Partner, LabKey Software [email protected] Extending LabKey Server Part 1: Retrieving and Presenting Data Extending LabKey Server LabKey Server is a large system that combines an extensive set
Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary (HDD): Implemented with a data warehouse
Meaningful use. Meaningful data. Meaningful care. The 3M Healthcare Data Dictionary (HDD): Implemented with a data warehouse Executive summary A large academic research institution uses the 3M Healthcare
NATIONAL CENTER FOR PUBLIC HEALTH INFORMATICS (CPE)
NATIONAL CENTER FOR PUBLIC HEALTH INFORMATICS (CPE) The National Center for Public Health Informatics (NCPHI) protects and improves the public s health through discovery, innovation, and service in health
Know more Act Better: Launching KPI Reporting & Benchmarking Framework
Know more Act Better: Launching KPI Reporting & Benchmarking Framework JANUARY 2012 Abstract In today s competitive scenario of commoditization of products and services, technology is no longer a differentiator.
Ernesto Ongaro BI Consultant February 19, 2013. The 5 Levels of Embedded BI
Ernesto Ongaro BI Consultant February 19, 2013 The 5 Levels of Embedded BI Saleforce.com CRM 2013 Jaspersoft Corporation. 2 Blogger 2013 Jaspersoft Corporation. 3 Linked In 2013 Jaspersoft Corporation.
Secondary Uses of Data for Comparative Effectiveness Research
Secondary Uses of Data for Comparative Effectiveness Research Paul Wallace MD Director, Center for Comparative Effectiveness Research The Lewin Group [email protected] Disclosure/Perspectives Training:
The course will run simultaneously with the MSCI students.
Introduction to Medical Informatics for Informaticians Fall 2015 BMSC-GA 4455 Course Description: This course will serve as an introduction to Biomedical Informatics, which is the interdisciplinary science
Building patient-level predictive models Martijn J. Schuemie, Marc A. Suchard and Patrick Ryan 2015-11-01
Building patient-level predictive models Martijn J. Schuemie, Marc A. Suchard and Patrick Ryan 2015-11-01 Contents 1 Introduction 1 1.1 Specifying the cohort of interest and outcomes..........................
An interdisciplinary model for analytics education
An interdisciplinary model for analytics education Raffaella Settimi, PhD School of Computing, DePaul University Drew Conway s Data Science Venn Diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
Business Intelligence, Analytics & Reporting: Glossary of Terms
Business Intelligence, Analytics & Reporting: Glossary of Terms A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Ad-hoc analytics Ad-hoc analytics is the process by which a user can create a new report
Frequently Asked Questions
Frequently Asked Questions Business Office: 598 Airport Boulevard Suite 1400 Morrisville NC 27560 Contact: [email protected] Phone: 888.750.6941 Fax: 888.650.6795 www.cognitrax.com Diseases of the
3M Health Information Systems
3M Health Information Systems 1 Data Governance Disparate Systems Interoperability Information Exchange Reporting Public Health Quality Metrics Research Data Warehousing Data Standards What is the 3M Healthcare
Geodatabase Programming with SQL
DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with SQL Craig Gillgrass Assumptions Basic knowledge of SQL and relational databases Basic knowledge of the Geodatabase We ll hold
Use of Electronic Health Records in Clinical Research: Core Research Data Element Exchange Detailed Use Case April 23 rd, 2009
Use of Electronic Health Records in Clinical Research: Core Research Data Element Exchange Detailed Use Case April 23 rd, 2009 Table of Contents 1.0 Preface...4 2.0 Introduction and Scope...6 3.0 Use Case
Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth
MAKING BIG DATA COME ALIVE Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth Steve Gonzales, Principal Manager [email protected]
Comparative Analysis of the Main Business Intelligence Solutions
148 Informatica Economică vol. 17, no. 2/2013 Comparative Analysis of the Main Business Intelligence Solutions Alexandra RUSANEANU Faculty of Cybernetics, Statistics and Economic Informatics Bucharest
How To Use Data Analysis To Get More Information From A Computer Or Cell Phone To A Computer
Applying Big Data approaches to Competitive Intelligence challenges THOMSON REUTERS IP & SCIENCE PHARMA CI EUROPE CONFERENCE & EXHIBITION TIM MILLER 19 FEBRUARY 2014 BIG DATA, NOT JUST ABOUT VOLUMES Patient
Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture
Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize
OpenAdmin Tool for Informix (OAT) October 2012
OpenAdmin Tool for Informix (OAT) October 2012 What is OpenAdmin Tool for Informix? OAT is a web-based administration tool for the IBM Informix database server A single OAT installation can administer
11. CASE STUDY: HEALTHCARE ANALYTICAL DASHBOARDS USING TABLEAU
11. CASE STUDY: HEALTHCARE ANALYTICAL DASHBOARDS USING TABLEAU 11.1 Problem Definition: The Quality of the healthcare data is enforced as part of Patient Protection and Affordable Care Act and to measure
Disrupting The Market: Predictive Analytics As A Service
Disrupting The Market: Predictive Analytics As A Service 0 Problem 8.7 Billion Connected Devices 1 Growing 25% Annually What Does This Data Tell Us About Sensor Use? 1 Study conducted by Cisco 1 Solution
Big Data to Knowledge (BD2K)
Big Data to Knowledge () potential funding agency synergies Jennie Larkin, PhD Office of the Associate Director of Data Science National Institutes of Health idash-pscanner meeting UCSD September 16, 2014
FDA's Mini-Sentinel Program to Evaluate the Safety of Marketed Medical Products. Progress and Direction
FDA's Mini-Sentinel Program to Evaluate the Safety of Marketed Medical Products Progress and Direction Richard Platt Harvard Pilgrim Health Care Institute Harvard Medical School for the Mini-Sentinel Investigators
Building Open-Source Based Architecture of Enterprise Applications for Business Intelligence
Building Open-Source Based Architecture of Enterprise Applications for Business Intelligence TOMAS CHLOUBA Department of Informatics and Quantitative Methods University of Hradec Kralove Rokitanskeho 62,
Business Intelligence in Healthcare: Trying to Get it Right the First Time!
Business Intelligence in Healthcare: Trying to Get it Right the First Time! David E. Garets, FHIMSS DISCLAIMER: The views and opinions expressed in this presentation are those of the author and do not
Business Intelligence and Healthcare
Business Intelligence and Healthcare SUTHAN SIVAPATHAM SENIOR SHAREPOINT ARCHITECT Agenda Who we are What is BI? Microsoft s BI Stack Case Study (Healthcare) Who we are Point Alliance is an award-winning
Data Management for Large Studies Robert R. Kelley, PhD. Thursday, September 27, 2012
Robert R. Kelley, PhD Thursday, September 27, 2012 Agenda Provide an overview of several tools for data management in large studies Present an extended Case Study in using REDCap to manage study data Offer
PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP
PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP Your business is swimming in data, and your business analysts want to use it to answer the questions of today and tomorrow. YOU LOOK TO
JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights. 2013 Copyright Metric insights, Inc.
JAVASCRIPT CHARTING Scaling for the Enterprise with Metric Insights 2013 Copyright Metric insights, Inc. A REVOLUTION IS HAPPENING... 3! Challenges... 3! Borrowing From The Enterprise BI Stack... 4! Visualization
Please contact Cyber and Technology Training at (410)777-1333/[email protected] for registration and pricing information.
Course Name Start Date End Date Start Time End Time Active Directory Services with Windows Server 8/31/2015 9/4/2015 9:00 AM 5:00 PM Active Directory Services with Windows Server 9/28/2015 10/2/2015 9:00
Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot
www.etidaho.com (208) 327-0768 Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot 3 Days About this Course This course is designed for the end users and analysts that
What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11
What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11 W elcome to North Face Software s software. With this software, you can accomplish
CRM Analytics - Techniques for Analysing Business Data
CRM Analytics - Techniques for Analysing Business Data Steve Zangari Partner Director EMEA Agenda» Brief introduction to Zap» CRM Analytics The importance The challenges The value» Leverage existing technology
Open is as Open Does: Lessons from Running a Professional Open Source Company
Open is as Open Does: Lessons from Running a Professional Open Source Company Leon Rozenblit, JD, PhD Founder and CEO at Prometheus Research, LLC email: [email protected] twitter: @leon_rozenblit
User Guide. Analytics Desktop Document Number: 09619414
User Guide Analytics Desktop Document Number: 09619414 CONTENTS Guide Overview Description of this guide... ix What s new in this guide...x 1. Getting Started with Analytics Desktop Introduction... 1
TIBCO Spotfire Metrics Modeler User s Guide. Software Release 6.0 November 2013
TIBCO Spotfire Metrics Modeler User s Guide Software Release 6.0 November 2013 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO SOFTWARE
The Big Data Bioinformatics System
The Big Data Bioinformatics System Introduction The purpose of this document is to describe a fictitious bioinformatics research system based in part on the design and implementation of a similar system
Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
