The Big Data Bioinformatics System
|
|
|
- Mitchell Randall
- 10 years ago
- Views:
Transcription
1 The Big Data Bioinformatics System Introduction The purpose of this document is to describe a fictitious bioinformatics research system based in part on the design and implementation of a similar system for a local Hospital. The document is divided into five parts. The initial section describes the motivation for the bioinformatics system, called Big Data Bioinformatics System or BD for short. A set of motivating requirements follows. After the set of motivating requirements, a set of proposed architectural challenges is presented. The key interactions for the system components are presented second to last, and the final section describes the intended use of the Apache OODT software system in order to meet the needs and challenges of the proposed BD system. Motivation We are tasked with helping researchers at a local hospital analyze information collected (with consent) from patients in the critical care unit. The various forms of information are shown in the left portion of Figure 1 below. "#$%&'()'*+&',-'./0/'1234)' "
2 Information collected includes: 1. Long-term Historical Patient Records including admit date, and discharge date, disease type, drugs administered and other discrete variables for a set of 10, 000 patients who have visited the hospital in the last 10 years. 2. Real-time, patient Monitor data from various instruments by the patient beside. These instruments include heart monitors, oxygen level monitors, monitors for lung function and brain activity and other instruments. This particular set of information about a patient is only collected short-term, over the past 2 years, because only some of the newer instruments have the ability to digitally record and store this data. This information is relevant for about 2000 patients who have stayed at the hospital. 3. Long-term, Doctor s and Nurse s notes recorded during a patient s visit. This mostly includes free-text notes electronically captured by machines near the patient beside, where the nurses and doctors record their observations about a patient. The dataset is unstructured and may contain overlapping information with the Historical Patient Records and Monitor data information. This information covers around 10,000 patients over 10 years. 4. Short-term, Offline Research databases, containing information collected during clinical trials and destroyed after a period of one year. This information is provided for about patients in the hospital. These rich datasets collected about patients are used by researchers and clinicians within the hospital to perform scientific research with the goal of identifying causalities and observations regarding patient mortality: in other words, this information is used to help identify and observe characteristics about patients who lived and died. These characteristics can feed into further research that hopefully one day can allow doctors to make informed decisions based on observations (about patient s heart rate, about their appearance, etc.) that lead to better overall hospital care. The immediate steps required to bring forth this proposed future more immediately involve the construction of a decision support biomedical informatics system which we have dubbed Big Data Bioinformatics System or BD for short. BD helps hospital researchers collect the information as different types of files (labeled as f in the middle portion of Figure 1), which are then combined and processed and eventually shoved into research databases on the middleright portion of Figure 1. Research databases are focused data warehouses containing combined information from the above four data sources that can easily be used by hospital researchers for a particular scientific study (resulting in some publication), or in generating raw data that can be shared with other hospitals and the broader community, or in pulling into custom analysis environments like IDL, Matlab or R for further analysis. Requirements BD has several over-arching requirements as outlined by the hospital administrators and other stakeholders in the system. 1. The patient information must be kept secure if there were an unbreakable requirement, this would be it. Because of U.S. HIPAA law and despite the patients signing consent forms, all of the information, whether at the data source (left hand side of Figure 1), whether in file form (middle portion of Figure 1), whether in the research databases (middle-right portion of Figure 1), or while in transit to the broader community #
3 or to the analysis tools, the data must be protected using encryption, using secure-bydesign principles, and using best practices. 2. The original data sources cannot be modified, only read information cannot be updated in any one of the four data sources on the left hand side of Figure 1. The information can only be read. The same goes for the Big Data file system portion research data files can only be read, and those files should only be created once, with no append capability after their initial creation. 3. Research databases may be dynamic Research databases are needed only for particular research campaigns and case studies, and may be stood up for short-term (monthly), or longer-term (multi-month, multi-year) purposes. All research databases will eventually be destroyed in order to comply with security requirements. 4. Data must be easily accessible from research databases Research database information is fed into scientific papers, is fed into scientific tools, and is distributed to other hospitals for multi-center collaborations. It must be easily accessible, in standard formats, and secure. Desired tool support includes: R, Matlab, and IDL, as well as common CSV formats. 5. Data must be de-identified In keeping with the security requirement, all original patient identifiers must be de-identified, and obfuscated from traceability back to their patient origin. This is strictly true for all data in research databases, but may not be true in the Big Data file system. 6. A standard data model and metadata model should be used to describe patient information The hospital has been looking at SNOWMED, HL7 and other models and formats, but does not strictly require use of any of those. They are simply provided as examples. 7. Data storage may require large scalability bursts, depending on upcoming research conferences, and depending on hospital priorities there may be on the order of research databases constructed within a month s time depending on the importance of an upcoming research conference, or collaboration with another hospital. 8. Processing scalability will be significantly less than data storage Though the researchers want to run data combination algorithms, and identify things like missing data (as we will elaborate below), this processing will be significantly more long-term and less bursty than the data storage. The important thing to the hospital researchers is the availability of the raw information as re-processing is always an option. Key Architectural Challenges Given the above requirements, there are several key architectural challenges with the design of the proposed BD system. We will elucidate those challenges below: 1. Creation of a read-only large-scale distributed copy of the origin data. The researchers do not have direct access to the Historical Patient Records, the Monitor data or any of the other two data sources available for research and analysis. Each curator of information has expressed her concern to the hospital director of research mainly amounting to growing concern of real-time queries to their data sources. Each one of the data sources has taken years to build up, and the technology that captures the information was never built to respond to the needs of the hospital research community inasmuch as the data sources were constructed to provide improved and immediate benefits to patient care. Because of this, the data providers (on the left side of Figure 1) have agreed to have their information periodically dumped into the Big Data filesystem, under the control of the hospital researchers, where they can experiment and analyze the information to their heart s delight. $
4 2. Keeping the BD Filesystem in Sync and Fresh This isn t exactly straight-forward, and it s really dictated by the types of analyses required, and presence of missing data which may periodically occur when performing the data export from any of the available data sources. Monitor data is sketchier since it s kept in a cache whereas data such as the Historical Patient Records are stored in more reliable database systems like Oracle or MySQL. 3. Each original data source uses a different technology as can be gleaned from the discussion thus far, each of the original data sources uses a different technology some like the Monitor data use a proprietary vendor cache technology, unique to each instrument. Others, on the other hand, use commercial DBMSs like Oracle (Historical Patient Records). Others use MySQL. Yet others are stored in note taking systems, sometimes called EHR or electronic health care records systems. BD needs a unified way of dumping information into easily accessible files. 4. The data models look very different for each type of file The BD filesystem houses all of the aforementioned data sources, and each one of the output files from them looks vastly different. Monitor data records look similar to CSV output, with blood pressure, heart rate, and other variables in comma separated lists. The data from the doctor and nurse s notes system looks like free-text output; the data from the Historical Patient Records table looks like database row output. Each one of these files must be combined into the specialized research databases. 5. Making the data available for scientific, peer-reviewed research and dissemination to other hospitals requires explicit permissions and necessary security in place at each component in the system. None of the components can escape it they all must be secure. 6. The analysis tools accept data in different ways R, Matlab, and IDL all read data differently and specialized output handlers for the research databases must be made in order to export to these tools. 7. Algorithms must be written to combine the data in unique ways, fill in missing data, and to de-identify the data. The algorithms need to be scheduled and run efficiently too, especially during paper writing campaigns of the researchers where resources are in contention. Key Interactions Of course, with the existing description of the system, the complexities of the components and the data flow and the interactions can probably be discerned as significant. To start, the BD file system will need to pull data from the origin 4 data sources, after which it needs to push data to allow the construction of the research databases. Research databases are likely built from some pre-defined workflow involving data de-identification, combination and summarization. After the research databases are created, their information is made available via client pull (from an interested scientist or researcher into e.g., Matlab or IDL or R), or via push to an external hospital on an external disk or via electronic transfer for further analysis. The research database information may also be made available to use as raw data in a (set of) scientific research paper(s). Apache OODT The hospital researchers have selected Apache OODT as the implementation technology to construct the BD system. OODT provides a set of components that allow data integration, information retrieval and extraction, and data dissemination. OODT is known as a grid %
5 technology as it allows the constructions of virtual organizations of users sharing data, computation and resources alike. The BD system should be built using Apache OODT components. The Apache OODT website is located at: A full treatment of Apache OODT is beyond the scope of this description document. Suffice to say, we pointer the interested readers to the following 2 papers: C. Mattmann, D. Crichton, N. Medvidovic and S. Hughes. A Software Architecture-Based Framework for Highly Distributed and Data Intensive Scientific Applications. In Proceedings of the 28th International Conference on Software Engineering (ICSE06), pp , Shanghai, China, May 20th-28th, C. Mattmann, D. Freeborn, D. Crichton, B. Foster, A. Hart, D. Woollard, S. Hardman, P. Ramirez, S. Kelly, A. Y. Chang, C. E. Miller. A Reusable Process Control System Framework for the Orbiting Carbon Observatory and NPP Sounder PEATE missions. In Proceedings of the 3rd IEEE Intl Conference on Space Mission Challenges for Information Technology (SMC-IT 2009), pp , July 19-23, &
Maximising the utility of OpeNDAP datasets through the NetCDF4 API
Maximising the utility of OpeNDAP datasets through the NetCDF4 API Stephen Pascoe ([email protected]) Chris Mattmann ([email protected]) Phil Kershaw ([email protected]) Ag
Cultivating a research agenda for data science
Mattmann Journal of Big Data 2014, 1:6 SHORT REPORT OpenAccess Cultivating a research agenda for data science Chris A Mattmann 1,2 Correspondence: [email protected] 1 Jet Propulsion Laboratory,
The What, When, Where and How of Natural Language Processing
The What, When, Where and How of Natural Language Processing There s a mystique that surrounds natural language processing (NLP) technology, regarding how it works, and what it can and cannot do. Although
EHR Standards Landscape
EHR Standards Landscape Dr Dipak Kalra Centre for Health Informatics and Multiprofessional Education (CHIME) University College London [email protected] A trans-national ehealth Infostructure Wellness
How To Use Zato Health Interoperability Platform Software On A Patient Record
STATE OF CONNECTICUT State Innovation Model Health Information Technology (HIT) Council 2 nd Submission: Zato Follow up Questions 1) Is Zato able to provide a healthcare demonstration of their de-identified
Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
Setting the World on FHIR
Setting the World on FHIR W. Ed Hammond. Ph.D., FACMI, FAIMBE, FIMIA, FHL7 Director, Duke Center for Health Informatics Director, Applied Informatics Research, DHTS Director of Academic Affairs, MMCi Program
Data Virtualization and ETL. Denodo Technologies Architecture Brief
Data Virtualization and ETL Denodo Technologies Architecture Brief Contents Data Virtualization and ETL... 3 Summary... 3 Data Virtualization... 7 What is Data Virtualization good for?... 8 Applications
Big Data With Hadoop
With Saurabh Singh [email protected] The Ohio State University February 11, 2016 Overview 1 2 3 Requirements Ecosystem Resilient Distributed Datasets (RDDs) Example Code vs Mapreduce 4 5 Source: [Tutorials
Big Data. Dr.Douglas Harris DECEMBER 12, 2013
Dr.Douglas Harris DECEMBER 12, 2013 GOWTHAM REDDY Fall,2013 Table of Contents Computing history:... 2 Why Big Data and Why Now?... 3 Information Life-Cycle Management... 4 Goals... 5 Information Management
New York ehealth Collaborative. Health Information Exchange and Interoperability April 2012
New York ehealth Collaborative Health Information Exchange and Interoperability April 2012 1 Introductions Information exchange patient, information, care team How is Health information exchanged Value
Job list - Research China
Job list - Research China Job # Job Title Page 010499 Scientist for Mechanical Engineering P2-3 002590 Senior Scientist for Biomedical Engineering P4-5 022831 Research Scientist for MRI P6-7 010501 Scientist
TeamCompanion Solution Overview. Visual Studio
TeamCompanion Solution Overview Visual Studio Information in this document, including URL and other Internet Web site references, is subject to change without notice. Unless otherwise noted, the example
Understanding Neo4j Scalability
Understanding Neo4j Scalability David Montag January 2013 Understanding Neo4j Scalability Scalability means different things to different people. Common traits associated include: 1. Redundancy in the
Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik
Leading Genomics Diagnostic harma Discove Collab Shanghai Cambridge, MA Reykjavik Global leadership for using the genome to create better medicine WuXi NextCODE provides a uniquely proven and integrated
Putting Apache Kafka to Use!
Putting Apache Kafka to Use! Building a Real-time Data Platform for Event Streams! JAY KREPS, CONFLUENT! A Couple of Themes! Theme 1: Rise of Events! Theme 2: Immutability Everywhere! Level! Example! Immutable
Data Use and the Liquid Grids Model
Data Use Policy Revision 1.1 03/09/2014 Ramos M. Mays, Chief Technology Officer Table of Contents 1. Information Sources... 3 2. Information we receive... 3 3. How we use information... 4 4. How long we
I n t e r S y S t e m S W h I t e P a P e r F O R H E A L T H C A R E IT E X E C U T I V E S. In accountable care
I n t e r S y S t e m S W h I t e P a P e r F O R H E A L T H C A R E IT E X E C U T I V E S The Role of healthcare InfoRmaTIcs In accountable care I n t e r S y S t e m S W h I t e P a P e r F OR H E
An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics
An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,
SOLUTION BRIEF. IMAT Enhances Clinical Trial Cohort Identification. imatsolutions.com
SOLUTION BRIEF IMAT Enhances Clinical Trial Cohort Identification imatsolutions.com Introduction Timely access to data is always a top priority for mature organizations. Identifying and acting on the information
The i2b2 Hive and the Clinical Research Chart
The i2b2 Hive and the Clinical Research Chart Henry Chueh Shawn Murphy The i2b2 Hive is centered around two concepts. The first concept is the existence of services provided by applications that are wrapped
SESSION DEPENDENT DE-IDENTIFICATION OF ELECTRONIC MEDICAL RECORDS
SESSION DEPENDENT DE-IDENTIFICATION OF ELECTRONIC MEDICAL RECORDS A Thesis Presented in Partial Fulfillment of the Requirements for the Degree Bachelor of Science with Honors Research Distinction in Electrical
Data Grids. Lidan Wang April 5, 2007
Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural
EMR Systems and the Conduct of Clinical Research. Daniel E Ford, MD, MPH Vice Dean for Clinical Investigation Johns Hopkins School of Medicine
EMR Systems and the Conduct of Clinical Research Daniel E Ford, MD, MPH Vice Dean for Clinical Investigation Johns Hopkins School of Medicine Clinical Research Environment Research protocols are becoming
Active AnAlytics: Driving informed Decisions leading to Better clinical AnD financial outcomes
Active AnAlytics: Driving informed Decisions leading to Better clinical AnD financial outcomes An InterSystems White Paper for Healthcare IT Executives Active AnAlytics: Driving informed Decisions leading
Considerations for Management of Laboratory Data
Considerations for Management of Laboratory Data 2003 Scientific Computing & Instrumentation LIMS Guide, November 2003 Michael H Elliott Drowning in a sea of data? Nervous about 21 CFR Part 11? Worried
Frequently Asked Questions
Frequently Asked Questions About PreManage The Oregon Health Leadership Council has formed a unique coalition of major stakeholders, including hospitals, health plans, Emergency Department (ED) physicians
Business Intelligence & Data Warehouse Consulting
Transforming Raw Data into Business Results In the rapid pace of today's business environment, businesses must be able to adapt to changing customer needs and quickly refocus resources to meet market demand.
Personalized Medicine: Humanity s Ultimate Big Data Challenge. Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences
Personalized Medicine: Humanity s Ultimate Big Data Challenge Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences 2012 Oracle Corporation Proprietary and Confidential 2 3 Humanity
Using Tableau Software with Hortonworks Data Platform
Using Tableau Software with Hortonworks Data Platform September 2013 2013 Hortonworks Inc. http:// Modern businesses need to manage vast amounts of data, and in many cases they have accumulated this data
Medical Informatic Basics for the Cancer Registry
Medical Informatic Basics for the Cancer Registry DEVELOPED BY: THE NCRA EDUCATION FOUNDATION AND THE NCRA CANCER INFORMATICS COMMITTEE Medical Informatics is the intersection of science, computer science
Uncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM
Uncovering Value in Healthcare Data with Cognitive Analytics Christine Livingston, Perficient Ken Dugan, IBM Conflict of Interest Christine Livingston Ken Dugan Has no real or apparent conflicts of interest
Hadoop. http://hadoop.apache.org/ Sunday, November 25, 12
Hadoop http://hadoop.apache.org/ What Is Apache Hadoop? The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using
Accelerating Clinical Trials Through Shared Access to Patient Records
INTERSYSTEMS WHITE PAPER Accelerating Clinical Trials Through Shared Access to Patient Records Improved Access to Clinical Data Across Hospitals and Systems Helps Pharmaceutical Companies Reduce Delays
SAP Business Objects Business Intelligence platform Document Version: 4.1 Support Package 7 2015-11-24. Data Federation Administration Tool Guide
SAP Business Objects Business Intelligence platform Document Version: 4.1 Support Package 7 2015-11-24 Data Federation Administration Tool Guide Content 1 What's new in the.... 5 2 Introduction to administration
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically
PROPOSAL To Develop an Enterprise Scale Disease Modeling Web Portal For Ascel Bio Updated March 2015
Enterprise Scale Disease Modeling Web Portal PROPOSAL To Develop an Enterprise Scale Disease Modeling Web Portal For Ascel Bio Updated March 2015 i Last Updated: 5/8/2015 4:13 PM3/5/2015 10:00 AM Enterprise
A&D srl Consulting & Logistic Systems Galleria Spagna, 35-35127 Padova (PD) - Italy - Telefono +39.049.8792400 - Fax +39.049.8792408 Sede Legale:
INTEGRATED DOCUMENT MANAGEMENT GENERAL DIAGRAM 1 GENERAL CONCEPTS The integrated document management of a company is due to two trends: 1. electronic processing (scanning) of documents used within the
Best Practices for Hadoop Data Analysis with Tableau
Best Practices for Hadoop Data Analysis with Tableau September 2013 2013 Hortonworks Inc. http:// Tableau 6.1.4 introduced the ability to visualize large, complex data stored in Apache Hadoop with Hortonworks
Digital libraries of the future and the role of libraries
Digital libraries of the future and the role of libraries Donatella Castelli ISTI-CNR, Pisa, Italy Abstract Purpose: To introduce the digital libraries of the future, their enabling technologies and their
Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture
Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent
Sanjeev Kumar. contribute
RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 [email protected] 1. Introduction The field of data mining and knowledgee discovery is emerging as a
Yale University Open Data Access (YODA) Project Procedures to Guide External Investigator Access to Clinical Trial Data Last Updated August 2015
OVERVIEW Yale University Open Data Access (YODA) Project These procedures support the YODA Project Data Release Policy and more fully describe the process by which clinical trial data held by a third party,
Scalable Web and Mobile Solution for Healthcare Software Provider
Scalable Web and Mobile Solution for Healthcare Software Provider The Client Overview Our client is a leading healthcare software vendor, providing solutions catering to a niche area of radiology diagnostics,
Strategies for De-Identification and Anonymization of Electronic Health Record Data for Use in Multicenter Research Studies
Strategies for De-Identification and Anonymization of Electronic Health Record Data for Use in Multicenter Research Studies Clete A. Kushida, M.D., Ph.D. Professor, Stanford University Medical Center Overview
KNIME Enterprise server usage and global deployment at NIBR
KNIME Enterprise server usage and global deployment at NIBR Gregory Landrum, Ph.D. NIBR Informatics Novartis Institutes for BioMedical Research, Basel 8 th KNIME Users Group Meeting Berlin, 26 February
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful
A Population Health Management Approach in the Home and Community-based Settings
A Population Health Management Approach in the Home and Community-based Settings Mark Emery Linda Schertzer Kyle Vice Charles Lagor Philips Home Monitoring Philips Healthcare 2 Executive Summary Philips
The deployment of OHMS TM. in private cloud
Healthcare activities from anywhere anytime The deployment of OHMS TM in private cloud 1.0 Overview:.OHMS TM is software as a service (SaaS) platform that enables the multiple users to login from anywhere
Big Data and Healthcare Payers WHITE PAPER
Knowledgent White Paper Series Big Data and Healthcare Payers WHITE PAPER Summary With the implementation of the Affordable Care Act, the transition to a more member-centric relationship model, and other
QLIKVIEW DATA FLOWS TECHNICAL BRIEF
QLIKVIEW DATA FLOWS TECHNICAL BRIEF A QlikView Technical Brief September 2013 qlikview.com Table of Contents Introduction 3 Overview 3 Data Sourcing 5 Loading and Modeling Data 6 Provisioning Data 9 Using
Clinical Mapping (CMAP) Draft for Public Comment
Integrating the Healthcare Enterprise 5 IHE Patient Care Coordination Technical Framework Supplement 10 Clinical Mapping (CMAP) 15 Draft for Public Comment 20 Date: June 1, 2015 Author: PCC Technical Committee
Big Data and Text Mining
Big Data and Text Mining Dr. Ian Lewin Senior NLP Resource Specialist [email protected] www.linguamatics.com About Linguamatics Boston, USA Cambridge, UK Software Consulting Hosted content Agile,
Hadoop & Spark Using Amazon EMR
Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?
Understanding the Value of In-Memory in the IT Landscape
February 2012 Understing the Value of In-Memory in Sponsored by QlikView Contents The Many Faces of In-Memory 1 The Meaning of In-Memory 2 The Data Analysis Value Chain Your Goals 3 Mapping Vendors to
ISO 18308 INTERNATIONAL STANDARD. Health informatics Requirements for an electronic health record architecture
INTERNATIONAL STANDARD ISO 18308 First edition 2011-04-15 Health informatics Requirements for an electronic health record architecture Informatique de santé Exigences relatives à une architecture de l'enregistrement
Big Data Challenges for Large Radio Arrays
Big Data Challenges for Large Radio Arrays Dayton L. Jones, Kiri Wagstaff, David R. Thompson, Larry D Addario, Robert Navarro, Chris Mattmann, Walid Majid, Joseph Lazio, Robert Preston, and Umaa Rebbapragada
HL7 and Meaningful Use
HL7 and Meaningful Use Grant M. Wood HL7 Ambassador HIMSS14 2012 Health Level Seven International. All Rights Reserved. HL7 and Health Level Seven are registered trademarks of Health Level Seven International.
Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage
Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take
Integration for your Health Information System
Integration for your Health Information System Achieve comprehensive healthcare IT integration that leverages your existing IT investments and helps you meet the growing demands of Meaningful Use, HIE,
Achilles a platform for exploring and visualizing clinical data summary statistics
Biomedical Informatics discovery and impact Achilles a platform for exploring and visualizing clinical data summary statistics Mark Velez, MA Ning "Sunny" Shang, PhD Department of Biomedical Informatics,
Relationship of HL7 EHR System Draft Standard to X12N
Relationship of HL7 EHR System Draft Standard to X12N EHR Technical Committee Co-Chairs: Gary Dickinson Linda Fischetti Sam Heard Excerpt of EHR-S DSTU Class Overview of Discussion Background Where We
I D C E X E C U T I V E B R I E F
I D C E X E C U T I V E B R I E F E n a b l i n g B e t t e r D e c i s i o n s T h r o u g h U n i f i e d Ac c e s s t o I n f o r m a t i o n November 2008 Global Headquarters: 5 Speen Street Framingham,
The Use of Patient Records (EHR) for Research
The Use of Patient Records (EHR) for Research Mary Devereaux, Ph.D. Director, Biomedical Ethics Seminars Assistant Director, Research Ethics Program & San Diego Research Ethics Consortium Abstract The
DRAT: An Unobtrusive, Scalable Approach to Large Scale Software License Analysis
DRAT: An Unobtrusive, Scalable Approach to Large Scale Software License Analysis Chris A. Mattmann 1,2, Ji-Hyun Oh 1,2, Tyler Palsulich 1*, Lewis John McGibbney 1, Yolanda Gil 2,3, Varun Ratnakar 3 1 Jet
Interoperability: White Paper. Introduction. PointClickCare Interoperability - 2014. January 2014
White Paper PointClickCare Interoperability - 2014 Interoperability: In healthcare, interoperability is where multiple technology platforms and software applications are able to connect, communicate, and
BUILDING A SCALABLE BIG DATA INFRASTRUCTURE FOR DYNAMIC WORKFLOWS
BUILDING A SCALABLE BIG DATA INFRASTRUCTURE FOR DYNAMIC WORKFLOWS ESSENTIALS Executive Summary Big Data is placing new demands on IT infrastructures. The challenge is how to meet growing performance demands
Recognition and Privacy Preservation of Paper-based Health Records
Quality of Life through Quality of Information J. Mantas et al. (Eds.) IOS Press, 2012 2012 European Federation for Medical Informatics and IOS Press. All rights reserved. doi:10.3233/978-1-61499-101-4-751
White Paper. Optimizing the Performance Of MySQL Cluster
White Paper Optimizing the Performance Of MySQL Cluster Table of Contents Introduction and Background Information... 2 Optimal Applications for MySQL Cluster... 3 Identifying the Performance Issues.....
Reference Architecture, Requirements, Gaps, Roles
Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture
Big Data Analytics in Health Care
Big Data Analytics in Health Care S. G. Nandhini 1, V. Lavanya 2, K.Vasantha Kokilam 3 1 13mss032, 2 13mss025, III. M.Sc (software systems), SRI KRISHNA ARTS AND SCIENCE COLLEGE, 3 Assistant Professor,
Introduction to Big Data! with Apache Spark" UC#BERKELEY#
Introduction to Big Data! with Apache Spark" UC#BERKELEY# So What is Data Science?" Doing Data Science" Data Preparation" Roles" This Lecture" What is Data Science?" Data Science aims to derive knowledge!
Securing your IT infrastructure with SOC/NOC collaboration
Technical white paper Securing your IT infrastructure with SOC/NOC collaboration Universal log management for IT operations Table of contents Executive summary 2 IT operations: Handle IT incidents and
Building Your Big Data Team
Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.
