Big Data and Graph Analytics in a Health Care Setting
|
|
|
- Harry Stone
- 10 years ago
- Views:
Transcription
1 Big Data and Graph Analytics in a Health Care Setting Supercomputing 12 November 15, 2012 Bob Techentin Mayo Clinic SPPDG Archive
2 Archive What is the Mayo Clinic? Mayo Clinic Mission: To inspire hope and contribute to health and well-being by providing the best care to every patient through integrated clinical practice, education and research Primary Value: The needs of the patient come first The best interest of the patient is the only interest to be considered, and in order that the sick may have the benefit of advancing knowledge, union of forces is necessary. -- William J. Mayo, June 15, 1910
3 Archive Mayo Clinic Environment (2011 Data) Not-for-profit foundation for Health Care/Research/Education Total Mayo staff size: 58,300 in three locations (Rochester, MN; Jacksonville, FL; and Scottsdale, AZ) plus multiple (60+) regional sites near Rochester Mayo Rochester: 15 million square feet (about 3.5X Mall of America) 1,113,000 patient registrations, 123,000 Hospital admissions, 588,000 hospital days, 2,400+ hospital beds, 20 million+ yearly laboratory tests Total Mayo research staff size: 4,252 FTEs
4 Archive Clinical Data Analysis Challenges Many clinical analysis problems are virtually intractable by traditional computational means. Mayo Clinic s patient data includes 5,000,000 patient records, exceeding 10 PB of data storage. Medical records are a combination of numeric data (e.g., lab work), clinical observations (e.g., medical history), categorical data (e.g., clinical diagnosis), and longitudinal measurements (e.g., annual checkup). Developing query mechanisms for this complex data is challenging. Relationships between clinical factors is a found in a very small number of records. This is a low signal-to-noise relationship. Access to a large, wellorganized medical record database is critical. Analysis approaches differ for various applications Clinical decision support patient vs. population Research correlation of disparate factors Monitoring detect / predict population trends
5 Archive Clinical Data Analysis Example: Clinical Decision Support Patient vs. Population Clinical Decision Support Rules have been implemented at Mayo Clinic hospitals Rules are laboriously crafted by medical experts, and implemented by information systems staff Rules are back-tested against medical records, manually! Implemented rules fire thousands of times per week, with a high false positive rate Systems engineering approach now being applied to development of new rules Systematically identify failure modes Analyze the whole system (including Big Data) and connections between processes
6 Clinical Data Analysis Example: Research Correlation of Disparate Factors Discovery in 2006 that a huge (7-8%) decrease in incidence of breast cancer in one year (2003) was caused by nationwide cessation of hormone supplementation in menopausal women in 2002! First reported at San Antonio Breast Cancer Symposium, December 13-15, 2006 Discovered through laborious manual records reviews! Clinicians and medical researchers need technology to reduce investigations from years to months or days Many such reviews are simply not done because of limited manual resources SPPDG Archive
7 Archive Characteristics of Mayo Enterprise Data Warehouse Complex Over 1000 departmental systems Many specialty domains Wide variety of data semantics Large Oldest formal medical record Complete conversion to electronic records (medical, surgical, radiology, billing, etc.) over 10 years ago Millions of patients / Terabytes of data Significant challenges Data sets are not well connected Noisy, incomplete, low reliability data / Low signal-to-noise ratio
8 SPPDG Archive
9 Archive Example Medical Record Database: Rochester Epidemiology Project REP data captures a subset of medical record data for a relatively stable population in Southeast Minnesota ~ 500K Individuals, 40 year duration ~ 2 M medical records from many clinics and hospitals Medical record events limited to births, deaths, diagnoses, prescriptions, procedures Relatively small and simple 50 GB Sybase database is two orders of magnitude smaller than Mayo Electronic Medical Record Translation to semantic (RDF) model results in dozens of classes and only 3.5 billion triples
10 SPPDG Archive
11 Archive Early Results From Semantic Analysis of Rochester Epidemiology Project Data Translation from multiple relational database sources into single semantic model is relatively straight forward 50 GB of relational data expands to 350 GB in NTRIPLES format Translation and data management can be slow and cumbersome Initial queries appear promising for research and clinical practice Cray XMT-2 and YarcData urika Query times of seconds, compared to hours for Sybase
12 Archive Expanding Clinical Record Analysis Beyond the Test Cases The Mayo Clinic Data Warehouse is two orders of magnitude larger than the REP databases, and the EMR is much larger yet Many more patients, across the U.S. and international Much more complex / richer dataset Aggregated data sets might exceed 300 B triples New systems supporting clinical practice need to scale in capacity and capability Medical centers might require 10, ,000 queries per day Integration of graph analytics with FLOPS - Logical AND numerical!
13 Archive Semantic Analysis Challenges for Clinical Applications Annotating facts with quality or reliability Reification adds substantial complexity to semantic model Identifying complex co-morbidity conditions Simple queries return N M combinations cluster analysis may be more effective Integration of logical and numerical processes Queries over ranges of values or dates (without FILTER!) Integrated analysis (with more sophistication than COUNT) Approximate matching Find a patient like me where me is 40,000 triples Temporal analysis
14 Archive Summary Both Big Data and Graph Analytics will have a role in future clinical practice and medical research Initial work with semantic analysis of medical record data using XMT-2 and urika appear promising Beginning to recognize limitations of current systems Scale and performance of future systems Expressing and querying data quality and reliability Integration of true graph analytics Tight coupling of logical and numerical analysis
15 A Few Dimensions of Evolution TODAY Applications Discrete Apps Situational Awareness Algorithms Graph Libraries Separate Monolithic Graph experts Logical and Numerical Incremental Coupled Subject-matter experts Audience Graph Primitives Static Graphs Dynamic Graphs Infrastructure/ Runtimes MPI, MapReduce Graph-specific Hardware Mass-market Current Graph-specific Emerging
16 A Few Dimensions of Evolution FUTURE Applications Discrete Apps Situational Awareness Algorithms Graph Libraries Separate Monolithic Graph experts Logical and Numerical Incremental Coupled Subject-matter experts Audience Graph Primitives Static Graphs Dynamic Graphs Infrastructure/ Runtimes MPI, MapReduce Graph-specific Hardware Mass-market Current Graph-specific Emerging
Using Big Data in Healthcare
Speaker First Plenary Session THE USE OF "BIG DATA" - WHERE ARE WE AND WHAT DOES THE FUTURE HOLD? David R. Holmes III, PhD Mayo Clinic College of Medicine Rochester, MN, USA Using Big Data in Healthcare
Find the signal in the noise
Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical
Graph Analytics in Big Data. John Feo Pacific Northwest National Laboratory
Graph Analytics in Big Data John Feo Pacific Northwest National Laboratory 1 A changing World The breadth of problems requiring graph analytics is growing rapidly Large Network Systems Social Networks
Adobe Insight, powered by Omniture
Adobe Insight, powered by Omniture Accelerating government intelligence to the speed of thought 1 Challenges that analysts face 2 Analysis tools and functionality 3 Adobe Insight 4 Summary Never before
A Novel Cloud Based Elastic Framework for Big Data Preprocessing
School of Systems Engineering A Novel Cloud Based Elastic Framework for Big Data Preprocessing Omer Dawelbeit and Rachel McCrindle October 21, 2014 University of Reading 2008 www.reading.ac.uk Overview
Building a real-time, self-service data analytics ecosystem Greg Arnold, Sr. Director Engineering
Building a real-time, self-service data analytics ecosystem Greg Arnold, Sr. Director Engineering Self Service at scale 6 5 4 3 2 1 ? Relational? MPP? Hadoop? Linkedin data 350M Members 25B 3.5M 4.8B 2M
bigdata Managing Scale in Ontological Systems
Managing Scale in Ontological Systems 1 This presentation offers a brief look scale in ontological (semantic) systems, tradeoffs in expressivity and data scale, and both information and systems architectural
Cray: Enabling Real-Time Discovery in Big Data
Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects
Microsoft Amalga HIS Electronic Medical Record
m Microsoft Amalga HIS Electronic Medical Record The Microsoft Amalga Hospital Information System (HIS) revolves around an electronic medical record (EMR) providing a comprehensive view into a patient
Six Days in the Network Security Trenches at SC14. A Cray Graph Analytics Case Study
Six Days in the Network Security Trenches at SC14 A Cray Graph Analytics Case Study WP-NetworkSecurity-0315 www.cray.com Table of Contents Introduction... 3 Analytics Mission and Source Data... 3 Analytics
Optum One. The Intelligent Health Platform
Optum One The Intelligent Health Platform The Optum One intelligent health platform enables healthcare providers to manage patient populations. The platform combines the industry s most advanced integrated
Introduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc
Big Data, Fast Data, Complex Data Jans Aasman Franz Inc Private, founded 1984 AI, Semantic Technology, professional services Now in Oakland Franz Inc Who We Are (1 (2 3) (4 5) (6 7) (8 9) (10 11) (12
Using Ultra-Large Data Sets in Healthcare New Questions-New Answers
Using Ultra-Large Data Sets in Healthcare New Questions-New Answers David Hartzband, D.Sc.. Director, Technology Research, RCHN Community Health Foundation & Lecturer, Engineering Systems Division Massachusetts
Practical Considerations for Real-Time Business Intelligence. Donovan Schneider Yahoo! September 11, 2006
Practical Considerations for Real-Time Business Intelligence Donovan Schneider Yahoo! September 11, 2006 Outline Business Intelligence (BI) Background Real-Time Business Intelligence Examples Two Requirements
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Using Data Mining and Machine Learning in Retail
Using Data Mining and Machine Learning in Retail Omeid Seide Senior Manager, Big Data Solutions Sears Holdings Bharat Prasad Big Data Solution Architect Sears Holdings Over a Century of Innovation A Fortune
THE VIRTUAL DATA WAREHOUSE (VDW) AND HOW TO USE IT
THE VIRTUAL DATA WAREHOUSE (VDW) AND HOW TO USE IT Table of Contents Overview o Figure 1. The HCSRN VDW and how it works Data Areas o Figure 2: HCSRN VDW data structures Steps for Using the VDW Multicenter
Integrating Genetic Data into Clinical Workflow with Clinical Decision Support Apps
White Paper Healthcare Integrating Genetic Data into Clinical Workflow with Clinical Decision Support Apps Executive Summary The Transformation Lab at Intermountain Healthcare in Salt Lake City, Utah,
Secondary Uses of Data for Comparative Effectiveness Research
Secondary Uses of Data for Comparative Effectiveness Research Paul Wallace MD Director, Center for Comparative Effectiveness Research The Lewin Group [email protected] Disclosure/Perspectives Training:
Department of Veterans Affairs (VA) Electronic Health Record: Providing Information for Clinical Care and Data for VA Health Research
Department of Veterans Affairs (VA) Electronic Health Record: Providing Information for Clinical Care and Data for VA Health Research National Institute of Health/National Library of Medicine: Workshop
Data and Information Management in Public Health
Data and Information Management in Public Health Adrienne S. Ettinger, Sc.D., M.P.H. Environmental Public Health Tracking Methods Course July 2004 Outline Information Management in Public Health Information
Introduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
The Fusion of Supercomputing and Big Data: The Role of Global Memory Architectures in Future Large Scale Data Analytics
HPC 2014 High Performance Computing FROM clouds and BIG DATA to EXASCALE AND BEYOND An International Advanced Workshop July 7 11, 2014, Cetraro, Italy Session III Emerging Systems and Solutions The Fusion
Workshop on Hadoop with Big Data
Workshop on Hadoop with Big Data Hadoop? Apache Hadoop is an open source framework for distributed storage and processing of large sets of data on commodity hardware. Hadoop enables businesses to quickly
Supercomputing and Big Data: Where are the Real Boundaries and Opportunities for Synergy?
HPC2012 Workshop Cetraro, Italy Supercomputing and Big Data: Where are the Real Boundaries and Opportunities for Synergy? Bill Blake CTO Cray, Inc. The Big Data Challenge Supercomputing minimizes data
Design of a Multi Dimensional Database for the Archimed DataWarehouse
169 Design of a Multi Dimensional Database for the Archimed DataWarehouse Claudine Bréant, Gérald Thurler, François Borst, Antoine Geissbuhler Service of Medical Informatics University Hospital of Geneva,
CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes
CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes Final Exam Overview Open books and open notes No laptops and no other mobile devices
Patient Management Systems. Terrence Adam, BS Pharm,, MD, PhD Assistant Professor, PCHS University of Minnesota College of Pharmacy
Patient Management Systems Terrence Adam, BS Pharm,, MD, PhD Assistant Professor, PCHS University of Minnesota College of Pharmacy Background Interests Interest in clinical informatics with training in
Conquering the Astronomical Data Flood through Machine
Conquering the Astronomical Data Flood through Machine Learning and Citizen Science Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ The Problem:
Big Data :: Big Demand
Big Data :: Big Demand Introductions John Martin Director Information Management Children s Hospital of Philadelphia [email protected] William Nieczpiel Analytics Development Manager Children s
Bringing Big Data Modelling into the Hands of Domain Experts
Bringing Big Data Modelling into the Hands of Domain Experts David Willingham Senior Application Engineer MathWorks [email protected] 2015 The MathWorks, Inc. 1 Data is the sword of the
Data Isn't Everything
June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,
Where is... How do I get to...
Big Data, Fast Data, Spatial Data Making Sense of Location Data in a Smart City Hans Viehmann Product Manager EMEA ORACLE Corporation August 19, 2015 Copyright 2014, Oracle and/or its affiliates. All rights
Open source Google-style large scale data analysis with Hadoop
Open source Google-style large scale data analysis with Hadoop Ioannis Konstantinou Email: [email protected] Web: http://www.cslab.ntua.gr/~ikons Computing Systems Laboratory School of Electrical
5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
Healthcare Market Overview
Healthcare Market Overview INTRODUCTION For healthcare organizations that require instantaneous availability of accurate information, mtuitive extends the value of legacy investments with capture and delivery
Healthcare Big Data Exploration in Real-Time
Healthcare Big Data Exploration in Real-Time Muaz A Mian A Project Submitted in partial fulfillment of the requirements for degree of Masters of Science in Computer Science and Systems University of Washington
Transformational Data-Driven Solutions for Healthcare
Transformational Data-Driven Solutions for Healthcare Transformational Data-Driven Solutions for Healthcare Today s healthcare providers face increasing pressure to improve operational performance while
Innovative Information Visualization of Electronic Health Record Data: a Systematic Review
Innovative Information Visualization of Electronic Health Record Data: a Systematic Review Vivian West, David Borland, W. Ed Hammond February 5, 2015 Outline Background Objective Methods & Criteria Analysis
National Cancer Institute
National Cancer Institute Information Systems, Technology, and Dissemination in the SEER Program U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health Information Systems, Technology,
Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage
Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take
Data Driven Discovery In the Social, Behavioral, and Economic Sciences
Data Driven Discovery In the Social, Behavioral, and Economic Sciences Simon Appleford, Marshall Scott Poole, Kevin Franklin, Peter Bajcsy, Alan B. Craig, Institute for Computing in the Humanities, Arts,
Bigdata : Enabling the Semantic Web at Web Scale
Bigdata : Enabling the Semantic Web at Web Scale Presentation outline What is big data? Bigdata Architecture Bigdata RDF Database Performance Roadmap What is big data? Big data is a new way of thinking
Chapter 3: Data Mining Driven Learning Apprentice System for Medical Billing Compliance
Chapter 3: Data Mining Driven Learning Apprentice System for Medical Billing Compliance 3.1 Introduction This research has been conducted at back office of a medical billing company situated in a custom
Presented by: Aaron Bossert, Cray Inc. Network Security Analytics, HPC Platforms, Hadoop, and Graphs Oh, My
Presented by: Aaron Bossert, Cray Inc. Network Security Analytics, HPC Platforms, Hadoop, and Graphs Oh, My The Proverbial Needle In A Haystack Problem The Nuclear Option Problem Statement and Proposed
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC
Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep Neil Raden Hired Brains Research, LLC Traditionally, the job of gathering and integrating data for analytics fell on data warehouses.
Documentation and Medical Records Paper-based and Computer-based
Documentation and Medical Records Paper-based and Computer-based Bengt Göransson [email protected] Medical Informatics 1MD012, Fall 2013 Division of Visual Information and Interaction, HCI Brainstorming
Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme
Big Data Analytics Prof. Dr. Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany 33. Sitzung des Arbeitskreises Informationstechnologie,
Environmental Health Science. Brian S. Schwartz, MD, MS
Environmental Health Science Data Streams Health Data Brian S. Schwartz, MD, MS January 10, 2013 When is a data stream not a data stream? When it is health data. EHR data = PHI of health system Data stream
CHAPTER-6 DATA WAREHOUSE
CHAPTER-6 DATA WAREHOUSE 1 CHAPTER-6 DATA WAREHOUSE 6.1 INTRODUCTION Data warehousing is gaining in popularity as organizations realize the benefits of being able to perform sophisticated analyses of their
Understanding applications using the BSC performance tools
Understanding applications using the BSC performance tools Judit Gimenez ([email protected]) German Llort([email protected]) Humans are visual creatures Films or books? Two hours vs. days (months) Memorizing
Impact Intelligence. Flexibility. Security. Ease of use. White Paper
Impact Intelligence Health care organizations continue to seek ways to improve the value they deliver to their customers and pinpoint opportunities to enhance performance. Accurately identifying trends
UNIVERSITY OF INFINITE AMBITIONS. MASTER OF SCIENCE COMPUTER SCIENCE DATA SCIENCE AND SMART SERVICES
UNIVERSITY OF INFINITE AMBITIONS. MASTER OF SCIENCE COMPUTER SCIENCE DATA SCIENCE AND SMART SERVICES MASTER S PROGRAMME COMPUTER SCIENCE - DATA SCIENCE AND SMART SERVICES (DS3) This is a specialization
[callout: no organization can afford to deny itself the power of business intelligence ]
Publication: Telephony Author: Douglas Hackney Headline: Applied Business Intelligence [callout: no organization can afford to deny itself the power of business intelligence ] [begin copy] 1 Business Intelligence
Healthcare data analytics. Da-Wei Wang Institute of Information Science [email protected]
Healthcare data analytics Da-Wei Wang Institute of Information Science [email protected] Outline Data Science Enabling technologies Grand goals Issues Google flu trend Privacy Conclusion Analytics
SQL Server 2012 Business Intelligence Boot Camp
SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations
HadoopRDF : A Scalable RDF Data Analysis System
HadoopRDF : A Scalable RDF Data Analysis System Yuan Tian 1, Jinhang DU 1, Haofen Wang 1, Yuan Ni 2, and Yong Yu 1 1 Shanghai Jiao Tong University, Shanghai, China {tian,dujh,whfcarter}@apex.sjtu.edu.cn
Associate Prof. Dr. Victor Onomza Waziri
BIG DATA ANALYTICS AND DATA SECURITY IN THE CLOUD VIA FULLY HOMOMORPHIC ENCRYPTION Associate Prof. Dr. Victor Onomza Waziri Department of Cyber Security Science, School of ICT, Federal University of Technology,
ANALYTICS BUILT FOR INTERNET OF THINGS
ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that
4. Understanding Clinical Data and Workflow Understanding Surveillance Data Exchange Processes Guide and Worksheet
To properly prepare for implementing the pilot of your surveillance program and its subsequent rollout, you must understand the surveillance data exchange processes. These processes can vary depending
Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers
Meaningful Use Stage 2 Certification: A Guide for EHR Product Managers Terminology Management is a foundational element to satisfying the Meaningful Use Stage 2 criteria and due to its complexity, and
EMPI: A BUILDING BLOCK FOR INTEROPERABILITY
WHITE PAPER EMPI: A BUILDING BLOCK FOR INTEROPERABILITY Table of Contents Executive Summary... 2 Extending the Business Impact of EMPI... 2 Does Your EMPI Deliver?... 3 Interoperability to enable communication
Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer
Data Warehousing Overview, Terminology, and Research Issues 1 Heterogeneous Database Integration Integration System World Wide Web Digital Libraries Scientific Databases Personal Databases Collects and
GC3 Use cases for the Cloud
GC3: Grid Computing Competence Center GC3 Use cases for the Cloud Some real world examples suited for cloud systems Antonio Messina Trieste, 24.10.2013 Who am I System Architect
Basic Study Designs in Analytical Epidemiology For Observational Studies
Basic Study Designs in Analytical Epidemiology For Observational Studies Cohort Case Control Hybrid design (case-cohort, nested case control) Cross-Sectional Ecologic OBSERVATIONAL STUDIES (Non-Experimental)
GEMS: Graph database Engine for Multithreaded Systems
1 GEMS: Graph database Engine for Multithreaded Systems Alessandro Morari, Vito Giovanni Castellana, Antonino Tumeo, Jesse Weaver, David Haglin, Sutanay Choudhury, John Feo Pacific Northwest National Laboratory
Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013
Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation
Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce
Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of
BMC ProactiveNet Performance Management: Delivering on the Promise of Predictive Control Across the Total IT Environment SOLUTION WHITE PAPER
BMC ProactiveNet Performance Management: Delivering on the Promise of Predictive Control Across the Total IT Environment SOLUTION WHITE PAPER TABLE OF CONTENTS EXECUTIVE OVERVIEW 1 MANAGING INCREASING
W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
Spark in Action. Fast Big Data Analytics using Scala. Matei Zaharia. www.spark- project.org. University of California, Berkeley UC BERKELEY
Spark in Action Fast Big Data Analytics using Scala Matei Zaharia University of California, Berkeley www.spark- project.org UC BERKELEY My Background Grad student in the AMP Lab at UC Berkeley» 50- person
Susan J Hyatt President and CEO HYATTDIO, Inc. Lorraine Fernandes, RHIA Global Healthcare Ambassador IBM Information Management
Accurate and Trusted Data- The Foundation for EHR Programs Susan J Hyatt President and CEO HYATTDIO, Inc. Lorraine Fernandes, RHIA Global Healthcare Ambassador IBM Information Management Healthcare priorities
The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data. Ravi Shankar
The Development of the Clinical Trial Ontology to standardize dissemination of clinical trial data Ravi Shankar Open access to clinical trials data advances open science Broad open access to entire clinical
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect [email protected]
HPC and Big Data EPCC The University of Edinburgh Adrian Jackson Technical Architect [email protected] EPCC Facilities Technology Transfer European Projects HPC Research Visitor Programmes Training
Scaling Out With Apache Spark. DTL Meeting 17-04-2015 Slides based on https://www.sics.se/~amir/files/download/dic/spark.pdf
Scaling Out With Apache Spark DTL Meeting 17-04-2015 Slides based on https://www.sics.se/~amir/files/download/dic/spark.pdf Your hosts Mathijs Kattenberg Technical consultant Jeroen Schot Technical consultant
Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov
Search and Data Mining: Techniques Applications Anya Yarygina Boris Novikov Introduction Data mining applications Data mining system products and research prototypes Additional themes on data mining Social
Large-Scale Data Processing
Large-Scale Data Processing Eiko Yoneki [email protected] http://www.cl.cam.ac.uk/~ey204 Systems Research Group University of Cambridge Computer Laboratory 2010s: Big Data Why Big Data now? Increase
Data Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
The Cubetree Storage Organization
The Cubetree Storage Organization Nick Roussopoulos & Yannis Kotidis Advanced Communication Technology, Inc. Silver Spring, MD 20905 Tel: 301-384-3759 Fax: 301-384-3679 {nick,kotidis}@act-us.com 1. Introduction
Machine Learning for Data-Driven Discovery
Machine Learning for Data-Driven Discovery Thoughts on the Past, Present and Future Sreenivas Rangan Sukumar, PhD Data Scientist and R&D Staff Computaional Data Analytics Group Computational Sciences and
Achieving Cost-Effective, Vendor-Neutral Archiving For Your Enterprise
Achieving Cost-Effective, Vendor-Neutral Archiving For Your Enterprise How To Merchandise Data for Clinical Use By: Eran Galil, PACS/Archive Product Manager, Carestream Health, BBM JoAnn Linder, RIS/PACS/Archive
Interoperability and Analytics February 29, 2016
Interoperability and Analytics February 29, 2016 Matthew Hoffman MD, CMIO Utah Health Information Network Conflict of Interest Matthew Hoffman, MD Has no real or apparent conflicts of interest to report.
Data Mining in the Swamp
WHITE PAPER Page 1 of 8 Data Mining in the Swamp Taming Unruly Data with Cloud Computing By John Brothers Business Intelligence is all about making better decisions from the data you have. However, all
