Open & Big Data for Life Imaging Technical aspects : existing solutions, main difficulties. Pierre Mouillard MD



Similar documents
Image Area. View Point. Medical Imaging. Advanced Imaging Solutions for Diagnosis, Localization, Treatment Planning and Monitoring.

AW Server 3 for Universal Viewer

Ganzheitliches Datenmanagement

GE Healthcare. Centricity PACS and PACS-IW with Universal Viewer* Where it all comes together

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

HDP Hadoop From concept to deployment.

USING GENIE REMOTELY

A new innovation to protect, share, and distribute healthcare data

7i Imaging on Demand PACS Solution FAQ s

Fight fire with fire when protecting sensitive data

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Information Builders Mission & Value Proposition

Presenting Mongoose A New Approach to Traffic Capture (patent pending) presented by Ron McLeod and Ashraf Abu Sharekh January 2013

Mobile Healthcare Security Whitepaper

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Big Data Analytics Nokia

SECURE, ENTERPRISE FILE SYNC AND SHARE WITH EMC SYNCPLICITY UTILIZING EMC ISILON, EMC ATMOS, AND EMC VNX

Client Overview. Engagement Situation. Key Requirements

So What s the Big Deal?

GE Healthcare. Centricity * PACS with Universal Viewer. Universal Viewer. Where it all comes together.

Data Refinery with Big Data Aspects

Transforming the Telecoms Business using Big Data and Analytics

Cloud Computing Where ISR Data Will Go for Exploitation

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

GE Healthcare. Centricity* PACS and PACS-IW with Universal Viewer. Universal Viewer. Where it all comes together.

VNA Vendor Neutral Archive, The How and Why

While a number of technologies fall under the Big Data label, Hadoop is the Big Data mascot.

Assignment # 1 (Cloud Computing Security)

BIG DATA What it is and how to use?

Changing The World One Hospital At a Time

XpoLog Competitive Comparison Sheet

Golder Cat-Scan & MRI Center Automates Imaging Center Workflows

I/O Considerations in Big Data Analytics

MedBroker A DICOM and HL7 Integration Product. Whitepaper

Mimecast Enterprise Information Archiving

Big Data and Cloud Computing for GHRSST

Introducing MIPAV. In this chapter...

Luncheon Webinar Series May 13, 2013

10231B: Designing a Microsoft SharePoint 2010 Infrastructure

Daymark DPS Enterprise - Agentless Cloud Backup and Recovery Software

Big Data and Analytics: Challenges and Opportunities

Workstation, Server Hosted

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

COMP9321 Web Application Engineering

Understanding Object Storage and How to Use It

The Future of Data Management

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

The deployment of OHMS TM. in private cloud

Manifest for Big Data Pig, Hive & Jaql

Agentless Cloud Backup and Recovery Software for the Enterprise

Embedded Systems in Healthcare. Pierre America Healthcare Systems Architecture Philips Research, Eindhoven, the Netherlands November 12, 2008

PARCA Certified PACS System Analyst (CPSA2014) Requirements

Scalable Architecture on Amazon AWS Cloud

HIGH AVAILABILITY CONFIGURATION FOR HEALTHCARE INTEGRATION PORTFOLIO (HIP) REGISTRY

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Big Data? Definition # 1: Big Data Definition Forrester Research

Standard-Compliant Streaming of Images in Electronic Health Records

Cloud Courses Description

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Dominik Wagenknecht Accenture

BIG DATA TOOLS. Top 10 open source technologies for Big Data

TRANSFORMING DATA PROTECTION

Analyzing HTTP/HTTPS Traffic Logs

Electronic Health Records and XDS the IHE approach

Comprehensive Agentless Cloud Backup and Recovery Software for the Enterprise

END TO END DATA CENTRE SOLUTIONS COMPANY PROFILE

Mobile Access Software Blade

Indian Journal of Science The International Journal for Science ISSN EISSN Discovery Publication. All Rights Reserved

Big Data Challenges in Bioinformatics

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

Cloud computing - Architecting in the cloud

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Building an efficient and inexpensive PACS system. OsiriX - dcm4chee - JPEG2000

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

The Design and Implementation of the Zetta Storage Service. October 27, 2009

Big Data Challenges. technology basics for data scientists. Spring Jordi Torres, UPC - BSC

Large Scale/Big Data Federation & Virtualization: A Case Study

AtriaCapture using Asigra

Redefining Microsoft SQL Server Data Management. PAS Specification

A Nemaris Company. Formal Privacy & Security Assessment For Surgimap version and higher

Transcription:

Open & Big Data for Life Imaging Technical aspects : existing solutions, main difficulties Pierre Mouillard MD

What is Big Data? lots of data more than you can process using common database software and standard computers complex data dataflows and time series the origins: big science, demographics, economics the next generation origins: e-commerce

What could we possibly do with all this data? Could we discover new unknown things, just digging and computing these huge datasets? The more data we have, the better: lots of data means better accuracy as a common sense paradigm and mostly a misconception

If you don t mind, I ll grab some big data from you.

Medical imaging prevention, screening diagnostic and decision aid training & planning before treatment real time imaging during therapeutic act reference sets and atlas follow-up of pathology, treatment assessment and: research epidemiology

Big data & medical imaging medical images are big: dynamic 3D CT scan = 1Gb, digitized XR = 100Mb an average hospital generate about 10-300 Tb / year mostly unstructured data (60-80%) medical image archives are increasing by 20-40% / year for one patient, average of 3 imaging modalities per medical act very different from e-commerce! fewer instances, more data per capita

Sharing medical images medical practice becomes increasingly collective multi-modality (CT, MRI, Usound, XR, PET ) means numerous experts involved tele-medecine, tele-diagnostics means remote access to images & medical data nosology and semiotics have to be redefined and expanded because of continuous advances in imaging research is more focused on specific diseases all this means we need to improve and facilitate medical image sharing

Big data: what for? numeric reference images databases patient similarity searching disease progression monitoring, clinical follow-up cases studies, training and learning, expertise sharing nosology and semiotics redefinition new algorithms and image editing tools testing shared archives epidemiology big & open data in medical imaging is a challenge, on a global scale

Big data: 2 concepts STATIC BIG DATA retrospective and prospective studies on finite sets of images statistical analysis at some point and conclusions = knowledge extraction and rules definition DYNAMIC BIG DATA forever ongoing studies, with always expanding image sets auto-adaptative and machine learning systems (neural networks, genetic algorithms ) = always improving and automatic optimization digital expertise and AI is rising!

Imaging data images are just big packs of digits each pixel (voxel) is a measure (each image is a rich dataset) images are just raw data matrix, unstructured data to be used as big data, image processing is needed image has to be normalized, segmented, and expertized ROI definition, measures of volumes & distances, characterization of structures, 3D reconstruction, dynamic analysis automatic processing or manual (assisted) metadata is the key for relevant retrieval and selection and clinical context data is always needed!

And then came the Internet Internet technologies are becoming the de facto standards for data sharing, collaboration, and global access to information Internet is now the strongest incentive for technological innovation in IT

Technological trends data encapsulation distributed storage processing and storage virtualization data centric architectures APIs, front / back dissociation new databases systems security open formats, open source, normalization innovation is coming from high demand Internet application projects: social networks, large e-commerce platforms, data sharing clouds

Imaging data object encapsulation images overlays 3D infos settings measures tags patient ID report clinical data DICOM

Distributed storage data query load balancer nodes storage

Distributed storage outsourced hosting health data agreement

Processing & storage virtualization Hadoop / Hive Map reduce Mahout

APIs - front / back API back app server DB data frontal app user

APIs - front / back store-search-retrieval batch processing distributed computing API server user computer frontal app pre-processing normalization individual analytics interface v v

Flat database systems user app server SGDB data in proprietary format (no direct access) user app server flat data (file system access) NoSQL - Cassandra, MongoDB

Security backups / mirrors redundant servers isolated database systems controlled access firewall encrypted communication SSL tunnelling / VPN API API data access only blind server & DB maintenance data striping crypted data

SaaS data architect + + back end builder front end designer imagery system linked data scalability virtualization cloud cluster hosting outsourcing logs WEB APP (SaaS) CRUD APP +interface data providers open data log / data supervisor controlled access API sandbox other SaaS database backups snapshots ghost server data replication recordsets Queries reports boards stats anonymization

Imaging data architecture non vendors PACS (open PACS) - DCM4CHEE, Osirix server, PACSOne open source visualization software : Osirix, Weasis (webapp) BYOD and scaling down: standard desktop computer, laptop, even tablets as viewing stations accessing the PACS at the patient bed or from remote location linking with in house medical record system and global medical record system research connexion: direct access to dynamic big data integration of new algorithms as add-ons modules (local processing) objective : promote imaging as actionable data, for research and benefit of the other patients

standard PACS CT scan MRI Ultrasound D X-Rays shared storage (NAS) consoles viewing stations

radiology dpt clinical dept image DB e-pacs patients records system remote viewing station remote PACS global patients record system (DMP)

imagery department remote processing backup archives new algorithms heuristics analytics added analytics / overlays normalized atlases API remote / mobile viewing open data remote images repository open data epidemiology publications studies connected research

difficulties pooling images from different sources is difficult: different resolutions, different angles, registration, quality of image image analysis is computing intensive: but radiologists cannot wait too long for results in an everyday use who wants to share? medical conservatism too much privacy regulation? extraction of the important and relevant features in images is a daunting task image formats, clinical records are not always coherent incompatible proprietary systems - we need better open source software what do we do with old data? heterogeneous data? ethics: who owns the data? what about patient consent? security concerns: black hat hackers are everywhere the IT people should not access to patient s health data how to integrate into PACS new algorithms modules? who pays for this?

perspectives new imaging sources: mobile ultrasound, mobile MRI connected objects: calibrated images taken with smartphone dynamic images and time series robotic surgery in vivo image multimodal overlay (guided surgery) PACS app for iphone?

thank you! pierre.mouillard@vigisys.fr