Fighting Cancer with the (help of) Grid. Jarek Nabrzyski, PSNC naber@man.poznan.pl



Similar documents
Developing a European Biomedical GRID for post-genomic research on Cancer

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Virtual digital libraries: The DILIGENT Project. Donatella Castelli ISTI-CNR, Italy

i2b2 Clinical Research Chart

Towards a National Health Knowledge Infrastructure (NHKI): The role of Semantics-based Knowledge Management vkashyap1@partners.org

How To Integrate An Ontology Based Database Into A Cancer Database

Applications des grilles aux sciences du vivant

TRANSFoRm: Vision of a learning healthcare system

Clinical Research from EHR data

Cloud Thinking. Simplifying Big Data Processing. Rui L. Aguiar, Diogo Gomes Universidade de Aveiro - Portugal

EDITORIAL MINING FOR GOLD : CAPITALISING ON DATA TO TRANSFORM DRUG DEVELOPMENT. A Changing Industry. What Is Big Data?

EHR Data Preservation

APPENDIX to CAHIIM 2012 Curriculum Requirements Health Informatics Master s Degree

i2b2 Clinical Research Chart

CAMPUS ACTIVITIES 2016

Semantically Steered Clinical Decision Support Systems

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Interoperability in Grid Computing

Data Grids. Lidan Wang April 5, 2007

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

Theodoros. N. Arvanitis, RT, DPhil, CEng, MIET, MIEEE, AMIA, FRSM

Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments

CVIMO Deployment of a Cyber-infrastructure on top of TRENCADIS Architecture to Share and Create DICOM Studies and Structured Reports

ehealth ELSA: The perspective of the VPH research community

An introduction to System Architecture

Clinical Research Informatics: Challenges And Trends

Grid Activities in Poland

EBiSC the first European bank for induced pluripotent stem cells

Software Description Technology

Shared Services and Healthcare Cloud:

What s Next for Data Sharing: Insight from the NIH Experience

Software services competence in research and development activities at PSNC. Cezary Mazurek PSNC, Poland

Web-Based Genomic Information Integration with Gene Ontology

EUFORIA: Grid and High Performance Computing at the Service of Fusion Modelling

Plateforme de Calcul pour les Sciences du Vivant. SRB & glite. V. Breton.

IO Informatics The Sentient Suite

ISO INTERNATIONAL STANDARD. Health informatics Requirements for an electronic health record architecture

MS Health Informatics ( )

Introduction to Grid computing

Translational research facilitating experimental medicine in dementia in the UK

WP 5 Paediatric Oncology tumour board ERN based on EHealth/ICT concepts for sharing and providing expert advice:

How To Preserve Digital Objects

How To Build An International Security System

TRANSFoRm: TRANSFoRming health care research and its implementation

Workprogramme

Gradient An EII Solution From Infosys

Open Source Repository and User Support

Legal perspectives in EU projects. Prof. Dr. Nikolaus Forgó Leibniz University Hannover

The National Coalition for Health Integration NCHI

ECRIN (European Clinical Research Infrastructures Network)

School of Nursing University of Minnesota Informatics Competencies across the Curriculum

Abstract. 1. Introduction. Ohio State University Columbus, OH

Ahmed BEN HAMIDA,Professor

SOA in the pan-canadian EHR

Personalized Medicine and IT

Semantic interoperability of dual-model EHR clinical standards

Text Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining

LABERINTO at ImageCLEF 2011 Medical Image Retrieval Task

BRISSkit: Biomedical Research Infrastructure Software Service kit. Jonathan Tedds. #brisskit #umfcloud

CAMDIT: A Toolkit for Integrating Heterogeneous Medical Data for improved Health Care Service Provisioning

Data Management and Standardisation in Distributed Systems Biology Research

Accelerate > Converged Storage Infrastructure. DDN Case Study. ddn.com DataDirect Networks. All Rights Reserved

Article on Grid Computing Architecture and Benefits

Scientific versus Business Workflows

A Big Picture for Big Data

Secure Exchange of Clinical Information for Clinical Research. Using open source technology

Knowledge based energy management for public buildings through holistic information modeling and 3D visualization. Ing. Antonio Sacchetti TERA SRL

EHR Systems: an Introduction

The next generation EHR

The glite File Transfer Service

DAME Astrophysical DAta Mining Mining & & Exploration Exploration GRID

Grids, e-business and e-utilities. Tony Hey Director of the UK e-science Core Programme EPSRC and DTI

The NIH Roadmap: Re-Engineering the Clinical Research Enterprise

Overview of the Internet of things

Digital libraries of the future and the role of libraries

Health Informatics Research and Development in Europe

Personalized Medicine: Humanity s Ultimate Big Data Challenge. Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences

How e-science may transform healthcare delivery. Professor Sir Michael Brady FRS FREng Hilary Term 2005

Find the signal in the noise

Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S.

Information Management Metamodel

ULICE WP7 Services for CPT Research. Interim report Steve Harris, University of Oxford

HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - aniketb1@umbc.edu. CMSC Presentation

The ERS IC-EHR as Local, Regional and National ehealth Infrastructure July 2010

Grids for life sciences: status and perspectives

POLAR IT SERVICES. Business Intelligence Project Methodology

NSW Government Standard Approach to Information Architecture. December 2013 v.1.0

How To Use Networked Ontology In E Health

Meme Media and Knowledge Federation in the Age of Big Data

Modeling Temporal Data in Electronic Health Record Systems

PRACE WP4 Distributed Systems Management. Riccardo Murri, CSCS Swiss National Supercomputing Centre

MediSapiens Ltd. Bio-IT solutions for improving cancer patient care. Because data is not knowledge. 19th of March 2015

Data Management System for grid and portal services

Helix Nebula, the Science Cloud

Dr Alexander Henzing

BioGrid s use of Business Analytics for Collaborative Medical Research. Maureen Turner, CEO, BioGrid Australia

Informatics Domain Task Force (idtf) CTSA PI Meeting 02/04/2015

Virtual research environments: learning gained from a situation and needs analysis for malaria researchers

OCRe: Ontology of Clinical Research

Transcription:

Fighting Cancer with the (help of) Grid Jarek Nabrzyski, PSNC naber@man.poznan.pl

Outline Cancer informatics Cancer Grid projects European ACGT Project (ehealth) Summary Cancer basics are not covered in this talk

The problem Individual cancer research groups and hospitals generate large amounts of data of diverse types Sharing of this data is not common today Value can be added by collecting data together and analysing it further sharing data of one type integrating data of different types But in most areas of research sharing and integrating data is invalid or error prone because of lack of standards and infrastructure

Does data sharing add value? Genbank grew from 606 sequencies in 1982 to >30 million in 2003 Data retrieved in first 6 months of 2004: equivalent to 5x10 13 base pairs Cost of regenerating 0.01% of this: $500 million

Essential standards & infrastructure Data elements Controlled vocabularies and ontologies Data exchange formats Protocol standardisation Implementation architecture & databases Data mining tools Confidentiality & privacy enhancing technologies Knowledge management

New knowledge, products & procedures Experiment Hypothesis generation INFORMATICS PLATFORM Integrated clinical data sources Integrated biological data sources Data & metadata Test against existing data resources Integrated genomic data sources Publications repository

Cancer Grid Projects www.worldcommunitygrid.org CancerGrid (UK) Breast Cancer Grid in US (Pennsylvania) Cancer Biomedical Informatics Grid (cabig) Biomedical Informatics Research Network (BIRN)

Advancing Clinico-Genomic Clinical Trials on Cancer (ACGT) New EU-funded IP project under ehealth unit Starting: February 2006, 4-years, 25 partners Budget: 16.7ME (11.8 requested) To manage and exploit the vast amounts of diverse information currently generated Improve prevention and treatment of cancer by effective use of informatics Increase the impact of European cancer research

ACGT Partner Countries

ACGT Goals To deliver to the cancer research community an integrated Clinico-Genomic ICT environment enabled by Grid infrastructure. Grid: seamless mediation services for sharing data and dataprocessing methods and tools and advanced security Integration: semantic, ontology based integration of clinical and genomic/proteomic data (standard clinical and genomic ontologies and metadata). Knowledge discovery: Delivery of data-mining Grid services in order to support and improve complex knowledge discovery process. ACGT is trying to put in place, metaphorically, a World Wide Web of cancer research, a semantically interoperable World Wide Web of cancer research.

Clinical trials Presence of clear research objectives, to validate the technological platform, Development of mechanisms for the smooth incorporation of the clinical-trials in an integrated, GRID-enabled and enriched with knowledgediscovery capabilities environment, Interpretation of results, as presented by the extracted knowledge, into standardized clinical guidelines and potential protocols

ACGT Pilot Trials Breast cancer Paediatric nephroblastoma or, Wilms tumour (PN) Development and evaluation of in silico tumour growth and tumour/normal tissue response simulation models

Current Approach Thousands case report forms (CRF) are collected to the central repository Data is only partially standardized CRFs CRFs CRFs trial database

Clinical trials today Protocol Paper forms

Integrated trials management

Genomics Meets Medicine Advancing and targeting microarray experiments in Clinical Trials Advancing and targeting polymorphisms identification in Clinical Trials Advancing and targeting systems biology approaches in Clinical Trials.

ACGT and Grid ACGT will deliver a biomedical GRID infrastructure able to manage distributed databases where clinical applications, decision support and knowledge discovery operations would access information, data and computational resources scattered over geographically dispersed databases and coderepositories, Special effort will be devoted for the GRID-enabling of the provisioned data mining components and knowledge discovery operations, Besides the computational empowerment, the provisioned ACGT GRID infrastructure will be also tuned in order to support the seamless aspects in data management and processing, thus meeting the needs of a knowledge-grid. Grid deployment, adding extensions where needed

ACGT Architecture 6-layer architecture Conceptual model mapping applications and requirements to chosen technologies

ACGT Data Layer Seamless and interoperable data access services to the distributed data sources for efficient integration the clinico-genomic data. Include as many as possible institutions that want to share data and goals for cancer research Inclusive environment, VO for cancer research

ACGT Grid Layer Infrastructure services: communication between disparate resources, Resource management services: monitoring, reservation, deployment, configuration, job mgmt, etc. Data services: database access and integration, data movement, replica management, data transformations etc. Context services: describe the resources, usage policies, etc. Grid information services: GPIR Self-management, autonomous services Security services: safe resource sharing, AAA, patient personal data security

Research Challenges Knowledge Mining and Clinico-Genomics: to identify reliable molecular-signatures that correlate patients phenotypical cancer features, i.e., medical history parameters, clinical observations, treatment outcome and disease recurrence, with the respective patients genomic / genotypical profiles. Knowledge Mining Tools: A common and unified suite: using R

ACGT Integrated Environment

ACGT VOs

Benefits Patients More rapid development of prevention & treatment Greater safety of therapy Researchers The informatics platform Clinicians Greater knowledge base Decision support systems Regulators & the NHS Advances in care Improved presentation of applicable research Funders Increased cost-effectiveness and advance in cancer care

Summary Big hopes Will work with EGEE2 and other infrastructure projects (CrossGrid?, GridLab, DEISA)

ERCIM Foundation for Research and Technology Hellas Acknowledgements ACGT partners are: Institut National de Recherche en Informatique et en Automatique University van Amsterdam Philips Electronics Nederland B.V. Association Hospitaliere de Bruxelles Centre Hospitalier Universitaire Bordet Institut Suisse de Bioinformatique Lunds Universitet Universidad de Malaga Universidad Politechnica de Madrid Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung A. Persidis & SIA O.E. University of Crete Unisersitaet Hannover Poznan Supercomputing and Networking Center Custodix Healthgrid Institute of Communications and Computer Systems Universitaet des Saarland S.C. SIVECO ROMANIA SA Facultes Universitaires Notre-Dame de la Paix Universitaet Hamburg The Chancellor, Masters and Scholars of the University of Oxford Hokkaido University Istituto Europeo di Oncologia s.r.l