Grille de calcul pour le biomédical Journée BioSTIC, 10 juillet 2008
|
|
- Ernest Atkinson
- 8 years ago
- Views:
Transcription
1 Grille de calcul pour le biomédical Journée BioSTIC, 10 juillet 2008 Johan Montagnat Diane Lingrand CNRS / UNSA, I3S laboratory, RAINBOW team EGEE-II INFSO-RI
2 EGEE Grid Infrastructure Flagship European grid infrastructure project Now in 3rd phase with more than 100 partners Size of the infrastructure today: > 250 sites in 48 countries > CPU cores ~ 5 PB disk + tape MSS > jobs/day > 9000 registered users EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
3 User requests Middleware components DataSets info resources info Information Service Data Management Author. &Authen. queries queries Workload Management Site X Resources allocation Publication Sites Resources Indexing Computing Resources Storage Resources Dynamic evolution Logging, real time monitoring EGEE-II INFSO-RI DICOM Image Management, J. Montagnat, HealthGrid, June 2,
4 glite middleware Complex middleware stack On top of Globus Toolkit 2, VDT, CONDOR, batch schedulers... Fundational services User authentication and authorization Information Index: registration of Computing Elements (CEs), Storage Elements (SEs), etc. Workload Management System: workload distribution over sites Data Management System: virtual file hierarchy, standard (SRM) interface to storage resources Batch oriented EGEE is a federation of computing centers and File Servers oriented EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
5 Why grids for medical imaging? Enabling Grids for E-sciencE Sharing computing resources and algorithms Research (populations studies, models design, validation, statistics) Complex analysis (compute intensive image processing, time constraints...) Data Procedures Algorithms S e q 1 > d c s c d s s d c s d c d s c bscdsbcbjbfvbfvbvfbvbvbhvb hsvbhdvbhfdbvfd S e q 2 > b v d f v f d v h b d f v b bhvdsvbhvbhdvrefghefgdscg dfgcsdycgdkcsqkc S e q n > b v d f v f d v h b d f v b bhvdsvbhvbhdvrefghefgdscg dfgcsdycgdkcsqkchdsqhfduh dhdhqedezhhezldhezhfehfle zfzejfv Computing power EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
6 Building on Production Grids Applications Applications Applications Applications Applications level Application-level services (high-level middleware) Application-level services (high-level middleware) Production grid infrastructure level Resources (low-level) Middleware Resources Resources Resources Resources Communication layer (GEANT, Internet...) EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
7 The Biomed Virtual Organization Enabling Grids for E-sciencE Fostering scientific communities EGEE is an international, multi-disciplinary research infrastructure Scientific communities are expanding beyond administrative boundaries Virtual Organizations Authentication and Authorization management Share resources (computers, data, algorithms...) inside a VO Application areas identification and support unit The Biomed VO now divided in 3 sectors Medical imaging Bioinformatics Drug discovery EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
8 Biomed Virtual Organization Size of the infrastructure today: > 250 sites in 48 countries > CPU cores ~ 5 PB disk + tape MSS > jobs/day > 9000 registered users Out of which, Biomed VO: > 100 sites in 30 countries (170 CEs, 130 SEs) ~ CPU > 150 registered users EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
9 Irregular use through time Year 2007 statistics collected per month EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
10 Total production in 2007 ~800 CPU years (1000 SpecInt 2000 Normalised CPU time) EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
11 Total production in 2007 ~2,600,000 jobs EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
12 Computing resources provision Enabling Grids for E-sciencE Computing hours per EGEE federation in 2007 EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
13 Biomed VO share Enabling Grids for E-sciencE Life sciences compared to other sciences CPU time Number of jobs Biomed VO EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
14 Building on Production Grids Applications Applications Applications Applications Applications level Application-level services (high-level middleware) Application-level services (high-level middleware) Production grid infrastructure level Resources (low-level) Middleware Resources Resources Resources Resources Communication layer (GEANT, Internet...) EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
15 WISDOM In silico Drug Discovery Enabling Grids for E-sciencE WISDOM: Goal: find new drugs for neglected and emerging diseases Neglected diseases lack R&D Emerging diseases require very rapid response time Need for an optimized environment To achieve production in a limited time To optimize performances Method: grid-enabled virtual docking Cheaper than in vitro tests Faster than in vitro tests EGEE-II INFSO-RI Application to Courtesy life sciences, of V. J. Breton Montagnat, June 18, 2008
16 Millions of chemical compounds available in laboratories High Throughput Virtual Docking Enabling Grids for E-sciencE Chemical compounds : ZINC Molecular docking : FlexX, Autodock Targets structures : PDB Grid infrastructure : EGEE Chemical compounds : Chembridge 500,000 Drug like 500,000 High Throughput Screening 1-10$/compound, nearly impossible ( Autodock Molecular docking (FlexX, ~80 CPU years, 1 TB data Computational data challenge ~6 weeks on ~1000/1600 computers Targets : ( 1lf3 Plasmepsin II (1lee, 1lf2, ( 1ls5 ) Plasmepsin IV Hits screening using assays performed on living cells Leads Clinical testing Drug EGEE-II INFSO-RI Application to life sciences, J. Montagnat, June 18,
17 The new WISDOM architecture Major new features: - improved data management - secure storage - migration to web service Credit: J. Salzeman, V. Bloch INFSO-RI Healthgrid March 8th, 2007 V. Breton 17
18 WISDOM-II: A huge international effort Significant contributions from several International grid infrastructures 1%2% 2% 3% 3% 3% 39% 3% 5% 6% 7% EGEE Germany Switzerland EGEE Asia Pacific EGEE Russia Auvergrid EuChinaGrid EELA EGEE South Western Europe EGEE Central Europe EGEE Northern Europe EGEE Italy EGEE South Eastern Europe EGEE France EGEE UKI 12% 15% Over 420 CPU years in 10 weeks A record throughput of docked compounds per hour INFSO-RI Healthgrid March 8th, 2007 V. Breton 18
19 The grid added value LPC Clermont-Ferrand: Biomedical grid Enabling Grids for E-sciencE The grid provides the centuries of CPU cycles required on demand The grid provides the reliable and secure data management services to store and replicate the biochemical inputs and outputs The grid offers a collaborative environment for the sharing of data in the research community on avian flu and malaria CEA, Acamba project: Biological targets, Chemogenomics SCAI Fraunhofer: Knowledge extraction, Chemoinformatics Univ. Modena: Biological targets, Molecular Dynamics Chonnam nat. univ.: In vitro testing New HealthGrid: Biomedical grid, Dissemination Univ. Los Andes: Biological targets, Malaria biology Univ. Pretoria: Bioinformatics, Malaria biology ITB CNR: Bioinformatics, Molecular modelling Academica Sinica: Grid user interface Biological targets In vitro testing EGEE-II INFSO-RI Application to Courtesy life sciences, of V. J. Breton Montagnat, June 18,
20 Statistics of deployment First Data Challenge: July 1st - August 15th 2005 Target: malaria 80 CPU years 1 TB of data produced 1700 CPUs used in parallel 1st large scale docking deployment world-wide on a e-infrastructure Second Data Challenge: April 15th - June 30th 2006 Target: avian flu 100 CPU years 800 GB of data produced 1700 CPUs used in parallel Collaboration initiated on March 1st: deployment preparation achieved in 45 days Third Data Challenge: October 1st - 15th December 2006 Target: malaria 400 CPU years 1,6 TB of data produced Up to 5000 CPUs used in parallel Very high docking throughput: > compounds per hour EGEE-II INFSO-RI Application to Courtesy life sciences, of V. J. Breton Montagnat, June 18,
21 Scientific objectives Molecular Bioinformatics: protein sequence analysis Analyse data from high-throughput Biology: complete genome projects, EST, complete proteomes, structural biology,. Integration of biological data and tools Method Provide Biologists with an usual Web interface: Web portal online since tools & 12 updated databases + 10,000,000 jobs & 5,000 jobs/day Ease the access to updated databases and algorithms. Protein databases are stored on grid storage as flat files. Legacy bioinformatics applications Wrapping usual binary in grid environment transparent remote access with local filesystem Display results in graphical Web interface. Status: Production Contact: Christophe.Blanchet@ibcp.fr GPSA: Bioinformatics Grid Portal EGEE-II INFSO-RI Application to Courtesy life sciences, of C. Blanchet J. Montagnat, June 18,
22 GPSA Results Enabling Grids for E-sciencE Grid-enabled bioinformatics resources 9 algorithms 3 protein databases Bioinformatics descriptors XML framework, Web portal, Web Services Encryption system: EncFile Security: AES, Key sharing, M-of-N Transparent access to grid files: Parrot EGEE-II INFSO-RI Application to Courtesy life sciences, of C. Blanchet J. Montagnat, June 18,
23 Pharmacokinetics UPV (University Polytechnic of Valencia) GATE LPC (CNRS Clermont Ferrand) SiMRI3D, ThIS, CAVIAR CREATIS (CNRS - Lyon) gptm3d LRI-LAL-LIMSI (CNRS - Orsay) Bronze Standard I3S (CNRS Sophia), INRIA SPM Alzheimer's U. of Genova / DEST EGEE Medical imaging sector LRMN EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
24 Pharmacokinetics: contrast agent diffusion Scientific objectives Study of contrast agent diffusion Computations Image sequences co-registration (independent computations) Parallel contrast agent diffusion modeling Features Application portal for use in clinical environment Enable on-line analysis of exams acquired daily Examination time ~15 minutes Exam analysis time ~2 hours 30 minutes EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
25 Scientific objectives Better understand MR physics. Study MR sequences in-silico. Study MR artefacts. Validate MR Image processing algorithms on synthetic yet realistic images. Method Simulate Bloch's electromagnetism equations. Parallel (MPI) implementation to speedup computations. SiMRI3D: MRI simulator LRMN D image simulation: ~100 CPU years. EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
26 Open GATE collaboration Scientific objectives Geant4 Application for Tomographic Emission Medical physics applications: PET camera simulation, radiotherapy, occular brachytherapy treatment Method GEANT4 base software to model physics of nuclear medicine. Use Monte Carlo simulation to improve accuracy of computations (as compared to the deterministic classical ( approach EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
27 GATE application to radiotherapy Enabling Grids for E-sciencE Accurate dose computation for radiotherapy planning Scanner slices: DICOM or DICOMRT format EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
28 Scientific objectives User-assisted reconstruction of 3D anatomical structures Quantification (e.g. volume measurements) Use models for augmented reality Computations Parallleization of segmentation processes Large number of short jobs Features ( real-time Highly interactive (almost Used in clinical environment Windows interface gptm3d: radiology analysis EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
29 MR Images collection (T1, T2, PD) Preprocessing Nonuniformity correction Intensity normalisation Use case: multiple sclerosis Software technologies for integration of process, data and knowledge in medical imaging Registration and resampling Stereotaxic registration Resampling Segmentation Brain segmentation Classification Segmentation 4 classes: - White matter - Gray matter - CSF - lesions Multimodal MRI Classification Quantification Statistical tests NeuroLOG ANR-06-TLOG-024 Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, Analysis
30 Large databases 256 3D images (test database) Experiments requirements Software technologies for integration of process, data and knowledge in medical imaging 120 3D images (mono-site clinical database) D images (multi-site clinical trial, TBs of data) Various computing tools 10 to 15 processing stages in the pipeline Computing power > 2 months sequential execution time Pipeline description and execution Workflow description Workflow manager (execution monitoring, restart on error...) NeuroLOG ANR-06-TLOG-024 Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
31 Application to rigid registration algorithms evaluation Enabling Grids for E-sciencE T O 1 O 2 Unregistered Registered EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
32 ~100 image pairs ~800 EGEE jobs Enabling Grids for E-sciencE Bronze standard workflow A B CrestLines Params Params Params Service Params Params Params PFMatchICP GetFromEGEE Yasmina Baladin PFRegister Params FormatConv GetFromEGEE GetFromEGEE GetFromEGEE WriteResults FormatConv FormatConv FormatConv WriteResults WriteResults WriteResults MethodToTest MultiTransfoTest EGEE-II INFSO-RI Accuracy Translation Accuracy Rotation Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
33 Embarassingly parallel problem Fast turn over of short jobs Data confidentiality Interactivity Fine grain parallelism (MPI) Workflow-based Portals / user interface Application specific requirements EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
34 Building on Production Grids Applications Applications Applications Applications Applications level Application-level services (high-level middleware) Application-level services (high-level middleware) Production grid infrastructure level Resources (low-level) Middleware Resources Resources Resources Resources Communication layer (GEANT, Internet...) EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2, 2008
35 WAN medical image exchange Cross-enterprise exchange of radiology reports and images Wide-area Health Information Network Radiology Enterprise B PACS B Radiology-to-radiology Physician Office Radiology-to- Physicians PACS A Radiology Enterprise A EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
36 Grid technologies Wide-area Health Information Network WAN medical image exchange Radiology Enterprise B PACS B Cross-enterprise authentication Access control Secured communications Efficient and reliable transfer PACS A Physician Office Radiology Enterprise A EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
37 Medical Data Manager Objectives Expose a standard grid interface (SRM) for medical image servers (DICOM) Use native DICOM storage format Fulfill medical applications security requirements Do not interfere with clinical practice DICOM Interface SRM Worker Nodes DICOM clients DICOM server User Interfaces EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
38 Medical Data Registration 1. Image is acquired 2. Image is stored in DICOM server 4. image metadata are registered AMGA Metadata 3. lcg client GFAL API 3a. Image is registered (a GUID is associated) 3b. Image key is produced and registered Hydra Key store LCG File Catalog EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
39 Medical Data Retrieval 1. get GUID from metadata AMGA Metadata User Interface Worker Node 7. get file key and decrypt file locally 2. lcg client 6. on-the-fly encryption and anonimyzation return encrypted file Key ACL control Hydra Key store GFAL API 5. get file key SRM-DICOM interface LCG File Catalog 3. get SURL from GUID 4. request file Metadata ACL control File ACL control Anonimization & encryption DICOM server EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
40 Grid Workflow Efficient Enactment for Data Intensive Applications Application workflows representation and efficient enactment on grids GWENDIA ANR-06-MDCA-009
41 Grid Workflow Efficient Enactment for Data Intensive Applications MOTEUR, I3S laboratory, CNRS High level interface Hides grid complexity to the user Service-based approach Legacy code service wrapper Scufl language (mygrid / Taverna) Pure data flow approach Grid submission interfaces EGEE (LCG2, glite) Grid5000 (OAR, DIET) Transparent parallelism exploitation Code and data parallelism Workflow management GWENDIA ANR-06-MDCA-009 Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
42 Efficient parallel execution Grid Workflow Efficient Enactment for Data Intensive Applications A workflow naturally provides application parallelization MOTEUR transparently exploits 3 kinds of parallelism Workflow parallelism Data parallelism Service parallelism D 0, D 1 D 0, D 1 D 0, D 1 S 1 S 1 S 1 D 1 D 0 D 1 S 2 S 3 S 2 S 3 S 2 DS 30 Jobs grouping strategy in sequential branches in order to reduce grid latency MOTEUR GWENDIA ANR-06-MDCA-009 Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
43 Conclusions Using the European grid infrastructure for production Regular usage in medical imaging community Sustained production since 2004 Closer to clinical end-users RSNA'07 demonstration (EGEE-MEDICUS) Molecular docking for drug discovery Higher level services More data protection (Role-based access control) Considering multiple actors involved in clinical procedures (patient, doctor, surgeon, radiologist, nurse, anesthetist..) Data semantics Metadata associated to data, Ontologies, Content-based data identification, indexing, mining... User interfaces Integrated in end-users environments; often specific to applications EGEE-II INFSO-RI Medical image analysis on EGEE, J. Montagnat, BioGrid, June 2,
Applications des grilles aux sciences du vivant
Institut des Grilles du CNRS Applications des grilles aux sciences du vivant V. Breton Credit: A. Da Costa, P. De Vlieger, J. Salzemann Introduction Grid technology provides services to do science differently,
More informationGrids for life sciences: status and perspectives
Grids for life sciences: status and perspectives V. Breton CNRS-IN2P3 Teratec 2009 Credit: A. Da Costa, P. De Vlieger, J. Salzemann http://clrpcsv.in2p3.fr Introduction Grid technology provides services
More informationUsing the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova
Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel
More informationEGEE-2 NA4 Biomed Bioinformatics in CNRS
Enabling Grids for E-sciencE EGEE-2 NA4 Biomed Bioinformatics in CNRS Christophe Blanchet Institute of Biology and Chemistry of Proteins Lyon, April 28, 2006 www.eu-egee.org Enabling Grids for E-sciencE
More informationA Secure Grid Medical Data Manager Interfaced to the glite Middleware
A Secure Grid Medical Data Manager Interfaced to the glite Middleware Johan Montagnat, Akos Frohner, Daniel Jouvenot, Christophe Pera, Peter Kunszt, Birger Koblitz, Nuno Santos, Charles Loomis, Romain
More informationFédération et analyse de données distribuées en imagerie biomédicale
Software technologies for integration of processes and data in neurosciences ConnaissancEs Distribuées en Imagerie BiomédicaLE Fédération et analyse de données distribuées en imagerie biomédicale Johan
More informationBridging clinical information systems and grid middleware: a Medical Data Manager
Bridging clinical information systems and grid middleware: a Medical Data Manager Johan Montagnat 1, Daniel Jouvenot 2, Christophe Pera 3, Ákos Frohner 4, Peter Kunszt 4, Birger Koblitz 4, Nuno Santos
More informationPlateforme de Calcul pour les Sciences du Vivant. SRB & glite. V. Breton. http://clrpcsv.in2p3.fr
SRB & glite V. Breton http://clrpcsv.in2p3.fr Introduction Goal: evaluation of existing technologies for data and tools integration and deployment Data and tools integration should be addressed using web
More informationEsqu Science Experiments For Computer Network
Vangelis Floros, GRNET S.A. 3 rd Open Source Software Conference March 22, 2008 NTUA, Athens Greece Introduction What is a Grid? What is escience? Large Scientific Grids The example of EGEE Building Grid
More informationglibrary: Digital Asset Management System for the Grid
glibrary: Digital Asset Management System for the Grid Antonio Calanducci INFN Catania EGEE User Forum Manchester, 09 th -11 th May 2007 www.eu-egee.org EGEE and glite are registered trademarks Outline
More informationEGEE a worldwide Grid infrastructure
EGEE a worldwide Grid infrastructure Fabrizio Gagliardi Project Director EGEE CERN, Switzerland IFIC, 6 October 2005 www.eu-egee.org Presentation overview Data intensive science and the rationale for Grid
More informationCluster, Grid, Cloud Concepts
Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of
More informationEmbedded Systems in Healthcare. Pierre America Healthcare Systems Architecture Philips Research, Eindhoven, the Netherlands November 12, 2008
Embedded Systems in Healthcare Pierre America Healthcare Systems Architecture Philips Research, Eindhoven, the Netherlands November 12, 2008 About the Speaker Working for Philips Research since 1982 Projects
More informationInteroperability in Grid Computing
Anette Weisbecker, Fraunhofer IAO, Stuttgart 18 th April 2007 Special Interest Session III Outline: Interoperability in Grid Computing Grid Computing for Medicine and Life Science Interoperability Architecture
More informationThe CMS analysis chain in a distributed environment
The CMS analysis chain in a distributed environment on behalf of the CMS collaboration DESY, Zeuthen,, Germany 22 nd 27 th May, 2005 1 The CMS experiment 2 The CMS Computing Model (1) The CMS collaboration
More informationA demonstration of the use of Datagrid testbed and services for the biomedical community
A demonstration of the use of Datagrid testbed and services for the biomedical community Biomedical applications work package V. Breton, Y Legré (CNRS/IN2P3) R. Météry (CS) Credits : C. Blanchet, T. Contamine,
More informationThe glite File Transfer Service
The glite File Transfer Service Peter Kunszt Paolo Badino Ricardo Brito da Rocha James Casey Ákos Frohner Gavin McCance CERN, IT Department 1211 Geneva 23, Switzerland Abstract Transferring data reliably
More informationEvaluating Metadata access
Evaluating Metadata access strategies with the GOME test suite André Gemünd Fraunhofer SCAI www.eu-egee.org EGEE-II INFSO-RI-031688 EGEE and glite are registered trademarks Motivation Testing the test
More informationEDG Project: Database Management Services
EDG Project: Database Management Services Leanne Guy for the EDG Data Management Work Package EDG::WP2 Leanne.Guy@cern.ch http://cern.ch/leanne 17 April 2002 DAI Workshop Presentation 1 Information in
More informationAn approach to grid scheduling by using Condor-G Matchmaking mechanism
An approach to grid scheduling by using Condor-G Matchmaking mechanism E. Imamagic, B. Radic, D. Dobrenic University Computing Centre, University of Zagreb, Croatia {emir.imamagic, branimir.radic, dobrisa.dobrenic}@srce.hr
More informationSilviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania)
Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania) Outline Introduction EO challenges; EO and classical/cloud computing; EO Services The computing platform Cluster -> Grid -> Cloud
More informationTHE CCLRC DATA PORTAL
THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: g.j.drinkwater@dl.ac.uk, s.a.sufi@dl.ac.uk Abstract: The project aims
More informationE-mail: guido.negri@cern.ch, shank@bu.edu, dario.barberis@cern.ch, kors.bos@cern.ch, alexei.klimentov@cern.ch, massimo.lamanna@cern.
*a, J. Shank b, D. Barberis c, K. Bos d, A. Klimentov e and M. Lamanna a a CERN Switzerland b Boston University c Università & INFN Genova d NIKHEF Amsterdam e BNL Brookhaven National Laboratories E-mail:
More informationData Grids. Lidan Wang April 5, 2007
Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural
More informationDenis Caromel, CEO Ac.veEon. Orchestrate and Accelerate Applica.ons. Open Source Cloud Solu.ons Hybrid Cloud: Private with Burst Capacity
Cloud computing et Virtualisation : applications au domaine de la Finance Denis Caromel, CEO Ac.veEon Orchestrate and Accelerate Applica.ons Open Source Cloud Solu.ons Hybrid Cloud: Private with Burst
More informationBusiness applications:
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA Business applications: the COMETA approach Prof. Antonio Puliafito University of Messina Open Grid Forum (OGF25) Catania, 2-6.03.2009 www.consorzio-cometa.it
More informationMicrosoft Research Worldwide Presence
Microsoft Research Worldwide Presence MSR India MSR New England Redmond Redmond, Washington Sept, 1991 San Francisco, California Jun, 1995 Cambridge, United Kingdom July, 1997 Beijing, China Nov, 1998
More informationGrid Computing Perspectives for IBM
Grid Computing Perspectives for IBM Atelier Internet et Grilles de Calcul en Afrique Jean-Pierre Prost IBM France jpprost@fr.ibm.com Agenda Grid Computing Initiatives within IBM World Community Grid Decrypthon
More informationCloud Ready for Bioinformatics?
IDB acknowledges co-funding by the European Community's Seventh Framework Programme (INFSO-RI-261552) and the French National Research Agency's Arpege Programme (ANR-10-SEGI-001) Cloud Ready for Bioinformatics?
More informationProtect, Share, and Distribute Healthcare Data with ehealth Managed Services
Protect, Share, and Distribute Healthcare Data with ehealth Managed Services An innovative offering CARESTREAM ehealth Managed Services include hardware, software, maintenance, remote monitoring, updates
More informationThe Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland
The Lattice Project: A Multi-Model Grid Computing System Center for Bioinformatics and Computational Biology University of Maryland Parallel Computing PARALLEL COMPUTING a form of computation in which
More informationDigital Mammogram National Database
Digital Mammogram National Database Professor Michael Brady FRS FREng Medical Vision Laboratory Oxford University Chairman: Mirada Solutions Ltd PharmaGrid 2/7/03 ediamond aims construct a federated database
More informationEGEE is a project funded by the European Union under contract IST-2003-508833
www.eu-egee.org NA4 Applications F.Harris(Oxford/CERN) NA4/HEP coordinator EGEE is a project funded by the European Union under contract IST-2003-508833 Talk Outline The basic goals of NA4 The organisation
More informationEuro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of
More informationA Platform for Collaborative e-science Applications. Marian Bubak ICS / Cyfronet AGH Krakow, PL bubak@agh.edu.pl
A Platform for Collaborative e-science Applications Marian Bubak ICS / Cyfronet AGH Krakow, PL bubak@agh.edu.pl Outline Motivation Idea of an experiment Virtual laboratory Examples of experiments Summary
More informationThe Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets
The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and
More informationBig Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
More informationDr Alexander Henzing
Horizon 2020 Health, Demographic Change & Wellbeing EU funding, research and collaboration opportunities for 2016/17 Innovate UK funding opportunities in omics, bridging health and life sciences Dr Alexander
More informationHPC Cloud. Focus on your research. Floris Sluiter Project leader SARA
HPC Cloud Focus on your research Floris Sluiter Project leader SARA Why an HPC Cloud? Christophe Blanchet, IDB - Infrastructure Distributing Biology: Big task to port them all to your favorite architecture
More informationBig Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
More informationMOTEUR: A DATA-INTENSIVE SERVICE-BASED
LBORTOIRE INFORMTIQUE, SIGNUX ET SYSTÈMES DE SOPHI NTIPOLIS UMR 67 MOTEUR: DT-INTENSIVE SERVICE-BSED WORKFLOW MNGER Tristan Glatard, Johan Montagnat, Xavier Pennec, David Emsellem, Diane Lingrand Projet
More informationHOPE, an open platform for medical data management on the grid
HOPE, an open platform for medical data management on the grid M. Diarena, S. Nowa, Jean-Yves Boire, V. Bloch, D. Donnarieix, A. Fessy, B. Grenier, B. Irrthum, Yannick Legre, L. Maigne, et al. To cite
More informationRequirements for Complex Interactive Workflows in Biomedical Research. Jeffrey S. Grethe, BIRN-CC University of California, San Diego
Requirements for Complex Interactive Workflows in Biomedical Research Jeffrey S. Grethe, BIRN-CC University of California, San Diego e-science Workflow Services December 3, 2003 Scientific Workflows Laboratory
More informationInformatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR)
Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR) Enable Science in silico & Provide the Right Knowledge to the Right People at the Right Time to enable the
More informationUsing Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments
Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments Mario Cannataro, Pietro Hiram Guzzi, Tommaso Mazza, and Pierangelo Veltri University Magna Græcia of Catanzaro, 88100
More informationEMBL Identity & Access Management
EMBL Identity & Access Management Rupert Lück EMBL Heidelberg e IRG Workshop Zürich Apr 24th 2008 Outline EMBL Overview Identity & Access Management for EMBL IT Requirements & Strategy Project Goal and
More informationAnalyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware
Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware R. Goranova University of Sofia St. Kliment Ohridski,
More informationHISP: a data-driven portal for hadron therapy
HISP: a data-driven portal for hadron therapy Faustin Laurentiu Roman CERN / IFIC Prototype architecture Tools, implementation & services Conclusions (& demo) 1 One slide situation: ereferral and escience
More informationBuilding Platform as a Service for Scientific Applications
Building Platform as a Service for Scientific Applications Moustafa AbdelBaky moustafa@cac.rutgers.edu Rutgers Discovery Informa=cs Ins=tute (RDI 2 ) The NSF Cloud and Autonomic Compu=ng Center Department
More informationCENTRE FOR PARALLEL COMPUTING
CENTRE FOR PARALLEL COMPUTING HIGH PERFORMANCE CLOUD APPLICATION SUPPORT SERVICE (HP-CLASS) TECHNOLOGICAL INNOVATION HIGH PERFORMANCE CLOUD APPLICATION SUPPORT SERVICE SUPPORT SERVICES FOR HIGH PERFORMANCE
More informationVirtual research environments: learning gained from a situation and needs analysis for malaria researchers
Virtual research environments: learning gained from a situation and needs analysis for malaria researchers Martie van Deventer (CSIR), Heila Pienaar (UP), Jane Morris (ACGT) and Zoleka Ngcete (SAMI) African
More informationSustainable Grid User Support
Sustainable Grid User Support Dr. Torsten Antoni torsten.antoni@kit.edu www.eu-egee.org EGEE and glite are registered trademarks User education User support is Simple access to a broad range of information
More informationManaging Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery
Center for Information Services and High Performance Computing (ZIH) Managing Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery Richard Grunzke*, Jens Krüger, Sandra Gesing, Sonja
More informationDigital libraries of the future and the role of libraries
Digital libraries of the future and the role of libraries Donatella Castelli ISTI-CNR, Pisa, Italy Abstract Purpose: To introduce the digital libraries of the future, their enabling technologies and their
More informationAN APPROACH TO DEVELOPING BUSINESS PROCESSES WITH WEB SERVICES IN GRID
AN APPROACH TO DEVELOPING BUSINESS PROCESSES WITH WEB SERVICES IN GRID R. D. Goranova 1, V. T. Dimitrov 2 Faculty of Mathematics and Informatics, University of Sofia S. Kliment Ohridski, 1164, Sofia, Bulgaria
More informationSoftware Process. Oliver Keeble SA3 EGEE-II final EU Review CERN 08-07-2008. www.eu-egee.org. EGEE and glite are registered trademarks
Software Process Quality Metrics Oliver Keeble SA3 EGEE-II final EU Review CERN 08-07-2008 www.eu-egee.org egee EGEE-II INFSO-RI-031688 EGEE and glite are registered trademarks Summary Enabling Grids for
More informationExperiences with the GLUE information schema in the LCG/EGEE production Grid
Experiences with the GLUE information schema in the LCG/EGEE production Grid Stephen Burke, Sergio Andreozzi and Laurence Field CHEP07, Victoria, Canada www.eu-egee.org EGEE and glite are registered trademarks
More informationAccelerating Hadoop MapReduce Using an In-Memory Data Grid
Accelerating Hadoop MapReduce Using an In-Memory Data Grid By David L. Brinker and William L. Bain, ScaleOut Software, Inc. 2013 ScaleOut Software, Inc. 12/27/2012 H adoop has been widely embraced for
More informationExisting concepts and experiences with interoperability initiatives
Existing concepts and experiences with interoperability initiatives Geert Claeys, M. Sc. Co-Chairman Europe Technology Manager, Agfa Healthcare/R&D Topics Interoperability problems in healthcare process
More informationGeant4 simulations using grid computing
Geant4 simulations using grid computing Lydia Maigne, PhD LPC, PCSV team, Clermont-Ferrand maigne@clermont.in2p3.fr http://clrpcsv.in2p3.fr Crédits: V. Breton, Y. Legré, C.O. Thiam, A. Fessy, M. Diarena
More informationWeb Service Based Data Management for Grid Applications
Web Service Based Data Management for Grid Applications T. Boehm Zuse-Institute Berlin (ZIB), Berlin, Germany Abstract Web Services play an important role in providing an interface between end user applications
More informationOpen & Big Data for Life Imaging Technical aspects : existing solutions, main difficulties. Pierre Mouillard MD
Open & Big Data for Life Imaging Technical aspects : existing solutions, main difficulties Pierre Mouillard MD What is Big Data? lots of data more than you can process using common database software and
More informationScientific and Technical Applications as a Service in the Cloud
Scientific and Technical Applications as a Service in the Cloud University of Bern, 28.11.2011 adapted version Wibke Sudholt CloudBroker GmbH Technoparkstrasse 1, CH-8005 Zurich, Switzerland Phone: +41
More informationGEDDM - Commercial Data Mining Using Distributed Resources
GEDDM - Commercial Data Mining Using Distributed Resources Mark Prentice 1 st December 2004 Introduction Industrial partner Overview of GEDDM project Application areas Grid enabled implementation Current
More informationCMS Dashboard of Grid Activity
Enabling Grids for E-sciencE CMS Dashboard of Grid Activity Julia Andreeva, Juha Herrala, CERN LCG ARDA Project, EGEE NA4 EGEE User Forum Geneva, Switzerland March 1-3, 2006 http://arda.cern.ch ARDA and
More informationHandling next generation sequence data
Handling next generation sequence data a pilot to run data analysis on the Dutch Life Sciences Grid Barbera van Schaik Bioinformatics Laboratory - KEBB Academic Medical Center Amsterdam Very short intro
More informationA new innovation to protect, share, and distribute healthcare data
A new innovation to protect, share, and distribute healthcare data ehealth Managed Services ehealth Managed Services from Carestream Health CARESTREAM ehealth Managed Services (ems) is a specialized healthcare
More informationGrids Computing and Collaboration
Grids Computing and Collaboration Arto Teräs CSC, the Finnish IT center for science University of Pune, India, March 12 th 2007 Grids Computing and Collaboration / Arto Teräs 2007-03-12 Slide
More informationFor large geographically dispersed companies, data grids offer an ingenious new model to economically share computing power and storage resources
Data grids for storage http://storagemagazine.techtarget.com/magitem/0,291266,sid35_gci1132545,00.html by: Ray Lucchesi Storage Magazine Issue: Oct 2005 For large geographically dispersed companies, data
More informationA PRACTICAL APPROACH FOR A WORKFLOW MANAGEMENT SYSTEM
A PRACTICAL APPROACH FOR A WORKFLOW MANAGEMENT SYSTEM Simone Pellegrini, Francesco Giacomini, Antonia Ghiselli INFN Cnaf Viale Berti Pichat, 6/2-40127 Bologna, Italy simone.pellegrini@cnaf.infn.it francesco.giacomini@cnaf.infn.it
More informationBioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
More informationHEP Data-Intensive Distributed Cloud Computing System Requirements Specification Document
HEP Data-Intensive Distributed Cloud Computing System Requirements Specification Document CANARIE NEP-101 Project University of Victoria HEP Computing Group December 18, 2013 Version 1.0 1 Revision History
More informationDisributed Query Processing KGRAM - Search Engine TOP 10
fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries Johan Montagnat CNRS, I3S lab, Modalis team on behalf of the CrEDIBLE
More informationBob Jones Technical Director bob.jones@cern.ch
Bob Jones Technical Director bob.jones@cern.ch CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST-2003-508833 EGEE Goal & Strategy Goal: Create a wide
More informationirods and Metadata survey Version 0.1 Date March Abhijeet Kodgire akodgire@indiana.edu 25th
irods and Metadata survey Version 0.1 Date 25th March Purpose Survey of Status Complete Author Abhijeet Kodgire akodgire@indiana.edu Table of Contents 1 Abstract... 3 2 Categories and Subject Descriptors...
More informationSINTERO SERVER. Simplifying interoperability for distributed collaborative health care
SINTERO SERVER Simplifying interoperability for distributed collaborative health care Tim Benson, Ed Conley, Andrew Harrison, Ian Taylor COMSCI, Cardiff University What is Sintero? Sintero Server is a
More informationirods at CC-IN2P3: managing petabytes of data
Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules irods at CC-IN2P3: managing petabytes of data Jean-Yves Nief Pascal Calvat Yonny Cardenas Quentin Le Boulc h
More informationImage Area. View Point. Medical Imaging. Advanced Imaging Solutions for Diagnosis, Localization, Treatment Planning and Monitoring. www.infosys.
Image Area View Point Medical Imaging Advanced Imaging Solutions for Diagnosis, Localization, Treatment Planning and Monitoring www.infosys.com Over the years, medical imaging has become vital in the early
More informationPartitionning medical image databases for content-based queries on a grid
Partitionning medical image databases for content-based queries on a grid Johan Montagnat, Vincent BRETON, Isabelle Magnin To cite this version: Johan Montagnat, Vincent BRETON, Isabelle Magnin. Partitionning
More informationThe ENEA-EGEE site: Access to non-standard platforms
V INFNGrid Workshop Padova, Italy December 18-20 2006 The ENEA-EGEE site: Access to non-standard platforms C. Sciò**, G. Bracco, P. D'Angelo, L. Giammarino*, S.Migliori, A. Quintiliani, F. Simoni, S. Podda
More informationAn objective comparison test of workload management systems
An objective comparison test of workload management systems Igor Sfiligoi 1 and Burt Holzman 1 1 Fermi National Accelerator Laboratory, Batavia, IL 60510, USA E-mail: sfiligoi@fnal.gov Abstract. The Grid
More informationRoberto Barbera. Centralized bookkeeping and monitoring in ALICE
Centralized bookkeeping and monitoring in ALICE CHEP INFN 2000, GRID 10.02.2000 WP6, 24.07.2001 Roberto 1 Barbera ALICE and the GRID Phase I: AliRoot production The GRID Powered by ROOT 2 How did we get
More informationMining Large Datasets: Case of Mining Graph Data in the Cloud
Mining Large Datasets: Case of Mining Graph Data in the Cloud Sabeur Aridhi PhD in Computer Science with Laurent d Orazio, Mondher Maddouri and Engelbert Mephu Nguifo 16/05/2014 Sabeur Aridhi Mining Large
More informationAlternative Deployment Models for Cloud Computing in HPC Applications. Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix
Alternative Deployment Models for Cloud Computing in HPC Applications Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix The case for Cloud in HPC Build it in house Assemble in the cloud?
More informationOn Enabling Hydrodynamics Data Analysis of Analytical Ultracentrifugation Experiments
On Enabling Hydrodynamics Data Analysis of Analytical Ultracentrifugation Experiments 18. June 2013 Morris Reidel, Shahbaz Memon, et al. Outline Background Ultrascan Application Ultrascan Software Components
More informationSoaplab - a unified Sesame door to analysis tools
Soaplab - a unified Sesame door to analysis tools Martin Senger, Peter Rice, Tom Oinn European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK http://industry.ebi.ac.uk/soaplab Abstract
More informationEfficient and Scalable Climate Metadata Management with the GRelC DAIS
Efficient and Scalable Climate Metadata Management with the GRelC DAIS G. Aloisio, S. Fiore CMCC Scientific Computing and Operations Division University of Salento, Lecce Context : countdown of the Intergovernmental
More informationAuthentication and authorisation prototype on the µgrid for medical data management *
Authentication and authorisation prototype on the µgrid for medical data management * Ludwig Seitz 1, Johan Montagnat 2, Jean-Marc Pierson 1, Didier Oriol 1, Diane Lingrand 2 1 LIRIS FRE CNRS 2672, INSA
More informationClassic Grid Architecture
Peer-to to-peer Grids Classic Grid Architecture Resources Database Database Netsolve Collaboration Composition Content Access Computing Security Middle Tier Brokers Service Providers Middle Tier becomes
More informationAnwendungsintegration und Workflows mit UNICORE 6
Mitglied der Helmholtz-Gemeinschaft Anwendungsintegration und Workflows mit UNICORE 6 Bernd Schuller und UNICORE-Team Jülich Supercomputing Centre, Forschungszentrum Jülich GmbH 26. November 2009 D-Grid
More informationBig Data Mining Services and Knowledge Discovery Applications on Clouds
Big Data Mining Services and Knowledge Discovery Applications on Clouds Domenico Talia DIMES, Università della Calabria & DtoK Lab Italy talia@dimes.unical.it Data Availability or Data Deluge? Some decades
More informationEnabling multi-cloud resources at CERN within the Helix Nebula project. D. Giordano (CERN IT-SDC) HEPiX Spring 2014 Workshop 23 May 2014
Enabling multi-cloud resources at CERN within the Helix Nebula project D. Giordano (CERN IT-) HEPiX Spring 2014 Workshop This document produced by Members of the Helix Nebula consortium is licensed under
More informationWrangler: A New Generation of Data-intensive Supercomputing. Christopher Jordan, Siva Kulasekaran, Niall Gaffney
Wrangler: A New Generation of Data-intensive Supercomputing Christopher Jordan, Siva Kulasekaran, Niall Gaffney Project Partners Academic partners: TACC Primary system design, deployment, and operations
More informationShanoir: So*ware as a Service Environment to Manage Popula9on Imaging Research Repositories
Shanoir: So*ware as a Service Environment to Manage Popula9on Imaging Research Repositories Chris&an Barillot, Elise Bannier, Olivier Commowick, Isabelle Corouge, Jus&ne Guillaumont, Yao Yao, Michael Kain
More informationThe Fusion of Supercomputing and Big Data. Peter Ungaro President & CEO
The Fusion of Supercomputing and Big Data Peter Ungaro President & CEO The Supercomputing Company Supercomputing Big Data Because some great things never change One other thing that hasn t changed. Cray
More informationEvery area of science is now engaged in data-intensive research Researchers need Technology to publish and share data in the cloud Data analytics
Talk Outline The 4 th paradigm of science The genesis of big data analysis in the cloud : searching the web The revolution in machine learning Examples The n-gram and language translation Recognizing images
More informationBig Data in Enterprise challenges & opportunities. Yuanhao Sun 孙 元 浩 yuanhao.sun@intel.com Software and Service Group
Big Data in Enterprise challenges & opportunities Yuanhao Sun 孙 元 浩 yuanhao.sun@intel.com Software and Service Group Big Data Phenomenon 1.8ZB in 2011 2 Days > the dawn of civilization to 2003 750M Photos
More informationA Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment
A Taxonomy and Survey of Grid Resource Planning and Reservation Systems for Grid Enabled Analysis Environment Arshad Ali 3, Ashiq Anjum 3, Atif Mehmood 3, Richard McClatchey 2, Ian Willers 2, Julian Bunn
More informationPersonalized Medicine: Humanity s Ultimate Big Data Challenge. Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences
Personalized Medicine: Humanity s Ultimate Big Data Challenge Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences 2012 Oracle Corporation Proprietary and Confidential 2 3 Humanity
More informationConcepts and Architecture of the Grid. Summary of Grid 2, Chapter 4
Concepts and Architecture of the Grid Summary of Grid 2, Chapter 4 Concepts of Grid Mantra: Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations Allows
More information