Advances in the LIBI Project, a Virtual Laboratory for Bioinformatics
|
|
- Ralph O’Connor’
- 8 years ago
- Views:
Transcription
1 Advances in the LIBI Project, a Virtual Laboratory for Bioinformatics Maria Mirto on behalf of Giovanni ALOISIO Giovanni ALOISIO (SPACI Consortium & University of Salento, Lecce) Giorgio MAGGI (INFN Sezione di Bari) Pietro LEO (IBM GBS Innovation Lab, Bari) Elda ROSSI (CINECA, Bologna) Rita CASADIO (Biocomputing Group & CIRB/CIG-Department of Biology, University of Bologna) Graziano PESOLE, Cecilia SACCONE (Project Coordinator) (ITB-CNR, Bari & Dipartimento di Biochimica e Biologia Molecolare E. Quagliariello, Università di Bari & Dipartimento di Scienze Biomolecolari e Biotecnologie, Università di Milano) 4th EGEE User Forum/OGF 25 - Grid Projects and Collaborations, March 5, 2009
2 Outline The LIBI Project Architecture Grid Infrastructure Services and basic frameworks Applications Porting Conclusions and Future Work
3 LIBI Project FIRB LIBI: International Laboratory of BioInformatics Funded by the MIUR (Italian Ministry for Education, University and Research) Goal: Setting up of an advanced Bioinformatics and Computational Biology Laboratory, focusing on the central activities of basic and applied research in modern Biology and Biotechnologies Biological Activities the construction and the maintenance of general genomic, proteomic and transcriptomic databases (e.g. ENSEMBL) as well as specialized databases developed by LIBI partners (e.g. MitoRes, UTRdb, UTRsite, ASPicDB, CSTdb, etc.); the design and implementation of new algorithms and software for the analysis of genomes and their expression products and for the prediction of the structure of proteins. Technological Activities: Building of a technological framework (Grid PSE) that, guaranting the interoperability between different grid middleware (glite, Unicore, Globus), provides an environment for composing, executing and monitoring complex experiments in Bioinformatics; Optimization and porting in the Grid of Bioinformatics applications.
4 LIBI Partner Two kind of actors have equal responsibility in the LIBI: Technological and Bioinformatics partners Technological RUs: University of Salento SPACI Consortium, Lecce. (Prof. Giovanni Aloisio) INFN Sections of Bari, Catania and Padova and CNAF-Bologna (Dr. Mirco Mazzucato) IBM Italia S.p.A - IBM Innovation Lab, Bari (Dr. Luigi Di Pace) CINECA Bologna (Dr. Elda Rossi) Bioinformatics RUs: CNR Institute for Biomedical Technologies, Bari (Prof. Cecilia Saccone, project coordinator) University of Bologna Biocomputing Group, Bologna (Prof. Rita Casadio) University of Milano (Prof. Graziano Pesole) Centro di Biomedicina Molecolare Trieste (Prof. Claudio Schneider) Associate RUs: University of Milano-Bicocca (Prof. Giancarlo Mauri) DIBIT-HSR, Milan (Dr. Giovanni Lavorgna) TIGEM, Naples (Dr. Sandro Banfi, Dr. Elia Stupka) University of Rome - Tor Vergata (Prof. Manuela Helmer-Citterich) University of Rome (Prof. Anna Tramontano) CASPUR, Rome. (Dr. Tiziana Castrignanò) Dipartimento Interateneo di Fisica Bari (Prof. Giorgio Maggi) University of Bari - (Dip. Informatica, Prof. Donato Malerba; Dip. Biochimica e Biologia Molecolare, Prof. Marcella Attimonelli) Exhicon srl, Unità Bioinformatica Bari (Dr. Graziano Pappadà
5 LIBI Partner Torino UNITO UNIMI Milano Bologna CINECA UNIBO CBMTS Trieste CASPUR Roma INFN ITB Bari IBM SPACI Lecce Legend Scientific Unit Technological Unit
6 The LIBI Architecture Researcher Team work Team work Web-based Portal Bioinformatic Research Applications Bioinformatic and Text Analytics Tools and Services Bioinformatic Workflows Tools and Services Virtualization (knowledge & computing warehouse) glite Unicore Globus Physical Databases & Computing Resources
7 The LIBI e-infrastructure -Based on IGI (the Italian-Grid infrastructure) - (and the EGEE European-Grid infrastructure) Italian sites available for LIBI activity - ~10 INFN-Grid sites enabled for LIBI/BIO VOs jobs submission (~ 10% of available resources => up to 1500 CPU cores reachable) - Up to CPU cores reachable using the biomed VO all over EGEE Grid - 6 RBs at different sites: Catania, 2 at CNAF, Ferrara, Padova, Bari - DB services -GRelC, AMGA, OGSA-DAI, GDSE servers installed at INFN-Bari -High availability servers that could provide up to 1TB of data - LFC (Grid files catalogue): installed at CNAF and at INFN-Bari I FESR G I
8 The LIBI Services and basic frameworks GRB Grid Portal; Workflow Management System: Editor Engine (Meta Scheduler) Job Submission Tool (JST) Bioinformatic Data Federation Service for managing and accessing the LIBI Federated DBs LIBI federator server GRelC DAS plug-in for DB2 Text Analytics 2.0 Framework Several bioinformatics applications have been deployed and optimized in the LIBI Grid platform such as BLAST, PSI-BLAST, PatSearch, DNAFan, FT-COMAR, Antihunter, Gromacs and NAMD, MrBayes, Gene Analogous Finder, CSTMiner/Genominer, WeederWeb, RNAProfile, Exalign, ASPIC, PAML, R-www (submission of R jobs by using a web form).
9 GRB Grid Portal It is a Grid Portal with the following services: Grid Configuration Profile Credential VO Management Resource Management Database Management Software Management View Configuration Resource Status Applications Transfer
10 Job Submission & Monitoring
11 Meta Scheduler Scenario A Grid Middleware: Interoperability - the Big Issue B C D D C B A B C A B C B D A
12 Meta Scheduler Features We have developed a Web Service component that supports the submission and the monitoring of workflows, batch, MPI and parameter sweep jobs distributed on glite, Globus and Unicore based grids; Several libraries, plugged in the meta scheduler, provide core functions for job submission/monitoring and data transfer by using the GridFTP protocol; Used JSDL specification, OGF compliant; Automatic converters from JSDL into specific grid languages for the submission; Support for the applications wrapped as Web Services (work in progress).
13 WFMS in action A B Metascheduler LIBI Portal C JSDL download editor A B C Job Job CB A AJO JDL RSL Submission request Job B status job RSL DONE Job A status job DONE JDL Job C status job DONE AJO Workflow Editor GRB WMS LB WMProxy NJS Network Job Supervisor
14 The Job Submission Tool JST is a tool developed (and initially used) by few expert operators for interactive job submission in bioinformatics grid Challenges Recently, JST has been upgraded to provide grid job submission services to already existing (non-grid) bioportals The portal, in fact, does not need to implement any glite submission procedure It is only required to provide an SQL insert into a DB server The JST daemons will take care of submitting, controlling, resubmitting failed jobs and collecting the final output.
15 PORTALs-JST Interaction classical execution triggered on limited resources Few resources for execution communication with JST to use the GRID JST G R I D Submission on the GRID
16 Data Federation The LIBI Data Federation Service provides a Federated Schema Assimilation Model that can be accessed through a standard and transparent SQL interface and it is exposed to the Grid by using GrelC GrelC SQL LIBI Data Federation Service Federated Schema Assimilation Model It wraps, in real-time, a number of local and remote biological databases that store information their original formats, including Relational Data, Web Services, flatfiles, EMBL/FASTA formats, etc. Example of benefits provided by the federation layer with respect to accessing to EMBL/FASTA format - The availability to view in a synoptic way EMBL and FASTA DBs, federated together with other, heterogeneous DBs - A fast end efficient retrieving system for entries in EMBL/FASTA format from large DBs (sub-second response times) - Customizable indexation of both textual and non textual fields of the EMBL entries (with/out tokenization, etc.) to enable also mining features - Support for multiplatform and dislocated deployment Relational Wrapper MitoRes UTRSite UTREF HmtDB Entrez Wrapper Pubmed GeneBank OMIM EMBL Wrapper ID AB standard; RNA; PRI; 368 BP. XX AC AB000263; XX DE Homo sapiens mrna for prepro cortistatin like peptide, complete cds. XX SQ Sequence 368 BP; acaagatgcc attgtccccc ggcctcctgc tgctgctgct ctccggggcc acggccaccg 60 ctgccctgcc cctggagggt ggccccaccg gccgagacag cgagcatatg caggaagcgg 120 agaccttctc ctcctgcaaa taaaacctca cccatgaatg ctcacgcaag tttaattaca 360 gacctgaa 368 // Improved response times UNIPROT EMBL_CDS EMBL EMBL/FASTA formats with old Index type with new Index type
17 Applications porting GAF: Gene Analogous Finder 6000 jobs submitted on Grid More than 200 WNs used ASPic: Alternative Splicing Prediction One complete genome analyzed (Mouse) in 3 days CSTminer: Conserved tracts identification 2 months for the human-mouse genomes (FULL comparison) ~ 900 WNs used 1 Day for one genome comparison using the optimized algoritm Developed using the lesson learned in the first run 1 Week for a FULL Comparison of Vitis genome BLAST: Large scale genome comparison using BLAST program Dataset: 599 complete genomes (2,624,555 protein sequences in FASTA format) The complete comparison of all the genomes against all, carried out on the GRID, took about 1 week PAML: maximum likelihood analysis using approximate derivatives 6690 cases evaluated in 36 hours
18 Applications porting FT-COMAR: Protein Tertiary Structure Prediction jobs executed in 5 days MrBayes: Bayesian inference of phylogeny over 7200 different runs; ~5 CPU/years. ~20 days of run on EGEE infrastructure Emboss vrnalfold : applications for molecular sequence over 0.5 M of sequences analyzed MAFFT: multiple sequence alignment more than 5000 sequences aligned Solexa Illumina: Sequence clustering 2.5M sequences clustered.
19 Applications porting PatSearch: Retrieving patterns into a sequence identify and annotate the presence of regulatory elements in mrna untranslated regions, collected in UTRsite and UTRdb databases, respectively; 150,000 jobs executed in 1 day. PSI-BLAST: multiple sequence alignment 70,000 jobs executed in 65 hours using GHz Itanium 2 processors; 96 days required on a single processor. Gromacs: proteins dynamics simulation Short simulations of a small protein, bovine β-lactoglobulin, in a solvent (water/urea + NaCl) at 300K and at constant pressure (1bar); 4-26 secs by using respectively 12 and 2 CPUs on Itanium 2 processors. Antihunter: identification of expressed sequence tag (EST) antisense transcripts from BLAST output
20 Conclusions and Future Work The LIBI environment is a virtual laboratory for bioinformatics based on a high performance and distributed infrastructure supporting access to large datasets and the execution of single or complex jobs; The LIBI platform involves a large set of resources belonging to three different grid middlewares: glite, Unicore and Globus. These toolkits provide basic services for managing the resources; Built on top of these services, a set of enhanced and novel services has been implemented related to resource and data management; Several case studies with related results have been presented. Future work: Testbed on more bioinformatics applications; Make the system fully operational and open it to external users.
21 LIBI Team M. Mirto, I. Epicoco, S. Fiore, M. Cafaro, A. Negro, D. Tartarini, M. Passante, O. Marra, A. Ferramosca, V. Zara, G. Aloisio SPACI & University of Salento, Lecce G. Cuscela, G. Donvito, G. La Rocca, S. My, G. Selvaggi, G. Maggi INFN, Padova and Catania Sections, & Dipartimento Interateneo di Fisica di Bari G. Scioscia, P. Leo, L. Di Pace IBM Italy, Bari G. Pappada', V. Quinto, M. Berardi Exhicon srl, Bari F. Falciano, A. Emerson, G. Lavorgna, A. Vanni, E. Rossi CINECA, Bologna L. Bartoli, P. Di Lena, P. Fariselli, R. Fronza, L. Margara, L. Montanucci, P. L. Martelli, I. Rossi, M. Vassura, and R. Casadio Biocomputing Group, CIRB/CIG-Department of Biology, University of Bologna and Bioinformatics Group-Department of Computer Science T. Castrignanò CASPUR, Roma D. D Elia, G. Grillo, F. Licciulli, S. Liuni, A. Gisel, M. Santamaria, S. Vicario, C. Saccone (Coordinator) ITB-CNR, Bari, Dipartimento di Biochimica e Biologia Molecolare E. Quagliariello, Università di Bari A. Anselmo, D. Horner, F. Mignone, G. Pavesi, E. Picardi, V. Piccolo&, M. Re, F. Zambelli, G. Pesole Dipartimento di Chimica Strutturale e Stereochimica Inorganica, Università di Milano & Dipartimento di Scienze Biomolecolari e Biotecnologie, Università di Milano Reference: The LIBI Grid Platform Developers - M. Mirto et al., The LIBI Grid Platform for Bioinformatics, in Mario Cannataro (Ed.), Handbook of Research on Computational Grid Technologies for Life Sciences, 4th Biomedicine EGEE User and Forum/OGF Healthcare, 25 IGI - Global Grid Projects (to appear). and Collaborations, March
The Grid-it: the Italian Grid Production infrastructure
n 1 Maria Cristina Vistoli INFN CNAF, Bologna Italy The Grid-it: the Italian Grid Production infrastructure INFN-Grid goals!promote computational grid technologies research & development: Middleware and
More informationEfficient and Scalable Climate Metadata Management with the GRelC DAIS
Efficient and Scalable Climate Metadata Management with the GRelC DAIS G. Aloisio, S. Fiore CMCC Scientific Computing and Operations Division University of Salento, Lecce Context : countdown of the Intergovernmental
More informationSPACI & EGEE LCG on IA64
SPACI & EGEE LCG on IA64 Dr. Sandro Fiore, University of Lecce and SPACI December 13 th 2005 www.eu-egee.org Outline EGEE Production Grid SPACI Activity Status of the LCG on IA64 SPACI & EGEE Farm Configuration
More informationUsing the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova
Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel
More informationA W orkflow Management System for Bioinformatics Grid
A W orkflow Management System for Bioinformatics Grid Giovanni Aloisio, Massimo Cafaro, Sandro Fiore, Maria Mirto C A C T/IS U FI SP A CI, University of Lecce and NNL/INFM&CNR,Italy NETTAB 2005, 5-7 October
More informationINFN Testbed status report
L. Gaido Oxford July, 2-5 2001 1 Dedicated resources (available now) Quantum Grid: 3-4 PCs in 15 sites: Bari, Bologna, Cagliari, Catania, Cnaf, Ferrara, Lecce, Legnaro, Milano, Napoli, Padova, Parma, Pisa,
More informationGruppi di lavoro Biologia Cellulare e Molecolare Biotecnologie e Differenziamento. Università degli Studi di Napoli Federico II BIOGEM.
Società Botanica Italiana Gruppi di lavoro Biologia Cellulare e Molecolare Biotecnologie e Differenziamento Università degli Studi di Napoli Federico II BIOGEM Organize the Summer School Challenges, methods
More informationThe ENEA-EGEE site: Access to non-standard platforms
V INFNGrid Workshop Padova, Italy December 18-20 2006 The ENEA-EGEE site: Access to non-standard platforms C. Sciò**, G. Bracco, P. D'Angelo, L. Giammarino*, S.Migliori, A. Quintiliani, F. Simoni, S. Podda
More informationRETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
More informationWebGReIC: Towards ubiquitous grid data management services
Rochester Institute of Technology RIT Scholar Works Articles 2006 WebGReIC: Towards ubiquitous grid data management services Giovanni Aloisio Massimo Cafaro Sandro Fiore Follow this and additional works
More informationThe Italian Grid Infrastructure (IGI) CGW08 Cracow. Mirco Mazzucato Italian Grid Infrastructure Coordinator INFN CNAF Director
The Italian Grid Infrastructure (IGI) CGW08 Cracow Mirco Mazzucato Italian Grid Infrastructure Coordinator INFN CNAF Director IGI -> National Grid Initiative Applications: Archeology Astronomy Astrophysics
More informationGridICE: monitoring the user/application activities on the grid
GridICE: monitoring the user/application activities on the grid Cristina Aiftimiei 1, Sergio Andreozzi 2, Guido Cuscela 3, Stefano Dal Pra 1, Giacinto Donvito 3, Vihang Dudhalkar 3, Sergio Fantinel 4,
More informationEsqu Science Experiments For Computer Network
Vangelis Floros, GRNET S.A. 3 rd Open Source Software Conference March 22, 2008 NTUA, Athens Greece Introduction What is a Grid? What is escience? Large Scientific Grids The example of EGEE Building Grid
More informationCluster, Grid, Cloud Concepts
Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of
More informationIntegration of Protein-protein Interaction Data in a Genomic and proteomic Data Warehouse
Integration of Protein-protein Interaction Data in a Genomic and proteomic Data Warehouse CANAKOGLU A, GHISALBERTI G, MASSEROLI M Dipartimentodi Elettronicae Informazione,Politecnicodi Milano, PiazzaLeonardoda
More informationGrid Scheduling Architectures with Globus GridWay and Sun Grid Engine
Grid Scheduling Architectures with and Sun Grid Engine Sun Grid Engine Workshop 2007 Regensburg, Germany September 11, 2007 Ignacio Martin Llorente Javier Fontán Muiños Distributed Systems Architecture
More informationRound Table Italy-Russia at Dubna
Round Table Italy-Russia at Dubna Efforts in Fundamental Research and Perspectives for Applied S&T and Business Development Report by Marco Boiti President of Consortium E.I.N.S.T.E.IN. on Scientific cooperation
More informationThe Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland
The Lattice Project: A Multi-Model Grid Computing System Center for Bioinformatics and Computational Biology University of Maryland Parallel Computing PARALLEL COMPUTING a form of computation in which
More informationA Workflow Service for Biomedical Applications
A Workflow Service for Biomedical Applications Emanuela Merelli Paolo Romano Lorenzo Scortichini Università di Camerino National Cancer Research Institute Università di Camerino ITALY ITALY ITALY 2004
More informationOn Enabling Hydrodynamics Data Analysis of Analytical Ultracentrifugation Experiments
On Enabling Hydrodynamics Data Analysis of Analytical Ultracentrifugation Experiments 18. June 2013 Morris Reidel, Shahbaz Memon, et al. Outline Background Ultrascan Application Ultrascan Software Components
More informationEarly Experiences with the GRelC Library
Early Experiences with the GRelC Library Giovanni Aloisio, Massimo Cafaro, Sandro Fiore, Maria Mirto Center for Advanced Computational Technologies/ISUFI, University of Lecce, Italy {giovanni.aloisio,
More informationAn approach to grid scheduling by using Condor-G Matchmaking mechanism
An approach to grid scheduling by using Condor-G Matchmaking mechanism E. Imamagic, B. Radic, D. Dobrenic University Computing Centre, University of Zagreb, Croatia {emir.imamagic, branimir.radic, dobrisa.dobrenic}@srce.hr
More informationProcessing Genome Data using Scalable Database Technology. My Background
Johann Christoph Freytag, Ph.D. freytag@dbis.informatik.hu-berlin.de http://www.dbis.informatik.hu-berlin.de Stanford University, February 2004 PhD @ Harvard Univ. Visiting Scientist, Microsoft Res. (2002)
More informationIS-ENES WP3. D3.8 - Report on Training Sessions
IS-ENES WP3 D3.8 - Report on Training Sessions Abstract: The deliverable D3.8 describes the organization and the outcomes of the tutorial meetings on the Grid Prototype designed within task4 of the NA2
More informationAN APPROACH TO DEVELOPING BUSINESS PROCESSES WITH WEB SERVICES IN GRID
AN APPROACH TO DEVELOPING BUSINESS PROCESSES WITH WEB SERVICES IN GRID R. D. Goranova 1, V. T. Dimitrov 2 Faculty of Mathematics and Informatics, University of Sofia S. Kliment Ohridski, 1164, Sofia, Bulgaria
More informationBIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:
More informationModule 1. Sequence Formats and Retrieval. Charles Steward
The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.
More informationDistributed Data Mining in Discovery Net. Dr. Moustafa Ghanem Department of Computing Imperial College London
Distributed Data Mining in Discovery Net Dr. Moustafa Ghanem Department of Computing Imperial College London 1. What is Discovery Net 2. Distributed Data Mining for Compute Intensive Tasks 3. Distributed
More informationRoberto Barbera. Centralized bookkeeping and monitoring in ALICE
Centralized bookkeeping and monitoring in ALICE CHEP INFN 2000, GRID 10.02.2000 WP6, 24.07.2001 Roberto 1 Barbera ALICE and the GRID Phase I: AliRoot production The GRID Powered by ROOT 2 How did we get
More informationThe ENEA gateway approach providing EGEE/gLite access to unsupported platforms and operating systems
EU-IndiaGrid Workshop Taipei, November 2nd 2007 The ENEA gateway approach providing EGEE/gLite access to unsupported platforms and operating systems G. Bracco, S.Migliori, A. Quintiliani, A. Santoro, C.
More informationorg.rn.eg.db December 16, 2015 org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank accession numbers.
org.rn.eg.db December 16, 2015 org.rn.egaccnum Map Entrez Gene identifiers to GenBank Accession Numbers org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank
More informationA Web-based Portal to Access and Manage WNoDeS Virtualized Cloud Resources
A Web-based Portal to Access and Manage WNoDeS Virtualized Cloud Resources Davide Salomoni 1, Daniele Andreotti 1, Luca Cestari 2, Guido Potena 2, Peter Solagna 3 1 INFN-CNAF, Bologna, Italy 2 University
More informationGridWay: Open Source Meta-scheduling Technology for Grid Computing
: Open Source Meta-scheduling Technology for Grid Computing Ruben S. Montero dsa-research.org Open Source Grid & Cluster Oakland CA, May 2008 Contents Introduction What is? Architecture & Components Scheduling
More informationDNA Sequence formats
DNA Sequence formats [Plain] [EMBL] [FASTA] [GCG] [GenBank] [IG] [IUPAC] [How Genomatix represents sequence annotation] Plain sequence format A sequence in plain format may contain only IUPAC characters
More informationA Tutorial in Genetic Sequence Classification Tools and Techniques
A Tutorial in Genetic Sequence Classification Tools and Techniques Jake Drew Data Mining CSE 8331 Southern Methodist University jakemdrew@gmail.com www.jakemdrew.com Sequence Characters IUPAC nucleotide
More informationThe CMS analysis chain in a distributed environment
The CMS analysis chain in a distributed environment on behalf of the CMS collaboration DESY, Zeuthen,, Germany 22 nd 27 th May, 2005 1 The CMS experiment 2 The CMS Computing Model (1) The CMS collaboration
More informationInstruments in Grid: the New Instrument Element
Instruments in Grid: the New Instrument Element C. Vuerli (1,2), G. Taffoni (1,2), I. Coretti (1), F. Pasian (1,2), P. Santin (1), M. Pucillo (1) (1) INAF Astronomical Observatory of Trieste (2) INAF Informative
More informationTUTORIAL. Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler, b.tweddell}@fz-juelich.
TUTORIAL Rebecca Breu, Bastian Demuth, André Giesler, Bastian Tweddell (FZ Jülich) {r.breu, b.demuth, a.giesler, b.tweddell}@fz-juelich.de September 2006 Outline Motivation & History Production UNICORE
More informationPoS(ISGC 2013)024. Porting workflows based on small and medium parallelism applications to the Italian Grid Infrastructure.
Porting workflows based on small and medium parallelism applications to the Italian Grid Infrastructure Daniele Cesini 1 INFN-CNAF V. B. Pichat 6/2; Bologna,Italy E-mail: daniele.cesini@cnaf.infn.it Marco
More informationIGI Portal architecture and interaction with a CA- online
IGI Portal architecture and interaction with a CA- online Abstract In the framework of the Italian Grid Infrastructure, we are designing a web portal for the grid and cloud services provisioning. In following
More informationA Platform for Collaborative e-science Applications. Marian Bubak ICS / Cyfronet AGH Krakow, PL bubak@agh.edu.pl
A Platform for Collaborative e-science Applications Marian Bubak ICS / Cyfronet AGH Krakow, PL bubak@agh.edu.pl Outline Motivation Idea of an experiment Virtual laboratory Examples of experiments Summary
More informationAnalisi di un servizio SRM: StoRM
27 November 2007 General Parallel File System (GPFS) The StoRM service Deployment configuration Authorization and ACLs Conclusions. Definition of terms Definition of terms 1/2 Distributed File System The
More informationHPC and Grid Concepts
HPC and Grid Concepts Divya MG (divyam@cdac.in) CDAC Knowledge Park, Bangalore 16 th Feb 2012 GBC@PRL Ahmedabad 1 Presentation Overview What is HPC Need for HPC HPC Tools Grid Concepts GARUDA Overview
More informationThe EDGeS project receives Community research funding
Desktop Grids EDGeS project Delegation for access to trusted resources The EDGeS project receives Community research funding 1 DG = Desktop Grid = Loose grid scavenging idle resources Unit of Work = Application
More informationThe GISELA Science Gateway
The GISELA Science Gateway Roberto Barbera (roberto.barbera@ct.infn.it) University of Catania and INFN - Italy TICAL 2012 Lima, 3 July 2012 Introduction and driving considerations The Catania Science Gateway
More informationJust the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
More informationCore Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat
More informationPipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices
overview Pipeline Pilot Enterprise Server Pipeline Pilot Enterprise Server (PPES) is a powerful client-server platform that streamlines the integration and analysis of the vast quantities of data flooding
More informationPeptidomicsDB: a new platform for sharing MS/MS data.
PeptidomicsDB: a new platform for sharing MS/MS data. Federica Viti, Ivan Merelli, Dario Di Silvestre, Pietro Brunetti, Luciano Milanesi, Pierluigi Mauri NETTAB2010 Napoli, 01/12/2010 Mass Spectrometry
More informationA demonstration of the use of Datagrid testbed and services for the biomedical community
A demonstration of the use of Datagrid testbed and services for the biomedical community Biomedical applications work package V. Breton, Y Legré (CNRS/IN2P3) R. Météry (CS) Credits : C. Blanchet, T. Contamine,
More informationReport from Italian ROC
Report from Italian ROC Paolo Veronesi for ROC It www.eu-egee.org ARM-7, ROC(111) 15th - 17th May 2006 - Krakow, Outline Changes in ROC structure over the last 3 months (people/institutes involved) People
More informationAn agent-based layered middleware as tool integration
An agent-based layered middleware as tool integration Flavio Corradini Leonardo Mariani Emanuela Merelli University of L Aquila University of Milano University of Camerino ITALY ITALY ITALY Helsinki FSE/ESEC
More informationEMBL Identity & Access Management
EMBL Identity & Access Management Rupert Lück EMBL Heidelberg e IRG Workshop Zürich Apr 24th 2008 Outline EMBL Overview Identity & Access Management for EMBL IT Requirements & Strategy Project Goal and
More informationEMBL-EBI Web Services
EMBL-EBI Web Services Rodrigo Lopez Head of the External Services Team SME Workshop Piemonte 2011 EBI is an Outstation of the European Molecular Biology Laboratory. Summary Introduction The JDispatcher
More informationMagic-5. Medical Applications in a GRID Infrastructure Connection. Ivan De Mitri* on behalf of MAGIC-5 collaboration
Magic-5 Medical Applications in a GRID Infrastructure Connection Ivan De Mitri* on behalf of MAGIC-5 collaboration *Dipartimento di Fisica dell Università di Lecce and Istituto Nazionale di Fisica Nucleare,
More informationISTITUTO NAZIONALE DI FISICA NUCLEARE
ISTITUTO NAZIONALE DI FISICA NUCLEARE Sezione di Catania INFN/CCR-07/3 25 Maggio 2007 CCR-04/2006/P GENERAL COMPUTATION AND GRID TIER2 TOWARD LHC Giuseppe Sava, Rosanna Catania, Emidio Giorgio, Gianluca
More informationData Grids. Lidan Wang April 5, 2007
Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural
More informationWidening the number of e- Infrastructure users with Science Gateways and Identity Federations (access for success)
Consorzio COMETA UNIONE EUROPEA Widening the number of e- Infrastructure users with Science Gateways and Identity Federations (access for success) Prof. Roberto Barbera (roberto.barbera@ct.infn.it) Univ.
More informationZum aktuellen Stand der GRID Forschung
Zum aktuellen Stand der GRID Forschung Regionales Rechenzentrum Erlangen 13.Januar 2004 Michael M. Resch Höchstleistungsrechenzentrum Stuttgart (HLRS), resch@hlrs.de Michael. M. Resch 1 Höchstleistungsrechenzentrum
More informationStatus and Integration of AP2 Monitoring and Online Steering
Status and Integration of AP2 Monitoring and Online Steering Daniel Lorenz - University of Siegen Stefan Borovac, Markus Mechtel - University of Wuppertal Ralph Müller-Pfefferkorn Technische Universität
More informationBioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
More informationGrids Computing and Collaboration
Grids Computing and Collaboration Arto Teräs CSC, the Finnish IT center for science University of Pune, India, March 12 th 2007 Grids Computing and Collaboration / Arto Teräs 2007-03-12 Slide
More informationInteroperability in Grid Computing
Anette Weisbecker, Fraunhofer IAO, Stuttgart 18 th April 2007 Special Interest Session III Outline: Interoperability in Grid Computing Grid Computing for Medicine and Life Science Interoperability Architecture
More informationEuro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of
More informationCD-HIT User s Guide. Last updated: April 5, 2010. http://cd-hit.org http://bioinformatics.org/cd-hit/
CD-HIT User s Guide Last updated: April 5, 2010 http://cd-hit.org http://bioinformatics.org/cd-hit/ Program developed by Weizhong Li s lab at UCSD http://weizhong-lab.ucsd.edu liwz@sdsc.edu 1. Introduction
More informationBioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
More informationInteroperating Cloud-based Virtual Farms
Stefano Bagnasco, Domenico Elia, Grazia Luparello, Stefano Piano, Sara Vallero, Massimo Venaruzzo For the STOA-LHC Project Interoperating Cloud-based Virtual Farms The STOA-LHC project 1 Improve the robustness
More informationThe GENIUS Grid Portal
The GENIUS Grid Portal (*) work in collaboration with A. Falzone and A. Rodolico EGEE NA4 Workshop, Paris, 18.12.2003 CHEP 2000, 10.02.2000 Outline Introduction Grid portal architecture and requirements
More informationFrom Oracle Warehouse Builder to Oracle Data Integrator fast and safe.
From Oracle Warehouse Builder to Oracle Data Integrator fast and safe. Massimo Sposaro marketing manager Alessandro Drago technical team leader Database & Technology From "Oracle Data Integrator and Oracle
More informationEfficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing
Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing James D. Jackson Philip J. Hatcher Department of Computer Science Kingsbury Hall University of New Hampshire Durham,
More informationBuilding Platform as a Service for Scientific Applications
Building Platform as a Service for Scientific Applications Moustafa AbdelBaky moustafa@cac.rutgers.edu Rutgers Discovery Informa=cs Ins=tute (RDI 2 ) The NSF Cloud and Autonomic Compu=ng Center Department
More informationQoS management in Grid environments
Consorzio COMETA - Progetto PI2S2 FESR QoS management in Grid environments Antonella Di Stefano Giovanni Morana Daniele Zito Consorzio Cometa Grid Open Days all Università di Palermo Palermo, 6-7.12.2007
More informationCNR-INFM DEMOCRITOS and SISSA elab Trieste
elab and the FVG grid Stefano Cozzini CNR-INFM DEMOCRITOS and SISSA elab Trieste Agenda/Aims Present elab ant its computational infrastructure GRID-FVG structure basic requirements technical choices open
More informationFuture Developments in UniGrids and NextGRID
Future Developments in UniGrids and NextGRID Dr. David Snelling Fujitsu Laboratories of Europe Unicore Summit Nice, France - October 2005 Overview Status Report New Gateway Architecture Parallel HTTP based
More informationBig Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
More informationGRIDSEED: A Virtual Training Grid Infrastructure
GRIDSEED: A Virtual Training Grid Infrastructure Iztok Gregori CNR-IOM DEMOCRITOS Trieste, Italy iztok@democritos.it Stefano Cozzini CNR-IOM DEMOCRITOS Trieste, Italy cozzini@democritos.it Tyanko Aleksiev
More informationProf. Elisabetta Cerbai University of Florence, Vice-Chancellor for Research Sapienza University of Rome, NVA Coordinator
Prof. Elisabetta Cerbai University of Florence, Vice-Chancellor for Research Sapienza University of Rome, NVA Coordinator Rome, October 14, 2014 1.Strategic Planning 2.From planning to facts 3.Evaluation:
More informationVirtual digital libraries: The DILIGENT Project. Donatella Castelli ISTI-CNR, Italy
Virtual digital libraries: The DILIGENT Project Donatella Castelli ISTI-CNR, Italy The DLs evolution Dynamic Universal Knowledge Environment 2005 Repository of digital texts + search service 1996 Large
More informationglibrary: Digital Asset Management System for the Grid
glibrary: Digital Asset Management System for the Grid Antonio Calanducci INFN Catania EGEE User Forum Manchester, 09 th -11 th May 2007 www.eu-egee.org EGEE and glite are registered trademarks Outline
More informationRecent advances of MedIGrid PSE in an LCG/gLite environment
430 FINAL WORKSHOP OF GRID PROJECTS, PON RICERCA 2000-2006, AVVISO 1575 Recent advances of MedIGrid PSE in an LCG/gLite environment V. Boccia 1, L. Carracciuolo 2, L. D amore 1, G. Laccetti 1, M. Lapegna
More informationTHE CCLRC DATA PORTAL
THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: g.j.drinkwater@dl.ac.uk, s.a.sufi@dl.ac.uk Abstract: The project aims
More informationa Peer-to-Peer Desktop Grid for scientific applications federating small research laboratories
ShareGrid a Peer-to-Peer Desktop Grid for scientific applications federating small research laboratories Guglielmo Girardi, TOP-IX, guglielmo.girardi@topix.it http://dcs.mfn.unipmn.it/sharegrid/ sharegrid.admin@topix.it
More informationScheduling and Resource Management in Computational Mini-Grids
Scheduling and Resource Management in Computational Mini-Grids July 1, 2002 Project Description The concept of grid computing is becoming a more and more important one in the high performance computing
More informationDAME Astrophysical DAta Mining Mining & & Exploration Exploration GRID
DAME Astrophysical DAta Mining & Exploration on GRID M. Brescia S. G. Djorgovski G. Longo & DAME Working Group Istituto Nazionale di Astrofisica Astronomical Observatory of Capodimonte, Napoli Department
More informationGenomeSpace Architecture
GenomeSpace Architecture The primary services, or components, are shown in Figure 1, the high level GenomeSpace architecture. These include (1) an Authorization and Authentication service, (2) an analysis
More informationThe Galaxy workflow. George Magklaras PhD RHCE
The Galaxy workflow George Magklaras PhD RHCE Biotechnology Center of Oslo & The Norwegian Center of Molecular Medicine University of Oslo, Norway http://www.biotek.uio.no http://www.ncmm.uio.no http://www.no.embnet.org
More informationG E N OM I C S S E RV I C ES
GENOMICS SERVICES THE NEW YORK GENOME CENTER NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. capabilities. N E X T- G E
More informationEventuale spazio per nome struttura o altro. Pharmacy education. The Italian academic viewpoint
Eventuale spazio per nome struttura o altro Pharmacy education. The Italian academic viewpoint Faculty of Pharmacy Total number of Pharmacy higher education institutes in Italy 1. Faculty of Pharmacy,
More informationCINECA DSpace-CRIS : An Open Source Solution Use Science 2013 w.c ineca.it
CINECA DSpace-CRIS : An Open Source Solution ~ Use Science 2013 Open www.cineca.it Infrastructure to Foster Collaboration between Industry and Academia Topics CINECA: a brief overview Solutions for Higher
More informationBIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS
BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Bioinformatics
More informationHeterogeneous Database Replication Gianni Pucciani
LCG Database Deployment and Persistency Workshop CERN 17-19 October 2005 Heterogeneous Database Replication Gianni Pucciani A.Domenici andrea.domenici@iet.unipi.it F.Donno flavia.donno@cern.ch L.Iannone
More informationClient/Server Grid applications to manage complex workflows
Client/Server Grid applications to manage complex workflows Filippo Spiga* on behalf of CRAB development team * INFN Milano Bicocca (IT) Outline Science Gateways and Client/Server computing Client/server
More informationThree data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk
Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes
More informationKLAPER: an Intermediate Language for Model-Driven Predictive Analysis of Performance and Reliability
KLAPER: an Intermediate Language for Model-Driven Predictive Analysis of Performance and Reliability Vincenzo Grassi Dipartimento di Informatica, Sistemi e Produzione, Università di Roma Tor Vergata Raffaela
More informationDATA MODEL FOR DESCRIBING GRID RESOURCE BROKER CAPABILITIES
DATA MODEL FOR DESCRIBING GRID RESOURCE BROKER CAPABILITIES Attila Kertész Institute of Informatics, University of Szeged H-6701 Szeged, P.O. Box 652, Hungary MTA SZTAKI Computer and Automation Research
More informationDistributed Computing for CEPC. YAN Tian On Behalf of Distributed Computing Group, CC, IHEP for 4 th CEPC Collaboration Meeting, Sep.
Distributed Computing for CEPC YAN Tian On Behalf of Distributed Computing Group, CC, IHEP for 4 th CEPC Collaboration Meeting, Sep. 12-13, 2014 1 Outline Introduction Experience of BES-DIRAC Distributed
More informationExperiences with the GLUE information schema in the LCG/EGEE production Grid
Experiences with the GLUE information schema in the LCG/EGEE production Grid Stephen Burke, Sergio Andreozzi and Laurence Field CHEP07, Victoria, Canada www.eu-egee.org EGEE and glite are registered trademarks
More informationFile Transfer Software and Service SC3
File Transfer Software and Service SC3 Gavin McCance JRA1 Data Management Cluster Service Challenge Meeting April 26 2005, Taipei www.eu-egee.org Outline Overview of Components Tier-0 / Tier-1 / Tier-2
More informationTier 1 Services - CNAF to T1
CDF Report on Tier 1 Usage Donatella Lucchesi for the CDF Italian Computing Group INFN Padova Outline The CDF Computing Model Tier1 resources usage as today CDF portal for European GRID: lcgcaf People
More informationFast and Easy Delivery of Data Mining Insights to Reporting Systems
Fast and Easy Delivery of Data Mining Insights to Reporting Systems Ruben Pulido, Christoph Sieb rpulido@de.ibm.com, christoph.sieb@de.ibm.com Abstract: During the last decade data mining and predictive
More information