Institut Français de Bioinformatique, Un Cloud pour les Sciences du Vivant
|
|
- Marjorie Norris
- 8 years ago
- Views:
Transcription
1 Institut Français de Bioinformatique, Un Cloud pour les Sciences du Vivant Christophe Blanchet! Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR French Node CNRS UMS Gif-sur-Yvette - FRANCE
2 Sequencing data source: source: Complete genome sequencing become a lab commodity with NGS (cheap and efficient)
3 And other experimental data FR - EU
4 EMBL-EBI data resources growth source: EMBL-EBI Annual Report 2013
5 Plateformes Expérimentales en Biologie Plateformes nationales (GIS IBISA) Nb Imagerie cellulaire 19 Génomique, Transcriptomique 16 Protéomique 13 Biologie structurale, biophysique 11 NGS BI C IMG Biological platform (Genomics, IMaGing, PROteomics...) Bioinformatics center Cloud resources Scientists PRO C NGS BI NGS PRO BI C Localisation des plateformes NGS PRO BI IMG PRO NGS PRO C IMG BI C C Source: omicsmaps.com Des sites intermédiaires permettent de répartir la charge en terme de stockage et de puissance de calcul tout en assurant une meilleure proximité avec les scientifiques
6 Infrastructures in Biology Lot of bioinformatics tools and services to treat and vizualize the biological data
7 IFB mission IFB: national research infrastructure of service in Bioinformatics! Mission: to make available core bioinformatics resources to the national life science research community.! by providing support for biology programs! projects bolstering! users training! by setting up an IT infrastructure devoted to the management and analysis of biological data! material resources : CPUs, disks, etc.! availability of biology data collections! deployment of bioinformatics tools (Cloud)! by acting as a middleman between the life science community needs and the bioinformatics/computer science research community! by being the ELIXIR French node
8 IFB structure IFB consists of :! A network of 6 regional centers (> 20 PFs)! about 110 FTE permanent staff + 70 FTC staff! A national node : IFB-core! plan of 10 FTE permanent staff (currently 3) + a few FTC staff
9 IFB-core: IFB s national hub IFB-core consists of two teams: The "Communication, Training, Exploitation" unit in charge of the web site and training program based on e-learning The "Infrastructure" unit in charge of the IFB national IT infrastructure
10 IFB-core tasks to provide a technical and administrative support to IFB to implement IFB scientific policy and facilitate the dissemination of actions to ensure an effective coordination between the PFs to serve as an interface by providing a unique entry point to IFB partners (supervisory authorities, the life science community, European and national bioinformatics communities) to set up and manage the IFB s national IT infrastructure to facilitate access to this IT infrastructure by deploying an academic Cloud
11 IFB e-infrastructure Support : help members to deploy and use their tools e-infrastructure: hardware, biology data collections, bioinformatics tools Academic cloud for life science a core ressource IFB-core hosted at CNRS IDRIS SC center (Paris) + regional resources 6 regional bioinformatics centers with 2 clouds 11,000 cores - +6 PB but +20 bioinformatics platforms Create a federation of clouds for life sciences Technical organization GRISBI: a national technical working group (all platforms) Participation to ELIXIR task forces RENABI-GO RENABI-SO APLIBIO RENABI-NE IFB-core PRABI RENABI-GS Cloud Ressources Location # Compute Cores # TB Storage # TB RAM Max VM size Technology IFB-core CNRS-IDRIS, Paris c 256GB StratusLab IFB-core 2015 CNRS-IDRIS, Paris 3, c 1TB StratusLab IFB-core 2016 CNRS-IDRIS, Paris 10,000 2,000-96c 2TB StratusLab Genocloud IFB-GO, Rennes ONE
12 Cloud? public community private SaaS Deploy. hybrid PaaS IaaS Ack.: C. Loomis
13 A cloud driven through a web dashboard
14 Ready-to-use bioinformatics cloud appliances Bioinformatics appliances are usual virtual machines small : few GB, easy to convert in most virtualization formats Installed and preconfigured with bioinformatics tools e.g. BLAST, Clustalw, ARIA, MEME, HMMer, TopHat, BWA, Samtools, etc. Referenced in a marketplace devoted to bioinformatics tools BLAST FastA OMSSA ClustalW2 SSearch PeptideShaker ARIA BWA X!tandem HMMer TopHat samtools Galaxy Clustal Muscle fastqc Omega Create new cloud services R Linux system Bioinformatics Marketplace Structures Sequences Virtual Machines Proteomics + Galaxy...
15 Current bioinformatics IFB Scientific apps CLI Virtual desktop Web Galaxy MODAL Proteomics Galaxy Galaxy AVIESAN 2013 RSAT PhyML RSAT mini biocompute Aria Node Utilities biodata BioMaj BlobSeer biodata NFS Cassandra Data mgmt biohadoop CentOS Ubuntu Base OS
16 Run bioinformatics appliances Bioinformatics marketplace both a virtual machines repository Store life science VMs and a catalogue Help users to select the appropriate VM for their analysis Bioinformatics Marketplace BI Structures Sequences Proteomics B A data public data UNIPROT EMBL Genomes PDB PROSITE Move cloud virtual machines Analyze data Galaxy tools VM: BLAST, ClustalW2, etc.... (2) IDB Cloud (3) Select tools Scientists can filter (1) the appliances through a Web interface to identify and launch (2) the appropriate ones. (1) Use tools (3) Scientists have access to their own cloud resources through web portal, remote virtual desktop or SSH. Filter images with metadata related to bioinformatics attribute <bio:tool> in VM manifests scientists can select the appropriate appliance according to the tools required for their analyses e.g. the BLAST tool Deploy on several clouds
17 Storage for biological data CLI (scp, sftp), GUI (Cyberduck, Transmit, Filezilla, ) sftp/http/s3 Upload your data Public Data sources Genomes EMBL PDB UNIPROT PROSITE shared (NFS ro) BLAST, Clustal, etc. PaaS IaaS launch jobs ssh Shared FS Master & Storage VM ARIA Workers VM CNS Identity Mgmt j. doe e. martin you chb virtual disks Portal Bioinformatics Cloud cg User data sftp/http/s3 Get your results CLI (scp, sftp), GUI (Cyberduck, Transmit, Filezilla, )
18 Monitor your usage
19 Moving VMs vs Data NGS IMG PRO NGS Biological platform (Genomics, IMaGing, PROteomics...) BI C Bioinformatics center Cloud resources Scientists C BI NGS data PRO VM BI VM C VMs PRO IFB life sciences marketplace & VMs repository NGS data VM PRO BI C IMG PRO data IMG BI C C
20 Case 1: Standard Bioinformatics node Use your own instance(s) With pre-installed standard appliance Biocompute bioinformatics tools BLAST, FastA, SSearch,HMM,... ClustalW2, Clustal-Omega, Muscle,.. Bowtie(2), BWA, samtools,... MEME, R, etc. Connected to public reference data Uniprot, EMBL, genomes, PDB, etc. Automaticaly shared to the VMs Cluster mode turn several instances in a single virtual cluster shared file system batch scheduling
21 Case 2: Cloud Galaxy portal Web interface portal Galaxy is widely used in the community analyse NGS data (mainly but not only) connected to community knowledge: data and indexes, tools, workflows Preserve workflows and results (cloud virtual disk) Help the integration of monthly updates and new tools Cloud permit different appliances to be built from the same base: generic with common tools for NGS specific for a set of tools, example of MODAL (MOdels for Data Analysis and Learning) for training: create a special appliance with dedicated datasets, tools or workflows (French AVIESAN school 2013) domain specific appliances: RNAseq, CHIPseq, etc. (planned)
22 Galaxy IFB cloud manual installation Linux system Created by interactive installation Main with standard tools Devoted For a specific event: training, demo, For tools: e.g. MODAL Galaxy Galaxy Galaxy AVIESAN 2013 MODAL Version 1.0 Version 2.0 N
23 Case 3: A specialized software suite for the analysis of noncoding sequences motif discovery in promotors of co-expressed genes CHIPseq analysis evolutionary conserved motifs (phylogenetics footprints) Contact: J. van Helden (TGAC) Used for ECCB 14 tutorial T01 RSAT offers a series of tools dedicated to the detection of regulatory signals in noncoding sequences input a list of genes of interest you retrieve the upstream sequences over a desired distance, discover putative regulatory signals, search the matching positions for these signals in your original dataset or in whole genomes, display the results graphically in the form of a feature map.
24 Case 4: Proteomics virtual desktop Motivation Collaboration with a mass spectroscopy platform Running out of space on their local resources Protein identification tools Mass experimental data Reference databases : nr, Swiss-Prot Reference screening tools: OMSSA, X!Tandem User interface Remote Virtual Desktop (NX) Reference GUIs SearchGUI PeptidShaker source: PeptideShaker site
25 IFB - an academic cloud for life sciences IFB s cloud simplify access to biological data and tools integrate tools and pipelines in turnkey cloud appliances is tightly connected to existing bioinformatics resources, e.g. public reference data sources 16 bioinformatics appliances: standard compute nodes, proteomics virtual desktop, Galaxy portal, structural biology +50 users from all IFB regional centers Bioinformatics appliances registry reference images related to life science help users to select the appropriate VM for their analysis BI tools BLAST FastA OMSSA ClustalW2 SSearch PeptideShaker ARIA BWA X!tandem HMMer TopHat samtools Galaxy Clustal Muscle fastqc Omega Create new cloud services Virtual Machines R Linux system Bioinformatics Marketplace Bioinformatics Marketplace Structures Galaxy Sequences Proteomics Proteomics B A data public data UNIPROT EMBL Genomes PDB PROSITE Move cloud virtual machines Analyze data tools VM: BLAST, ClustalW2, etc Galaxy (2) IDB Cloud... (3) Selec Scientists can the appliances Web interface t and launch appropriate ones (1) Use Scientists have access cloud resources throug remote virtual desktop o
26 Perspectives Create more bioinformatics appliances by the experts of the domains publish them in the IFB marketplace to make them available to the scientists! IFB established priorities for 5 scientific domains Microbial Bioinformatics Evolutionary bioinformatics Plant bioinformatics Structural Biology NGS data processing! and technical pilots Interoperability of appliances on different cloud infrastructures Registry of distributed multi-cloud datasets Live remote cloud processing of sequencing data
27 Questions? Acknowledgments! Clément Gauthey (CNRS IDRIS, form. IDB-IBCP) Developers of tools that integrated them as an IFB cloud appliance: Samuel Blanck (Inria Lille), Jacques van Helden (TAGC), You?! StratusLab members! IFB s funding by French program PIA INBS 2012
Sequencing data. And other experimental data. EMBL-EBI data resources growth
Sequencing Institut Français de Bioinformatique, Un loud pour les Sciences du Vivant source: www.genomesonline.org source: www.politigenomics.com/next-generation- hristophe Blanchet Institut Français de
More informationCloud pour la Bioinformatique
Cloud pour la Bioinformatique Christophe Blanchet Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR French Node CNRS UMS3601 - Gif-sur-Yvette - FRANCE Sequencing data
More informationCloud Ready for Bioinformatics?
IDB acknowledges co-funding by the European Community's Seventh Framework Programme (INFSO-RI-261552) and the French National Research Agency's Arpege Programme (ANR-10-SEGI-001) Cloud Ready for Bioinformatics?
More informationBioinformatique sur Cloud Cas d usage avec le portail Galaxy
Bioinformatique sur Cloud Cas d usage avec le portail Galaxy Christophe Blanchet Institute of Biology and Chemistry of Proteins Head of Service Infrastructure for Biology - IDB CNRS-IBCP FR3302 - LYON
More informationUne e-infrastructure nationale en bioinformatique
Une e-infrastructure nationale en bioinformatique Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE JDEV
More informationIFB s e-infrastructure
IFB s e-infrastructure Christophe Blanchet Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Life Sciences Platforms in France
More informationLe cloud IFB et son instance Galaxy
Le cloud IFB et son instance Galaxy Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Ecole Bioinformatique
More informationLe cloud IFB et son instance Galaxy
Le cloud IFB et son instance Galaxy Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Ecole Bioinformatique
More informationDeployment of BioXSDenabled services on a Cloud. christophe.blanchet@ibcp.fr
Deployment of BioXSDenabled services on a Cloud Outline IBCP, provider of BioXSD-enabled services Cloud Computing RENABI GRISBI, French infrastructure Bioinformatics Integrated s gbio-pbil.ibcp.fr/ws GBIO
More informationIns$tut Français de Bioinforma$que Current situa+on and prospect. IFB General Assembly Gif- sur- Yve=e, January 9 2015
Ins$tut Français de Bioinforma$que Current situa+on and prospect IFB General Assembly Gif- sur- Yve=e, January 9 2015 Background 2010: Na+onal Infrastructures in Biology and Health call from the Investment
More informationE-SCIENCE IN WESTERN FRANCE : THE BEGINNING
E-SCIENCE IN WESTERN FRANCE : THE BEGINNING Yvan Le Bras Olivier Collin Jacques Nicolas CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics Next : Bio-imaging
More informationAlternative Deployment Models for Cloud Computing in HPC Applications. Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix
Alternative Deployment Models for Cloud Computing in HPC Applications Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix The case for Cloud in HPC Build it in house Assemble in the cloud?
More informationBig Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
More informationSURFsara HPC Cloud Workshop
SURFsara HPC Cloud Workshop www.cloud.sara.nl Tutorial 2014-06-11 UvA HPC and Big Data Course June 2014 Anatoli Danezi, Markus van Dijk cloud-support@surfsara.nl Agenda Introduction and Overview (current
More informationA curated Domain centric shared Docker registry linked to the Galaxy toolshed
A curated Domain centric shared Docker registry linked to the Galaxy toolshed François Moreews 1, Olivier Sallou 2, Yvan le Bras 2, Marie Grosjean 3, Cyril Monjeaud 2, Thomas Darde 4, Olivier Collin 2,
More informationBig Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
More informationSURFsara HPC Cloud Workshop
SURFsara HPC Cloud Workshop doc.hpccloud.surfsara.nl UvA workshop 2016-01-25 UvA HPC Course Jan 2016 Anatoli Danezi, Markus van Dijk cloud-support@surfsara.nl Agenda Introduction and Overview (current
More informationStratusLab project. Standards, Interoperability and Asset Exploitation. Vangelis Floros, GRNET
StratusLab project Standards, Interoperability and Asset Exploitation Vangelis Floros, GRNET EGI Technical Forum 2011 19-22 September 2011, Lyon, France StratusLab is co-funded by the European Community
More informationCloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers
Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/
More informationUGENE Quick Start Guide
Quick Start Guide This document contains a quick introduction to UGENE. For more detailed information, you can find the UGENE User Manual and other special manuals in project website: http://ugene.unipro.ru.
More informationThree data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk
Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes
More informationDATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES
DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES Yvan Le Bras Cyril Monjeaud Olivier Collin Jacques Nicolas CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics
More informationEoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille
Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille Journées SUCCES Stéphane Le Crom (UPMC IBENS) stephane.le_crom@upmc.fr Paris November 2013 The Sanger DNA sequencing method Sequencing
More informationE-SCIENCE IN WESTERN FRANCE :
E-SCIENCE IN WESTERN FRANCE : BEGINS Yvan Le Bras Cyril Monjeaud Olivier Collin & the GenOuest team CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics Next :
More informationTowards a galaxy.prabi.fr
Towards a galaxy.prabi.fr IFB- galaxy Day 04/12/2013 Navra5l V., PhD, UCBL navra5l@prabi.fr www.prabi.fr One among the six IFB regional nodes Region: Rhône- Alpes Director: Guy Perrière 11 Research Team,
More informationBig Data and Cloud Computing for GHRSST
Big Data and Cloud Computing for GHRSST Jean-Francois Piollé (jfpiolle@ifremer.fr) Frédéric Paul, Olivier Archer CERSAT / Institut Français de Recherche pour l Exploitation de la Mer Facing data deluge
More informationCloud computing for research
& for research Cécile Cavet cecile.cavet at apc.univ-paris7.fr Centre François Arago (FACe), Laboratoire APC, Université Paris Diderot January 11, 2015 Plan & 1 2 & 3 4 What is cloud computing? &??? Provides
More informationEnabling multi-cloud resources at CERN within the Helix Nebula project. D. Giordano (CERN IT-SDC) HEPiX Spring 2014 Workshop 23 May 2014
Enabling multi-cloud resources at CERN within the Helix Nebula project D. Giordano (CERN IT-) HEPiX Spring 2014 Workshop This document produced by Members of the Helix Nebula consortium is licensed under
More informationHadoopizer : a cloud environment for bioinformatics data analysis
Hadoopizer : a cloud environment for bioinformatics data analysis Anthony Bretaudeau (1), Olivier Sallou (2), Olivier Collin (3) (1) anthony.bretaudeau@irisa.fr, INRIA/Irisa, Campus de Beaulieu, 35042,
More informationCloud services in PL-Grid and EGI Infrastructures
1 Cloud services in PL-Grid and EGI Infrastructures J. Meizner, M. Radecki, M. Pawlik, T. Szepieniec ACK Cyfronet AGH Cracow Grid Workshop 2012, Kraków, 22.10.2012 Overview 2 Different types of Compute
More informationSteven Newhouse, Head of Technical Services
Challenges at EMBL-EBI Steven Newhouse, Head of Technical Services European Bioinformatics Institute Outstation of the European Molecular Biology Laboratory International organisation created by treaty
More informationNew solutions for Big Data Analysis and Visualization
New solutions for Big Data Analysis and Visualization From HPC to cloud-based solutions Barcelona, February 2013 Nacho Medina imedina@cipf.es http://bioinfo.cipf.es/imedina Head of the Computational Biology
More informationPlanning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise
Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise Manager Oracle NIST Definition of Cloud Computing Cloud
More informationCloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/
More informationUniversity of Messina, Italy
University of Messina, Italy IEEE MoCS 2011 Kerkyra - Greece June 28, 2011 Dr. Massimo Villari mvillari@unime.it Cross Cloud Federation Federated Cloud Scenario Cloud Middleware Model: the Stack The CLEVER
More informationCloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/
More informationEuro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of
More informationicer Bioinformatics Support Fall 2011
icer Bioinformatics Support Fall 2011 John B. Johnston HPC Programmer Institute for Cyber Enabled Research 2011 Michigan State University Board of Trustees. Institute for Cyber Enabled Research (icer)
More informationBuilding Storage Service in a Private Cloud
Building Storage Service in a Private Cloud Sateesh Potturu & Deepak Vasudevan Wipro Technologies Abstract Storage in a private cloud is the storage that sits within a particular enterprise security domain
More informationFinal Report on StratusLab Adoption
Final Report on StratusLab Adoption Charles Loomis, Mohammed Airaj, Marc-Elian Bégin, Christophe Blanchet, Evangelos Floros, Clément Gauthey To cite this version: Charles Loomis, Mohammed Airaj, Marc-Elian
More informationEMBL-EBI Web Services
EMBL-EBI Web Services Rodrigo Lopez Head of the External Services Team SME Workshop Piemonte 2011 EBI is an Outstation of the European Molecular Biology Laboratory. Summary Introduction The JDispatcher
More informationOPEN SOURCE AND BOTTOM-UP VRE APPROACH IN WESTERN FRANCE
OPEN SOURCE AND BOTTOM-UP VRE APPROACH IN WESTERN FRANCE Towards supporting accessible, reproducible, and transparent research in the life sciences Yvan Le Bras Cyril Monjeaud Olivier Collin, the GenOuest
More informationVirtualization & Cloud Computing (2W-VnCC)
Virtualization & Cloud Computing (2W-VnCC) DETAILS OF THE SYLLABUS: Basics of Networking Types of Networking Networking Tools Basics of IP Addressing Subnet Mask & Subnetting MAC Address Ports : Physical
More informationEGEE-2 NA4 Biomed Bioinformatics in CNRS
Enabling Grids for E-sciencE EGEE-2 NA4 Biomed Bioinformatics in CNRS Christophe Blanchet Institute of Biology and Chemistry of Proteins Lyon, April 28, 2006 www.eu-egee.org Enabling Grids for E-sciencE
More informationHPC Cloud. Focus on your research. Floris Sluiter Project leader SARA
HPC Cloud Focus on your research Floris Sluiter Project leader SARA Why an HPC Cloud? Christophe Blanchet, IDB - Infrastructure Distributing Biology: Big task to port them all to your favorite architecture
More informationSixSq Cloud Capabilities
SixSq Cloud Capabilities SlipStream: Mutli-cloud Management Platform Marc-Elian Bégin, CEO, Co-founder, SixSq HEPIA Cloud Masters, Lausanne, 2015 Locations Global Headquarters Geneva, Switzerland North
More informationCloud-Based Big Data Analytics in Bioinformatics
Cloud-Based Big Data Analytics in Bioinformatics Presented By Cephas Mawere Harare Institute of Technology, Zimbabwe 1 Introduction 2 Big Data Analytics Big Data are a collection of data sets so large
More informationDesign and Building of IaaS Clouds
21th May 2010 CloudViews 2010 Porto, Portugal Next Generation Data Center Summit Design and Building of IaaS Clouds Distributed Systems Architecture Research Group Universidad Complutense de Madrid This
More informationBioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
More informationAnalysis of ChIP-seq data in Galaxy
Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers
More informationvnebula Cloud. Made Easy. Introducing vnebula from Stream Networks. A simple, self-service cloud portal for our partner community.
vnebula Cloud. Made Easy. Introducing vnebula from Stream Networks. A simple, self-service cloud portal for our partner community. Keep it simple Our partners asked us for a cloud portal that was simple
More informationDeploying Business Virtual Appliances on Open Source Cloud Computing
International Journal of Computer Science and Telecommunications [Volume 3, Issue 4, April 2012] 26 ISSN 2047-3338 Deploying Business Virtual Appliances on Open Source Cloud Computing Tran Van Lang 1 and
More informationExperiences and challenges in the development of the JASMIN cloud service for the environmental science community
JASMIN (STFC/Stephen Kill) Experiences and challenges in the development of the JASMIN cloud service for the environmental science community ECMWF Visualisa-on in Meteorology Week, 28 September 2015 Philip
More informationSolution for private cloud computing
The CC1 system Solution for private cloud computing 1 Outline What is CC1? Features Technical details System requirements and installation How to get it? 2 What is CC1? The CC1 system is a complete solution
More informationDutch HPC Cloud: flexible HPC for high productivity in science & business
Dutch HPC Cloud: flexible HPC for high productivity in science & business Dr. Axel Berg SARA national HPC & e-science Support Center, Amsterdam, NL April 17, 2012 4 th PRACE Executive Industrial Seminar,
More informationEGI services for distribution and federation of data and computing
EGI services for distribution and federation of data and computing Tiziana Ferrari Technical Director, EGI.eu tiziana.ferrari@egi.eu March 2014 EGI-InSPIRE RI-261323 1 Accelerating Excellent Science MISSION.
More informationLarge-scale Research Data Management and Analysis Using Globus Services. Ravi Madduri Argonne National Lab University of Chicago @madduri
Large-scale Research Data Management and Analysis Using Globus Services Ravi Madduri Argonne National Lab University of Chicago @madduri Outline Who we are Challenges in Big Data Management and Analysis
More informationENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
More informationBioHPC Web Computing Resources at CBSU
BioHPC Web Computing Resources at CBSU 3CPG workshop Robert Bukowski Computational Biology Service Unit http://cbsu.tc.cornell.edu/lab/doc/biohpc_web_tutorial.pdf BioHPC infrastructure at CBSU BioHPC Web
More informationEnabling a federated environment to support biomedical research. Gianmauro Cuccuru CRS4
Enabling a federated environment to support biomedical research Gianmauro Cuccuru CRS4 ELIXIR connects national bioinformatics centres and EMBL- EBI into a sustainable European infrastructure for biological
More informationSolution for private cloud computing
The CC1 system Solution for private cloud computing 1 Outline What is CC1? Features Technical details Use cases By scientist By HEP experiment System requirements and installation How to get it? 2 What
More informationCloud Computing. What Are We Handing Over? Ganesh Shankar Advanced IT Core Pervasive Technology Institute
Cloud Computing What Are We Handing Over? Ganesh Shankar Advanced IT Core Pervasive Technology Institute Why is the Cloud Relevant to In the current research workflow. Medical Research? Data volumes are
More informationIntroduction to Cloud Computing
Introduction to Cloud Computing Shang Juh Kao Dept. of Computer Science and Engineering National Chung Hsing University 2011/10/27 CSE, NCHU 1 Table of Contents 1. Introduction ( 資 料 取 自 NCHC 自 由 軟 體 實
More informationOpen Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)
Open Cloud System (Integration of Eucalyptus, Hadoop and into deployment of University Private Cloud) Thinn Thu Naing University of Computer Studies, Yangon 25 th October 2011 Open Cloud System University
More informationBuilding Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT
Building Bioinformatics Capacity in Africa Nicky Mulder CBIO Group, UCT Outline What is bioinformatics? Why do we need IT infrastructure? What e-infrastructure does it require? How we are developing this
More informationCore Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat
More informationComparing Methods for Identifying Transcription Factor Target Genes
Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF
More informationManaging and Conducting Biomedical Research on the Cloud Prasad Patil
Managing and Conducting Biomedical Research on the Cloud Prasad Patil Laboratory for Personalized Medicine Center for Biomedical Informatics Harvard Medical School SaaS & PaaS gmail google docs app engine
More informationDevelopment of Bio-Cloud Service for Genomic Analysis Based on Virtual
Development of Bio-Cloud Service for Genomic Analysis Based on Virtual Infrastructure 1 Jung-Ho Um, 2 Sang Bae Park, 3 Hoon Choi, 4 Hanmin Jung 1, First Author Korea Institute of Science and Technology
More informationSURFsara Data Services
SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,
More informationEnhanced Research Data Management and Publication with Globus
Enhanced Research Data Management and Publication with Globus Vas Vasiliadis Jim Pruyne Presented at OR2015 June 8, 2015 Presentations and other useful information available at globus.org/events/or2015/tutorial
More informationAssignment # 1 (Cloud Computing Security)
Assignment # 1 (Cloud Computing Security) Group Members: Abdullah Abid Zeeshan Qaiser M. Umar Hayat Table of Contents Windows Azure Introduction... 4 Windows Azure Services... 4 1. Compute... 4 a) Virtual
More informationGlobus Genomics Tutorial GlobusWorld 2014
Globus Genomics Tutorial GlobusWorld 2014 Agenda Overview of Globus Genomics Example Collaborations Demonstration Globus Genomics interface Globus Online integration Scenario 1: Using Globus Genomics for
More informationTrials community. Yannick Legré. yannick.legre@egi.eu. www.egi.eu. EGI InSPIRE RI 261323
EGI InSPIRE InSPIRE EGI Federated cloud for the Clinical Trials community Yannick Legré yannick.legre@egi.eu ECRIN Workshop EGI European Grid Infrastructure Distributed, federated storage and compute facilities
More informationCloud Storage in a PaaS World
Cloud Storage in a PaaS World Susan Wu Oracle Group Manager, Product Management Cloud Storage runs Apps Just like a Platform runs Applications, there are many Applications that need to run in Cloud Storage
More information<Insert Picture Here> Private Cloud with Fusion Middleware
Private Cloud with Fusion Middleware Duško Vukmanović Principal Sales Consultant, Oracle dusko.vukmanovic@oracle.com The following is intended to outline our general product direction.
More informationINTRODUCTION TO CLOUD MANAGEMENT
CONFIGURING AND MANAGING A PRIVATE CLOUD WITH ORACLE ENTERPRISE MANAGER 12C Kai Yu, Dell Inc. INTRODUCTION TO CLOUD MANAGEMENT Oracle cloud supports several types of resource service models: Infrastructure
More informationCourse 20533: Implementing Microsoft Azure Infrastructure Solutions
Course 20533: Implementing Microsoft Azure Infrastructure Solutions Overview About this course This course is aimed at experienced IT Professionals who currently administer their on-premises infrastructure.
More informationMicrosoft Research Windows Azure for Research Training
Copyright 2013 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the
More informationOpenNebula Cloud Innovation and Case Studies for Telecom
Telecom Cloud Standards Information Day Hyatt Regency, Santa Clara, CA, USA 6-7 December, 2010 OpenNebula Cloud Innovation and Case Studies for Telecom Constantino Vázquez Blanco DSA-Research.org Distributed
More informationOpenNebula Leading Innovation in Cloud Computing Management
OW2 Annual Conference 2010 Paris, November 24th, 2010 OpenNebula Leading Innovation in Cloud Computing Management Ignacio M. Llorente DSA-Research.org Distributed Systems Architecture Research Group Universidad
More informationCultural Heritage Institutions, Metadata Aggregators and The Cloud Aleksandra Nowak, Marcin Werla Poznań Supercomputing and Networking Center
Cultural Heritage Institutions, Metadata Aggregators and The Cloud Aleksandra Nowak, Marcin Werla Poznań Supercomputing and Networking Center ECloud and LoCloud are funded by the European Commission's
More informationEarly Cloud Experiences with the Kepler Scientific Workflow System
Available online at www.sciencedirect.com Procedia Computer Science 9 (2012 ) 1630 1634 International Conference on Computational Science, ICCS 2012 Early Cloud Experiences with the Kepler Scientific Workflow
More information1 P a g e Delivering Self -Service Cloud application service using Oracle Enterprise Manager 12c
Delivering Self-service Cloud application services using Oracle Enterprise Manager 12c Kai Yu, Senior Principal Engineer, Oracle Solutions Engineering, Dell Inc ABSTRACT Oracle Self-Service provisioning
More informationglobus online Cloud-based services for (reproducible) science Ian Foster Computation Institute University of Chicago and Argonne National Laboratory
globus online Cloud-based services for (reproducible) science Ian Foster Computation Institute University of Chicago and Argonne National Laboratory Computation Institute (CI) Apply to challenging problems
More informationAutomating Big Data Benchmarking for Different Architectures with ALOJA
www.bsc.es Jan 2016 Automating Big Data Benchmarking for Different Architectures with ALOJA Nicolas Poggi, Postdoc Researcher Agenda 1. Intro on Hadoop performance 1. Current scenario and problematic 2.
More informationHow To Run A Cloud Server On A Server Farm (Cloud)
StratusLab: Darn Simple Cloud Charles (Cal) Loomis (CNRS/LAL & SixSq Sàrl) FOSDEM 13: Cloud Devroom (3 February 2013) StratusLab What is it? Complete Infrastructure as a Service (IaaS) cloud distribution
More informationSUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager pchadwick@suse.com. Product Marketing Manager djarvis@suse.
SUSE Cloud 2.0 Pete Chadwick Douglas Jarvis Senior Product Manager pchadwick@suse.com Product Marketing Manager djarvis@suse.com SUSE Cloud SUSE Cloud is an open source software solution based on OpenStack
More informationCloud Computing. Chapter 1 Introducing Cloud Computing
Cloud Computing Chapter 1 Introducing Cloud Computing Learning Objectives Understand the abstract nature of cloud computing. Describe evolutionary factors of computing that led to the cloud. Describe virtualization
More informationMicrosoft Research Microsoft Azure for Research Training
Copyright 2014 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the
More informationScientific and Technical Applications as a Service in the Cloud
Scientific and Technical Applications as a Service in the Cloud University of Bern, 28.11.2011 adapted version Wibke Sudholt CloudBroker GmbH Technoparkstrasse 1, CH-8005 Zurich, Switzerland Phone: +41
More informationINTRODUCTION APPLICATION DEPLOYMENT WITH ORACLE VIRTUAL ASSEMBLY
SIMPLIFYING APPLICATION DEPLOYMENT IN CLOUD USING VIRTUAL ASSEMBLIES AND EM 12C Kai Yu, Dell Inc. ABSTRACT Oracle virtual assemblies provide a great way to simply the deployment of enterprise-class multi-tier
More informationSilviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania)
Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania) Outline Introduction EO challenges; EO and classical/cloud computing; EO Services The computing platform Cluster -> Grid -> Cloud
More informationFuture computing platforms for biodiversity science
www.bsc.es Future computing platforms for biodiversity science Daniele Lezzi Rome, 5 September 2013 Motivation Lack of service integration and interoperability of research e- Infrastructure e-irg 2013
More informationImplementing Microsoft Azure Infrastructure Solutions
Course Code: M20533 Vendor: Microsoft Course Overview Duration: 5 RRP: 2,025 Implementing Microsoft Azure Infrastructure Solutions Overview This course is aimed at experienced IT Professionals who currently
More informationSC12 Cloud Compu,ng for Science Tutorial: Introduc,on to Infrastructure Clouds
SC12 Cloud Compu,ng for Science Tutorial: Introduc,on to Infrastructure Clouds John Breshnahan, Patrick Armstrong, Kate Keahey, Pierre Riteau Argonne National Laboratory Computation Institute, University
More informationGlobus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis
Globus Research Data Management: Introduction and Service Overview Steve Tuecke Vas Vasiliadis Presentations and other useful information available at globus.org/events/xsede15/tutorial 2 Thank you to
More informationLinux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering
Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering NIST Definition of Cloud Computing Cloud computing is a model for enabling convenient, on-demand
More informationPractical Course on. Bioinformatics and Comparative Genomes Analyses. University of La Réunion 2013 May 6-18
Practical Course on Bioinformatics and Comparative Genomes Analyses University of La Réunion 2013 May 6-18 http://www.pasteur.fr/~tekaia/bcga2013.html Abstract The course will be organized within the frame
More information-> Integration of MAPHiTS in Galaxy
Enabling NGS Analysis with(out) the Infrastructure, 12:0512 Development of a workflow for SNPs detection in grapevine From Sets to Graphs: Towards a Realistic Enrichment Analy species: MAPHiTS -> Integration
More information