Institut Français de Bioinformatique, Un Cloud pour les Sciences du Vivant

Size: px
Start display at page:

Download "Institut Français de Bioinformatique, Un Cloud pour les Sciences du Vivant"

Transcription

1 Institut Français de Bioinformatique, Un Cloud pour les Sciences du Vivant Christophe Blanchet! Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR French Node CNRS UMS Gif-sur-Yvette - FRANCE

2 Sequencing data source: source: Complete genome sequencing become a lab commodity with NGS (cheap and efficient)

3 And other experimental data FR - EU

4 EMBL-EBI data resources growth source: EMBL-EBI Annual Report 2013

5 Plateformes Expérimentales en Biologie Plateformes nationales (GIS IBISA) Nb Imagerie cellulaire 19 Génomique, Transcriptomique 16 Protéomique 13 Biologie structurale, biophysique 11 NGS BI C IMG Biological platform (Genomics, IMaGing, PROteomics...) Bioinformatics center Cloud resources Scientists PRO C NGS BI NGS PRO BI C Localisation des plateformes NGS PRO BI IMG PRO NGS PRO C IMG BI C C Source: omicsmaps.com Des sites intermédiaires permettent de répartir la charge en terme de stockage et de puissance de calcul tout en assurant une meilleure proximité avec les scientifiques

6 Infrastructures in Biology Lot of bioinformatics tools and services to treat and vizualize the biological data

7 IFB mission IFB: national research infrastructure of service in Bioinformatics! Mission: to make available core bioinformatics resources to the national life science research community.! by providing support for biology programs! projects bolstering! users training! by setting up an IT infrastructure devoted to the management and analysis of biological data! material resources : CPUs, disks, etc.! availability of biology data collections! deployment of bioinformatics tools (Cloud)! by acting as a middleman between the life science community needs and the bioinformatics/computer science research community! by being the ELIXIR French node

8 IFB structure IFB consists of :! A network of 6 regional centers (> 20 PFs)! about 110 FTE permanent staff + 70 FTC staff! A national node : IFB-core! plan of 10 FTE permanent staff (currently 3) + a few FTC staff

9 IFB-core: IFB s national hub IFB-core consists of two teams: The "Communication, Training, Exploitation" unit in charge of the web site and training program based on e-learning The "Infrastructure" unit in charge of the IFB national IT infrastructure

10 IFB-core tasks to provide a technical and administrative support to IFB to implement IFB scientific policy and facilitate the dissemination of actions to ensure an effective coordination between the PFs to serve as an interface by providing a unique entry point to IFB partners (supervisory authorities, the life science community, European and national bioinformatics communities) to set up and manage the IFB s national IT infrastructure to facilitate access to this IT infrastructure by deploying an academic Cloud

11 IFB e-infrastructure Support : help members to deploy and use their tools e-infrastructure: hardware, biology data collections, bioinformatics tools Academic cloud for life science a core ressource IFB-core hosted at CNRS IDRIS SC center (Paris) + regional resources 6 regional bioinformatics centers with 2 clouds 11,000 cores - +6 PB but +20 bioinformatics platforms Create a federation of clouds for life sciences Technical organization GRISBI: a national technical working group (all platforms) Participation to ELIXIR task forces RENABI-GO RENABI-SO APLIBIO RENABI-NE IFB-core PRABI RENABI-GS Cloud Ressources Location # Compute Cores # TB Storage # TB RAM Max VM size Technology IFB-core CNRS-IDRIS, Paris c 256GB StratusLab IFB-core 2015 CNRS-IDRIS, Paris 3, c 1TB StratusLab IFB-core 2016 CNRS-IDRIS, Paris 10,000 2,000-96c 2TB StratusLab Genocloud IFB-GO, Rennes ONE

12 Cloud? public community private SaaS Deploy. hybrid PaaS IaaS Ack.: C. Loomis

13 A cloud driven through a web dashboard

14 Ready-to-use bioinformatics cloud appliances Bioinformatics appliances are usual virtual machines small : few GB, easy to convert in most virtualization formats Installed and preconfigured with bioinformatics tools e.g. BLAST, Clustalw, ARIA, MEME, HMMer, TopHat, BWA, Samtools, etc. Referenced in a marketplace devoted to bioinformatics tools BLAST FastA OMSSA ClustalW2 SSearch PeptideShaker ARIA BWA X!tandem HMMer TopHat samtools Galaxy Clustal Muscle fastqc Omega Create new cloud services R Linux system Bioinformatics Marketplace Structures Sequences Virtual Machines Proteomics + Galaxy...

15 Current bioinformatics IFB Scientific apps CLI Virtual desktop Web Galaxy MODAL Proteomics Galaxy Galaxy AVIESAN 2013 RSAT PhyML RSAT mini biocompute Aria Node Utilities biodata BioMaj BlobSeer biodata NFS Cassandra Data mgmt biohadoop CentOS Ubuntu Base OS

16 Run bioinformatics appliances Bioinformatics marketplace both a virtual machines repository Store life science VMs and a catalogue Help users to select the appropriate VM for their analysis Bioinformatics Marketplace BI Structures Sequences Proteomics B A data public data UNIPROT EMBL Genomes PDB PROSITE Move cloud virtual machines Analyze data Galaxy tools VM: BLAST, ClustalW2, etc.... (2) IDB Cloud (3) Select tools Scientists can filter (1) the appliances through a Web interface to identify and launch (2) the appropriate ones. (1) Use tools (3) Scientists have access to their own cloud resources through web portal, remote virtual desktop or SSH. Filter images with metadata related to bioinformatics attribute <bio:tool> in VM manifests scientists can select the appropriate appliance according to the tools required for their analyses e.g. the BLAST tool Deploy on several clouds

17 Storage for biological data CLI (scp, sftp), GUI (Cyberduck, Transmit, Filezilla, ) sftp/http/s3 Upload your data Public Data sources Genomes EMBL PDB UNIPROT PROSITE shared (NFS ro) BLAST, Clustal, etc. PaaS IaaS launch jobs ssh Shared FS Master & Storage VM ARIA Workers VM CNS Identity Mgmt j. doe e. martin you chb virtual disks Portal Bioinformatics Cloud cg User data sftp/http/s3 Get your results CLI (scp, sftp), GUI (Cyberduck, Transmit, Filezilla, )

18 Monitor your usage

19 Moving VMs vs Data NGS IMG PRO NGS Biological platform (Genomics, IMaGing, PROteomics...) BI C Bioinformatics center Cloud resources Scientists C BI NGS data PRO VM BI VM C VMs PRO IFB life sciences marketplace & VMs repository NGS data VM PRO BI C IMG PRO data IMG BI C C

20 Case 1: Standard Bioinformatics node Use your own instance(s) With pre-installed standard appliance Biocompute bioinformatics tools BLAST, FastA, SSearch,HMM,... ClustalW2, Clustal-Omega, Muscle,.. Bowtie(2), BWA, samtools,... MEME, R, etc. Connected to public reference data Uniprot, EMBL, genomes, PDB, etc. Automaticaly shared to the VMs Cluster mode turn several instances in a single virtual cluster shared file system batch scheduling

21 Case 2: Cloud Galaxy portal Web interface portal Galaxy is widely used in the community analyse NGS data (mainly but not only) connected to community knowledge: data and indexes, tools, workflows Preserve workflows and results (cloud virtual disk) Help the integration of monthly updates and new tools Cloud permit different appliances to be built from the same base: generic with common tools for NGS specific for a set of tools, example of MODAL (MOdels for Data Analysis and Learning) for training: create a special appliance with dedicated datasets, tools or workflows (French AVIESAN school 2013) domain specific appliances: RNAseq, CHIPseq, etc. (planned)

22 Galaxy IFB cloud manual installation Linux system Created by interactive installation Main with standard tools Devoted For a specific event: training, demo, For tools: e.g. MODAL Galaxy Galaxy Galaxy AVIESAN 2013 MODAL Version 1.0 Version 2.0 N

23 Case 3: A specialized software suite for the analysis of noncoding sequences motif discovery in promotors of co-expressed genes CHIPseq analysis evolutionary conserved motifs (phylogenetics footprints) Contact: J. van Helden (TGAC) Used for ECCB 14 tutorial T01 RSAT offers a series of tools dedicated to the detection of regulatory signals in noncoding sequences input a list of genes of interest you retrieve the upstream sequences over a desired distance, discover putative regulatory signals, search the matching positions for these signals in your original dataset or in whole genomes, display the results graphically in the form of a feature map.

24 Case 4: Proteomics virtual desktop Motivation Collaboration with a mass spectroscopy platform Running out of space on their local resources Protein identification tools Mass experimental data Reference databases : nr, Swiss-Prot Reference screening tools: OMSSA, X!Tandem User interface Remote Virtual Desktop (NX) Reference GUIs SearchGUI PeptidShaker source: PeptideShaker site

25 IFB - an academic cloud for life sciences IFB s cloud simplify access to biological data and tools integrate tools and pipelines in turnkey cloud appliances is tightly connected to existing bioinformatics resources, e.g. public reference data sources 16 bioinformatics appliances: standard compute nodes, proteomics virtual desktop, Galaxy portal, structural biology +50 users from all IFB regional centers Bioinformatics appliances registry reference images related to life science help users to select the appropriate VM for their analysis BI tools BLAST FastA OMSSA ClustalW2 SSearch PeptideShaker ARIA BWA X!tandem HMMer TopHat samtools Galaxy Clustal Muscle fastqc Omega Create new cloud services Virtual Machines R Linux system Bioinformatics Marketplace Bioinformatics Marketplace Structures Galaxy Sequences Proteomics Proteomics B A data public data UNIPROT EMBL Genomes PDB PROSITE Move cloud virtual machines Analyze data tools VM: BLAST, ClustalW2, etc Galaxy (2) IDB Cloud... (3) Selec Scientists can the appliances Web interface t and launch appropriate ones (1) Use Scientists have access cloud resources throug remote virtual desktop o

26 Perspectives Create more bioinformatics appliances by the experts of the domains publish them in the IFB marketplace to make them available to the scientists! IFB established priorities for 5 scientific domains Microbial Bioinformatics Evolutionary bioinformatics Plant bioinformatics Structural Biology NGS data processing! and technical pilots Interoperability of appliances on different cloud infrastructures Registry of distributed multi-cloud datasets Live remote cloud processing of sequencing data

27 Questions? Acknowledgments! Clément Gauthey (CNRS IDRIS, form. IDB-IBCP) Developers of tools that integrated them as an IFB cloud appliance: Samuel Blanck (Inria Lille), Jacques van Helden (TAGC), You?! StratusLab members! IFB s funding by French program PIA INBS 2012

Sequencing data. And other experimental data. EMBL-EBI data resources growth

Sequencing data. And other experimental data. EMBL-EBI data resources growth Sequencing Institut Français de Bioinformatique, Un loud pour les Sciences du Vivant source: www.genomesonline.org source: www.politigenomics.com/next-generation- hristophe Blanchet Institut Français de

More information

Cloud pour la Bioinformatique

Cloud pour la Bioinformatique Cloud pour la Bioinformatique Christophe Blanchet Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR French Node CNRS UMS3601 - Gif-sur-Yvette - FRANCE Sequencing data

More information

Cloud Ready for Bioinformatics?

Cloud Ready for Bioinformatics? IDB acknowledges co-funding by the European Community's Seventh Framework Programme (INFSO-RI-261552) and the French National Research Agency's Arpege Programme (ANR-10-SEGI-001) Cloud Ready for Bioinformatics?

More information

Bioinformatique sur Cloud Cas d usage avec le portail Galaxy

Bioinformatique sur Cloud Cas d usage avec le portail Galaxy Bioinformatique sur Cloud Cas d usage avec le portail Galaxy Christophe Blanchet Institute of Biology and Chemistry of Proteins Head of Service Infrastructure for Biology - IDB CNRS-IBCP FR3302 - LYON

More information

Une e-infrastructure nationale en bioinformatique

Une e-infrastructure nationale en bioinformatique Une e-infrastructure nationale en bioinformatique Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE JDEV

More information

IFB s e-infrastructure

IFB s e-infrastructure IFB s e-infrastructure Christophe Blanchet Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Life Sciences Platforms in France

More information

Le cloud IFB et son instance Galaxy

Le cloud IFB et son instance Galaxy Le cloud IFB et son instance Galaxy Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Ecole Bioinformatique

More information

Le cloud IFB et son instance Galaxy

Le cloud IFB et son instance Galaxy Le cloud IFB et son instance Galaxy Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Ecole Bioinformatique

More information

Deployment of BioXSDenabled services on a Cloud. christophe.blanchet@ibcp.fr

Deployment of BioXSDenabled services on a Cloud. christophe.blanchet@ibcp.fr Deployment of BioXSDenabled services on a Cloud Outline IBCP, provider of BioXSD-enabled services Cloud Computing RENABI GRISBI, French infrastructure Bioinformatics Integrated s gbio-pbil.ibcp.fr/ws GBIO

More information

Ins$tut Français de Bioinforma$que Current situa+on and prospect. IFB General Assembly Gif- sur- Yve=e, January 9 2015

Ins$tut Français de Bioinforma$que Current situa+on and prospect. IFB General Assembly Gif- sur- Yve=e, January 9 2015 Ins$tut Français de Bioinforma$que Current situa+on and prospect IFB General Assembly Gif- sur- Yve=e, January 9 2015 Background 2010: Na+onal Infrastructures in Biology and Health call from the Investment

More information

E-SCIENCE IN WESTERN FRANCE : THE BEGINNING

E-SCIENCE IN WESTERN FRANCE : THE BEGINNING E-SCIENCE IN WESTERN FRANCE : THE BEGINNING Yvan Le Bras Olivier Collin Jacques Nicolas CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics Next : Bio-imaging

More information

Alternative Deployment Models for Cloud Computing in HPC Applications. Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix

Alternative Deployment Models for Cloud Computing in HPC Applications. Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix Alternative Deployment Models for Cloud Computing in HPC Applications Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix The case for Cloud in HPC Build it in house Assemble in the cloud?

More information

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements

More information

SURFsara HPC Cloud Workshop

SURFsara HPC Cloud Workshop SURFsara HPC Cloud Workshop www.cloud.sara.nl Tutorial 2014-06-11 UvA HPC and Big Data Course June 2014 Anatoli Danezi, Markus van Dijk cloud-support@surfsara.nl Agenda Introduction and Overview (current

More information

A curated Domain centric shared Docker registry linked to the Galaxy toolshed

A curated Domain centric shared Docker registry linked to the Galaxy toolshed A curated Domain centric shared Docker registry linked to the Galaxy toolshed François Moreews 1, Olivier Sallou 2, Yvan le Bras 2, Marie Grosjean 3, Cyril Monjeaud 2, Thomas Darde 4, Olivier Collin 2,

More information

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements

More information

SURFsara HPC Cloud Workshop

SURFsara HPC Cloud Workshop SURFsara HPC Cloud Workshop doc.hpccloud.surfsara.nl UvA workshop 2016-01-25 UvA HPC Course Jan 2016 Anatoli Danezi, Markus van Dijk cloud-support@surfsara.nl Agenda Introduction and Overview (current

More information

StratusLab project. Standards, Interoperability and Asset Exploitation. Vangelis Floros, GRNET

StratusLab project. Standards, Interoperability and Asset Exploitation. Vangelis Floros, GRNET StratusLab project Standards, Interoperability and Asset Exploitation Vangelis Floros, GRNET EGI Technical Forum 2011 19-22 September 2011, Lyon, France StratusLab is co-funded by the European Community

More information

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/

More information

UGENE Quick Start Guide

UGENE Quick Start Guide Quick Start Guide This document contains a quick introduction to UGENE. For more detailed information, you can find the UGENE User Manual and other special manuals in project website: http://ugene.unipro.ru.

More information

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes

More information

DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES

DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES Yvan Le Bras Cyril Monjeaud Olivier Collin Jacques Nicolas CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics

More information

Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille

Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille Journées SUCCES Stéphane Le Crom (UPMC IBENS) stephane.le_crom@upmc.fr Paris November 2013 The Sanger DNA sequencing method Sequencing

More information

E-SCIENCE IN WESTERN FRANCE :

E-SCIENCE IN WESTERN FRANCE : E-SCIENCE IN WESTERN FRANCE : BEGINS Yvan Le Bras Cyril Monjeaud Olivier Collin & the GenOuest team CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics Next :

More information

Towards a galaxy.prabi.fr

Towards a galaxy.prabi.fr Towards a galaxy.prabi.fr IFB- galaxy Day 04/12/2013 Navra5l V., PhD, UCBL navra5l@prabi.fr www.prabi.fr One among the six IFB regional nodes Region: Rhône- Alpes Director: Guy Perrière 11 Research Team,

More information

Big Data and Cloud Computing for GHRSST

Big Data and Cloud Computing for GHRSST Big Data and Cloud Computing for GHRSST Jean-Francois Piollé (jfpiolle@ifremer.fr) Frédéric Paul, Olivier Archer CERSAT / Institut Français de Recherche pour l Exploitation de la Mer Facing data deluge

More information

Cloud computing for research

Cloud computing for research & for research Cécile Cavet cecile.cavet at apc.univ-paris7.fr Centre François Arago (FACe), Laboratoire APC, Université Paris Diderot January 11, 2015 Plan & 1 2 & 3 4 What is cloud computing? &??? Provides

More information

Enabling multi-cloud resources at CERN within the Helix Nebula project. D. Giordano (CERN IT-SDC) HEPiX Spring 2014 Workshop 23 May 2014

Enabling multi-cloud resources at CERN within the Helix Nebula project. D. Giordano (CERN IT-SDC) HEPiX Spring 2014 Workshop 23 May 2014 Enabling multi-cloud resources at CERN within the Helix Nebula project D. Giordano (CERN IT-) HEPiX Spring 2014 Workshop This document produced by Members of the Helix Nebula consortium is licensed under

More information

Hadoopizer : a cloud environment for bioinformatics data analysis

Hadoopizer : a cloud environment for bioinformatics data analysis Hadoopizer : a cloud environment for bioinformatics data analysis Anthony Bretaudeau (1), Olivier Sallou (2), Olivier Collin (3) (1) anthony.bretaudeau@irisa.fr, INRIA/Irisa, Campus de Beaulieu, 35042,

More information

Cloud services in PL-Grid and EGI Infrastructures

Cloud services in PL-Grid and EGI Infrastructures 1 Cloud services in PL-Grid and EGI Infrastructures J. Meizner, M. Radecki, M. Pawlik, T. Szepieniec ACK Cyfronet AGH Cracow Grid Workshop 2012, Kraków, 22.10.2012 Overview 2 Different types of Compute

More information

Steven Newhouse, Head of Technical Services

Steven Newhouse, Head of Technical Services Challenges at EMBL-EBI Steven Newhouse, Head of Technical Services European Bioinformatics Institute Outstation of the European Molecular Biology Laboratory International organisation created by treaty

More information

New solutions for Big Data Analysis and Visualization

New solutions for Big Data Analysis and Visualization New solutions for Big Data Analysis and Visualization From HPC to cloud-based solutions Barcelona, February 2013 Nacho Medina imedina@cipf.es http://bioinfo.cipf.es/imedina Head of the Computational Biology

More information

Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise

Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise Manager Oracle NIST Definition of Cloud Computing Cloud

More information

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/

More information

University of Messina, Italy

University of Messina, Italy University of Messina, Italy IEEE MoCS 2011 Kerkyra - Greece June 28, 2011 Dr. Massimo Villari mvillari@unime.it Cross Cloud Federation Federated Cloud Scenario Cloud Middleware Model: the Stack The CLEVER

More information

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of

More information

icer Bioinformatics Support Fall 2011

icer Bioinformatics Support Fall 2011 icer Bioinformatics Support Fall 2011 John B. Johnston HPC Programmer Institute for Cyber Enabled Research 2011 Michigan State University Board of Trustees. Institute for Cyber Enabled Research (icer)

More information

Building Storage Service in a Private Cloud

Building Storage Service in a Private Cloud Building Storage Service in a Private Cloud Sateesh Potturu & Deepak Vasudevan Wipro Technologies Abstract Storage in a private cloud is the storage that sits within a particular enterprise security domain

More information

Final Report on StratusLab Adoption

Final Report on StratusLab Adoption Final Report on StratusLab Adoption Charles Loomis, Mohammed Airaj, Marc-Elian Bégin, Christophe Blanchet, Evangelos Floros, Clément Gauthey To cite this version: Charles Loomis, Mohammed Airaj, Marc-Elian

More information

EMBL-EBI Web Services

EMBL-EBI Web Services EMBL-EBI Web Services Rodrigo Lopez Head of the External Services Team SME Workshop Piemonte 2011 EBI is an Outstation of the European Molecular Biology Laboratory. Summary Introduction The JDispatcher

More information

OPEN SOURCE AND BOTTOM-UP VRE APPROACH IN WESTERN FRANCE

OPEN SOURCE AND BOTTOM-UP VRE APPROACH IN WESTERN FRANCE OPEN SOURCE AND BOTTOM-UP VRE APPROACH IN WESTERN FRANCE Towards supporting accessible, reproducible, and transparent research in the life sciences Yvan Le Bras Cyril Monjeaud Olivier Collin, the GenOuest

More information

Virtualization & Cloud Computing (2W-VnCC)

Virtualization & Cloud Computing (2W-VnCC) Virtualization & Cloud Computing (2W-VnCC) DETAILS OF THE SYLLABUS: Basics of Networking Types of Networking Networking Tools Basics of IP Addressing Subnet Mask & Subnetting MAC Address Ports : Physical

More information

EGEE-2 NA4 Biomed Bioinformatics in CNRS

EGEE-2 NA4 Biomed Bioinformatics in CNRS Enabling Grids for E-sciencE EGEE-2 NA4 Biomed Bioinformatics in CNRS Christophe Blanchet Institute of Biology and Chemistry of Proteins Lyon, April 28, 2006 www.eu-egee.org Enabling Grids for E-sciencE

More information

HPC Cloud. Focus on your research. Floris Sluiter Project leader SARA

HPC Cloud. Focus on your research. Floris Sluiter Project leader SARA HPC Cloud Focus on your research Floris Sluiter Project leader SARA Why an HPC Cloud? Christophe Blanchet, IDB - Infrastructure Distributing Biology: Big task to port them all to your favorite architecture

More information

SixSq Cloud Capabilities

SixSq Cloud Capabilities SixSq Cloud Capabilities SlipStream: Mutli-cloud Management Platform Marc-Elian Bégin, CEO, Co-founder, SixSq HEPIA Cloud Masters, Lausanne, 2015 Locations Global Headquarters Geneva, Switzerland North

More information

Cloud-Based Big Data Analytics in Bioinformatics

Cloud-Based Big Data Analytics in Bioinformatics Cloud-Based Big Data Analytics in Bioinformatics Presented By Cephas Mawere Harare Institute of Technology, Zimbabwe 1 Introduction 2 Big Data Analytics Big Data are a collection of data sets so large

More information

Design and Building of IaaS Clouds

Design and Building of IaaS Clouds 21th May 2010 CloudViews 2010 Porto, Portugal Next Generation Data Center Summit Design and Building of IaaS Clouds Distributed Systems Architecture Research Group Universidad Complutense de Madrid This

More information

Bioinformatics Grid - Enabled Tools For Biologists.

Bioinformatics Grid - Enabled Tools For Biologists. Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis

More information

Analysis of ChIP-seq data in Galaxy

Analysis of ChIP-seq data in Galaxy Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers

More information

vnebula Cloud. Made Easy. Introducing vnebula from Stream Networks. A simple, self-service cloud portal for our partner community.

vnebula Cloud. Made Easy. Introducing vnebula from Stream Networks. A simple, self-service cloud portal for our partner community. vnebula Cloud. Made Easy. Introducing vnebula from Stream Networks. A simple, self-service cloud portal for our partner community. Keep it simple Our partners asked us for a cloud portal that was simple

More information

Deploying Business Virtual Appliances on Open Source Cloud Computing

Deploying Business Virtual Appliances on Open Source Cloud Computing International Journal of Computer Science and Telecommunications [Volume 3, Issue 4, April 2012] 26 ISSN 2047-3338 Deploying Business Virtual Appliances on Open Source Cloud Computing Tran Van Lang 1 and

More information

Experiences and challenges in the development of the JASMIN cloud service for the environmental science community

Experiences and challenges in the development of the JASMIN cloud service for the environmental science community JASMIN (STFC/Stephen Kill) Experiences and challenges in the development of the JASMIN cloud service for the environmental science community ECMWF Visualisa-on in Meteorology Week, 28 September 2015 Philip

More information

Solution for private cloud computing

Solution for private cloud computing The CC1 system Solution for private cloud computing 1 Outline What is CC1? Features Technical details System requirements and installation How to get it? 2 What is CC1? The CC1 system is a complete solution

More information

Dutch HPC Cloud: flexible HPC for high productivity in science & business

Dutch HPC Cloud: flexible HPC for high productivity in science & business Dutch HPC Cloud: flexible HPC for high productivity in science & business Dr. Axel Berg SARA national HPC & e-science Support Center, Amsterdam, NL April 17, 2012 4 th PRACE Executive Industrial Seminar,

More information

EGI services for distribution and federation of data and computing

EGI services for distribution and federation of data and computing EGI services for distribution and federation of data and computing Tiziana Ferrari Technical Director, EGI.eu tiziana.ferrari@egi.eu March 2014 EGI-InSPIRE RI-261323 1 Accelerating Excellent Science MISSION.

More information

Large-scale Research Data Management and Analysis Using Globus Services. Ravi Madduri Argonne National Lab University of Chicago @madduri

Large-scale Research Data Management and Analysis Using Globus Services. Ravi Madduri Argonne National Lab University of Chicago @madduri Large-scale Research Data Management and Analysis Using Globus Services Ravi Madduri Argonne National Lab University of Chicago @madduri Outline Who we are Challenges in Big Data Management and Analysis

More information

ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013

ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013 ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and

More information

BioHPC Web Computing Resources at CBSU

BioHPC Web Computing Resources at CBSU BioHPC Web Computing Resources at CBSU 3CPG workshop Robert Bukowski Computational Biology Service Unit http://cbsu.tc.cornell.edu/lab/doc/biohpc_web_tutorial.pdf BioHPC infrastructure at CBSU BioHPC Web

More information

Enabling a federated environment to support biomedical research. Gianmauro Cuccuru CRS4

Enabling a federated environment to support biomedical research. Gianmauro Cuccuru CRS4 Enabling a federated environment to support biomedical research Gianmauro Cuccuru CRS4 ELIXIR connects national bioinformatics centres and EMBL- EBI into a sustainable European infrastructure for biological

More information

Solution for private cloud computing

Solution for private cloud computing The CC1 system Solution for private cloud computing 1 Outline What is CC1? Features Technical details Use cases By scientist By HEP experiment System requirements and installation How to get it? 2 What

More information

Cloud Computing. What Are We Handing Over? Ganesh Shankar Advanced IT Core Pervasive Technology Institute

Cloud Computing. What Are We Handing Over? Ganesh Shankar Advanced IT Core Pervasive Technology Institute Cloud Computing What Are We Handing Over? Ganesh Shankar Advanced IT Core Pervasive Technology Institute Why is the Cloud Relevant to In the current research workflow. Medical Research? Data volumes are

More information

Introduction to Cloud Computing

Introduction to Cloud Computing Introduction to Cloud Computing Shang Juh Kao Dept. of Computer Science and Engineering National Chung Hsing University 2011/10/27 CSE, NCHU 1 Table of Contents 1. Introduction ( 資 料 取 自 NCHC 自 由 軟 體 實

More information

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud) Open Cloud System (Integration of Eucalyptus, Hadoop and into deployment of University Private Cloud) Thinn Thu Naing University of Computer Studies, Yangon 25 th October 2011 Open Cloud System University

More information

Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT

Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT Building Bioinformatics Capacity in Africa Nicky Mulder CBIO Group, UCT Outline What is bioinformatics? Why do we need IT infrastructure? What e-infrastructure does it require? How we are developing this

More information

Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1

Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1 Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: Sonia.Casillas@uab.cat

More information

Comparing Methods for Identifying Transcription Factor Target Genes

Comparing Methods for Identifying Transcription Factor Target Genes Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF

More information

Managing and Conducting Biomedical Research on the Cloud Prasad Patil

Managing and Conducting Biomedical Research on the Cloud Prasad Patil Managing and Conducting Biomedical Research on the Cloud Prasad Patil Laboratory for Personalized Medicine Center for Biomedical Informatics Harvard Medical School SaaS & PaaS gmail google docs app engine

More information

Development of Bio-Cloud Service for Genomic Analysis Based on Virtual

Development of Bio-Cloud Service for Genomic Analysis Based on Virtual Development of Bio-Cloud Service for Genomic Analysis Based on Virtual Infrastructure 1 Jung-Ho Um, 2 Sang Bae Park, 3 Hoon Choi, 4 Hanmin Jung 1, First Author Korea Institute of Science and Technology

More information

SURFsara Data Services

SURFsara Data Services SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,

More information

Enhanced Research Data Management and Publication with Globus

Enhanced Research Data Management and Publication with Globus Enhanced Research Data Management and Publication with Globus Vas Vasiliadis Jim Pruyne Presented at OR2015 June 8, 2015 Presentations and other useful information available at globus.org/events/or2015/tutorial

More information

Assignment # 1 (Cloud Computing Security)

Assignment # 1 (Cloud Computing Security) Assignment # 1 (Cloud Computing Security) Group Members: Abdullah Abid Zeeshan Qaiser M. Umar Hayat Table of Contents Windows Azure Introduction... 4 Windows Azure Services... 4 1. Compute... 4 a) Virtual

More information

Globus Genomics Tutorial GlobusWorld 2014

Globus Genomics Tutorial GlobusWorld 2014 Globus Genomics Tutorial GlobusWorld 2014 Agenda Overview of Globus Genomics Example Collaborations Demonstration Globus Genomics interface Globus Online integration Scenario 1: Using Globus Genomics for

More information

Trials community. Yannick Legré. yannick.legre@egi.eu. www.egi.eu. EGI InSPIRE RI 261323

Trials community. Yannick Legré. yannick.legre@egi.eu. www.egi.eu. EGI InSPIRE RI 261323 EGI InSPIRE InSPIRE EGI Federated cloud for the Clinical Trials community Yannick Legré yannick.legre@egi.eu ECRIN Workshop EGI European Grid Infrastructure Distributed, federated storage and compute facilities

More information

Cloud Storage in a PaaS World

Cloud Storage in a PaaS World Cloud Storage in a PaaS World Susan Wu Oracle Group Manager, Product Management Cloud Storage runs Apps Just like a Platform runs Applications, there are many Applications that need to run in Cloud Storage

More information

<Insert Picture Here> Private Cloud with Fusion Middleware

<Insert Picture Here> Private Cloud with Fusion Middleware Private Cloud with Fusion Middleware Duško Vukmanović Principal Sales Consultant, Oracle dusko.vukmanovic@oracle.com The following is intended to outline our general product direction.

More information

INTRODUCTION TO CLOUD MANAGEMENT

INTRODUCTION TO CLOUD MANAGEMENT CONFIGURING AND MANAGING A PRIVATE CLOUD WITH ORACLE ENTERPRISE MANAGER 12C Kai Yu, Dell Inc. INTRODUCTION TO CLOUD MANAGEMENT Oracle cloud supports several types of resource service models: Infrastructure

More information

Course 20533: Implementing Microsoft Azure Infrastructure Solutions

Course 20533: Implementing Microsoft Azure Infrastructure Solutions Course 20533: Implementing Microsoft Azure Infrastructure Solutions Overview About this course This course is aimed at experienced IT Professionals who currently administer their on-premises infrastructure.

More information

Microsoft Research Windows Azure for Research Training

Microsoft Research Windows Azure for Research Training Copyright 2013 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the

More information

OpenNebula Cloud Innovation and Case Studies for Telecom

OpenNebula Cloud Innovation and Case Studies for Telecom Telecom Cloud Standards Information Day Hyatt Regency, Santa Clara, CA, USA 6-7 December, 2010 OpenNebula Cloud Innovation and Case Studies for Telecom Constantino Vázquez Blanco DSA-Research.org Distributed

More information

OpenNebula Leading Innovation in Cloud Computing Management

OpenNebula Leading Innovation in Cloud Computing Management OW2 Annual Conference 2010 Paris, November 24th, 2010 OpenNebula Leading Innovation in Cloud Computing Management Ignacio M. Llorente DSA-Research.org Distributed Systems Architecture Research Group Universidad

More information

Cultural Heritage Institutions, Metadata Aggregators and The Cloud Aleksandra Nowak, Marcin Werla Poznań Supercomputing and Networking Center

Cultural Heritage Institutions, Metadata Aggregators and The Cloud Aleksandra Nowak, Marcin Werla Poznań Supercomputing and Networking Center Cultural Heritage Institutions, Metadata Aggregators and The Cloud Aleksandra Nowak, Marcin Werla Poznań Supercomputing and Networking Center ECloud and LoCloud are funded by the European Commission's

More information

Early Cloud Experiences with the Kepler Scientific Workflow System

Early Cloud Experiences with the Kepler Scientific Workflow System Available online at www.sciencedirect.com Procedia Computer Science 9 (2012 ) 1630 1634 International Conference on Computational Science, ICCS 2012 Early Cloud Experiences with the Kepler Scientific Workflow

More information

1 P a g e Delivering Self -Service Cloud application service using Oracle Enterprise Manager 12c

1 P a g e Delivering Self -Service Cloud application service using Oracle Enterprise Manager 12c Delivering Self-service Cloud application services using Oracle Enterprise Manager 12c Kai Yu, Senior Principal Engineer, Oracle Solutions Engineering, Dell Inc ABSTRACT Oracle Self-Service provisioning

More information

globus online Cloud-based services for (reproducible) science Ian Foster Computation Institute University of Chicago and Argonne National Laboratory

globus online Cloud-based services for (reproducible) science Ian Foster Computation Institute University of Chicago and Argonne National Laboratory globus online Cloud-based services for (reproducible) science Ian Foster Computation Institute University of Chicago and Argonne National Laboratory Computation Institute (CI) Apply to challenging problems

More information

Automating Big Data Benchmarking for Different Architectures with ALOJA

Automating Big Data Benchmarking for Different Architectures with ALOJA www.bsc.es Jan 2016 Automating Big Data Benchmarking for Different Architectures with ALOJA Nicolas Poggi, Postdoc Researcher Agenda 1. Intro on Hadoop performance 1. Current scenario and problematic 2.

More information

How To Run A Cloud Server On A Server Farm (Cloud)

How To Run A Cloud Server On A Server Farm (Cloud) StratusLab: Darn Simple Cloud Charles (Cal) Loomis (CNRS/LAL & SixSq Sàrl) FOSDEM 13: Cloud Devroom (3 February 2013) StratusLab What is it? Complete Infrastructure as a Service (IaaS) cloud distribution

More information

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager pchadwick@suse.com. Product Marketing Manager djarvis@suse.

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager pchadwick@suse.com. Product Marketing Manager djarvis@suse. SUSE Cloud 2.0 Pete Chadwick Douglas Jarvis Senior Product Manager pchadwick@suse.com Product Marketing Manager djarvis@suse.com SUSE Cloud SUSE Cloud is an open source software solution based on OpenStack

More information

Cloud Computing. Chapter 1 Introducing Cloud Computing

Cloud Computing. Chapter 1 Introducing Cloud Computing Cloud Computing Chapter 1 Introducing Cloud Computing Learning Objectives Understand the abstract nature of cloud computing. Describe evolutionary factors of computing that led to the cloud. Describe virtualization

More information

Microsoft Research Microsoft Azure for Research Training

Microsoft Research Microsoft Azure for Research Training Copyright 2014 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the

More information

Scientific and Technical Applications as a Service in the Cloud

Scientific and Technical Applications as a Service in the Cloud Scientific and Technical Applications as a Service in the Cloud University of Bern, 28.11.2011 adapted version Wibke Sudholt CloudBroker GmbH Technoparkstrasse 1, CH-8005 Zurich, Switzerland Phone: +41

More information

INTRODUCTION APPLICATION DEPLOYMENT WITH ORACLE VIRTUAL ASSEMBLY

INTRODUCTION APPLICATION DEPLOYMENT WITH ORACLE VIRTUAL ASSEMBLY SIMPLIFYING APPLICATION DEPLOYMENT IN CLOUD USING VIRTUAL ASSEMBLIES AND EM 12C Kai Yu, Dell Inc. ABSTRACT Oracle virtual assemblies provide a great way to simply the deployment of enterprise-class multi-tier

More information

Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania)

Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania) Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania) Outline Introduction EO challenges; EO and classical/cloud computing; EO Services The computing platform Cluster -> Grid -> Cloud

More information

Future computing platforms for biodiversity science

Future computing platforms for biodiversity science www.bsc.es Future computing platforms for biodiversity science Daniele Lezzi Rome, 5 September 2013 Motivation Lack of service integration and interoperability of research e- Infrastructure e-irg 2013

More information

Implementing Microsoft Azure Infrastructure Solutions

Implementing Microsoft Azure Infrastructure Solutions Course Code: M20533 Vendor: Microsoft Course Overview Duration: 5 RRP: 2,025 Implementing Microsoft Azure Infrastructure Solutions Overview This course is aimed at experienced IT Professionals who currently

More information

SC12 Cloud Compu,ng for Science Tutorial: Introduc,on to Infrastructure Clouds

SC12 Cloud Compu,ng for Science Tutorial: Introduc,on to Infrastructure Clouds SC12 Cloud Compu,ng for Science Tutorial: Introduc,on to Infrastructure Clouds John Breshnahan, Patrick Armstrong, Kate Keahey, Pierre Riteau Argonne National Laboratory Computation Institute, University

More information

Globus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis

Globus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis Globus Research Data Management: Introduction and Service Overview Steve Tuecke Vas Vasiliadis Presentations and other useful information available at globus.org/events/xsede15/tutorial 2 Thank you to

More information

Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering

Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering NIST Definition of Cloud Computing Cloud computing is a model for enabling convenient, on-demand

More information

Practical Course on. Bioinformatics and Comparative Genomes Analyses. University of La Réunion 2013 May 6-18

Practical Course on. Bioinformatics and Comparative Genomes Analyses. University of La Réunion 2013 May 6-18 Practical Course on Bioinformatics and Comparative Genomes Analyses University of La Réunion 2013 May 6-18 http://www.pasteur.fr/~tekaia/bcga2013.html Abstract The course will be organized within the frame

More information

-> Integration of MAPHiTS in Galaxy

-> Integration of MAPHiTS in Galaxy Enabling NGS Analysis with(out) the Infrastructure, 12:0512 Development of a workflow for SNPs detection in grapevine From Sets to Graphs: Towards a Realistic Enrichment Analy species: MAPHiTS -> Integration

More information