Cloud pour la Bioinformatique
|
|
|
- Angelica James
- 10 years ago
- Views:
Transcription
1 Cloud pour la Bioinformatique Christophe Blanchet Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR French Node CNRS UMS Gif-sur-Yvette - FRANCE
2 Sequencing data source: source: Complete genome sequencing become a lab commodity with NGS (cheap and efficient) Ecole IN2P3 Cloud 2014, 1er juillet let 2014, Lyon
3 And other experimental data FR - EU
4 EMBL-EBI data resources growth source: EMBL-EBI Annual Report 2013
5 Plateformes Expérimentales en Biologie Plateformes nationales (GIS IBISA) Nb Imagerie cellulaire 19 Génomique, Transcriptomique 16 Protéomique 13 Biologie structurale, biophysique 11 NGS BI C IMG Biological platform (Genomics, IMaGing, PROteomics...) Bioinformatics center Cloud resources Scientists PRO C NGS BI NGS PRO BI C Localisation des plateformes NGS PRO BI IMG PRO NGS PRO C IMG BI C C Source: omicsmaps.com Des sites intermédiaires permettent de répartir la charge en terme de stockage et de puissance de calcul tout en assurant une meilleure proximité avec les scientifiques
6 Infrastructures in Biology Lot of bioinformatics tools and services to treat and vizualize the biological data Ecole IN2P3 Cloud 2014, 1er juillet 2014, Lyon
7 Bioinformatics Today Biological data are big data 1552 online databases (NAR Database Issue 2014) Institut Sanger, UK, 5 PB - Beijing Genome Institute, China, 7 sites, 20.6 PB Big data in many places Analysing such data became difficult Scale-up of the analyses : gene/protein to complete genome/proteome,... Lot of different daily-used tools that need to be combined in workflows Usual interfaces: portals, Web services, Datacenters with ease of access/use C NGS NGS BI NGS BI PRO BI C IMG PRO IMG Distributed resources Experimental platforms: NGS, imaging,... Bioinformatics platforms Federation of datacenters C PRO C n
8 Cloud? Essential characteristics On-demand self-service No human intervention Broad network access Fast, reliable remote access Rapid elasticity Scale based on app. needs Resource pooling Multi-tenant sharing Measured service Direct or indirect economic model with measured use Deployment models Private Single administrative domain, limited Community Public number of users Different administrative domains with common interests & proc. People outside of institute s administrative domain Hybrid Federation via combination of other deployment models Service models Software as a Service (SaaS) Direct (scalable) hosting of end user applications Platform as a Service (PaaS) Framework and infrastructure for creating web applications Infrastructure as a Service (IaaS) Access to remote virtual Machines with root access nistpubs/ /sp pdf
9 Cloud IDB Cloud workbench for Biology Infrastructure Distributed for Biology Running since Sept IBCP FR3302 CNRS-Univ. lyon1, Lyon, France opened to Biology community 14 bioinformatics appliances: Galaxy portal, standard compute nodes, proteomics, virtual desktop, structural biology, users from all IFB regional centers PRABI 16, APLIBIO 28, RENABI-NE 13, -GO 7, -SO 2, - GS 5 VMs up to 32cores-768GB RAM Infrastructure Compute +900cores +4TB ram Standard nodes (32c-128GB) Bigmen nodes (64c 768GB) Storage +250TB Virtual disks, large-scale object storage (S3) Powered by StratusLab and CEPH RENATER Scientists 1giga Frontend 10giga eth IDB's Cloud Web portal Pdisk storage iscsi 10giga eth Object storage Cloud Hypervisors - std nodes: 32c 128GB - bigmem nodes: 64c 768 GB
10 French Institute of Bioinformatics - IFB Mission : to make available core bioinformatics resources to the national/ international life science research community. To provide support for national biology programs To provide an IT infrastructure devoted to management and analysis of biological data To act as a middleman between the life science community and the bioinformatics/computer science research community ELIXIR French Node optimizing the interactions and coordination between the national level and ELIXIR and other ESFRI infrastructures in biomedical and environmental field, promoting consistency and complementarities between the components offered by the ELIXIR French node and those of other European nodes
11 IFB e-infrastructure Support : help members to deploy and use their tools e-infrastructure: hardware, biology data collections, bioinformatics tools Academic cloud for life science a core ressource IFB-core hosted at CNRS IDRIS SC center (Paris) + regional resources 6 regional bioinformatics centers with 2 clouds 11,000 cores - 6 PB but +20 bioinformatics platforms Create a federation of clouds for life science Technical organization GRISBI: a national technical group (all national platforms) Participation to ELIXIR task forces RENABI-GO RENABI-SO APLIBIO RENABI-NE IFB-core PRABI RENABI-GS Cloud Ressources Location # Compute Cores # TB Storage # TB RAM Max VM size Technology IFB-core CNRS-IDRIS, Paris c 256GB StratusLab IFB-core 2014 CNRS-IDRIS, Paris 4, c 1TB StratusLab IFB-core 2015 CNRS-IDRIS, Paris 10,000 2,000-96c 2TB StratusLab idee-b PRABI-IBCP, Lyon 1, c 768GB StratusLab Genocloud IFB-GO, Rennes ONE
12 B B I Extended cloud functionalities for bioinformatics Bioinformatics appliances integrate bioinformatics tools and workflows Bioinformatics marketplace focus on bioinformatics appliances satisfy visibility contraints for some bioinformatics appliances (confidentiality) Bioinformatics metadata bio:tool annotate appliances with attributes related to bioinformatics tools help to select suitable bioinformatics appliances containing the required tools Integrated Web interface Native cloud services Authentication Virtual machine management Persistent disk service Client CLI VM & virtual disks management filter bioinformatics appliances with bio:tool etc. IDB CEPH storage backend large scale and distributed storage reliable by replication high-througput IO single unified storage cluster for all interfaces: block, object and file system
13 Bioinformatics cloud appliances Bioinformatics appliances are usual virtual machines tools R FastA OMSSA X!tandem Galaxy Muscle BLAST ClustalW2 SSearch ARIA PeptideShaker TopHat BWA HMMer samtools Clustal Omega fastqc small : few GB, easy to convert in most virtualization formats Installed and preconfigured with bioinformatics tools Create new cloud services Linux system e.g. BLAST, Clustalw, ARIA, MEME, HMMer, TopHat, BWA, Samtools, etc. Virtual Machines Bioinformatics Marketplace Recorded in a marketplace Structures Sequences Proteomics + Galaxy... devoted to bioinformatics
14 Run bioinformatics appliances Bioinformatics marketplace Store life science VMs and a catalogue both a virtual machines repository Help users to select the appropriate VM for their analysis Bioinformatics Marketplace BI Structures Galaxy Sequences Proteomics B A data public data UNIPROT EMBL Genomes PDB PROSITE Move cloud virtual machines Analyze data tools VM: BLAST, ClustalW2, etc.... (2) IDB Cloud (3) Select tools Scientists can filter (1) the appliances through a Web interface to identify and launch (2) the appropriate ones. (1) Use tools (3) Scientists have access to their own cloud resources through web portal, remote virtual desktop or SSH. Filter images with metadata related to bioinformatics attribute <bio:tool> in VM manifests scientists can select the appropriate appliance according to the tools required for their analyses e.g. the BLAST tool Deploy on several clouds
15 Appliances page List of existing appliances Appliance description and doc Direct launch Power button
16 Filter appliances with tools description
17 A cloud driven through a simple web interface
18 Connection to VMs ssh web NX (opennx)
19 Cloud Storage for Biological Data Upload your data Public Data sources Genomes EMBL PDB UNIPROT PROSITE shared (NFS ro) BLAST, Clustal, etc. PaaS sftp/http/s3 IaaS launch jobs ssh Shared FS j. doe e. martin you chb v. disk sambaa Portal Master & Storage VM ARIA Workers VM CNS Bioinformatics Cloud Identity Mgmt cg User data S3 sftp/http/s3 Get your results
20 Exchanging data with VMs CLI scp/sftp GUI: Cyberduck, Transmit Integrated: Galaxy Ecole IN2P3 Cloud 2014, 1er juillet let 2014, Lyon
21 Moving VMs vs Data NGS IMG PRO NGS Biological platform (Genomics, IMaGing, PROteomics...) BI C Bioinformatics center Cloud resources Scientists C BI NGS data PRO VM BI VM C VMs PRO IFB Bioinfor- matics marketplace & VMs repository data VM BI IMG PRO NGS PRO C IMG data BI C C
22 Case 1: Standard Bioinformatics node appliance Biocompute Use your own instance(s) With pre-installed standard bioinformatics tools BLAST, FastA, SSearch,HMM,... ClustalW2, Clustal-Omega, Muscle,.. Bowtie(2), BWA, samtools,... MEME, R, etc. Connected to public reference data Uniprot, EMBL, genomes, PDB, etc. Automaticaly shared to the VMs Cluster mode turn several instances in a single virtual cluster shared file system batch scheduling Ecole IN2P3 Cloud 2014, 1er juillet 2014, Lyon
23 Case 2: Cloud Galaxy portal for NGS analyses Analyse NGS data portal Galaxy is widely used in the community connected to large public data: sequences and indexes large user data (GBs) Preserve workflows and results (cloud virtual disk) Different domain-specific instances (RNAseq, CHIPseq, etc.) For training: create a special instance derived from the main instance but with dedicated datasets Help the integration of monthly updates
24 w Tools s hares nt text bases s.ib bcp.fr"\ Run your Galaxy Portal on Cloud Bioinformatics Marketplace Sequence Structure NGS Galaxy ARIA ( ) Stay Connected to Standard Data & Tools ay Connected to Stan Launch Instances User data: upload datafiles or attache pdisk Reference databases: mount biodata server shares Tools: used pre-installed ones or or install install yours yours User data: upload datafiles Reference databases: moun Tools: used pre-installed on Run instances with bioinformatics context stratus-run-instance\ with bi i nformatics ormatics co co --context="bio_db_server=idb-databases.ibcp.fr"\ MNb7A8XGnNwwJAaZUBj5fwYVoFf Shared FS BLAST, Clustal, etc. launch jobs ssh IaaS Master & Storage VM ARIA Workers VM CNS Portal IBCP's Cloud Resources PaaS
25 Connect to your Galaxy Portal
26 for development & operations (DevOps): different In collaboration with the French Institute of Bioinformatics Advantages of Cloud for Galaxy Added value of cloud for Galaxy, for scientific analyses: user-specific resources, isolated, different domain-specific instances (RNAseq, CHIPseq, Variants,...) for training: create a special instance derived from the main but with dedicated datasets Examples of training with Galaxy: Mai 2013 Galaxy Lille, Nov 2013 Aviesan Bioinformatics School For integration of monthly updates versions at the same time Bioinformatics cloud (e.g. IDB) Tightly connected to existing bioinformatics resources Linked to public biological databases
27 Case 3: Proteomics virtual desktop Motivation Collaboration with a mass spectroscopy platform Running out of space on their local resources Protein identification tools Mass experimental data Reference databases : nr, Swiss-Prot Reference screening tools: OMSSA, X!Tandem User interface Remote Virtual Desktop (NX) Reference GUIs SearchGUI PeptidShaker source: PeptideShaker esha site Ecole IN2P3 Cloud 2014, 1er juillet let 2014, Lyon
28 Case 4: Hadoop for Life Science Provide turnkey virtual machine with preconfigured mapreduce framework Accelerate biological bigdata analysis Hadoop MapReduce Appliances (2) HDFS with integrated bioinformatics tools Example of sequence similarity searching FastA & SSearch deploy database of sequences in HDFS compare each structure to others provide standard hadoop: including mapreduce and Databank Developed in the context of the French project MapReduce, ANR ARPEGE FastAMR splits the databank into subsets and puts them in the DFS along with the sequences file FastAMR subset #01 FastA #01 subset #02 Mappers FastA # Each mapper send the score and sequences to reducers Reducers Results score sequence score sequence... Users run the FastAMR script with its sequences and the databank User's Sequences Each mapper runs a FastA program on a part of the databank Reducers copy the best scores of the whole experiment in the DFS
29 Cloud it be done? IFB s cloud for life science simplify access to biological data and tools integrate tools and pipelines in turnkey cloud appliances is tightly connected to existing bioinformatics resources, e.g. public reference data sources 14 bioinformatics appliances: standard compute nodes, proteomics virtual desktop, Galaxy portal, structural biology +70 users from all IFB regional centers PRABI 16, APLIBIO 28, RENABI-NE 13, -GO 7, -SO 2, -GS 5 Bioinformatics marketplace store images related to life science help users to select the appropriate VM for their analysis BI tools BLAST FastA OMSSA ClustalW2 SSearch PeptideShaker ARIA BWA X!tandem HMMer TopHat samtools Galaxy Clustal Muscle fastqc Omega Create new cloud services Virtual Machines R + Linux system Bioinformatics Marketplace Bioinformatics Marketplace Structures Galaxy Galaxy Sequences... Proteomics Proteomics (2) B A data public data UNIPROT EMBL Genomes PDB PROSITE Move cloud virtual machines Analyze data tools VM: BLAST, ClustalW2, etc. IDB Cloud Use Scientists have access cloud resources throug Ecole IN2P3 Cloud 2014, 1er remote juillet virtual 2014, desktop Lyon o... (3) Selec Scientists can the appliances Web interface t and launch appropriate ones (1)
30 Perspectives Create bioinformatics appliances by the experts of the domains make them available to the scientists IFB established priorities: 5 scientific domains Microbial Bioinformatics Evolutionary bioinformatics Plant bioinformatics Structural Biology NGS data processing and 3 technical pilots Appliances interoperability between different cloud infrastructures Distributing biological data with distributed nosql engine Live remote cloud processing of sequencing data
31 Questions? Clément Gauthey (IDB-IBCP) StratusLab members IDB s co-funding by European Community's Seventh Framework Programme (INFSO-RI ) French National Research Agency's Arpege Programme (ANR-10-SEGI-001). IFB s funding by French program PIA INBS 2012 Acknowledgments
Bioinformatique sur Cloud Cas d usage avec le portail Galaxy
Bioinformatique sur Cloud Cas d usage avec le portail Galaxy Christophe Blanchet Institute of Biology and Chemistry of Proteins Head of Service Infrastructure for Biology - IDB CNRS-IBCP FR3302 - LYON
Cloud Ready for Bioinformatics?
IDB acknowledges co-funding by the European Community's Seventh Framework Programme (INFSO-RI-261552) and the French National Research Agency's Arpege Programme (ANR-10-SEGI-001) Cloud Ready for Bioinformatics?
Sequencing data. And other experimental data. EMBL-EBI data resources growth
Sequencing Institut Français de Bioinformatique, Un loud pour les Sciences du Vivant source: www.genomesonline.org source: www.politigenomics.com/next-generation- hristophe Blanchet Institut Français de
Une e-infrastructure nationale en bioinformatique
Une e-infrastructure nationale en bioinformatique Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE JDEV
IFB s e-infrastructure
IFB s e-infrastructure Christophe Blanchet Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Life Sciences Platforms in France
Le cloud IFB et son instance Galaxy
Le cloud IFB et son instance Galaxy Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Ecole Bioinformatique
Le cloud IFB et son instance Galaxy
Le cloud IFB et son instance Galaxy Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Ecole Bioinformatique
StratusLab project. Standards, Interoperability and Asset Exploitation. Vangelis Floros, GRNET
StratusLab project Standards, Interoperability and Asset Exploitation Vangelis Floros, GRNET EGI Technical Forum 2011 19-22 September 2011, Lyon, France StratusLab is co-funded by the European Community
Building Storage Service in a Private Cloud
Building Storage Service in a Private Cloud Sateesh Potturu & Deepak Vasudevan Wipro Technologies Abstract Storage in a private cloud is the storage that sits within a particular enterprise security domain
OpenNebula Open Souce Solution for DC Virtualization. C12G Labs. Online Webinar
OpenNebula Open Souce Solution for DC Virtualization C12G Labs Online Webinar What is OpenNebula? Multi-tenancy, Elasticity and Automatic Provision on Virtualized Environments I m using virtualization/cloud,
Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)
Open Cloud System (Integration of Eucalyptus, Hadoop and into deployment of University Private Cloud) Thinn Thu Naing University of Computer Studies, Yangon 25 th October 2011 Open Cloud System University
Alternative Deployment Models for Cloud Computing in HPC Applications. Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix
Alternative Deployment Models for Cloud Computing in HPC Applications Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix The case for Cloud in HPC Build it in house Assemble in the cloud?
SURFsara HPC Cloud Workshop
SURFsara HPC Cloud Workshop doc.hpccloud.surfsara.nl UvA workshop 2016-01-25 UvA HPC Course Jan 2016 Anatoli Danezi, Markus van Dijk [email protected] Agenda Introduction and Overview (current
Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers
Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/
Getting Started Hacking on OpenNebula
LinuxTag 2013 Berlin, Germany, May 22nd Getting Started Hacking on OpenNebula Carlos Martín Project Engineer Acknowledgments The research leading to these results has received funding from Comunidad de
Assignment # 1 (Cloud Computing Security)
Assignment # 1 (Cloud Computing Security) Group Members: Abdullah Abid Zeeshan Qaiser M. Umar Hayat Table of Contents Windows Azure Introduction... 4 Windows Azure Services... 4 1. Compute... 4 a) Virtual
Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille
Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille Journées SUCCES Stéphane Le Crom (UPMC IBENS) [email protected] Paris November 2013 The Sanger DNA sequencing method Sequencing
Private Cloud Database Consolidation with Exadata. Nitin Vengurlekar Technical Director/Cloud Evangelist
Private Cloud Database Consolidation with Exadata Nitin Vengurlekar Technical Director/Cloud Evangelist Agenda Private Cloud vs. Public Cloud Business Drivers for Private Cloud Database Architectures for
OpenNebula Open Souce Solution for DC Virtualization
13 th LSM 2012 7 th -12 th July, Geneva OpenNebula Open Souce Solution for DC Virtualization Constantino Vázquez Blanco OpenNebula.org What is OpenNebula? Multi-tenancy, Elasticity and Automatic Provision
Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, 2015. 20014 IBM Corporation
Boas Betzler Cloud IBM Distinguished Computing Engineer for a Smarter Planet Globally Distributed IaaS Platform Examples AWS and SoftLayer November 9, 2015 20014 IBM Corporation Building Data Centers The
OpenNebula Open Souce Solution for DC Virtualization
OSDC 2012 25 th April, Nürnberg OpenNebula Open Souce Solution for DC Virtualization Constantino Vázquez Blanco OpenNebula.org What is OpenNebula? Multi-tenancy, Elasticity and Automatic Provision on Virtualized
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/
Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager [email protected]. Product Marketing Manager djarvis@suse.
SUSE Cloud 2.0 Pete Chadwick Douglas Jarvis Senior Product Manager [email protected] Product Marketing Manager [email protected] SUSE Cloud SUSE Cloud is an open source software solution based on OpenStack
Towards a galaxy.prabi.fr
Towards a galaxy.prabi.fr IFB- galaxy Day 04/12/2013 Navra5l V., PhD, UCBL [email protected] www.prabi.fr One among the six IFB regional nodes Region: Rhône- Alpes Director: Guy Perrière 11 Research Team,
OpenNebula Leading Innovation in Cloud Computing Management
OW2 Annual Conference 2010 Paris, November 24th, 2010 OpenNebula Leading Innovation in Cloud Computing Management Ignacio M. Llorente DSA-Research.org Distributed Systems Architecture Research Group Universidad
The OpenNebula Cloud Platform for Data Center Virtualization
CloudOpen 2012 San Diego, USA, August 29th, 2012 The OpenNebula Cloud Platform for Data Center Virtualization Carlos Martín Project Engineer Acknowledgments The research leading to these results has received
Hadoopizer : a cloud environment for bioinformatics data analysis
Hadoopizer : a cloud environment for bioinformatics data analysis Anthony Bretaudeau (1), Olivier Sallou (2), Olivier Collin (3) (1) [email protected], INRIA/Irisa, Campus de Beaulieu, 35042,
SURFsara HPC Cloud Workshop
SURFsara HPC Cloud Workshop www.cloud.sara.nl Tutorial 2014-06-11 UvA HPC and Big Data Course June 2014 Anatoli Danezi, Markus van Dijk [email protected] Agenda Introduction and Overview (current
Course 20533: Implementing Microsoft Azure Infrastructure Solutions
Course 20533: Implementing Microsoft Azure Infrastructure Solutions Overview About this course This course is aimed at experienced IT Professionals who currently administer their on-premises infrastructure.
Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP
Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP Agenda ADP Cloud Vision and Requirements Introduction to SUSE Cloud Overview Whats New VMWare intergration HyperV intergration ADP
Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk
Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes
Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases
NASA Ames NASA Advanced Supercomputing (NAS) Division California, May 24th, 2012 Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases Ignacio M. Llorente Project Director OpenNebula Project.
Solution for private cloud computing
The CC1 system Solution for private cloud computing 1 Outline What is CC1? Features Technical details Use cases By scientist By HEP experiment System requirements and installation How to get it? 2 What
Implementing Microsoft Azure Infrastructure Solutions
Course Code: M20533 Vendor: Microsoft Course Overview Duration: 5 RRP: 2,025 Implementing Microsoft Azure Infrastructure Solutions Overview This course is aimed at experienced IT Professionals who currently
A curated Domain centric shared Docker registry linked to the Galaxy toolshed
A curated Domain centric shared Docker registry linked to the Galaxy toolshed François Moreews 1, Olivier Sallou 2, Yvan le Bras 2, Marie Grosjean 3, Cyril Monjeaud 2, Thomas Darde 4, Olivier Collin 2,
Virtualizing Apache Hadoop. June, 2012
June, 2012 Table of Contents EXECUTIVE SUMMARY... 3 INTRODUCTION... 3 VIRTUALIZING APACHE HADOOP... 4 INTRODUCTION TO VSPHERE TM... 4 USE CASES AND ADVANTAGES OF VIRTUALIZING HADOOP... 4 MYTHS ABOUT RUNNING
SOFTWARE DEFINED SOLUTIONS JEUDI 19 NOVEMBRE 2015. Nicolas EHRMAN Sr Presales SDS
SOFTWARE DEFINED SOLUTIONS JEUDI 19 NOVEMBRE 2015 Nicolas EHRMAN Sr Presales SDS Transform your Datacenter to the next level with EMC SDS EMC SOFTWARE DEFINED STORAGE, A SUCCESS STORY 5 ÈME ÉDITEUR MONDIAL
Cloud OS. Philip Meyer Partner Technology Specialist - Hosting
Cloud OS Philip Meyer Partner Technology Specialist - Hosting The New Era of Hosting 52.4% 68% 62.5% Customers Cloud Applications Grow their business or realign to new company strategy Plan to adopt hybrid
CloudStack and Big Data. Sebastien Goasguen @sebgoa May 22nd 2013 LinuxTag, Berlin
CloudStack and Big Data Sebastien Goasguen @sebgoa May 22nd 2013 LinuxTag, Berlin Google trends Start of Clouds Cloud computing trending down, while Big Data is booming. Virtualization BigData on the Trigger
Managing and Conducting Biomedical Research on the Cloud Prasad Patil
Managing and Conducting Biomedical Research on the Cloud Prasad Patil Laboratory for Personalized Medicine Center for Biomedical Informatics Harvard Medical School SaaS & PaaS gmail google docs app engine
Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering
Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering NIST Definition of Cloud Computing Cloud computing is a model for enabling convenient, on-demand
Integrated Rule-based Data Management System for Genome Sequencing Data
Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer
How To Run A Cloud Server On A Server Farm (Cloud)
StratusLab: Darn Simple Cloud Charles (Cal) Loomis (CNRS/LAL & SixSq Sàrl) FOSDEM 13: Cloud Devroom (3 February 2013) StratusLab What is it? Complete Infrastructure as a Service (IaaS) cloud distribution
Implementing Microsoft Azure Infrastructure Solutions 20533B; 5 Days, Instructor-led
Implementing Microsoft Azure Infrastructure Solutions 20533B; 5 Days, Instructor-led Course Description This course is aimed at experienced IT Professionals who currently administer their on-premises infrastructure.
Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise
Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise Manager Oracle NIST Definition of Cloud Computing Cloud
Course 20533B: Implementing Microsoft Azure Infrastructure Solutions
Course 20533B: Implementing Microsoft Azure Infrastructure Solutions Sales 406/256-5700 Support 406/252-4959 Fax 406/256-0201 Evergreen Center North 1501 14 th St West, Suite 201 Billings, MT 59102 Course
<Insert Picture Here> Private Cloud with Fusion Middleware
Private Cloud with Fusion Middleware Duško Vukmanović Principal Sales Consultant, Oracle [email protected] The following is intended to outline our general product direction.
Scientific and Technical Applications as a Service in the Cloud
Scientific and Technical Applications as a Service in the Cloud University of Bern, 28.11.2011 adapted version Wibke Sudholt CloudBroker GmbH Technoparkstrasse 1, CH-8005 Zurich, Switzerland Phone: +41
Cloud-Based Big Data Analytics in Bioinformatics
Cloud-Based Big Data Analytics in Bioinformatics Presented By Cephas Mawere Harare Institute of Technology, Zimbabwe 1 Introduction 2 Big Data Analytics Big Data are a collection of data sets so large
Steven Newhouse, Head of Technical Services
Challenges at EMBL-EBI Steven Newhouse, Head of Technical Services European Bioinformatics Institute Outstation of the European Molecular Biology Laboratory International organisation created by treaty
<Insert Picture Here> Cloud Computing Strategy
Cloud Computing Strategy Rex Wang VP Infrastructure and Management The following is intended to outline our general product direction. It is intended for information purposes only,
wu.cloud: Insights Gained from Operating a Private Cloud System
wu.cloud: Insights Gained from Operating a Private Cloud System Stefan Theußl, Institute for Statistics and Mathematics WU Wirtschaftsuniversität Wien March 23, 2011 1 / 14 Introduction In statistics we
Microsoft Research Windows Azure for Research Training
Copyright 2013 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the
Open source Google-style large scale data analysis with Hadoop
Open source Google-style large scale data analysis with Hadoop Ioannis Konstantinou Email: [email protected] Web: http://www.cslab.ntua.gr/~ikons Computing Systems Laboratory School of Electrical
With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments
RED HAT ENTERPRISE VIRTUALIZATION DATASHEET RED HAT ENTERPRISE VIRTUALIZATION AT A GLANCE Provides a complete end-toend enterprise virtualization solution for servers and desktop Provides an on-ramp to
Microsoft Research Microsoft Azure for Research Training
Copyright 2014 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the
Building Storage as a Service with OpenStack. Greg Elkinbard Senior Technical Director
Building Storage as a Service with OpenStack Greg Elkinbard Senior Technical Director MIRANTIS 2012 PAGE 1 About the Presenter Greg Elkinbard Senior Technical Director at Mirantis Builds on demand IaaS
OpenNebula The Open Source Solution for Data Center Virtualization
LinuxTag April 23rd 2012, Berlin OpenNebula The Open Source Solution for Data Center Virtualization Hector Sanjuan OpenNebula.org 1 What is OpenNebula? Multi-tenancy, Elasticity and Automatic Provision
Introducing ScienceCloud
Zentrale Informatik Introducing ScienceCloud Sergio Maffioletti IS/Cloud S3IT: Service and Support for Science IT Zurich, 10.03.2015 What are we going to talk about today? 1. Why are we building ScienceCloud?
Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000
Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000 Alexandra Carpen-Amarie Diana Moise Bogdan Nicolae KerData Team, INRIA Outline
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of
DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES
DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES Yvan Le Bras Cyril Monjeaud Olivier Collin Jacques Nicolas CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics
New solutions for Big Data Analysis and Visualization
New solutions for Big Data Analysis and Visualization From HPC to cloud-based solutions Barcelona, February 2013 Nacho Medina [email protected] http://bioinfo.cipf.es/imedina Head of the Computational Biology
BlobSeer: Towards efficient data storage management on large-scale, distributed systems
: Towards efficient data storage management on large-scale, distributed systems Bogdan Nicolae University of Rennes 1, France KerData Team, INRIA Rennes Bretagne-Atlantique PhD Advisors: Gabriel Antoniu
A Service for Data-Intensive Computations on Virtual Clusters
A Service for Data-Intensive Computations on Virtual Clusters Executing Preservation Strategies at Scale Rainer Schmidt, Christian Sadilek, and Ross King [email protected] Planets Project Permanent
OpenCB a next generation big data analytics and visualisation platform for the Omics revolution
OpenCB a next generation big data analytics and visualisation platform for the Omics revolution Development at the University of Cambridge - Closing the Omics / Moore s law gap with Dell & Intel Ignacio
Experiences and challenges in the development of the JASMIN cloud service for the environmental science community
JASMIN (STFC/Stephen Kill) Experiences and challenges in the development of the JASMIN cloud service for the environmental science community ECMWF Visualisa-on in Meteorology Week, 28 September 2015 Philip
<Insert Picture Here> Infrastructure as a Service (IaaS) Cloud Computing for Enterprises
Infrastructure as a Service (IaaS) Cloud Computing for Enterprises Speaker Title The following is intended to outline our general product direction. It is intended for information
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
Open Source Cloud Computing Management with OpenNebula
CloudCamp Campus Party July 2011, Valencia Open Source Cloud Computing Management with OpenNebula Javier Fontán Muiños dsa-research.org Distributed Systems Architecture Research Group Universidad Complutense
Scalable Cloud Computing Solutions for Next Generation Sequencing Data
Scalable Cloud Computing Solutions for Next Generation Sequencing Data Matti Niemenmaa 1, Aleksi Kallio 2, André Schumacher 1, Petri Klemelä 2, Eija Korpelainen 2, and Keijo Heljanko 1 1 Department of
Building a BI Solution in the Cloud
Building a BI Solution in the Cloud Stacia Varga, Principal Consultant Email: [email protected] Twitter: @_StaciaV_ 2 SQLSaturday #467 Sponsors Stacia (Misner) Varga Over 30 years of IT experience,
Traditional v/s CONVRGD
Traditional v/s CONVRGD Traditional Virtualization Stack Converged Virtualization Infrastructure with HCE/HSE Data protection software applications PDU Backup Servers + Virtualization Storage Switch HA
Big Data and Cloud Computing for GHRSST
Big Data and Cloud Computing for GHRSST Jean-Francois Piollé ([email protected]) Frédéric Paul, Olivier Archer CERSAT / Institut Français de Recherche pour l Exploitation de la Mer Facing data deluge
Development of Bio-Cloud Service for Genomic Analysis Based on Virtual
Development of Bio-Cloud Service for Genomic Analysis Based on Virtual Infrastructure 1 Jung-Ho Um, 2 Sang Bae Park, 3 Hoon Choi, 4 Hanmin Jung 1, First Author Korea Institute of Science and Technology
Hadoop IST 734 SS CHUNG
Hadoop IST 734 SS CHUNG Introduction What is Big Data?? Bulk Amount Unstructured Lots of Applications which need to handle huge amount of data (in terms of 500+ TB per day) If a regular machine need to
Large-scale Research Data Management and Analysis Using Globus Services. Ravi Madduri Argonne National Lab University of Chicago @madduri
Large-scale Research Data Management and Analysis Using Globus Services Ravi Madduri Argonne National Lab University of Chicago @madduri Outline Who we are Challenges in Big Data Management and Analysis
Deploying Business Virtual Appliances on Open Source Cloud Computing
International Journal of Computer Science and Telecommunications [Volume 3, Issue 4, April 2012] 26 ISSN 2047-3338 Deploying Business Virtual Appliances on Open Source Cloud Computing Tran Van Lang 1 and
Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT
Building Bioinformatics Capacity in Africa Nicky Mulder CBIO Group, UCT Outline What is bioinformatics? Why do we need IT infrastructure? What e-infrastructure does it require? How we are developing this
UGENE Quick Start Guide
Quick Start Guide This document contains a quick introduction to UGENE. For more detailed information, you can find the UGENE User Manual and other special manuals in project website: http://ugene.unipro.ru.
Building an Internal Cloud that is ready for the external Cloud
Building an Internal Cloud that is ready for the external Cloud Luca ZERMINIANI, Senior Systems Engineer, VMware Italy Athens, February 2010 2009 VMware Inc. All rights reserved Agenda How virtualization
1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 7
1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 7 Oracle Virtual Machine Server pre x86 Marián Kuna Technology Sales
