A curated Domain centric shared Docker registry linked to the Galaxy toolshed



Similar documents
Docker : devops, shared registries, HPC and emerging use cases. François Moreews & Olivier Sallou

Intro to Docker and Containers

e-biogenouest : The Tools

DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES

Linstantiation of applications. Docker accelerate

Building a Continuous Integration Pipeline with Docker

The Virtualization Practice

Hadoopizer : a cloud environment for bioinformatics data analysis

Cisco Application-Centric Infrastructure (ACI) and Linux Containers

Platform as a Service and Container Clouds

RED HAT CONTAINER STRATEGY

Building a Kubernetes Cluster with Ansible. Patrick Galbraith, ATG Cloud Computing Expo, NYC, May 2016

The Definitive Guide To Docker Containers

STRATEGIC WHITE PAPER. The next step in server virtualization: How containers are changing the cloud and application landscape

Intro to Docker for CMS

Integrating Containers into Workflows: A Case Study Using Makeflow, Work Queue, and Docker

Linux A first-class citizen in Windows Azure. Bruno Terkaly bterkaly@microsoft.com Principal Software Engineer Mobile/Cloud/Startup/Enterprise

Private Cloud Management

THE EUCALYPTUS OPEN-SOURCE PRIVATE CLOUD

Experiences and challenges in the development of the JASMIN cloud service for the environmental science community

ISLET: Jon Schipp, Ohio Linux Fest An Attempt to Improve Linux-based Software Training

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

the CONTAINER COLORING BOOK "Who's afraid of the big bad wolf?" MÁIRÍN DUFFY DAN WALSH illustrated by written by

White Paper Server. SUSE Linux Enterprise Server 12 Modules

Modern App Architecture for the Enterprise Delivering agility, portability and control with Docker Containers as a Service (CaaS)

UTILIZING THE PORTABILITY OF DOCKER TO DESIGN A PORTABLE AND SCALABLE CONTINUOUS INTEGRATION STACK

How Bigtop Leveraged Docker for Build Automation and One-Click Hadoop Provisioning

OpenShift. Marek Jelen, OpenShift, Red Hat

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

Cross-Platform ASP.NET 5 For the Cloud. Anthony Sneed

Containerisation and the PaaS Cloud

Benchmark Report: Univa Grid Engine, Nextflow, and Docker for running Genomic Analysis Workflows

Container Clusters on OpenStack

Modern Application Architecture for the Enterprise

Cloud Ready for Bioinformatics?

Introduction to Arvados. A Curoverse White Paper

Containerization and the PaaS Cloud

Do Containers fully 'contain' security issues? A closer look at Docker and Warden. By Farshad Abasi,

Use Cases for Docker in Enterprise Linux Environment CloudOpen North America, 2014 Linda Wang Sr. Software Engineering Manager Red Hat, Inc.

Chris Rosen, Technical Product Manager for IBM Containers, Lin Sun, Senior Software Engineer for IBM Containers,

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

Distributed Scheduling with Apache Mesos in the Cloud. PhillyETE - April, 2015 Diptanu Gon

Implementing Container-based Virtualization in a Hybrid Cloud

Azul pitches Docker as alternative to virtualization for heavy-duty Java applications

DOCLITE: DOCKER CONTAINER-BASED LIGHTWEIGHT BENCHMARKING ON THE CLOUD

How to Do/Evaluate Cloud Computing Research. Young Choon Lee

Sequencing data. And other experimental data. EMBL-EBI data resources growth

DevOps with Containers. for Microservices

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

What new with Informix Software as a Service and Bluemix? Brian Hughes IBM

Sistemi Operativi e Reti. Cloud Computing

Development of Bio-Cloud Service for Genomic Analysis Based on Virtual

Continuous Integration for XML and RDF Data

A lap around Team Foundation Server 2015 en Visual Studio 2015

WHITEPAPER INTRODUCTION TO CONTAINER SECURITY. Introduction to Container Security

Sriram Krishnan, Ph.D.

Virtualization and Containerization of Application Infrastructure: A Comparison

Kubernetes-Murano Integration in Mirantis OpenStack 7.0

Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering

Docker on OpenStack. August Author : Nitin Agarwal nitinagarwal3006@gmail.com. Supervisor(s) : Belmiro Moreira

Openshift for Continuous Integration

Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT

Why the Datacenter needs an Operating System. Dr. Bernd Mathiske Senior Software Architect Mesosphere

12 Things about Oracle WebLogic Server

Deploying complex applications to Google Cloud. Olia Kerzhner

WHITE PAPER Redefining Monitoring for Today s Modern IT Infrastructures

Practical Solutions for Big Data Analytics

Bioinformatique sur Cloud Cas d usage avec le portail Galaxy

IFB s e-infrastructure

Index. Registry Report

Red Hat Cloud Ecosystem

Bringing Hadoop into Bioinformatics with Cloudgene and CloudMan

Integration of Virtualized Workernodes in Batch Queueing Systems The ViBatch Concept

Docker Containers. Marko Ambrož, Žiga Hudolin Ministrstvo za javno upravo DIREKTORAT ZA INFORMATIKO

Eoulsan Analyse du séquençage à haut débit dans le cloud et sur la grille

High-Availability Using Open Source Software

CONDOR CLUSTERS ON EC2

Towards a galaxy.prabi.fr

Transcription:

A curated Domain centric shared Docker registry linked to the Galaxy toolshed François Moreews 1, Olivier Sallou 2, Yvan le Bras 2, Marie Grosjean 3, Cyril Monjeaud 2, Thomas Darde 4, Olivier Collin 2, Christophe Blanchet 3 1 Genscale team -IRISA -Rennes, France 2 Genouest Bioinformatics facility INRIA/IRISA Rennes, France 3 French Institute of Bioinformatics CNRS IFB-Core UMS3601 Gif-sur-Yvette, France 4 INSERM U625 Rennes France

Docker : presentation Docker is an open-source engine to easily create lightweight, portable, self-sufficient containers from any application. The same container that a developer builds and test on a laptop can run at scale, in production, on Vms,[...], public clouds and more.

Docker : presentation Why using Docker containers to build, deploy and execute applications? Efficient (no virttualization) Isolation Build one time, execute anywhere, independently of the execution platform (laptops, clusters and clouds with linux kernels)...

Build : Dependencies & Dockerfile FROM ubuntu:12.0.4 ADD. /script ENTRYPOINT ["perl /script/analysis.pl"]

Run Docker docker run containeruniqueid k=31 i input.fastq o output.bam -The docker run command acts as a wrapper of the tool command line. -Host directories (input, output,work...) can be mounted inside the container.

Docker on the commercial Cloud Google Container Engine A container based cloud architecture With container-based computing, application developers can focus on their application code, instead of on deployments and integration into hosting environments.

Docker on academic HPC clusters Google Kubernetes : an open source technology for containers life cycle management. Docker Swarm : allows to create and access to a pool of Docker hosts. Genouest GO-DOCKER : a batch scheduler like SGE, submitting jobs in Docker containers on top of Swarm..

Bioinformatics tools benchmarks with Docker cami-challenge.org : Critical Assessment of Metagenomic Interpretation http://nucleotid.es : continuous, objective and reproducible evaluation of genome assemblers using docker containers bioboxes.org : interchangable bioinformatics software containers

Galaxy Docker integration Docker can be used in Galaxy to : manage tools dependencies : one tool, on Docker Distribute populated Galaxy Distribution related to one topic

Shared registries : Docker Hub Not structured Not curated Not domain centric Not community driven

Shared registries : BIOSHADOCK BIOSHADOCK An initiative of the French Bioinformatics Institut & the Genouest Bioinformatics Facility Goals : Federate bioinformatics tools deployment procedures for the IFB cloud infrastructure Generate customized Galaxy cloud instances on the fly. Docker image indexation (service registry & searches)

Shared registries : BIOSHADOCK

Shared registries : BIOSHADOCK

Shared registries : BIOSHADOCK BIOSHADOCK Focuses on the model on tool, one docker image Allows Dockerfile build Manages permissions (private/ public images) May integrate meta data to facilitate query and service registry searches One unique repositity for softwares with or without tool.xml => SAAS + CMD Integrated to Galaxy by redefining tools dependencies in a Toolshed

GO-DOCKER +SWARN

BIOSHADOCK TOOLSHED integration

BIOSHADOCK cluster /cloud integration using GO-DOCKER

Build it one time, use it as you want Command lines Galaxy tools & workflows BIOSHADOCK Other SAAS tools

References Genouest GO-DOCKER :http://www.genouest.org/?p=246 Google Kubernetes, Docker container cluster management : kubernetes.io BioShaDock, a Bioinformatics Shared Docker registry : http://dockerui.genouest.org GUGGO Galaxy Tooshed : http://toolshed.genouest.org Nucleotid.es, continuous, objective and reproducible evaluation of genome assemblers using docker containers : http://nucleotid.es ELIXIR Tools and Data Services Registry : https://elixir-registry.cbs.dtu.dk Bioboxes, a standard for creating interchangable bioinformatics software containers : http://bioboxes.org IFB academic Cloud : http://www.france-bioinformatique.fr/?q=en/core/einfrastructure-team/ifb-cloud