Experiences and challenges in the development of the JASMIN cloud service for the environmental science community



Similar documents
JASMIN Cloud ESGF and UV- CDAT Conference December 2014 STFC / Stephen Kill

The OPTIRAD Platform: Cloud-hosted IPython Notebooks for collaborative EO Data Analysis and Processing

Big Data and the Earth Observation and Climate Modelling Communities: JASMIN and CEMS

Unleash the IaaS Cloud About VMware vcloud Director and more VMUG.BE June 1 st 2012

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

Cloud Computing. Adam Barker

Data Centers and Cloud Computing

Virtualisation Cloud Computing at the RAL Tier 1. Ian Collier STFC RAL Tier 1 HEPiX, Bologna, 18 th April 2013

Data Centers and Cloud Computing. Data Centers

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

JASMIN: the Joint Analysis System for big data. Bryan Lawrence

OpenStack. Orgad Kimchi. Principal Software Engineer. Oracle ISV Engineering. 1 Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Cloud 101. Mike Gangl, Caltech/JPL, 2015 California Institute of Technology. Government sponsorship acknowledged

2013 ovh.com. All rights reserved

Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

CLOUD COMPUTING. When It's smarter to rent than to buy

Plan for Success with a Hybrid Cloud! Thanks to IBM Power Systems OpenStack SoftLayer and UrbanCode

See Appendix A for the complete definition which includes the five essential characteristics, three service models, and four deployment models.

Building an Internal Cloud that is ready for the external Cloud

IS PRIVATE CLOUD A UNICORN?

Virtualization & Cloud Computing (2W-VnCC)

Why Private Cloud? Nenad BUNCIC VPSI 29-JUNE-2015 EPFL, SI-EXHEB

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

Linux/Open Source and Cloud computing Wim Coekaerts Senior Vice President, Linux and Virtualization Engineering

Sistemi Operativi e Reti. Cloud Computing

Configuring and Managing a Private Cloud with Enterprise Manager 12c

Cloud Computing in the Enterprise An Overview. For INF 5890 IT & Management Ben Eaton 24/04/2013

Evaluation Methodology of Converged Cloud Environments

PRIVATE CLOUD PLATFORM OPTIONS. Stephen Lee CEO, ArkiTechs Inc.

VMware on VMware: Private Cloud Case Study Customer Presentation

Cloud Computing. Course: Designing and Implementing Service Oriented Business Processes

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

A Complete Open Cloud Storage, Virt, IaaS, PaaS. Dave Neary Open Source and Standards, Red Hat

With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments

VMware Building Many Bridges to the Cloud

Red Hat enterprise virtualization 3.0 feature comparison

Cloud Computing. Chapter 1 Introducing Cloud Computing

Dell Cloud Services. Services

Virtualization and IaaS management

Installation Guide Avi Networks Cloud Application Delivery Platform Integration with Cisco Application Policy Infrastructure

Datacenters and Cloud Computing. Jia Rao Assistant Professor in CS

white paper Introduction to Cloud Computing The Future of Service Provider Networks

Content Distribution Management

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

INTRODUCTION TO CLOUD MANAGEMENT

Cloud OS. Philip Meyer Partner Technology Specialist - Hosting

Essential Characteristics of Cloud Computing: On-Demand Self-Service Rapid Elasticity Location Independence Resource Pooling Measured Service

How To Use Arcgis For Free On A Gdb (For A Gis Server) For A Small Business

Data Centers and Cloud Computing. Data Centers

An Alternative to the VMware Tax...

vnebula Cloud. Made Easy. Introducing vnebula from Stream Networks. A simple, self-service cloud portal for our partner community.

Solution for private cloud computing

Building Out Your Cloud-Ready Solutions. Clark D. Richey, Jr., Principal Technologist, DoD

Cloud Optimize Your IT

Red Hat CloudForms for Cloud Management: Key Features & Roadmap

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

A Gentle Introduction to Cloud Computing

IBM EXAM QUESTIONS & ANSWERS

Scale Cloud Across the Enterprise

Cloud Computing. Karan Saxena * & Kritika Agarwal**

Equinox above the Cloud ( - Some call it Heaven!!)

Understand IBM Cloud Manager V4.2 for IBM z Systems

Fundamental Concepts and Models

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

How To Protect Your Cloud From Attack

Cloud Models and Platforms

A Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012)

Marco Mantegazza WebSphere Client Technical Professional Team IBM Software Group. Virtualization and Cloud

Cloud Computing in the Enterprise: A Question of Control.. And who has it. INF5210 Ben Eaton 12/11/2013

vcloud Air Simone Brunozzi, VP and Chief Technologist, vcloud 2014 VMware Inc. All rights reserved.

7 Ways OpenStack Enables Automation & Agility for KVM Environments

Assignment # 1 (Cloud Computing Security)

Why Cisco for Cloud? IT Service Delivery, Orchestration and Automation

Cloud Terminology Handbook

Intel IT Cloud 2013 and Beyond. Name Title Month, Day 2013


CompTIA Cloud+ 9318; 5 Days, Instructor-led

CEDA Storage. Dr Matt Pritchard. Centre for Environmental Data Archival (CEDA)

What is Cloud Computing? First, a little history. Demystifying Cloud Computing. Mainframe Era ( ) Workstation Era ( ) Xerox Star 1981!

CompTIA Cloud+ Course Content. Length: 5 Days. Who Should Attend:

Building Storage Service in a Private Cloud

Definitions. Hardware Full virtualization Para virtualization Hosted hypervisor Type I hypervisor. Native (bare metal) hypervisor Type II hypervisor

CHAPTER 8 CLOUD COMPUTING

How To Set Up Egnyte For Netapp Sync For Netapp

CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment

Deploying Your Application On Public Cloud

Certified Cloud Computing Professional Sample Material

Remote Voting Conference

Cluster, Grid, Cloud Concepts

SURFsara HPC Cloud Workshop

Transcription:

JASMIN (STFC/Stephen Kill) Experiences and challenges in the development of the JASMIN cloud service for the environmental science community ECMWF Visualisa-on in Meteorology Week, 28 September 2015 Philip Kershaw, CEDA Technical Manager Victoria BenneC, Jonathan Churchill, MarDn Juckes, Bryan Lawrence, CrisDna del Cano Novales, Sam Pepler, MaC Pritchard, MaC Pryor, Ag Stephens

IntroducDon What is cloud? How can it be applied for science applicadons Touch on reladonship with HPC, Grid PracDcal experience building a community cloud for JASMIN Example applicadon: IPython Notebook Challenges and next steps

The cloud and the hype Hype, ignorance, fear, misunderstanding and confladon with what has gone before (e.g. cloud = the Grid) An enabling and disrupdve technology Different communides and applicadon domains have reached different places on the curve Cloud for science a fast evoludon Magellan Report 2011 hcp://science.energy.gov/~/media/ascr/pdf/ program- documents/docs/ Magellan_Final_Report.pdf Nb. Technology driven development vs. user or science driven The Gartner hype curve e.g. Frankencloud J hcp://www.entrepreneur.com/ardcle/247140

Understand in order to exploit: a cloud definidon Cloud compudng is a model for enabling ubiquitous, convenient, on- demand network access to a shared pool of configurable compudng resources that can be rapidly provisioned and released with minimal management effort or service provider interacdon. NIST SP800-145 5 essendal characterisdcs On- demand self- service Broad network access Resource pooling Rapid elasdcity Measured service 3 service models IaaS (Infrastructure as a Service) PaaS (Plaform as a Service) SaaS (Sogware as a Service) 4 deployment models Private cloud Community cloud Public cloud Hybrid cloud

How can Cloud help the Research Community? Share of Cloud Resource Abstrac-on of Physical Resources Compute Network Storage OrganisaDon A OrganisaDon B Internal Services OrganisaDon C Address Big Data problem Bring users to the data PotenDal for near- limitless compute Long- tail research Provide data analysis downstream of primary producdon in a way that is customised for researchers e.g. virtualised desktops Resource pooling Divide up resources easily amongst different tenant research groups

JASMIN and Cloud <= Different slices thru the infrastructure => JASMIN Data Archive, Group workspaces and compute Bare Metal Compute Virtualisation High performance global file system Community Cloud Cloud Federation API Support a spectrum of usage models Cloud burst as demand requires External Cloud Providers Different classes of user A big data analysis facility for the environmental sciences 16 Petabytes high- performance disk 4000 compudng cores (HPC, VirtualisaDon) High- performance network design opdmised for i/o throughput VirtualisaDon Cloud - 200 socket Vmware vcloud licence (100 servers, 1600 cores) A combina0on of capabilides deliver what is needed

CEMS and JASMIN I: first steps with Cloud Deployed a private cloud based on VMware vcloud Director in common JASMIN- CEMS environment Goal: self- service configurable VMs for sciendfic analysis next to the data But, vcloud web portal too complicated for external users It couldn t be locked down sufficiently for deployment alongside the data archive and sensidve services Demand for processing with batch compute which didn t need the flexibility of cloud SoluDons 1) Build a custom cloud portal 2) The Inside- outside project

1) Custom Cloud Portal JASMIN Cloud Management Interface vcloud Client Implements VMware s vcloud API AbstracDon layer OpenStack Client AWS Client 0ther clients Building on top of vcloud Keep it simple: provide just enough funcdonality to provision and configure VMs Right tools for right users: sciendsts, developers and administrators AbstracDon from vcloud also provides a route to cloud federadon / bursdng Thin or thick client?

2) The Inside- Outside Project Create, an isolated part of the network inside JASMIN that would give users all the freedom they would have outside but the benefit of good bandwidth to the data archive. JASMIN Internal Network Managed Cloud - PaaS, SaaS Direct access to batch processing cluster Firewall Direct File System Access Firewall External Network inside JASMIN Unmanaged Cloud IaaS, PaaS, SaaS VM images Lotus Batch Compute Panasas storage Standard Remote Access Protocols gp, hcp, NetApp storage JASMIN

Managed Cloud - PaaS, SaaS Tenancy management and the VM deployment workflow JASMIN Cloud Portal External access Other JASMIN services NIC 0 NIC 1 Firewall + NAT Science Analysis Science VM Analysis VM Science Analysis VM Appliance template Catalogue Deploy appliance (VMs) Manage ssh access vcloud Director Web Service AuthenDcaDon with cloud services LDAP Puppet Master Request new VM setng guest customisa0on setngs Virtual OrganisaDon Panasas storage

Pooling RAM with EOS Cloud Desktop as a Service for environmental genomics hosted on JASMIN Cloud Exploits key characterisdcs of cloud: pooling and elasdcity The problem: bioinformadcs apps are memory hungry SoluDon: using virtualisadon seamlessly share access to a fat node with 512 GB RAM in the tenancy A token system allows users to boost their VM to use the addidonal memory for a metered period Credit Tim Booth, NERC CEH

Example Cloud- hosted ApplicaDon: IPython Notebook Provides Python kernels accessible via a web browser Sessions can be saved and shared Trivial access to parallel processing capabilides IPython.parallel New JupyterHub allows muld- user and notebook management Opportunity to open a middle ground in the applicadon space between batch compute / command line access: powerful but hard web portals: easy to use but less flexible OPTIRAD: ESA- funded project, land surface data assimiladon algorithms via notebooks on JASMIN cloud

JupyterHub, Swarm and Containers JupyterHub manages users and provision of notebooks JupyterHub Docker Container VM: Swarm pool 0 VM: Swarm pool 0 VM: Docker Swarm Container pool 0 IPython Notebook Kernel Docker Container Parallel Controller VM: slave 0 VM: Swarm VM: Parallel pool 0 Swarm Engine pool 0 Swarm IPython Notebook Kernel Parallel Controller Parallel Engine Swarm manages allocadon of containers for notebooks Notebooks and kernels in containers Nodes for parallel Processing

Conclusions Experiences from project delivery Importance of key skills Cross- cutng team spanning, developers, sys admins, DevOps EffecDve linking of hardware deployment, cloud middleware and applicadon layer development Documented repeatable processes for operadons Futures Challenge: how to bridge together different types of resource and service seamlessly whilst preserving performance and user segregadon Effect and influence of new technologies: containers, object stores

Further informadon JASMIN and CEDA: hcp://jasmin.ac.uk/ hcp://www.ceda.ac.uk JASMIN paper (Sept 2013) hcp://home.badc.rl.ac.uk/lawrence/stadc/2013/10/14/ LawEA13_Jasmin.pdf Cloud paper to follow soon @PhilipJKershaw