Integration of Virtualized Workernodes in Batch Queueing Systems The ViBatch Concept



Similar documents
Virtualization of a Cluster Batch System

Computing in High- Energy-Physics: How Virtualization meets the Grid

Full and Para Virtualization

Solution for private cloud computing

Basics of Virtualisation

Cloud Computing through Virtualization and HPC technologies

StACC: St Andrews Cloud Computing Co laboratory. A Performance Comparison of Clouds. Amazon EC2 and Ubuntu Enterprise Cloud

RED HAT ENTERPRISE VIRTUALIZATION & CLOUD COMPUTING

A quantitative comparison between xen and kvm

RPM Brotherhood: KVM VIRTUALIZATION TECHNOLOGY

Virtualization: Know your options on Ubuntu. Nick Barcet. Ubuntu Server Product Manager

Enabling Technologies for Distributed Computing

Efficient Load Balancing using VM Migration by QEMU-KVM

PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE

This presentation provides an overview of the architecture of the IBM Workload Deployer product.

Batch and Cloud overview. Andrew McNab University of Manchester GridPP and LHCb

Cloud^H^H^H^H^H Virtualization Technology. Andrew Jones May 2011

Automated deployment of virtualization-based research models of distributed computer systems

RED HAT ENTERPRISE VIRTUALIZATION

Virtual Switching Without a Hypervisor for a More Secure Cloud

USING VIRTUAL MACHINE REPLICATION FOR DYNAMIC CONFIGURATION OF MULTI-TIER INTERNET SERVICES

Enabling Technologies for Distributed and Cloud Computing

2972 Linux Options and Best Practices for Scaleup Virtualization

Shoal: IaaS Cloud Cache Publisher

Work Environment. David Tur HPC Expert. HPC Users Training September, 18th 2015

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

International Journal of Computer & Organization Trends Volume20 Number1 May 2015

Virtualization. Dr. Yingwu Zhu

Cloud and Virtualization to Support Grid Infrastructures

Introduction to Virtualization & KVM

Performance Testing of a Cloud Service

Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies

GUEST OPERATING SYSTEM BASED PERFORMANCE COMPARISON OF VMWARE AND XEN HYPERVISOR

Pros and Cons of HPC Cloud Computing

CORRIGENDUM TO TENDER FOR HIGH PERFORMANCE SERVER

Intro to Virtualization

Cloud Computing with Red Hat Solutions. Sivaram Shunmugam Red Hat Asia Pacific Pte Ltd.

Virtualization. Michael Tsai 2015/06/08

Hypervisors. Introduction. Introduction. Introduction. Introduction. Introduction. Credits:

Virtualization and Performance NSRC

Operating Systems Virtualization mechanisms

OSG Hadoop is packaged into rpms for SL4, SL5 by Caltech BeStMan, gridftp backend

Data Centers and Cloud Computing

XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines. A.Zydroń 18 April Page 1 of 12

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

Installing & Using KVM with Virtual Machine Manager COSC 495

KVM, OpenStack, and the Open Cloud

Alternative models to distribute VO specific software to WLCG sites: a prototype set up at PIC

Parallels Plesk Automation

COM 444 Cloud Computing

Virtual Machines.

October Gluster Virtual Storage Appliance User Guide

Module I-7410 Advanced Linux FS-11 Part1: Virtualization with KVM

With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments

9/26/2011. What is Virtualization? What are the different types of virtualization.

PARALLELS SERVER 4 BARE METAL README

Regional SEE-GRID-SCI Training for Site Administrators Institute of Physics Belgrade March 5-6, 2009

x86 ISA Modifications to support Virtual Machines

Enterprise-Class Virtualization with Open Source Technologies

VIRTUALIZATION 101. Brainstorm Conference 2013 PRESENTER INTRODUCTIONS

A Highly Versatile Virtual Data Center Ressource Pool Benefits of XenServer to virtualize services in a virtual pool

MODULE 3 VIRTUALIZED DATA CENTER COMPUTE

Provisioning and Resource Management at Large Scale (Kadeploy and OAR)

PES. Batch virtualization and Cloud computing. Part 1: Batch virtualization. Batch virtualization and Cloud computing

Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms

Virtual Machine Monitors. Dr. Marc E. Fiuczynski Research Scholar Princeton University

Dynamic Load Balancing of Virtual Machines using QEMU-KVM

High Performance Computing in CST STUDIO SUITE

Preparation Guide. How to prepare your environment for an OnApp Cloud v3.0 (beta) deployment.

The virtualization of SAP environments to accommodate standardization and easier management is gaining momentum in data centers.

Comparing Free Virtualization Products

A cure for Virtual Insanity: A vendor-neutral introduction to virtualization without the hype

Developing a dynamic, real-time IT infrastructure with Red Hat integrated virtualization

Exploiting Private and Hybrid Clouds for Compute Intensive Web Applications

VON/K: A Fast Virtual Overlay Network Embedded in KVM Hypervisor for High Performance Computing

Windows Server 2008 R2 Hyper V. Public FAQ

Virtualization with Windows

JBoss Seam Performance and Scalability on Dell PowerEdge 1855 Blade Servers

Virtualization and the U2 Databases

Lecture 2 Cloud Computing & Virtualization. Cloud Application Development (SE808, School of Software, Sun Yat-Sen University) Yabo (Arber) Xu

Google

How To Make A Virtual Machine Aware Of A Network On A Physical Server

Options in Open Source Virtualization and Cloud Computing. Andrew Hadinyoto Republic Polytechnic

The QEMU/KVM Hypervisor

Transcription:

Integration of Virtualized Workernodes in Batch Queueing Systems, Dr. Armin Scheurer, Oliver Oberst, Prof. Günter Quast INSTITUT FÜR EXPERIMENTELLE KERNPHYSIK FAKULTÄT FÜR PHYSIK KIT University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu

(Computer) Virtualization Sharing resources of one physical machine between independent Operating Systems (OS) in Virtual Machines (VM) Virtual Machines are decoupled from the underlying hardware and (almost) arbitrary operating systems can be installed. Different virtualization techniques provided by various vendors and open-source communities 2 VM server 2 Proxy server VM server 1 Workernode OS 1 VM server 3 User portal OS 3 Virtualization Physical host machine VM server 1 Workernode OS 1 VM server 2 Proxy server OS 2 VM server 3 User portal OS 3

Why Virtualization? Offers independence from host systems and encapsulation of user interaction. Enables use of special validated operating systems for high energy physics analysis Enables use of Virtual Appliances, e.g. CernVM (see later) Allows the dynamic partitioning of a shared HPC cluster: Grants different setups for different user groups No incompatibilities have to be considered High flexibility 3

- Kernel-based Virtual Machine KVM is implemented as a Kernel module Linux kernel is the virtual machine monitor VMs run as normal processes Supports native virtualization techniques AMD-V and Intel VT-x => Very good performance! Hardware Normal user processes Linux Kernel VM 1 Debian VM 2 SuSe module libvirt Virtualization API Interface to common VMMs/hypervisors such as KVM, Xen, Vmware, UML (Remote) management of virtual machines and storage. More Information: http://libvirt.org 4

Dynamic Virtualization Project at KIT: HPC Cluster Models Group A Group B Group C Isolated Computing Cluster Each group/institution has sep. cluster Administration overhead Can not cover peak loads Shared Computing Cluster All groups share one cluster Setup compromise not always possible Load-balancing by fair-share Dynamic Partitioned Cluster Configure cluster in real-time with VMs Allows any software/os configuration Virtualization layer hidden Load-balancing by fair-share 5

Dynamic Virtualization Project at KIT: ViBatch Lightweight tool enabling virtualization of job environments Can be implemented into arbitrary batch systems Batch system is not aware of the virtualization no code modification needed (only adapt configuration) Virtual environment is determined per job just by the queue the job is sent to: qsub -q [normal_queue] job1.sh qsub -q [virtual_queue] job1.sh job submission: only queue changes! Torque PBS 6

ViBatch - Workflow 7

ViBatch - Lightweight Core components: just bash scripts ( prologue, epilogue and remoteshell ) Additional scripts for (almost) automatic installation on arbitrary clusters Cluster information and preferences in one config-file 8 Logfiles enable debugging and workload statistics.

ViBatch - Virtual Appliances: CernVM-FS Our VM image includes CernVM-FS, which is a remote file system via HTTP developed by CernVM Software appliance http://cernvm.cern.ch/portal Provides LHC software installation (various VOs: CMS, ATLAS,...) including most common versions of experiment software We don't have to care about own installations! A simple Squid HTTP proxy server does the caching Easy to install with yum package manger 9

ViBatch in Operation at EKP, KIT Load of ViBatch (last 6 weeks) ViBatch has already been used at EKP for several HEP analysis: Data Skims for Higgs TauTau analysis (see talk A. Burgmeier, T49.7) Running on EKP production cluster in parallel to native job submission # jobs Monte-Carlo generation for studies in Higgs search (C. Hackstein, T49.1) Performance 10 Monte Carlo Sim. (vbfnlo) native virtual CPU benchmark whetstone native virtual CMSSW physics analysis native: not available virtual +17 % +12 % Depends on KVM tuning and host setup Currently investigated and tuned (KSM,...) SLE11 not binary compatible with CMSSW

ViBatch in Operation at EKP, KIT Our setup characteristics & problems Memory consumption ~ 2GB RAM per VM Currently no InfiniBand driver for our VMs => No native use of Lustre file system possible Storage mounted via NFS export Shared Institutscluster IC1 at KIT Workernodes (EKP) 200 (25) CPU 8x2.66 GHz Intel Xeon Memory 2 GB RAM per core Disc space 750 GB per node Storage 350 TB Lustre FS Network 40 Gbit/s InfiniBand Problems with compatibility kernel space NFS daemon Lustre driver: Unstable, few nodes crashed Currently solved using user space NFS daemon 11

Conclusion and outlook Extend operation to the whole cluster (200 nodes 1600 VM slots) Provide detailed documentation Further simplify installation Burst into cloud: Connect with ROCED (Talk S. Riedel, T 77.3) ViBatch Cloud + ROCED 12