PES. Batch virtualization and Cloud computing. Part 1: Batch virtualization. Batch virtualization and Cloud computing

Similar documents
Cloud Computing through Virtualization and HPC technologies

HPC performance applications on Virtual Clusters

Computing in High- Energy-Physics: How Virtualization meets the Grid

OpenNebula Leading Innovation in Cloud Computing Management

Enabling Technologies for Distributed Computing

Investigation of storage options for scientific computing on Grid and Cloud facilities

Enabling Technologies for Distributed and Cloud Computing

Investigation of storage options for scientific computing on Grid and Cloud facilities

U-LITE Network Infrastructure

Cloud Server. Parallels. Key Features and Benefits. White Paper.

Intro to Virtualization

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

IOS110. Virtualization 5/27/2014 1

Cloud Computing for Control Systems CERN Openlab Summer Student Program 9/9/2011 ARSALAAN AHMED SHAIKH

Hyper-V vs ESX at the datacenter

Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies

Efficient Data Management Support for Virtualized Service Providers

Best Practices for Monitoring Databases on VMware. Dean Richards Senior DBA, Confio Software

Virtualization with Windows

OpenNebula Cloud Case Studies

Scientific Computing Data Management Visions

Lecture 2 Cloud Computing & Virtualization. Cloud Application Development (SE808, School of Software, Sun Yat-Sen University) Yabo (Arber) Xu

Solution for private cloud computing

MODULE 3 VIRTUALIZED DATA CENTER COMPUTE

MICROSOFT HYPER-V SCALABILITY WITH EMC SYMMETRIX VMAX

Virtualization Performance on SGI UV 2000 using Red Hat Enterprise Linux 6.3 KVM

Parallels Server 4 Bare Metal

Elastic Management of Cluster based Services in the Cloud

JOB ORIENTED VMWARE TRAINING INSTITUTE IN CHENNAI

White Paper. Recording Server Virtualization

Virtualization. Types of Interfaces

Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms

VirtualclientTechnology 2011 July

<Insert Picture Here> Introducing Oracle VM: Oracle s Virtualization Product Strategy

Best Practices for Optimizing Your Linux VPS and Cloud Server Infrastructure

HTCondor at the RAL Tier-1

ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy

Managing a Tier-2 Computer Centre with a Private Cloud Infrastructure

2972 Linux Options and Best Practices for Scaleup Virtualization

Tier0 plans and security and backup policy proposals

Building a Private Cloud with Eucalyptus

SURFsara HPC Cloud Workshop

Enterprise Deployment: Laserfiche 8 in a Virtual Environment. White Paper

SQL Server Virtualization 101. David Klee, Group Principal and Practice Lead. SQL PASS Virtualization VC,

SURFsara HPC Cloud Workshop

StorPool Distributed Storage Software Technical Overview

Virtualization, Grid, Cloud: Integration Paths for Scientific Computing

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

RED HAT ENTERPRISE VIRTUALIZATION

CS 6343: CLOUD COMPUTING Term Project

Open Source Cloud Computing Management with OpenNebula

Alternative models to distribute VO specific software to WLCG sites: a prototype set up at PIC

Windows Server 2008 R2 Hyper V. Public FAQ

Dell Virtualization Solution for Microsoft SQL Server 2012 using PowerEdge R820

Deploying Business Virtual Appliances on Open Source Cloud Computing

A quantitative comparison between xen and kvm

Virtualization in Linux a Key Component for Cloud Computing

Cloud Server. Parallels. An Introduction to Operating System Virtualization and Parallels Cloud Server. White Paper.

StACC: St Andrews Cloud Computing Co laboratory. A Performance Comparison of Clouds. Amazon EC2 and Ubuntu Enterprise Cloud

PARALLELS SERVER 4 BARE METAL README

Variations in Performance and Scalability when Migrating n-tier Applications to Different Clouds

An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform

OGF25/EGEE User Forum Catania, Italy 2 March 2009

Making a Smooth Transition to a Hybrid Cloud with Microsoft Cloud OS

Integration of Virtualized Workernodes in Batch Queueing Systems The ViBatch Concept

ZEN LOAD BALANCER EE v3.02 DATASHEET The Load Balancing made easy

Scaling in a Hypervisor Environment

How To Build A Cloud Stack For A University Project

International Journal of Computer & Organization Trends Volume20 Number1 May 2015

Configuring and Managing a Private Cloud with Enterprise Manager 12c

PowerVC 1.2 Q Power Systems Virtualization Center

SQL Server Virtualization

Introduction to the NI Real-Time Hypervisor

With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments

Ceph Distributed Storage for the Cloud An update of enterprise use-cases at BMW

Parallels Plesk Automation

PARALLELS CLOUD SERVER

WebLogic on Oracle Database Appliance: Combining High Availability and Simplicity

Operating Systems Virtualization mechanisms

Transcription:

Batch virtualization and Cloud computing Batch virtualization and Cloud computing Part 1: Batch virtualization Tony Cass, Sebastien Goasguen, Belmiro Moreira, Ewan Roche, Ulrich Schwickerath, Romain Wartel 2 nd Workshop on adapting applications and computing services to multi-core and virtualization See also related presentations: CloudViews2010 conference, Porto HEPIX spring and autumn meeting 2009, 2010 Virtualization vision, Grid Deployment Board (GDB) 9/9/2009 Batch virtualization at CERN, EGEE09 conference, Barcelona 1 st multi-threading and virtualization workshop, CERN, 2009

Outline and disclaimer Part 1: batch virtualization What we are doing Where we are right now First experiences and results (preliminary) Part 2: cloud computing Perspectives for new computing models Disclaimer: We are still in the testing and evaluation phase. No final decision has been taken yet on what we are going to use in the future. All given numbers and figures are preliminary

Why virtualization and cloud computing? Service consolidation: Improve resource usage by squeezing mostly unused machines onto single big hypervisors Decouple hardware life cycle from applications running on the box Ease management by supporting life migration Virtualization of batch resources: Decouple jobs and physical resources Ease management of the batch farm resources Enable the computer center for new computing models This presentation is about virtualization of batch resources only

Batch virtualization CERN batch farm lxbatch: ~3500 physical hosts ~20000 CPU cores >70 queues Current setup

What we are doing: Philosophy I Today Batch (SLC = Scientific Linux CERN) Physical SLC4 WN Near future: Physical SLC5 WN SLC4 WN SLC5 WN hypervisor cluster Batch Physical SLC4 WN (far) future? Physical SLC5 WN Batch T0 development other/cloud applications Internal cloud

Virtual batch: basic ideas Virtual batch worker nodes: Clones of real worker nodes, same setup Platform Infrastructure Sharing Facility Mix with physical resources (ISF) Dynamically join the batch farm as normal worker nodes For high level VM management Limited lifetime: stop accepting jobs after 24h, and then destroy when em Only one user job per VM at a time Note: The limited lifetime allows for a fully automated system which dynamically adapts to the current needs, and automatically deploys intrusive updates. Use case: intrusive updates on worker nodes (eg. Kernel upgrades)

Virtual batch: basic ideas, technical Images: staged on hypervisors Platform Infrastructure Sharing Facility (ISF) master images, instances use LVM snapshots For high level VM management Start with few different flavors only Image creation: Derived from a centrally managed golden node Regularly updated and distributed to get updates in

Virtual batch: basic ideas Images distribution: Platform Infrastructure Sharing Facility (ISF) Only shared file system available is AFS For high level VM management Prefer peer to peer methods SCP wave Rtorrent

Virtual batch: basic ideas VM placement and management system Use existing solutions Platform Infrastructure Sharing Facility (ISF) Testing both a free and a commercial For high solution level VM management OpenNebula (ONE) Platform's Infrastructure Sharing Facility (ISF) Both implement a request queue and a very basic scheduler to place VMs

Status of building blocks (test system) Submission and batch managemen Hypervisor cluster VM kiosk and image distribution VM management system Initial deployment OK OK OK OK Central management OK OK Mostly implemented ISF OK, ONE missing Monitoring and alarming OK (OK) Switched off for tests missing missing

Test setup for scalability tests Main goal: test batch system scalability using VMs Side effect tests: General infrastructure (eg network database) Image distribution VM provisioning systems (ONE, ISF) VM performance

Test setup for scalability tests Temporary access to 500 new worker nodes 2 x XEON L5520 @ 2.27GHz CPUs (8 cores) 24GB RAM ~1TB local disk space HS06 rating ~97 Software: SLC5/64bit with XEN

Performance tests: overview Image distribution: from repository to the hypervisors (ongoing) VM provisioning systems (ongoing) OpenNebula (ONE) (Open source solution) Infrastructure Sharing Facility (ISF), Platform computing HS06 rating of VMs Disk performance tests (ongoing) Network performance tests (todo)

Image distribution: SCP wave versus rtorrent Preliminary! Slow nodes, (under investigation) (BT = bit torrent)

Scalability tests: some first numbers One shot test with OpenNebula: Inject virtual machine requests And let them live and die, reduced life time 4h Record the number of alive machines seen by LSF every 30s Units: 1h5min

CPU performance: HS06 benchmark results (preliminary) Test case: 1 VM / hypervisor, raising the number of CPUs/hypervisor Physical box: 97 HS06 100 90 XEN Theoretical HS06 rating 80 70 60 50 Use case: VM for multithreaded apps preliminary 40 30 20 10 0 1 2 3 4 5 6 7 8 Number of CPUs per VM

CPU performance: HS06 benchmark results (preliminary) Test case : raise number of 1core VMs on a single hypervisor 14 12 XEN preliminary Working point HS06 rating per VM (average) 10 average HSP06 per VM 8 6 4 2 0 1 2 3 4 5 6 7 8 9 N VMs per hypervisor Number of VMs per hypervisor

Disk performance: iozone tests Test case: Raise the number of VMs running iozone on a single hypervisor Compare with running N iozone processes in parallel on a single physical server Write performance OK Read performance requires tuning Tuning: eg. change read-ahead size in /sys/block/svdx/queue/read_ahead_kb Command line parameters: iozone -Mce -r 256k -s 8g -f /root/iozone.dat -i0 -i1 -i

Disk performance: iozone tests read performance read [kb/s] Read [kb/s] 90000 80000 70000 60000 50000 40000 Too optimistic... Compare with physical node: ~100MB/s read ~82MB/s write ~20% performance penalty 30000 20000 10000 Needs further investigation! 0 1 2 3 4 5 6 7 8 9 number of VMs Command line parameters: iozone -Mce -r 256k -s 8g -f /root/iozone.dat -i0 -i1 -i

Network performance Still to do... no big worries here though

Technical issues and challenges Scalability General infrastructure Image distribution Provisioning system Batch system Networking / public IPs

Technical issues and challenges VM CPU accounting/ CPU factors VM monitoring VM read performance under load VM space requirements Scratch space Swap space AFS cache size

Batch virtualization END OF THIS PART

Batch virtualization: architecture Job submission CE/interactive Grey: physical resource Colored: different VMs Centrally managed Golden nodes VM kiosk Batch system management Hypervisors / HW resources VM worker nodes With limited lifetime VM management system