Managing a Tier-2 Computer Centre with a Private Cloud Infrastructure

Similar documents
The OpenNebula Cloud Platform for Data Center Virtualization

OpenNebula Open Souce Solution for DC Virtualization. C12G Labs. Online Webinar

Getting Started Hacking on OpenNebula

OpenNebula Open Souce Solution for DC Virtualization

OpenNebula Open Souce Solution for DC Virtualization

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

Open Source Cloud Computing Management with OpenNebula

OpenNebula The Open Source Solution for Data Center Virtualization

Interoperating Cloud-based Virtual Farms

With Red Hat Enterprise Virtualization, you can: Take advantage of existing people skills and investments

ovirt and Gluster hyper-converged! HA solution for maximum resource utilization

ovirt and Gluster hyper-converged! HA solution for maximum resource utilization

Introduction to OpenStack

StorPool Distributed Storage Software Technical Overview

Why is a good idea to use OpenNebula in your VMware Infrastructure?

Cloud Computing for Control Systems CERN Openlab Summer Student Program 9/9/2011 ARSALAAN AHMED SHAIKH

Building Clouds with OpenNebula 2.2 and StratusLab

Scala Storage Scale-Out Clustered Storage White Paper

Building Storage Service in a Private Cloud

CernVM Online and Cloud Gateway a uniform interface for CernVM contextualization and deployment

Deployment of Private, Hybrid & Public Clouds with OpenNebula

PES. Batch virtualization and Cloud computing. Part 1: Batch virtualization. Batch virtualization and Cloud computing

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

OpenNebula Cloud Platform for Data Center Virtualization

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

Deep Dive on SimpliVity s OmniStack A Technical Whitepaper

2) Xen Hypervisor 3) UEC

High Availability Databases based on Oracle 10g RAC on Linux

Building Clouds with OpenNebula 3.4

FREE AND OPEN SOURCE SOFTWARE FOR CLOUD COMPUTING SERENA SPINOSO FULVIO VALENZA

VMware Software-Defined Storage Vision

Building Storage as a Service with OpenStack. Greg Elkinbard Senior Technical Director

WHITE PAPER. Software Defined Storage Hydrates the Cloud

A Complete Open Cloud Storage, Virt, IaaS, PaaS. Dave Neary Open Source and Standards, Red Hat

Mobile Cloud Computing T Open Source IaaS

Architecture des plates-formes IaaS Etat des lieux et perspectives

Comparing Open Source Private Cloud (IaaS) Platforms

What Is Microsoft Private Cloud Fast Track?

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

cloud functionality: advantages and Disadvantages

How To Compare Cloud Computing To Cloud Platforms And Cloud Computing

Deploying Business Virtual Appliances on Open Source Cloud Computing

Establishing Scientific Computing Clouds on Limited Resources using OpenStack

How To Create A Cloud Based System For Aaas (Networking)

Introduction to Cloud Computing

Service Catalogue. virtual services, real results

Performance Testing of a Cloud Service

Optimizing Large Arrays with StoneFly Storage Concentrators

DVS Enterprise. Reference Architecture. VMware Horizon View Reference

Solution for private cloud computing

OGF25/EGEE User Forum Catania, Italy 2 March 2009

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

MICROSOFT HYPER-V SCALABILITY WITH EMC SYMMETRIX VMAX

VMware Software-Defined Storage & Virtual SAN 5.5.1

Symantec NetBackup Appliances

Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure

PARALLELS CLOUD STORAGE

IBM PureData System for Transactions. Technical Deep Dive. Jonathan Rossi, PureSystems Specialist

INUVIKA TECHNICAL GUIDE

Building an AWS-Compatible Hybrid Cloud with OpenStack

The next step in Software-Defined Storage with Virtual SAN

What s New with VMware Virtual Infrastructure

MagFS: The Ideal File System for the Cloud

SUSE Cloud Installation: Best Practices Using an Existing SMT and KVM Environment

VMware Virtual Machine File System: Technical Overview and Best Practices

A virtual SAN for distributed multi-site environments

SUSE Cloud Installation: Best Practices Using a SMT, Xen and Ceph Storage Environment

ovirt and Gluster Hyperconvergence

Sistemi Operativi e Reti. Cloud Computing

Business Process Desktop: Acronis backup & Recovery 11.5 Deployment Guide

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Technical White Paper Integration of ETERNUS DX Storage Systems in VMware Environments

Postgres on OpenStack

U-LITE Network Infrastructure

Deployment Guide. How to prepare your environment for an OnApp Cloud deployment.

Frequently Asked Questions: EMC UnityVSA

OpenNebula Leading Innovation in Cloud Computing Management

Zadara Storage Cloud A

Testing of several distributed file-system (HadoopFS, CEPH and GlusterFS) for supporting the HEP experiments analisys. Giacinto DONVITO INFN-Bari

OpenStack IaaS. Rhys Oxenham OSEC.pl BarCamp, Warsaw, Poland November 2013

Traditional v/s CONVRGD

MICROSOFT CLOUD REFERENCE ARCHITECTURE: FOUNDATION

FOR SERVERS 2.2: FEATURE matrix

Virtual Machine Management with OpenNebula in the RESERVOIR project

Transcription:

Managing a Tier-2 Computer Centre with a Private Cloud Infrastructure Stefano Bagnasco, Riccardo Brunetti, Stefano Lusso (INFN-Torino), Dario Berzano (CERN) ACAT2013 Beijing, May 16-21, 2013

motivation The amount of resources and the variety of applications is steadily increasing, manpower unfortunately is not" It is becoming almost mandatory to consolidate such resources to achieve scalability and economies-of-scale Separate application management from infrastructure management Our Data Centres need to become providers of computing and storage resources, not (only) of high level services The Cloud approach (IaaS) might help to better provision resources to the different scientific computing applications Grid Sites, small or medium computing farms, single users, Several cloud computing projects are starting at national and European level From definition of best practices and reference configurations to deployment of large-scale distributed infrastructures A local working cloud infrastructure will also allow to take immediately part in such activities ACAT2013 Beijing, May 16-21, 2013-2/417

philosophy Ensure interoperability Favour manageability and flexibility over performance Keep it simple Stay" mainstream Provide a production service to applications Be useroriented ACAT2013 Beijing, May 16-21, 2013-3/417

philosophy Don t use too many tools Develop as few pieces as possible Introduce features only when needed by applications Use few simple images plus contextualization Keep it simple Stay" mainstream Be useroriented ACAT2013 Beijing, May 16-21, 2013-4/417

philosophy Choose stable and widely used tools and components: OpenNebula cloud stack Common interfaces: " OCCI, EC2, OCA GlusterFS filesystem OpenWRT for network management Keep it simple Stay mainstream Be useroriented ACAT2013 Beijing, May 16-21, 2013-5/417

philosophy Adopt an agile development cycle Give resources to users as soon as possible Add functionalities as they become needed Keep it simple Stay mainstream Be useroriented ACAT2013 Beijing, May 16-21, 2013-6/417

two clusters Services VMs providing critical services: in- & out-bound connectivity public & private IP live migration no special I/O requirements Workers VMs providing computing workforce: example: Grid WNs private IP only. high storage I/O performance ACAT2013 Beijing, May 16-21, 2013-7/417

two clusters Server-class hardware Shared image repository Resiliency-optimized FS for shared system disks Currently 4 hosts Services Working-class hardware J Cached image repository Access to perfomanceoptimized FS for data needs Currently 35 hosts. Workers ACAT2013 Beijing, May 16-21, 2013-8/417

stakeholders Tier-2 Services Grid and PROOF User Interfaces Grid Tier-2 R&D Services Short-term needs PROOF Masters Workers Medical Image Processing BES-III Computing R&D ALICE Analysis Facility ACAT2013 Beijing, May 16-21, 2013-9/417

the ingredients Cloud management Toolkit: OpenNebula Open Source stack with a wide user community Modular architecture Already provides most of the required functionalities Uses standard interfaces (EC2, OCCI, OCA) Easy to customize (mostly shell and ruby scripts) OpenStack, now widely adopted in new projects, was too embrionic when we started and arguably* OpenNebula is better suited at Data Center Nebulization Currently using version 3.6, will migrate to 3.8 soon (or 4.0, available since last week) We use templates based on few very simple images plus full contextualization via context scripts and puppet (looking into CloudInit) * See e.g. blog.opennebula.org/?p=4042 ACAT2013 Beijing, May 16-21, 2013-10/417

the ingredients Backend storage: GlusterFS Easy to setup in a basic configuration Flexible enough to cater to different needs with a single tool (see next slides) Proven robustness and scalability VM network management: OpenWRT Light-weight Linux distribution for embedded systems Provides tools for network configuration and management We deploy VRouters for virtual clusters Again, OpenVSwitch was not integrated in OpenNebula when we started ACAT2013 Beijing, May 16-21, 2013-11/417

multipurpose storage: glusterfs GlusterFS mimics RAID functionalities at filesystem level by aggregating bricks on different machines: distributed replicated striped (can be combined) Horizontal scalability: no master host, all synchronizations are peer-to peer clients access data directly from the node hosting it Easy management: On-line addition, removal, replacement of bricks ACAT2013 Beijing, May 16-21, 2013-12/417

multipurpose storage: glusterfs Our use cases: VM image repository: one brick exported System datastore for service-class hosts: replicated on two servers for redundancy. Replica is synchronous, self-healing enabled. Continuous r/w occurs Experiment data pool of aggregated disks (currently ~50 TB). Very high throughput towards many concurrent clients ACAT2013 Beijing, May 16-21, 2013-13/417

multipurpose storage: glusterfs Two storage servers with 10Gbps interface provide some of the LUNs through GlusterFS All the virtual machines run on RAW or QCOW file images Services System Datastore is shared to allow live migration Workers System Datastore is local to the hypervisors to increase " I/O capacity. Images repository is locally cached on each hypervisor to reduce startup time. An ad-hoc script synchronizes the local copies using a custom torrent-like tool (scpwave + rsync) when new versions of the images are saved FC SAN Image Repository Datastore Gluster Replicated Volume Storage Servers Shared Datastore for running VMs Services Cluster Cache for imagerepo Datastore Workers Cluster ACAT2013 Beijing, May 16-21, 2013-14/417

networking Network Isolation (Level 2) Each user has its own Virtual Network, isolated using ebtables rules defined on the hypervisor bridge (OpenNebula V-net driver takes care of this). Virtual Routers (Level 3) Lightweight VM image (1 CPU, 150 MB Ram) with a Linux distribution designed for embedded systems DHCP Server, DNS Server, NAT Firewalling/Port Forwarding This provides the user with a dedicated fully featured class-c network while the connectivity remains under our control (the user has no access to the V-Router) V-Routers WAN V-Net 1 V-Net 2 ACAT2013 Beijing, May 16-21, 2013-15/417

vrouter gui ACAT2013 Beijing, May 16-21, 2013-16/417

sunstone dashboard Tier-2 services and worker node templates CERNVM-based templates ACAT2013 Beijing, May 16-21, 2013-17/417

tier-2 cpu efficiency ALICE overall average over same period ~85% http://alimonitor.cerm.ch ACAT2013 Beijing, May 16-21, 2013-18/417

the virtual analysis facility Dario.Berzano@cern.ch http://goo.gl/cfnmm ACAT2013 Beijing, May 16-21, 2013-19/417

the virtual analysis facility Dario.Berzano@cern.ch http://goo.gl/cfnmm ACAT2013 Beijing, May 16-21, 2013-20/417

the virtual analysis facility Dario.Berzano@cern.ch http://goo.gl/cfnmm ACAT2013 Beijing, May 16-21, 2013-21/417

future developments Understand the opportunities given by the CernVM ecosystem Study the integration of the OpenNebula Authn/ Authz system in a VO context or using federated authentication mechanisms. Explore the GlusterFS UFO Object Storage to provide a DropBox-like storage to users. Participate in upcoming projects aimed to develop a higher-level federated cloud infrastructure ACAT2013 Beijing, May 16-21, 2013-22/417

conclusions The infrastructure is in full production mode since more than one year The core software stack (OpenNebula + GlusterFS) proved itself stable and robust The management of the centre was actually simplified Trivial example: rolling updates Lots of room for improvement and optimization Example: there is no trivial method to optimize allocation of sets of identical machines on heterogeneous hypervisors (8, 12, 24 cores per host) Lots of room also for new features, extensions and integrations ACAT2013 Beijing, May 16-21, 2013-23/417

thank you! Questions? Stefano.Bagnasco@to.infn.it ACAT2013 Beijing, May 16-21, 2013-24/417