Operating an OpenStack Cloud

Similar documents
Openstack. Cloud computing with Openstack. Saverio Proto

Introducing ScienceCloud

Ceph. A complete introduction.

Installing Hadoop over Ceph, Using High Performance Networking

How to Deploy OpenStack on TH-2 Supercomputer Yusong Tan, Bao Li National Supercomputing Center in Guangzhou April 10, 2014

Running an OpenStack Cloud for several years and living to tell the tale. Alexandre Maumené Gaëtan Trellu Tokyo Summit, November 2015

Scyld Cloud Manager User Guide

CloudCIX Bootcamp. The essential IaaS getting started guide.

Sheepdog: distributed storage system for QEMU

AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013

SUSE Cloud Installation: Best Practices Using a SMT, Xen and Ceph Storage Environment

Bosch Video Management System High Availability with Hyper-V

Assignment # 1 (Cloud Computing Security)

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

Ubuntu OpenStack on VMware vsphere: A reference architecture for deploying OpenStack while limiting changes to existing infrastructure

SYNNEFO: A COMPLETE CLOUD PLATFORM OVER GOOGLE GANETI WITH OPENSTACK APIs VANGELIS KOUKIS, TECH LEAD, SYNNEFO

PES. Ermis service for DNS Load Balancer configuration. HEPiX Fall Aris Angelogiannopoulos, CERN IT-PES/PS Ignacio Reguero, CERN IT-PES/PS

UZH Experiences with OpenStack

FIA Athens 2014 ~OKEANOS: A LARGE EUROPEAN PUBLIC CLOUD BASED ON SYNNEFO. VANGELIS KOUKIS, TECHNICAL LEAD, ~OKEANOS

How To Install Openstack On Ubuntu (Amd64)

The Greenplum Analytics Workbench

Service Catalogue. virtual services, real results

Scientific Computing Data Management Visions

Déployer son propre cloud avec OpenStack. GULL François Deppierraz

Red Hat CloudForms : OPENSTACK INFRASTRUCTURE MANAGEMENT. John Hardy Senior Principal Product Manager 24 rd June 2015

Reference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack

A Complete Open Cloud Storage, Virt, IaaS, PaaS. Dave Neary Open Source and Standards, Red Hat

Web Application Platform for Sandia

StorPool Distributed Storage Software Technical Overview

Altor Virtual Network Security Analyzer v1.0 Installation Guide

Hadoop: Embracing future hardware

PowerVC 1.2 Q Power Systems Virtualization Center

1.1 SERVICE DESCRIPTION

Vintage IT Services MSRP Pricing PaaS Catalog Pricing Service Categories MSRP Pricing

Prepared for: How to Become Cloud Backup Provider

OpenStack Awareness Session

Solution for private cloud computing

Develop a process for applying updates to systems, including verifying properties of the update. Create File Systems

CERN Cloud Architecture

Citrix Lab Manager 3.6 SP 2 Quick Start Guide

NephOS A Licensed End-to-end IaaS Cloud Software Stack for Enterprise or OEM On-premise Use.

SUSE Cloud Installation: Best Practices Using an Existing SMT and KVM Environment

University of Birmingham & CLIMB GPFS User Experience. Simon Thompson Research Support, IT Services University of Birmingham

Cloud Optimize Your IT

How to Backup and Restore a VM using Veeam

Vantage Report. Quick Start Guide

Features of AnyShare

Introduction to Cloud Computing

The last 18 months. AutoScale. IaaS. BizTalk Services Hyper-V Disaster Recovery Support. Multi-Factor Auth. Hyper-V Recovery.

Installing GFI Network Server Monitor

Migration of virtual machine to cloud using Openstack Python API Clients

SDN v praxi overlay sítí pro OpenStack Daniel Prchal daniel.prchal@hpe.com

Building a big IaaS cloud with Apache CloudStack

HP OpenStack & Automation

1 What is Cloud Computing? Cloud Infrastructures OpenStack Amazon EC CAMF Cloud Application Management

OpenStack Introduction. November 4, 2015

Technical Overview Simple, Scalable, Object Storage Software

Barracuda Backup Vx. Virtual Appliance Deployment. White Paper

Cloud Panel Service Evaluation Scenarios

Camilyo APS package by Techno Mango Service Provide Deployment Guide Version 1.0

POSIX and Object Distributed Storage Systems

Cloudified IP Multimedia Subsystem (IMS) for Network Function Virtualization (NFV)-based architectures

How Comcast Built An Open Source Content Delivery Network National Engineering & Technical Operations

Mirantis OpenStack Express: Security White Paper

FileCruiser Backup & Restoring Guide

Low-cost

In order to upload a VM you need to have a VM image in one of the following formats:

Introduction to OpenStack

CS 6343: CLOUD COMPUTING Term Project

Cloud.com CloudStack Release Notes

Installation Runbook for Avni Software Defined Cloud

Cloud on TEIN Part I: OpenStack Cloud Deployment. Vasinee Siripoonya Electronic Government Agency of Thailand Kasidit Chanchio Thammasat University

Virtual Appliance Setup Guide

Prepared for: How to Become Cloud Backup Provider

Open Source Virtualization with ovirt. DI (FH) René Koch Systems Engineer Siedl Networks GmbH Grazer Linuxtage,

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Application-Centric WLAN. Rob Mellencamp

Virtual Server and Storage Provisioning Service. Service Description

Release Notes for Fuel and Fuel Web Version 3.0.1

A Secure Intrusion detection system against DDOS attack in Wireless Mobile Ad-hoc Network Abstract

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

OnApp Cloud. The complete platform for cloud service providers. 114 Cores. 286 Cores / 400 Cores

Setting up your virtual infrastructure using FIWARE Lab Cloud

TANDBERG MANAGEMENT SUITE 10.0

Veeam Cloud Connect. Version 8.0. Administrator Guide

Building Storage Service in a Private Cloud

Monitor Open stack environments from the bottom up and front to back. Roger Ruttimann VP Engineering, GroundWork OpenSource November 17, 2015

Parallels Cloud Storage

Cloud Design and Deployment on Intel Xeon Processor-based Platforms

SUSE OpenStack Cloud 4 Private Cloud Platform based on OpenStack. Gábor Nyers Sales gnyers@suse.com

Installation Guide. Step-by-Step Guide for clustering Hyper-V virtual machines with Sanbolic s Kayo FS. Table of Contents

NTP Software File Reporter Analysis Server

Transcription:

Operating an OpenStack Cloud Learning from building & operating SWITCHengines SA7 T3, 26.11.2015 Jens-Christian Fischer jens-christian.fischer@switch.ch

SWITCH Non Profit Foundation IT Services for Universities and Higher Education Traditional Internet (SWITCHlan) Federated Single Sign On (SWITCHaai) Lecture Recording & Videos (SWITCHcast / SWITCHtube) Many, many more A partner to Acacdemia Registry for.ch (since 1987) And Domain registration until 2015 2

SWITCH Services Established services: Network Authentication and authorization infrastructure PKI, SWITCHham, SMS, videoconf Security Collaboration, e-learning services New(er) services: Procurement Cloud services Under development: Swiss edu-id, Information Security Management (ISMS) 3

Cloud Infrastructure as a Service SWITCHengines Software as a Service SWITCHdrive an Academic Sync & Share Service SWITCHfilesender overcome the 10 MB Mailbox limit SWITCHcast record and edit lectures SWITCHtube a secure video distribution tool 4

Timeline for IaaS Cloud Offering 2012: Start of Project BCC Building Cloud Competency 2013: Decision that SWITCH builds Swiss Academic Cloud services 2014: SCALE project. One of the deliverables build an initial cloud infrastructure - 64 Servers purchased and installed 2015: Pilot usage with production load. End of SCALE project Start of SCALE-UP project 2016: SWITCHengines an official SWITCH service 2017: End of SCALE-UP project 5

Current Status SWITCHengines has been in public beta and internal production since December 2014 Several SWITCH services run on it (SWITCHdrive, SWITCHengines, SWITCHfilesender, SWITCHtube) Over 630 individual users and around 40 research projects online Around 1000 VMs running on the cloud right now (2200 cores, 3700 GB RAM, 360 TB of storage allocated) Over 60 000 VMs have been created / decommissioned since the beginning 6

Cloud Usage 7

Cloud Usage 8

Expansion under way Double the number of compute nodes (and increase the number of CPUs by 150% - Moore s Law at Work) 2000 CPU cores Increase the storage capacity by 50% 2.25 PB raw storage Online end of 2015 9

Building https://www.flickr.com/photos/criminalintent/5101528210 10

Hardware is easy Commercial Off the Shelf Low Cost, Medium Performance (but lots of it) 11

Building SWITCHengines 2 Geographic Locations ZHdK UNIL 32 Storage Nodes 384 4TB SATA Disks (and a bunch of SSDs for Journals) 1.5 PB RAW Storage 41 Compute Nodes 1240 Cores 9.2 TB RAM 12

Software 13

But wait, there s more 14

Architecture 15

Infrastructure - Foreman 16

Ceph When 12 * 4 TB aren t enough, go distributed storage systems $ ceph -s cluster yyyyyyyyy-bdae-464b-a864-xxxxxxxxxxx health HEALTH_OK monmap e2: 3 mons at {zhdk0009=[2001:xxx::1009]:6789/0,zhdk0013=[2001:yyy::1013]:6 789/0,zhdk0025=[2001:yyy::1025]:6789/0}, election epoch 612, quorum 0,1,2 zhdk0009,zhdk0013,zhdk0025 osdmap e44762: 125 osds: 125 up, 125 in pgmap v20730639: 3336 pgs, 17 pools, 85088 GB data, 23864 kobjects 279 TB used, 174 TB / 454 TB avail 3317 active+clean 19 active+clean+scrubbing+deep client io 10031 kb/s rd, 14072 kb/s wr, 1472 op/s 17

Foreman Server Definition 18

Server Puppet Classes 19

Puppet Configuration Management http://puppetlabs.com https://github.com/puppetlabs/puppetlabs-openstack Modules for every component of OpenStack Generic Glue Modules that build the actual Cluster 20

Split between Code Puppet 21

Configuration - Hiera 22

Puppet Glue 23

Orchestrating the Orchestrator Ansible - http://www.ansible.com/home ansible-playbook disable_puppet.yml -i cloud --sudo --asksudo-pass --extra-vars "hosts=production ansible-playbook prepare_release.yml -i cloud --sudo --asksudo-pass --extra-vars "modules=v1.3 config=v1.3.1 ansible-playbook test_release.yml -i cloud --sudo --ask-sudopass --extra-vars "hosts=canary-vm-unil ansible-playbook do_release.yml -i cloud --sudo --ask-sudopass --extra-vars "hosts=zhdk-all" 24

Version? What version? 25

Discussions 26

Merge Requests 27

Monitoring 28

Usage 29

Usage (5 months later) 30

Performance 31

Monitoring 32

Reporting 33

User Interface 34

New UI 35

New UI 36

OpenStack ++ There is more to cloud than just cloud You need the tools You need the team And when you start to actually run the cloud, things get fun 37

When you build it, they will come Network simulations Digital Humanities Aerial Photography of Lake shores (Leman / Baikal) CoreOS / Docker Container orchestration Classroom VMs ATLAS computation nodes Lotus Domino Servers Web servers from University IT departments TOR exit nodes 38

Joys of running a cloud User chooses Public network, her VM doesn t launch Your network node takes down all VMs network connections A faulty disk causes major performance problems in a distributed, no SPOF storage system Your Performance test framework doesn t delete default security rules => IPTables with 70 000 entries Your keystone service catalog entries get duplicated, all API access breaks because Keystone tokens are hashed differently 39

Joys of running a cloud Deleting a Cinder volume takes down your whole Storage API services (bug in the Cinder/Ceph integration) Users enable password login to their VM (because: Easier) Your organisation changes the way the Firewall rules are distributed to the routers You need to manage access for 650 users and 50 groups/tenants Your users exhaust the number of floating (public) IP addresses you have assigned to the Cloud network 40

Watch the network traffic 41

Part of a DDOS attack Users VM was hacked (see above: User enables password login) 42

43 http://en.wikipedia.org/wiki/computing#/media/file:babbage_difference_engine_(being_utilised).jpg