wu.cloud: Insights Gained from Operating a Private Cloud System



Similar documents
Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

StACC: St Andrews Cloud Computing Co laboratory. A Performance Comparison of Clouds. Amazon EC2 and Ubuntu Enterprise Cloud

Cornell University Center for Advanced Computing

Cloud Computing. Chapter 1 Introducing Cloud Computing

Cloud Computing & Hosting Solutions

CHAPTER 8 CLOUD COMPUTING

Comparison of Open Source Cloud System for Small and Medium Sized Enterprises

Cloud computing. Intelligent Services for Energy-Efficient Design and Life Cycle Simulation. as used by the ISES project

Elastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus

Cloud Computing. Chapter 1 Introducing Cloud Computing

Cloud Computing Submitted By : Fahim Ilyas ( ) Submitted To : Martin Johnson Submitted On: 31 st May, 2009

Data Centers and Cloud Computing

Private Clouds with Open Source

Cloud Computing and Amazon Web Services

SURFsara HPC Cloud Workshop

IOS110. Virtualization 5/27/2014 1

Cornell University Center for Advanced Computing

High Performance Applications over the Cloud: Gains and Losses

9/26/2011. What is Virtualization? What are the different types of virtualization.

XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines. A.Zydroń 18 April Page 1 of 12

Cloud Computing. Chapter 1 Introducing Cloud Computing

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

Introduction to Engineering Using Robotics Experiments Lecture 18 Cloud Computing

Efficient Cloud Management for Parallel Data Processing In Private Cloud

CLOUD COMPUTING. When It's smarter to rent than to buy

Viswanath Nandigam Sriram Krishnan Chaitan Baru

Cluster, Grid, Cloud Concepts

Mobile Cloud Computing T Open Source IaaS

Building a Private Cloud with Eucalyptus

Cloud UT. Pay-as-you-go computing explained

SURFsara HPC Cloud Workshop

Solution for private cloud computing

2) Xen Hypervisor 3) UEC

Development of Monitoring and Analysis Tools for the Huawei Cloud Storage

Performance measurement of a private Cloud in the OpenCirrus Testbed

Technology and Cost Considerations for Cloud Deployment: Amazon Elastic Compute Cloud (EC2) Case Study

Lecture 02a Cloud Computing I

Solution for private cloud computing

Application Performance in the Cloud, and its Relationship to QoS

Analysis and Research of Cloud Computing System to Comparison of Several Cloud Computing Platforms

Cloud Computing Technology

Week Overview. Installing Linux Linux on your Desktop Virtualization Basic Linux system administration

Oracle Applications and Cloud Computing - Future Direction

Cloud Computing Now and the Future Development of the IaaS

Permanent Link:

Amazon EC2 XenApp Scalability Analysis

Cloud Computing with Red Hat Solutions. Sivaram Shunmugam Red Hat Asia Pacific Pte Ltd.

Private Cloud Database Consolidation with Exadata. Nitin Vengurlekar Technical Director/Cloud Evangelist

Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad


How To Compare Cloud Computing To Cloud Platforms And Cloud Computing

Running R from Amazon's Elastic Compute Cloud

Axceleon s CloudFuzion Turbocharges 3D Rendering On Amazon s EC2

Operating Systems: Basic Concepts and History

What is Cloud Computing? First, a little history. Demystifying Cloud Computing. Mainframe Era ( ) Workstation Era ( ) Xerox Star 1981!

Cloud Computing. Adam Barker

Hadoop on OpenStack Cloud. Dmitry Mescheryakov Software

OpenNebula The Open Source Solution for Data Center Virtualization

Datacenters and Cloud Computing. Jia Rao Assistant Professor in CS

DataCenter optimization for Cloud Computing

Cloud Computing and E-Commerce

A Gentle Introduction to Cloud Computing

International Journal of Advanced Research in Computer Science and Software Engineering

Cloud Computing through Virtualization and HPC technologies

Deploying Business Virtual Appliances on Open Source Cloud Computing

Neptune. A Domain Specific Language for Deploying HPC Software on Cloud Platforms. Chris Bunch Navraj Chohan Chandra Krintz Khawaja Shams

Comparison of Several Cloud Computing Platforms

The Building Blocks to the Architecture of a Cloud Platform

Virtualization. Types of Interfaces

Dimension Data Enabling the Journey to the Cloud

PARALLELS SERVER BARE METAL 5.0 README

Private Cloud in Educational Institutions: An Implementation using UEC

CPET 581 Cloud Computing: Technologies and Enterprise IT Strategies. Virtualization of Clusters and Data Centers

AN IMPLEMENTATION OF E- LEARNING SYSTEM IN PRIVATE CLOUD

Big Data and Cloud Computing for GHRSST

Enabling Technologies for Distributed and Cloud Computing

PARALLELS SERVER 4 BARE METAL README

How To Understand Cloud Computing

Cloud Models and Platforms

Unlimited Virtual Computing Capacity using the Cloud for Automated Parameter Estimation

Cloud Computing. What Are We Handing Over? Ganesh Shankar Advanced IT Core Pervasive Technology Institute

High Performance Computing in CST STUDIO SUITE

Comparing Open Source Private Cloud (IaaS) Platforms

Installing & Using KVM with Virtual Machine Manager COSC 495

Full and Para Virtualization

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

Dutch HPC Cloud: flexible HPC for high productivity in science & business

Intro to Virtualization

Why Private Cloud? Nenad BUNCIC VPSI 29-JUNE-2015 EPFL, SI-EXHEB

Setting up a private cloud for academic environment with open source software

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

User s Manual

Assignment # 1 (Cloud Computing Security)

Cloud Computing an introduction

Cloud Computing Backgrounder

Comparing Free Virtualization Products

Computing in High- Energy-Physics: How Virtualization meets the Grid

Tamanna Roy Rayat & Bahra Institute of Engineering & Technology, Punjab, India talk2tamanna@gmail.com

Edwin Guchu. Implementation of cloud infrastructure using open source software

Transcription:

wu.cloud: Insights Gained from Operating a Private Cloud System Stefan Theußl, Institute for Statistics and Mathematics WU Wirtschaftsuniversität Wien March 23, 2011 1 / 14

Introduction In statistics we are increasingly facing the following challenges: more accurate and time consuming models (1), computational intensive applications (2), and/or large datasets (3). Thus, one could or just wait (1+2), reduce problem size (3), run similar tasks on independent processors in parallel (A), load data onto multiple machines that work together in parallel (B), outsource computation (C). In this talk we focus on option C: outsourcing computation. Introduction 2 / 14

Requirements (Applications) Statisticians need/want to run highly computational applications, process large data sets, run memory-demanding applications. For example: Bayesian statistics (Gibbs sampling) Complex optimization problems Investigation of CDS/Bond quote/trade via database backend MC simulation: hedging of options with Levy processes Text mining on large data sets Topic models Requirements 3 / 14

Requirements (Software) However, usually the scientific software employed is rather heterogeneous: R: want to use current version and complete development environment Compilers: GNU Compiler Collection, Intel Compiler, etc. Mathematica and gridmathematica Matlab Optimization: want to use state-of-the-art optimizers like CPLEX, GLPK, KNITRO, MOSEK, etc. ideally on different platforms: Linux and Windows-based system (32 and 64 bit) using various editors: emacs, RStudio, Winedit, nano, vi, etc. Requirements 4 / 14

Outsourcing Computation If we want to outsource our daily (scientific) computation, at WU we either buy new equipment to run the given application, not recommended only if not avoidable use appropriately configured workstations/virtual (Xen) instances for different types of problems almost perfect solution for a given application not scalable use a 520-core cluster of workstations called cluster@wu very scalable however, applications must meet certain requirements like running on specific OS as batch job or, move computations to the cloud on-demand network access to a shared pool of configurable computing resources The latter seems to become a very popular vehicle for outsourcing our computational tasks which we well show in this talk. Requirements 5 / 14

Private Clouds Why running a private cloud system? Emulate public cloud on (existing) private resources, thus, provides benefits of clouds (elasticity, dynamic provisioning, multi-os/arch operation, etc.), while maintaining control of resources. Moreover, there is always the option to scale out to the public cloud (going hybrid). wu.cloud 6 / 14

wu.cloud From the NIST Definition of Cloud Computing, see http://csrc.nist.gov/groups/sns/cloud-computing/, we derived the following cloud model for wu.cloud: private cloud as solely operated for WU members and projects, thus, network access only via Intranet/VPN, on-demand self-service, resource pooling via virtualization, extensibility/elasticity, Infrastructure as a Service (IaaS), Platform as a Service (PaaS). wu.cloud 7 / 14

wu.cloud wu.cloud is a private cloud system based on the open source software package Eucalyptus (see http://open.eucalyptus.com/). Accessible via http://cloud.wu.ac.at/. Consists of a frontend (website, management software) and a backend (providing resources) system. wu.cloud 8 / 14

wu.cloud Hardware Backend system: (c) 2010 IBM Corporation, from Datasheet XSD03054-USEN-05 Frontend System: 2x IBM X3850 X5 8x8 (64) core Intel Xeon CPUs 2.26 GHz 1 TB RAM EMC 2 Storage Area Network: 7 TB fast + 4 TB slow disks Suse Linux Enterprise Server 11 SP1 Xen 4.0.1 Eucalyptus backend components (cluster, storage, node controller) Virtual (Xen) instance Apache Webserver Eucalyptus frontend components (cloud controller, walrus) wu.cloud 9 / 14

wu.cloud Characteristics wu.cloud aims at scaling in three different dimensions: Compute-nodes: number of cloud instances and cores employed Memory: amount of memory per instance requested Software: Windows vs. Linux and software packages installed CPU 0 5 10 15 20 25 30 35 Debian/gridMathematica virtual cluster Windows/R high CPU instance 1 2 4 8 16 32 64 128 256 Debian/R high memory instance Linux base system R/Mathematica/Matlab R dev environment GUI based customized system R dev environment Matlab/PASW/Stata Windows base system RAM [GB] per instance wu.cloud 10 / 14

wu.cloud User Interface Amazon EC2 API allows for using tools like ec2/euca2ools, hybridfox, etc., primarily designed for EC2 transparent use of wu.cloud and EC2/S3 side by side Remote connection to cloud instances can be established by Secure shell (ssh), PuTTY (http://www.chiark.greenend.org.uk/ ~sgtatham/putty/) VNC (Linux) Remote Desktop (Windows) wu.cloud 11 / 14

wu.cloud User Interface wu.cloud 12 / 14

Insights Gained and Outlook Operating a private cloud environment is recommended under the following conditions: want to benefit from the cloud model (elasticity, dynamic provisioning, multi-os/arch operation, etc.), while maintaining control of resources, appropriate hardware is available/affordable, pay-per-use model of public clouds cannot be considered. The wu.cloud idea is easily advertised to researchers using the three dimensions of scalability: computing resources, RAM, and software packages employed. Some users prefer to have full control over a given system (i.e., being root) rather than just outsourcing computations to a homogeneous system. Nevertheless, it is very important to guide users into the cloud (manuals, lectures, etc.), and considerable resources have to be invested in order to provide several base images (Linux/Windows, R, Matlab, etc.). wu.cloud 13 / 14

Contact Stefan Theußl Institute for Statistics and Mathematics email: cloud@wu.ac.at, or, Stefan.Theussl@wu.ac.at URL: http://statmath.wu.ac.at/~theussl WU Vienna Augasse 2 6, A-1090 Wien wu.cloud 14 / 14