Open Cirrus: Towards an Open Source Cloud Stack



Similar documents
Elastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus

S06: Open-Source Stack for Cloud Computing

The KOALA Cloud Management Service

Private Clouds with Open Source

Performance measurement of a private Cloud in the OpenCirrus Testbed

An Introduction to Private Cloud

Assembling Cloud Infrastructures with Eucalyptus

Cloud Computing. Adam Barker

Introduction to Cloud Computing

Modernizing Your Data Warehouse for Hadoop

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad

Can High-Performance Interconnects Benefit Memcached and Hadoop?

Putchong Uthayopas, Kasetsart University

I/O Virtualization Using Mellanox InfiniBand And Channel I/O Virtualization (CIOV) Technology

CLEVER: a CLoud-Enabled Virtual EnviRonment

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 14

BIG DATA TRENDS AND TECHNOLOGIES

How To Understand Cloud Computing

SMB Direct for SQL Server and Private Cloud

Cloud Computing Now and the Future Development of the IaaS

Cloud Computing Training

OpenNebula Leading Innovation in Cloud Computing Management

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

SR-IOV: Performance Benefits for Virtualized Interconnects!

Cloud Computing Submitted By : Fahim Ilyas ( ) Submitted To : Martin Johnson Submitted On: 31 st May, 2009

Fundamentals of Cloud Computing

Neptune. A Domain Specific Language for Deploying HPC Software on Cloud Platforms. Chris Bunch Navraj Chohan Chandra Krintz Khawaja Shams

High Performance OpenStack Cloud. Eli Karpilovski Cloud Advisory Council Chairman

Building a Private Cloud with Eucalyptus

Cloud computing - Architecting in the cloud

SURFsara HPC Cloud Workshop

Enabling High performance Big Data platform with RDMA

A.Prof. Dr. Markus Hagenbuchner CSCI319 A Brief Introduction to Cloud Computing. CSCI319 Page: 1

Sistemi Operativi e Reti. Cloud Computing

Hadoop. Sunday, November 25, 12

The Private Cloud Your Controlled Access Infrastructure

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

An Introduction to Cloud Computing Concepts

Installing Hadoop over Ceph, Using High Performance Networking

FREE AND OPEN SOURCE SOFTWARE FOR CLOUD COMPUTING SERENA SPINOSO FULVIO VALENZA

Hadoop & Spark Using Amazon EMR

Mellanox Cloud and Database Acceleration Solution over Windows Server 2012 SMB Direct

IaaS Cloud Architectures: Virtualized Data Centers to Federated Cloud Infrastructures

What Is It? Business Architecture Research Challenges Bibliography. Cloud Computing. Research Challenges Overview. Carlos Eduardo Moreira dos Santos

Efficient Cloud Management for Parallel Data Processing In Private Cloud

Hadoop implementation of MapReduce computational model. Ján Vaňo

Automating Big Data Benchmarking for Different Architectures with ALOJA

Hadoop & its Usage at Facebook

FLOW-3D Performance Benchmark and Profiling. September 2012

Introduction to Cloud Computing

Cloud Computing: a Perspective Study

LS DYNA Performance Benchmarks and Profiling. January 2009

Cloud Design and Implementation. Cheng Li MPI-SWS Nov 9 th, 2010

Hadoop & its Usage at Facebook

Introduction to Cloud Design Four Design Principals For IaaS

Large scale processing using Hadoop. Ján Vaňo

Data Centers and Cloud Computing

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloud Computing Architecture: A Survey

Cloud Infrastructure Pattern

Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop

A Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012)

CLOUD COMPUTING. When It's smarter to rent than to buy

Hadoop on the Gordon Data Intensive Cluster

Networking Issues For Big Data

CSE 590: Special Topics Course ( Supercomputing ) Lecture 10 ( MapReduce& Hadoop)

Introduction to Cloud Computing

Emerging Technology for the Next Decade

CHAPTER 8 CLOUD COMPUTING

Building a BI Solution in the Cloud

ECLIPSE Performance Benchmarks and Profiling. January 2009

How To Understand Cloud Computing

The Greenplum Analytics Workbench

Datacenters and Cloud Computing. Jia Rao Assistant Professor in CS

Architectural Implications of Cloud Computing

VMware on VMware: Private Cloud Case Study Customer Presentation

How To Understand Cloud Computing

Cloud Computing. Chapter 1 Introducing Cloud Computing

Oracle Applications and Cloud Computing - Future Direction

IPOP-TinCan: User-defined IP-over-P2P Virtual Private Networks

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

Comparing SMB Direct 3.0 performance over RoCE, InfiniBand and Ethernet. September 2014

Manjrasoft Market Oriented Cloud Computing Platform

Pilot-Streaming: Design Considerations for a Stream Processing Framework for High- Performance Computing

Transcription:

Open Cirrus: Towards an Open Source Cloud Stack Karlsruhe Institute of Technology (KIT) HPC2010, Cetraro, June 2010 Marcel Kunze KIT University of the State of Baden-Württemberg and National Laboratory of the Helmholtz Association www.kit.edu

Open Cirrus Cloud Computing Research Testbed http://opencirrus.org An open, internet-scale global testbed for cloud computing research A tool for collaborative research Data center management & cloud services Resources Multi-continent, multi-datacenter, cloud computing system Federated Centers of Excellence around the globe each with 100 400+ nodes and up to ~PB storage and running a suite of cloud services Structure Sponsors: HP Labs, Intel Research, Yahoo! Partners: CMU, ETRI, IDA Singapore, KIT, MIMOS, RAS, UIUC Discussion with: CESGA, UNIDO, China Mobile Cloud system and application development Cover story in IEEE Computer, vol 43, no 4, April 2010 2

Cloud Systems Research Open Cirrus is seeking research in the following areas: Cloud services Datacenter federation Datacenter management and automation Data-intensive applications and systems Research requirements Perform experiments also on a low system level Flexible cloud computing framework Compare different methodologies and implementations Simple, transparent, controllable cloud computing infrastructure 3

Proprietary Cloud Computing Stacks 4

Open SourceCloud Stack OpenCirrus researchers have complete access to the hardware and software platform 5

Open Cirrus Blueprint Cloud Application Services Virtual Resource Sets Cloud Infrastructure Services Eucalyptus OpenNebula IT-Infrastructure Layer (Physical Resource Sets) 6

HPC vs. HTC vs. MTC (Many Task Computing) MTC Traditional Cloud HPC HTC Source: I.Foster 7

Extension of the Cloud Space BDaaS HPCaaS Cloud development towards HPC and Big Data as a Service 8

Big Data as a Service: Tashi http://wiki.apache.org/incubator/tashiproposal Scheduler Virtualization Service Storage Service Cluster Manager Node Node Node Node Node Node Virtualization of CPU and storage Co-Scheduling of CPU and data 9

Big Data as a Service: Hadoop Reproduce the proprietary software infrastructure developed by Google as Open Source Hadoop implements Parallel programming model (MapReduce) Hadoop Distributed File System (HDFS) Parallel database (HBase) Programming environment (Pig) Data warehouse infrastructure (HIVE) Data collection and analysis (Chukwa) Machine based learning (Mahout) Largest Cluster at Yahoo!: 32.000 cores and 16 PetaByte storage (1PB/16h) 10

Hadoop Cluster at KIT Being commissioned in June 2010 448 Nehalem cores HP DL1000, 112 TeraByte online storage 11

Large Scale Data Facility: Digital Microscopy SAMBA Combination of GPFS file system and Hadoop cluster Real-time data taking and analysis Implement data service for desktop users LSDF C LSDF A Hadoop + OpenNebula 12

HPC Cluster at KIT (HC3) Infniband QDR Interconnect MSA2000fc 2656 Nehalem Cores / 332 Servers / 10 TB Memory Infiniband QDR Interconnect 192 TB Lustre Filesystem 13

HPC as a Service Traditional HPC Architecture is characterized by very specific computer clusters designed for special applications offers pre defined operation systems and user environments only serves one single application at a given time provides restricted user access provides management privileges exclusively to administrators Concept of HPCaaS Capability of using clustered servers and storage as resource pools, fully automated management Individual cluster configuration on demand Flexibility to serve multiple user groups and applications with varying requirements Customers gain resource management privileges 14

HPC as a Service: Virtualization of CPU and I/O Infiniband Single Root I/O Virtualization (SR-IOV) Virtual Machine 3 I/O Software Driver PCIe Card (SR IOV capable) Virtual Machine 3 I/O Software Driver Virtual Interface Virtual Machine 3 I/O Software Driver Physical Interface Virtual Machine Manager 15

InfiniBand Virtualization: SR-IOV Limits of traditional I/O virtualization Increased I/O latency: VMM must process and route every data packet and interrupt, leads to higher application response time Scalability limitations: software-based I/O processing consumes CPU cycles, reduces the processing capacity Solution: SR-IOV provides near-native performance Extension to the PCI Express specification suite Physical I/O resources are virtualized within the PCIe card, each card presents multiple virtual I/O interfaces Virtual Functions (VF) Provide all the functionality which is necessary for communication VMs interface directly with a VF without VMM intervention Physical Functions (PF) VMM interfaces with PF to configure and manage I/O resource sharing among the multiple VMs 16

Current InfiniBand Development First SR IOV supported IB Host Channel Adapters (HCAs) are already available by Mellanox Technologies: Model Type: ConnectX 2 SR IOV supported Drivers for the OFED Software Stack and Firmware are currently in development and will be available in the second half of 2010 Outlook The SR IOV standard will allow to use InfiniBand in virtual machines at near native performance Cloud frameworks like Eucalyptus or OpenNebula have to be extended to work with virtual resource bundles Customers will be able to configure and deploy complete HPC systems on demand according to their specific requirements! 17

Cloud Management Tools Three categories of management tools exist Browser based (e.g.: Elasticfox, Hybridfox) Proprietary SaaS by provider (Amazon AWS console) Proprietary command line based The optimum tool would Run out of the cloud as a service (No local installation) Run with any web browser (Pure HTML) Support standard protocols like EC2/S3/EBS and OCCI Be Open Source to cover a broad range of products KOALA: Karlsruhe Open Application (for) cloudadministration 18

Cloud Management with KOALA Management of public and private cloud services in a hybrid cloud: AWS, Eucalyptus, Nimbus, Tashi, ONE, Runs as a software service in Google App Engine Multi tenant management of CPU, storage and network Future perspective: Dynamic management of resource bundles MPI and Hadoop clusters in the cloud 19

KOALA running inside a Private Cloud PaaShttp://koalacloud.appspot.com http://code.google.com/p/koalacloud KOALA also runs in a Private Cloud PaaS based on AppScale AppScale is an Open Source re implementation of the Google App Engine IaaS PaaS KOALA Allows users to do cloud management inside their own Private Cloud in case ofconcerns regarding privacy and availability 20

Hybrid Clouds: Cloud Bursting Private Cloud Open Source EC2, S3, OCCI Transfer workloads and data transparently between clouds Example: Amazon Virtual Private Cloud 21

Summary Open Cirrus project offers interesting R&D opportunities Cloud systems development Cloud application development Accepting research proposals Open Source cloud stack Standards are important to foster a cloud computing market De-facto standards are around (e.g. Amazon EC2, S3) Standards are evolving (OCCI) Federation of private and public clouds seems important (Cloud bursting) 22

23