Next Generation Operating Systems

Similar documents
Datacenter Operating Systems

Data Center and Cloud Computing Market Landscape and Challenges

Computing at the HL-LHC

Infrastructure Matters: POWER8 vs. Xeon x86

Data Centric Systems (DCS)

Rackspace Cloud Databases and Container-based Virtualization

How Solace Message Routers Reduce the Cost of IT Infrastructure

Virtual Computing and VMWare. Module 4

COS 318: Operating Systems. Virtual Machine Monitors

ICRI-CI Retreat Architecture track

Windows Server 2008 R2 Hyper V. Public FAQ

Intel DPDK Boosts Server Appliance Performance White Paper

Seeking Opportunities for Hardware Acceleration in Big Data Analytics

Virtualization. Types of Interfaces

IT of SPIM Data Storage and Compression. EMBO Course - August 27th! Jeff Oegema, Peter Steinbach, Oscar Gonzalez

Eloquence Training What s new in Eloquence B.08.00

Petascale Software Challenges. Piyush Chaudhary High Performance Computing

Performance monitoring at CERN openlab. July 20 th 2012 Andrzej Nowak, CERN openlab

Cloud Computing through Virtualization and HPC technologies

Enhancing Hypervisor and Cloud Solutions Using Embedded Linux Iisko Lappalainen MontaVista

Amazon EC2 Product Details Page 1 of 5

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Parallels VDI Solution

The new frontier of the DATA acquisition using 1 and 10 Gb/s Ethernet links. Filippo Costa on behalf of the ALICE DAQ group

Delivering Quality in Software Performance and Scalability Testing

Windows Server Virtualization An Overview

Comparing SMB Direct 3.0 performance over RoCE, InfiniBand and Ethernet. September 2014

Enabling High performance Big Data platform with RDMA

COM 444 Cloud Computing

Virtualization for Cloud Computing

Scaling in a Hypervisor Environment

Networking Virtualization Using FPGAs

A Deduplication File System & Course Review

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

The QEMU/KVM Hypervisor

IO Visor: Programmable and Flexible Data Plane for Datacenter s I/O

Multi-core Programming System Overview

InfiniBand Software and Protocols Enable Seamless Off-the-shelf Applications Deployment

SMB Direct for SQL Server and Private Cloud

Enabling the Use of Data

Virtualization Technology. Zhiming Shen

RED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: COMPETITIVE FEATURES

Overview: X5 Generation Database Machines

Data Centers and Cloud Computing

High-Density Network Flow Monitoring

How To Build An Ark Processor With An Nvidia Gpu And An African Processor

Linux Performance Optimizations for Big Data Environments

OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC

A general-purpose virtualization service for HPC on cloud computing: an application to GPUs

Cloud Sure - Virtual Machines

Full and Para Virtualization

Solution Guide Parallels Virtualization for Linux

Microkernels, virtualization, exokernels. Tutorial 1 CSC469

Evaluation Methodology of Converged Cloud Environments

State of the Art Cloud Infrastructure

FlashSoft Software from SanDisk : Accelerating Virtual Infrastructures

RAMCloud and the Low- Latency Datacenter. John Ousterhout Stanford University

Resource Efficient Computing for Warehouse-scale Datacenters

New Design and Layout Tips For Processing Multiple Tasks

Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association

Traditional v/s CONVRGD

M710 - Max 960 Drive, 8Gb/16Gb FC, Max 48 ports, Max 192GB Cache Memory

CS550. Distributed Operating Systems (Advanced Operating Systems) Instructor: Xian-He Sun

Cloud Simulator for Scalability Testing

ODP Application proof point: OpenFastPath. ODP mini-summit

12. Introduction to Virtual Machines

NVIDIA CUDA Software and GPU Parallel Computing Architecture. David B. Kirk, Chief Scientist

VMware and CPU Virtualization Technology. Jack Lo Sr. Director, R&D

FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab

Edge 2013 Comes to You. IBM Storwize Family. Maurizio Rizzi Storage Platform Leader. Title of presentation goes here IBM Corporation

A1 and FARM scalable graph database on top of a transactional memory layer

Cloud Data Center Acceleration 2015

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Experience with the integration of distribution middleware into partitioned systems

Introduction to the NI Real-Time Hypervisor

ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy

Xeon+FPGA Platform for the Data Center

IBM System x SAP HANA

Open Source Cloud Computing Management with OpenNebula

Virtualization with Windows

Proposal for Virtual Private Server Provisioning

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

Cloud Computing. Adam Barker

Oracle Database Scalability in VMware ESX VMware ESX 3.5

Enterprise-Class Virtualization with Open Source Technologies

I/O Virtualization Using Mellanox InfiniBand And Channel I/O Virtualization (CIOV) Technology

Main Memory Data Warehouses

Mark Bennett. Search and the Virtual Machine

Virtualization. Dr. Yingwu Zhu

Chapter 7: Distributed Systems: Warehouse-Scale Computing. Fall 2011 Jussi Kangasharju

MODULE 3 VIRTUALIZED DATA CENTER COMPUTE

Chapter 1: Introduction. What is an Operating System?

Transcription:

Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015

The end of CPU scaling Future computing challenges Power efficiency Performance == parallelism Cisco Confidential 2

Paradox of the computing industry System software has not evolved at the same pace as HW Multiple applications Multiple applications Time-sharing OS Spatial computing platform Single core CPU Massively multicore CPU CPU FPGA GPU CPU Cisco Confidential 3

Server Virtualization is at a generational shift APIs and interfaces for containers management Operating System Operating System Virtual machine Virtual machine Container Container Hypervisor Hardware Hypervisors are still good, but have pitfalls Flexible, multi-os, application isolation / security Optimizations, IO handling, expensive license fees Operating System Hardware lication containers are the future of virtualization No hypervisor overhead, performance, fine grained resource control Linux ABI, security and app isolation Cisco Confidential 4

OS constraints are hard to overcome with current design Containers are built on Operating system L3 cache 30 cycles L2 cache 12 cycles CPU L1 cache 4 cycles Number of cores Packet scalability Kernel stack too complicated User mode networking stack like PF_RING/DPDK write your own driver Number of cores CPU Cores scalability Many task on one core one task on many cores Monolithic architecture based on time sharing DRAM 300 cycles Memory scalability CPU cache too small Cache misses due to scattered data in the memory Cisco Confidential 5

Solving the problem at the lowest level of abstraction Next-Gen Computing: Redesign of Operating System for energy efficient and linear scalable computing platform New architectural concept, greatly enhancing application performance in modern datacenters and fundamentally addressing the following challenges: o lication s parallelization driven by rapidly growing number of CPU cores per socket o Efficient use of heterogeneous resources o Data center wide system consistency Advantages of OS control/data plane separation o Greater scale-up bundled with smarter NIC processing o Reduce CPU kernel overhead o Higher network throughput by using multiple cores Ctrl Plane Data plane Hardware Hardware Hardware Hardware Cisco Confidential 6

Cisco Confidential 7

CERN Challenge: The Alice computing requirements Detector upgrade for Run 3 (2020) 100 increase in event rate 1TB/s raw data rate From Detector Readout to Analysis: What is the optimal computing architecture? Cisco Confidential 8

O2 facility: Highly specialized heterogeneous computing platform + 463 FPGAs Detector readout and fast cluster finder + 100 000 CPU cores To compress 1.1 TB/s data stream by overall factor 14 + 5000 GPUs To speed up the reconstruction + 50 PB of disk -------------------------------------------------------- = Considerable computing capacity that will be used for Online and Offline tasks Cisco Confidential 9

Data Plane Computing System DPCS: Openlab project investigating applicability of modern OS concepts in the ALICE O2 environment Data Plane OS concept I/O Virtualization Multicore scaling Heterogeneous compute Cisco Confidential 10

Thank you.

Data flow in O2 facility Detectors electronics Data of all interaction - continuous mode Computing farm Online data volume reduction by factor 14 (20 for the TPC data) Compressed data Data storage for 1 year of compressed data (60 PB) Compressed data Asynchronous event reconstruction A few hours after data taking 1.1 TB/s 90 GB/s Reconstructed events ~ 8000 optical links Read-out farm: 250 servers with FPGA acceleration Processing farm: 1500 servers with GPU acceleration Storage system 68 storage units with 34 data servers Cisco Confidential 12