Share and aggregate GPUs in your cluster. F. Silla Technical University of Valencia Spain



Similar documents
Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory

I/O Considerations in Big Data Analytics

David. University Fabrication Lab: High Level Overview. Nanoscale Vacuum Electronics. Electrical Engineering Community. Co-Founder & CTO Libelium

Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes. Anthony Kenisky, VP of North America Sales

ECDF Infrastructure Refresh - Requirements Consultation Document

Hortonworks Data Platform Reference Architecture

Healthcare Firm Gains More Efficiency, Cuts Costs with Private Cloud Environment

High Performance Computing in CST STUDIO SUITE

Grid vs. Cloud Computing

White Paper Solarflare High-Performance Computing (HPC) Applications

STORAGE HIGH SPEED INTERCONNECTS HIGH PERFORMANCE COMPUTING VISUALISATION GPU COMPUTING

IBM Deep Computing Visualization Offering

Pedraforca: ARM + GPU prototype

Available online at Available online at

MapR Enterprise Edition & Enterprise Database Edition

Building a Top500-class Supercomputing Cluster at LNS-BUAP

Numerix CrossAsset XL and Windows HPC Server 2008 R2

Lavastorm Resolution Center 2.2 Release Frequently Asked Questions

Het is een kleine stap naar een hybrid cloud

Introduction to Gluster. Versions 3.0.x

New Storage System Solutions

Advanced Server Virtualization: Vmware and Microsoft Platforms in the Virtual Data Center

Juniper Networks Secure Access. Initial Configuration User Records Synchronization

Accelerating High-Speed Networking with Intel I/O Acceleration Technology

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

Abstract: Motivation: Description of proposal:

POWER ALL GLOBAL FILE SYSTEM (PGFS)

Shared Parallel File System

RevoScaleR Speed and Scalability

Optimizing GPU-based application performance for the HP for the HP ProLiant SL390s G7 server

Get Success in Passing Your Certification Exam at first attempt!

CHAPTER 2 BACKGROUND AND OBJECTIVE OF PRESENT WORK

LSKA 2010 Survey Report Job Scheduler

MCSE SYLLABUS. Exam : Managing and Maintaining a Microsoft Windows Server 2003:

From Ethernet Ubiquity to Ethernet Convergence: The Emergence of the Converged Network Interface Controller

3G Converged-NICs A Platform for Server I/O to Converged Networks

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency

Microsoft Technical Computing The Advancement of Parallelism. Tom Quinn, Technical Computing Partner Manager

TEAL: Transparent Archiving Library

- An Essential Building Block for Stable and Reliable Compute Clusters

Ecomm Enterprise High Availability Solution. Ecomm Enterprise High Availability Solution (EEHAS) Page 1 of 7

Simulation Platform Overview

Server Virtualization with Windows Server Hyper-V and System Center

IBM Global Technology Services November Successfully implementing a private storage cloud to help reduce total cost of ownership

Server Virtualization with Windows Server Hyper-V and System Center

Client/Server Computing Distributed Processing, Client/Server, and Clusters

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

OVERLAYING VIRTUALIZED LAYER 2 NETWORKS OVER LAYER 3 NETWORKS

Windows Server 2008 R2 Hyper V. Public FAQ

Simple Introduction to Clusters

The Greenplum Analytics Workbench

Computer Organization & Architecture Lecture #19

Cooling and thermal efficiently in

Trends in Embedded Software Development in Europe. Dr. Dirk Muthig

Collaborative Project in Cloud Computing

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

Interoperability between Sun Grid Engine and the Windows Compute Cluster

Building Multi-Site & Ultra-Large Scale Cloud with Openstack Cascading

ANALYSIS OF GRID COMPUTING AS IT APPLIES TO HIGH VOLUME DOCUMENT PROCESSING AND OCR

I N T E R S Y S T E M S W H I T E P A P E R INTERSYSTEMS CACHÉ AS AN ALTERNATIVE TO IN-MEMORY DATABASES. David Kaaret InterSystems Corporation

Unified Computing Systems

Mellanox Cloud and Database Acceleration Solution over Windows Server 2012 SMB Direct

Analysis and Research of Cloud Computing System to Comparison of Several Cloud Computing Platforms

Server Virtualization with Windows Server Hyper-V and System Center

2015 The MathWorks, Inc. 1

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

Cray DVS: Data Virtualization Service

HadoopTM Analytics DDN

big.little Technology Moves Towards Fully Heterogeneous Global Task Scheduling Improving Energy Efficiency and Performance in Mobile Devices

Get More Scalability and Flexibility for Big Data

Computational infrastructure for NGS data analysis. José Carbonell Caballero Pablo Escobar

High Performance Data-Transfers in Grid Environment using GridFTP over InfiniBand

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

Hyper-V over SMB Remote File Storage support in Windows Server 8 Hyper-V. Jose Barreto Principal Program Manager Microsoft Corporation

Manjrasoft Market Oriented Cloud Computing Platform

CUTTING-EDGE SOLUTIONS FOR TODAY AND TOMORROW. Dell PowerEdge M-Series Blade Servers

On Cloud Computing Technology in the Construction of Digital Campus

HPC Software Requirements to Support an HPC Cluster Supercomputer

GHG Protocol Product Life Cycle Accounting and Reporting Standard ICT Sector Guidance. Chapter 7:

LLNL Redefines High Performance Computing with Fusion Powered I/O

Emerging Technology for the Next Decade

LS DYNA Performance Benchmarks and Profiling. January 2009

Windows Server 2012 R2 VDI - Virtual Desktop Infrastructure. Ori Husyt Agile IT Consulting Team Manager orih@agileit.co.il

MCSE Core exams (Networking) One Client OS Exam. Core Exams (6 Exams Required)

IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW

Informix Dynamic Server May Availability Solutions with Informix Dynamic Server 11

A Cloud Test Bed for China Railway Enterprise Data Center

Next-Generation Federal Data Center Architecture

Avaya Call Recording Solution Configuration

Which ARM Cortex Core Is Right for Your Application: A, R or M?

"Charting the Course... MOC B Server Virtualization with Windows Hyper-V and System Center. Course Summary

MICROSOFT CERTIFIED SYSTEMS ENGINEER Windows 2003 Track

HGST Virident Solutions 2.0

Enterprise Desktop Virtualization

Purchase of High Performance Computing (HPC) Central Compute Resources by Northwestern Researchers

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

Transcription:

Share and aggregate s in your cluster F. Silla Technical University of Valencia Spain

... more technically... Remote virtualization F. Silla Technical University of Valencia Spain

Accelerating applications Many applications require a lot of computing resources Execution time is usually increased Applications are accelerated to get their execution time reduced

s as accelerators The basic building block is a node with one or more s

-accelerated clusters From the programming point of view, just a set of nodes: one or more s per node (probably with several cores per ) one or more s per node (typically between 1 and 4) an interconnection network

Getting benefit from accelerated clusters s only bring benefits for the right kind of code: There must be data parallelism in the code: this is the only way of taking benefit from the hundreds of processors in a Code with low amount of data parallelism Code with high amount of data parallelism Code with moderate amount of data parallelism

Getting benefit from accelerated clusters s only bring benefits for the right kind of code: There must be data parallelism in the code: this is the only way of taking benefit from the hundreds of processors in a Code with low amount of data parallelism Code with high amount of data parallelism Code with moderate amount of data parallelism Do you really need a -accelerated cluster?

Getting benefit from accelerated clusters s only bring benefits for the right kind of code: There must be data parallelism in the code: this is the only way of taking benefit from the hundreds of processors in a Code with low amount of data parallelism Code with high amount of data parallelism Code with moderate amount of data parallelism You can take a lot of benefit from your -accelerated cluster

Getting benefit from accelerated clusters s only bring benefits for the right kind of code: There must be data parallelism in the code: this is the only way of taking benefit from the hundreds of processors in a Code with low amount of data parallelism Code with high amount of data parallelism Code with moderate amount of data parallelism Applications will make a moderate use of the s in your cluster

Money leakage in current clusters? For applications with moderate levels of data parallelism, s in a cluster may be idle for long periods of time: Initial acquisition costs not amortized Space: s reduce density

Further concerns in accelerated clusters FACTS: Applications can only use the s within their node CONCERNS: For applications with moderate levels of data parallelism, many s in the cluster may be idle for long periods of time Multi- applications running on a subset of nodes cannot make use of the tremendous resources available at other cluster nodes (even if they are idle)

What is missing is...... some flexibility for using the s in the cluster

Addressing current concerns A way of addressing the first concern is by removing the s that are not really needed and sharing the remaining ones among all the nodes. For the second concern, all the s in the cluster could be granted to a single application

Remote virtualization This new configuration requires: A way of seamlessly sharing s across nodes in the cluster (remote virtualization) Enhanced job schedulers that take into account the new configuration

How virtualization adds value virtualization allows a new vision of a deployment, moving from the traditional cluster configuration: Interconnection to the following one.

How virtualization adds value Physical configuration Interconnection Logical configuration Logical connections Interconnection

Benefits from remote virtualization Remote virtualization increases utilization, also lowering power consumption, at the same time that initial acquisition costs are reduced Logical connections Interconnection

virtualization and power Remote virtualization can further reduce power consumption: enhancing the scheduling process so that servers are put into low-power sleeping modes when not needed + + + + +

The rcuda virtualization framework A framework enabling a CUDA-based application running in one (or some) node(s) to access s in other nodes It is useful for: Applications with moderate level of data parallelism Applications for multi- computing

About rcuda Available at developers website under request: High-performance InfiniBand communications library TCP/IP compatible for other networks Binary compatible with support for CUDA 5 ARM support SLURM support http://www.rcuda.net