High Performance Computing OpenStack Options. September 22, 2015



Similar documents
Block Storage in the Open Source Cloud called OpenStack

Cloud File Services: October 1, 2014

Cloud and Big Data initiatives. Mark O Connell, EMC

Visions for Ethernet Connected Drives. Vice President, Dell Oro Group March 25, 2015

Big Data Storage Options for Hadoop Sam Fineberg, HP Storage

Understanding Enterprise NAS

Building Storage as a Service with OpenStack. Greg Elkinbard Senior Technical Director

Trends in Application Recovery. Andreas Schwegmann, HP

Storage Cloud Environments. Alex McDonald NetApp

New Storage System Solutions

UNDERSTANDING DATA DEDUPLICATION. Thomas Rivera SEPATON

Accelerating Applications and File Systems with Solid State Storage. Jacob Farmer, Cambridge Computer

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

How it can benefit your enterprise. Dejan Kocic Netapp

SMB 3.0 New Opportunities for Windows Environments

UNDERSTANDING DATA DEDUPLICATION. Tom Sas Hewlett-Packard

Storage Architectures for Big Data in the Cloud

ADVANCED DEDUPLICATION CONCEPTS. Larry Freeman, NetApp Inc Tom Pearce, Four-Colour IT Solutions

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

How it can benefit your enterprise. Dejan Kocic Hitachi Data Systems (HDS)

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE. Matt Kixmoeller, Pure Storage

UNDERSTANDING DATA DEDUPLICATION. Jiří Král, ředitel pro technický rozvoj STORYFLEX a.s.

Sujee Maniyam, ElephantScale

Cloud Storage Clients. Rich Ramos, Individual

Deploying Public, Private, and Hybrid Storage Clouds. Marty Stogsdill, Oracle

Storage Clouds. Karthik Ramarao. Director of Strategy and Technology and CTO Asia Pacific, NetApp Board Director SNIA South Asia

Enabling High performance Big Data platform with RDMA

Cluster Implementation and Management; Scheduling

Enterprise Architecture and the Cloud. Marty Stogsdill, Oracle

Restoration Technologies. Mike Fishman / EMC Corp.

Cloud Data Management Interface (CDMI) The Cloud Storage Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG

Lustre * Filesystem for Cloud and Hadoop *

IBM Spectrum Protect in the Cloud

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

WHITE PAPER. Software Defined Storage Hydrates the Cloud

How To Write On A Flash Memory Flash Memory (Mlc) On A Solid State Drive (Samsung)

Current Status of FEFS for the K computer

NetApp High-Performance Computing Solution for Lustre: Solution Guide

Building Storage Service in a Private Cloud

Object Storage: Out of the Shadows and into the Spotlight

IBM ELASTIC STORAGE SEAN LEE

Lustre failover experience

HP reference configuration for entry-level SAS Grid Manager solutions

WAN Optimization and Cloud Computing. Josh Tseng, Riverbed

SQL Server 2012 Performance White Paper

Best Practice and Deployment of the Network for iscsi, NAS and DAS in the Data Center

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

February, 2015 Bill Loewe

Cloud Optimize Your IT

WAN Optimization and Thin Client: Complementary or Competitive Application Delivery Methods? Josh Tseng, Riverbed

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

High Performance Computing Specialists. ZFS Storage as a Solution for Big Data and Flexibility

How To Understand And Understand The Risks Of Configuration Drift

Fundamental Approaches to WAN Optimization. Josh Tseng, Riverbed

10th TF-Storage Meeting

XtreemStore A SCALABLE STORAGE MANAGEMENT SOFTWARE WITHOUT LIMITS YOUR DATA. YOUR CONTROL

Introduction to NetApp Infinite Volume

Application Performance for High Performance Computing Environments

How To Migrate To A Network (Wan) From A Server To A Server (Wlan)

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT

Red Hat Storage Server

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

Storage Clouds. Enterprise Architecture and the Cloud. Author and Presenter: Marty Stogsdill, Oracle

Enterprise Architecture and the Cloud. Marty Stogsdill, Oracle

Zadara Storage Cloud A

PCI Express IO Virtualization Overview

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Technology Insight Series

Evaluation of Enterprise Data Protection using SEP Software

Moving Virtual Storage to the Cloud

Object storage in Cloud Computing and Embedded Processing

Hadoop & its Usage at Facebook

CONFIGURATION GUIDELINES: EMC STORAGE FOR PHYSICAL SECURITY

The Use of Flash in Large-Scale Storage Systems.

An Alternative Storage Solution for MapReduce. Eric Lomascolo Director, Solutions Marketing

Network Attached Storage. Jinfeng Yang Oct/19/2015

Hadoop & its Usage at Facebook

Developing High-Performance, Scalable, cost effective storage solutions with Intel Cloud Edition Lustre* and Amazon Web Services

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

Transcription:

High Performance Computing OpenStack PRESENTATION TITLE GOES HERE Options September 22, 2015

Today s Presenters Glyn Bowden, SNIA Cloud Storage Initiative Board HP Helion Professional Services Alex McDonald, SNIA Cloud Storage Initiative Chair NetApp 2

SNIA Legal Notice! The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted.! Member companies and individual members may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced in their entirety without modification The SNIA must be acknowledged as the source of any material used in the body of any document containing material from these presentations.! This presentation is a project of the SNIA Education Committee.! Neither the author nor the presenter is an attorney and nothing in this presentation is intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney.! The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 3

Abstract Organisations are beginning to look to OpenStack to provide framework and tenancy controls around HPC workloads. The greatest gain in the multi-tenancy model is also the greatest challenge for storage; how to provide, reliable, high performance storage that is adequately segregated for the workloads in a cloud environment. This presentation looks at the options available within OpenStack and the Cloud Storage community.

Agenda!!!! HPC vs OpenStack What is High Performance Compute Specific Challenges Storage Options Block Object File! Example Scenario! Summary

HPC VS OPENSTACK

The logical architecture

..of OpenStack

..and MOAB HPC Suite

HPC and OpenStack Opposing Forces! Cloud Share Everything Generic Workloads Loosely Coupled Many small workloads

HPC and OpenStack Opposing Forces! Cloud Share Everything Generic Workloads Loosely Coupled Many small workloads! HPC Share Nothing Specific, Niche Workloads Tightly Coupled (RDMA) Few Large Distributed Workload

But the same! Cloud Highly Distributed Large Storage Pools Resource Management Key Performance Management! HPC Highly Distributed Large Storage Pools Resource Management Key Performance Management

WHAT IS HIGH PERFORMANCE COMPUTING?

Background on HPC! Two Major Types of HPC Analytics Big Data Sets Simple Operations repeated many times Aggregation of results Computationally Intensive Smaller Data Sets Very complex algorithms that need to be broken down Sequential processing and summary Often Latency Sensitive (RDMA, Lustre) or Bandwidth Sensitive (High Volume Filesystems, Large Files)

Background on HPC! Two types of Computational HPC Batch Processing Loosely coupled Embarrassingly Parallel Limited / No shared resources during jobs Realtime / Grid Computing Tightly Coupled Requires High Performance Networking for Remote Direct Memory Access (RDMA) Usually a high performance shared file system is required CPU and Memory Architecture more critical than batch

Why HPC and OpenStack THE CHALLENGES

The Challenges! Resource Management HPC clusters have always been very good at managing their own resources Challenge comes when security and multi-tenancy is required! Multi-Tenancy Drivers Genome Research driving separation Human data cannot be shared beyond proposed use Projects use flexible resources but on a fixed hardware platform

Storage Options EPHEMERAL STORAGE

Ephemeral Storage! What is it? Persists only as long as the VM exists Usually located locally on the compute server! HPC Use Cases? User scratch space Work scratch space Operating Environment

Storage Options BLOCK STORAGE

Block Storage! What is it? Persistent, non-shared block storage Can be provided by many sources, SAN Arrays, Local Disk etc.! HPC Use Cases? Supporting Databases High performance scratch space! OpenStack Project is CINDER A Large Disk Array attached by Fibre Channel SAN to all of the compute nodes OpenStack uses Cinder drivers to create, mount and protect LUNs for the guests. Guest is responsible for creating a file system on those LUNs Not shared with other guests

Storage Options OBJECT STORAGE

Object Storage! What is it? Persistent, scalable storage pools Access using a REST based API Not bound to an individual Guest! HPC Use Cases? Centralised Data Lakes Archives / Backups of source data! OpenStack Project is Swift Usually uses large pools of local disk attached directly to the object servers Uses metadata to index the data and locate object blocks from unique identifiers Lots of plugins for the various analytics engines that are expanding the adoption of object as a centralised data lake

Storage Options FILE STORAGE

File Storage! What is it? Shared, persistent storage Uses standard POSIX file system methods to access data! HPC Use Cases? User Home Directories Shared Project Data Scale out file systems!! OpenStack Project is Manila Manage the creation of storage pools on the provider service Create the shares and apply the correct permissions Mount those shares within the guests that need them Can be NFS or CIFS based today Plugin driven

AN EXAMPLE SCENARIO

Storage Use Case

OpenStack and Lustre Mapping

Lustre as a Service Stack OSS OSS OSS OSS OSS OSS CRAM CRAM CRAM CRAM CRAM CRAM ZFS ZFS ZFS ZFS ZFS ZFS Cinder Cinder Cinder Cinder Cinder Cinder OST OST OST

Lustre Components! Massively Parallel Filesystem made up of key components MGS Management Server MGT Management Target MDS Meta Data Server MDT Meta Data Target OSS Object Storage Server OST Object Storage Target! 1 File can be spread over up to 2000 objects! With ldiskfs, each of those each object can be up to 16 TB! That s 31.25 PB (Yes PETA bytes) for a single file using ldiskfs! Up to 4 Billion files per MDT! Up to 4096 MDTs!

What about ZFS? Why?! Lustre has limited data protection. RAID 0 Protection from Physical Infrastructure! Scale Out Easy, Scale UP Hard! ZFS has healing, snapshots (not necessarily a good idea here) and scale up!! ZFS Cache Pools for Meta-Data or even data sets, huge acceleration potential! Scale

Lustre + ZFS File Limits LDISKFS ZFS Object Size 16 TB 256 PB Maximum File Size 21.25 PB 8 EB (2^63) Max Files per MDT 4 Billion 4 Billion Max MDTs 4096 4096

Work in Progress!!!!!!!! Lustre can be for high bandwidth and low latency Low latency challenging in virtual environment High Bandwidth, easier (not simple though) Use OpenStack tools to provision Lustre Components Build small scale, segregated clusters for multi-tenancy Export via NFS with Manila on private networks Include ZFS and Compression Use Cinder Block for the shared storage element

SUMMARY

Summary! Initial interest of HPC on OpenStack is being driven by tenancy requirements! Managing flexible HPC resources has been tricky, OpenStack makes that easier for HPCaaS! Many areas are needed to work well together for success, OpenStack Community beginning to address that as we have seen.! HPC on OS is a reality and many are pushing the boundaries and committing back to the community.

After This Webcast! This webcast and a copy of the slides will be posted to the SNIA-CSI website and available on-demand! http://www.snia.org/forum/csi/knowledge/webcasts! A full Q&A from this webcast, including answers to questions we couldn't get to today, will be posted to the SNIA Cloud blog! http://www.sniacloud.com/! F ollow us on Twitter @SNIACloud! Upcoming SNIA Webcast: OpenStack Manila Oct. 7th! https://www.brighttalk.com/webcast/663/173013! Google Groups:! http://groups.google.com/group/snia-cloud 36

Conclusion Questions 37

Conclusion Thank You 38