HPC on AWS. Hiroshi Kobayashi, Dev./Lab. IT System HGST Japan, Ltd. Jun 3, 2015

Similar documents
Using GPUs in the Cloud for Scalable HPC in Engineering and Manufacturing March 26, 2014

Unlock the value of data with smarter storage solutions.

Building a Flash Fabric

Data Storage Technology Update

Data Center and Enterprise Storage Solutions. Long Live Data

Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage

SMB Direct for SQL Server and Private Cloud

How To Scale Myroster With Flash Memory From Hgst On A Flash Flash Flash Memory On A Slave Server

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

StorPool Distributed Storage. Software-Defined. Business Overview

OpenStack Benelux. Dan Chester. Seagate Cloud Systems & Solutions

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

IT of SPIM Data Storage and Compression. EMBO Course - August 27th! Jeff Oegema, Peter Steinbach, Oscar Gonzalez

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Data Center Solutions

Realizing the next step in storage/converged architectures

Amazon EC2 Product Details Page 1 of 5

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance

Data Storage At the Heart of any Information System. Ken Claffey, VP/GM - June 2015

HadoopTM Analytics DDN

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

On Demand Satellite Image Processing

Flash Performance for Oracle RAC with PCIe Shared Storage A Revolutionary Oracle RAC Architecture

IBM ELASTIC STORAGE SEAN LEE

HGST Virident Solutions 2.0

SGI HPC Systems Help Fuel Manufacturing Rebirth

Accelerating Real Time Big Data Applications. PRESENTATION TITLE GOES HERE Bob Hansen

IBM Global Technology Services September NAS systems scale out to meet growing storage demand.

Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks

præsentation oktober 2011

Forward Looking Statements

VDI: What Does it Mean, Deploying challenges & Will It Save You Money?

Intro to AWS: Storage Services

Seagate Lustre Update. Peter Bojanic

Data Center Solutions

Evoluzione dell Infrastruttura di Calcolo e Data Analytics per la ricerca

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

Big + Fast + Safe + Simple = Lowest Technical Risk

Cisco for SAP HANA Scale-Out Solution on Cisco UCS with NetApp Storage

Scientific Computing Data Management Visions

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

NEXT GENERATION EMC: LEAD YOUR STORAGE TRANSFORMATION. Copyright 2013 EMC Corporation. All rights reserved.

StorageBox High Performance NVMe JBOF

Flash Controller Architecture for All Flash Arrays

Solid State Architectures in the Modern Data Center

Comparing SMB Direct 3.0 performance over RoCE, InfiniBand and Ethernet. September 2014

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Green HPC - Dynamic Power Management in HPC

The Benefits of Purpose Built Super Efficient Video Servers

Transform Your Business Using the IBM FlashSystem

Colgate-Palmolive selects SAP HANA to improve the speed of business analytics with IBM and SAP

InfiniBand Update Addressing new I/O challenges in HPC, Cloud, and Web 2.0 infrastructures. Brian Sparks IBTA Marketing Working Group Co-Chair

Flash at the price of disk Redefining the Economics of Storage. Kris Van Haverbeke Enterprise Marketing Manager Dell Belux

Clusters: Mainstream Technology for CAE

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

New Hitachi Virtual Storage Platform Family. Name Date

IT Platforms for Utilization of Big Data

EMC XtremSF: Delivering Next Generation Performance for Oracle Database

Driving Datacenter Change

Cloud OS Vision. Modern platform for the world s apps

New Cluster-Ready FAS3200 Models

Express5800 Scalable Enterprise Server Reference Architecture. For NEC PCIe SSD Appliance for Microsoft SQL Server

Scala Storage Scale-Out Clustered Storage White Paper

SLIDE 1 Previous Next Exit

PRIMERGY server-based High Performance Computing solutions

Building your Big Data Architecture on Amazon Web Services

High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software

DataStax Enterprise, powered by Apache Cassandra (TM)

Sun Constellation System: The Open Petascale Computing Architecture

IBM Enterprise Linux Server

Software-defined Storage Architecture for Analytics Computing

Windows HPC Server 2008 R2 Service Pack 3 (V3 SP3)

HGST Object Storage for a New Generation of IT

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

HyperQ Storage Tiering White Paper

MapR Enterprise Edition & Enterprise Database Edition

The PHI solution. Fujitsu Industry Ready Intel XEON-PHI based solution. SC Denver

TCO Case Study Enterprise Mass Storage: Less Than A Penny Per GB Per Year

Accelerating I/O- Intensive Applications in IT Infrastructure with Innodisk FlexiArray Flash Appliance. Alex Ho, Product Manager Innodisk Corporation

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

GPFS Storage Server. Concepts and Setup in Lemanicus BG/Q system" Christian Clémençon (EPFL-DIT)" " 4 April 2013"

Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using

Data management challenges in todays Healthcare and Life Sciences ecosystems

SMART SCALE YOUR STORAGE - Object "Forever Live" Storage - Roberto Castelli EVP Sales & Marketing BCLOUD

The last 18 months. AutoScale. IaaS. BizTalk Services Hyper-V Disaster Recovery Support. Multi-Factor Auth. Hyper-V Recovery.

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Microsoft Windows Server Hyper-V in a Flash

Introduction to Red Hat Storage. January, 2012

HP Z Turbo Drive PCIe SSD

VMware Software-defined Data Center Technical Strategy and Customer Benefits

High Performance Computing (HPC)

PCIe Over Cable Provides Greater Performance for Less Cost for High Performance Computing (HPC) Clusters. from One Stop Systems (OSS)

Intel RAID SSD Cache Controller RCS25ZB040

IBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads

Building low cost disk storage with Ceph and OpenStack Swift

Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud

Transcription:

HPC on AWS Hiroshi Kobayashi, Dev./Lab. IT System HGST Japan, Ltd. Jun 3, 2015 1

HPC on AWS HPC = High Performance Computing AWS = Amazon Web Service 2

Agenda HGST Why choose Cloud? Performance Flexibility What s Next Summary 3

HGST Company Profile Founded in 2003 through the combination of the hard drive businesses of IBM, the inventor of the hard drive, and Hitachi, Ltd ( Hitachi ) Acquired by Western Digital in 2012 Headquartered in San Jose, California Approximately 38,000 employees worldwide More than 4,700 active worldwide patents Develops innovative, advanced hard disk drives, enterpriseclass solid state drives, external storage solutions and services Delivers intelligent storage devices that tightly integrate hardware and software to maximize solution performance 4

Broadening Lineup of Storage Solutions RECENT INNOVATIONS HDD Storage Solutions with HelioSeal Technology Petabyte-scale Data Center Storage Solutions HGST 10TB SMR HDD HGST Ultrastar He8 Active Archive Platform Solid State Storage Solutions HGST Storage Software HGST Virident Solutions FlashMAX III PCIe Ultrastar SSD800MH.B, SSD1600MM & SSD1600MR SAS SSD Ultrastar SN100 Series NVMe PCIe HGST Virident Space 5

HGST Active Archive System Our first fully integrated system with 4.7PB raw capacity per rack! Complete scale-out object storage system for cloud data centers 4.7PB raw capacity per rack Optimized for active archive workloads Breakthrough TCO Highest Density Improves Data Center Efficiency Lowest Power per TB with Fast Data Access Beats White Box Economics Scales to Exabytes of Capacity 6

Market Leadership 7

Agenda HGST Why choose Cloud? Performance Flexibility What s Next Summary 8

Why choose Cloud? Background A few years ago, HPC implementation project was started. Project team investigated several cloud HPC services except for AWS. But those did not satisfy HGST s requirement. CIO Steve Phillpott recommended AWS for HPC. He had much experience of HPC on AWS at life-science industry. Through several Proof of Concept projects, began to understand Pros/Cons of On-premise and Cloud HPC. Key factors are Scalability, Data transfer, Remote Visualization Commercial Application, Cost 9

Agenda HGST Why choose Cloud? Performance Flexibility What s Next Summary 10

Scalability CD-adapco provided the benchmark data on their cluster. C3 provide significant improvement to the scalability C3 is 1.81x faster than CR1 Still behind to physical cluster with InfiniBand 1.70x slower 1.81x faster 1 EN = Enhanced Networking 2 placement group enable 3 evaluated by elapse time 4 only 200steps 11

Remote Visualization Result data is too huge to download Transferring huge data is NOT a option Require Remote Visualization for huge result data Remote Desktop Console Consume server side GPU resource and license Users Client Remote access via RDC/VNC G2 AWS graphic server Not good performance Slower response Slower rendering Server Client Mode Users Client Consume client side GPU resource Consume server side license AWS file server Great performance!!! Almost same performance as local workstation with highend graphic card 12

Data Collaboration Transferring huge data is NOT a option Even 48TB of d2.8xlarge may not be sufficient for long term / huge data repository High cost for re-computing of large scale model AWS Simple Storage Service (S3) Cluster Master Computing Nodes S3 bucket job submission Shared storage small data back to client Client Users 13

Performance Scalability C3.8xlarge improved the scalability dramatically Higher scalability is better Remote Visualization Star-CCM+ is ready Other applications are NOT ready Data Collaboration No need to struggle with the storage capacity and durability AWS can support whole process of simulation works!!! 14

Agenda HGST Why choose AWS for HPC? Performance Flexibility What s Next Summary 15

Hybrid HPC Architecture Local + Cloud = Hybrid HPC environment AWS + Cycle Computing http://www.cyclecomputing.com/ Auto Scale Out / In Cluster Master attached data I/O Computing Nodes Fixed Capability Users Client Shared Storage Virtual Private Cloud HGST Local Cluster S3 bucket AWS 16

Shape Compute To Match Work To Be Done All Jobs Run In Parallel on AWS 1.67x Throughput Improvement Time Before: Shared Cluster Computer 512 core waiting 512core 512core 512core 256 core waiting 256 core 128 core waiting 128 core Today: AWS EC2 CC2 Cluster (Max Total 512 core) 17

Shape Compute To Match Work To Be Done (Cont.) 18

Shape Storage To Match Work To Be Done No need to struggle with the storage capacity and durability!!! Cluster Master Computing Nodes S3 bucket job submission Shared storage small data back to client Client Users 19

Shape Cost To Match Work To Be Done Workload is NOT constant Server Reservation Discount = Reserved Instances (RI) Analyzing workload Utilizing RI Optimizing cost 20

Agenda HGST Why choose Cloud? Performance Flexibility What s Next Summary 21

What s next for Cloud HPC Computing Performance More scalability, like InfiniBand Remote Visualization Higher performance than RDC-TCP/IP PC over IP? NICE DCV? Star-CCM+ is ready!!! Commercial Application License End User License Agreement (EULA) Hybrid License Server Consumption Based License Power On Demand!!! Local License Server 22

Agenda HGST Why choose Cloud? Performance Flexibility What s Next Summary 23

Summary At this moment, HPC on AWS is NOT perfect Scalability, Remote Visualization except for Star-CCM+ HPC on AWS has extremely high flexibility Hybrid HPC, Shape Compute/Storage/Cost To Match Work To Be Done Flexibility will drive to responding to the changing business model Benefit of HPC on AWS should be verified with each applications based on its characteristic Required collaboration with application venders 24

Helping the World Harness the Power of Data with Smarter Storage Solutions 25