Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing



Similar documents
AWS Cloud for HPC and Big Data

AIST Data Symposium. Ed Lenta. Managing Director, ANZ Amazon Web Services

Thing Big: How to Scale Your Own Internet of Things.

Accelerating Innovation

Introduction to AWS in Higher Ed

Razvoj Java aplikacija u Amazon AWS Cloud: Praktična demonstracija

Introduction to Amazon Web Services! Leo Senior Solutions Architect

CLOUD COMPUTING & DIGITAL CUSTOMER EXPERIENCE. Nicola Previati Territory Manager Italy

Using GPUs in the Cloud for Scalable HPC in Engineering and Manufacturing March 26, 2014

How To Use Aws.Com

EEDC. Scalability Study of web apps in AWS. Execution Environments for Distributed Computing

Innovative Geschäftsmodelle Ermöglicht durch die AWS Cloud

Analyzing Cloud Costs

Scalable Application. Mikalai Alimenkou

AWS IaaS Services. Methods Digital GCloud Service Definition

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros

Amazon EC2 Product Details Page 1 of 5

LONDON. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

AWS Performance Tuning

Amazon Web Services Annual ALGIM Conference. Tim Dacombe-Bird Regional Sales Manager Amazon Web Services New Zealand

Hadoop & Spark Using Amazon EMR

Amazon Web Services. Lawrence Berkeley LabTech Conference 9/10/15. Jamie Baker Federal Scientific Account Manager AWS WWPS

Cloud and the future of Unemployment Sean Rhody, CTO Capgemini Government Solutions

Netop Environment Security. Unified security to all Netop products while leveraging the benefits of cloud computing

Scalable Architecture on Amazon AWS Cloud

CLOUD COMPUTING FOR THE ENTERPRISE AND GLOBAL COMPANIES Steve Midgley Head of AWS EMEA

ur skills.com

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

Scaling in the Cloud with AWS. By: Eli White (CTO & mojolive) eliw.com - mojolive.com

Real Time Big Data Processing

Microservices on AWS

How To Manage An Orgsync Database On An Amazon Cloud 2 Instance

How AWS Pricing Works May 2015

Cloud Computing with Amazon Web Services and the DevOps Methodology.

An Introduction to High Performance Computing on AWS

Service Organization Controls 3 Report

Cloud Computing for Research. Jeff Barr - January 2011

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES

A Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012)

Last time. Today. IaaS Providers. Amazon Web Services, overview

Running Oracle Applications on AWS

AWS Benefits, Regions & Across. Paul Yung Head of Territory Development HK, Macau & TW pyung@amazon.com

How AWS Pricing Works

Getting Started with SAP BI on AWS

Shifting cloud cover: The changing technological and legal landscape of cloud contracting

Financial Services Grid Computing on Amazon Web Services January 2013 Ian Meyers

Last time. Today. IaaS Providers. Amazon Web Services, overview

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

AWS Directory Service. Simple AD Administration Guide Version 1.0

Amazon Web Services Fredrik Rapp, Partner Manager. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved.

The Cloud as a Computing Platform: Options for the Enterprise

HADOOP BIG DATA DEVELOPER TRAINING AGENDA

Enterprise Cloud Computing with AWS. for internal partner use only

Preparing Your IT for the Holidays. A quick start guide to take your e-commerce to the Cloud

CAPTURING & PROCESSING REAL-TIME DATA ON AWS

Building your Big Data Architecture on Amazon Web Services

How to Leverage Cloud to Quickly Build Scalable Applications

Creating a Cloud Standard How to accelerate your business and be an IT hero

Design for Failure High Availability Architectures using AWS

CLOUD COMPUTING WITH AWS An INTRODUCTION. John Hildebrandt Solutions Architect ANZ

Uses, considerations and recommendations for AWS Scalar Decisions Inc. Not for distribution outside of intended audience.

Amazon.com, Inc. and its affiliates. All rights reserved.

PCI on Amazon Web Services (AWS) What You Need To Know Understanding the regulatory roadmap for PCI on AWS

CLOUD GAMING WITH NVIDIA GRID TECHNOLOGIES Franck DIARD, Ph.D., SW Chief Software Architect GDC 2014

Building Real-Time Analytics Into Big Data Applications

Cost Optimization with AWS

TECHNOLOGY WHITE PAPER Jun 2012

Amazon Web Services Yu Xiao

Amazon Elastic Beanstalk

The Scenario: Priority Matrix for Cloud Computing

Using ArcGIS for Server in the Amazon Cloud

Research on AWS. Genomics Research Statistical Analysis Mathematical Modeling Computational Fluid Dynamics Grid/Cluster/High Performance Computing

SOC on Amazon Web Services (AWS) What You Need To Know Understanding the regulatory roadmap for SOC on AWS

How To Set Up Wiremock In Anhtml.Com On A Testnet On A Linux Server On A Microsoft Powerbook 2.5 (Powerbook) On A Powerbook 1.5 On A Macbook 2 (Powerbooks)

Using ArcGIS for Server in the Amazon Cloud

Shadi Khalifa Database Systems Laboratory (DSL)

HPC on AWS. Hiroshi Kobayashi, Dev./Lab. IT System HGST Japan, Ltd. Jun 3, 2015

Amazon Web Services: a Case Study Course: Business Process for IT Services 2012, EPFL

AWS at Telenor Digital. Internet Services for the next Billion

How to Run Your Enterprise Applications on Cloud

ColdFusion 10 in the Amazon AWS Cloud. Sven Ramuschkat tecracer GmbH

Expand Your Infrastructure with the Elastic Cloud. Mark Ryland Chief Solutions Architect Jenn Steele Product Marketing Manager

Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida

AGENDA. Overview GPU Video Encoding NVIDIA Video Encoding Capabilities. Software API Performance & Quality. Kepler vs Maxwell GPU capabilities Roadmap

Cloud Computing Benefits for Educational Institutions

AWS Big Science Best Practices Dario Rivera AWS Solutions Architect Brendan Bouffler AWS BDM Scientific Computing

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Développement logiciel pour le Cloud (TLC)

TECHNOLOGY WHITE PAPER Jan 2016

CONNECTRIA MANAGED AMAZON WEB SERVICES (AWS)

Logentries Insights: The State of Log Management & Analytics for AWS

!"#$%&'()*'+),-./)0' 9##+':,%-.;),0'

EXECUTIVE SUMMARY CONTENTS. 1. Summary 2. Objectives 3. Methodology and Approach 4. Results 5. Next Steps 6. Glossary 7. Appendix. 1.

Financial Services Grid Computing on Amazon Web Services. January, 2016

Managed Amazon Web Services

Transcription:

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing July 7, 2014 David Pellerin, Business Development Principal Amazon Web Services

What Do We Hear From Customers? Want to run larger HPC workloads, more often Need to scale up and scale out for faster results Want to run more jobs for higher quality of results Need to globally collaborate on critical data To enhance the productivity of distributed teams To operate globally, with greater data governance Need to manage large, diverse datasets Ingest and analyze, query, and take action Including connected devices, Internet-of-Things

A Cloud First Strategy at HGST HGST is using the cloud for a higher performance, lower cost, faster deployed solution vs buying a huge on-site cluster. - Steve Philpott, CIO HGST HPC and big data roadmap: Molecular dynamics simulations Collaboration tools for engineering Big data for manufacturing yield analysis CAD, CFD, EDA cloud first Every application presents unique challenges some technical, some business

Cloud Scalability for Simulations Computer-Aided Design, Simulation, Analysis, Visualization Across industries, the trend is Simulation-Driven Design and Discovery Examples in Design and Manufacturing Computer-Aided Design Electronic Design Automation Computational Fluid Dynamics Molecular Modeling Seismic Modeling and Reservoir Simulations Cloud enables scalable simulations for global manufacturers Cloud also enables HPC convergence with big data

Simulations for Electronic Product Design No job queue Using cloud, simulation sweeps are possible in hours instead of weeks Courtesy of Cypress Semiconductor

Job Queues Are Evil! Conflicting goals HPC users seek fastest possible time-to-results IT support team seeks highest possible utilization? Result: The job queue becomes the capacity buffer Users are frustrated and run fewer simulations, or they try to game the system Money is being saved, but for the wrong reasons!

Instead, run multiple clusters at the same time, on-demand

Match the Architectures to the Jobs

Automation allows easier management of clusters, and monitoring of jobs and costs n a secure Virtual Private Cloud

Many HPC Deployment Methods Traditional HPC schedulers and cluster managers MOAB/Torque, Bright LSF/OpenLava, Grid Engine, SLURM, Condor Born in the cloud tools MIT StarCluster Cycle Computing CycleServer AWS-provided tools and APIs Cloudformation, Auto Scaling cfncluster (github.com/awslabs/cfncluster)

Multiple Consumption Models On-Demand Reserved Spot Dedicated Pay for compute capacity by the hour with no long-term commitments Make a low, one-time payment and receive a significant discount on the hourly charge Bid for unused capacity, charged at a Spot Price which fluctuates based on supply and demand Launch instances within Amazon VPC that run on hardware dedicated to a single customer For spiky workloads, or to define needs For committed utilization For time-insensitive or transient workloads For highly sensitive or compliance related workloads

Spot Instances for Scale-Out Computing Mark E. Thompson University of Southern California 18 hours 205,000 materials analyzed 156,314 AWS Spot cores at peak 2.3M core-hours Total spending: $33K (Under 1.5 cents per core-hour)

The Democratizing of HPC Deployed in one AWS region (we have 10) In one availability zone in that region Using one placement group in that AZ Consisting entirely of one new instance type (c3.8xlarge) Brought up, benchmarked, and torn down in hours

Development & Deployment Monitoring CloudWatch Application Services Foundation Services RDS Customer Applications Built on Higher Level Services Deployment & Management AWS Global Infrastructure Applications AWS Global Infrastructure SES SNS SQS Compute Databases Dynamo BeanStalk OpsWork Elastic Transcoder CloudSearch SWF ElastiCache RedShift EMR DataPipeline Kinesis AWS Global Infrastructure Storage Cloud Formation Analytics CloudTrail IAM Networking Identity & Access Federation AppStream Content Delivery CloudFront API Human Interaction Support AWS Interaction Globa Web Console l Command Line Infras Libraries, SDK s truct ure EC2 WorkSpaces S3 EBS Glacier Storage Gateway VPC Direct Connect AWS Global Infrastructure ELB Route53 Regions AWS Global Availability Infrastructure Zones Edge Locations

Cloud Collaboration is Secure Collaboration

Collaboration is More Secure in the Cloud Bring the users to the data, don t send the data to the users

Collaboration is More Secure in the Cloud Bring the users to the data, don t send the data to the users

Collaboration with Secure Remote Access Data and computation hosted in a secure, customer-managed virtual private cloud, with controlled access via a wide variety of client devices. Virtual Private Cloud Image courtesy of Calgary Scientific

NVIDIA GRID K520 in AWS Cloud Product Name GRID K520 GPUs CUDA cores Core Clocks Memory Size HW Video Encoder Power Consumption Supported APIs 2 x GK104 GPUs 3,072 (1,536 per GPU) 800 MHz 8GB GDDR5 (4GB per GPU) 2x h.264 (1 per GPU) 225W OpenGL 4.3, DirectX 9, 10, 11, CUDA 5.5, OpenCL 1.1, NVFBC, NVIFR, NVENC

Application Streaming Middleware

Amazon AppStream Application Streaming Remote visualization Thin client 3D applications

Cloud Plus Big Data Equals Awesome

Integration of Big Data and HPC via Cloud Cloud provides a global, distributed, secure, and scalable environment for collaborative design and manufacturing

AWS Has Always Been About Big Data

Big Data is Everywhere!

Big Data in Manufacturing Managing big data for competitive advantage For design, engineering, production environments Quality and Yield Analysis, Statistical Process Control Structured and unstructured queries Input Manufacturing facility monitoring In-field device monitoring Logging Log4J Appender push to Kinesis Processing Elastic MapReduce pull from Yield analysis Hive Pig Cascading MapReduce

Sending & Reading Data from Kinesis Streams Sending Reading HTTP Post Get* APIs AWS SDK LOG4J Kinesis Client Library + Connector Library Apache Storm Flume Fluentd Amazon Elastic MapReduce

Summary Cloud for Scalability Cloud for Global Collaboration Cloud for Big Data