Performance Analysis of a Numerical Weather Prediction Application in Microsoft Azure

Similar documents
Cloud Computing. Alex Crawford Ben Johnstone

CloudFTP: A free Storage Cloud

Emerging Technology for the Next Decade

Virtual Machine Instance Scheduling in IaaS Clouds

DOCLITE: DOCKER CONTAINER-BASED LIGHTWEIGHT BENCHMARKING ON THE CLOUD

Het is een kleine stap naar een hybrid cloud

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

Cloud Computing Submitted By : Fahim Ilyas ( ) Submitted To : Martin Johnson Submitted On: 31 st May, 2009

Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania)

Data Sharing Options for Scientific Workflows on Amazon EC2

Cloud Computing An Elephant In The Dark

A Gentle Introduction to Cloud Computing

wu.cloud: Insights Gained from Operating a Private Cloud System

Amazon EC2 XenApp Scalability Analysis

Cloud computing - Architecting in the cloud

Alternative Deployment Models for Cloud Computing in HPC Applications. Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix

Public Cloud Offerings and Private Cloud Options. Week 2 Lecture 4. M. Ali Babar

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

Making a Smooth Transition to a Hybrid Cloud with Microsoft Cloud OS

Variations in Performance and Scalability when Migrating n-tier Applications to Different Clouds

Auto-Scaling Model for Cloud Computing System

CLOUD COMPUTING. A Primer

NASA's Strategy and Activities in Server Side Analytics

A Cost-Evaluation of MapReduce Applications in the Cloud

Demystifying the Cloud Computing

Sistemi Operativi e Reti. Cloud Computing

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

<Insert Picture Here> Infrastructure as a Service (IaaS) Cloud Computing for Enterprises

ArcGIS for Server: In the Cloud

White Paper on CLOUD COMPUTING

Benchmarking Amazon s EC2 Cloud Platform

Cluster, Grid, Cloud Concepts

How to Do/Evaluate Cloud Computing Research. Young Choon Lee

Research Paper Available online at: A COMPARATIVE STUDY OF CLOUD COMPUTING SERVICE PROVIDERS


Cloud Computing Simulation Using CloudSim

VM Management for Green Data Centres with the OpenNebula Virtual Infrastructure Engine

On Demand Satellite Image Processing

Unleash the IaaS Cloud About VMware vcloud Director and more VMUG.BE June 1 st 2012

ACANO SOLUTION VIRTUALIZED DEPLOYMENTS. White Paper. Simon Evans, Acano Chief Scientist

Li Sheng. Nowadays, with the booming development of network-based computing, more and more

Today: Data Centers & Cloud Computing" Data Centers"

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

HPC ON WALL ST OPENSTACK AND BIG DATA. Brent Holden Chief Field Architect, Eastern US April 2014

CloudSim: A Toolkit for Modeling and Simulation of Cloud Computing Environments and Evaluation of Resource Provisioning Algorithms

High Performance Computing in CST STUDIO SUITE

Neptune. A Domain Specific Language for Deploying HPC Software on Cloud Platforms. Chris Bunch Navraj Chohan Chandra Krintz Khawaja Shams

Cloud Computing. Chapter 1 Introducing Cloud Computing

Automating Big Data Benchmarking for Different Architectures with ALOJA

CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment

OTM in the Cloud. Ryan Haney

Datasheet Fujitsu Cloud Infrastructure Management Software V1

Comparing major cloud-service providers: virtual processor performance. A Cloud Report by Danny Gee, and Kenny Li

Workshop on Parallel and Distributed Scientific and Engineering Computing, Shanghai, 25 May 2012

Evaluation Methodology of Converged Cloud Environments

Comparison of Windows IaaS Environments

Data Centers and Cloud Computing. Data Centers

Running R from Amazon's Elastic Compute Cloud

How To Compare Cloud Computing To Cloud Platforms And Cloud Computing

How To Create A Cloud Based System For Aaas (Networking)

Cloud Computing. Chapter 1 Introducing Cloud Computing

CHAPTER 8 CLOUD COMPUTING

SUSE Linux Enterprise Server for VMware

Cloud Computing Technology

What Is It? Business Architecture Research Challenges Bibliography. Cloud Computing. Research Challenges Overview. Carlos Eduardo Moreira dos Santos

Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts

Data Centers and Cloud Computing

Service-Oriented Cloud Automation. White Paper

There Are Clouds In Your Future. Jeff Barr Amazon Web (Twitter)

Reallocation and Allocation of Virtual Machines in Cloud Computing Manan D. Shah a, *, Harshad B. Prajapati b

Permanent Link:

Amazon Web Services. Elastic Compute Cloud (EC2) and more...

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc.

Benchmarking Large Scale Cloud Computing in Asia Pacific

Oracle Applications and Cloud Computing - Future Direction

Microsoft Private Cloud

DataCenter optimization for Cloud Computing

Datacenters and Cloud Computing. Jia Rao Assistant Professor in CS

Cloud Federation to Elastically Increase MapReduce Processing Resources

W H I T E P A P E R. Cloud Computing. Raising Geospatial Technology to the Cloud: Intergraph Strategy for Leveraging Cloud-based Resources

Elastic Cloud Computing in the Open Cirrus Testbed implemented via Eucalyptus

Cloud Computing. Adam Barker

Cloud Computing. Chapter 1 Introducing Cloud Computing

BITDEFENDER SECURITY FOR AMAZON WEB SERVICES

Figure 1. The cloud scales: Amazon EC2 growth [2].

Transcription:

Performance Analysis of a Numerical Weather Prediction Application in Microsoft Azure Emmanuell D Carreño, Eduardo Roloff, Jimmy V. Sanchez, and Philippe O. A. Navaux WSPPD 2015 - XIII Workshop de Processamento Paralelo e Distribuído Grupo de Processamento Paralelo e Distribuído

Introduction Numerical Weather Prediction (NWP) is one area of science that have evolved continuously with the help of computing. Every year datasets grows larger and computing demand reaches the limit of deployed architecture. High Performance Computing (HPC) allowed performing simulations on less time, but also has increased upfront costs. 2

Motivation Analyze the scaling performance of an HPC (NWP) application in Microsoft Azure Cloud Infrastructure. Why? HPC NWP Cloud Low priority jobs to the cloud Hardware investment budget Need for energy efficiency escience platforms On-demand severe weather alert systems Sharing results Previous work: small HPC benchmarks Optimize costs Reduce complexity 3

Outline Background Environment and Methodology Results Analysis Conclusion and Future Work 4

Cloud Computing Developed with the combination and evolution of distributed computing and virtualization. With contributions from grid and parallel computing. Essential Characteristics: On demand self-service Broad network access Resource Pooling Rapid elasticity Measured Service Infrastructure as a Service (IaaS): Control over operating system User managing the VM Storage Limited control networking Deployed applications 5

Azure Cloud computing infrastructure from Microsoft Second cloud provider by market share 19 Datacenters and Diversity of Instance Sizes Why Azure? Better performance than other IaaS Providers (Roloff et al, 2012) Adapted from Microsoft Ignite 2015 6

BRAMS Numerical weather prediction model used to simulate atmospheric conditions. Modified to simulate in a better way the tropical atmosphere, specially Brazil climate characteristics. Developed by the CPTEC-INPE. Used by Brazilian and South American regional weather centers. Source: Longo et al. 2013 7

BRAMS Workflow Time Stage 1 Download/ Convert Initial data Stage 2 Stage 3 Stage 4 Stage 5 Run MAKESFC stage Run MAKEVFILE stage Run INITIAL stage Run Postprocessing Content Size [GB] CLIMATOLOGY 0.52 DP 86.00 EMISSION-DATA 3.60 FIRES-DATA/DSA 0.05 FIRES-DATA/MODIS 0.18 SOIL-MOISTURE 23.00 SURFACE-DATA 8.80 Total 122.15 SST* Topo* Veg* Soil* Soil Humid* NDVI BRAMS MAKESFC BRAMS INITIAL Analysis History Post Output data Size [MB] MAKESFC & MAKEVFILE 12 Analysis 1100 History 147 Post-process 8 Total 1267 BRAMS MAKEVFILE Initial Conditions* Boundary Conditions Observational Data* 8

Outline Background Environment and Methodology Results Analysis Conclusion and Future Work 9

BRAMS: Cloud Lifecycle Controlling the cloud service deployment of BRAMS Initial cloud parameters Instantiable BRAMS VM Deploying VMs Workflow control Removing parts Create 1 Affinity Group 2 Storage Account 3 Cloud Service Base VM 1 Instantiate 2 Capture Instantiate 1 Frontend 2 Compute Nodes Reproducible experiments Reduce deployment time Reduce costs Automate service Usable in parallel applications Delete 1 Compute Nodes 2 Frontend 3 Base VM 4 Storage Account 5 Cloud Service 6 Affinity Group Control 1 Start Frontend 2 Stop Frontend 3 Start Compute nodes 4 Stop compute nodes 10

BRAMS Deployment Architecture Azure Virtual Machines Azure File Service Frontend VM Processing VM1 Processing VM2 Processing VM3 Processing VM N Frontend gets input data Storage using Azure File Service Processing VMs created on-demand Networking in the same datacenter location 11

Environments Characteristics Cloud Instance Sizes A4 A7 A8 A9 Processor Model Xeon E5-2660 Xeon E5-2670 Processor Speed (GHz) 2,2 2,6 Number of CPU Cores 8 8 8 16 Memory (GB) 14 56 56 112 Networking (Mbps) 800 2000 5000 10000 Forecast: 72h Resolution: 50 km Perimeter: 6000 km Area: 1500x1500 km 12

Outline Background Environment and Methodoloty Results Analysis Conclusion and Future Work 13

Performance - Scalability 168 cores, 20KM Low performance variation. Not scaling well beyond 88 cores. After 56 cores time reduction is less than two percent Caused by Networking? 14

Performance - Scalability 160 cores, 5KM Similar behavior Scaling after 80 cores minimal Application limitation, communication. 15

Post-processing Modifying the final part of the workflow. Using GrADS to generate contours or shaded plots. Using BING maps to obtain background Forecast: 72h Resolution: 50 km Area: 1500x1500 km 16

Outline Background Environment and Methodology Results Analysis Conclusion and Future Work 17

Conclusion Cloud infrastructure offers possibilities to move HPC applications from legacy code reducing. Variability in performance was low in the cloud, an important characteristic for HPC. Scaling BRAMS up to 168 cores, beyond 88 cores <2% exec time reduction Scaling model resolution from 50km to 5km in the cloud deployment. 18

Future Work Obtaining results from other cloud providers as Amazon EC2 and Google Compute. Capturing more metrics to cover all the aspects of the execution and try to improve the performance of this HPC application in a cloud environment. Parallelize some of the sequential steps in BRAMS workflow if possible. Test new instances like Linux VMs with Infiniband. 19

Performance Analysis of a Numerical Weather Prediction Application in Microsoft Azure Emmanuell D Carreño, Eduardo Roloff, Jimmy V. Sanchez, and Philippe O. A. Navaux WSPPD 2015 - XIII Workshop de Processamento Paralelo e Distribuído Grupo de Processamento Paralelo e Distribuído

THANK YOU! Performance Analysis of a Numerical Weather Prediction Application in Microsoft Azure 21