IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM



Similar documents
Managed Cloud Services

IBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

Windows HPC Server 2008 R2 Service Pack 3 (V3 SP3)

High Performance Computing Cloud Offerings from IBM Technical Computing IBM Redbooks Solution Guide

EMC Data Protection Advisor 6.0

Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise

Cloud Computing through Virtualization and HPC technologies

Bright Cluster Manager

Load DynamiX Storage Performance Validation: Fundamental to your Change Management Process

Cloud Computing for Control Systems CERN Openlab Summer Student Program 9/9/2011 ARSALAAN AHMED SHAIKH

<Insert Picture Here> Infrastructure as a Service (IaaS) Cloud Computing for Enterprises

Hadoop as a Service. VMware vcloud Automation Center & Big Data Extension

BMC Cloud Management Functional Architecture Guide TECHNICAL WHITE PAPER

Microsoft Private Cloud

Broadening Moab/TORQUE for Expanding User Needs

An enterprise- grade cloud management platform that enables on- demand, self- service IT operating models for Global 2000 enterprises

Operationalize Policies. Take Action. Establish Policies. Opportunity to use same tools and practices from desktop management in server environment

CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment

DATA CENTER INFRASTRUCTURE MANAGEMENT

IBM PureFlex System. The infrastructure system with integrated expertise

System Management Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice

Monitoring, Managing and Supporting Enterprise Clouds with Oracle Enterprise Manager 12c Name, Title Oracle

Hitachi Unified Compute Platform (UCP) Pro for VMware vsphere

Petascale Software Challenges. Piyush Chaudhary High Performance Computing

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds.

Cloud Lifecycle Management

Making a Smooth Transition to a Hybrid Cloud with Microsoft Cloud OS

Vistara Lifecycle Management

Private cloud computing advances

Using SUSE Cloud to Orchestrate Multiple Hypervisors and Storage at ADP

Lab Management, Device Provisioning and Test Automation Software

W H I T E PA P E R Intr t oduction t t o o t Cloud f C oud or HPC o C

Developing Parallel Applications with the Eclipse Parallel Tools Platform

How To Get The Most Out Of Sagecrm V7.1

Managed Hosting is a managed service provided by MN.IT. It is structured to help customers meet:

Load and Performance Load Testing. RadView Software October

Cisco Process Orchestrator Adapter for Cisco UCS Manager: Automate Enterprise IT Workflows

Cloud Computing. Adam Barker

WHITE PAPER: Egenera Cloud Suite

IBM Infrastructure Suite for z/vm and Linux

Introduction to Cloud for HPC

Integration and Automation with Lenovo XClarity Administrator

IBM EXAM QUESTIONS & ANSWERS

BI xpress Product Overview

WHITE PAPER: Egenera Cloud Suite for EMC VSPEX. The Proven Solution For Building Cloud Services

7/15/2011. Monitoring and Managing VDI. Monitoring a VDI Deployment. Veeam Monitor. Veeam Monitor

MICROSOFT CLOUD REFERENCE ARCHITECTURE: FOUNDATION

Data Integration Checklist

Total Cloud Control with Oracle Enterprise Manager 12c. Kevin Patterson, Principal Sales Consultant, Enterprise Manager Oracle

IBM ELASTIC STORAGE SEAN LEE

Empowering Private Cloud with Next Generation Infrastructure. Martin Ip, Head of Advanced Solutions and Services Macroview Telecom

The Top Six Advantages of CUDA-Ready Clusters. Ian Lumb Bright Evangelist

Track-It! 8.5. The World s Most Widely Installed Help Desk and Asset Management Solution

Understand IBM Cloud Manager V4.2 for IBM z Systems

Microsoft Cloud Platform System. powered by Dell

Veritas Storage Foundation High Availability for Windows by Symantec

SOLUTION WHITE PAPER. Building a flexible, intelligent cloud

Sistemi Operativi e Reti. Cloud Computing

Red Hat Satellite Management and automation of your Red Hat Enterprise Linux environment

Big data management with IBM General Parallel File System

User's Guide - Beta 1 Draft

Symantec Storage Foundation High Availability for Windows

Moving beyond Virtualization as you make your Cloud journey. David Angradi

Red Hat Network Satellite Management and automation of your Red Hat Enterprise Linux environment

Cloud Infrastructure Management - IBM VMControl

2015 LENOVO. ALL RIGHTS RESERVED. Isabel Zarate Lenovo EBG Leader

Cisco Tidal Enterprise Scheduler

A Gentle Introduction to Cloud Computing

Simplified Private Cloud Management

Why Cisco for Cloud? IT Service Delivery, Orchestration and Automation

Management Packs for Database

Accelerating Innovation with Self- Service HPC

AppStack Technology Overview Model-Driven Application Management for the Cloud

Automatizace Private Cloud. Petr Košec, Microsoft MVP, MCT, MCSE

OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC

1 What is Cloud Computing? Cloud Infrastructures OpenStack Amazon EC CAMF Cloud Application Management

ProjExec Project Management for IBM Collaborative Platforms. Simple and effective project execution with collaboration for all project needs

SPEED your path to virtualization.

Jitterbit Technical Overview : Microsoft Dynamics CRM

Intel Service Assurance Administrator. Product Overview

Cloud Optimize Your IT

Virtualized, Converged Data Centers and Cloud Service Providers

GTC Presentation March 19, Copyright 2012 Penguin Computing, Inc. All rights reserved

IBM PureData System for Transactions. Technical Deep Dive. Jonathan Rossi, PureSystems Specialist

Cisco Enterprise Mobility Services Platform

Red Hat Enterprise Linux 6. Stanislav Polášek ELOS Technologies

Transcription:

IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM #OpenPOWERSummit Join the conversation at #OpenPOWERSummit 1

Scale-out and Cloud Infrastructure Management Needs Traditional HPC cluster Multi-tenants HPC environment Data Management Provisioning & monitoring system for scale-out computing infrastructures Technical computing capacity to multiple departments, projects, and users (multi-tenants) Self service capability Elastic Storage Management and monitoring Elastic Storage Appliance management and monitoring Cloud Infrastructure VM, virtual network, & virtual storage management Underlying infrastructure (undercloud) management Configuration management Capacity overflow Automated cloud bursting capabilities Flexibility and benefits of VMs Big data clusters Multi-tenancy Enterprise class system management capabilities (audit, Role Based Access Control etc.)

Architecture Overview IBM Platform LSF Family IBM Parallel Environment IBM Platform Cluster Manger Advanced Edition Unified Web-based Interface Cluster template designer IBM Spectrum Scale template IBM Platform LSF template Infrastructure Management Monitoring and Reporting Mellanox InfiniBand NVIDIA GPUs Hypervisor Infrastructure Services Join the conversation at #OpenPOWERSummit 3

IBM Platform Cluster Manager Overview Powerful lifecycle management for scale-out cluster environments Key Capabilities Simplified management with cluster template designer Scales from single clusters to complex multi-team environments Robust, scalable alerting and reporting Automated infrastructure management one-click cluster deployment Spectrum Scale cluster support Benefits Faster time to cluster readiness Unified interface for management and monitoring Increased administrator productivity Single infrastructure supporting multiple business needs

Infrastructure at a glance Single interface for management and monitoring of multiple clusters Dashboard provides overview of resources, allocations

Rackview graphical cluster overview 2D representation of the data center (racks, nodes) It allows administrator to quickly examine the status of individual nodes Admin can drill into node details by clicking the node Chassis console can be launched from the rack view Overview of entire cluster at a glance

Drag and drop cluster builder Out of the box templates for Platform LSF, Spectrum Scale (GPFS) clusters

Managing node personalities Provisioning templates, image and network profiles can be easily managed all through the GUI.

Hardware management and monitoring Node detail is monitored with system & performance Management capabilities include: power cycling, firmware updates, OS reboot, reprovision, synchronize, node LED control, BMC, SSH, VNC console Integrated network switch and chassis monitoring Integrated Spectrum Scale monitoring

Alerts highlight potential issues in the cluster Fully customizable; alerts can be defined using any monitored metrics Alert can trigger an automated pre-defined action Alert history shows the detail of the triggered alert

Resource Reporting Gauge utilization of the cluster Historical reports can be generated for Cluster availability Cluster performance and usage Free application licenses

Comprehensive HPC Software Stack Products Client Benefits Systems Management Platform Cluster Manager Advanced Edition Ease of Use: web portal Customizable: admin productivity Faster time to system productivity Robust monitoring Application Runtime PE Runtime Ed ESSL / PESSL Optimized Parallel Runtime Optimized LAPACK and ScaLPACK libraries User controlled workflow support Development Productivity PE Developer Ed XL Compiler Modern application development environment using Eclipse Performance analysis tools to help analyze applications Optimized compiler for Power Power Systems S824L Workload Management Platform LSF Optimized utilization of resources Policy, energy and resource aware scheduling Robust add-on features Data Management Spectrum Scale HPSS TSM Scalable/reliable storage for parallel filesystem (GSS) ILM for transparent migration of data from storage to tape and back Application Environment Platform Application Center Platform Process Manager Simplify job submission for repeatable workload: customization Customizable Faster time to system productivity Power Systems S822L

Pdb Debugger IBM Parallel Environment High Performance Execution Environment to Take Full Advantage of Scalable Compute Resources Parallel Operating Environment (POE) for submitting and managing jobs. IBM's MPI and PAMI libraries for communication between parallel tasks. A parallel debugger (pdb) for debugging parallel programs. IBM High Performance Computing Toolkit for analyzing performance of parallel and serial applications. Integrated with LSF to assist in resource management, job submission and node allocation Applications MPI PAMI POE What s New: Ubuntu 14.04.1 Little Endian NV (Non Virtulaized) MPICH as BASE and collective performance improvements MPI 3.0 (via MPICH) MPI I/O Improved Performance

IBM Platform LSF Most Complete Advanced, feature-rich workload scheduling Robust set of add-on features Integrated application support Most Powerful Policy & resource-aware scheduling Resource consolidation for max performance Advanced self-management Most Scalable Thousands of concurrent users & jobs Virtualized pool of shared resources Flexible control, multiple policies Best TCO Optimal utilization, less infrastructure cost Better productivity, faster time to result Robust capabilities, administrative productivity Join the conversation at #OpenPOWERSummit 14

IBM Platform Application Center Increase user productivity with browser based access for job submission, management Capture best practices with guided submission (templates) Enable access via mobile devices with web services Support for 2D/3D remote visualization Join the conversation at #OpenPOWERSummit 15

IBM Platform Process Manager Platform Process Manager Flow Editor Intuitive drag-and-drop interface Creates self-documenting flows Support for sub-flows, job arrays Rich error-handling / retry capability Save workflows in XML format Publish flows directly to Flow Manager Platform Process Manager Flow Manager Manages multiple flows for multiple users and groups simultaneously Monitor workflow execution graphically Trigger flows automatically through calendar events, the flow manager or the command line. Join the conversation at #OpenPOWERSummit 16

Genomic medicine reference architecture http://www.powergene.net Join the conversation at #OpenPOWERSummit 17

Links IBM Platform Computing product information (https://ibm.biz/bdxbdr) Service Management Connect Technical Computing Community (https://ibm.biz/bdfr8r) IBM Knowledge Center (https://ibm.biz/bdxbdx) Join the conversation at #OpenPOWERSummit 18

Contacts Development Manager: Jing Li (jingili@cn.ibm.com) Product Management: Mehdi Bozzo-Rey (mbozzore@ca.ibm.com) Product Marketing: Gabor Samu (gsamu@ca.ibm.com) Join the conversation at #OpenPOWERSummit 19

Q&A Join the conversation at #OpenPOWERSummit 20