Windows IB. Introduction to Windows 2003 Compute Cluster Edition. Eric Lantz Microsoft elantz@microsoft.com



Similar documents
Supercomputing on Windows. Microsoft (Thailand) Limited

Microsoft Compute Clusters in High Performance Technical Computing. Björn Tromsdorf, HPC Product Manager, Microsoft Corporation

Windows Compute Cluster Server Top 5 Most Common Challenges

Technical Overview of Windows HPC Server 2008

Interoperability between Sun Grid Engine and the Windows Compute Cluster

The GRID according to Microsoft

supercomputing. simplified.

"Charting the Course to Your Success!" MOC A Understanding and Administering Windows HPC Server Course Summary

Technical Overview of Windows HPC Server 2008

High Productivity Computing With Windows

On-Demand Supercomputing Multiplies the Possibilities

High Productivity Computing With Windows Windows HPC Server 2008

REFERENCE. Microsoft in HPC. Tejas Karmarkar, Solution Sales Professional, Microsoft

Microsoft Windows Compute Cluster Server 2003 Getting Started Guide

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Parallel application development

Accelerating From Cluster to Cloud: Overview of RDMA on Windows HPC. Wenhao Wu Program Manager Windows HPC team

InfiniBand Software and Protocols Enable Seamless Off-the-shelf Applications Deployment

Living in a mixed world -Interoperability in Windows HPC Server Steven Newhouse stevenn@microsoft.com

Windows Compute Cluster Server Miron Krokhmal CTO

DB2 Connect for NT and the Microsoft Windows NT Load Balancing Service

HPC Software Requirements to Support an HPC Cluster Supercomputer

Cluster Grid Interconects. Tony Kay Chief Architect Enterprise Grid and Networking

MPI / ClusterTools Update and Plans

Cisco Active Network Abstraction Gateway High Availability Solution

History of Windows. INLS 576 Spring 2009 Tuesday, 1/13/2009

Using the Windows Cluster

Quantum StorNext. Product Brief: Distributed LAN Client

Module I-7410 Advanced Linux FS-11 Part1: Virtualization with KVM

Hyper-V over SMB Remote File Storage support in Windows Server 8 Hyper-V. Jose Barreto Principal Program Manager Microsoft Corporation

<Insert Picture Here> Infrastructure as a Service (IaaS) Cloud Computing for Enterprises

PRIMERGY server-based High Performance Computing solutions

Finite Elements Infinite Possibilities. Virtual Simulation and High-Performance Computing

Cluster Implementation and Management; Scheduling

SRNWP Workshop. HP Solutions and Activities in Climate & Weather Research. Michael Riedmann European Performance Center

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

Hyper-V over SMB: Remote File Storage Support in Windows Server 2012 Hyper-V. Jose Barreto Principal Program Manager Microsoft Corporation

Petascale Software Challenges. Piyush Chaudhary High Performance Computing

Mellanox Academy Course Catalog. Empower your organization with a new world of educational possibilities

10533A: Deploying, Configuring, and Administering Microsoft Lync Server 2010

Implementing, Managing, and Maintaining a Microsoft Windows Server 2003 Network Infrastructure

Introduction to Infiniband. Hussein N. Harake, Performance U! Winter School

ICONICS Choosing the Correct Edition of MS SQL Server

Kriterien für ein PetaFlop System

Oracle Communications WebRTC Session Controller: Basic Admin. Student Guide

Cloud Optimize Your IT

Mellanox Academy Online Training (E-learning)

Eine CAE Infrastruktur für LS-DYNA. unter Verwendung von. Microsoft Windows HPC Server 2008

Virtualization across the organization

What s New with VMware Virtual Infrastructure

BRIDGING EMC ISILON NAS ON IP TO INFINIBAND NETWORKS WITH MELLANOX SWITCHX

Microsoft Windows Storage Server 2003 R2

Putchong Uthayopas, Kasetsart University

CLOUD COMPUTING & WINDOWS AZURE

Windows Azure Pack Installation and Initial Configuration

Microsoft Exam

App Orchestration Setup Checklist

SMB Advanced Networking for Fault Tolerance and Performance. Jose Barreto Principal Program Managers Microsoft Corporation

PATROL Console Server and RTserver Getting Started

Windows Server 2008 R2 Essentials

Windows Server 2008 Essentials. Installation, Deployment and Management

MCSE SYLLABUS. Exam : Managing and Maintaining a Microsoft Windows Server 2003:

Migration from SharePoint 2007 to SharePoint 2010

Deploying Cisco Unified Contact Center Express Volume 1

Windows 2000 Security Architecture. Peter Brundrett Program Manager Windows 2000 Security Microsoft Corporation

CT LANforge-FIRE VoIP Call Generator

Apache CloudStack 4.x (incubating) Network Setup: excerpt from Installation Guide. Revised February 28, :32 pm Pacific

IT service for life science

STEALTHbits Technologies, Inc. StealthAUDIT v5.1 System Requirements and Installation Notes

Deploying, Configuring, and Administering Microsoft Lync Server 2010

A Tour of the Linux OpenFabrics Stack

VMware ESXi 3.5 update 2

Lustre Networking BY PETER J. BRAAM

Oracle Data Integrator 11g New Features & OBIEE Integration. Presented by: Arun K. Chaturvedi Business Intelligence Consultant/Architect

RED HAT: UNLOCKING THE VALUE OF THE CLOUD

Configuring Windows Server Clusters

xvm Server and xvm VirtualBox Christopher Beal Principal Engineer

Introducing. Markus Erlacher Technical Solution Professional Microsoft Switzerland

Cloud Services ADM. Agent Deployment Guide

MCSE Core exams (Networking) One Client OS Exam. Core Exams (6 Exams Required)

SO114 - Solaris 10 OE Network Administration

Cluster, Grid, Cloud Concepts

Troubleshooting BlackBerry Enterprise Service 10 version Instructor Manual

Active-Active and High Availability

OVERVIEW OF TYPICAL WINDOWS SERVER ROLES

Windows TCP Chimney: Network Protocol Offload for Optimal Application Scalability and Manageability

Siebel Installation Guide for UNIX. Siebel Innovation Pack 2013 Version 8.1/8.2, Rev. A April 2014

IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM

Assignment # 1 (Cloud Computing Security)

APPLICATION FOR BOARD APPROVAL. of Locally Developed Course. MCP CERTIFICATION 11/12a/12b/12c

Windows Server on WAAS: Reduce Branch-Office Cost and Complexity with WAN Optimization and Secure, Reliable Local IT Services

locuz.com HPC App Portal V2.0 DATASHEET

Citrix MetaFrame Presentation Server 3.0 and Microsoft Windows Server 2003 Value Add Feature Guide

Getting Started Guide

msuite5 & mdesign Installation Prerequisites

Comprehensive Resource for Licensing and Pricing

Data Center Op+miza+on

Centralizing Mac Home. Live Webinar David McNeely Centrify Geordie Korper Group Logic

Windows HPC 2008 Cluster Launch

DeployStudio Server Quick Install

Transcription:

Datacenter Fabric Workshop Windows IB Introduction to Windows 2003 Compute Cluster Edition Eric Lantz Microsoft elantz@microsoft.com August 22, 2005

What this talk is not about High Availability, Fail-over clustering Scaling out general business applications (ie Exchange, SQL, SAP, etc) August 22, 2005 Datacenter Fabric Workshop Windows IB Page 1 of 19

Agenda HPC Market Definition & Trends Microsoft s Compute Cluster Solution CCE Key Features Deployment Job Scheduling MPI & Networking Development Tools Q&A August 22, 2005 Datacenter Fabric Workshop Windows IB Page 2 of 19

HPC Market Definition & Trends August 22, 2005 Datacenter Fabric Workshop Windows IB Page 3 of 19

Top 500 Supercomputer Trends Industry usage rising Clusters over 60% GigE is leading, IB is growing IA is winning August 22, 2005 Datacenter Fabric Workshop Windows IB Page 4 of 19

HPC Systems are Affecting Every Vertical Leverage Volume Markets of Industry Standard Hardware and Software Rapid Procurement, Installation and Integration of systems Cluster Ready Applications Accelerating Market Growth Engineering Bioinformatics Oil & Gas Finance Government The convergence of affordable high performance hardware and commercial apps is making supercomputing personal August 22, 2005 Datacenter Fabric Workshop Windows IB Page 5 of 19

Supercomputing Goes Personal 1991 1998 2005 System Cray Y-MP C916 Sun HPC10000 Shuttle @ NewEgg.com Architecture OS GFlops Top500 # Price Customers Applications 16 x Vector 4GB, Bus UNICOS ~10 1 $40,000,000 Government Labs Classified, Climate, Physics Research 24 x 333MHz Ultra- SPARCII, 24GB, SBus Solaris 2.5.1 $1,000,000 (40x drop) Large Enterprises Manufacturing, Energy, Finance, Telecom 4 x 2.2GHz Athlon64 4GB, GigE Windows Server 2003 SP1 < $4,000 (250x drop) Every Engineer & Scientist Bioinformatics, Materials Sciences, Digital Media August 22, 2005 Datacenter Fabric Workshop Windows IB Page 6 of 19 ~10 500 ~10 N/A

Solution Requirements Customers require: An integrated supported solution stack Simplified job submission, status and progress monitoring Maximum compute performance and scalability Simplified environment from desktops to HPC clusters Administrators require: Better cluster monitoring and management for maximum resource utilization Flexible, extensible, policy-driven job scheduling and resource allocation Maximum node uptime Secure process startup and complete cleanup Developers Require: Programming environment that enables maximum productivity Availability and optimized compilers (Fortran) and math libraries Parallel debugger, profiler, and visualization tools Parallel programming models (MPI) August 22, 2005 Datacenter Fabric Workshop Windows IB Page 7 of 19

Microsoft Compute Cluster Solution August 22, 2005 Datacenter Fabric Workshop Windows IB Page 8 of 19

Key Scenarios Departmental Cluster: Conventional scenario IT owns large clusters due to complexity and allocates resources on per job basis Users submit batch jobs via scripts In-house and ISV apps, many based on MPI Very poor development tools Manual, batch execution IT Mgr Personal/Workgroup Cluster: Emerging scenario Clusters are pre-packaged OEM appliances, purchased and managed by end-users Desktop HPC applications transparently and interactively make use of cluster resources Desktop development tools integration Interactive Computation and Visualization August 22, 2005 Datacenter Fabric Workshop Windows IB Page 9 of 19

Design Goals Designing for (Corporate/Engineering IT) Appliance-like setup experience Clear, prescriptive setup guidance Simple deployment of head node and compute nodes Minimized complexity for corporate IT integration Most operations are scriptable Leverage existing infrastructure Use Active Directory for user, resource and access management Secure execution, resource access, management Allow customers to use existing deployment tools August 22, 2005 Datacenter Fabric Workshop Windows IB Page 10 of 19

Compute Cluster Solution Mission: Deliver the easiest to deploy and most cost effective solution for solving scaled-out business, engineering and scientific computational problems. Windows Server 2003, Compute Cluster Edition Support for high performance hardware (x64bit architecture) Compute Cluster Pack + = Support for Industry Standards MPI2, RDMA on Ethernet & Infiniband Integrated Job Scheduler Cluster Resource Management Tools Microsoft Compute Cluster Solution Integrated Solution out- of-the the-box Leverages investment in Windows administration and tools Makes cluster operation easy and secure as a single system August 22, 2005 Datacenter Fabric Workshop Windows IB Page 11 of 19

CCS Key Features Integration with existing Windows and management infrastructure Integrates with AD, Windows security and existing systems management and deployment tools Node Deployment and Administration Compute nodes automatically imaged and added to cluster Node Management through UI and command line To Do List to configure head node Extensible job scheduler 3 rd party extensibility at job submission and/or job assignment Examples: admission policies and license verification Submit jobs from command line, UI, or directly from applications Simple management, similar to print queue management Secure MPI User credentials secured in job scheduler and compute nodes Standardized MPI stack Microsoft provided stack reduces application/mpi incompatibility issues August 22, 2005 Datacenter Fabric Workshop Windows IB Page 12 of 19

Typical Cluster Topology AD DNS DHCP Corporate IT Infrastructure Windows Update Monitoring MOM 3 rd party Systems Management SMS 3 rd party Public Network Compute Cluster Private Network MPI Network CNn. 192.168.X.X CN1 192.168.0.X Head Node NAT is configured Compute Nodes and Head Node are member servers in a domain in a corp Active Directory Public Network: Required for connectivity with existing corp network Private Network: Required to separate cluster management and deployment traffic MPI Network: Optional high-speed interconnect network (IB, Gig-Ethernet with RDMA) to August 22, 2005 Datacenter Fabric Workshop Windows IB Page 13 of 19 separate the MPI traffic

Windows Cluster Computing Platform Head Node Active Directory Job Mgmt Cluster Mgmt Scheduling Resource Mgmt Web service User Job Policy, reports admin Console Admin Management Cmd line Job Input User Console Sensors, Workflow, Computation DB or FS Data Data mining, Visualization, Workflow Remote query Compute Node High speed, low latency interconnect (Ethernet over RDMA, Infiniband) MPI Datacenter Fabric Workshop Windows IB Page 14 of 19 Job Execution August 22, 2005 User App

What Happens After I Press Submit Job? Task Execution Client Scheduler orchestrates Node allocation to the tasks Head Node Timing, execution, and clean-up Error Recovery Scheduler Re-try Routing around un-responsive nodes Within a security context Compute Node Compute Node Compute nodes authenticate as the user App App Secure client-scheduler-computenode MPI MPI communication Network Driver Network Driver The Other Layers Application: [your program here] Fast Network Message Passing Interface (MPI): API for messaging between compute nodes cooperating on a task Networking: drivers that enable fast communication via WinSock Direct August 22, 2005 Datacenter Fabric Workshop Windows IB Page 15 of 19

Windows CCE Leverages Winsock Direct Architecture HPC Application User Mode MPI WinSock DLL Switch traffic based on sub-net Winsock Switch IB w/ RDMA GigE w/ RDMA Ethernet IB WinSock Provider DLL User API (verbs based) IP Kernel Mode NDIS Miniport (GigE) Virtual Bus Driver August 22, 2005 User Host Channel Adapter Driver TCP GigE RDMA WinSock Provider DLL Miniport (IPoIB) Kernel API (verbs based) Manage hardware resources in user space (eg. Send & receive queues) OS component Host Channel Adapter Driver Networking Hardware Datacenter Fabric Workshop Windows IB IHV-provided component Page 16 of 19

Associated MS Products Visual Studio Parallel Debugger Automatic attach to MPI processes from IDE Process level stepping Process breakpoints Process sensitive expression evaluation OpenMP support Services For Unix (SFU) Integrate Windows and UNIX/Linux environments Migrate UNIX applications to Windows Directory, File System and UNIX Subsystem Tested and supported by Microsoft August 22, 2005 Datacenter Fabric Workshop Windows IB Page 17 of 19

To Learn More To Learn More Microsoft HPC website: http://www.microsoft.com/hpc/ x64 Info: http://www.microsoft.com/windowsserversystem/64bit/ default.mspx August 22, 2005 Datacenter Fabric Workshop Windows IB Page 18 of 19

Q & A August 22, 2005 Datacenter Fabric Workshop Windows IB Page 19 of 19

Message Passing Interface What is it? Minimalist Answer Software which is automatically installed on each compute node by Windows CCE s node management It s plumbing it has to be there for your HPC apps to run MS MPI based on (and compatible with) an open-source, MPI reference implementation- Argonne National Lab s MPICH2 Why did the MS team choose MPI? MPI emerging as the dominant protocol for parallel compute messaging Do I have to use MS MPI if I use Windows CCP? No, you can use any MPI stack you choose. However, the security features MS HPC have added to MPICH may not be available in other MPI stacks. August 22, 2005 Datacenter Fabric Workshop Windows IB Page 20 of 19

MPI Description MPI is a standard specification, there are many implementations such as MPICH2, MS MPI, etc. MPI consists of 2 parts For ISVs: Full-featured API of160+ functions (can do much work with ~10 functions!) For Users: Command-line (mpiexec) or GUI tool to launch jobs Abstracts communication concepts so even I can create parallel programs! [But it s still not as simple as it must be for common usage] August 22, 2005 Datacenter Fabric Workshop Windows IB Page 21 of 19

MS MPI and MPICH2 MS HPC goal is maximum compatibility with MPICH2 Reference Implementation Full compatibility for ISV s using MPI API s. Exceptions made for with: CCP Scheduler incompatibilities CCP Security Goal incompatibilities Windows-based Performance improvements that do not affect the API s Thus, differences concentrated in job launch/mgmt: MPIExec, MPI Daemon (SMPD) pwdfile, delegate, impersonate, localroot, remove [uninstall smpd], sethosts, etc. August 22, 2005 Datacenter Fabric Workshop Windows IB Page 22 of 19

Parallel Debugger Basic features to debug MPI applications Automatic attach to MPI processes from IDE Process level stepping Process breakpoints Process sensitive expression evaluation August 22, 2005 Datacenter Fabric Workshop Windows IB Page 23 of 19

OpenMP A specification for multithreaded programs Helps hyperthreading Conformance to the OpenMP 2.0 standard Support for.net and OpenMP together Compiler generates MSIL for OpenMP code It consists of a set of simple #pragmas and runtime routines #pragma omp parallel A common technique: Start with sequential code and parallelize by adding #pragmas Most value, where? Parallelizing large loops without loop-dependencies Can do more, but that s the big win August 22, 2005 Datacenter Fabric Workshop Windows IB Page 24 of 19

Windows UNIX Interoperability The Challenge Enable user productivity via Windows for UNIX administrators and developers Approaches Use UNIX Interoperability Tools Compile and Configure UNIX Tools from Source Code Assemble a Collection of Third-party Tools August 22, 2005 Datacenter Fabric Workshop Windows IB Page 25 of 19

Microsoft's Solution - Services for UNIX: Services for UNIX v. 3.5 provides the tools and environment that IT professionals and developers need to: Integrate Windows and UNIX/Linux environments Migrate UNIX applications to Windows Services for UNIX is one of the most comprehensive interoperability solutions: Directory, File System and UNIX Subsystem Tested and supported by Microsoft Services for UNIX uniquely enables IT pros to easily extend the value of their knowledge and training August 22, 2005 Datacenter Fabric Workshop Windows IB Page 26 of 19

Services For UNIX 3.5 Focused on two major customer pain areas Seamless UNIX / Windows Interoperability File & data sharing NFS (Client, Server, Gateway) UNIX / Windows cross-platform management AD / NIS server directory services & interop Bidirectional Password Sync, user name mapping Remote exec tools, rlogin, xterm, telnet, UNIX scripting, Perl UNIX to Windows Application Portability UNIX Tools: C, C++, Fortran, scripts, build tools Interix UNIX subsystem Leverage existing UNIX skills, methods and code Best System Integration Award - LinuxWorld 2003 August 22, 2005 Datacenter Fabric Workshop Windows IB Page 27 of 19