Sun Grid Engine Package for OSCAR A Google SoC 2005 Project



Similar documents
Software Deployment and Configuration

Release Notes for Open Grid Scheduler/Grid Engine. Version: Grid Engine

Linux Distributions. What they are, how they work, which one to choose Avi Alkalay

How to Build an RPM OVERVIEW UNDERSTANDING THE PROCESS OF BUILDING RPMS. Author: Chris Negus Editor: Allison Pranger 09/16/2011

Dynamic Kernel Module Support. Matt Domsch, Software Architect Dell, Inc.

Simple Introduction to Clusters

TCB No September Technical Bulletin. GS FLX+ System & GS FLX System. Installation of 454 Sequencing System Software v2.

Managing your Red Hat Enterprise Linux guests with RHN Satellite

SGE Roll: Users Guide. Version Edition

Work Environment. David Tur HPC Expert. HPC Users Training September, 18th 2015

Grid Engine Basics. Table of Contents. Grid Engine Basics Version 1. (Formerly: Sun Grid Engine)

Grid Engine. Application Integration

Automating Big Data Benchmarking for Different Architectures with ALOJA

installation administration and monitoring of beowulf clusters using open source tools

High Performance Computing with Sun Grid Engine on the HPSCC cluster. Fernando J. Pineda

Grid Engine 6. Troubleshooting. BioTeam Inc.

Automated deployment of virtualization-based research models of distributed computer systems

How to Install an OSCAR Cluster Quick Start Install Guide Software Version 4.2 Documentation Version 4.2

Open Source Datacenter Conference 2011 System Management with RHN Satellite. Dirk Herrmann, Solution Architect, Red Hat

Automated build service to facilitate Continuous Delivery

LSKA 2010 Survey Report Job Scheduler

Introduction to Sun Grid Engine (SGE)

LAE Enterprise Server Installation Guide

Copyright by Parallels Holdings, Ltd. All rights reserved.

Sun Grid Engine, a new scheduler for EGEE

MATLAB Distributed Computing Server Installation Guide. R2012a

Improving software quality with an automated build process

MPI / ClusterTools Update and Plans

ABAQUS High Performance Computing Environment at Nokia

Oracle Grid Engine. Installation and Upgrade Guide Release 6.2 Update 7 E

GRID Computing: CAS Style

Kaspersky Endpoint Security 8 for Linux INSTALLATION GUIDE

Grid Engine Training Introduction

CycleServer Grid Engine Support Install Guide. version 1.25

User s Manual

Interoperability between Sun Grid Engine and the Windows Compute Cluster

Storage Management for the Oracle Database on Red Hat Enterprise Linux 6: Using ASM With or Without ASMLib

FleSSR Project: Installing Eucalyptus Open Source Cloud Solution at Oxford e- Research Centre

KonyOne Server Installer - Linux Release Notes

Sun Grid Engine Update

SSI-OSCAR Single System Image - Open Source Cluster Application Resources

Software configuration management

Acronis Backup & Recovery 10 Server for Linux. Installation Guide

IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM

HPSA Agent Characterization

Intellicus Enterprise Reporting and BI Platform

Grid Engine Users Guide p1 Edition

Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7

Sun Grid Engine, a new scheduler for EGEE middleware

A Smart Way to Manage Packages in Yocto Project

An Open Source SCADA Toolkit

Apache 2.0 Installation Guide

IBM WebSphere Application Server Version 7.0

IBM Endpoint Manager Version 9.1. Patch Management for Red Hat Enterprise Linux User's Guide

CPSC 491. Today: Source code control. Source Code (Version) Control. Exercise: g., no git, subversion, cvs, etc.)

An Introduction to Mercurial Version Control Software

Red Hat Network Satellite (On System z) 18-JUNE CAVMEN Meeting

Analysis and Implementation of Cluster Computing Using Linux Operating System

Sample copy. Introduction To WebLogic Server Property of Web 10.3 Age Solutions Inc.

Acronis Backup & Recovery 10 Server for Linux. Update 5. Installation Guide

Automated Deployment of Oracle RAC Using Enterprise Manager Provisioning Pack

Oracle Net Service Name Resolution

ovirt self-hosted engine seamless deployment

INF-110. GPFS Installation

Continuous Integration and Automatic Testing for the FLUKA release using Jenkins (and Docker)

OFBiz Addons goals, howto use, howto manage. Nicolas Malin, Nov. 2012

JAMF Software Server Installation and Configuration Guide for OS X. Version 9.2

The SUN ONE Grid Engine BATCH SYSTEM

Best Practices for Deploying and Managing Linux with Red Hat Network

Streamline Computing Linux Cluster User Training. ( Nottingham University)

TUT19344 Managing RH/CentOS with SUSE Manager

Cisco ISE Command-Line Interface

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

Installing and Configuring Websense Content Gateway

Cloud Computing for Control Systems CERN Openlab Summer Student Program 9/9/2011 ARSALAAN AHMED SHAIKH

GroundWork Monitor Open Source Installation Guide

Continuous Integration. CSC 440: Software Engineering Slide #1

openmosix Live free() or die() A short intro to HPC Kris Buytaert buytaert@stone-it.be First Prev Next Last Go Back Full Screen Close Quit

INTRODUCTION TO CLOUD MANAGEMENT

My DevOps Journey by Billy Foss, Engineering Services Architect, CA Technologies

SRNWP Workshop. HP Solutions and Activities in Climate & Weather Research. Michael Riedmann European Performance Center

The Penguin in the Pail OSCAR Cluster Installation Tool

Intellicus Cluster and Load Balancing- Linux. Version: 7.3

Virtualization Management the ovirt way

Grid Engine Administration. Overview

Cisco UCS CPA Workflows

Transcription:

Sun Grid Engine Package for OSCAR A Google SoC 2005 Project Babu Sundaram, Barbara Chapman University of Houston Bernard Li, Mark Mayo, Asim Siddiqui, Steven Jones Canada s Michael Smith Genome Sciences Centre

Sun Grid Engine Distributed resource management and batch job queuing software Increase cluster utilization to maximum Precise control over resource usage, supports sophisticated scheduling policies Widely deployed at major institutions UH (COE) has a SGE cluster (~250 nodes) Open source software, community effort gridengine.sunsource.net

Typical SGE setup

The OSCAR Project a snapshot of the best known methods for building, programming, and using HPC clusters Easy to install software bundle Everything needed to install, build, maintain and use a Linux cluster Supports various distros such as Red Hat Enterprise Linux (and clones), Fedora Core, Mandriva Linux on x86, ia64, x86_64 architectures http://oscar.openclustergroup.org

What is an OSCAR Package? <packages dir> <package_name> * - mandatory config.xml * doc RPMS SRPMS scripts testing

OSCAR Package details config.xml XML file indicating package details, its version, dependencies (e.g., sge, ksh) and OS-, client-specific rpmlists doc Mostly help and README files RPMS pre-compiled binaries as RPMs SRPMS to allow building on other platforms testing tests after package installation

OSCAR Package scripts OSCAR framework recognizes a standard set of scripts and they have definitive purpose Seq# Script Name Description 1 setup Perform any package setup 2 pre_configure Prepare package config (dynamic user input) 3 post_configure Process results from package config 4 post_server_rpm_install Perform out of RPM operations on server 5 post_client_rpm_install Perform out of RPM operations on client 6 post_clients For configurations with knowledge about nodes 7 post_install For final config with fully install/booted nodes

OSCAR Package Configuration configurator.html - page with configuration settings to be used during the Configure Selected OSCAR Packages step Values stored in.configurator.values and used by scripts for setup

SGE Package for OSCAR Lots of interest for SGE OSCAR Package Provides an alternative Resource Manager to TORQUE Sets up SGE as part of cluster deployment or add-on after initial deployment

Tasks in SGE package creation Source RPM generation Binary RPM generation Server-, client- and GUI-specific RPMs Develop OSCAR configuration and scripts Implementation, Licensing, Documentation

RPM generation for SGE Source RPM generation was our first step SGE source rpm for version 6.0 update 4 At that time, ScalableSystems had a release ready Now, we have SRPM and RPM based on update 8 Some patches were identified earlier on and some were added later for correct compilation qtcsh, inst_sge, aimk, distinst, qmon icons Spec file modification and SGE binary RPM generation

Scripts for SGE-OSCAR Automates SGE install on the OSCAR cluster All perl scripts post_server_install Configures the overall SGE setup; Sets up SGE master with various values for the options SGE_ROOT, CELLNAME, FULLSERVER, GIDRANGE, SPOOLTYPE, PORTS oscar_cluster..conf is a file that gets generated at this stage to drive inst_sge m auto User input/customization happens at this stage (configurator.html) At the end of this step, the qmaster is up and running on the OSCAR head node post_clients Gets executed after clients are defined (not installed) Adds clients as admin hosts so they can be setup as exec hosts later get_machine_listing(); then, qconf ah $hostname;

SGE OSCAR scripts cont post_install All actions that can be done only after a full cluster install happen in this step qmaster already knows about the clients (from the definition step) and they are already admin hosts All settings (dir: cell_name) gets tarred and ready to get pushed to the clients during post_install Cannot assume NFS; So, the cell_name_dir.tar gets pushed to the clients and untarred Clients now know about the qmaster details Automated install of inst_sge x (patched in spec); Executed via cexec over ssh post_server_rpm_uninstall, post_client_rpm_uninstall Not much SGE-specific functionality, but there to allow clean SGE uninstall

Implementation details OSCAR s Subversion repository for code revision control http://svn.oscar.openclustergroup.org/oscar/ tmp/soc/sge Initial implementation was on FC2 x86 Basic tools involved: rpm, make, perl, diff/patch OSCAR-specific code is under GPL; SGE under SISSL

Where is the code now? Code integrated into OSCAR trunk, to be released in 5.0 Supported by all distributions on x86 and x86_64 (except for Mandriva) Parallel Environment integration: LAM/MPI, PVM, MPICH, Open MPI (only setup if parallel libraries are installed)

Acknowledgements Google Inc., OSCAR developers SGE developers (Ron, Fritz, Andreas ) Chandler Wilkerson, LAN admin, CS, UH ScalableSystems