Grid Engine Training Introduction



Similar documents
Introduction to Sun Grid Engine (SGE)

Grid Engine Basics. Table of Contents. Grid Engine Basics Version 1. (Formerly: Sun Grid Engine)

The SUN ONE Grid Engine BATCH SYSTEM

Grid Engine 6. Troubleshooting. BioTeam Inc.

KISTI Supercomputer TACHYON Scheduling scheme & Sun Grid Engine

SGE Roll: Users Guide. Version Edition

GC3: Grid Computing Competence Center Cluster computing, I Batch-queueing systems

Grid Engine Users Guide p1 Edition

Grid Engine 6. Policies. BioTeam Inc.

Release Notes for Open Grid Scheduler/Grid Engine. Version: Grid Engine

LSKA 2010 Survey Report Job Scheduler

Streamline Computing Linux Cluster User Training. ( Nottingham University)

SUN GRID ENGINE & SGE/EE: A CLOSER LOOK

High Performance Computing Facility Specifications, Policies and Usage. Supercomputer Project. Bibliotheca Alexandrina

Oracle Grid Engine. User Guide Release 6.2 Update 7 E

locuz.com HPC App Portal V2.0 DATASHEET

Efficient cluster computing

HEPiX Fall 2013 Workshop Grid Engine: One Roadmap. Cameron Brunner Director of Engineering

Enigma, Sun Grid Engine (SGE), and the Joint High Performance Computing Exchange (JHPCE) Cluster

An Introduction to High Performance Computing in the Department

Introduction to the SGE/OGS batch-queuing system

Grid Engine Administration. Overview

Batch Systems. provide a mechanism for submitting, launching, and tracking jobs on a shared resource

Open Source Grid Computing Java Roundup

Batch Scheduling and Resource Management

General Overview. Slurm Training15. Alfred Gil & Jordi Blasco (HPCNow!)

Cluster Computing With R

Miami University RedHawk Cluster Working with batch jobs on the Cluster

SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education

WebLogic Server Foundation Topology, Configuration and Administration

Introduction to Sun Grid Engine 5.3

SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education

Grid Engine experience in Finis Terrae, large Itanium cluster supercomputer. Pablo Rey Mayo Systems Technician, Galicia Supercomputing Centre (CESGA)

MPI / ClusterTools Update and Plans

Sun Grid Engine, a new scheduler for EGEE

Work Environment. David Tur HPC Expert. HPC Users Training September, 18th 2015

Oracle Grid Engine. Administration Guide Release 6.2 Update 7 E

High Performance Computing

Tutorial: Using WestGrid. Drew Leske Compute Canada/WestGrid Site Lead University of Victoria

High Performance Computing with Sun Grid Engine on the HPSCC cluster. Fernando J. Pineda

Resource Management and Job Scheduling

Interoperability between Sun Grid Engine and the Windows Compute Cluster

Final Report. Cluster Scheduling. Submitted by: Priti Lohani

Grid Engine Release Notes

System Requirements. Version

A High Performance Computing Scheduling and Resource Management Primer

CycleServer Grid Engine Support Install Guide. version 1.25

Benchmark Report: Univa Grid Engine, Nextflow, and Docker for running Genomic Analysis Workflows

Advanced Techniques with Newton. Gerald Ragghianti Advanced Newton workshop Sept. 22, 2011

NEC HPC-Linux-Cluster

Using WestGrid. Patrick Mann, Manager, Technical Operations Jan.15, 2014

Grid 101. Grid 101. Josh Hegie.

Technical Guide to ULGrid

Manual for using Super Computing Resources

Installing and running COMSOL on a Linux cluster

Notes on the SNOW/Rmpi R packages with OpenMPI and Sun Grid Engine

BEGINNER'S GUIDE TO SUN GRID ENGINE 6.2

User s Manual

An Oracle White Paper August Beginner's Guide to Oracle Grid Engine 6.2

Features of AnyShare

Guillimin HPC Users Meeting. Bryan Caron

SLURM Workload Manager

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

Configuring and Managing a Private Cloud with Enterprise Manager 12c

Until now: tl;dr: - submit a job to the scheduler

Martinos Center Compute Clusters

Ra - Batch Scripts. Timothy H. Kaiser, Ph.D. tkaiser@mines.edu

Running ANSYS Fluent Under SGE

Basic TCP/IP networking knowledge of client/server concepts Basic Linux commands and desktop navigation (if don't know we will cover it )

Ingres High Availability Option

Beyond Windows: Using the Linux Servers and the Grid

Sun Grid Engine, a new scheduler for EGEE middleware

Regional SEE-GRID-SCI Training for Site Administrators Institute of Physics Belgrade March 5-6, 2009

JMS: An Open Source Workflow Management System and Web-Based Cluster Front-End for High Performance Computing

Sun Grid Engine Package for OSCAR A Google SoC 2005 Project

Learn Oracle WebLogic Server 12c Administration For Middleware Administrators

PBS Tutorial. Fangrui Ma Universit of Nebraska-Lincoln. October 26th, 2007

Sun Powers the Grid SUN GRID ENGINE

HPC-Nutzer Informationsaustausch. The Workload Management System LSF

Protect SAP HANA Based on SUSE Linux Enterprise Server with SEP sesam

System Software for High Performance Computing. Joe Izraelevitz

NorduGrid ARC Tutorial

Manjrasoft Market Oriented Cloud Computing Platform

How To Run A Tompouce Cluster On An Ipra (Inria) (Sun) 2 (Sun Geserade) (Sun-Ge) 2/5.2 (

HPC Workload Management Tools: A Competitive Benchmark Study

User s Guide. Introduction

Submitting Jobs to the Sun Grid Engine. CiCS Dept The University of Sheffield.

GRID Computing: CAS Style

Whitepaper: Back Up SAP HANA and SUSE Linux Enterprise Server with SEP sesam. Copyright 2014 SEP

How To Install Linux Titan

Configuration of High Performance Computing for Medical Imaging and Processing. SunGridEngine 6.2u5

Sun Grid Engine Manual

Running on Blue Gene/Q at Argonne Leadership Computing Facility (ALCF)

Algorithms. Miroslav Ruda 1 Hana Rudová 2. Masaryk University. Masaryk University. Grid Simulator with Production Scheduling.

Experience with Server Self Service Center (S3C)

SCHEDULER POLICIES FOR JOB PRIORITIZATION IN THE SUN N1 GRID ENGINE 6 SYSTEM. Charu Chaubal, N1 Systems. Sun BluePrints OnLine October 2005

Running applications on the Cray XC30 4/12/2015

Cloud Computing Architecture with OpenNebula HPC Cloud Use Cases

Job scheduler details

Isolating Cluster Jobs for Performance and Predictability

Transcription:

Grid Engine Training Jordi Blasco (jordi.blasco@xrqtc.org) 26-03-2012

Agenda 1 How it works? 2 History Current status future About the Grid Engine version of this training Documentation 3 Grid Engine internals What provides? Features Key terms & Concepts tools 4 5

How it works? Batch Queue System is a software application that is in charge of unattended background executions, commonly known for historical reasons as batch processing. a a source : Wikipedia What is a queue? A queue oers a set of resources for a similar jobs. The queues use to have some limits to manage the computational resources eciency. Only few BQS have consumable resources control like concurrent licenses limitation. Jordi Blasco (jordi.blasco@xrqtc.org)

How it works? How it works? The users sent their jobs with qsub, with a detailed resources needs(mem., cputime, disk, number of cores, licenses,...) The manager register the job. When all the resources are available, the manager send the job to execution nodes following a complex allocation rules (priority, urgency, etc.) Some BQS needs more complex scheduler layer like Maui, Moab cluster suite. The users can view the job status using qstat. The users can delete their job using qdel.

project History Current status future About the Grid Engine version of this trainin Documentation History Previously known as CODINE (COmputing in DIstributed Networked Environments). In 2000, Sun acquired Gridware, Inc. In 2001, Sun made the source code available In 2010, Oracle acquired Sun. In December 2010, Oracle announced that Grid Engine would no longer be freely available as an open-source product. In response to this, the Grid Engine community started the develop forked versions of Grid Engine.

Grid Engine Forks History Current status future About the Grid Engine version of this trainin Documentation Open Source and Free Grid Engine Forks OGS Open Grid Scheduler SGE Son of Grid Engine Commercially Supported and Licensed Grid Engine Forks UGE Univa Grid Engine OGS Scalable Logic (Open Grid Scheduler) OGE Oracle Grid Engine

Grid Engine Forks History Current status future About the Grid Engine version of this trainin Documentation Comparing Grid Engine at 1Q'12 UGE OGE SGE OGS Base Core 6.2U5 6.2U7 6.2U5 a 6.2U5 Current Version 8.0.1 6.2U5p2 8.0.0d GE-2011.11 License Commercial Commercial SISSL SISSL Support level Enterprise Enterprise Community Community* b Activity high medium medium high Source Access yes* yes* yes yes Public Roadmap yes no no no We suggest to read the article "Which Grid Engine" by Chris Dagdigian, consultant at BioTeam. http://www.bio-itworld.com/2012/02/15/which-grid-engine.html a since 2011-09-29 SGE use Univa Grid Engine 8.0.0 source code core b In November 2011, Scalable Logic has announced its intent to provide commercial support and consulting

What now? History Current status future About the Grid Engine version of this trainin Documentation

Jordi Blasco (jordi.blasco@xrqtc.org) History Current status future About the Grid Engine version of this trainin Documentation Which flavor of Grid Engine we will use for training? We will use Open Grid Scheduler to develop this training, because: It's 100% Open Source. It follows the same policy of Sun (SISSL). If you need help, you can contact for commercial support. The enterprise and the community version are the same. The developers are active members of the mail list. And,... we have to choose only one :-)

Docs & Info History Current status future About the Grid Engine version of this trainin Documentation Docs & Info http://arc.liv.ac.uk/sge/ http://gridscheduler.sourceforge.net/documentation.html http://bioteam.net/2009/09/sge-training-slides/ http://www.hpckp.org (coming soon)

What Grid Engine provides? What provides? Features Key terms & Concepts tools What Grid Engine provides? The BQS and the scheduler comes in the same pack You don't need extra software to schedule Detailed job accounting Fine-grained computing resources to the users Allows suspend, resume and migrate jobs Checkpointing integration Parallel environment integration Job arrays Awesome policies to share resources APIs to easy develop 3rd party software

Features What provides? Features Key terms & Concepts tools Most awesome features The BQS and the scheduler are integrated Job Preemption Portable Hardware Locality Library (hwloc) support GPU support ARM Linux port available Linux Kernel 3.0 Support

Key Terms & Concepts What provides? Features Key terms & Concepts tools Key Terms Cluster Execution host Master (Shadow) Submit host Admin host Daemons (sge_qmaster, sge_execd) Queue instance Slots Jobs

Key Terms & Concepts What provides? Features Key terms & Concepts tools Key Concepts Users describe the needed resources and GE looks for the best host and queue instance. You don't have to send jobs on particular queue Each node can have one or more queue instances

source: www.bioteam.net Jordi Blasco (jordi.blasco@xrqtc.org) Key Terms & Concepts What provides? Features Key terms & Concepts tools

Setting up Grid Engine What provides? Features Key terms & Concepts tools Setting up GE with qmon(gui) & qconf(cli).

Setting up Grid Engine What provides? Features Key terms & Concepts tools Core user commands qstat & qhost Tools for monitoring. qsub Submit job tool. Core admin commands qconf Admin tool for adding/changing/conguring the Grid Engine system. qstat & qhost Tools for monitoring. qmod Modify & disable an existing queue, clear error states, etc. qalter Change attribute of pending job.

Missing features on main project code Accounting web interface (S-GAE - RDLab) Monitoring web interface (PHPQstat - XRQTC) GPU Integration (Jose Alcantara scripts - XRQTC) Power Control (CLUES - GRyCAP) Eciency Control (New XRQTC/HPCKP Contribution) Dynamic quota (New XRQTC/HPCKP Contribution) Jordi Blasco (jordi.blasco@xrqtc.org)

Jordi Blasco (jordi.blasco@xrqtc.org)

References Jordi Blasco (jordi.blasco@xrqtc.org) References Sun Grid Engine Installation Guide Sun Grid Engine Administrator Guide Sun Grid Engine User Guide http://bioteam.net http://www.univa.com https://arc.liv.ac.uk/trac/sge http://gridengine.org http://gridscheduler.sourceforge.net http://gridengine.info

Lets go to Hands-On 1 (Install) Jordi Blasco (jordi.blasco@xrqtc.org)