SGE Roll: Users Guide. Version Edition
|
|
|
- Lionel Chandler
- 10 years ago
- Views:
Transcription
1 SGE Roll: Users Guide Version Edition
2 SGE Roll: Users Guide : Edition Published Aug 2006 Copyright 2006 UC Regents, Scalable Systems
3 Table of Contents Preface...i 1. Requirements Rocks Version Other Rolls Installing the SGE Roll Adding the Roll Using the SGE Roll How to use SGE Setting the SGE environment Submitting Batch Jobs to SGE Monitoring SGE Jobs Managing SGE queues Copyrights Sun Grid Engine...8 iii
4 Preface The SGE Roll installs and configures the SUN Grid Engine scheduler. Please visit the SGE site 1 to learn more about their release and the individual software components. Notes 1. i
5 Chapter 1. Requirements 1.1. Rocks Version The SGE Roll is for use with Rocks version (Hallasan) Other Rolls The SGE Roll does not require any other Rolls (other than the HPC Roll) to be installed on the Frontend. Compatiblity has been verified with the following Rolls. HPC Grid 1
6 Chapter 2. Installing the SGE Roll 2.1. Adding the Roll The SGE Roll must be installed during the Frontend installation step of your cluster (refer to section 1.2 of the Rocks usersguide). Future releases will allow the installation of the SGE Roll onto a running system. The SGE Roll is added to a Frontend installation in exactly the same manner as the required HPC Roll. Specifically, after the HPC Roll is added the installer will once again ask if you have a Roll (see below). Select Yes and insert the SGE Roll. 2
7 Chapter 3. Using the SGE Roll 3.1. How to use SGE This section tells you how to get started using Sun Grid Engine (SGE). SGE is a distributed resource management software and it allows the resources within the cluster (cpu time,software, licenses etc) to be utilized effectively. Also, the SGE Roll sets up Sun Grid Engine such that NFS is not needed for it s operation. This provides a more scalable setup but it does mean that we will lose the high availability benefits that a SGE with NFS setup offers. Another thing that the Roll does is that that generic queues are setup automatically the moment new nodes are being integrated within the Rocks cluster and booted up Setting the SGE environment When you log into the cluster, the SGE environment would have already been set up for you. The SGE commands should have been automatically added into your $PATH. [sysadm1@frontend-0 sysadm1]$ echo $SGE_ROOT /opt/gridengine [sysadm1@frontend-0 sysadm1]$ which qsub /opt/gridengine/bin/glinux/qsub 3.3. Submitting Batch Jobs to SGE Batch jobs are submitted to SGE via scripts. Here is an example of a serial job script, sleep.sh 1. It basically executes the sleep command. [sysadm1@frontend-0 sysadm1]$ cat sleep.sh #!/bin/bash # #$ -cwd #$ -j y #$ -S /bin/bash # date sleep 10 date Entries which start with #$ will be treated as SGE options. -cwd means to execute the job for the current working directory. -j y means to merge the standard error stream into the standard output stream instead of having two separate error and output streams. 3
8 Chapter 3. Using the SGE Roll -S /bin/bash specifies the interpreting shell for this job to be the Bash shell. To submit this serial job script, you should use the qsub command. sysadm1]$ qsub sleep.sh your job 16 ("sleep.sh") has been submitted For a parallel MPI job script, take a look at this script, linpack.sh 2. Note that you need to put in two SGE variables, $NSLOTS and $TMP/machines within the job script. [sysadm1@frontend-0 sysadm1]$ cat linpack.sh #!/bin/bash # #$ -cwd #$ -j y #$ -S /bin/bash # MPI_DIR=/opt/mpich/gnu/ HPL_DIR=/opt/hpl/mpich-hpl/ # OpenMPI part. Uncomment the following code and comment the above code # to use OpemMPI rather than MPICH # MPI_DIR=/opt/openmpi/ # HPL_DIR=/opt/hpl/openmpi-hpl/ $MPI_DIR/bin/mpirun -np $NSLOTS -machinefile $TMP/machines \ $HPL_DIR/bin/xhpl The command to submit a MPI parallel job script is similar to submitting a serial job script but you will need to use the -pe mpich N. N refers to the number of processes that you want to allocate to the MPI program. Here s an example of submitting a 2 processes linpack program using this HPL.dat 3 file: [sysadm1@frontend-0 sysadm1]$ qsub -pe mpich 2 linpack.sh your job 17 ("linpack.sh") has been submitted If you need to delete an already submitted job, you can use qdel given it s job id. Here s an example of deleting a fluent job under SGE: [sysadm1@frontend-0 sysadm1]$ qsub fluent.sh your job 31 ("fluent.sh") has been submitted [sysadm1@frontend-0 sysadm1]$ qstat job-id prior name user state submit/start at queue master ja-task-id fluent.sh sysadm1 t 12/24/ :10:28 comp-pvfs- MASTER [sysadm1@frontend-0 sysadm1]$ qdel 31 sysadm1 has registered the job 31 for deletion [sysadm1@frontend-0 sysadm1]$ qstat [sysadm1@frontend-0 sysadm1]$ 4
9 Chapter 3. Using the SGE Roll Although the example job scripts are bash scripts, SGE can also accept other types of shell scripts. It is trivial to wrap serial programs into a SGE job script. Similarly, for MPI parallel jobs, you just need to use the correct mpirun launcher and to also add in the two SGE variables, $NSLOTS and $TMP/machines within the job script. For other parallel jobs other than MPI, a Parallel Environment or PE needs to be defined. This is covered withn the SGE documentation Monitoring SGE Jobs To monitor jobs under SGE, use the qstat command. When executed with no arguments, it will display a summarized list of jobs [sysadm1@frontend-0 sysadm1]$ qstat job-id prior name user state submit/start at queue master ja-task-id sleep.sh sysadm1 t 12/23/ :22:09 frontend-0 MASTER 21 0 sleep.sh sysadm1 t 12/23/ :22:09 frontend-0 MASTER 22 0 sleep.sh sysadm1 qw 12/23/ :22:06 Use qstat -f to display a more detailed list of jobs within SGE. [sysadm1@frontend-0 sysadm1]$ qstat -f queuename qtype used/tot. load_avg arch states comp-pvfs-0-0.q BIP 0/ glinux comp-pvfs-0-1.q BIP 0/ glinux comp-pvfs-0-2.q BIP 0/ glinux frontend-0.q BIP 2/ glinux 23 0 sleep.sh sysadm1 t 12/23/ :23:40 MASTER 24 0 sleep.sh sysadm1 t 12/23/ :23:40 MASTER ############################################################################ - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS ############################################################################ 25 0 linpack.sh sysadm1 qw 12/23/ :23:32 You can also use qstat to query the status of a job, given it s job id. For this, you would use the -j N option where N would be the job id. [sysadm1@frontend-0 sysadm1]$ qsub -pe mpich 1 single-xhpl.sh your job 28 ("single-xhpl.sh") has been submitted [sysadm1@frontend-0 sysadm1]$ qstat -j 28 job_number: 28 exec_file: job_scripts/28 submission_time: Wed Dec 24 01:00: owner: sysadm1 uid: 502 group: sysadm1 gid: 502 5
10 Chapter 3. Using the SGE Roll sge_o_home: /home/sysadm1 sge_o_log_name: sysadm1 sge_o_path: /opt/sge/bin/glinux:/usr/kerberos/bin:/usr/local/bin:/bin:/usr/bin:/usr/x sge_o_mail: /var/spool/mail/sysadm1 sge_o_shell: /bin/bash sge_o_workdir: /home/sysadm1 sge_o_host: frontend-0 account: sge cwd: /home/sysadm1 path_aliases: /tmp_mnt/ * * / merge: y mail_list: [email protected] notify: FALSE job_name: single-xhpl.sh shell_list: /bin/bash script_file: single-xhpl.sh parallel environment: mpich range: 1 scheduling info: queue "comp-pvfs-0-1.q" dropped because it is temporarily not available queue "comp-pvfs-0-2.q" dropped because it is temporarily not available queue "comp-pvfs-0-0.q" dropped because it is temporarily not available 3.5. Managing SGE queues To display a list of queues within the Rocks cluster, use qconf -sql. [sysadm1@frontend-0 sysadm1]$ qconf -sql comp-pvfs-0-0.q comp-pvfs-0-1.q comp-pvfs-0-2.q frontend-0.q If there is a need to disable a particular queue for some reason, e.g scheduling that node for maintenance, use qmod -d Q where Q is the queue name. You will need to be a SGE manager in order to disable a queue like the root account. You can also use wildcards to select a particular range of queues. [sysadm1@frontend-0 sysadm1]$ qstat -f queuename qtype used/tot. load_avg arch states comp-pvfs-0-0.q BIP 0/ glinux comp-pvfs-0-1.q BIP 0/ glinux comp-pvfs-0-2.q BIP 0/ glinux frontend-0.q BIP 0/ glinux [sysadm1@frontend-0 sysadm1]$ su - Password: [root@frontend-0 root]# qmod -d comp-pvfs-0-0.q Queue "comp-pvfs-0-0.q" has been disabled by [email protected] [root@frontend-0 root]# qstat -f 6
11 Chapter 3. Using the SGE Roll queuename qtype used/tot. load_avg arch states comp-pvfs-0-0.q BIP 0/ glinux d comp-pvfs-0-1.q BIP 0/ glinux comp-pvfs-0-2.q BIP 0/ glinux frontend-0.q BIP 0/ glinux To enable back the queue, you can use qmod -e Q. Here is an example of Q being specified as range of queues via wildcards. [root@frontend-0 root]# qmod -e comp-pvfs-* Queue "comp-pvfs-0-0.q" has been enabled by [email protected] root - queue "comp-pvfs-0-1.q" is already enabled root - queue "comp-pvfs-0-2.q" is already enabled [root@frontend-0 root]# qstat -f queuename qtype used/tot. load_avg arch states comp-pvfs-0-0.q BIP 0/ glinux comp-pvfs-0-1.q BIP 0/ glinux comp-pvfs-0-2.q BIP 0/ glinux frontend-0.q BIP 0/ glinux For more information in using SGE, please refer to the SGE documentation and the man pages. Notes 1. examples/sleep.sh 2. examples/linpack.sh 3. examples/hpl.dat 7
12 Chapter 4. Copyrights 4.1. Sun Grid Engine The project uses the Sun Industry Standards Source License, or SISSL. This license is recognized as a free and open license by the Free Software Foundation and the Open Source Initiative respectively. This license is a good license where interoperability and commercial considerations are important. Under the SISSL license a user may do what they like with the source base, such as modifing it and extending it, but the user must maintain compatibility if they want to provide their modified version to others. It is not mandatory for the user to contribute back code changes to the project, though of course this is encouraged. If a user wishes to make available a modified version which is not compatible with the project, the license requires that the licensee must provide a reference implementation of sources which constitute the modification, thereby opening the details of any incompatibility/modification which has been introduced. Additionally the non-compliant modified version cannot use the Grid Engine name. 8
Grid Engine Users Guide. 2011.11p1 Edition
Grid Engine Users Guide 2011.11p1 Edition Grid Engine Users Guide : 2011.11p1 Edition Published Nov 01 2012 Copyright 2012 University of California and Scalable Systems This document is subject to the
Efficient cluster computing
Efficient cluster computing Introduction to the Sun Grid Engine (SGE) queuing system Markus Rampp (RZG, MIGenAS) MPI for Evolutionary Anthropology Leipzig, Feb. 16, 2007 Outline Introduction Basic concepts:
Grid Engine Basics. Table of Contents. Grid Engine Basics Version 1. (Formerly: Sun Grid Engine)
Grid Engine Basics (Formerly: Sun Grid Engine) Table of Contents Table of Contents Document Text Style Associations Prerequisites Terminology What is the Grid Engine (SGE)? Loading the SGE Module on Turing
Streamline Computing Linux Cluster User Training. ( Nottingham University)
1 Streamline Computing Linux Cluster User Training ( Nottingham University) 3 User Training Agenda System Overview System Access Description of Cluster Environment Code Development Job Schedulers Running
Notes on the SNOW/Rmpi R packages with OpenMPI and Sun Grid Engine
Notes on the SNOW/Rmpi R packages with OpenMPI and Sun Grid Engine Last updated: 6/2/2008 4:43PM EDT We informally discuss the basic set up of the R Rmpi and SNOW packages with OpenMPI and the Sun Grid
Introduction to Sun Grid Engine (SGE)
Introduction to Sun Grid Engine (SGE) What is SGE? Sun Grid Engine (SGE) is an open source community effort to facilitate the adoption of distributed computing solutions. Sponsored by Sun Microsystems
Grid 101. Grid 101. Josh Hegie. [email protected] http://hpc.unr.edu
Grid 101 Josh Hegie [email protected] http://hpc.unr.edu Accessing the Grid Outline 1 Accessing the Grid 2 Working on the Grid 3 Submitting Jobs with SGE 4 Compiling 5 MPI 6 Questions? Accessing the Grid Logging
High Performance Computing Facility Specifications, Policies and Usage. Supercomputer Project. Bibliotheca Alexandrina
High Performance Computing Facility Specifications, Policies and Usage Supercomputer Project Bibliotheca Alexandrina Bibliotheca Alexandrina 1/16 Topics Specifications Overview Site Policies Intel Compilers
Introduction to the SGE/OGS batch-queuing system
Grid Computing Competence Center Introduction to the SGE/OGS batch-queuing system Riccardo Murri Grid Computing Competence Center, Organisch-Chemisches Institut, University of Zurich Oct. 6, 2011 The basic
GRID Computing: CAS Style
CS4CC3 Advanced Operating Systems Architectures Laboratory 7 GRID Computing: CAS Style campus trunk C.I.S. router "birkhoff" server The CAS Grid Computer 100BT ethernet node 1 "gigabyte" Ethernet switch
The SUN ONE Grid Engine BATCH SYSTEM
The SUN ONE Grid Engine BATCH SYSTEM Juan Luis Chaves Sanabria Centro Nacional de Cálculo Científico (CeCalCULA) Latin American School in HPC on Linux Cluster October 27 November 07 2003 What is SGE? Is
Grid Engine 6. Troubleshooting. BioTeam Inc. [email protected]
Grid Engine 6 Troubleshooting BioTeam Inc. [email protected] Grid Engine Troubleshooting There are two core problem types Job Level Cluster seems OK, example scripts work fine Some user jobs/apps fail Cluster
Miami University RedHawk Cluster Working with batch jobs on the Cluster
Miami University RedHawk Cluster Working with batch jobs on the Cluster The RedHawk cluster is a general purpose research computing resource available to support the research community at Miami University.
How To Run A Tompouce Cluster On An Ipra (Inria) 2.5.5 (Sun) 2 (Sun Geserade) 2-5.4 (Sun-Ge) 2/5.2 (
Running Hadoop and Stratosphere jobs on TomPouce cluster 16 October 2013 TomPouce cluster TomPouce is a cluster of 20 calcula@on nodes = 240 cores Located in the Inria Turing building (École Polytechnique)
Cluster@WU User s Manual
Cluster@WU User s Manual Stefan Theußl Martin Pacala September 29, 2014 1 Introduction and scope At the WU Wirtschaftsuniversität Wien the Research Institute for Computational Methods (Forschungsinstitut
1.0. User Manual For HPC Cluster at GIKI. Volume. Ghulam Ishaq Khan Institute of Engineering Sciences & Technology
Volume 1.0 FACULTY OF CUMPUTER SCIENCE & ENGINEERING Ghulam Ishaq Khan Institute of Engineering Sciences & Technology User Manual For HPC Cluster at GIKI Designed and prepared by Faculty of Computer Science
Enigma, Sun Grid Engine (SGE), and the Joint High Performance Computing Exchange (JHPCE) Cluster
Enigma, Sun Grid Engine (SGE), and the Joint High Performance Computing Exchange (JHPCE) Cluster http://www.biostat.jhsph.edu/bit/sge_lecture.ppt.pdf Marvin Newhouse Fernando J. Pineda The JHPCE staff:
Grid Engine Training Introduction
Grid Engine Training Jordi Blasco ([email protected]) 26-03-2012 Agenda 1 How it works? 2 History Current status future About the Grid Engine version of this training Documentation 3 Grid Engine internals
Hodor and Bran - Job Scheduling and PBS Scripts
Hodor and Bran - Job Scheduling and PBS Scripts UND Computational Research Center Now that you have your program compiled and your input file ready for processing, it s time to run your job on the cluster.
Submitting Jobs to the Sun Grid Engine. CiCS Dept The University of Sheffield. Email [email protected] [email protected].
Submitting Jobs to the Sun Grid Engine CiCS Dept The University of Sheffield Email [email protected] [email protected] October 2012 Topics Covered Introducing the grid and batch concepts.
Grid Engine. Application Integration
Grid Engine Application Integration Getting Stuff Done. Batch Interactive - Terminal Interactive - X11/GUI Licensed Applications Parallel Jobs DRMAA Batch Jobs Most common What is run: Shell Scripts Binaries
High Performance Computing
High Performance Computing at Stellenbosch University Gerhard Venter Outline 1 Background 2 Clusters 3 SU History 4 SU Cluster 5 Using the Cluster 6 Examples What is High Performance Computing? Wikipedia
HPCC - Hrothgar Getting Started User Guide MPI Programming
HPCC - Hrothgar Getting Started User Guide MPI Programming High Performance Computing Center Texas Tech University HPCC - Hrothgar 2 Table of Contents 1. Introduction... 3 2. Setting up the environment...
Installing and running COMSOL on a Linux cluster
Installing and running COMSOL on a Linux cluster Introduction This quick guide explains how to install and operate COMSOL Multiphysics 5.0 on a Linux cluster. It is a complement to the COMSOL Installation
SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education www.accre.vanderbilt.
SLURM: Resource Management and Job Scheduling Software Advanced Computing Center for Research and Education www.accre.vanderbilt.edu Simple Linux Utility for Resource Management But it s also a job scheduler!
Tutorial: Using WestGrid. Drew Leske Compute Canada/WestGrid Site Lead University of Victoria
Tutorial: Using WestGrid Drew Leske Compute Canada/WestGrid Site Lead University of Victoria Fall 2013 Seminar Series Date Speaker Topic 23 September Lindsay Sill Introduction to WestGrid 9 October Drew
Introduction to Grid Engine
Introduction to Grid Engine Workbook Edition 8 January 2011 Document reference: 3609-2011 Introduction to Grid Engine for ECDF Users Workbook Introduction to Grid Engine for ECDF Users Author: Brian Fletcher,
Manual for using Super Computing Resources
Manual for using Super Computing Resources Super Computing Research and Education Centre at Research Centre for Modeling and Simulation National University of Science and Technology H-12 Campus, Islamabad
PBS Tutorial. Fangrui Ma Universit of Nebraska-Lincoln. October 26th, 2007
PBS Tutorial Fangrui Ma Universit of Nebraska-Lincoln October 26th, 2007 Abstract In this tutorial we gave a brief introduction to using PBS Pro. We gave examples on how to write control script, and submit
SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education www.accre.vanderbilt.
SLURM: Resource Management and Job Scheduling Software Advanced Computing Center for Research and Education www.accre.vanderbilt.edu Simple Linux Utility for Resource Management But it s also a job scheduler!
How to Run Parallel Jobs Efficiently
How to Run Parallel Jobs Efficiently Shao-Ching Huang High Performance Computing Group UCLA Institute for Digital Research and Education May 9, 2013 1 The big picture: running parallel jobs on Hoffman2
High-Performance Reservoir Risk Assessment (Jacta Cluster)
High-Performance Reservoir Risk Assessment (Jacta Cluster) SKUA-GOCAD 2013.1 Paradigm 2011.3 With Epos 4.1 Data Management Configuration Guide 2008 2013 Paradigm Ltd. or its affiliates and subsidiaries.
Cluster Computing With R
Cluster Computing With R Stowers Institute for Medical Research R/Bioconductor Discussion Group Earl F. Glynn Scientific Programmer 18 December 2007 1 Cluster Computing With R Accessing Linux Boxes from
HPCC USER S GUIDE. Version 1.2 July 2012. IITS (Research Support) Singapore Management University. IITS, Singapore Management University Page 1 of 35
HPCC USER S GUIDE Version 1.2 July 2012 IITS (Research Support) Singapore Management University IITS, Singapore Management University Page 1 of 35 Revision History Version 1.0 (27 June 2012): - Modified
NEC HPC-Linux-Cluster
NEC HPC-Linux-Cluster Hardware configuration: 4 Front-end servers: each with SandyBridge-EP processors: 16 cores per node 128 GB memory 134 compute nodes: 112 nodes with SandyBridge-EP processors (16 cores
User s Guide. Introduction
CHAPTER 3 User s Guide Introduction Sun Grid Engine (Computing in Distributed Networked Environments) is a load management tool for heterogeneous, distributed computing environments. Sun Grid Engine provides
An Introduction to High Performance Computing in the Department
An Introduction to High Performance Computing in the Department Ashley Ford & Chris Jewell Department of Statistics University of Warwick October 30, 2012 1 Some Background 2 How is Buster used? 3 Software
Beyond Windows: Using the Linux Servers and the Grid
Beyond Windows: Using the Linux Servers and the Grid Topics Linux Overview How to Login & Remote Access Passwords Staying Up-To-Date Network Drives Server List The Grid Useful Commands Linux Overview Linux
Using WestGrid. Patrick Mann, Manager, Technical Operations Jan.15, 2014
Using WestGrid Patrick Mann, Manager, Technical Operations Jan.15, 2014 Winter 2014 Seminar Series Date Speaker Topic 5 February Gino DiLabio Molecular Modelling Using HPC and Gaussian 26 February Jonathan
AstroCompute. AWS101 - using the cloud for Science. Brendan Bouffler ( boof ) Scientific Computing (SciCo) @ AWS. ska-astrocompute@amazon.
AstroCompute AWS101 - using the cloud for Science Brendan Bouffler ( boof ) Scientific Computing (SciCo) @ AWS [email protected] AWS is hoping to contribute to the development of data processing,
NYUAD HPC Center Running Jobs
NYUAD HPC Center Running Jobs 1 Overview... Error! Bookmark not defined. 1.1 General List... Error! Bookmark not defined. 1.2 Compilers... Error! Bookmark not defined. 2 Loading Software... Error! Bookmark
Running ANSYS Fluent Under SGE
Running ANSYS Fluent Under SGE ANSYS, Inc. Southpointe 275 Technology Drive Canonsburg, PA 15317 [email protected] http://www.ansys.com (T) 724-746-3304 (F) 724-514-9494 Release 15.0 November 2013 ANSYS,
Batch Job Analysis to Improve the Success Rate in HPC
Batch Job Analysis to Improve the Success Rate in HPC 1 JunWeon Yoon, 2 TaeYoung Hong, 3 ChanYeol Park, 4 HeonChang Yu 1, First Author KISTI and Korea University, [email protected] 2,3, KISTI,[email protected],[email protected]
Using Parallel Computing to Run Multiple Jobs
Beowulf Training Using Parallel Computing to Run Multiple Jobs Jeff Linderoth August 5, 2003 August 5, 2003 Beowulf Training Running Multiple Jobs Slide 1 Outline Introduction to Scheduling Software The
Linux für bwgrid. Sabine Richling, Heinz Kredel. Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim. 27.
Linux für bwgrid Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 27. June 2011 Richling/Kredel (URZ/RUM) Linux für bwgrid FS 2011 1 / 33 Introduction
SLURM Workload Manager
SLURM Workload Manager What is SLURM? SLURM (Simple Linux Utility for Resource Management) is the native scheduler software that runs on ASTI's HPC cluster. Free and open-source job scheduler for the Linux
HPC at IU Overview. Abhinav Thota Research Technologies Indiana University
HPC at IU Overview Abhinav Thota Research Technologies Indiana University What is HPC/cyberinfrastructure? Why should you care? Data sizes are growing Need to get to the solution faster Compute power is
Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC
Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC Goals of the session Overview of parallel MATLAB Why parallel MATLAB? Multiprocessing in MATLAB Parallel MATLAB using the Parallel Computing
Grid Engine 6. Policies. BioTeam Inc. [email protected]
Grid Engine 6 Policies BioTeam Inc. [email protected] This module covers High level policy config Reservations Backfilling Resource Quotas Advanced Reservation Job Submission Verification We ll be talking
Parallel Debugging with DDT
Parallel Debugging with DDT Nate Woody 3/10/2009 www.cac.cornell.edu 1 Debugging Debugging is a methodical process of finding and reducing the number of bugs, or defects, in a computer program or a piece
Work Environment. David Tur HPC Expert. HPC Users Training September, 18th 2015
Work Environment David Tur HPC Expert HPC Users Training September, 18th 2015 1. Atlas Cluster: Accessing and using resources 2. Software Overview 3. Job Scheduler 1. Accessing Resources DIPC technicians
High Performance Computing with Sun Grid Engine on the HPSCC cluster. Fernando J. Pineda
High Performance Computing with Sun Grid Engine on the HPSCC cluster Fernando J. Pineda HPSCC High Performance Scientific Computing Center (HPSCC) " The Johns Hopkins Service Center in the Dept. of Biostatistics
Job Scheduling with Moab Cluster Suite
Job Scheduling with Moab Cluster Suite IBM High Performance Computing February 2010 Y. Joanna Wong, Ph.D. [email protected] 2/22/2010 Workload Manager Torque Source: Adaptive Computing 2 Some terminology..
UMass High Performance Computing Center
.. UMass High Performance Computing Center University of Massachusetts Medical School October, 2014 2 / 32. Challenges of Genomic Data It is getting easier and cheaper to produce bigger genomic data every
Sun Grid Engine, a new scheduler for EGEE
Sun Grid Engine, a new scheduler for EGEE G. Borges, M. David, J. Gomes, J. Lopez, P. Rey, A. Simon, C. Fernandez, D. Kant, K. M. Sephton IBERGRID Conference Santiago de Compostela, Spain 14, 15, 16 May
Configuration of High Performance Computing for Medical Imaging and Processing. SunGridEngine 6.2u5
Configuration of High Performance Computing for Medical Imaging and Processing SunGridEngine 6.2u5 A manual guide for installing, configuring and using the cluster. Mohammad Naquiddin Abd Razak Summer
Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research
Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research Cynthia Cornelius Center for Computational Research University at Buffalo, SUNY 701 Ellicott St
Getting Started with HPC
Getting Started with HPC An Introduction to the Minerva High Performance Computing Resource 17 Sep 2013 Outline of Topics Introduction HPC Accounts Logging onto the HPC Clusters Common Linux Commands Storage
Sun Grid Engine Package for OSCAR A Google SoC 2005 Project
Sun Grid Engine Package for OSCAR A Google SoC 2005 Project Babu Sundaram, Barbara Chapman University of Houston Bernard Li, Mark Mayo, Asim Siddiqui, Steven Jones Canada s Michael Smith Genome Sciences
Grid Engine 6. Monitoring, Accounting & Reporting. BioTeam Inc. [email protected]
Grid Engine 6 Monitoring, Accounting & Reporting BioTeam Inc. [email protected] This module covers System Monitoring Accounting & Reporting tools SGE Accounting File ARCo & sgeinspect SGE Reporting 3rd
Running applications on the Cray XC30 4/12/2015
Running applications on the Cray XC30 4/12/2015 1 Running on compute nodes By default, users do not log in and run applications on the compute nodes directly. Instead they launch jobs on compute nodes
INF-110. GPFS Installation
INF-110 GPFS Installation Overview Plan the installation Before installing any software, it is important to plan the GPFS installation by choosing the hardware, deciding which kind of disk connectivity
Running on Blue Gene/Q at Argonne Leadership Computing Facility (ALCF)
Running on Blue Gene/Q at Argonne Leadership Computing Facility (ALCF) ALCF Resources: Machines & Storage Mira (Production) IBM Blue Gene/Q 49,152 nodes / 786,432 cores 768 TB of memory Peak flop rate:
KISTI Supercomputer TACHYON Scheduling scheme & Sun Grid Engine
KISTI Supercomputer TACHYON Scheduling scheme & Sun Grid Engine 슈퍼컴퓨팅인프라지원실 윤 준 원 ([email protected]) 2014.07.15 Scheduling (batch job processing) Distributed resource management Features of job schedulers
High Performance Compute Cluster
High Performance Compute Cluster Overview for Researchers and Users Alces Software Limited 2013 COMMERCIAL IN CONFIDENCE 1 Copyrights, Licenses and Acknowledgements Dell, PowerEdge, PowerVault and PowerConnect
Schedule WRF model executions in parallel computing environments using Python
Schedule WRF model executions in parallel computing environments using Python A.M. Guerrero-Higueras, E. García-Ortega and J.L. Sánchez Atmospheric Physics Group, University of León, León, Spain J. Lorenzana
Quick Tutorial for Portable Batch System (PBS)
Quick Tutorial for Portable Batch System (PBS) The Portable Batch System (PBS) system is designed to manage the distribution of batch jobs and interactive sessions across the available nodes in the cluster.
The CNMS Computer Cluster
The CNMS Computer Cluster This page describes the CNMS Computational Cluster, how to access it, and how to use it. Introduction (2014) The latest block of the CNMS Cluster (2010) Previous blocks of the
OpenMP & MPI CISC 879. Tristan Vanderbruggen & John Cavazos Dept of Computer & Information Sciences University of Delaware
OpenMP & MPI CISC 879 Tristan Vanderbruggen & John Cavazos Dept of Computer & Information Sciences University of Delaware 1 Lecture Overview Introduction OpenMP MPI Model Language extension: directives-based
Grid Engine experience in Finis Terrae, large Itanium cluster supercomputer. Pablo Rey Mayo Systems Technician, Galicia Supercomputing Centre (CESGA)
Grid Engine experience in Finis Terrae, large Itanium cluster supercomputer Pablo Rey Mayo Systems Technician, Galicia Supercomputing Centre (CESGA) Agenda Introducing CESGA Finis Terrae Architecture Grid
CycleServer Grid Engine Support Install Guide. version 1.25
CycleServer Grid Engine Support Install Guide version 1.25 Contents CycleServer Grid Engine Guide 1 Administration 1 Requirements 1 Installation 1 Monitoring Additional OGS/SGE/etc Clusters 3 Monitoring
RSA Authentication Manager 7.1 to 8.1 Migration Guide: Upgrading RSA SecurID Appliance 3.0 On Existing Hardware
RSA Authentication Manager 7.1 to 8.1 Migration Guide: Upgrading RSA SecurID Appliance 3.0 On Existing Hardware Contact Information Go to the RSA corporate website for regional Customer Support telephone
The RWTH Compute Cluster Environment
The RWTH Compute Cluster Environment Tim Cramer 11.03.2013 Source: D. Both, Bull GmbH Rechen- und Kommunikationszentrum (RZ) How to login Frontends cluster.rz.rwth-aachen.de cluster-x.rz.rwth-aachen.de
HOD Scheduler. Table of contents
Table of contents 1 Introduction... 2 2 HOD Users... 2 2.1 Getting Started... 2 2.2 HOD Features...5 2.3 Troubleshooting... 14 3 HOD Administrators... 21 3.1 Getting Started... 22 3.2 Prerequisites...
New High-performance computing cluster: PAULI. Sascha Frick Institute for Physical Chemistry
New High-performance computing cluster: PAULI Sascha Frick Institute for Physical Chemistry 02/05/2012 Sascha Frick (PHC) HPC cluster pauli 02/05/2012 1 / 24 Outline 1 About this seminar 2 New Hardware
IBM Redistribute Big SQL v4.x Storage Paths IBM. Redistribute Big SQL v4.x Storage Paths
Redistribute Big SQL v4.x Storage Paths THE GOAL The Big SQL temporary tablespace is used during high volume queries to spill sorts or intermediate data to disk. To improve I/O performance for these queries,
Introduction to Sun Grid Engine 5.3
CHAPTER 1 Introduction to Sun Grid Engine 5.3 This chapter provides background information about the Sun Grid Engine 5.3 system that is useful to users and administrators alike. In addition to a description
Job scheduler details
Job scheduler details Advanced Computing Center for Research & Education (ACCRE) Job scheduler details 1 / 25 Outline 1 Batch queue system overview 2 Torque and Moab 3 Submitting jobs (ACCRE) Job scheduler
Submitting batch jobs Slurm on ecgate. Xavi Abellan [email protected] User Support Section
Submitting batch jobs Slurm on ecgate Xavi Abellan [email protected] User Support Section Slide 1 Outline Interactive mode versus Batch mode Overview of the Slurm batch system on ecgate Batch basic
PaperStream Connect. Setup Guide. Version 1.0.0.0. Copyright Fujitsu
PaperStream Connect Setup Guide Version 1.0.0.0 Copyright Fujitsu 2014 Contents Introduction to PaperStream Connect... 2 Setting up PaperStream Capture to Release to Cloud Services... 3 Selecting a Cloud
Enhanced Connector Applications SupportPac VP01 for IBM WebSphere Business Events 3.0.0
Enhanced Connector Applications SupportPac VP01 for IBM WebSphere Business Events 3.0.0 Third edition (May 2012). Copyright International Business Machines Corporation 2012. US Government Users Restricted
Agenda. Using HPC Wales 2
Using HPC Wales Agenda Infrastructure : An Overview of our Infrastructure Logging in : Command Line Interface and File Transfer Linux Basics : Commands and Text Editors Using Modules : Managing Software
UNISOL SysAdmin. SysAdmin helps systems administrators manage their UNIX systems and networks more effectively.
1. UNISOL SysAdmin Overview SysAdmin helps systems administrators manage their UNIX systems and networks more effectively. SysAdmin is a comprehensive system administration package which provides a secure
High Performance Computing Cluster Quick Reference User Guide
High Performance Computing Cluster Quick Reference User Guide Base Operating System: Redhat(TM) / Scientific Linux 5.5 with Alces HPC Software Stack Copyright 2011 Alces Software Ltd All Rights Reserved
Integrating VoltDB with Hadoop
The NewSQL database you ll never outgrow Integrating with Hadoop Hadoop is an open source framework for managing and manipulating massive volumes of data. is an database for handling high velocity data.
Debugging and Profiling Lab. Carlos Rosales, Kent Milfeld and Yaakoub Y. El Kharma [email protected]
Debugging and Profiling Lab Carlos Rosales, Kent Milfeld and Yaakoub Y. El Kharma [email protected] Setup Login to Ranger: - ssh -X [email protected] Make sure you can export graphics
How To Configure the Oracle ZFS Storage Appliance for Quest Authentication for Oracle Solaris
How To Configure the Oracle ZFS Storage Appliance for Quest Authentication for Oracle Solaris January 2014; v1.3 By Andrew Ness This article describes how to configure Quest Authentication Services in
ORACLE NOSQL DATABASE HANDS-ON WORKSHOP Cluster Deployment and Management
ORACLE NOSQL DATABASE HANDS-ON WORKSHOP Cluster Deployment and Management Lab Exercise 1 Deploy 3x3 NoSQL Cluster into single Datacenters Objective: Learn from your experience how simple and intuitive
How to Set Up pgagent for Postgres Plus. A Postgres Evaluation Quick Tutorial From EnterpriseDB
How to Set Up pgagent for Postgres Plus A Postgres Evaluation Quick Tutorial From EnterpriseDB February 19, 2010 EnterpriseDB Corporation, 235 Littleton Road, Westford, MA 01866, USA T +1 978 589 5700
How To Use A Job Management System With Sun Hpc Cluster Tools
A Comparison of Job Management Systems in Supporting HPC ClusterTools Presentation for SUPerG Vancouver, Fall 2000 Chansup Byun and Christopher Duncan HES Engineering-HPC, Sun Microsystems, Inc. Stephanie
Using the Yale HPC Clusters
Using the Yale HPC Clusters Stephen Weston Robert Bjornson Yale Center for Research Computing Yale University Oct 2015 To get help Send an email to: [email protected] Read documentation at: http://research.computing.yale.edu/hpc-support
Introduction to parallel computing and UPPMAX
Introduction to parallel computing and UPPMAX Intro part of course in Parallel Image Analysis Elias Rudberg [email protected] March 22, 2011 Parallel computing Parallel computing is becoming increasingly
Oracle Grid Engine. User Guide Release 6.2 Update 7 E21976-02
Oracle Grid Engine User Guide Release 6.2 Update 7 E21976-02 February 2012 Oracle Grid Engine User Guide, Release 6.2 Update 7 E21976-02 Copyright 2000, 2012, Oracle and/or its affiliates. All rights reserved.
