Caltech Center for Advanced Computing Research System Guide: MRI2 Cluster (zwicky) January 2014

Size: px
Start display at page:

Download "Caltech Center for Advanced Computing Research System Guide: MRI2 Cluster (zwicky) January 2014"

Transcription

1 1. How to Get An Account CACR Accounts 2. How to Access the Machine Connect to the front end, zwicky.cacr.caltech.edu: ssh -l username zwicky.cacr.caltech.edu or ssh Edits, compiles, builds, and job submissions are done on the front end. NOTE: Password-based authentication is not supported for connecting to the cluster; ssh public key authentication is the only way to connect. If you don't have an ssh keypair, please see SSH key generation instructions. 3. Technical Summary & System Configuration Information The MRI2 cluster, zwicky, is specifically configured to meet the needs of applications in Caltech's Theoretical Astrophysics group, in particular the simulation of black holes and other extreme spacetimes. The configuration, integrated by Hewlett-Packard and CACR's technical support team, consists of 2244 Intel X5650 compute cores plus 320 Intel E cores, connected via QDR InfiniBand.160 TB of parallel file system space (Panasas PAS11) is mounted over InfiniBand. Typical applications running on zwicky are MPI based. Queuing policies allow for development and production cycles, flexible to meet the project's PIs' requirements.

2 Architecture Intel Westmere, RedHat Linux Cluster Head Node 2 Intel x Ghz Hex Core Processors - 12 cores 48 GB ECC SDRAM Compute Nodes 187 dual-processor Intel x Ghz, hex core 2 GB ECC SDRAM/core - 24 GB/node 20 dual-processor Intel E Ghz, 8core 64 GB/node 12800R ECC SDRAM 2308 compute cores, 4.6TB memory Network Interconnect 11 Voltaire Infiniband 4x QDR 36P managed switches Storage 40 GB local scratch/node ~160 TB, supplying /home and /panfs/ds06/sxs project work area 44TB o /nfs/ds0[1,3,4]/sxs o /nfs/ds0[3,4]/sxs+bbhdata archival area 180TB o /nfs/as[01 12]/sxs Batch System Torque with Maui Operating System RHEL 5.9 Compilers GNU PGI Open64 Intel MPI Open MPI 4. Available Filesystems and Descriptions/Intended Usage 4.a Filesystems available on head node, compute nodes, and archival storage nodes: /home /nfs/ds01/sxs /nfs/ds03/sxs /nfs/ds03/sxs_bbhdata /nfs/ds04/sxs /nfs/ds04/sxs_bbhdata /panfs/ds06/sxs - RAID5/1 backed up - RAID6 not backed up - RAID6 not backed up - RAID6 not backed up - RAID5 not backed up - RAID5 not backed up - RAID5/1 not backed up 4.b File systems available on compute nodes only /scratch - new filesystem created after each job (~40G), local to each compute node 4.c File systems available on zwicky head and archival storage nodes (tier2-storage-[a,b].cacr.caltech.edu /nfs/as[01, 02, 12]/sxs - RAID6/1 not backed up

3 All the filesystems listed above, except for /nfs/*/sxs_bbhdata and /scratch, have a per-user directory for each sxs project member. Notes regarding archival storage nodes: 1. Currently access to the archival storage nodes is available from the zwicky cluster as well as external nodes via 'ssh' (as per zwicky head node.) eg. ssh USERNAME@tier2-storage-b.cacr.caltech.edu 2. Please move all data that will not be immediately needed to the archival storage areas. Appropriate commands are: cp -r /panfs/ds06/sxs/username/yourfromdirectory/. \ /nfs/as01/sxs/username/yourtodirectory ( cd /panfs/ds06/sxs/username/yourfromdirectory && tar cf -. ) \ ( cd /nfs/as02/sxs/username/yourtodirectory && tar xf - ) as well as variants of 'rsync' and any other Linux copy command. 3. Remote archival area: /nfs/as[01 12]/sxs/<username>. For example: /nfs/as04/sxs/joeuser Archival storage is currently accessible from the zwicky head node 'zwicky.cacr.caltech.edu' as well as the dedicated archival access nodes 'tier2-storage.cacr.caltech.edu' which is an alias directing to tier2-storage-a.cacr.caltech.edu or tier2-storage-b.cacr.caltech.edu. This archival area is NOT currently available from the zwicky compute nodes. Due to its connectivity, the performance of file copies between /panfs/* and the /nfs/as[01..12] areas will be much higher if done on 'tier2-storage.cacr.caltech.edu' than if done on 'zwicky.cacr.caltech.edu'. This area is intended for archival storage and due to performance reasons is not meant to be used by MPI programs. Low I/O MPI access from the zwicky head node is acceptable but should be limited. Example 1: ssh tier2-storage.cacr.caltech.edu tier2-storage-a: pwd /home/joeuser tier2-storage-a: cd /panfs/ds06/sxs/joeuser tier2-storage-a: pwd /panfs/ds06/sxs/joeuser tier2-storage-a: cp filea /nfs/as01/sxs/joeuser tier2-storage-a: ls -l /nfs/as01/sxs/joeuser total 16 -rw-r--r-- 1 joeuser joeuser Jul 27 10:23 filea Example 2: ssh tier2-storage.cacr.caltech.edu tier2-storage-b: pwd /home/joeuser

4 tier2-storage-b: cd /panfs/ds06/sxs/joeuser tier2-storage-b: ls -al total 64 drwxrwsr-x 3 joeuser sxs 4096 Jul 27 10:37./ drwxr-xr-x 67 root root Jul 11 14:35../ drwxr-sr-x 2 joeuser sxs 4096 Jul 27 10:36 DirectoryA/ tier2-storage-b: ls -al /nfs/as01/sxs/joeuser total 24 drwxrwxr-x 2 joeuser joeuser 4096 Jul 27 10:37./ drwxrwsr-x 67 root root 4096 Jul 25 07:53../ -rw-r--r-- 1 joeuser joeuser Jul 27 10:34 oldfile tier2-storage-b: tar cf - DirectoryA ( cd /nfs/as01/sxs/joeuser && tar xpf - ) tier2-storage-b: ls -al /nfs/as01/sxs/joeuser total 28 drwxrwxr-x 3 joeuser joeuser 4096 Jul 27 10:37./ drwxrwsr-x 67 root root 4096 Jul 25 07:53../ drwxr-sr-x 2 joeuser joeuser 4096 Jul 27 10:36 DirectoryA/ -rw-r--r-- 1 joeuser joeuser Jul 27 10:34 oldfile 5. Available compilers, debuggers, libraries and other tools Many useful tools, besides those coming with the standard RHEL, are available on zwicky. 'module avail ' will provide the current available packages; below is a typical collection: Compilers: gcc, intel, open64, pgi, nasm Debuggers: inspector, intel, totalview, valgrind Languages: java, python python-pkg/mpi4py,pytz,numarray,scipy,gnuplot,numpy,sip,ipython,pymc, sympy,matplotlib, pyqt Libraries: ATLAS, HDF5, fftw3, qt, mkl MPI Libraries/Tools impi, openmpi, mvapich2, platform_mpi Tools: autoconf, automake, binutils, cmake, dakota, krb, m4 matlab, metis, octave, papi, petsc, svn, wireshark Visualization: gnuplot, grace, paraview, tecplot, visit 5.1 Using Module

5 The software environment on the MRI2 cluster is managed with the module command. To get started, please try module help, to see available arguments. To see all available packages: module avail To add default openmpi version: module add openmpi You will be able to see which version is the default for a package by looking at the output from: module avail To use a specific version of openmpi: module add openmpi/1.4.1-gcc The same package cannot be added twice, so when switching between different versions of the same package, you will need to remove the loaded version module del opempi, before loading in the new version module add openmpi/1.4.1-gcc. Another way of doing this, is to use the swap argument: module swap openmpi openmpi/1.4.1-gcc Some packages have dependencies and will show this when a prerequisite is not met: module add python python/2.6.4(23):error:151: Module 'python/2.6.4' depends on one of the module(s) 'openmpi/1.4.1-pgi openmpi/ intel openmpi/1.4.1-gcc openmpi/1.4.1 openmpi/1.4-pgi openmpi/1.4-intel openmpi/1.4-gcc openmpi/1.4 openmpi/ pgi openmpi/1.3.3-intel openmpi/1.3.3-gcc openmpi/1.3.3' python/2.6.4(23):error:102: Tcl command execution failed: prereq openmpi After adding openmpi module add openmpi, the module add python will succeed. To see what modules are in your environment: module list To clear all packages loaded from your environment: module clear To get help for a module: module help openmpi To display what a module will do to your environment: module display openmpi Some users test package versions with the using command in their Makefiles: MPIVERSIONREQUIRED=openmpi_1.4.1 MPIVERSION := $(strip $(shell using grep mpi)) ifneq ($(MPIVERSION),$(MPIVERSIONREQUIRED)) $(error You are using version $(MPIVERSION). \ You need to be using $(MPIVERSIONREQUIRED) \ for compilation to work.\ Please put 'use $(MPIVERSIONREQUIRED)' in your.tcshrc,.cshrc, or.profile \ [whichever is appropriate for your shell] and try again.) endif With the environment module, the following can be done instead:

6 MPIVERSIONREQUIRED=openmpi/1.4.1 MPIVERSION := $(strip $(shell echo $LOADEDMODULES tr ':' '\n' grep mpi)) ifneq ($(MPIVERSION),$(MPIVERSIONREQUIRED)) $(error You are using version $(MPIVERSION). \ You need to be using $(MPIVERSIONREQUIRED) \ for compilation to work.\ Please put 'use $(MPIVERSIONREQUIRED)' in your.tcshrc,.cshrc, or.profile \ [whichever is appropriate for your shell] and try again.) endif 6. Supported Debuggers and Debugging Tips idb (Intel Debugger) valgrind - To use valgrind with MPI jobs, do [l,h]mpirun -n X /usr/bin/valgrind --log-file=memlog a.out where memlog is a random name, and you'll see memlog.pid for each task with info about memory and pointer usage. TotalView Documentation o Examples are located in /usr/local/totalview/toolworks/linux-x86-64/examples License supports debugging of up to 32 threads or processes Example of running TotalView in an interactive session: Compile your code with -g option Allocate a couple of interactive nodes for the debugging session: qsub -I -x -l nodes=2 will land you on a compute node, where TotalView can be started: totalview & From the TotalView GUI, choose program to run. Select "Arguments" tab and fill in command-line arguments Select "Parallel" tab and select parallel system: Open MPI Choose how many tasks and how many nodes should be used for the debugging session. Click OK to load the code for debugging. 7. Launching, Managing and Priorities of Parallel Jobs 7.1 Launching a job As stated above, the zwicky cluster is currently a heterogeneous cluster made up of 207 compute nodes. 187 compute nodes each have 12 cores, twenty compute nodes have 16 cores. Allocating any subset of the twenty E nodes (each with 16 cores) requires the special "core16" tag. When requesting a compute node, core12 nodes are allocated if no core[12,16] tag is specified.see the runme.pbs example below for requesting a mix of X5650 and E nodes. It is expected that users will compile and run small verification tests of their code on the head node. They will then launch compute/memory intensive tests and production runs on the backend (aka compute) nodes.

7 Access to the compute nodes is obtained via the Maui queuing software. Maui will allocate the requested nodes for the time requested to the user based on a priority-based system. A user can obtain interactive access to a node (or set of nodes), or they can run a "job script" non interactively on a node (or set of nodes.) Interactive jobs on the compute nodes are initiated from the head node via the command: qsub -I -l nodes=x -l walltime=hh:mm:ss X = number of nodes HH:MM:SS = requested time HH: hours MM: minutes SS: seconds Non-interactive or batch jobs are submitted via the command: qsub -V runme.pbs where runme.pbs is the name of the batch script to run on the nodes and the -V option to qsub passes all of your current environment variables to the programs run in the runme.pbs script. Prior to submitting the job to PBS set up the path: module add openmpi/1.3.3-gcc (openmpi/1.3.3-gcc was used to compile the MPI program hello_world, that is run in the runme.pbs script) A simple runme.pbs file could contain: #!/bin/csh -f # Request 2 nodes for 1 hour #PBS -l nodes=2 #PBS -l walltime=01:00:00 # Direct stdout/err to files in directories # below my home directory #PBS -o /home/my_unixid/examples/hello.out #PBS -e /home/my_unixid/examples/hello.err # request core16 and core12 nodes #PBS l nodes=4:core16+8:core12 # Display the libraries my program will be using # note: this will be written to your stdout file. /usr/bin/ldd $HOME/examples/hello_world # Write the list of nodes allocated to me to stdout cat $PBS_NODEFILE # Run "hello_world" on 24 cores using the full pathname # mpirun -np 24 $PBS_O_HOME/examples/hello_world # Run "hello_world" on 12 cores using a relative pathname # (note: the 'cd') cd ${HOME}/examples mpirun -np 12 $PBS_O_HOME/examples/hello_world 7.2. Commonly Used Job Scheduler Commands

8 Command Description canceljob diagnose pbsnodes qstat qdel showq Cancel a job eg. canceljob <PBS_JOBID> Provide diagnostic reports for various aspects of resources, workload and scheduling eg. diagnose -f Show compute node status eg. pbsnodes -a State of any job currently queued/running. Note: When monitoring jobs with qstat, look at "Elap Time" (elapsed time) rather than "Time Use". The "Elap Time" is the time since the job started, whereas "Time Use" is the CPU time used by the user process; this number is usually zero or close to it, since it counts the script that actually launches the MPI job, not the job itself. qstat -a Show quick information of the server: qstat -B Show all queues: qstat -q Show all jobs running on the system: qstat -r Show detailed information for a specified job: qstat -f PBS_JOBID Another way to cancel a job: eg. qdel Jobs that are queued, running on hold. How many nodes are in use/free. For much more detailed documentation on the Maui scheduler and Torque Resource Manager. See the sections: TORQUE Resource Manager Maui Cluster Scheduler at VM Limits The zwicky compute nodes now have an upper bound (approximately 66 GBytes) on the total amount of virtual memory that can be used, cumulatively, by all processes running on the node. Since system processes consume a couple of GBytes, this means that the total VM available on a node for a user job will be a bit more than 60 GBytes. Once that limit is hit, the brk() system call (and library routines which call it, e.g. sbrk() and malloc()) will fail and return an error. 8. Getting Help/Communicating with Other Users/Staff CACR technical support is available during standard business hours (M-F, 8am to 5pm); after hours responses are as time permits. 9. Expectations About System Down Times/News Monday mornings are reserved for scheduled system downtime, starting no earlier than 7:00, lasting until no later than noon. Sometimes the downtime will involve only the compute nodes (queues stopped, and no jobs scheduled);

9 occasionally the downtime will require that users be prevented from using the head node as well. Some weeks no Monday-morning downtime will be taken. If you would like to schedule benchmarking or test runs which require dedicated access to the entire cluster, asking zwicky-help for a portion of the scheduled system time is fine and encouraged. News about operational changes (e.g. system software upgrades, file system policy changes, dedicated runs) will be posted using news. New news items since last logging in on to the head node will be displayed automatically. Using news: news -help Usage: o news prints all new news items o news X prints news item "X" o news -a prints all news items o news -l prints names of all news items o news -h prints this message 10. Performance HPL performance ( GFLOPs peak/node) Cores GFLOPs 12 (1 node) 1.152e (8 nodes) 9.187e (16 nodes) 1.831e (32 nodes) 3.660e (64 nodes) 7.288e (128 nodes) 1.452e (187 nodes) TBD 11. Policies All users are expected to adhere to the CACR Computing Policies. 12. Accounting and Job Priority Policies Jobs are scheduled according to "weight." Many factors are taken into consideration when determining a job's weight, including cpu time consumed recently by the user, time spent waiting in the queue, runtime, node count as well as a "Fairshare job Scheduling" algorithm. The intent of FairShare scheduling is to prevent a user from dominating compute resources. A balance is struck between utilization of cpu resources and job throughput Job request limitations Jobs can be requested for a maximum time of 48 hours. Jobs lasting 12 hours or less can request at most 64 nodes Jobs lasting longer than 12 hours can request at most 22 nodes 12.2 Job Priorities There are 4 nodes permanently reserved for jobs running 2 hours or less No more than 22 nodes on the system can be running jobs that last more than 12 hours.

10 All things being equal a job lasting more than 12 hours will have priority over a job lasting 12 hours or less, but as stated above there will be no more than 22 nodes running such jobs. A fairshare algorithm is implemented between users. o A user's cpu-hrs consumed is weighted over 7x24-hour intervals and an attempt is made to balance this weighted usage equally among all users. o Each interval is weighted by (0.9)^n, where n is the n'th 24 hr interval from the present starting with n=0 Within the list of a user's jobs waiting to run, all other things being equal the job queued the longest will run first.

Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research

Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research Cynthia Cornelius Center for Computational Research University at Buffalo, SUNY 701 Ellicott St

More information

RA MPI Compilers Debuggers Profiling. March 25, 2009

RA MPI Compilers Debuggers Profiling. March 25, 2009 RA MPI Compilers Debuggers Profiling March 25, 2009 Examples and Slides To download examples on RA 1. mkdir class 2. cd class 3. wget http://geco.mines.edu/workshop/class2/examples/examples.tgz 4. tar

More information

Using WestGrid. Patrick Mann, Manager, Technical Operations Jan.15, 2014

Using WestGrid. Patrick Mann, Manager, Technical Operations Jan.15, 2014 Using WestGrid Patrick Mann, Manager, Technical Operations Jan.15, 2014 Winter 2014 Seminar Series Date Speaker Topic 5 February Gino DiLabio Molecular Modelling Using HPC and Gaussian 26 February Jonathan

More information

Streamline Computing Linux Cluster User Training. ( Nottingham University)

Streamline Computing Linux Cluster User Training. ( Nottingham University) 1 Streamline Computing Linux Cluster User Training ( Nottingham University) 3 User Training Agenda System Overview System Access Description of Cluster Environment Code Development Job Schedulers Running

More information

Work Environment. David Tur HPC Expert. HPC Users Training September, 18th 2015

Work Environment. David Tur HPC Expert. HPC Users Training September, 18th 2015 Work Environment David Tur HPC Expert HPC Users Training September, 18th 2015 1. Atlas Cluster: Accessing and using resources 2. Software Overview 3. Job Scheduler 1. Accessing Resources DIPC technicians

More information

Getting Started with HPC

Getting Started with HPC Getting Started with HPC An Introduction to the Minerva High Performance Computing Resource 17 Sep 2013 Outline of Topics Introduction HPC Accounts Logging onto the HPC Clusters Common Linux Commands Storage

More information

Parallel Debugging with DDT

Parallel Debugging with DDT Parallel Debugging with DDT Nate Woody 3/10/2009 www.cac.cornell.edu 1 Debugging Debugging is a methodical process of finding and reducing the number of bugs, or defects, in a computer program or a piece

More information

An Introduction to High Performance Computing in the Department

An Introduction to High Performance Computing in the Department An Introduction to High Performance Computing in the Department Ashley Ford & Chris Jewell Department of Statistics University of Warwick October 30, 2012 1 Some Background 2 How is Buster used? 3 Software

More information

Quick Tutorial for Portable Batch System (PBS)

Quick Tutorial for Portable Batch System (PBS) Quick Tutorial for Portable Batch System (PBS) The Portable Batch System (PBS) system is designed to manage the distribution of batch jobs and interactive sessions across the available nodes in the cluster.

More information

Grid Engine Basics. Table of Contents. Grid Engine Basics Version 1. (Formerly: Sun Grid Engine)

Grid Engine Basics. Table of Contents. Grid Engine Basics Version 1. (Formerly: Sun Grid Engine) Grid Engine Basics (Formerly: Sun Grid Engine) Table of Contents Table of Contents Document Text Style Associations Prerequisites Terminology What is the Grid Engine (SGE)? Loading the SGE Module on Turing

More information

Hodor and Bran - Job Scheduling and PBS Scripts

Hodor and Bran - Job Scheduling and PBS Scripts Hodor and Bran - Job Scheduling and PBS Scripts UND Computational Research Center Now that you have your program compiled and your input file ready for processing, it s time to run your job on the cluster.

More information

The CNMS Computer Cluster

The CNMS Computer Cluster The CNMS Computer Cluster This page describes the CNMS Computational Cluster, how to access it, and how to use it. Introduction (2014) The latest block of the CNMS Cluster (2010) Previous blocks of the

More information

NEC HPC-Linux-Cluster

NEC HPC-Linux-Cluster NEC HPC-Linux-Cluster Hardware configuration: 4 Front-end servers: each with SandyBridge-EP processors: 16 cores per node 128 GB memory 134 compute nodes: 112 nodes with SandyBridge-EP processors (16 cores

More information

The Asterope compute cluster

The Asterope compute cluster The Asterope compute cluster ÅA has a small cluster named asterope.abo.fi with 8 compute nodes Each node has 2 Intel Xeon X5650 processors (6-core) with a total of 24 GB RAM 2 NVIDIA Tesla M2050 GPGPU

More information

Job Scheduling with Moab Cluster Suite

Job Scheduling with Moab Cluster Suite Job Scheduling with Moab Cluster Suite IBM High Performance Computing February 2010 Y. Joanna Wong, Ph.D. yjw@us.ibm.com 2/22/2010 Workload Manager Torque Source: Adaptive Computing 2 Some terminology..

More information

Ra - Batch Scripts. Timothy H. Kaiser, Ph.D. tkaiser@mines.edu

Ra - Batch Scripts. Timothy H. Kaiser, Ph.D. tkaiser@mines.edu Ra - Batch Scripts Timothy H. Kaiser, Ph.D. tkaiser@mines.edu Jobs on Ra are Run via a Batch System Ra is a shared resource Purpose: Give fair access to all users Have control over where jobs are run Set

More information

Batch Scripts for RA & Mio

Batch Scripts for RA & Mio Batch Scripts for RA & Mio Timothy H. Kaiser, Ph.D. tkaiser@mines.edu 1 Jobs are Run via a Batch System Ra and Mio are shared resources Purpose: Give fair access to all users Have control over where jobs

More information

Parallel Programming for Multi-Core, Distributed Systems, and GPUs Exercises

Parallel Programming for Multi-Core, Distributed Systems, and GPUs Exercises Parallel Programming for Multi-Core, Distributed Systems, and GPUs Exercises Pierre-Yves Taunay Research Computing and Cyberinfrastructure 224A Computer Building The Pennsylvania State University University

More information

Linux für bwgrid. Sabine Richling, Heinz Kredel. Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim. 27.

Linux für bwgrid. Sabine Richling, Heinz Kredel. Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim. 27. Linux für bwgrid Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 27. June 2011 Richling/Kredel (URZ/RUM) Linux für bwgrid FS 2011 1 / 33 Introduction

More information

PBS Tutorial. Fangrui Ma Universit of Nebraska-Lincoln. October 26th, 2007

PBS Tutorial. Fangrui Ma Universit of Nebraska-Lincoln. October 26th, 2007 PBS Tutorial Fangrui Ma Universit of Nebraska-Lincoln October 26th, 2007 Abstract In this tutorial we gave a brief introduction to using PBS Pro. We gave examples on how to write control script, and submit

More information

Introduction to Linux and Cluster Basics for the CCR General Computing Cluster

Introduction to Linux and Cluster Basics for the CCR General Computing Cluster Introduction to Linux and Cluster Basics for the CCR General Computing Cluster Cynthia Cornelius Center for Computational Research University at Buffalo, SUNY 701 Ellicott St Buffalo, NY 14203 Phone: 716-881-8959

More information

Manual for using Super Computing Resources

Manual for using Super Computing Resources Manual for using Super Computing Resources Super Computing Research and Education Centre at Research Centre for Modeling and Simulation National University of Science and Technology H-12 Campus, Islamabad

More information

HPC at IU Overview. Abhinav Thota Research Technologies Indiana University

HPC at IU Overview. Abhinav Thota Research Technologies Indiana University HPC at IU Overview Abhinav Thota Research Technologies Indiana University What is HPC/cyberinfrastructure? Why should you care? Data sizes are growing Need to get to the solution faster Compute power is

More information

Using the Yale HPC Clusters

Using the Yale HPC Clusters Using the Yale HPC Clusters Stephen Weston Robert Bjornson Yale Center for Research Computing Yale University Oct 2015 To get help Send an email to: hpc@yale.edu Read documentation at: http://research.computing.yale.edu/hpc-support

More information

LANL Computing Environment for PSAAP Partners

LANL Computing Environment for PSAAP Partners LANL Computing Environment for PSAAP Partners Robert Cunningham rtc@lanl.gov HPC Systems Group (HPC-3) July 2011 LANL Resources Available To Alliance Users Mapache is new, has a Lobo-like allocation Linux

More information

Martinos Center Compute Clusters

Martinos Center Compute Clusters Intro What are the compute clusters How to gain access Housekeeping Usage Log In Submitting Jobs Queues Request CPUs/vmem Email Status I/O Interactive Dependencies Daisy Chain Wrapper Script In Progress

More information

RWTH GPU Cluster. Sandra Wienke wienke@rz.rwth-aachen.de November 2012. Rechen- und Kommunikationszentrum (RZ) Fotos: Christian Iwainsky

RWTH GPU Cluster. Sandra Wienke wienke@rz.rwth-aachen.de November 2012. Rechen- und Kommunikationszentrum (RZ) Fotos: Christian Iwainsky RWTH GPU Cluster Fotos: Christian Iwainsky Sandra Wienke wienke@rz.rwth-aachen.de November 2012 Rechen- und Kommunikationszentrum (RZ) The RWTH GPU Cluster GPU Cluster: 57 Nvidia Quadro 6000 (Fermi) innovative

More information

Miami University RedHawk Cluster Working with batch jobs on the Cluster

Miami University RedHawk Cluster Working with batch jobs on the Cluster Miami University RedHawk Cluster Working with batch jobs on the Cluster The RedHawk cluster is a general purpose research computing resource available to support the research community at Miami University.

More information

Tutorial: Using WestGrid. Drew Leske Compute Canada/WestGrid Site Lead University of Victoria

Tutorial: Using WestGrid. Drew Leske Compute Canada/WestGrid Site Lead University of Victoria Tutorial: Using WestGrid Drew Leske Compute Canada/WestGrid Site Lead University of Victoria Fall 2013 Seminar Series Date Speaker Topic 23 September Lindsay Sill Introduction to WestGrid 9 October Drew

More information

Introduction to SDSC systems and data analytics software packages "

Introduction to SDSC systems and data analytics software packages Introduction to SDSC systems and data analytics software packages " Mahidhar Tatineni (mahidhar@sdsc.edu) SDSC Summer Institute August 05, 2013 Getting Started" System Access Logging in Linux/Mac Use available

More information

Cluster@WU User s Manual

Cluster@WU User s Manual Cluster@WU User s Manual Stefan Theußl Martin Pacala September 29, 2014 1 Introduction and scope At the WU Wirtschaftsuniversität Wien the Research Institute for Computational Methods (Forschungsinstitut

More information

OLCF Best Practices. Bill Renaud OLCF User Assistance Group

OLCF Best Practices. Bill Renaud OLCF User Assistance Group OLCF Best Practices Bill Renaud OLCF User Assistance Group Overview This presentation covers some helpful information for users of OLCF Staying informed Some aspects of system usage that may differ from

More information

New High-performance computing cluster: PAULI. Sascha Frick Institute for Physical Chemistry

New High-performance computing cluster: PAULI. Sascha Frick Institute for Physical Chemistry New High-performance computing cluster: PAULI Sascha Frick Institute for Physical Chemistry 02/05/2012 Sascha Frick (PHC) HPC cluster pauli 02/05/2012 1 / 24 Outline 1 About this seminar 2 New Hardware

More information

Using Parallel Computing to Run Multiple Jobs

Using Parallel Computing to Run Multiple Jobs Beowulf Training Using Parallel Computing to Run Multiple Jobs Jeff Linderoth August 5, 2003 August 5, 2003 Beowulf Training Running Multiple Jobs Slide 1 Outline Introduction to Scheduling Software The

More information

High Performance Computing Facility Specifications, Policies and Usage. Supercomputer Project. Bibliotheca Alexandrina

High Performance Computing Facility Specifications, Policies and Usage. Supercomputer Project. Bibliotheca Alexandrina High Performance Computing Facility Specifications, Policies and Usage Supercomputer Project Bibliotheca Alexandrina Bibliotheca Alexandrina 1/16 Topics Specifications Overview Site Policies Intel Compilers

More information

SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education www.accre.vanderbilt.

SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education www.accre.vanderbilt. SLURM: Resource Management and Job Scheduling Software Advanced Computing Center for Research and Education www.accre.vanderbilt.edu Simple Linux Utility for Resource Management But it s also a job scheduler!

More information

Introduction to Running Computations on the High Performance Clusters at the Center for Computational Research

Introduction to Running Computations on the High Performance Clusters at the Center for Computational Research ! Introduction to Running Computations on the High Performance Clusters at the Center for Computational Research! Cynthia Cornelius! Center for Computational Research University at Buffalo, SUNY! cdc at

More information

Grid 101. Grid 101. Josh Hegie. grid@unr.edu http://hpc.unr.edu

Grid 101. Grid 101. Josh Hegie. grid@unr.edu http://hpc.unr.edu Grid 101 Josh Hegie grid@unr.edu http://hpc.unr.edu Accessing the Grid Outline 1 Accessing the Grid 2 Working on the Grid 3 Submitting Jobs with SGE 4 Compiling 5 MPI 6 Questions? Accessing the Grid Logging

More information

Debugging and Profiling Lab. Carlos Rosales, Kent Milfeld and Yaakoub Y. El Kharma carlos@tacc.utexas.edu

Debugging and Profiling Lab. Carlos Rosales, Kent Milfeld and Yaakoub Y. El Kharma carlos@tacc.utexas.edu Debugging and Profiling Lab Carlos Rosales, Kent Milfeld and Yaakoub Y. El Kharma carlos@tacc.utexas.edu Setup Login to Ranger: - ssh -X username@ranger.tacc.utexas.edu Make sure you can export graphics

More information

Parallel Processing using the LOTUS cluster

Parallel Processing using the LOTUS cluster Parallel Processing using the LOTUS cluster Alison Pamment / Cristina del Cano Novales JASMIN/CEMS Workshop February 2015 Overview Parallelising data analysis LOTUS HPC Cluster Job submission on LOTUS

More information

High-Performance Reservoir Risk Assessment (Jacta Cluster)

High-Performance Reservoir Risk Assessment (Jacta Cluster) High-Performance Reservoir Risk Assessment (Jacta Cluster) SKUA-GOCAD 2013.1 Paradigm 2011.3 With Epos 4.1 Data Management Configuration Guide 2008 2013 Paradigm Ltd. or its affiliates and subsidiaries.

More information

How to Run Parallel Jobs Efficiently

How to Run Parallel Jobs Efficiently How to Run Parallel Jobs Efficiently Shao-Ching Huang High Performance Computing Group UCLA Institute for Digital Research and Education May 9, 2013 1 The big picture: running parallel jobs on Hoffman2

More information

Introduction to ACENET Accelerating Discovery with Computational Research May, 2015

Introduction to ACENET Accelerating Discovery with Computational Research May, 2015 Introduction to ACENET Accelerating Discovery with Computational Research May, 2015 What is ACENET? What is ACENET? Shared regional resource for... high-performance computing (HPC) remote collaboration

More information

MPI / ClusterTools Update and Plans

MPI / ClusterTools Update and Plans HPC Technical Training Seminar July 7, 2008 October 26, 2007 2 nd HLRS Parallel Tools Workshop Sun HPC ClusterTools 7+: A Binary Distribution of Open MPI MPI / ClusterTools Update and Plans Len Wisniewski

More information

Advanced Techniques with Newton. Gerald Ragghianti Advanced Newton workshop Sept. 22, 2011

Advanced Techniques with Newton. Gerald Ragghianti Advanced Newton workshop Sept. 22, 2011 Advanced Techniques with Newton Gerald Ragghianti Advanced Newton workshop Sept. 22, 2011 Workshop Goals Gain independence Executing your work Finding Information Fixing Problems Optimizing Effectiveness

More information

Using NeSI HPC Resources. NeSI Computational Science Team (support@nesi.org.nz)

Using NeSI HPC Resources. NeSI Computational Science Team (support@nesi.org.nz) NeSI Computational Science Team (support@nesi.org.nz) Outline 1 About Us About NeSI Our Facilities 2 Using the Cluster Suitable Work What to expect Parallel speedup Data Getting to the Login Node 3 Submitting

More information

HPC system startup manual (version 1.30)

HPC system startup manual (version 1.30) HPC system startup manual (version 1.30) Document change log Issue Date Change 1 12/1/2012 New document 2 10/22/2013 Added the information of supported OS 3 10/22/2013 Changed the example 1 for data download

More information

Job scheduler details

Job scheduler details Job scheduler details Advanced Computing Center for Research & Education (ACCRE) Job scheduler details 1 / 25 Outline 1 Batch queue system overview 2 Torque and Moab 3 Submitting jobs (ACCRE) Job scheduler

More information

Introduction to Sun Grid Engine (SGE)

Introduction to Sun Grid Engine (SGE) Introduction to Sun Grid Engine (SGE) What is SGE? Sun Grid Engine (SGE) is an open source community effort to facilitate the adoption of distributed computing solutions. Sponsored by Sun Microsystems

More information

Berkeley Research Computing. Town Hall Meeting Savio Overview

Berkeley Research Computing. Town Hall Meeting Savio Overview Berkeley Research Computing Town Hall Meeting Savio Overview SAVIO - The Need Has Been Stated Inception and design was based on a specific need articulated by Eliot Quataert and nine other faculty: Dear

More information

Working with HPC and HTC Apps. Abhinav Thota Research Technologies Indiana University

Working with HPC and HTC Apps. Abhinav Thota Research Technologies Indiana University Working with HPC and HTC Apps Abhinav Thota Research Technologies Indiana University Outline What are HPC apps? Working with typical HPC apps Compilers - Optimizations and libraries Installation Modules

More information

Overview of HPC systems and software available within

Overview of HPC systems and software available within Overview of HPC systems and software available within Overview Available HPC Systems Ba Cy-Tera Available Visualization Facilities Software Environments HPC System at Bibliotheca Alexandrina SUN cluster

More information

SLURM Workload Manager

SLURM Workload Manager SLURM Workload Manager What is SLURM? SLURM (Simple Linux Utility for Resource Management) is the native scheduler software that runs on ASTI's HPC cluster. Free and open-source job scheduler for the Linux

More information

1.0. User Manual For HPC Cluster at GIKI. Volume. Ghulam Ishaq Khan Institute of Engineering Sciences & Technology

1.0. User Manual For HPC Cluster at GIKI. Volume. Ghulam Ishaq Khan Institute of Engineering Sciences & Technology Volume 1.0 FACULTY OF CUMPUTER SCIENCE & ENGINEERING Ghulam Ishaq Khan Institute of Engineering Sciences & Technology User Manual For HPC Cluster at GIKI Designed and prepared by Faculty of Computer Science

More information

NCCS Brown Bag Series

NCCS Brown Bag Series NCCS Brown Bag Series Tips for Monitoring Memory Usage in PBS jobs on Discover Chongxun (Doris) Pan doris.pan@nasa.gov October 16, 2012 After the talk, you will understand -- What s memory swapping, really?

More information

Beyond Windows: Using the Linux Servers and the Grid

Beyond Windows: Using the Linux Servers and the Grid Beyond Windows: Using the Linux Servers and the Grid Topics Linux Overview How to Login & Remote Access Passwords Staying Up-To-Date Network Drives Server List The Grid Useful Commands Linux Overview Linux

More information

Debugging with TotalView

Debugging with TotalView Tim Cramer 17.03.2015 IT Center der RWTH Aachen University Why to use a Debugger? If your program goes haywire, you may... ( wand (... buy a magic... read the source code again and again and...... enrich

More information

Introduction to Supercomputing with Janus

Introduction to Supercomputing with Janus Introduction to Supercomputing with Janus Shelley Knuth shelley.knuth@colorado.edu Peter Ruprecht peter.ruprecht@colorado.edu www.rc.colorado.edu Outline Who is CU Research Computing? What is a supercomputer?

More information

Installing and running COMSOL on a Linux cluster

Installing and running COMSOL on a Linux cluster Installing and running COMSOL on a Linux cluster Introduction This quick guide explains how to install and operate COMSOL Multiphysics 5.0 on a Linux cluster. It is a complement to the COMSOL Installation

More information

Running on Blue Gene/Q at Argonne Leadership Computing Facility (ALCF)

Running on Blue Gene/Q at Argonne Leadership Computing Facility (ALCF) Running on Blue Gene/Q at Argonne Leadership Computing Facility (ALCF) ALCF Resources: Machines & Storage Mira (Production) IBM Blue Gene/Q 49,152 nodes / 786,432 cores 768 TB of memory Peak flop rate:

More information

Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC

Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC Parallel Computing using MATLAB Distributed Compute Server ZORRO HPC Goals of the session Overview of parallel MATLAB Why parallel MATLAB? Multiprocessing in MATLAB Parallel MATLAB using the Parallel Computing

More information

How To Build A Supermicro Computer With A 32 Core Power Core (Powerpc) And A 32-Core (Powerpc) (Powerpowerpter) (I386) (Amd) (Microcore) (Supermicro) (

How To Build A Supermicro Computer With A 32 Core Power Core (Powerpc) And A 32-Core (Powerpc) (Powerpowerpter) (I386) (Amd) (Microcore) (Supermicro) ( TECHNICAL GUIDELINES FOR APPLICANTS TO PRACE 7 th CALL (Tier-0) Contributing sites and the corresponding computer systems for this call are: GCS@Jülich, Germany IBM Blue Gene/Q GENCI@CEA, France Bull Bullx

More information

Batch Systems. provide a mechanism for submitting, launching, and tracking jobs on a shared resource

Batch Systems. provide a mechanism for submitting, launching, and tracking jobs on a shared resource PBS INTERNALS PBS & TORQUE PBS (Portable Batch System)-software system for managing system resources on workstations, SMP systems, MPPs and vector computers. It was based on Network Queuing System (NQS)

More information

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC HPC Architecture End to End Alexandre Chauvin Agenda HPC Software Stack Visualization National Scientific Center 2 Agenda HPC Software Stack Alexandre Chauvin Typical HPC Software Stack Externes LAN Typical

More information

SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education www.accre.vanderbilt.

SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education www.accre.vanderbilt. SLURM: Resource Management and Job Scheduling Software Advanced Computing Center for Research and Education www.accre.vanderbilt.edu Simple Linux Utility for Resource Management But it s also a job scheduler!

More information

Introduction to the SGE/OGS batch-queuing system

Introduction to the SGE/OGS batch-queuing system Grid Computing Competence Center Introduction to the SGE/OGS batch-queuing system Riccardo Murri Grid Computing Competence Center, Organisch-Chemisches Institut, University of Zurich Oct. 6, 2011 The basic

More information

bwgrid Treff MA/HD Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 29.

bwgrid Treff MA/HD Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 29. bwgrid Treff MA/HD Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 29. September 2010 Richling/Kredel (URZ/RUM) bwgrid Treff WS 2010/2011 1 / 25 Course

More information

Introduction to HPC Workshop. Center for e-research (eresearch@nesi.org.nz)

Introduction to HPC Workshop. Center for e-research (eresearch@nesi.org.nz) Center for e-research (eresearch@nesi.org.nz) Outline 1 About Us About CER and NeSI The CS Team Our Facilities 2 Key Concepts What is a Cluster Parallel Programming Shared Memory Distributed Memory 3 Using

More information

NYUAD HPC Center Running Jobs

NYUAD HPC Center Running Jobs NYUAD HPC Center Running Jobs 1 Overview... Error! Bookmark not defined. 1.1 General List... Error! Bookmark not defined. 1.2 Compilers... Error! Bookmark not defined. 2 Loading Software... Error! Bookmark

More information

10 STEPS TO YOUR FIRST QNX PROGRAM. QUICKSTART GUIDE Second Edition

10 STEPS TO YOUR FIRST QNX PROGRAM. QUICKSTART GUIDE Second Edition 10 STEPS TO YOUR FIRST QNX PROGRAM QUICKSTART GUIDE Second Edition QNX QUICKSTART GUIDE A guide to help you install and configure the QNX Momentics tools and the QNX Neutrino operating system, so you can

More information

OLCF Best Practices (and More) Bill Renaud OLCF User Assistance Group

OLCF Best Practices (and More) Bill Renaud OLCF User Assistance Group OLCF Best Practices (and More) Bill Renaud OLCF User Assistance Group Overview This presentation covers some helpful information for users of OLCF Staying informed Some aspects of system usage that may

More information

Using Google Compute Engine

Using Google Compute Engine Using Google Compute Engine Chris Paciorek January 30, 2014 WARNING: This document is now out-of-date (January 2014) as Google has updated various aspects of Google Compute Engine. But it may still be

More information

Biowulf2 Training Session

Biowulf2 Training Session Biowulf2 Training Session 9 July 2015 Slides at: h,p://hpc.nih.gov/docs/b2training.pdf HPC@NIH website: h,p://hpc.nih.gov System hardware overview What s new/different The batch system & subminng jobs

More information

24/08/2004. Introductory User Guide

24/08/2004. Introductory User Guide 24/08/2004 Introductory User Guide CSAR Introductory User Guide Introduction This material is designed to provide new users with all the information they need to access and use the SGI systems provided

More information

A Crash course to (The) Bighouse

A Crash course to (The) Bighouse A Crash course to (The) Bighouse Brock Palen brockp@umich.edu SVTI Users meeting Sep 20th Outline 1 Resources Configuration Hardware 2 Architecture ccnuma Altix 4700 Brick 3 Software Packaged Software

More information

bwgrid Treff MA/HD Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 24.

bwgrid Treff MA/HD Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 24. bwgrid Treff MA/HD Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 24. November 2010 Richling/Kredel (URZ/RUM) bwgrid Treff WS 2010/2011 1 / 17 Course

More information

Matlab on a Supercomputer

Matlab on a Supercomputer Matlab on a Supercomputer Shelley L. Knuth Research Computing April 9, 2015 Outline Description of Matlab and supercomputing Interactive Matlab jobs Non-interactive Matlab jobs Parallel Computing Slides

More information

8/15/2014. Best Practices @OLCF (and more) General Information. Staying Informed. Staying Informed. Staying Informed-System Status

8/15/2014. Best Practices @OLCF (and more) General Information. Staying Informed. Staying Informed. Staying Informed-System Status Best Practices @OLCF (and more) Bill Renaud OLCF User Support General Information This presentation covers some helpful information for users of OLCF Staying informed Aspects of system usage that may differ

More information

Compute Cluster Server Lab 3: Debugging the parallel MPI programs in Microsoft Visual Studio 2005

Compute Cluster Server Lab 3: Debugging the parallel MPI programs in Microsoft Visual Studio 2005 Compute Cluster Server Lab 3: Debugging the parallel MPI programs in Microsoft Visual Studio 2005 Compute Cluster Server Lab 3: Debugging the parallel MPI programs in Microsoft Visual Studio 2005... 1

More information

INF-110. GPFS Installation

INF-110. GPFS Installation INF-110 GPFS Installation Overview Plan the installation Before installing any software, it is important to plan the GPFS installation by choosing the hardware, deciding which kind of disk connectivity

More information

GRID Computing: CAS Style

GRID Computing: CAS Style CS4CC3 Advanced Operating Systems Architectures Laboratory 7 GRID Computing: CAS Style campus trunk C.I.S. router "birkhoff" server The CAS Grid Computer 100BT ethernet node 1 "gigabyte" Ethernet switch

More information

HPCC USER S GUIDE. Version 1.2 July 2012. IITS (Research Support) Singapore Management University. IITS, Singapore Management University Page 1 of 35

HPCC USER S GUIDE. Version 1.2 July 2012. IITS (Research Support) Singapore Management University. IITS, Singapore Management University Page 1 of 35 HPCC USER S GUIDE Version 1.2 July 2012 IITS (Research Support) Singapore Management University IITS, Singapore Management University Page 1 of 35 Revision History Version 1.0 (27 June 2012): - Modified

More information

TCB No. 2012-008 September 2012. Technical Bulletin. GS FLX+ System & GS FLX System. Installation of 454 Sequencing System Software v2.

TCB No. 2012-008 September 2012. Technical Bulletin. GS FLX+ System & GS FLX System. Installation of 454 Sequencing System Software v2. TCB No. 2012-008 September 2012 Technical Bulletin GS FLX+ System & GS FLX System Installation of 454 Sequencing System Software v2.8 Summary This document describes how to upgrade the 454 Sequencing System

More information

1. Product Information

1. Product Information ORIXCLOUD BACKUP CLIENT USER MANUAL LINUX 1. Product Information Product: Orixcloud Backup Client for Linux Version: 4.1.7 1.1 System Requirements Linux (RedHat, SuSE, Debian and Debian based systems such

More information

Running applications on the Cray XC30 4/12/2015

Running applications on the Cray XC30 4/12/2015 Running applications on the Cray XC30 4/12/2015 1 Running on compute nodes By default, users do not log in and run applications on the compute nodes directly. Instead they launch jobs on compute nodes

More information

Online Backup Client User Manual Linux

Online Backup Client User Manual Linux Online Backup Client User Manual Linux 1. Product Information Product: Online Backup Client for Linux Version: 4.1.7 1.1 System Requirements Operating System Linux (RedHat, SuSE, Debian and Debian based

More information

How To Run A Tompouce Cluster On An Ipra (Inria) 2.5.5 (Sun) 2 (Sun Geserade) 2-5.4 (Sun-Ge) 2/5.2 (

How To Run A Tompouce Cluster On An Ipra (Inria) 2.5.5 (Sun) 2 (Sun Geserade) 2-5.4 (Sun-Ge) 2/5.2 ( Running Hadoop and Stratosphere jobs on TomPouce cluster 16 October 2013 TomPouce cluster TomPouce is a cluster of 20 calcula@on nodes = 240 cores Located in the Inria Turing building (École Polytechnique)

More information

GC3: Grid Computing Competence Center Cluster computing, I Batch-queueing systems

GC3: Grid Computing Competence Center Cluster computing, I Batch-queueing systems GC3: Grid Computing Competence Center Cluster computing, I Batch-queueing systems Riccardo Murri, Sergio Maffioletti Grid Computing Competence Center, Organisch-Chemisches Institut, University of Zurich

More information

CHAPTER FIVE RESULT ANALYSIS

CHAPTER FIVE RESULT ANALYSIS CHAPTER FIVE RESULT ANALYSIS 5.1 Chapter Introduction 5.2 Discussion of Results 5.3 Performance Comparisons 5.4 Chapter Summary 61 5.1 Chapter Introduction This chapter outlines the results obtained from

More information

Table of Contents New User Orientation...1

Table of Contents New User Orientation...1 Table of Contents New User Orientation...1 Introduction...1 Helpful Resources...3 HPC Environment Overview...4 Basic Tasks...10 Understanding and Managing Your Allocations...16 New User Orientation Introduction

More information

Using the Windows Cluster

Using the Windows Cluster Using the Windows Cluster Christian Terboven terboven@rz.rwth aachen.de Center for Computing and Communication RWTH Aachen University Windows HPC 2008 (II) September 17, RWTH Aachen Agenda o Windows Cluster

More information

Cluster Computing at HRI

Cluster Computing at HRI Cluster Computing at HRI J.S.Bagla Harish-Chandra Research Institute, Chhatnag Road, Jhunsi, Allahabad 211019. E-mail: jasjeet@mri.ernet.in 1 Introduction and some local history High performance computing

More information

Resource Management and Job Scheduling

Resource Management and Job Scheduling Resource Management and Job Scheduling Jenett Tillotson Senior Cluster System Administrator Indiana University May 18 18-22 May 2015 1 Resource Managers Keep track of resources Nodes: CPUs, disk, memory,

More information

RHadoop and MapR. Accessing Enterprise- Grade Hadoop from R. Version 2.0 (14.March.2014)

RHadoop and MapR. Accessing Enterprise- Grade Hadoop from R. Version 2.0 (14.March.2014) RHadoop and MapR Accessing Enterprise- Grade Hadoop from R Version 2.0 (14.March.2014) Table of Contents Introduction... 3 Environment... 3 R... 3 Special Installation Notes... 4 Install R... 5 Install

More information

Remote & Collaborative Visualization. Texas Advanced Compu1ng Center

Remote & Collaborative Visualization. Texas Advanced Compu1ng Center Remote & Collaborative Visualization Texas Advanced Compu1ng Center So6ware Requirements SSH client VNC client Recommended: TigerVNC http://sourceforge.net/projects/tigervnc/files/ Web browser with Java

More information

The Top Six Advantages of CUDA-Ready Clusters. Ian Lumb Bright Evangelist

The Top Six Advantages of CUDA-Ready Clusters. Ian Lumb Bright Evangelist The Top Six Advantages of CUDA-Ready Clusters Ian Lumb Bright Evangelist GTC Express Webinar January 21, 2015 We scientists are time-constrained, said Dr. Yamanaka. Our priority is our research, not managing

More information

Notes on the SNOW/Rmpi R packages with OpenMPI and Sun Grid Engine

Notes on the SNOW/Rmpi R packages with OpenMPI and Sun Grid Engine Notes on the SNOW/Rmpi R packages with OpenMPI and Sun Grid Engine Last updated: 6/2/2008 4:43PM EDT We informally discuss the basic set up of the R Rmpi and SNOW packages with OpenMPI and the Sun Grid

More information

Recommended hardware system configurations for ANSYS users

Recommended hardware system configurations for ANSYS users Recommended hardware system configurations for ANSYS users The purpose of this document is to recommend system configurations that will deliver high performance for ANSYS users across the entire range

More information

Using the Millipede cluster - I

Using the Millipede cluster - I Using the Millipede cluster - I Fokke Dijkstra, Bob Dröge High Performance Computing and Visualisation group Donald Smits Centre for Information Technology General introduction Course aimed at beginners

More information

HPC Wales Skills Academy Course Catalogue 2015

HPC Wales Skills Academy Course Catalogue 2015 HPC Wales Skills Academy Course Catalogue 2015 Overview The HPC Wales Skills Academy provides a variety of courses and workshops aimed at building skills in High Performance Computing (HPC). Our courses

More information