Rocoto. HWRF Python Scripts Training Miami, FL November 19, 2015
|
|
|
- Arnold Richardson
- 9 years ago
- Views:
Transcription
1 Rocoto HWRF Python Scripts Training Miami, FL November 19, 2015
2 Outline Introduction to Rocoto How it works Overview and description of XML Effectively using Rocoto (run, boot, stat, check, rewind, logs) Activities: Check status of run (Two cycles: one dead, one hung) Why did it hang? To boot or not to boot? How would you Change the dependencies that make a certain task run (e.g., vortex relocate can only run between 2 and 3 pm, or something else) Tinker with the number of processors used to run each job? 2
3 Rocoto s Job Workflow management A workflow is a collection of interconnected steps employed to accomplish an overall goal Rocoto is a workflow manager A means of defining a workflow Automation of workflow execution Rocoto is capable of Tracking dependencies Checking job status, including failures Resubmitting failed jobs (to a maximum number of attempts) Reliability Communication Completion of tasks Persistence Efficiency Order 3
4 How Rocoto operates Basic overview: Submits a task if its dependencies have been met Run again to check completion of jobs, and whether more jobs can be submitted Continue submitting until all tasks have completed Launcher init_gdas init_ocean init_gfs relocate gsi_d02 gsi_d03 Forecast merge unpost Post Archive Scrub 4 bufrprep Products Rocoto uses a custom XML language to define the workflow Tasks and interdependencies Runtime requirements (queueing, environment variables) Automation controls
5 How it works Rocoto XML introduction
6 XML Components Header Entities Important tags <workflow> Everything lives inside here <log> Defines the location of the Rocoto log file <cyclestr> References the current cycle at runtime <cycledef> Defines the set of cycles to be run for the workflow <task> Job submission portion of workflow <metatask> Collection of tasks 6 Taken from
7 Rocoto XML Environment Variables <?xml version="1.0"?> <!DOCTYPE workflow [ <!-- Scrub Times --> <!ENTITY COM_SCRUB_TIME "14400"> <!ENTITY WORK_SCRUB_TIME "1200"> <!ENTITY CYCLE_THROTTLE "4"> Header HWRF System Variables HWRF XML EXAMPLE parm/*.conf. <!-- External parameter entities --> <!ENTITY % SITES SYSTEM "sites/all.ent"> <!ENTITY % TASKS SYSTEM "tasks/all.ent"> <!ENTITY % STORMS SYSTEM "storms/h214.ent"> %SITES; %TASKS; %STORMS; <!ENTITY EXPT "trunk"> <!ENTITY SUBEXPT "trunk"> <!ENTITY HOMEhwrf "/pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/&expt;"> <!ENTITY WORKhwrf "/pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/pytmp/&subexpt;/@y@m@d@h/&sid;"> <!ENTITY COMhwrf "/pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/pytmp/&subexpt;/com/@y@m@d@h/&sid;">. <!-- Enabling or disabling parts of the workflow: --> <!ENTITY RUN_GSI "YES"> <!ENTITY RUN_OCEAN "YES"> <!ENTITY RUN_RELOCATION "YES"> <!ENTITY EXTRA_TRACKERS "NO"> <! > <!-- External entities --> <!ENTITY ENV_VARS SYSTEM "env_vars.ent"> <!ENTITY cycling_condition SYSTEM "cycling_condition.ent">. 7 <!-- Workflow below here --> Variables for include files (rocoto/*) Variables for include files rocoto/* HWRF Config Variables
8 Rocoto XML - Workflow <!-- Workflow below here --> <workflow realtime="f" cyclethrottle="&cycle_throttle;" scheduler="&scheduler;" taskthrottle="20"> <cycledef> :00:00</cycledef> cyclethrottle and taskthrottle limit the number of cycles or tasks that run at one time Cycles to run <log><cyclestr>&loghwrf;/rocoto_&sid;_@y@m@[email protected]</cyclestr></log> <!-- Initialization tasks --> <metatask name="meta_init" mode="parallel"> <var name="ens">&ensids;</var> &launch_task; &bdy_task; &init_gfs_metatask; &init_gdas1_metatask; &ocean_init_task; &relocate_gfs_metatask; &relocate_gdas1_metatask; &gsi_metatask; &merge_task; </metatask>. </workflow> List of Rocoto tasks to run Log of submit statuses 8
9 Rocoto XML - Workflow <!-- Workflow below here --> <workflow realtime="f" cyclethrottle="&cycle_throttle;" scheduler="&scheduler;" taskthrottle="20"> <cycledef> :00:00</cycledef> cyclethrottle and taskthrottle limit the number of cycles or tasks that run at one time Cycles to run <log><cyclestr>&loghwrf;/rocoto_&sid;_@y@m@[email protected]</cyclestr></log> <!-- Initialization tasks --> <metatask name="meta_init" mode="parallel"> <var name="ens">&ensids;</var> &launch_task; &bdy_task; &init_gfs_metatask; &init_gdas1_metatask; &ocean_init_task; &relocate_gfs_metatask; &relocate_gdas1_metatask; &gsi_metatask; &merge_task; </metatask>. </workflow> Queue tags List of Rocoto tasks to run Log of submit statuses <task name="merge_e#ens#" maxtries="3"> Task <command>&exhwrf;/exhwrf_merge.py</command> <jobname>hwrf_merge_&sid;_<cyclestr>@y@m@d@h</cyclestr>_e#ens#</ jobname> <account>&account;</account> <queue>&pe;</queue> <nodes>1:ppn=1:tpp=&threads;</nodes> <envar> <name>total_tasks</name> <value>1</value> </envar> <walltime>00:39:00</walltime> <memory></memory> <stdout><cyclestr>&workhwrf;/hwrf_merge.out</cyclestr></stdout> <stderr><cyclestr>&workhwrf;/hwrf_merge.err</cyclestr></stderr> 9 Set environment variables Dependencies &ENV_VARS; &RESERVATION; &CORES_EXTRA; &REQUEST_THREADS; <dependency> <and> <metataskdep metatask="meta_gsi_e#ens#"/> <taskdep task="init_gfs_0_e#ens#"/> <streq><left>&run_gsi;</left><right>yes</right></streq> </and> </dependency> </task>
10 Types of Dependencies Task <taskdep>! cycle_offset: <taskdep task="wrfpost_f006 cycle_offset="-6:00:00"/> state: <taskdep state="succeeded" task="x"/>! Metatask <metataskdep>! Data <datadep>! age & minsize: Time <timedep>! Cycle exists <cycleexistdep>! Grep <sh> grep! tasks/gsi_post.ent! deps/cycling_condition.ent! tasks/launch.ent! tasks/forecast.ent tasks/launch.ent! 10
11 Activity 1 Describe the dependencies for the following tasks in words: 1. Relocate GFS 2. Uncoupled Forecast 3. Post_helper
12 Activity 2 Change the following tasks to have the corresponding dependencies: 1. post and products cannot run until forecast is complete 2. com_scrub should never run 3. relocate must only be scheduled between 2 and 3 pm
13 Activity: Create a simple XML Create an XML script to run HWRF.sh Environment variables required HWRF=1 BASIN=AL SID=11L Cycles:
14 Effectively Using Rocoto
15 To run the Rocoto XML Documentation available here: rocotorun w XMLFILE d DATABASEFILE Generates a database file the first time it s run Must run several times to complete the entire workflow Manually run while debugging Use cron during production Performs the following steps each time: Read the database file specified by d flag Query the batch system for current state of workflow Take action based on state of workflow Resubmit crashed jobs Submit jobs for tasks whose dependencies are now satisfied Save the current state of the workflow in the database file specified by d flag Quit qstat qstat u USERNAME Job ID Username Queue Jobname SessID NDS TSK Memory Time S Time jetbqs3 Christina.H batch hwrf_cpl_forecas :59:00 R 01:08: jetbqs3 Christina.H batch hwrf_post_18l_ :59:00 R 00:59: jetbqs3 15 Christina.H batch hwrf_post_helper :59:00 R 00:59: jetbqs3 Christina.H batch hwrf_products_ :59:00 R 00:54:11
16 16 rocotostat rocotostat w XMLFILE d DATABASEFILE c YYYYMMDDHHMM Check the status of a set of cycles CYCLE TASK JOBID STATE EXIT STATUS TRIES DURATION ================================================================================================================== launch_e SUCCEEDED bdy_e SUCCEEDED init_gfs_0_e SUCCEEDED init_gdas1_3_e SUCCEEDED init_gdas1_6_e SUCCEEDED init_gdas1_9_e SUCCEEDED ocean_init_e SUCCEEDED relocate_gfs_0_e relocate_gdas1_3_e SUCCEEDED relocate_gdas1_6_e SUCCEEDED relocate_gdas1_9_e SUCCEEDED gsi_d02_e SUCCEEDED gsi_d03_e SUCCEEDED merge_e SUCCEEDED check_init_e SUCCEEDED coupled_forecast_e RUNNING uncoupled_forecast_e unpost_e99 druby://fe3:37405 SUBMITTING post_e post_helper_e products_e tracker_d1_e tracker_d12_e output_e completion SUCCEEDED RUNNING SUBMITTING FAILED DEAD UNKNOWN
17 rocotocheck rocotocheck w XMLFILE d DATABASEFILE c YYYYMMDDHHMM t TASK qstat PARAFLAG ==> YES Detailed status info for a specific task in a specific cycle Task: ocean_init_e99 account: dtc-hurr command: /pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/trunk/scripts/exhwrf_ocean_init.py cores: 9 final: false jobname: hwrf_ocean_init_18l_ _e99 maxtries: 3 memory: metatasks: meta_init name: ocean_init_e99 native: -l partition=ujet:tjet:vjet:sjet queue: batch seqnum: 5 stderr: /pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/pytmp/trunk/ /18l/hwrf_ocean_init.err stdout: /pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/pytmp/trunk/ /18l/hwrf_ocean_init.out throttle: walltime: 00:59:00 environment CONFhwrf ==> /pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/pytmp/trunk/com/ /18l/storm1.conf HOMEhwrf ==> /pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/trunk PYTHONPATH ==> /pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/trunk/ush TOTAL_TASKS ==> 9 WORKhwrf ==> /pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/pytmp/trunk/ /18l jlogfile ==> /pan2/projects/dtc-hurr/christina.holt/cc_rel_branch/pytmp/trunk/log/jlogfile dependencies AND is satisfied launch_e99 of cycle is SUCCEEDED 'YES'=='YES' is true Cycle: State: done Activated: Fri Oct 10 15:14:35 UTC 2014 Completed: Fri Oct 10 19:25:10 UTC 2014 Expired: - qstat u USERNAME 17 Job: State: SUCCEEDED (C) Exit Status: 0 Tries: 1 Unknown count: 0 Duration:
18 rocotoboot rocotoboot w XMLFILE d DATABASEFILE c YYYYMMDDHHMM t TASK Forces a task to run, regardless of dependencies rocotorewind rocotorewind w XMLFILE d DATABASEFILE c YYYYMMDDHHMM t TASK1 t TASK2 t TASK3 18 Clear the database of specified tasks Resubmit jobs that have dependencies met Kills jobs already running or in the queue Rewinding the launcher will delete com and work directories To rewind an entire cycle, use the a option
19 Activity 3 Use Rocoto utilities to find more information about failures
20 HWRF Layer to Configure XML HWRF is a complex system that has many configurable options Choice of configuration can change the steps and the dependencies of each step Rocoto does not have branching capabilities no logic structures Python layer on top of Rocoto layer Populates an XML template that matches your configuration Removes the burden of matching the workflow to the configuration from the user 20
21 hwrf/rocoto/ run_hwrf.py Get environment variables from confs Check for a TCVital record Generate xml from template (or use existing) Source the include file that loads modules Issue rocotorun command hwrf_workflow.xml.in Template for xml workflow sites/ Files containing variables specific to known machines Any machine can be added by copying and modifying one of the sites/ files storms/ Not currently used tasks/ Files defining Rocoto tasks specific to HWRF cycling_condition.ent Lists of dependencies for cycled runs (rocoto XML file) env_vars.ent List of environmental variables defining location of code, conf, output, etc. (rocoto XML file) 21
22 hwrf/rocoto/ run_hwrf.py Get environment variables from confs Check for a TCVital record Generate xml from template (or use existing) Source the include file that loads modules Issue rocotorun command hwrf_workflow.xml.in & hwrf_multistorm_workflow.xml.in Template for xml workflow runhwrf_wrapper sites/ Files containing variables specific to known machines Any machine can be added by copying and modifying one of the sites/ files storms/ Not currently used tasks/ & multistorm_tasks/ Files defining Rocoto tasks specific to HWRF deps/ Complex dependencies env_vars.ent, forecast_procs.ent, ms_vars.ent! Variable definitions 22
23 Running Rocoto for HWRF Arguments for run_hwrf.py are nearly the same as for exhwrf_launch.py./run_hwrf.py w {XMLfile} d {DBFILE} {DATE} n s sites/sjet.ent {STID} HISTORY config.expt={expt} config.run_gsi=no {XMLfile} is the XML file (optional) {DBFILE} is the database file (optional) {DATE}! YYYYMMDDHH-YYYYMMDDHH for a range of cycles YYYYMMDDHH for a single cycle YYYYMMDDHH YYYYMMDDHH for two specific cycles {STID} is the storm ID, i.e. 18L for Sandy {EXPT} is the name of parent directory of rocoto/! Can set any conf parameter in this line without editing a conf file e.g. add option: config.run_gsi=no! -n turns of invest renumbering -s to specify site file (optional) -f for running subsequent instances -m for running multistorm with a particular storm -M for running multistorm for a list of basins 23
24 Running Rocoto for HWRF The first instance of the run_hwrf.py Generates the xml code in rocoto/ Invokes rocotorun which generates database file in rocoto/ Run every few minutes using the f argument Checks for the completion of tasks Submits tasks when dependencies have been met Does not overwrite db and xml files when f option is used (asks otherwise) Run HWRF with a cron job (crontab e to edit your jobs) 24
25 Questions? Additional Resources: Rocoto for HWRF: Rocoto: Cron: Rocoto Help:
Miami University RedHawk Cluster Working with batch jobs on the Cluster
Miami University RedHawk Cluster Working with batch jobs on the Cluster The RedHawk cluster is a general purpose research computing resource available to support the research community at Miami University.
Job Scheduler Daemon Configuration Guide
Job Scheduler Daemon Configuration Guide A component of Mark Dickinsons Unix Job Scheduler This manual covers the server daemon component of Mark Dickinsons unix Job Scheduler. This manual is for version
Using Parallel Computing to Run Multiple Jobs
Beowulf Training Using Parallel Computing to Run Multiple Jobs Jeff Linderoth August 5, 2003 August 5, 2003 Beowulf Training Running Multiple Jobs Slide 1 Outline Introduction to Scheduling Software The
Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research
Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research Cynthia Cornelius Center for Computational Research University at Buffalo, SUNY 701 Ellicott St
High Performance Computing Facility Specifications, Policies and Usage. Supercomputer Project. Bibliotheca Alexandrina
High Performance Computing Facility Specifications, Policies and Usage Supercomputer Project Bibliotheca Alexandrina Bibliotheca Alexandrina 1/16 Topics Specifications Overview Site Policies Intel Compilers
JobScheduler Events Definition and Processing
JobScheduler - Job Execution and Scheduling System JobScheduler Events Definition and Processing Reference March 2015 March 2015 JobScheduler Events page: 1 JobScheduler Events - Contact Information Contact
Parallel Processing using the LOTUS cluster
Parallel Processing using the LOTUS cluster Alison Pamment / Cristina del Cano Novales JASMIN/CEMS Workshop February 2015 Overview Parallelising data analysis LOTUS HPC Cluster Job submission on LOTUS
SSHP Data Transfer. Description Data Source at RFC Destination table at WFO in IHFS DB SAC-SMA state variables
SSHP Data Transfer 1. Description of Algorithm http://www.weather.gov/ohd/hrl/nwsrfs/users_manual/part2/_pdf/24unithg.pdfin order to keep the SAC-SMA model states up-to-date and to provide other necessary
Overview of Archival and Purge Process in IBM Sterling B2B Integrator
Overview of Archival and Purge Process in IBM Sterling B2B Integrator - Bhavya M Reddy ([email protected]), Staff Software Engineer, IBM Sterling B2B Integrator L2 Support Table Of Contents Introduction
Managed File Transfer with Universal File Mover
Managed File Transfer with Universal File Mover Roger Lacroix [email protected] http://www.capitalware.com Universal File Mover Overview Universal File Mover (UFM) allows the user to combine
Quick Tutorial for Portable Batch System (PBS)
Quick Tutorial for Portable Batch System (PBS) The Portable Batch System (PBS) system is designed to manage the distribution of batch jobs and interactive sessions across the available nodes in the cluster.
SLURM Workload Manager
SLURM Workload Manager What is SLURM? SLURM (Simple Linux Utility for Resource Management) is the native scheduler software that runs on ASTI's HPC cluster. Free and open-source job scheduler for the Linux
PROCESSES LOADER 9.0 SETTING. Requirements and Assumptions: I. Requirements for the batch process:
SETTING UP DATA LOADER 9.0 FOR AUTO PROCESSES Requirements and Assumptions: 9.0 The purpose of this document is to document findings on the setup of Data Loader 9.0 for automated processes. I will be updating
Hodor and Bran - Job Scheduling and PBS Scripts
Hodor and Bran - Job Scheduling and PBS Scripts UND Computational Research Center Now that you have your program compiled and your input file ready for processing, it s time to run your job on the cluster.
Running applications on the Cray XC30 4/12/2015
Running applications on the Cray XC30 4/12/2015 1 Running on compute nodes By default, users do not log in and run applications on the compute nodes directly. Instead they launch jobs on compute nodes
Installation Manual v2.0.0
Installation Manual v2.0.0 Contents ResponseLogic Install Guide v2.0.0 (Command Prompt Install)... 3 Requirements... 4 Installation Checklist:... 4 1. Download and Unzip files.... 4 2. Confirm you have
Microsoft Visual Basic Scripting Edition and Microsoft Windows Script Host Essentials
Microsoft Visual Basic Scripting Edition and Microsoft Windows Script Host Essentials 2433: Microsoft Visual Basic Scripting Edition and Microsoft Windows Script Host Essentials (3 Days) About this Course
Embedded Event Manager Commands
This module describes the commands that are used to set the Embedded Event Manager (EEM) operational attributes and monitor EEM operations. The Cisco IOS XR software EEM functions as the central clearing
PBS Tutorial. Fangrui Ma Universit of Nebraska-Lincoln. October 26th, 2007
PBS Tutorial Fangrui Ma Universit of Nebraska-Lincoln October 26th, 2007 Abstract In this tutorial we gave a brief introduction to using PBS Pro. We gave examples on how to write control script, and submit
JobScheduler Web Services Executing JobScheduler commands
JobScheduler - Job Execution and Scheduling System JobScheduler Web Services Executing JobScheduler commands Technical Reference March 2015 March 2015 JobScheduler Web Services page: 1 JobScheduler Web
Grid Engine Users Guide. 2011.11p1 Edition
Grid Engine Users Guide 2011.11p1 Edition Grid Engine Users Guide : 2011.11p1 Edition Published Nov 01 2012 Copyright 2012 University of California and Scalable Systems This document is subject to the
www.novell.com/documentation Jobs Guide Identity Manager 4.0.1 February 10, 2012
www.novell.com/documentation Jobs Guide Identity Manager 4.0.1 February 10, 2012 Legal Notices Novell, Inc. makes no representations or warranties with respect to the contents or use of this documentation,
000-420. IBM InfoSphere MDM Server v9.0. Version: Demo. Page <<1/11>>
000-420 IBM InfoSphere MDM Server v9.0 Version: Demo Page 1. As part of a maintenance team for an InfoSphere MDM Server implementation, you are investigating the "EndDate must be after StartDate"
Log Analyzer Reference
IceWarp Unified Communications Log Analyzer Reference Version 10.4 Printed on 27 February, 2012 Contents Log Analyzer 1 Quick Start... 2 Required Steps... 2 Optional Steps... 3 Advanced Configuration...
Grid Engine 6. Troubleshooting. BioTeam Inc. [email protected]
Grid Engine 6 Troubleshooting BioTeam Inc. [email protected] Grid Engine Troubleshooting There are two core problem types Job Level Cluster seems OK, example scripts work fine Some user jobs/apps fail Cluster
Understanding Task Scheduler FIGURE 33.14. Task Scheduler. The error reporting screen.
1383 FIGURE.14 The error reporting screen. curring tasks into a central location, administrators gain insight into system functionality and control over their Windows Server 2008 R2 infrastructure through
NASA Workflow Tool. User Guide. September 29, 2010
NASA Workflow Tool User Guide September 29, 2010 NASA Workflow Tool User Guide 1. Overview 2. Getting Started Preparing the Environment 3. Using the NED Client Common Terminology Workflow Configuration
Submitting batch jobs Slurm on ecgate. Xavi Abellan [email protected] User Support Section
Submitting batch jobs Slurm on ecgate Xavi Abellan [email protected] User Support Section Slide 1 Outline Interactive mode versus Batch mode Overview of the Slurm batch system on ecgate Batch basic
Running on Blue Gene/Q at Argonne Leadership Computing Facility (ALCF)
Running on Blue Gene/Q at Argonne Leadership Computing Facility (ALCF) ALCF Resources: Machines & Storage Mira (Production) IBM Blue Gene/Q 49,152 nodes / 786,432 cores 768 TB of memory Peak flop rate:
White Paper. Fabasoft app.test Load Testing. Fabasoft app.test 2015 Update Rollup 2. Fabasoft app.test Load Testing 1
White Paper Fabasoft app.test Load Testing Fabasoft app.test 2015 Update Rollup 2 Fabasoft app.test Load Testing 1 Copyright Fabasoft R&D GmbH, Linz, Austria, 2015. All rights reserved. All hardware and
The RWTH Compute Cluster Environment
The RWTH Compute Cluster Environment Tim Cramer 11.03.2013 Source: D. Both, Bull GmbH Rechen- und Kommunikationszentrum (RZ) How to login Frontends cluster.rz.rwth-aachen.de cluster-x.rz.rwth-aachen.de
HIPAA Compliance Use Case
Overview HIPAA Compliance helps ensure that all medical records, medical billing, and patient accounts meet certain consistent standards with regard to documentation, handling, and privacy. Current Situation
Backing Up TestTrack Native Project Databases
Backing Up TestTrack Native Project Databases TestTrack projects should be backed up regularly. You can use the TestTrack Native Database Backup Command Line Utility to back up TestTrack 2012 and later
JOINUS AG. PowerPay Checkout. Magento Module User Manual. Support: [email protected]
PowerPay Checkout Magento Module User Manual Support: [email protected] This document explains installation procedure and configuration options for Joinus AG PowerPay checkout magento payment module.
HOD Scheduler. Table of contents
Table of contents 1 Introduction... 2 2 HOD Users... 2 2.1 Getting Started... 2 2.2 HOD Features...5 2.3 Troubleshooting... 14 3 HOD Administrators... 21 3.1 Getting Started... 22 3.2 Prerequisites...
SGE Roll: Users Guide. Version @VERSION@ Edition
SGE Roll: Users Guide Version @VERSION@ Edition SGE Roll: Users Guide : Version @VERSION@ Edition Published Aug 2006 Copyright 2006 UC Regents, Scalable Systems Table of Contents Preface...i 1. Requirements...1
How To Use Syntheticys User Management On A Pc Or Mac Or Macbook Powerbook (For Mac) On A Computer Or Mac (For Pc Or Pc) On Your Computer Or Ipa (For Ipa) On An Pc Or Ipad
SYNTHESYS MANAGEMENT User Management Synthesys.Net User Management 1 SYNTHESYS.NET USER MANAGEMENT INTRODUCTION...3 STARTING SYNTHESYS USER MANAGEMENT...4 Viewing User Details... 5 Locating individual
SLURM: Resource Management and Job Scheduling Software. Advanced Computing Center for Research and Education www.accre.vanderbilt.
SLURM: Resource Management and Job Scheduling Software Advanced Computing Center for Research and Education www.accre.vanderbilt.edu Simple Linux Utility for Resource Management But it s also a job scheduler!
Martinos Center Compute Clusters
Intro What are the compute clusters How to gain access Housekeeping Usage Log In Submitting Jobs Queues Request CPUs/vmem Email Status I/O Interactive Dependencies Daisy Chain Wrapper Script In Progress
Avira AntiVir MailGate 3.2 Release Notes
Release Notes 1. Features 1.1 Assigning recipient addresses to groups either by using Active Directory or a plain text file 1.1.1 Using a Active Directory server MailGate communicates with Active Directory
NYUAD HPC Center Running Jobs
NYUAD HPC Center Running Jobs 1 Overview... Error! Bookmark not defined. 1.1 General List... Error! Bookmark not defined. 1.2 Compilers... Error! Bookmark not defined. 2 Loading Software... Error! Bookmark
Tracking Network Changes Using Change Audit
CHAPTER 14 Change Audit tracks and reports changes made in the network. Change Audit allows other RME applications to log change information to a central repository. Device Configuration, Inventory, and
StreamServe Persuasion SP5 Document Broker Plus
StreamServe Persuasion SP5 Document Broker Plus User Guide Rev A StreamServe Persuasion SP5 Document Broker Plus User Guide Rev A 2001-2010 STREAMSERVE, INC. ALL RIGHTS RESERVED United States patent #7,127,520
An Introduction to High Performance Computing in the Department
An Introduction to High Performance Computing in the Department Ashley Ford & Chris Jewell Department of Statistics University of Warwick October 30, 2012 1 Some Background 2 How is Buster used? 3 Software
FioranoMQ 9. High Availability Guide
FioranoMQ 9 High Availability Guide Copyright (c) 1999-2008, Fiorano Software Technologies Pvt. Ltd., Copyright (c) 2008-2009, Fiorano Software Pty. Ltd. All rights reserved. This software is the confidential
Debugging and Profiling Lab. Carlos Rosales, Kent Milfeld and Yaakoub Y. El Kharma [email protected]
Debugging and Profiling Lab Carlos Rosales, Kent Milfeld and Yaakoub Y. El Kharma [email protected] Setup Login to Ranger: - ssh -X [email protected] Make sure you can export graphics
Work Environment. David Tur HPC Expert. HPC Users Training September, 18th 2015
Work Environment David Tur HPC Expert HPC Users Training September, 18th 2015 1. Atlas Cluster: Accessing and using resources 2. Software Overview 3. Job Scheduler 1. Accessing Resources DIPC technicians
Using the Yale HPC Clusters
Using the Yale HPC Clusters Stephen Weston Robert Bjornson Yale Center for Research Computing Yale University Oct 2015 To get help Send an email to: [email protected] Read documentation at: http://research.computing.yale.edu/hpc-support
CrontabFile Converter
JobScheduler - Job Execution and Scheduling System CrontabFile Converter Users Manual July 2014 July 2014 CrontabFile Converter page: 1 CrontabFile Converter - Contact Information Contact Information Software-
Exam Name: IBM InfoSphere MDM Server v9.0
Vendor: IBM Exam Code: 000-420 Exam Name: IBM InfoSphere MDM Server v9.0 Version: DEMO 1. As part of a maintenance team for an InfoSphere MDM Server implementation, you are investigating the "EndDate must
FIGURE 33.5. Selecting properties for the event log.
1358 CHAPTER 33 Logging and Debugging Customizing the Event Log The properties of an event log can be configured. In Event Viewer, the properties of a log are defined by general characteristics: log path,
Gentran Integration Suite. Archiving and Purging. Version 4.2
Gentran Integration Suite Archiving and Purging Version 4.2 Copyright 2007 Sterling Commerce, Inc. All rights reserved. Additional copyright information is located on the Gentran Integration Suite Documentation
Introduction to the SGE/OGS batch-queuing system
Grid Computing Competence Center Introduction to the SGE/OGS batch-queuing system Riccardo Murri Grid Computing Competence Center, Organisch-Chemisches Institut, University of Zurich Oct. 6, 2011 The basic
Extending Remote Desktop for Large Installations. Distributed Package Installs
Extending Remote Desktop for Large Installations This article describes four ways Remote Desktop can be extended for large installations. The four ways are: Distributed Package Installs, List Sharing,
Maintaining Non-Stop Services with Multi Layer Monitoring
Maintaining Non-Stop Services with Multi Layer Monitoring Lahav Savir System Architect and CEO of Emind Systems [email protected] www.emindsys.com The approach Non-stop applications can t leave on their
How To Run A Tompouce Cluster On An Ipra (Inria) 2.5.5 (Sun) 2 (Sun Geserade) 2-5.4 (Sun-Ge) 2/5.2 (
Running Hadoop and Stratosphere jobs on TomPouce cluster 16 October 2013 TomPouce cluster TomPouce is a cluster of 20 calcula@on nodes = 240 cores Located in the Inria Turing building (École Polytechnique)
Cache Configuration Reference
Sitecore CMS 6.2 Cache Configuration Reference Rev: 2009-11-20 Sitecore CMS 6.2 Cache Configuration Reference Tips and Techniques for Administrators and Developers Table of Contents Chapter 1 Introduction...
SIEBEL SERVER ADMINISTRATION GUIDE
SIEBEL SERVER ADMINISTRATION GUIDE VERSION 7.5.3 JULY 2003 12-FRLK3Z Siebel Systems, Inc., 2207 Bridgepointe Parkway, San Mateo, CA 94404 Copyright 2003 Siebel Systems, Inc. All rights reserved. Printed
CA Workload Automation Agent for Microsoft SQL Server
CA Workload Automation Agent for Microsoft SQL Server CLI User Guide r11.3.1 This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to
PASS4TEST 専 門 IT 認 証 試 験 問 題 集 提 供 者
PASS4TEST 専 門 IT 認 証 試 験 問 題 集 提 供 者 http://www.pass4test.jp 1 年 で 無 料 進 級 することに 提 供 する Exam : C2090-420 Title : IBM InfoSphere MDM Server v9.0 Vendors : IBM Version : DEMO NO.1 Which two reasons would
ArpViewer Manual Version 1.0.6 Datum 30.9.2007
SWITCHaai ArpViewer Manual Version 1.0.6 Datum 30.9.2007 AAI C.Witzig Content 1. Introduction...3 2. Functional Description...3 3. Description of the Components...3 3.1. Arpfilter...3 3.2. Controller...4
Audit TM. The Security Auditing Component of. Out-of-the-Box
Audit TM The Security Auditing Component of Out-of-the-Box This guide is intended to provide a quick reference and tutorial to the principal features of Audit. Please refer to the User Manual for more
CAPIX Job Scheduler User Guide
CAPIX Job Scheduler User Guide Version 1.1 December 2009 Table of Contents Table of Contents... 2 Introduction... 3 CJS Installation... 5 Writing CJS VBA Functions... 7 CJS.EXE Command Line Parameters...
Tutorial: Packaging your server build
Tutorial: Packaging your server build This tutorial walks you through the steps to prepare a game server folder or package containing all the files necessary for your game server to run in Amazon GameLift.
ACE 2011 International
ACE 2011 International Understanding Federation and Web Services www. Copyright 2011 Aras All Rights Reserved. Welcome Session Goals Previous session covered overall Integration strategies and a focus
Job scheduler details
Job scheduler details Advanced Computing Center for Research & Education (ACCRE) Job scheduler details 1 / 25 Outline 1 Batch queue system overview 2 Torque and Moab 3 Submitting jobs (ACCRE) Job scheduler
Decision Support System to MODEM communications
Decision Support System to MODEM communications Guy Van Sanden [email protected] Decision Support System to MODEM communications by Guy Van Sanden This document describes how to set up the dss2modem communications
Siebel Business Process Framework: Workflow Guide. Siebel Innovation Pack 2013 Version 8.1/8.2 September 2013
Siebel Business Process Framework: Workflow Guide Siebel Innovation Pack 2013 Version 8.1/8.2 September 2013 Copyright 2005, 2013 Oracle and/or its affiliates. All rights reserved. This software and related
Windows PowerShell Cookbook
Windows PowerShell Cookbook Lee Holmes O'REILLY' Beijing Cambridge Farnham Koln Paris Sebastopol Taipei Tokyo Table of Contents Foreword Preface xvii xxi Part I. Tour A Guided Tour of Windows PowerShell
Job Scheduling with Moab Cluster Suite
Job Scheduling with Moab Cluster Suite IBM High Performance Computing February 2010 Y. Joanna Wong, Ph.D. [email protected] 2/22/2010 Workload Manager Torque Source: Adaptive Computing 2 Some terminology..
Troubleshooting WebSphere Application Server Start/Stop Issues
IBM Software Group Troubleshooting WebSphere Application Server Start/Stop Issues Ganesan Karuppaiah & Kumaran Nathan WebSphere Application Server L2 Support [email protected], [email protected] WebSphere
Version Control Your Jenkins Jobs with Jenkins Job Builder
Version Control Your Jenkins Jobs with Jenkins Job Builder Abstract Wayne Warren [email protected] Puppet Labs uses Jenkins to automate building and testing software. While we do derive benefit from
Parallel Debugging with DDT
Parallel Debugging with DDT Nate Woody 3/10/2009 www.cac.cornell.edu 1 Debugging Debugging is a methodical process of finding and reducing the number of bugs, or defects, in a computer program or a piece
Fair Scheduler. Table of contents
Table of contents 1 Purpose... 2 2 Introduction... 2 3 Installation... 3 4 Configuration...3 4.1 Scheduler Parameters in mapred-site.xml...4 4.2 Allocation File (fair-scheduler.xml)... 6 4.3 Access Control
Unix Scripts and Job Scheduling
Unix Scripts and Job Scheduling Michael B. Spring Department of Information Science and Telecommunications University of Pittsburgh [email protected] http://www.sis.pitt.edu/~spring Overview Shell Scripts
System Monitoring and Diagnostics Guide for Siebel Business Applications. Version 7.8 April 2005
System Monitoring and Diagnostics Guide for Siebel Business Applications April 2005 Siebel Systems, Inc., 2207 Bridgepointe Parkway, San Mateo, CA 94404 Copyright 2005 Siebel Systems, Inc. All rights reserved.
RA MPI Compilers Debuggers Profiling. March 25, 2009
RA MPI Compilers Debuggers Profiling March 25, 2009 Examples and Slides To download examples on RA 1. mkdir class 2. cd class 3. wget http://geco.mines.edu/workshop/class2/examples/examples.tgz 4. tar
Ra - Batch Scripts. Timothy H. Kaiser, Ph.D. [email protected]
Ra - Batch Scripts Timothy H. Kaiser, Ph.D. [email protected] Jobs on Ra are Run via a Batch System Ra is a shared resource Purpose: Give fair access to all users Have control over where jobs are run Set
Microsoft Windows PowerShell v2 For Administrators
Course 50414B: Microsoft Windows PowerShell v2 For Administrators Course Details Course Outline Module 1: Introduction to PowerShell the Basics This module explains how to install and configure PowerShell.
Products that are referred to in this document may be trademarks and/or registered trademarks of the respective owners.
2015 GEOVAP, spol. s r. o. All rights reserved. GEOVAP, spol. s r. o. Cechovo nabrezi 1790 530 03 Pardubice Czech Republic +420 466 024 618 http://www.geovap.cz Products that are referred to in this document
Introduction to Programming and Computing for Scientists
Oxana Smirnova (Lund University) Programming for Scientists Tutorial 7b 1 / 48 Introduction to Programming and Computing for Scientists Oxana Smirnova Lund University Tutorial 7b: Grid certificates and
Healthstone Monitoring System
Healthstone Monitoring System Patrick Lambert v1.1.0 Healthstone Monitoring System 1 Contents 1 Introduction 2 2 Windows client 2 2.1 Installation.............................................. 2 2.2 Troubleshooting...........................................
Backup Tab. User Guide
Backup Tab User Guide Contents 1. Introduction... 2 Documentation... 2 Licensing... 2 Overview... 2 2. Create a New Backup... 3 3. Manage backup jobs... 4 Using the Edit menu... 5 Overview... 5 Destination...
Workshop for WebLogic introduces new tools in support of Java EE 5.0 standards. The support for Java EE5 includes the following technologies:
Oracle Workshop for WebLogic 10g R3 Hands on Labs Workshop for WebLogic extends Eclipse and Web Tools Platform for development of Web Services, Java, JavaEE, Object Relational Mapping, Spring, Beehive,
WebSphere Commerce and Sterling Commerce
WebSphere Commerce and Sterling Commerce Inventory and order integration This presentation introduces WebSphere Commerce and Sterling Commerce inventory and order integration. Order_Inventory_Integration.ppt
Scheduling in SAS 9.3
Scheduling in SAS 9.3 SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc 2011. Scheduling in SAS 9.3. Cary, NC: SAS Institute Inc. Scheduling in SAS 9.3
System Administration
Performance Monitoring For a server, it is crucial to monitor the health of the machine You need not only real time data collection and presentation but offline statistical analysis as well Characteristics
Scheduling in SAS 9.4 Second Edition
Scheduling in SAS 9.4 Second Edition SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. Scheduling in SAS 9.4, Second Edition. Cary, NC: SAS Institute
LATITUDE Patient Management System
LATITUDE PACEART INTEGRATION 1.01 GUIDE LATITUDE Patient Management System LATITUDE PACEART INTEGRATION SYSTEM DIAGRAM a. Patient environment b. LATITUDE environment c. Clinic environment d. Data retrieval
Using NeSI HPC Resources. NeSI Computational Science Team ([email protected])
NeSI Computational Science Team ([email protected]) Outline 1 About Us About NeSI Our Facilities 2 Using the Cluster Suitable Work What to expect Parallel speedup Data Getting to the Login Node 3 Submitting
Setting up PostgreSQL
Setting up PostgreSQL 1 Introduction to PostgreSQL PostgreSQL is an object-relational database management system based on POSTGRES, which was developed at the University of California at Berkeley. PostgreSQL
DCWorkflow makes a few simple assumptions about your workflow:
1 Purpose This documentation explains the purpose of the DCWorkflow product and how to make use of it. DCWorkflow is a product made to be used with Zope CMF from Zope Corporation. Familiarity with Zope
Solution White Paper Connect Hadoop to the Enterprise
Solution White Paper Connect Hadoop to the Enterprise Streamline workflow automation with BMC Control-M Application Integrator Table of Contents 1 EXECUTIVE SUMMARY 2 INTRODUCTION THE UNDERLYING CONCEPT
Troubleshooting Citrix MetaFrame Procedures
Troubleshooting Citrix MetaFrame Procedures Document name Troubleshooting a Citrix MetaFrame environment v1.0.doc Author Marcel van As Last Revision Date 28 February 2006 Edited and released by: www.dabcc.com
HP-UX Essentials and Shell Programming Course Summary
Contact Us: (616) 875-4060 HP-UX Essentials and Shell Programming Course Summary Length: 5 Days Prerequisite: Basic computer skills Recommendation Statement: Student should be able to use a computer monitor,
Getting Started with SandStorm NoSQL Benchmark
Getting Started with SandStorm NoSQL Benchmark SandStorm is an enterprise performance testing tool for web, mobile, cloud and big data applications. It provides a framework for benchmarking NoSQL, Hadoop,
Batch Systems. provide a mechanism for submitting, launching, and tracking jobs on a shared resource
PBS INTERNALS PBS & TORQUE PBS (Portable Batch System)-software system for managing system resources on workstations, SMP systems, MPPs and vector computers. It was based on Network Queuing System (NQS)
