Getting help - guide to the ticketing system. Thomas Röblitz, UiO/USIT/UAV/ITF/FI ;)



Similar documents
Introduction to Running Computations on the High Performance Clusters at the Center for Computational Research

DVS-100 Installation Guide

GETTING STARTED WITH FLEXI-CLOUD

Introduction to Linux and Cluster Basics for the CCR General Computing Cluster

The Asterope compute cluster

Cloud Server powered by Mac OS X. Getting Started Guide. Cloud Server. powered by Mac OS X. AKJZNAzsqknsxxkjnsjx Getting Started Guide Page 1

DVS-100 Installation Guide

Registering the Digital Signature Certificate for Bank Officials

Extending Remote Desktop for Large Installations. Distributed Package Installs

LifeCyclePlus Version 1

PaperStream Connect. Setup Guide. Version Copyright Fujitsu

CONNECTING TO DEPARTMENT OF COMPUTER SCIENCE SERVERS BOTH FROM ON AND OFF CAMPUS USING TUNNELING, PuTTY, AND VNC Client Utilities

User s Manual

Product Manual. MDM On Premise Installation Version 8.1. Last Updated: 06/07/15

Windows Intune Walkthrough: Windows Phone 8 Management

CafePilot has 3 components: the Client, Server and Service Request Monitor (or SRM for short).

How To Run A Tompouce Cluster On An Ipra (Inria) (Sun) 2 (Sun Geserade) (Sun-Ge) 2/5.2 (

Configure Backup Server for Cisco Unified Communications Manager

Online Helpdesk System

SUSE Manager in the Public Cloud. SUSE Manager Server in the Public Cloud

Using Network Attached Storage with Linux. by Andy Pepperdine

Fairsail REST API: Guide for Developers

Getting Started with StoreGrid Cloud

Manual for using Super Computing Resources

FREE computing using Amazon EC2

Nessus Cloud User Registration

Secure Global Desktop (SGD)

Estonian Scientific Computing Infrastructure (ETAIS)

New Ticketing System

CSA Helpdesk User Guide

An Introduction to High Performance Computing in the Department

Enable Connectivity for 3PAR Storage:

Accessing SSL VPN with Mac OS X

Using the Windows Cluster

Bitrix Site Manager ASP.NET. Installation Guide

Manual POLICY PATROL SECURE FILE TRANSFER

Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research

Online Backup Client User Manual

Talk Internet User Guides Controlgate Administrative User Guide

Using Microsoft Visual Studio API Reference

MATLAB on EC2 Instructions Guide

Using Virtual Machines

Password Reset PRO INSTALLATION GUIDE

FLOW-3D Performance Benchmark and Profiling. September 2012

SNMP Manager User s Manual

How to Obtain an APNs Certificate for CA MDM

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

Using the Remote Desktop Portal

NU SSO Account Activation Job Aid NU Employees

Wolfr am Lightweight Grid M TM anager USER GUIDE

Troubleshooting the Campus Mobile Portal

2X Cloud Portal v10.5

Network Management & Monitoring Request Tracker (RT) Installation and Configuration

FAQ: UFS Password Self Service System

Using the Remote Desktop Portal

Using a Remote SQL Server Best Practices

User Manual for. MANILA IT Resource Center.

UAB CIS QuickStart Guide Using the RT SelfService Web Interface Revision 1, 3/22/06

Windows Clients and GoPrint Print Queues

Content Management System

Mortgage Quest WebDesk Setup and Login Instructions

Remote Desktop Solution, (RDS), replacing CITRIX Home Access

Network Licensing. White Paper 0-15Apr014ks(WP02_Network) Network Licensing with the CRYPTO-BOX. White Paper

locuz.com HPC App Portal V2.0 DATASHEET

How to connect to the University of Exeter VPN service

CORRIGENDUM TO TENDER FOR HIGH PERFORMANCE SERVER

Welcome to the ARCO Group Support Desk

IDAM Most frequently encountered messages / known issues document

How do I enroll in the password portal?

A SHORT INTRODUCTION TO DUPLICITY WITH CLOUD OBJECT STORAGE. Version

Keepit command-line client

Massey University Follow Me Printer Setup for Linux systems

SOA Software API Gateway Appliance 7.1.x Administration Guide

Welcome to EMP Monitor (Employee monitoring system):

IT Support Tracking with Request Tracker (RT)

Oracle WebLogic Server

How to Get Set Up for the 2014 BE-180 and Request an Extension if Needed

Information Technology Help Desk Instructions

Provider Express Obtaining Login Access. Information for Network Providers

Backing Up TestTrack Native Project Databases

Lync Online Deployment Guide. Version 1.0

Setting Up a Dreamweaver Site Definition for OIT s Web Hosting Server

Google 2 factor authentication User Guide

OHIO BUSINESS GATEWAY USER ACCOUNT UPDATE GUIDE FOR PASSWORD RESET AND ACCOUNT SECURITY FUNCTIONALITY

UTech Services Announces New Helpdesk Service Request System!!

A Study of Data Management Technology for Handling Big Data

Employers Guide to Online Recruiting

Global UGRAD Program

Configure Cisco Emergency Responder Disaster Recovery System

Matlab on a Supercomputer

UMass High Performance Computing Center

Create your portal account, and connect to your medical records.

Using and Contributing Virtual Machines to VM Depot

CA Spectrum and CA Service Desk

Visualization Cluster Getting Started

1 Bull, 2011 Bull Extreme Computing

Transcription:

Getting help - guide to the ticketing system Thomas Röblitz, UiO/USIT/UAV/ITF/FI ;)

World is perfect, isn t? Why do we need a ticket system?

Example 1 User: I am having trouble logging on to abel this morning. My username is, and I have access to a home area and the project area. Do you know if this is a general problem, and whether it will be solved soon? Very good - Good - Bad

Example 1 User: I am having trouble logging on to abel this morning. My username is, and I have access to a home area and the project area. Do you know if this is a general problem, and whether it will be solved soon? Very good - Good - Bad

Example 1 User: I am having trouble logging on to abel this morning. My username is, and I have access to a home area and the project area. Do you know if this is a general problem, and whether it will be solved soon? Support: Very good - Good - Bad Hei, do you have a little more information? E.g., some error message... User: -bash: warning: setlocale: LC_CTYPE: cannot change locale (UTF-8) and then I do not get a new bash prompt.

Example 1 User: -bash: warning: setlocale: LC_CTYPE: cannot change locale (UTF-8) and then I do not get a new bash prompt. Support: it might be a problem with some environment variable on your computer. Please try the following export LC_CTYPE=en_US.UTF-8 Then, try to login again User: It worked! Thanks a lot! Timeline: user (0), support (+7 min), user (+6 min), support (+68 min), user (+45 min) => ~ 2 h

Example 2 User: I am experiencing severe problems in using XY and compiling files on Abel. I had a working setup on Titan but now it seems impossible to source and run XY on Abel. Very good - Good - Bad

Example 2 User: I am experiencing severe problems in using XY and compiling files on Abel. I had a working setup on Titan but now it seems impossible to source and run XY on Abel. Very good - Good - Bad Support: What were you actually trying? E.g., which commands did you run, what is their output. How do you identify the problem? User: never responded! Timeline: user (0), support (+30 min), closed (+10 days)

Goals understand how we process tickets provide guidelines for the interactions, information needed by the support team get the help needed

Abel 875 users R1 R2 Compute nodes Intel SNB 2x 8 core IB...... Mellanox FDR Infiniband 56 Gbps core support team: ~10 10K cores 300K jobs per month ~130* sw packages RN Data (NorStore/ UiOStore/Local)... FhGFS global parallel filsystem 400 TiB hugemem nodes (1 TiB) Scratch /home Data IB + GbE...... SLURM resource manager frontends / mgmnt login + portals tasks: tickets, devel, projects several other units involved RT system new par FS GPU special purpose nodes GPU nodes nvidia GPU IO compute nodes cloud grid ~6K tickets 8Y 800 for Abel

Tickets stats ~ 6K HPC tickets, ~ 800 since Abel ~ 3 days (median) to process a ticket

How do we process a ticket?

How do we process a ticket? known/easy issue -new user -reset password -program not found -recently solved issue -... usually short time to process

How do we process a ticket? unknown/complex issue -what parts of Abel are involved -what actually happens/ed to provide a (good) solution we usually want to reproduce what the user (not) sees can be quite long procedure

Reproducing the problem Trying to run a minimal sequence of commands that leads to the problem Verify that the problem exists Understand the problem (better) Adapt environment, fix sw pkgs, change parameters to provide a solution Test with sequence!

Guidelines Is it a simple (UNIX) or generic issue? (1) Google, books; (2) colleagues; (3) houston HPC (Abel) specific Did you check our documentation? issue a ticket

Information for a new ticket some observations - often too few, too imprecise information - (long) procedure to figure out what is the core of the issue remember: what, when, where, who,...

Information for a new ticket What? - try to be as precise and expressive as possible (you not always can though) - - run commands with tool script to generate a sequence of commands & outputs that leads to the problem either the root cause becomes obvious OR the issue can be reproduced

Information for a new ticket Where? - which Abel machine: login, compute, appnode, bioportal - - which remote machine: eg when logging in, what operating system (Win, OS X, Linux) which path, file: HOME/myrun/..., PROJECTS/..., WORK/...

Information for a new ticket When? - which day, time: yesterday, this morning, now,... - which job: job id(s)

Information for a new ticket Who? - myself - myself + my colleague(s) {whom precisely?}

Information for a new ticket Other infos / recommendations - - - known previous/similar issue (refer to ticket with URL/id) try to limit the scope of a single ticket do not reopen a resolved ticket with a follow-up issue

Interacting with hpc-drift Do not address a specific member of the support team (unless you know the one is the only one who can help...) Help them with providing additional information (sometimes timing is critical) Please, no URGENT,!!!,??? ;) Try to not blame other users of wrongdoing (nobody is perfect)

Resolved? We are there to help you. Do not hesitate to ask! Remember that we need your help too, to provide good solutions in reasonable time... to be pointed to issues we may overlook...