UK HEP System Manager Meeting. 30 June 3 July 2009 RAL. Diego Bellini



Similar documents
NEXTGEN v5.8 HARDWARE VERIFICATION GUIDE CLIENT HOSTED OR THIRD PARTY SERVERS

Parallels Plesk Automation

Priority Pro v17: Hardware and Supporting Systems

Red Hat Enterprise Virtualization - KVM-based infrastructure services at BNL

Department of Technology Services UNIX SERVICE OFFERING

DAS (Direct Attached Storage)

Priority Zoom v17: Hardware and Supporting Systems

Report from SARA/NIKHEF T1 and associated T2s

Belgacom Group Carrier & Wholesale Solutions. ICT to drive Your Business. Hosting Solutions. Datacenter Services

DEDICATED MANAGED SERVER PROGRAM

The Ultimate Business & Enterprise Hosting Solutions.

out of this world guide to: POWERFUL DEDICATED SERVERS

Copyright by Parallels Holdings, Ltd. All rights reserved.

RICOH Data Center Services

DocuShare Installation Guide

Cisco Prime Home 5.0 Minimum System Requirements (Standalone and High Availability)

Diploma in Computer Science

Powerful Dedicated Servers

Small Business Server Part 1

This guide specifies the required and supported system elements for the application.

Virtualisation Cloud Computing at the RAL Tier 1. Ian Collier STFC RAL Tier 1 HEPiX, Bologna, 18 th April 2013

Reboot the ExtraHop System and Test Hardware with the Rescue USB Flash Drive

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

Computing Support Services at the Hawai`i Institute of Geophysics and Planetology

Sage Grant Management System Requirements

napp-it ZFS Storage principles for Video Performance Dual Xeon, 40 GB RAM Intel X540, mtu 9000 (JumboFrames) Server OS: Oracle Solaris 11.3 SMB 2.

Servers, Clients. Displaying max. 60 cameras at the same time Recording max. 80 cameras Server-side VCA Desktop or rackmount form factor

Software Environment. Options. Service guarantee:. 24/7 Hardware Support. 99% uptime

Vembu NetworkBackup v3.1.1 GA

NETGEAR SMB Storage Line Update and ReadyNAS 2100 Introduction

Office 365 Migration Performance & Server Requirements

OIS. Update on Windows 7 at CERN & Remote Desktop Gateway. Operating Systems & Information Services CERN IT-OIS

Microsoft Office Outlook 2013: Part 1

GridKa site report. Manfred Alef, Andreas Heiss, Jos van Wezel. Steinbuch Centre for Computing

DocuShare Installation Guide

ORACLE OPS CENTER: PROVISIONING AND PATCH AUTOMATION PACK

Building a Linux Cluster

LabStats 5 System Requirements

Virtualization Infrastructure at Karlsruhe

112 Linton House Union Street London SE1 0LH T: F:

insync Installation Guide

HAMBURG ZEUTHEN. DESY Tier 2 and NAF. Peter Wegner, Birgit Lewendel for DESY-IT/DV. Tier 2: Status and News NAF: Status, Plans and Questions

v7.1 Technical Specification

WINNEBAGO COUNTY INFORMATION TECHNOLOGY STANDARDS 2014/2015

Deploying Cloudera CDH (Cloudera Distribution Including Apache Hadoop) with Emulex OneConnect OCe14000 Network Adapters

MedInformatix System Requirements

The Technical Realization of the National Analysis Facility

EIOBoard Intranet Installer Guide

A Highly Versatile Virtual Data Center Ressource Pool Benefits of XenServer to virtualize services in a virtual pool

Staying afloat in today s Storage Pools. Bob Trotter IGeLU 2009 Conference September 7-9, 2009 Helsinki, Finland

RO-11-NIPNE, evolution, user support, site and software development. IFIN-HH, DFCTI, LHCb Romanian Team

File server

vrealize Business System Requirements Guide

Ignify ecommerce. Item Requirements Notes

Appendix 3. Specifications for e-portal

Xserve Transition Guide. November 2010

Implementing Enterprise Disk Arrays Using Open Source Software. Marc Smith Mott Community College - Flint, MI Merit Member Conference 2012

QUESTIONS & ANSWERS. ItB tender 72-09: IT Equipment. Elections Project

Adapt Support Managed Service Programs

StruxureWare Data Center Expert Release Notes

NEWSTAR ENTERPRISE and NEWSTAR Sales System Requirements

Introduction to Cloud Computing

- Brazoria County on coast the north west edge gulf, population of 330,242

SMALL BUSINESS OUTSOURCING

Hardware Requirements

Solution Brief: Creating Avid Project Archives

Chapter 2 Installing ShareScope

Computing in High- Energy-Physics: How Virtualization meets the Grid

vnas Series All-in-one NAS with virtualization platform

Dell KACE K1000 Management Appliance. Administrator Guide. Release 5.3. Revision Date: May 16, 2011

HARDWARE AND SOFTWARE REQUIREMENTS

MoversSuite by EWS. System Requirements

Novell Open Workgroup Suite

Hardware and Software Requirements for Installing California.pro

Enabling Technologies for Distributed Computing

Terms of Reference Microsoft Exchange and Domain Controller/ AD implementation

Sun Constellation System: The Open Petascale Computing Architecture

Minimum Hardware Specifications Upgrades

Checklist Florin / Calacsy 6.x / 7.x Small Business model. Document versie: 0.2

System Environment Specifications Network, PC, Peripheral & Server Requirements

DIVISION OF ENGINEERING COMPUTING SERVICES DECS SERVICE DESK. Fall & Spring: Monday Thursday 8am to 9pm. Summer & Breaks:

QPS 9.2 ReadMe...5. QPS components...6

Server Recommendations August 28, 2014 Version 1.22

Introduction to Mirametrix EyeTracker

Network Guidelines and Hardware Requirements

For Hyper-V Edition Practical Operation Seminar. 4th Edition

DOCSVAULT Document Management System for everyone

System Requirements. 60GB free after OS and Updates, Raid 5 or Hybrid SSD array

Main Memory Data Warehouses

Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies

HP Universal CMDB. Software Version: Support Matrix

IBM System x SAP HANA

Installation, Report Environment Set-up and DB Maintenance

Virtual Private Servers

Workstation Virtualization Software Review. Matthew Smith. Office of Science, Faculty and Student Team (FaST) Big Bend Community College

Vintage IT Services MSRP Pricing PaaS Catalog Pricing Service Categories MSRP Pricing

White paper. ATCA Compute Platforms (ACP) Use ACP to Accelerate Private Cloud Deployments for Mission Critical Workloads. Rev 01

Tier0 plans and security and backup policy proposals

OBSERVEIT DEPLOYMENT SIZING GUIDE

Transcription:

UK HEP System Manager Meeting 30 June 3 July 2009 RAL 1

System Admin Staff: (new admin staff member) Simon George Barry Green (me) Govind Songara: Grid support (started around March) 2

Some General Data: ~ ~ ~ ~ ~ ~ ~ 11 PP Academic Staff 15 PP Research Staff 11 PP Research Students 20 desktops (SL4/SL5) 21 servers (SL4, Win 2003, Solaris 8/10) 5 laptops at disposition to all the Group 32 personal laptop for research purpose 3

Grid Systems: Faraday and Newton 4

Grid Systems (cont d): A. Faraday - Supplier: Compusys Location: RHUL PP computer room 5 years old support ended March 2009 75 worker nodes with Intel Xeon (i686) 3GHz x2 cpu/node, 1GB mem/node => 163 ksi2k - 3 storage nodes, total 8 TB RAID6. - o/s: RHEL3 - Upgrade plan: SL4 (since glite i686-sl5 not supported) 5

Grid Systems(cont'd): B. Newton - Supplier: Clustervision Installed December 07 Current location: hosted by Imperial College Relocating to new RHUL machine room August 09 50 worker nodes with 2x Intel E5345 2.3GHz (Clovertown) (total 8 cores), 16GB mem/node => 960 ksi2k - 15 storage nodes total 320 TB RAID6 - o/s: SL4 C. Next system: CIF/GridPP funded, planned for 2010. 6

Grid: Mini Test Cluster - From 8 nodes of Faraday built up test cluster with SL4 32 bit for training & testing purposes - Installing CreamCE, lcg-ce, bdii, torque, maui, worker nodes, etc. - Testing different monitoring and management tools like Ganglia, cfengine, nagios, MonAMI, etc... - Will use to pilot SL4 upgrade of full cluster 7

ATLAS analysis (Tier3) - Now: System for local interactive use and Grid UI. - Future: share of next cluster - Specified and ordered: - 8 core processing node with 16GB memory (Nehalem) - 60TB nfs scratch space for data - New dual CPU system - Vibration problems with 2TB disks 8

Power incident at IC Friday 16 May 2009 8am: Aircon failure in ICT machine room Room overheating rapidly Operators need to cut power Go down aisles pulling power connections (IEC-C13) How we saw it ssh prompt froze (Duncan and Simon were connected) Nagios errors RHUL network manager noticed firewall/router down Rapid response from our IC contact explaing the situation. 9

Power incident at IC (cont d) Following Monday, Govind, Duncan and Simon discovered the full horror & spent a few hours reconnecting power cables. Thank goodness for: Clustervision cable labeling & db on master node Rack with broken lock: door could not be opened so remained cabled as an example. Needless to say we are reviewing procedures. CV/APC integration makes it easy to test a node is connected in the right place by powering on/off. 10

Since last September: Ranking group web server (www.pp.rhul.ac.uk) - google search engine oriented - google analytics tool - distinguish web site areas - page title, meta tag, site-maps,... 11

Windows server - upgraded old win 2000 to win 2003 - authentication managed through Samba - eliminated active directory 12

Started migration from SL4 to SL5 - desktop: planned to complete by the end of August - server: rolling upgrade over next year, already underway - cern-castor-client: group not recognized in kickstart's %packages section for i386 installation=> install in a later step through yum groupinstall (is there any motivation?) 13

Migration to outsourced spam filter (webroot) - in progress - some problems due to specific Particle Physics situation: - own DNS service for @pp.rhul.ac.uk - own mail service 14

File server migration - /home nfs server - hardware: pc system, 2 Ethernet 1Gb (trunked), raid 5 - o.s.: Solaris 10 with zfs (zraid) - Backup system A. first file server - hardware: pc system, 2 Ethernet 1Gb (trunked), raid controller as jbod - o.s.: Solaris 10 with zfs (zraid2) B. second file server (off-site) located at new RHUL off-site machine room o/s & config to be decided - storage of dump file no more tapes 15

File server migration - Problems - slower performance if NVRAM is attached to raid controller We've tried to investigate but we didn't understand, anyway for our purpose it's good enough - switching to jbod raid controller feature: Raid controller couldn't reboot => upgraded firmware - reboot randomly doesn't work properly => open call with sun support 16

Plans for the future: - Complete desktop upgrades - Complete file server migration/new backup - Complete spam filter migration - Upgrade video room from fedora 8 to 11 - upgrade desktop hardware: - about 5 year old - dual core 64 bit: AMD Athlon 64 X2 5200+ - ram 2Gb 17

Email migration - We run a simple smtp & SSL imap server - MTA: exim - CC encourage us to move to their central Exchange2007 system - Our academic staff want a shared calendar integration solution => motivation to move to them - We are doing some trials now 18

Email migration open issues - migration will cost us several x annual effort running exim - policy and technical issues with forwarding from CC to external address - Anyone decent linux client for Exchange2007 over MAPI? - Variety of personal calendar solutions already in use, complex syncing may be needed 19