Standby Services or Reliance on Experts for Accelerator control?



Similar documents
Machine Interlocks Systems

The LHC Alarm Service.

THE TESLA TEST FACILITY AS A PROTOTYPE FOR THE GLOBAL ACCELERATOR NETWORK

Readout Unit using Network Processors

"FRAMEWORKING": A COLLABORATIVE APPROACH TO CONTROL SYSTEMS DEVELOPMENT

PLCs and SCADA Systems

PROFINET the Industrial Ethernet standard. Siemens AG Alle Rechte vorbehalten.

Database Services for CERN

PLC Support Software at Jefferson Lab

PROFINET IO Diagnostics 1

CCS Hardware Test and Commissioning Plan

Open Field Network CC-Link Troubleshooting Guide

24/7 Monitoring Pro-Active Support High Availability Hardware & Software Helpdesk. itg CloudBase

Mellanox Academy Online Training (E-learning)

Tier0 plans and security and backup policy proposals

Migrating Control System Servers to Virtual Machines

Battery Xplorer Enterprise Software Release Notes 3.2

Alain Nifenecker - General Electric Manager Controls Engineering

Highly Available Mobile Services Infrastructure Using Oracle Berkeley DB

Bosch Packaging Academy Essential Training

AUTOMATION OF OPERATIONS ON THE VEPP-4 CONTROL SYSTEM

Your advantages at a glance

WHAT IS SCADA? A. Daneels, CERN, Geneva, Switzerland W.Salter, CERN, Geneva, Switzerland. Abstract 2 ARCHITECTURE. 2.1 Hardware Architecture

EPICS VME Crate Monitor Software Design Note Rev. 0 Date:

SCADA Systems. Make the most of your energy. March 2012 / White paper. by Schneider Electric Telemetry & Remote SCADA Solutions

Projects and R&D activities

Redundancy in Adroit. Figure 1: Three level redundancy in Adroit. Client Workstations. Graphics Servers. I/O Servers.

Hands-On Microsoft Windows Server Chapter 12 Managing System Reliability and Availability

How To Improve The Ganil Control System

Internet Services. CERN IT Department CH-1211 Genève 23 Switzerland

Beam Instrumentation Group, CERN ACAS, Australian Collaboration for Accelerator Science 3. School of Physics, University of Melbourne 4

Wellness screening for your machine Remote Condition Monitoring

integrated lights-out in the ProLiant BL p-class system

LEARNING SOLUTIONS website milner.com/learning phone

EUROPEAN ORGANIZATION FOR NUCLEAR RESEARCH CERN ACCELERATORS AND TECHNOLOGY SECTOR A REMOTE TRACING FACILITY FOR DISTRIBUTED SYSTEMS

THE SOFTWARE PRODUCTS FOR WATER NETWORKS MANAGEMENT THE CONTROL TECHNOLOGY GROUP. The AQUASOFT family solutions allow to perform:

Maintenance in instrumentation Maintenance with concept and technology focusing on results

Building Reliable, Scalable AR System Solutions. High-Availability. White Paper

ViaMonstra Configuration Manager 2012 Project plan and success criteria s

Redundant PROFIBUS DP network with S7-400H System and Ponto PO5063V5 Remote

The new frontier of the DATA acquisition using 1 and 10 Gb/s Ethernet links. Filippo Costa on behalf of the ALICE DAQ group

Operations Manager Comprehensive, secure remote monitoring and management of your entire digital signage network infrastructure

Online CMS Web-Based Monitoring. Zongru Wan Kansas State University & Fermilab (On behalf of the CMS Collaboration)

SIMATIC Virtualization as a Service

Siemens Openlab Major Review. PLCs Security. October Author: Filippo Tilaro Supervised by: Brice Copy

How To Use Softxpand (A Thin Client) On A Pc Or Laptop Or Mac Or Macbook Or Ipad (For A Powerbook)

Automation and Control. CERN OpenLab 25 January 2010

HP2-T25: SERVICING HP RACK AND TOWER SERVER SOLUTIONS

1 How configure S7 PLC in the configuration tool

Status Overview The Australian Synchrotron. Steven Banks Control Systems Group

Full and Para Virtualization

SECTION PROGRAMMABLE LOGIC CONTROLLERS AND COMPUTER CONTROL SYSTEM PART 1 GENERAL Summary. A. Section Includes:

U-LITE Network Infrastructure

Enterprise Manager 12c for Middleware

Maintenance & reliability management. H. Rozelot - Workshop Maintenance & Reliability SOLEIL- November 9 & 10

Industrial Involvement in the Construction of Synchrotron Light Sources

Management Tools, Systems and Applications. Network Management

Infrastructure Configuration Management Techniques. Neal R. Firth VIZIM Worldwide, Inc.

Industry White Paper. Ensuring system availability in RSView Supervisory Edition applications

A SOA visualisation for the Business

AIDIAG PREMIUM. Offer positioning

How To Test For A Test On A Test Server

PowerPanel Business Edition USER MANUAL

gimm Global Integrated Manufacturing Manager. Solution for industrial management of productive and logistic factory processes

Control System, general approach, integration with TANGO, and some User Interfaces

P-3202H-Bb. G-PON VoIP IAD DEFAULT LOGIN DETAILS. Firmware v1.0 Edition 1, 09/2009. IP Address: Password: 1234

Vehicular On-board Security: EVITA Project

SIMATIC S7-300, CPU 315-2DP CPU WITH MPI INTERFACE INTEGRATED 24 V DC POWER SUPPLY 128 KBYTE WORKING MEMORY 2

Libera at ELETTRA: prehistory, history and present state

EUROPASS DIPLOMA SUPPLEMENT

Higher Computing Networking 1

NETWORK MANAGEMENT SOLUTIONS

A More Secure and Cost-Effective Replacement for Modems

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

DISASTER RECOVERY. Omniture Disaster Plan. June 2, 2008 Version 2.0

hp embedded web server for hp LaserJet printers

Config software for D2 systems USER S MANUAL

LEN s.r.l. Via S. Andrea di Rovereto 33 c.s CHIAVARI (GE) Tel Fax mailto: len@len.it url: http//

IP Address Assignment in Large Industrial Networks

Vorlesung Kommunikationsnetze Fieldbus Systems

MyOfficePlace Business Critical Services Handbook

Comparison of Native Fibre Channel Tape and SAS Tape Connected to a Fibre Channel to SAS Bridge. White Paper

Transcription:

Standby Services or Reliance on Experts for Accelerator control? Claude-Henri Sicard AB/CO ATC/ABOC Days 27

Plan: PS Complex Controls Standby service: case study Organisation Domain of intervention Statistics of interventions Tools Some comparative evaluation Conclusions

Organisation Team of 5 to 6 technicians Each member on service during one week Callable by CCC Operation 24/24 during accelerator run (~ 32 weeks) Applies to standard CO controls (hardware/software), mostly Front-end Manages spare parts Tracing: E-logbook, Follow-ups

Requirements For proper functioning, this service needs: Training: Know geographical, technical details Regular information from CO sections (SW or HW updates, new installations, planned interruptions) Weekly Contact with Operations team (planned changes, follow-ups)

Domain of intervention Quality assurance: ensure new systems put in exploitation are correctly delivered (files, startup) configured and documented. Diagnostic: identify causes of failure within the different layers of control system Procedures: non-destructive resets, setting-ups Hardware interventions: identify and replace failing components, re-initialize systems Software: Restoring operational data, correct configuration of front-end equipments or generic applications, FE startup sequences

Significant Numbers Camac 1553 GPIB Domain Accel FECs loop crates loop crates crates Devices Description PS ADE 24 3 3 12 189 9 267 Antiproton Decelerator PS CPS 63 5 8 29 393 4 4453 Cern Proton Synchrotron & beam xfer lines PS LEI 32 5 58 1157 LEIR Low Energy Ion Ring PS LN3 1 6 16 1 427 Lead Ion Linac PS ISO 6 2 3 4 65 ISOLDE facility PS LIN 1 2 4 9 156 1 956 Proton Linac PS PSB 56 6 9 12 231 8 3648 Proton Synchrotron Booster TEST CTF 21 2 11 13 115 6228 CLIC Test Facility TEST REX 4 122 REX facility GEN MCR 11 195 Equipment common to several accelerators Total PS Complex 237 18 35 88 1251 27 1993

Operational indicators An ideal list would include: Number / duration of interventions outside working hours Effectiveness of interventions Beam time lost due to controls Manpower cost involved

Main HW Intervention areas Front-End Hardware diagnostic/replacing of: Crate Power supply Cpu CO standard cards (list TBD, test pgms) Timing: Distribution: repeaters, cables Reception (TG8/CTRx) Specific LTIM config check (no changes) Communication / fieldbuses diagnostic/replacing of CO-specific parts (Bus drivers, repeaters, RTI cards): Mil1553 FIP Ethernet (->PLCs) (NOTE: fieldbus agents are normally not CO responsibility)

comm sw ext-cond network 3 25 2 15 1 5 Piquet Interventions Piquet Interventions interventions by type (year 26) total: 268 timing config cable alim gfa sampler applic mil-1553 oasis vme hw auxil hw central services distrib timing operation vme alim central tim generic apps data-table camac plc power server fe on/off, reset fe sw

Interventions (24/26( figures) Year: 24 26 Yearly total (registered): -na- 268 Outside working hours*: 43 35 Duration (h) 66 53 Mean duration: 1h3 1h3 Requiring follow-up: --- 67 * not counting issues solved by phone/rlogin

Tools & Technologies Shared knowledge via: E-logbook intervention list Web-based tips and tricks Currently, collection of separate diagnostic tools Building a unified set of tools is a main work area during this year Could become usable by operators New/extended tools needed for LHC domain FIP, FESA, PLCs, Industrial controls, 3-tier

CO operational responsibilities per sections HT Timing /Sequencing Remote reset FE FE FESA FE Installation FIP AP Java Applic. FW High level applications for : -LEIR -LHC HC All CO sections have activities related to Exploitation IN Servers FE (via xcluc) CMW LASER DM PVSS IEPLC CRYO IS MA Test benches MI Logging Configuration DB ABCAM LAYOUT DB Test bench Machine Interlocks

Positive aspects of Standby service Guaranteed response & single entry point for OP Is a link between sections (if piquet spread in whole group) Pushes for better & common processes, documentation, diagnostic tools Gives wider view of control system to piquet team members Globally more efficient in CERN resources (CO piquet can solve basic FE problems for all equipment domains conforming to standard) Better spread of exploitation load among sections (reduces risk of overloaded exploitation experts )

Negative aspects of Standby service Experts need to provide documents & non-expert tools May add delays if piquet has to call expert Piquet team members only productive 8% of their time CO sections (and Eq groups) delegate (drop?) some of their responsibility For efficiency, OP needs similar services from main equipment groups

Pros and cons of On-call experts OP may need to call several numbers to get an answer OP must first diagnose the right domain more in-depth knowledge => faster repair No need for Eq.Grp to provide centralised documentation or diagnostics One Call list may cover all machines & domains

Conclusive comments A coherent view (across machines) is needed for OP and other equipment groups (as aimed by Control Coordination Committee) Piquet team builds competence across control domains (within CO) Overlap between fields (outside CO) CO Piquet supports other groups (mostly PO): could be reduced But anyway, efficient support needs some knowledge outside own field Should investigate possible common domains with OP/TI