Scalable Multi-Node Event Logging System for Ba Bar



Similar documents
The Google File System

Maurice Askinazi Ofer Rind Tony Wong. Cornell Nov. 2, 2010 Storage at BNL

High Availability Databases based on Oracle 10g RAC on Linux

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

Xanadu 130. Business Class Storage Solution. 8G FC Host Connectivity and 6G SAS Backplane. 2U 12-Bay 3.5 Form Factor

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

How To Fix A Powerline From Disaster To Powerline

AFS Usage and Backups using TiBS at Fermilab. Presented by Kevin Hill

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF

Ultra-Scalable Storage Provides Low Cost Virtualization Solutions

Configuring Apache Derby for Performance and Durability Olav Sandstå

CYBERNETICS. Virtualization of Tape Storage

The Microsoft Large Mailbox Vision

Site Recovery Manager Installation and Configuration

SYSTEM SETUP FOR SPE PLATFORMS

Installation Guide July 2009

Oracle Maximum Availability Architecture with Exadata Database Machine. Morana Kobal Butković Principal Sales Consultant Oracle Hrvatska

DELL TM PowerEdge TM T Mailbox Resiliency Exchange 2010 Storage Solution

NEXTGEN v5.8 HARDWARE VERIFICATION GUIDE CLIENT HOSTED OR THIRD PARTY SERVERS

Storage node capacity in RAID0 is equal to the sum total capacity of all disks in the storage node.

Red Hat Enterprise linux 5 Continuous Availability

ADVANCED NETWORK CONFIGURATION GUIDE

Protect Data... in the Cloud

XFS File System and File Recovery Tools

Identifying the Hidden Risk of Data Deduplication: How the HYDRAstor TM Solution Proactively Solves the Problem

Huawei N2000 NAS Storage System Technical White Paper

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

RAID Level Descriptions. RAID 0 (Striping)

Perforce Backup Strategy & Disaster Recovery at National Instruments

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

Installing the Operating System with Express Setup

SUN ORACLE DATABASE MACHINE

PARALLELS CLOUD STORAGE

About Backing Up a Cisco Unity System

Practical issues in DIY RAID Recovery

PADS GPFS Filesystem: Crash Root Cause Analysis. Computation Institute

Informix Dynamic Server May Availability Solutions with Informix Dynamic Server 11

Avamar. Technology Overview

OVERVIEW. CEP Cluster Server is Ideal For: First-time users who want to make applications highly available

Building Storage Clouds for Online Applications A Case for Optimized Object Storage

How To Create A Multi Disk Raid

RFP - Equipment for the Replication of Critical Systems at Bank of Mauritius Tower and at Disaster Recovery Site. 06 March 2014

Get Success in Passing Your Certification Exam at first attempt!

LSKA 2010 Survey Report Job Scheduler

Certification Document macle GmbH Grafenthal-S1212M 24/02/2015. macle GmbH Grafenthal-S1212M Storage system

IBM ^ xseries ServeRAID Technology

Input / Ouput devices. I/O Chapter 8. Goals & Constraints. Measures of Performance. Anatomy of a Disk Drive. Introduction - 8.1

(Scale Out NAS System)

SUN ORACLE DATABASE MACHINE

Trends in Enterprise Backup Deduplication

The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

Planning Domain Controller Capacity

MedInformatix System Requirements

Price/performance Modern Memory Hierarchy

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

Technical White Paper. Symantec Backup Exec 10d System Sizing. Best Practices For Optimizing Performance of the Continuous Protection Server

QuickSpecs. HP Smart Array 5312 Controller. Overview

EMC Unified Storage for Microsoft SQL Server 2008

How To Make A Backup System More Efficient

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

Upgrading Small Business Client and Server Infrastructure E-LEET Solutions. E-LEET Solutions is an information technology consulting firm

Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage

Virtualizing SQL Server 2008 Using EMC VNX Series and Microsoft Windows Server 2008 R2 Hyper-V. Reference Architecture

Avid ISIS

SanDisk ION Accelerator High Availability

Implementing Network Attached Storage. Ken Fallon Bill Bullers Impactdata

Cisco Active Network Abstraction Gateway High Availability Solution

Best Practices for Virtualised SharePoint

Long term retention and archiving the challenges and the solution

Chapter 12 Network Administration and Support

High Availability of the Polarion Server

POWER ALL GLOBAL FILE SYSTEM (PGFS)

Availability Digest. Stratus Avance Brings Availability to the Edge February 2009

RAID Manual. Edition. Trademarks V1.0 P/N: CK8-A5-0E

June Blade.org 2009 ALL RIGHTS RESERVED

Cloud Computing with Microsoft Azure

SCALABILITY AND AVAILABILITY

Software-defined Storage at the Speed of Flash

Symantec Backup Exec 2014 TM Licensing Guide

Volume Replication INSTALATION GUIDE. Open-E Data Storage Server (DSS )

Backup architectures in the modern data center. Author: Edmond van As Competa IT b.v.

How To Run Apa Hadoop 1.0 On Vsphere Tmt On A Hyperconverged Network On A Virtualized Cluster On A Vspplace Tmter (Vmware) Vspheon Tm (

How to Choose your Red Hat Enterprise Linux Filesystem

Frequently Asked Questions: EMC UnityVSA

The functionality and advantages of a high-availability file server system

External Storage 200 Series. User s Manual

The VMware Administrator s Guide to Hyper-V in Windows Server Brien Posey Microsoft

Step-by-Step Guide. to configure Open-E DSS V7 Active-Active iscsi Failover on Intel Server Systems R2224GZ4GC4. Software Version: DSS ver. 7.

RAID User Guide. Edition. Trademarks V1.0 P/N: C51GME0-00

A Practical Guide to Backup and Recovery of IBM DB2 for Linux, UNIX and Windows in SAP Environments Part 1 Backup and Recovery Overview

ORACLE RMAN DESIGN BEST PRACTICES WITH EMC DATA DOMAIN

Transcription:

A New Scalable Multi-Node Event Logging System for BaBar James A. Hamilton Steffen Luitz For the BaBar Computing Group

Original Structure Raw Data Processing Level 3 Trigger Mirror Detector Electronics Raw Data Processing Raw Data Processing Switch Level 3 Trigger Level 3 Trigger Event Logging Tape Archive 2

Performance Requirements Current 30 kilobyte events X 300 events/sec. = 9 megabytes/sec. Server limited to ~ 10 megabytes/sec. Projected by 2007 75 kilobyte events X 500 events/sec. = 37.5 megabytes/sec. 3

Options Could do a simple HW/SW upgrade But a new architecture can also get: Scalability in case we guess wrong Fault tolerance (RAID array not enough because of long disk reconstruction time) High burst rate to allow passthrough runs So we chose the new architecture 4

New Architecture Take advantage of multiple level 3 systems (currently 30) Write data in parallel to local disks on level 3 systems Merge to file server at end of run BaBar B-factory runs continuously with ~1 hour runs Uses trickle injection, so data-taking never stops Multiple file servers for finished runs 5

*, - / "! ( " )!( ( Data Paths Level 3 Trigger Level 3 Trigger 10 GB 10 GB Merge Server 1.6 TB Tape Archive Merge Server 1.6 TB. ( *! Level 3 Trigger 10 GB Local Disks '&!#" %$ " " "!#" +*! 6

Server Performance Each merge server is equipped with: 1.6 terabyte raid disk array at ~100 megabytes/sec. Gigabit ethernet Dual 3GHz P4 running Linux & XFS Merge operations average 80 megabytes/sec. transfer rate Over 1000 events/sec. At 75 kilobytes per event Use second server for fault tolerance 7

1 1 0 Level 3 System Performance Each level 3 system has: Two 18 gigabyte mirrored disks at ~50 megabytes/sec. Leaves ~10 gigabytes for event data enough for ~10 hours at current rates Need larger disks here already Gigabit ethernet to dataflow 100 megabit ethernet to merge servers Each merger takes 2.6 megabytes/sec. (80/30) 5 % of disk bandwidth 21 % of network bandwidth ~20,000 events/sec. burst, at 75 kilobytes/event Have done 3,000 events/sec. in production 8

0 Component Architecture Farm Node Level 3 Local Event File Source Server Merger Controller GUI Merged Event File Run Control Controller Merge Manager Run Control spawns Merge Server 9

0 Controller GUI 10

2 2 2 2 2 0 Failure Modes Loss of network connection Unrecoverable single disk error Processor failure Component crash or restart Must recover from these without loss of data 11

3 4 4 3 4 4 3 4 3 4 0 Merge Failure Recovery Repair Retry Replace failing drive (recover via RAID) or processor Continue taking data during repair Source data never deleted until merge succeeds Manual and automatic Scavenge Controller queries source servers and merge manager Redirect Sources can be renamed or omitted 12

2 5 5 5 2 0 Conclusions We have developed a new event logging system that is: Scalable meets projected throughput requirements for lifetime of BaBar Fault tolerant Supports high burst-rate data taking In successful production use for last two weeks of Run 4. Will continue in Run 5. 13

6 Scalability Timelines Data taking Run 1 Run 2 Run 3 Merge Server 1 Idle Run 1 Idle Run 3 Merge Server 2 Idle Run 2 14