CHESS DAQ* Introduction



Similar documents
Solution Brief: Creating Avid Project Archives

High Availability Databases based on Oracle 10g RAC on Linux

Backup and Recovery 1

ZFS Backup Platform. ZFS Backup Platform. Senior Systems Analyst TalkTalk Group. Robert Milkowski.

Modification after decommission of AX100 SAN (RVN00-FILEDR) /3/2009 Deputy IT Operations Manager

Best practices for operational excellence (SharePoint Server 2010)

Ignify ecommerce. Item Requirements Notes

WHITE PAPER BRENT WELCH NOVEMBER

Disaster Recovery Strategies: Business Continuity through Remote Backup Replication

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

Disk-to-Disk-to-Offsite Backups for SMBs with Retrospect

Every organization has critical data that it can t live without. When a disaster strikes, how long can your business survive without access to its

The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

Protecting Information in a Smarter Data Center with the Performance of Flash

Backup Exec 2012 Agents and Options

IBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads

Acronis Backup & Recovery for Mac. Acronis Backup & Recovery & Acronis ExtremeZ-IP REFERENCE ARCHITECTURE

risk of failure of or access to system applications. Backup software used for backup management is Bakbone Netvault : Backup.

Technical White Paper for the Oceanspace VTL6000

DATA BACKUP & RESTORE

Using Symantec NetBackup with Symantec Security Information Manager 4.5

CENTER FOR NUCLEAR WASTE REGULATORY ANALYSES

XenData Archive Series Software Technical Overview

FileCruiser Backup & Restoring Guide

Version: Page 1 of 5

The LCO Group. Backup & Disaster Recovery. Protect your data. Protect your business.

Backup and Recovery Solutions for Exadata. Ľubomír Vaňo Principal Sales Consultant

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

Powerful Management of Financial Big Data

Virtual Server and Storage Provisioning Service. Service Description

We look beyond IT. Cloud Offerings

Business Process Desktop: Acronis backup & Recovery 11.5 Deployment Guide

Disaster Recovery Checklist Disaster Recovery Plan for <System One>

Maurice Askinazi Ofer Rind Tony Wong. Cornell Nov. 2, 2010 Storage at BNL

ENTERPRISE DATA CENTER BACKUP AND RECOVERY OVERVIEW

C CMRR Computer Resources Overview

Microsoft Hyper-V chose a Primary Server Virtualization Platform

PrimeArray Data Storage Solutions Network Attached Storage (NAS) iscsi Storage Area Networks (SAN) Optical Storage Systems (CD/DVD)

DAS (Direct Attached Storage)

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Library Recovery Center

Perforce Backup Strategy & Disaster Recovery at National Instruments

Tandberg Data AccuVault RDX

The syslog-ng Store Box 3 F2

Overview of HPC Resources at Vanderbilt

WHITE PAPER PPAPER. Symantec Backup Exec Quick Recovery & Off-Host Backup Solutions. for Microsoft Exchange Server 2003 & Microsoft SQL Server

Tiered Data Protection Strategy Data Deduplication. Thomas Störr Sales Director Central Europe November 8, 2007

PADS GPFS Filesystem: Crash Root Cause Analysis. Computation Institute

U-LITE Network Infrastructure

OPTIONS / AGENTS DESCRIPTION BENEFITS

Unitrends Recovery-Series: Addressing Enterprise-Class Data Protection

Symantec NetBackup 5000 Appliance Series

Symantec Backup Exec 2014 TM Licensing Guide

MEDIAROOM. Products Hosting Infrastructure Documentation. Introduction. Hosting Facility Overview

Evolved Backup Features Computer Box 220 5th Ave South Clinton, IA

November 2013 Webinar Backup & Replication!

Symantec NetBackup for Microsoft SharePoint Server Administrator s Guide

Tier0 plans and security and backup policy proposals

IMPLEMENTING GREEN IT

Best Practices for Using Symantec Online Storage for Backup Exec

Symantec Backup Appliances

AlphaTrust PRONTO - Hardware Requirements

TELSTRA CLOUD SERVICES CLOUD INFRASTRUCTURE PRICING GUIDE SINGAPORE

Total Cost of Ownership Analysis

DEDUPLICATION BASICS

Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive

Solution for private cloud computing

Open Source Mainframe Backup with Hercules

Turnkey Deduplication Solution for the Enterprise

NETGEAR SMB Storage Line Update and ReadyNAS 2100 Introduction

How To Backup An Exchange Server With 25Gb And More On A Microsoft Smartfiler With A Backup From A Backup To A Backup Point Set On A Flash Drive On A Pc Or Macbook Or Ipad On A Cheap Computer (For A

An Affordable Commodity Network Attached Storage Solution for Biological Research Environments.

Implementing an Automated Digital Video Archive Based on the Video Edition of XenData Software

DEDICATED MANAGED SERVER PROGRAM

About Backing Up a Cisco Unity System

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF

EMC DATA DOMAIN OPERATING SYSTEM

White Paper. Recording Server Virtualization

Best Practices Guide. Symantec NetBackup with ExaGrid Disk Backup with Deduplication ExaGrid Systems, Inc. All rights reserved.

TELSTRA CLOUD SERVICES CLOUD INFRASTRUCTURE PRICING GUIDE AUSTRALIA

Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage

CRIBI. Calcolo Scientifico e Bioinformatica oggi Università di Padova 13 gennaio 2012

SAN/iQ Remote Copy Networking Requirements OPEN iscsi SANs 1

Data Backup and Restore (DBR) Overview Detailed Description Pricing... 5 SLAs... 5 Service Matrix Service Description

Evaluation of Enterprise Data Protection using SEP Software

EMC AVAMAR. a reason for Cloud. Deduplication backup software Replication for Disaster Recovery

Transcription:

CHESS DAQ* Introduction Werner Sun (for the CLASSE IT group), Cornell University * DAQ = data acquisition https://en.wikipedia.org/wiki/data_acquisition

Big Data @ CHESS Historically, low data volumes: Less than 1 GB per user group Different storage solutions for each staff scientist But things are changing: Higher flux with undulators More advanced area detectors Maia detector Time-resolved experiments Up to 400 MB/s (burst), 1 TB/week NSF Data Sharing Policy Requires paradigm shift: Central data storage On-site data reduction/analysis Not unique to CHESS 2

CLASSE: History of Big Data CLEO experiment, 1979-2008 Collected billions of e+e- collision events ~200 people, ~20 institutions, 500+ publications Types of data: Raw data: ~20 kb/event Centrally reduced data: < 10 kb/event Simulated data (typically 10-20x real data) Personal analysis data Total 2000-2008: Raw data 80+ TB Reduced + simulated data ~40 TB Data analysis software developed by CLEO collaborators, analysis jobs run at Cornell CesrTA follows similar model to CLEO 3

Maia Detector Key driver of growth in CHESS data volume Designed by particle physicists at CSIRO and BNL Non-trivial data acquisition pipeline, tightly integrated with CLASSE computing Binary logger daemon (blogd) runs on the CLASSE central infrastructure. Receives data from detector (HYMOD) Writes data to disk (CHESS DAQ filesystem) Data processed with GeoPIXE on the CLASSE Compute Farm Compute Farm has direct access to data (no file transfers necessary) Average ~500 GB / user group / run 8+ TB collected since early 2014 4

CHESS DAQ Basics Unlike CLEO, many data sources at CHESS, many types of user groups All data producers (eventually) write to two dedicated filesystems: CHESS_DAQ: 30 TB CHESS_MAIA: 10 TB [ CHESS_RAW: 10 TB, read-only interface, restore point for archived data ] Additional filesystems for previous run(s) PREVIOUSDAQ (30 TB) and PREVIOUSMAIA (10 TB) Auxiliary (meta)data, processed data, user data CHESS_AUX: 10 TB Software packages for CHESS CHESS_OPT: 500 GB Anaconda, WIEN2k, GeoPIXE, tomopy, RAW (BioSAXS) See https://wiki.classe.cornell.edu/chess/datastoragemanagement 5

CHESS DAQ Data Flow A1 A2 B1 C1 D1 F1 F2 F3 G1 G2 10 Gbps Maia G3 } CHESS DAQ network read-write access for data producers CHESS_DAQ 30 TB CHESS_RAW CHESS_MAIA 10 TB 10 TB Archival tape library read-only access for data consumers CLASSE Public network { 1 Gbps CLASSE Desktops CLASSE Compute Farm GeoPIXE, MATLAB, FEpX SSH SFTP SCP HTTPS CHESS Users onsite or offsite CHESS_AUX 10 TB Nightly/monthly backup 6

Data Archival and Rotation At least two runs on disk at all times (current + previous) Archival and backup handled by Symantec NetBackup Can request files to be restored from tape Complies with NSF open access mandate and CHESS Data Management Plan Data can be kept on disk upon request CHESS_DAQ / CHESS_MAIA: After each run, two copies of data written to tape One copy stored off-site (disaster recovery copy) Data is archived indefinitely Before next run: Move data to PREVIOUSDAQ / PREVIOUSMAIA, make read-only Prepare CHESS_DAQ / CHESS_MAIA directories for next run, update softlinks CHESS_AUX and CHESS_OPT: Monthly full + nightly incremental backups 7

More Details Permissions CHESS_DAQ / CHESS_MAIA: Network-based permissions Can be written to only from CHESS DAQ subnet Analysis systems are on CLASSE Public network, read-only access CHESS_AUX and CHESS_OPT: Group-based permissions Open to anyone in the "chess" security group Currently includes staff, students, postdocs, users For CHESS users: data transfer kiosks at CHESS ops and reception Can be unlocked by anyone with a CLASSE account Some station/analysis computers are now CLASSE-managed 8

Data Access & CLASSE Computing Centralized computing based on Linux (currently Scientific Linux 6) Similar models at SLAC, FNAL, CERN, BNL, ESRF, APS, etc. CLASSE accounts: single sign-on for all central resources CLASSE Linux systems give direct access to: CHESS DAQ filesystems (data, software, project space) Compute Farm (includes MATLAB, Mathematica, python, etc.) Personal web pages Access: Log directly into Linux: ssh <username>@lnx201.classe.cornell.edu On Windows, use PuTTY or X2Go (see Computing homepage) cd /nfs/chess/raw Graphical file access from Windows: use Samba Connect to the CLASSE VPN (see Computing homepage) In Windows Explorer, browse to \\chesssamba.classe.cornell.edu\raw 9

CLASSE Connection Diagram NOT BACKED UP ON CLASSE SYSTEMS End-user onsite computers (incl. CHESS) SSH SFTP SCP Central Linux nodes lnx201 CLASSE compute farm } local disk local disk Linux Windows Samba remote file server Central Filesystems home directories user disk project areas CHESS_{DAQ, MAIA, RAW, AUX, OPT} local disk Mac } BACKED UP VPN RedRover/eduroam & offsite SSH SFTP SCP 10

Usage Patterns Example #1 Log into lnx201 Develop C++ or python analysis on CHESS_AUX Submit data analysis batch jobs to Compute Farm Write output to CHESS_AUX Further analysis with interactive jobs on Linux Files backed up by CLASSE IT Example #2 Log into lnx201 Run interactive data processing/ visualization Stage output to CHESS_AUX Use Samba/SFTP/ SCP to copy files to personal computer User responsible for backing up files Example #3 Use Samba/SFTP/ SCP to copy data to personal computer Analyze data locally User responsible for backing up files 11

CHESS DAQ Hardware 10 Gb storage area network (SAN) Enterprise-class storage devices, servers, network switches 2 x Infortrend iscsi storage devices Dual controllers, redundant power 24 x 4 TB drives/device configured in 2 x RAID 6 Total 128 TB usable Files served by CHESS DAQ cluster 5 x IBM x3550 M4 servers Each has 2 x 6-core Intel Xeon, 128 GB RAM Networking: 10 Gb IBM Blade switches New multi-mode optical fiber runs throughout CHESS Throughput: Up to 900-1000 MB/s writes (sustained 600 MB/s) Average 200-300 MB/s reads IBM tape library, total capacity 250 TB (uncompressed) 12

Current Status & Plans First deployment of CHESS DAQ in October 2014 24 TB written so far CHESS DAQ network available at all beamlines 7 of 11 beamlines are now writing to central storage Missing: B1, C1, and MacCHESS (except BioSAXS @ G1) Archived: 2014-3, 2015-1, plus some legacy data Plans for summer down: Archive 2015-2 data Investigate networking bottleneck at A2 with Pilatus 300K Organize network switches on experimental floor Convert B1 and C1 (possibly) Longer term: develop central software repository for CHESS 13

Additional Resources CHESS DAQ documentation: https://wiki.classe.cornell.edu/chess/datastoragemanagement Lyris mailing list: chess-daqadmin-l@cornell.edu Meeting next Monday 27 July, 2 PM, Wilson 374 CLASSE computing homepage: https://wiki.classe.cornell.edu/computing/webhome Linux @ CLASSE: https://wiki.classe.cornell.edu/computing/linuxsupport Central file systems: https://wiki.classe.cornell.edu/computing/datastewardship CLASSE Compute Farm: https://wiki.classe.cornell.edu/computing/computefarmintro 14