MX Data (& Sample) Handling (& Tracking) at the ESRF Gordon Leonard ESRF Macromolecular Crystallography Group



Similar documents
Control Software at ESRF beamlines

Biomile Network - Design and Implementation

Development of a Standardized Data-Backup System for Protein Crystallography (PX-DBS) Michael Hellmig, BESSY GmbH

Cryo SAXS sample environment Project update 02/2007

CHESS DAQ* Introduction

Beamline Automation at the APS

Data-analysis scheme and infrastructure at the X-ray free electron laser facility, SACLA

User Autonomy Darren Spruce. Head of Kontrols & IT services (KITS)

CRISP WP18. Requirements for Data Recording to Storage Media. CRISP Milestone 3. CRISP_MS3.doc

Real Time Analysis of Advanced Photon Source Data

CARPS: An integrated proposal and data collection system

Preparing for remote data collection at NE-CAT

Data Sheet FUJITSU Storage ETERNUS LT260 Tape System

e-science Technologies in Synchrotron Radiation Beamline - Remote Access and Automation (A Case Study for High Throughput Protein Crystallography)

Linking raw data with scientific workflow and software repository: some early

Datasheet Fujitsu ETERNUS LT20 S2

Automation and Remote Synchrotron Data Collection

Backup & Restore. Backing Up and Restoring STX Databases For Non-OLS Subscribers

How To Back Up A Computer To A Backup On A Hard Drive On A Microsoft Macbook (Or Ipad) With A Backup From A Flash Drive To A Flash Memory (Or A Flash) On A Flash (Or Macbook) On

BIG data big problems big opportunities Rudolf Dimper Head of Technical Infrastructure Division ESRF

Remote Data Collection and Analysis Tom Worlton Argonne National Laboratory

How To Backup An Exchange Server With 25Gb And More On A Microsoft Smartfiler With A Backup From A Backup To A Backup Point Set On A Flash Drive On A Pc Or Macbook Or Ipad On A Cheap Computer (For A

Administrative Features

High-Capacity, High-Throughput: New Satellite Systems Meet Big Oil s Big Data

AFS Usage and Backups using TiBS at Fermilab. Presented by Kevin Hill

AFS Usage and Backups using TiBS at Fermilab. Presented by Kevin Hill

Eurobackup PRO Exchange Databases Backup Best Practices

Storage of the Experimental Data at SOLEIL. Computing and Electronics

Protecting SQL Server Databases Software Pursuits, Inc.

UW-IT Backups & Archives

2012 CASE STUDY ON QR CODE (2D) BASED ASSET INFORMATION MANAGEMENT SOLUTION (R- AIMS)

Using the NDMP File Service for DMA- Driven Replication for Disaster Recovery. Hugo Patterson

Data acquisition/analysis software for Diffraction Beam Lines at SSRL SPECPLOT. spec ICS. WebSpec. WxDiff

research papers 1162 doi: /s Acta Cryst. (2006). D62, Introduction

XenData Product Brief: SX-550 Series Servers for LTO Archives

FTP Use. Internal NPS FTP site instructions using Internet Explorer:

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

- Brazoria County on coast the north west edge gulf, population of 330,242

Innovative, High-Density, Massively Scalable Packet Capture and Cyber Analytics Cluster for Enterprise Customers

CONSTRUCTION / SERVICE BILLING SYSTEM SPECIFICATIONS

TEAL: Transparent Archiving Library

Deploying Riverbed wide-area data services in a LeftHand iscsi SAN Remote Disaster Recovery Solution

Basics Webmail versus Internet Mail

Sentinel DX4000 8TB Storage Server - 4X2TB Drives included, 2x Gigabit Ethernet Ports, 2x USB 3.0 Ports, Secure Remote Access

Diamond Macromolecular Crystallography Beamlines: An Update. Ralf Flaig

611 Tradewind Dr. Suite 100, Ancaster ON, L9G 4V5 (905) ext 244

XenData Video Edition. Product Brief:

SSRL Remote Synchrotron Access Workshop, February 9, University of Melbourne.

The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

Network Performance Optimisation and Load Balancing. Wulf Thannhaeuser

Data Storage at IBT. Topics. Storage, Concepts and Guidelines

ENTERPRISE DATA CENTER BACKUP AND RECOVERY OVERVIEW

Data Storage Solutions

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices

Data Data Everywhere, We are now in the Big Data era

Product Brief: XenData X2500 LTO-6 Digital Video Archive System

Data Sheet FUJITSU Storage ETERNUS LT20 S2 Tape System

Optimizing LTO Backup Performance

Evaluation of Enterprise Data Protection using SEP Software

Tier3 Network Issues. Richard Carlson May 19, 2009

Drobo How-To Guide. What You Will Need. Use Drobo and SmartSync for Site-to-Site Synchronization

Tape Storage at Sun. Dwayne Edling Manager Archive Development Sun Microsystems, Inc.

Backup and Recovery 1

Intelligent disaster recovery. Dell DL backup to Disk Appliance powered by Symantec

APS Windows Servers Argonne National Laboratory is managed by The University of Chicago for the U.S. Department of Energy

EMC arhiviranje. Lilijana Pelko Primož Golob. Sarajevo, Copyright 2008 EMC Corporation. All rights reserved.

Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive

Backup to attached Tape Library using Open-E DSS V6

Solution Brief: Creating Avid Project Archives

OPTIMIZING EXCHANGE SERVER IN A TIERED STORAGE ENVIRONMENT WHITE PAPER NOVEMBER 2006

City of Montpelier Requests for Proposal. Purchase of Onsite and Remote File / Data Backup Devices

EMC CLARiiON Backup Storage Solutions

The ANKA Archiving System

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft. dcache Introduction

We look beyond IT. Cloud Offerings

MX Breakout Session. Bob Fischetti. Workshop on Science Opportunities with an MBA Lattice Advanced Photon Source. October 21-22, 2013

Tissue Culture 1 Cell/ Microplates 2 HTS- 3 Immunology/ HLA 4 Microbiology/ Bacteriology Purpose Beakers 5 Tubes/Multi-

EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage

Storage Techniques for RHIC Accelerator Data*

Functionality. Overview. Broad Compatibility. 125 TB LTO Archive System Fast Robotics for Restore Intensive Applications

GroupWise Archiving. For additional help on setting up an archive, please contact TSC Help Desk at ext

Integrating SQL LiteSpeed in Your Existing Backup Infrastructure

DNS (Domain Name System) is the system & protocol that translates domain names to IP addresses.

White paper 200 Camera Surveillance Video Vault Solution Powered by Fujitsu

Turnkey Deduplication Solution for the Enterprise

Joining. Domain. Windows XP Pro

A pretty picture, or a measurement? Retinal Imaging

Guide for SMBs. Seagate NAS: Finding Your Best Storage Fit A Network Attached Storage Decision Guide for Your Small-to-Midsize Business

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

LTO4 Hardware Encryption Best Practices. By Vic Ludlam Distributed by Dynamic Solutions International

Operating Systems and Networks Sample Solution 1

A STORAGE SYSTEM JUST LIKE THE ONE YOU HAVE TODAY A STORAGE SYSTEM NOTHING LIKE THE ONE YOU HAVE TODAY.

Storage in Database Systems. CMPSCI 445 Fall 2010

How To Manage The Sas Metadata Server With Ibm Director Multiplatform

Scalar i500. The Intelligent Midrange Library Platform FEATURES AND BENEFITS

Data Analysis Sequencing - EDNA. Olof Svensson Data Analysis Unit ISDD ESRF

School Management System

MEDIAROOM. Products Hosting Infrastructure Documentation. Introduction. Hosting Facility Overview

OPERATIONAL CAPABILITY TECHNOLOGY QUESTIONNAIRE

Backup to attached Tape Drive using Open-E DSS V6

Transcription:

MX Data (& Sample) Handling (& Tracking) at the ESRF Gordon Leonard ESRF Macromolecular Crystallography Group Slide: 1 Gordon Leonard, 3-Way Meeting, APS, March 2008

Data production at the ESRF Data Storage 10 years evolution Yearly Data Creation on NICE 350 300 Data Production in 2007: 300 TB (MX BLs ~45TB) 250 200 150 100 50 TBytes (1TB = 2 40 Bytes)) Series1 The data rate has increased 260-fold over the last ten years! Stored on central servers not on individual beam-lines 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 Year 0 Slide: 2

Data from ESRF MX Beamlines in 2007 # images 07 3 fixed λ BL : ID14-1 419,000 ID14-2 335,000 ID14-3* 223,000 3 MAD BL : ID14-4* 290,000 ID23-1 907,000 ID29 720,000 1 μfocus BL : ID23-2 382,000 TOTAL 3,276,000 High throughput. During data collection images produced every few seconds. Need ability to process/analyse/track these as they are produced Slide: 3 Gordon Leonard, 3-Way Meeting, APS, March 2008

Data Handling at the MX Beamlines Beamline Networked Interactive Computing Environment 1 Gbps Internet User data Q315r 6kx6k pixels 35 MB/s 9x1Gbps 10 Gbps 40 Gbps Computing Clusters IHR data FC attachments Slide: 4 30 days 100 days 6 month Tape backup Allows real-time data reduction Different servers for different BLs. Only 10 day live storage for MX users. Longer term archiving is left to the users. Situation O.K. for now Network Attached RAID Storage Central Building Computer Room 150 m 2, 75 kw Tape Robot (archiving) CTRM Computer Room 120 m 2, 75 kw

Pixel detectors may change things PILATUS 6M detector Digital gating will allow shutter-less data collection. Fine phi-slicing will become routine. Image production rates will soar Will no longer be able to send images to central storage during data collection (?) Pixel size 172 x 172 µm 2 Readout time <3.6 ms Framing rate 10 Hz http://pilatus.web.psi.ch/pilatus.htm Slide: 5 Gordon Leonard, 3-Way Meeting, APS, March 2008

Data and sample tracking why? MX BAGs collect ~25,000 data sets/year also ~90,000 screening sets BAG User (MX-999) ID14-1 ID14-2 ID14-3 ID14-4 ID29 ID23-1 ID23-2 Where s my data? Where did I collect it? When did I collect it? Slide: 6 Gordon Leonard, 3-Way Meeting, APS, March 2008

Remote access makes this more necessary Slide: 7 Gordon Leonard, 3-Way Meeting, APS, March 2008

MX data & sample tracking at the ESRF SC EH DNA MxCuBE GUI (all MX beam-lines) Database Slide: 8 Gordon Leonard, 3-Way Meeting, APS, March 2008

The ISpyB LIMS http://ispyb.esrf.fr (Remote) user ESRF Pre-frozen sample samplesheets Web services User Office Database User Office, Safety FedEx Information about proteins, dewars, samples, experiments to be performed LIMS web pages Reports about experiments, results, samples Experiment Database ESRF staff / on site user Beamline DNA MXCube Slide: 9

More about ISpyB Slide: 10

Barcodes make sample tracking easier European SPINE standard Full specifications at http://www.spineurope.org, protocols menu 10 Character identification code: - DataMatrix on the base of Caps - Clear code near the DatatMatrix DataMatrices read by SC3 sample changer Hampton Datamatrices for SC3 pucks as well Also need to consider tracking dewars inside & outside ESR. 850 dewars / year (2007). Need to make sure they re delivered to the right beamline. Will be done via ISPyB Hampton Slide: 11

Dewar tracking is not trivial Hampton Hampton Slide: 12

MX mail-in data collection at the ESRF MxPress activity in 2007 by quarter with totals The ESRF introduced the MXpress service in 2002 offering companies the option of a mail in data collection service. MXpress offers clients an efficient & cost effective alternative to sending their staff to the synchrotron for data collection sessions that often lasts only a few hours. Currently, MXpress has a client base of around 20 companies with over 1000 samples being processed each year. # samples screened 368 595 580 516 ~2,500 data set 94 154 151 196 595 data set special 32 36 29 38 135 Evolution of MXPress Service 700 600 500 The provision of an efficient Mxpress service has been one of the main driving forces for the automation of the MX beamlines at the ESRF. MXpress has helped with the testing and development of sample changing robots, the automatic data collection system DNA & the ISPyB LIMS which is critical for sample tracking and subsequent reporting of results. All of these are now in routine use at the ESRF. unit 400 300 200 100 number of sample Data set Data set spec Puck screen Fluo scan shift Beamtime 0 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 2005 2006 2007 Slide: 13

Summary MX data handling & storage at ESRF sufficient for current needs Needs will change if (when?) pixel detectors are installed on the BLs Initial storage at beam-lines? Faster computers for on-site processing & analysis? ~25,000 datasets and ~90,000 screening sets per year Data & sample tracking becomes vital ISPyB database/electronic logbook indispensible for this Barcodes are a good way of ensuring proper tracking Individual samples Pucks containing many samples Dewars containing several pucks Slide: 14