New Design and Layout Tips For Processing Multiple Tasks
|
|
- Marlene Fletcher
- 3 years ago
- Views:
Transcription
1 Novel, Highly-Parallel Software for the Online Storage System of the ATLAS Experiment at CERN: Design and Performances Tommaso Colombo a,b Wainer Vandelli b a Università degli Studi di Pavia b CERN IEEE Real-Time Conference, 12 June 2012 T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
2 The ATLAS Trigger & DAQ System Event rates design (2012 peak) 40 MHz (20 MHz) 2.5 µs 75 khz (~65 khz) Level 1 Trigger Custom Hardware Regions of Interest Level 1 Accept Calo/ Muon FE DAQ Other FE Other FE ROD ROD ROD Detector Readout Data rates design (2012 peak) ATLAS Event 1.5 MB/25 ns (1.6 MB/50ns) ~40 ms (~45 ms) 3 khz (~5.5 khz) ~4 s (~1 s) ~200 Hz (~800 Hz) Level 2 Event Filter ~5000 dsa Processing Unit ~5000 Processing Unit ROI data L2 Accept Full events EF Accept Data Collection Network Event Filter Network CERN Permanent Storage ~150 Readout System ~100 Event Builder 5 Data Logger Data Flow ~110 GB/s (~105 GB/s) ~4.5 GB/s (~9 GB/s) ~300 MB/s (~1100 MB/s) T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
3 The current Data Logging system: overview Purpose 5 PCs receive data from the Event Filter system and write it to local disks. Each event is: analyzed to determine the tags applied by the Event Filter trigger processed (e.g. compressed) written to appropriate file(s) according to the tags Details The event tags are determined by the trigger algorithms based on the event content To facilitate off-line data distribution, every event is written to multiple files, one per each of its tags File checksum is calculated while writing CPU-intensive! T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
4 The current Data Logging system: overview Purpose 5 PCs receive data from the Event Filter system and write it to local disks. Each event is: analyzed to determine the tags applied by the Event Filter trigger processed (e.g. compressed) written to appropriate file(s) according to the tags Details The event tags are determined by the trigger algorithms based on the event content To facilitate off-line data distribution, every event is written to multiple files, one per each of its tags File checksum is calculated while writing CPU-intensive! T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
5 The current Data Logging system: limitations The current Data Logger implementation is essentially single-threaded: multiple threads receive the events from the EF a single thread does the processing and writing This design is very unlikely to scale: maximum processing throughput: ~ 500 MB/s comparable with I/O (network and disks) limits Network I/O Thread Events Q. ev ev ev put event get event Processing and Writing Thread It is a major blocker for the addition of new features requiring more CPU power than a single core can provide due to this event-level data compression currently needs to be performed off-line, as an additional step T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
6 The current Data Logging system: limitations The current Data Logger implementation is essentially single-threaded: multiple threads receive the events from the EF a single thread does the processing and writing This design is very unlikely to scale: maximum processing throughput: ~ 500 MB/s comparable with I/O (network and disks) limits Network I/O Thread Events Q. ev ev ev put event get event Processing and Writing Thread It is a major blocker for the addition of new features requiring more CPU power than a single core can provide due to this event-level data compression currently needs to be performed off-line, as an additional step T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
7 The current Data Logging system: limitations The current Data Logger implementation is essentially single-threaded: multiple threads receive the events from the EF a single thread does the processing and writing This design is very unlikely to scale: maximum processing throughput: ~ 500 MB/s comparable with I/O (network and disks) limits Network I/O Thread Events Q. ev ev ev put event get event Processing and Writing Thread It is a major blocker for the addition of new features requiring more CPU power than a single core can provide due to this event-level data compression currently needs to be performed off-line, as an additional step T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
8 New design: general considerations The data processing workload is embarrassingly parallel: the incoming data are already divided in events Constraint The raw data file format is strictly sequential It is impossible to do concurrent writes to the same file Necessary to calculate the overall file checksum before writing to disk, aiding in the detection of write errors Keeps the format complexity to a minimum Multiple events can be written to different raw data files concurrently, but no more than one event can be written to each data file at once T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
9 New design: general considerations The data processing workload is embarrassingly parallel: the incoming data are already divided in events Constraint The raw data file format is strictly sequential It is impossible to do concurrent writes to the same file Necessary to calculate the overall file checksum before writing to disk, aiding in the detection of write errors Keeps the format complexity to a minimum Multiple events can be written to different raw data files concurrently, but no more than one event can be written to each data file at once T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
10 New design: general considerations The data processing workload is embarrassingly parallel: the incoming data are already divided in events Constraint The raw data file format is strictly sequential It is impossible to do concurrent writes to the same file Necessary to calculate the overall file checksum before writing to disk, aiding in the detection of write errors Keeps the format complexity to a minimum Multiple events can be written to different raw data files concurrently, but no more than one event can be written to each data file at once T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
11 New design: idea Split the workload in tasks For each event: one task does the processing multiple tasks do the writing (for each tag, a task writes the event to the corresponding file). Use a single thread pool to execute the tasks Schedule the tasks cleverly to avoid locking At any given time: any number of processing tasks can run for each raw data file, only one task writing to it can run T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
12 New design: idea Split the workload in tasks For each event: one task does the processing multiple tasks do the writing (for each tag, a task writes the event to the corresponding file). Use a single thread pool to execute the tasks Schedule the tasks cleverly to avoid locking At any given time: any number of processing tasks can run for each raw data file, only one task writing to it can run T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
13 New design: idea Split the workload in tasks For each event: one task does the processing multiple tasks do the writing (for each tag, a task writes the event to the corresponding file). Use a single thread pool to execute the tasks Schedule the tasks cleverly to avoid locking At any given time: any number of processing tasks can run for each raw data file, only one task writing to it can run T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
14 New design: finally, a diagram! Events Raw file manager File eγ Q. File μ Q. File jets Q. PT 14 PT 13 PT 12 WT 7-eγ WT 6-eγ WT 5-eγ WT 7-μ WT 6-μ WT 6-jets WT 5-jets schedule schedule only one notify completion Execution Q. PT 11 PT 10 WT 2-jets PT 9 PT nn WT nn-aa Processing task for event nn Writing task for stream aa of event nn enqueue run Thread Thread Thread WT 8-jets WT 1-eγ WT 1-μ PT 8 WT 8-μ T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
15 New design: implementation with Threading Building Blocks Events Raw file manager File eγ Q. File μ Q. File jets Q. PT 14 PT 13 PT 12 WT 7-eγ WT 6-eγ WT 5-eγ WT 7-μ WT 6-μ WT 6-jets WT 5-jets schedule schedule only one notify completion Execution Q. PT 11 PT 10 Writing task The new design was implemented WT 2-jets WT nn-aa for stream using aa of event nn (and inspired PT by) 9 the open source C++ library Thread run Thread PT nn Intel Threading Building Blocks Thread Processing task for event nn WT 8-jets enqueue WT 1-eγ WT 1-μ PT 8 WT 8-μ T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
16 New design: implementation with Threading Building Blocks Events PT 14 PT 13 PT 12 File eγ Q. WT 7-eγ File μ Q. Task based multi-threading Raw file manager File jets Q. WT 6-eγ WT 7-μ WT 6-jets WT 5-eγ WT 6-μ WT 5-jets schedule Task execution queue schedule only one notify completion Execution Q. PT 11 PT 10 WT 2-jets PT 9 PT nn WT nn-aa Processing task for event nn Writing task for stream aa of event nn enqueue Thread pool run Thread Thread Thread WT 8-jets WT 1-eγ WT 1-μ PT 8 WT 8-μ T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
17 New design: implementation with Threading Building Blocks Events Raw file manager File eγ Q. File μ Q. File jets Q. PT 14 PT 13 PT 12 WT 7-eγ WT 6-eγ WT 5-eγ WT 7-μ WT 6-μ WT 6-jets WT 5-jets schedule Concurrent queue notify completion Thread Execution Q. PT 11 PT 10 WT 2-jets PT 9 run Thread schedule only one Concurrent hash map PT nn WT nn-aa Thread Processing task for event nn Writing task for stream aa of event nn WT 8-jets enqueue Thread-safe containers optimized for concurrency WT 1-eγ WT 1-μ PT 8 WT 8-μ T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
18 Performance evaluation: resource utilization The new implementation was tested and compared with the old one in the current production system in a testbed with older hardware A single Data Logger machine was operated at saturation The dataset consisted of actual event data, with 1 to 4 tags assigned to each event changing the number of tags per event changes the number of files the Data Logger has to write each event to does not change the required network bandwidth Testbed Data Logger PC 2x dual-core Xeon GB RAM 3x 3ware RAID5 array 2x GbE NIC Production Data Logger PC 2x quad-core Xeon E GB RAM 3x Adaptec RAID5 array 2x GbE NIC T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
19 Performance evaluation: resource utilization The new implementation was tested and compared with the old one in the current production system in a testbed with older hardware A single Data Logger machine was operated at saturation The dataset consisted of actual event data, with 1 to 4 tags assigned to each event changing the number of tags per event changes the number of files the Data Logger has to write each event to does not change the required network bandwidth T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
20 Performance evaluation: resource utilization The new implementation was tested and compared with the old one in the current production system in a testbed with older hardware A single Data Logger machine was operated at saturation The dataset consisted of actual event data, with 1 to 4 tags assigned to each event changing the number of tags per event changes the number of files the Data Logger has to write each event to does not change the required network bandwidth T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
21 Performance evaluation: resource utilization The new implementation was tested and compared with the old one in the current production system in a testbed with older hardware A single Data Logger machine was operated at saturation The dataset consisted of actual event data, with 1 to 4 tags assigned to each event changing the number of tags per event changes the number of files the Data Logger has to write each event to does not change the required network bandwidth T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
22 Performance evaluation: resource utilization The new implementation was tested and compared with the old one in the current production system in a testbed with older hardware A single Data Logger machine was operated at saturation The dataset consisted of actual event data, with 1 to 4 tags assigned to each event changing the number of tags per event changes the number of files the Data Logger has to write each event to does not change the required network bandwidth T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
23 Performance evaluation: resource utilization The new implementation was tested and compared with the old one in the current production system in a testbed with older hardware A single Data Logger machine was operated at saturation The dataset consisted of actual event data, with 1 to 4 tags assigned to each event changing the number of tags per event changes the number of files the Data Logger has to write each event to does not change the required network bandwidth T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
24 Performance evaluation: resource utilization The hard limit on the throughput of a single Data Logger is given by the network bandwidth: 2 Gb/s 250 MB/s Old single-threaded implementation Can operate at network saturation only for a single tag per event Above 2 tags per event, the load generated by its single thread exceeds what a single CPU core can take The throughput decreases accordingly T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
25 Performance evaluation: resource utilization The hard limit on the throughput of a single Data Logger is given by the network bandwidth: 2 Gb/s 250 MB/s New multi-threaded implementation The throughput is almost unaffected by the load Its 4 threads spread the workload on the 4 CPU cores: none of them uses more than 60% of a core Leaves plenty of headroom for additional CPU intensive processing compression T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
26 Performance evaluation: scalability (in testbed) On-line event compression (with zlib) radically changes the landscape The time spent compressing events (~ 50 ms per MB) dominates the rest of the processing (~ 2 ms per MB) throughput is much lower all workloads saturate the CPU One can examine the throughput as a function of the number of CPU cores (threads) used scaling is (almost) linear T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
27 Performance evaluation: scalability (in testbed) On-line event compression (with zlib) radically changes the landscape The time spent compressing events (~ 50 ms per MB) dominates the rest of the processing (~ 2 ms per MB) throughput is much lower all workloads saturate the CPU One can examine the throughput as a function of the number of CPU cores (threads) used scaling is (almost) linear T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
28 Performance evaluation: scalability (in production) T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
29 Performance evaluation: scalability (in production) T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
30 Conclusions A novel design for the ATLAS Data Logging application was implemented and thoroughly tested The performance of the new software is very satisfactory: taps into the full power of modern CPUs future-proofs the Data Logger enables the addition of computationally-intensive features It will be one of the essential components of the evolved system currently being developed to meet the challenges of LHC data-taking in 2014 and beyond T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
31 Conclusions A novel design for the ATLAS Data Logging application was implemented and thoroughly tested The performance of the new software is very satisfactory: taps into the full power of modern CPUs future-proofs the Data Logger enables the addition of computationally-intensive features It will be one of the essential components of the evolved system currently being developed to meet the challenges of LHC data-taking in 2014 and beyond T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
32 Conclusions A novel design for the ATLAS Data Logging application was implemented and thoroughly tested The performance of the new software is very satisfactory: taps into the full power of modern CPUs future-proofs the Data Logger enables the addition of computationally-intensive features It will be one of the essential components of the evolved system currently being developed to meet the challenges of LHC data-taking in 2014 and beyond T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
33 Backup T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
34 Other design constraints Operations are driven by the received event data The Data Logger can only rely on the information it gathers by examining the received events No assumptions about the data flow Cannot assume that the rate of received events is somehow balanced across the spectrum of the possible tags The flow of events with one tag can vary during a run and even stop completely T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
35 Other design constraints Operations are driven by the received event data The Data Logger can only rely on the information it gathers by examining the received events No assumptions about the data flow Cannot assume that the rate of received events is somehow balanced across the spectrum of the possible tags The flow of events with one tag can vary during a run and even stop completely T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
36 for ask access 1 Stream 1 Lumiblock Event Queue 1 Stream 2 Lumiblock save event Other possible designs: thread pool with locking get event Processing Thread Raw File Manager Stream 2 Stream 3 T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
37 Event Queue 1 Stream 2 Lumiblock 1 Stream 1 Lumiblock Other possible designs: chain of responsibility Processing Thread Processing Thread Processing Thread Stream 2 Risk of starvation! T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
38 1 Stream 2 Lumiblock Event Queue 1 Stream 2 Lumiblock Other possible designs: one thread pool per file Processing Thread Processing Thread Processing Thread Stream 2 Too many threads! T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
39 zlib performance evaluation T. Colombo, W. Vandelli (Pavia U., CERN) ATLAS Highly-parallel Data-Logging System Real-Time Conference, 12 Jun / 13
How To Test For Performance And Scalability On A Server With A Multi-Core Computer (For A Large Server)
Scalability Results Select the right hardware configuration for your organization to optimize performance Table of Contents Introduction... 1 Scalability... 2 Definition... 2 CPU and Memory Usage... 2
More informationNext Generation Operating Systems
Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015 The end of CPU scaling Future computing challenges Power efficiency Performance == parallelism Cisco Confidential 2 Paradox of the
More informationNetwork Performance Optimisation and Load Balancing. Wulf Thannhaeuser
Network Performance Optimisation and Load Balancing Wulf Thannhaeuser 1 Network Performance Optimisation 2 Network Optimisation: Where? Fixed latency 4.0 µs Variable latency
More informationSoftware design ideas for SoLID
Software design ideas for SoLID Ole Hansen Jefferson Lab EIC Software Meeting Jefferson Lab September 25, 2015 Ole Hansen (Jefferson Lab) Software design ideas for SoLID Sept 25, 2015 1 / 10 The SoLID
More informationDSS. High performance storage pools for LHC. Data & Storage Services. Łukasz Janyst. on behalf of the CERN IT-DSS group
DSS High performance storage pools for LHC Łukasz Janyst on behalf of the CERN IT-DSS group CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it Introduction The goal of EOS is to provide a
More informationIT of SPIM Data Storage and Compression. EMBO Course - August 27th! Jeff Oegema, Peter Steinbach, Oscar Gonzalez
IT of SPIM Data Storage and Compression EMBO Course - August 27th Jeff Oegema, Peter Steinbach, Oscar Gonzalez 1 Talk Outline Introduction and the IT Team SPIM Data Flow Capture, Compression, and the Data
More informationPerformance Counters. Microsoft SQL. Technical Data Sheet. Overview:
Performance Counters Technical Data Sheet Microsoft SQL Overview: Key Features and Benefits: Key Definitions: Performance counters are used by the Operations Management Architecture (OMA) to collect data
More informationComputing at the HL-LHC
Computing at the HL-LHC Predrag Buncic on behalf of the Trigger/DAQ/Offline/Computing Preparatory Group ALICE: Pierre Vande Vyvre, Thorsten Kollegger, Predrag Buncic; ATLAS: David Rousseau, Benedetto Gorini,
More informationFPGA-based Multithreading for In-Memory Hash Joins
FPGA-based Multithreading for In-Memory Hash Joins Robert J. Halstead, Ildar Absalyamov, Walid A. Najjar, Vassilis J. Tsotras University of California, Riverside Outline Background What are FPGAs Multithreaded
More informationDELL TM PowerEdge TM T610 500 Mailbox Resiliency Exchange 2010 Storage Solution
DELL TM PowerEdge TM T610 500 Mailbox Resiliency Exchange 2010 Storage Solution Tested with: ESRP Storage Version 3.0 Tested Date: Content DELL TM PowerEdge TM T610... 1 500 Mailbox Resiliency
More informationThe new frontier of the DATA acquisition using 1 and 10 Gb/s Ethernet links. Filippo Costa on behalf of the ALICE DAQ group
The new frontier of the DATA acquisition using 1 and 10 Gb/s Ethernet links Filippo Costa on behalf of the ALICE DAQ group DATE software 2 DATE (ALICE Data Acquisition and Test Environment) ALICE is a
More informationWindows Server Performance Monitoring
Spot server problems before they are noticed The system s really slow today! How often have you heard that? Finding the solution isn t so easy. The obvious questions to ask are why is it running slowly
More informationBenchmarking Cassandra on Violin
Technical White Paper Report Technical Report Benchmarking Cassandra on Violin Accelerating Cassandra Performance and Reducing Read Latency With Violin Memory Flash-based Storage Arrays Version 1.0 Abstract
More informationReadout Unit using Network Processors
Readout Unit using Network Processors FEM FEM FEM FEM DAQ RU GbE GbE GbE GbE IBM NP4GS3 Phy GMII Phy GMII Phy GMII Phy GMII 4 x 1Gb full duplex Ethernet MACs ECS Ethernet CC-PC PCI Switch Bus IBM NP4GS3
More informationPerformance Characteristics of VMFS and RDM VMware ESX Server 3.0.1
Performance Study Performance Characteristics of and RDM VMware ESX Server 3.0.1 VMware ESX Server offers three choices for managing disk access in a virtual machine VMware Virtual Machine File System
More informationIPRO ecapture Performance Report using BlueArc Titan Network Storage System
IPRO ecapture Performance Report using BlueArc Titan Network Storage System Allen Yen, BlueArc Corp Jesse Abrahams, IPRO Tech, Inc Introduction IPRO ecapture is an e-discovery application designed to handle
More informationPerformance monitoring at CERN openlab. July 20 th 2012 Andrzej Nowak, CERN openlab
Performance monitoring at CERN openlab July 20 th 2012 Andrzej Nowak, CERN openlab Data flow Reconstruction Selection and reconstruction Online triggering and filtering in detectors Raw Data (100%) Event
More informationAccelerating Server Storage Performance on Lenovo ThinkServer
Accelerating Server Storage Performance on Lenovo ThinkServer Lenovo Enterprise Product Group April 214 Copyright Lenovo 214 LENOVO PROVIDES THIS PUBLICATION AS IS WITHOUT WARRANTY OF ANY KIND, EITHER
More informationCSE-E5430 Scalable Cloud Computing Lecture 2
CSE-E5430 Scalable Cloud Computing Lecture 2 Keijo Heljanko Department of Computer Science School of Science Aalto University keijo.heljanko@aalto.fi 14.9-2015 1/36 Google MapReduce A scalable batch processing
More informationOnline Performance Monitoring of the Third ALICE Data Challenge (ADC III)
EUROPEAN ORGANIZATION FOR NUCLEAR RESEARCH European Laboratory for Particle Physics Publication ALICE reference number ALICE-PUB-1- version 1. Institute reference number Date of last change 1-1-17 Online
More informationTier0 plans and security and backup policy proposals
Tier0 plans and security and backup policy proposals, CERN IT-PSS CERN - IT Outline Service operational aspects Hardware set-up in 2007 Replication set-up Test plan Backup and security policies CERN Oracle
More information15-418 Final Project Report. Trading Platform Server
15-418 Final Project Report Yinghao Wang yinghaow@andrew.cmu.edu May 8, 214 Trading Platform Server Executive Summary The final project will implement a trading platform server that provides back-end support
More informationPerformance and scalability of a large OLTP workload
Performance and scalability of a large OLTP workload ii Performance and scalability of a large OLTP workload Contents Performance and scalability of a large OLTP workload with DB2 9 for System z on Linux..............
More informationVirtuoso and Database Scalability
Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of
More informationIntroduction 1 Performance on Hosted Server 1. Benchmarks 2. System Requirements 7 Load Balancing 7
Introduction 1 Performance on Hosted Server 1 Figure 1: Real World Performance 1 Benchmarks 2 System configuration used for benchmarks 2 Figure 2a: New tickets per minute on E5440 processors 3 Figure 2b:
More informationDelivering Quality in Software Performance and Scalability Testing
Delivering Quality in Software Performance and Scalability Testing Abstract Khun Ban, Robert Scott, Kingsum Chow, and Huijun Yan Software and Services Group, Intel Corporation {khun.ban, robert.l.scott,
More informationPerformance Guideline for syslog-ng Premium Edition 5 LTS
Performance Guideline for syslog-ng Premium Edition 5 LTS May 08, 2015 Abstract Performance analysis of syslog-ng Premium Edition Copyright 1996-2015 BalaBit S.a.r.l. Table of Contents 1. Preface... 3
More informationChronicle: Capture and Analysis of NFS Workloads at Line Rate
Chronicle: Capture and Analysis of NFS Workloads at Line Rate Ardalan Kangarlou, Sandip Shete, and John Strunk Advanced Technology Group 1 Motivation Goal: To gather insights from customer workloads via
More informationDSS. Diskpool and cloud storage benchmarks used in IT-DSS. Data & Storage Services. Geoffray ADDE
DSS Data & Diskpool and cloud storage benchmarks used in IT-DSS CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/it Geoffray ADDE DSS Outline I- A rational approach to storage systems evaluation
More informationObject Database Scalability for Scientific Workloads
Object Database Scalability for Scientific Workloads Technical Report Julian J. Bunn Koen Holtman, Harvey B. Newman 256-48 HEP, Caltech, 1200 E. California Blvd., Pasadena, CA 91125, USA CERN EP-Division,
More informationDirect NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle
Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle Agenda Introduction Database Architecture Direct NFS Client NFS Server
More informationSeptember 25, 2007. Maya Gokhale Georgia Institute of Technology
NAND Flash Storage for High Performance Computing Craig Ulmer cdulmer@sandia.gov September 25, 2007 Craig Ulmer Maya Gokhale Greg Diamos Michael Rewak SNL/CA, LLNL Georgia Institute of Technology University
More informationVery Large Enterprise Network Deployment, 25,000+ Users
Very Large Enterprise Network Deployment, 25,000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering
More informationXenDesktop 7 Database Sizing
XenDesktop 7 Database Sizing Contents Disclaimer... 3 Overview... 3 High Level Considerations... 3 Site Database... 3 Impact of failure... 4 Monitoring Database... 4 Impact of failure... 4 Configuration
More information11.1 inspectit. 11.1. inspectit
11.1. inspectit Figure 11.1. Overview on the inspectit components [Siegl and Bouillet 2011] 11.1 inspectit The inspectit monitoring tool (website: http://www.inspectit.eu/) has been developed by NovaTec.
More informationInitial Hardware Estimation Guidelines. AgilePoint BPMS v5.0 SP1
Initial Hardware Estimation Guidelines Document Revision r5.2.3 November 2011 Contents 2 Contents Preface...3 Disclaimer of Warranty...3 Copyright...3 Trademarks...3 Government Rights Legend...3 Virus-free
More informationMuse Server Sizing. 18 June 2012. Document Version 0.0.1.9 Muse 2.7.0.0
Muse Server Sizing 18 June 2012 Document Version 0.0.1.9 Muse 2.7.0.0 Notice No part of this publication may be reproduced stored in a retrieval system, or transmitted, in any form or by any means, without
More informationInterScan Web Security Virtual Appliance
InterScan Web Security Virtual Appliance Sizing Guide for version 6.0 July 2013 TREND MICRO INC. 10101 N. De Anza Blvd. Cupertino, CA 95014 www.trendmicro.com Toll free: +1 800.228.5651 Fax: +1 408.257.2003
More informationRAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29
RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f. one large disk) Parallelism improves performance Plus extra disk(s) for redundant data storage Provides fault tolerant
More informationAgenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.
Agenda Enterprise Performance Factors Overall Enterprise Performance Factors Best Practice for generic Enterprise Best Practice for 3-tiers Enterprise Hardware Load Balancer Basic Unix Tuning Performance
More informationVirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5
Performance Study VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5 VMware VirtualCenter uses a database to store metadata on the state of a VMware Infrastructure environment.
More informationDatacenter Operating Systems
Datacenter Operating Systems CSE451 Simon Peter With thanks to Timothy Roscoe (ETH Zurich) Autumn 2015 This Lecture What s a datacenter Why datacenters Types of datacenters Hyperscale datacenters Major
More informationImprove Business Productivity and User Experience with a SanDisk Powered SQL Server 2014 In-Memory OLTP Database
WHITE PAPER Improve Business Productivity and User Experience with a SanDisk Powered SQL Server 2014 In-Memory OLTP Database 951 SanDisk Drive, Milpitas, CA 95035 www.sandisk.com Table of Contents Executive
More informationTHE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES
THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES Vincent Garonne, Mario Lassnig, Martin Barisits, Thomas Beermann, Ralph Vigne, Cedric Serfon Vincent.Garonne@cern.ch ph-adp-ddm-lab@cern.ch XLDB
More informationAmadeus SAS Specialists Prove Fusion iomemory a Superior Analysis Accelerator
WHITE PAPER Amadeus SAS Specialists Prove Fusion iomemory a Superior Analysis Accelerator 951 SanDisk Drive, Milpitas, CA 95035 www.sandisk.com SAS 9 Preferred Implementation Partner tests a single Fusion
More informationDELL s Oracle Database Advisor
DELL s Oracle Database Advisor Underlying Methodology A Dell Technical White Paper Database Solutions Engineering By Roger Lopez Phani MV Dell Product Group January 2010 THIS WHITE PAPER IS FOR INFORMATIONAL
More informationDistributed applications monitoring at system and network level
Distributed applications monitoring at system and network level Monarc Collaboration 1 Abstract Most of the distributed applications are presently based on architectural models that don t involve real-time
More informationWITH A FUSION POWERED SQL SERVER 2014 IN-MEMORY OLTP DATABASE
WITH A FUSION POWERED SQL SERVER 2014 IN-MEMORY OLTP DATABASE 1 W W W. F U S I ON I O.COM Table of Contents Table of Contents... 2 Executive Summary... 3 Introduction: In-Memory Meets iomemory... 4 What
More informationAdonis Technical Requirements
Information Sheet Adonis Technical Requirements CONTENTS Contents... 1 Adonis Project Implementation... 1 Host Installation / Onboard Installation Full replication (LARGER Vessels):... 1 Onboard installation
More informationVery Large Enterprise Network, Deployment, 25000+ Users
Very Large Enterprise Network, Deployment, 25000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering
More informationData Deduplication HTBackup
Data Deduplication HTBackup HTBackup and it s Deduplication technology is touted as one of the best ways to manage today's explosive data growth. If you're new to the technology, these key facts will help
More informationBenchmark Study on Distributed XML Filtering Using Hadoop Distribution Environment. Sanjay Kulhari, Jian Wen UC Riverside
Benchmark Study on Distributed XML Filtering Using Hadoop Distribution Environment Sanjay Kulhari, Jian Wen UC Riverside Team Sanjay Kulhari M.S. student, CS U C Riverside Jian Wen Ph.D. student, CS U
More informationRemoving Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering
Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays Red Hat Performance Engineering Version 1.0 August 2013 1801 Varsity Drive Raleigh NC
More informationSun 8Gb/s Fibre Channel HBA Performance Advantages for Oracle Database
Performance Advantages for Oracle Database At a Glance This Technical Brief illustrates that even for smaller online transaction processing (OLTP) databases, the Sun 8Gb/s Fibre Channel Host Bus Adapter
More informationOBJECTIVE ANALYSIS WHITE PAPER MATCH FLASH. TO THE PROCESSOR Why Multithreading Requires Parallelized Flash ATCHING
OBJECTIVE ANALYSIS WHITE PAPER MATCH ATCHING FLASH TO THE PROCESSOR Why Multithreading Requires Parallelized Flash T he computing community is at an important juncture: flash memory is now generally accepted
More informationBenchmarking Hadoop & HBase on Violin
Technical White Paper Report Technical Report Benchmarking Hadoop & HBase on Violin Harnessing Big Data Analytics at the Speed of Memory Version 1.0 Abstract The purpose of benchmarking is to show advantages
More informationOptimization of Cluster Web Server Scheduling from Site Access Statistics
Optimization of Cluster Web Server Scheduling from Site Access Statistics Nartpong Ampornaramveth, Surasak Sanguanpong Faculty of Computer Engineering, Kasetsart University, Bangkhen Bangkok, Thailand
More informationWHITE PAPER BRENT WELCH NOVEMBER
BACKUP WHITE PAPER BRENT WELCH NOVEMBER 2006 WHITE PAPER: BACKUP TABLE OF CONTENTS Backup Overview 3 Background on Backup Applications 3 Backup Illustration 4 Media Agents & Keeping Tape Drives Busy 5
More informationIntroduction to the NI Real-Time Hypervisor
Introduction to the NI Real-Time Hypervisor 1 Agenda 1) NI Real-Time Hypervisor overview 2) Basics of virtualization technology 3) Configuring and using Real-Time Hypervisor systems 4) Performance and
More informationA Deduplication File System & Course Review
A Deduplication File System & Course Review Kai Li 12/13/12 Topics A Deduplication File System Review 12/13/12 2 Traditional Data Center Storage Hierarchy Clients Network Server SAN Storage Remote mirror
More informationPerformance measurements of syslog-ng Premium Edition 4 F1
Performance measurements of syslog-ng Premium Edition 4 F1 October 13, 2011 Abstract Performance analysis of syslog-ng Premium Edition Copyright 1996-2011 BalaBit IT Security Ltd. Table of Contents 1.
More informationDevelopment of nosql data storage for the ATLAS PanDA Monitoring System
Development of nosql data storage for the ATLAS PanDA Monitoring System M.Potekhin Brookhaven National Laboratory, Upton, NY11973, USA E-mail: potekhin@bnl.gov Abstract. For several years the PanDA Workload
More informationStreaming and Virtual Hosted Desktop Study: Phase 2
IT@Intel White Paper Intel Information Technology Computing Models April 1 Streaming and Virtual Hosted Desktop Study: Phase 2 Our current findings indicate that streaming provides better server loading
More informationHP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads
HP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads Gen9 Servers give more performance per dollar for your investment. Executive Summary Information Technology (IT) organizations face increasing
More informationDeliverable 2.1.4. 150 Billion Triple dataset hosted on the LOD2 Knowledge Store Cluster. LOD2 Creating Knowledge out of Interlinked Data
Collaborative Project LOD2 Creating Knowledge out of Interlinked Data Project Number: 257943 Start Date of Project: 01/09/2010 Duration: 48 months Deliverable 2.1.4 150 Billion Triple dataset hosted on
More informationHadoop Architecture. Part 1
Hadoop Architecture Part 1 Node, Rack and Cluster: A node is simply a computer, typically non-enterprise, commodity hardware for nodes that contain data. Consider we have Node 1.Then we can add more nodes,
More informationDeep Dive: Maximizing EC2 & EBS Performance
Deep Dive: Maximizing EC2 & EBS Performance Tom Maddox, Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved What we ll cover Amazon EBS overview Volumes Snapshots
More informationTechnical White Paper. Symantec Backup Exec 10d System Sizing. Best Practices For Optimizing Performance of the Continuous Protection Server
Symantec Backup Exec 10d System Sizing Best Practices For Optimizing Performance of the Continuous Protection Server Table of Contents Table of Contents...2 Executive Summary...3 System Sizing and Performance
More informationShoal: IaaS Cloud Cache Publisher
University of Victoria Faculty of Engineering Winter 2013 Work Term Report Shoal: IaaS Cloud Cache Publisher Department of Physics University of Victoria Victoria, BC Mike Chester V00711672 Work Term 3
More informationVI Performance Monitoring
VI Performance Monitoring Preetham Gopalaswamy Group Product Manager Ravi Soundararajan Staff Engineer September 15, 2008 Agenda Introduction to performance monitoring in VI Common customer/partner questions
More informationOnline data handling with Lustre at the CMS experiment
Online data handling with Lustre at the CMS experiment Lavinia Darlea, on behalf of CMS DAQ Group MIT/DAQ CMS September 17, 2015 1 / 29 CERN 2 / 29 CERN CERN was founded 1954: 12 European States Science
More informationWHITE PAPER FUJITSU PRIMERGY SERVER BASICS OF DISK I/O PERFORMANCE
WHITE PAPER BASICS OF DISK I/O PERFORMANCE WHITE PAPER FUJITSU PRIMERGY SERVER BASICS OF DISK I/O PERFORMANCE This technical documentation is aimed at the persons responsible for the disk I/O performance
More informationSTI Hardware Specifications for PCs
Local School Fileserver with STIOffice & STIClassroom WIN 1GHz or Higher 512MB 1GB Free Space Windows 2000/2003 Server, Novell Netware v4.11 or greater. Local School Fileserver with STIOffice, STIClassroom
More informationIntel Data Direct I/O Technology (Intel DDIO): A Primer >
Intel Data Direct I/O Technology (Intel DDIO): A Primer > Technical Brief February 2012 Revision 1.0 Legal Statements INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE,
More informationFTK the online Fast Tracker for the ATLAS upgrade
FTK the online Fast Tracker for the ATLAS upgrade Kostas Kordas Aristotle University of Thessaloniki Annual EESFYE mtg, University of the Aegean, Chios, 25-29 April 2013 Overview ATLAS Trigger and DAQ
More informationImproving Microsoft Exchange Performance Using SanDisk Solid State Drives (SSDs)
WHITE PAPER Improving Microsoft Exchange Performance Using SanDisk Solid State Drives (s) Hemant Gaidhani, SanDisk Enterprise Storage Solutions Hemant.Gaidhani@SanDisk.com 951 SanDisk Drive, Milpitas,
More informationEvaluation Report: Supporting Microsoft Exchange on the Lenovo S3200 Hybrid Array
Evaluation Report: Supporting Microsoft Exchange on the Lenovo S3200 Hybrid Array Evaluation report prepared under contract with Lenovo Executive Summary Love it or hate it, businesses rely on email. It
More informationHigh Performance Tier Implementation Guideline
High Performance Tier Implementation Guideline A Dell Technical White Paper PowerVault MD32 and MD32i Storage Arrays THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS
More informationEstimate Performance and Capacity Requirements for Workflow in SharePoint Server 2010
Estimate Performance and Capacity Requirements for Workflow in SharePoint Server 2010 This document is provided as-is. Information and views expressed in this document, including URL and other Internet
More informationLSI MegaRAID CacheCade Performance Evaluation in a Web Server Environment
LSI MegaRAID CacheCade Performance Evaluation in a Web Server Environment Evaluation report prepared under contract with LSI Corporation Introduction Interest in solid-state storage (SSS) is high, and
More informationAn Oracle White Paper December 2011. Oracle Virtual Desktop Infrastructure: A Design Proposal for Hosted Virtual Desktops
An Oracle White Paper December 2011 Oracle Virtual Desktop Infrastructure: A Design Proposal for Hosted Virtual Desktops Introduction... 2! Design Goals... 3! Architectural Overview... 5! Logical Architecture...
More informationEvent Logging and Distribution for the BaBar Online System
LAC-PUB-8744 August 2002 Event Logging and Distribution for the BaBar Online ystem. Dasu, T. Glanzman, T. J. Pavel For the BaBar Prompt Reconstruction and Computing Groups Department of Physics, University
More informationEnhance Service Delivery and Accelerate Financial Applications with Consolidated Market Data
White Paper Enhance Service Delivery and Accelerate Financial Applications with Consolidated Market Data What You Will Learn Financial market technology is advancing at a rapid pace. The integration of
More informationIntroduction. Application Performance in the QLinux Multimedia Operating System. Solution: QLinux. Introduction. Outline. QLinux Design Principles
Application Performance in the QLinux Multimedia Operating System Sundaram, A. Chandra, P. Goyal, P. Shenoy, J. Sahni and H. Vin Umass Amherst, U of Texas Austin ACM Multimedia, 2000 Introduction General
More informationMicrosoft Exchange Solutions on VMware
Design and Sizing Examples: Microsoft Exchange Solutions on VMware Page 1 of 19 Contents 1. Introduction... 3 1.1. Overview... 3 1.2. Benefits of Running Exchange Server 2007 on VMware Infrastructure 3...
More informationIntel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms
Intel Cloud Builders Guide Intel Xeon Processor-based Servers RES Virtual Desktop Extender Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms Client Aware Cloud with RES Virtual
More informationSQL Server Consolidation Using Cisco Unified Computing System and Microsoft Hyper-V
SQL Server Consolidation Using Cisco Unified Computing System and Microsoft Hyper-V White Paper July 2011 Contents Executive Summary... 3 Introduction... 3 Audience and Scope... 4 Today s Challenges...
More informationEMC Business Continuity for Microsoft SQL Server 2008
EMC Business Continuity for Microsoft SQL Server 2008 Enabled by EMC Celerra Fibre Channel, EMC MirrorView, VMware Site Recovery Manager, and VMware vsphere 4 Reference Architecture Copyright 2009, 2010
More informationOracle Database Reliability, Performance and scalability on Intel Xeon platforms Mitch Shults, Intel Corporation October 2011
Oracle Database Reliability, Performance and scalability on Intel platforms Mitch Shults, Intel Corporation October 2011 1 Intel Processor E7-8800/4800/2800 Product Families Up to 10 s and 20 Threads 30MB
More informationReference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack
Reference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack May 2015 Copyright 2015 SwiftStack, Inc. swiftstack.com Page 1 of 19 Table of Contents INTRODUCTION... 3 OpenStack
More informationParallel Replication for MySQL in 5 Minutes or Less
Parallel Replication for MySQL in 5 Minutes or Less Featuring Tungsten Replicator Robert Hodges, CEO, Continuent About Continuent / Continuent is the leading provider of data replication and clustering
More informationMicrosoft SharePoint Server 2010
Microsoft SharePoint Server 2010 Small Farm Performance Study Dell SharePoint Solutions Ravikanth Chaganti and Quocdat Nguyen November 2010 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY
More informationAnalyzing the Virtualization Deployment Advantages of Two- and Four-Socket Server Platforms
IT@Intel White Paper Intel IT IT Best Practices: Data Center Solutions Server Virtualization August 2010 Analyzing the Virtualization Deployment Advantages of Two- and Four-Socket Server Platforms Executive
More informationSoftware-defined Storage Architecture for Analytics Computing
Software-defined Storage Architecture for Analytics Computing Arati Joshi Performance Engineering Colin Eldridge File System Engineering Carlos Carrero Product Management June 2015 Reference Architecture
More informationSawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices
Sawmill Log Analyzer Best Practices!! Page 1 of 6 Sawmill Log Analyzer Best Practices! Sawmill Log Analyzer Best Practices!! Page 2 of 6 This document describes best practices for the Sawmill universal
More informationDistribution One Server Requirements
Distribution One Server Requirements Introduction Welcome to the Hardware Configuration Guide. The goal of this guide is to provide a practical approach to sizing your Distribution One application and
More informationDell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III
White Paper Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III Performance of Microsoft SQL Server 2008 BI and D/W Solutions on Dell PowerEdge
More informationPerformance Analysis and Capacity Planning Whitepaper
Performance Analysis and Capacity Planning Whitepaper Contents P E R F O R M A N C E A N A L Y S I S & Executive Summary... 3 Overview... 3 Product Architecture... 4 Test Environment... 6 Performance Test
More informationIOmark- VDI. HP HP ConvergedSystem 242- HC StoreVirtual Test Report: VDI- HC- 150427- b Test Report Date: 27, April 2015. www.iomark.
IOmark- VDI HP HP ConvergedSystem 242- HC StoreVirtual Test Report: VDI- HC- 150427- b Test Copyright 2010-2014 Evaluator Group, Inc. All rights reserved. IOmark- VDI, IOmark- VM, VDI- IOmark, and IOmark
More informationMain Memory Data Warehouses
Main Memory Data Warehouses Robert Wrembel Poznan University of Technology Institute of Computing Science Robert.Wrembel@cs.put.poznan.pl www.cs.put.poznan.pl/rwrembel Lecture outline Teradata Data Warehouse
More information