CERNBox + EOS: Cloud Storage for Science

Size: px
Start display at page:

Download "CERNBox + EOS: Cloud Storage for Science"

Transcription

1 Data & Storage Services CERNBox + EOS: Cloud Storage for Science CERN IT Department CH-1211 Geneva 23 Switzerland Presenter: Luca Masce. Thanks to: Jakub T. Mościcki, Andreas J. Peters, Hugo G. Labrador, Massimo Lamanna CERN/IT- DSS

2 Content What we have done What we do What we will do CERNBox 2

3 The origins of the CERNBox project Missing link? CERNBox 4500 dissnct IPs in DNS from cern.ch to *.dropbox.com (daily...) What we are missing easy access cloud storage for end users files go automascally to the cloud and are available always everywhere broken laptop data lost offline access to data work on the plane and rsync when back online keep files in sync across devices access on mobile clients (easy) sharing of files with colleagues ssll surprisingly difficult Can we have this? for documents (small files, oben ppts, text, ) for science data (integrated into data processing workflows and exissng infrastructure) 3

4 Original architecture (CERNBox beta service) USER Sync client (webdav) Web access (hgps) HTTPS LB Data flow Metadata flow Apache, PHP 5.4 (SCL1.0) mod_proxy_balancer 64 core, 64GB RAM AS OC AS OC AS OC Setup 100% RH6 on standard hardware Based on owncloud Guaranteed failover (redundant nodes) Image courtesy of Keeps track of sync state for every file in the system SQL overheads (Hz metadata ops) DB MySQL server 48GB RAM filesystem (POSIX) Files not exposed directly to the user STORAGE NFS servers, async, SW RAID 1 IniSal space: 20 TB

5 Usage of the beta service CERNBox Beta 2014 March April May June October users 190 (*) files 191K 907K 1.6M 2.7M 6.4M size 480GB 1TB 1.5TB 1.9TB 3.4TB (*) users inherited from the inisal prototype deployment 15% 1% Size per user 84% Avg ~5GB <10GB >10GB up to 100GB Files per user 1% 5% 94% Avg ~10K files < 5K 5K- 20K up to 100K 5

6 File access patterns GET/PUT raso: 2/1 File type distribuson: 1200 different file extensions! 30%.c.h.C 30%.jpg.png 15% no extension (UNIX world!) 25% other:.pdf,.txt,.ppt,.docx,.root,.py,.eps,.tex ~100 URL shares, ~40 synced shares UNICODE filenames: greek, russian, thai(?) 6

7 Pilot limitations Move On the origin client move is propagated to the server On the other clients it is propagated as COPY/DELETE (subopsmal) Symlinks are not supported Ignored files:, :? * < > We currently recommend one sync folder setup: ~/cernbox High per- file overhead Expect 2-5Hz PUT Expect ~10Hz GET Transfer rates Expect 10-30MB download Expect 5-10MB upload Larger files: 400MB file on standard desktop hgps/upload: ~25MB/s, hgps/download: ~60MB/s For wireless devices, laptops, phones do we care about transfer rates? 7

8 Towards large-scale data sync and share Currently deployed CERNBox beta works OK so far for the classical Dropbox use- case low- frequency document sync and share But can we bring this system to the next level? Our core- business and large- scale workloads expose PBs of exis%ng data from day 1 integrason into physics data processing eco- system central services: batch, interacsve data analysis applicasons sync higher data volumes at higher rates Can we ssll keep the simplicity of cloud storage access? 8

9 Massive scaling at reduced cost? No need to keep track of all files and directories in the database avoids explosive growth of your DB infrastructure Our file number essmate? With 10K users we have 2.5 billion files in AFS already! What is your number for 100K users? Before we start throwing hardware at the problem consider the cost of running the service Fixed: hardware purchase, service deployment, infrastructure Scaling: hardware incidents, user support; backup; integrity checks; upgrades Infrastructure: space, electricity and cooling in the data center For massive scaling we need to keep TCO under control profit from exissng large- scale operasons and support of our storage services exploit economies of scale 9

10 Integration Started in May 2014 FuncSonality Enable sync and share for exissng data in EOS Without exporsng data to another storage Direct access to data with efficient sync behind OperaSons NFS/async backend server is a temporary soluson EOS offer virtually unlimited cloud storage for end- users Fold- in the operason cost into EOS But: Integrate as transparently as possible most users don t care about storage backend Fully working soluson compasble with owncloud clients we don t want to end up with half- working CERN- specific soluson 10

11 EOS Integration Details Understanding sync protocol and underlying semanscs. Add a few consistency features to EOS (e.g. atomic upload) Adding few new features to EOS or libing restricsons (e.g. UTF8 support) Beef- up the webdav endpoint to allow owncloud clients to talk directly to it Integrate web- access and sharing funcsonality Web fronted: develop new plugins Nice integrason of trashbin, versions and sharing: Fusion between owncloud model and EOS model (Hugo G. Labrador) Making more robust less stressed parts in EOS (hgp/webdav) Lots, lots of tessng. 11

12 CERNBox 2.0 Architecture Sync client (webdav) Web access (hgps) HTTPS HTTPS LB HTTPS LB HTTPS LB LB Data flow Metadata flow Data directly accessible by the user USER hgp (public data) hgps (private data) hgp (internal) KHz metadata ops OC fuse All sync state as metadata in the storage STORAGE (EOS) Files wriyen with USER credenzals disk servers (1000s) IO redirect namespace

13 Prototype deployment on EOSPPS /eos/user/<u>/<username> this is the default sync and web- enabled folder as an advanced user you may add arbitrary folder from EOS very easy to implement a folder shared by an e- group We can also allow transparent access to different instances 13

14 First performance numbers User- perceived performance (client) Metadata operason (pycurl with SSL sessions) PROPFIND with 1 entry: 90 Hz PROPFIND with 1K entries: 8.5 KHz PROPFIND with 10K entries: 10KHz nice speed e.g. kernel src tree upload (50K files, 500MB) ~ 1h from laptop/wifi at home, download ~20 min ops/s Small files (10KB) pycurl seq Download Upload Delete pycurl P=10 pycurl P=50 owncloud sync client 14

15 Summary Working and usable beta service Useful for ge.ng experience, user feedback and understanding what we want / don t want in the final producson system based on EOS CERNBox Advanced integrason into EOS will open up new possibilises but there is no free lunch: we will have to adapt to evolving owncloud clients, etc. heading towards large sync and share layer for science research all our data exposed from day 1 massive scalability, high performance integrated into exissng workflows - new capabilises! small overhead on top of our exissng operasons and development TCO control and ssll as easy to use as Dropbox.com 15

16 Integrated storage ecosystem for scientific research sync / share / offline access 2.0 USER webdav & hgps:// online file- system access fuse CERNBox Analysis cluster Central Services high- performance applicason access xrootd:// batch access xrdcopy LARGE- SCALE STORAGE

17 agenda full ~35 parscipants Tracks Keynote B.Pierce Technology Users Site reports Vendor talks IBM Powerfolder SeaFile PyDio Owncloud 17

18 CERNBox 2.0 some numbers Advanced prototype stage Adapted exissng webdav interface in EOS to be compasble with owncloud sync clients Test environment (EOSPPS) standard hardware namespace node with Xeon 2.2GHz, 16 cores, 24GB RAM 50 disk servers: cheap JBODs (1000 disks), total 800TB usable space Storage layout: 2 replicas in RAIN mode à every file PUT = 2 copies of the file on two independent storage nodes (with adler32 checksums of content) Event- based hgp(s) load- balancer (nginx) Underlying storage scalability (EOS Prod) Max observed IO: ~40GB/s on a single instance (eosatlas => ) Max observed file stats: 10s KHz Thousands of connected clients Server should never be a bogleneck for CERNBox 18

Prototyping a file sharing and synchronisation platform with owncloud

Prototyping a file sharing and synchronisation platform with owncloud Data & Storage Services Prototyping a file sharing and synchronisation platform with owncloud CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/it Jakub T. Moscicki Massimo Lamanna CERN IT-

More information

DSS. High performance storage pools for LHC. Data & Storage Services. Łukasz Janyst. on behalf of the CERN IT-DSS group

DSS. High performance storage pools for LHC. Data & Storage Services. Łukasz Janyst. on behalf of the CERN IT-DSS group DSS High performance storage pools for LHC Łukasz Janyst on behalf of the CERN IT-DSS group CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it Introduction The goal of EOS is to provide a

More information

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT SS Data & Storage CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT HEPiX Fall 2012 Workshop October 15-19, 2012 Institute of High Energy Physics, Beijing, China SS Outline

More information

DSS. The Data Storage Services (DSS) Strategy at CERN. Jakub T. Moscicki. (Input from J. Iven, M. Lamanna A. Pace, A. Peters and A.

DSS. The Data Storage Services (DSS) Strategy at CERN. Jakub T. Moscicki. (Input from J. Iven, M. Lamanna A. Pace, A. Peters and A. The Data Storage Services () Strategy at CERN Jakub T. Moscicki (Input from J. Iven, M. Lamanna A. Pace, A. Peters and A. Wiebalck) HEPiX Spring 2012 Workshop Prague, April 2012 The big picture Situation

More information

Book of Abstracts. 2016 CS3 Workshop

Book of Abstracts. 2016 CS3 Workshop Book of Abstracts 2016 CS3 Workshop Experiences of Cloud Storage Service Monitoring: Performance Assessment and Comparison E. Bocchi (Politecnico Torino) Enrico Bocchi 1,2, Idilio Drago 1, Marco Mellia

More information

owncloud Enterprise Edition on IBM Infrastructure

owncloud Enterprise Edition on IBM Infrastructure owncloud Enterprise Edition on IBM Infrastructure A Performance and Sizing Study for Large User Number Scenarios Dr. Oliver Oberst IBM Frank Karlitschek owncloud Page 1 of 10 Introduction One aspect of

More information

Storage strategy and cloud storage evaluations at CERN Dirk Duellmann, CERN IT

Storage strategy and cloud storage evaluations at CERN Dirk Duellmann, CERN IT SS Data & Storage Storage strategy and cloud storage evaluations at CERN Dirk Duellmann, CERN IT (with slides from Andreas Peters and Jan Iven) 5th International Conference "Distributed Computing and Grid-technologies

More information

Scala Storage Scale-Out Clustered Storage White Paper

Scala Storage Scale-Out Clustered Storage White Paper White Paper Scala Storage Scale-Out Clustered Storage White Paper Chapter 1 Introduction... 3 Capacity - Explosive Growth of Unstructured Data... 3 Performance - Cluster Computing... 3 Chapter 2 Current

More information

Data and Storage Services

Data and Storage Services Data and Storage Services G. Cancio, D. Duellmann, J. Iven, M. Lamanna, A. Pace, A.J. Peters, R.Toebbicke CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it CERN IT Department CH-1211 Genève

More information

DSS. Diskpool and cloud storage benchmarks used in IT-DSS. Data & Storage Services. Geoffray ADDE

DSS. Diskpool and cloud storage benchmarks used in IT-DSS. Data & Storage Services. Geoffray ADDE DSS Data & Diskpool and cloud storage benchmarks used in IT-DSS CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/it Geoffray ADDE DSS Outline I- A rational approach to storage systems evaluation

More information

Implementing Internet Storage Service Using OpenAFS. Sungjin Chun(chunsj@embian.com) Dongguen Choi(eastroot@embian.com) Arum Yoon(toy7777@embian.

Implementing Internet Storage Service Using OpenAFS. Sungjin Chun(chunsj@embian.com) Dongguen Choi(eastroot@embian.com) Arum Yoon(toy7777@embian. Implementing Internet Storage Service Using OpenAFS Sungjin Chun(chunsj@embian.com) Dongguen Choi(eastroot@embian.com) Arum Yoon(toy7777@embian.com) Overview Introduction Implementation Current Status

More information

Storage Virtualization. Andreas Joachim Peters CERN IT-DSS

Storage Virtualization. Andreas Joachim Peters CERN IT-DSS Storage Virtualization Andreas Joachim Peters CERN IT-DSS Outline What is storage virtualization? Commercial and non-commercial tools/solutions Local and global storage virtualization Scope of this presentation

More information

owncloud Architecture Overview

owncloud Architecture Overview owncloud Architecture Overview Time to get control back Employees are using cloud-based services to share sensitive company data with vendors, customers, partners and each other. They are syncing data

More information

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF Panasas at the RCF HEPiX at SLAC Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory Centralized File Service Single, facility-wide namespace for files. Uniform, facility-wide

More information

owncloud Architecture Overview

owncloud Architecture Overview owncloud Architecture Overview owncloud, Inc. 57 Bedford Street, Suite 102 Lexington, MA 02420 United States phone: +1 (877) 394-2030 www.owncloud.com/contact owncloud GmbH Schloßäckerstraße 26a 90443

More information

Patrick Fuhrmann. The DESY Storage Cloud

Patrick Fuhrmann. The DESY Storage Cloud The DESY Storage Cloud Patrick Fuhrmann The DESY Storage Cloud Hamburg, 2/3/2015 for the DESY CLOUD TEAM Content > Motivation > Preparation > Collaborations and publications > What do you get right now?

More information

Analisi di un servizio SRM: StoRM

Analisi di un servizio SRM: StoRM 27 November 2007 General Parallel File System (GPFS) The StoRM service Deployment configuration Authorization and ACLs Conclusions. Definition of terms Definition of terms 1/2 Distributed File System The

More information

GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project

GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project Description of Application The Spatial Data Warehouse project at the USGS/EROS distributes services and data in support of The National

More information

EMC SYNCPLICITY FILE SYNC AND SHARE SOLUTION

EMC SYNCPLICITY FILE SYNC AND SHARE SOLUTION EMC SYNCPLICITY FILE SYNC AND SHARE SOLUTION Automated file synchronization Flexible, cloud-based administration Secure, on-premises storage EMC Solutions January 2015 Copyright 2014 EMC Corporation. All

More information

Introduction to Gluster. Versions 3.0.x

Introduction to Gluster. Versions 3.0.x Introduction to Gluster Versions 3.0.x Table of Contents Table of Contents... 2 Overview... 3 Gluster File System... 3 Gluster Storage Platform... 3 No metadata with the Elastic Hash Algorithm... 4 A Gluster

More information

The cloud storage service bwsync&share at KIT

The cloud storage service bwsync&share at KIT The cloud storage service bwsync&share at KIT Alexander Yasnogor, Nico Schlitter, Andreas Petzold @CERN, Workshop on Cloud Services for File Synchronisation and Sharing STEINBUCH CENTRE FOR COMPUTING -

More information

Scientific Storage at FNAL. Gerard Bernabeu Altayo Dmitry Litvintsev Gene Oleynik 14/10/2015

Scientific Storage at FNAL. Gerard Bernabeu Altayo Dmitry Litvintsev Gene Oleynik 14/10/2015 Scientific Storage at FNAL Gerard Bernabeu Altayo Dmitry Litvintsev Gene Oleynik 14/10/2015 Index - Storage use cases - Bluearc - Lustre - EOS - dcache disk only - dcache+enstore Data distribution by solution

More information

WOS Cloud. ddn.com. Personal Storage for the Enterprise. DDN Solution Brief

WOS Cloud. ddn.com. Personal Storage for the Enterprise. DDN Solution Brief DDN Solution Brief Personal Storage for the Enterprise WOS Cloud Secure, Shared Drop-in File Access for Enterprise Users, Anytime and Anywhere 2011 DataDirect Networks. All Rights Reserved DDN WOS Cloud

More information

Business and enterprise cloud sync, backup and sharing solutions

Business and enterprise cloud sync, backup and sharing solutions Business and enterprise cloud sync, backup and sharing solutions Private Cloud Synchronization Collaboration Backup / Versioning Web Access Mobile Access LDAP / Active Directory Federated RADIUS Single

More information

Investigation of storage options for scientific computing on Grid and Cloud facilities

Investigation of storage options for scientific computing on Grid and Cloud facilities Investigation of storage options for scientific computing on Grid and Cloud facilities Overview Context Test Bed Lustre Evaluation Standard benchmarks Application-based benchmark HEPiX Storage Group report

More information

HSS: A simple file storage system for web applications

HSS: A simple file storage system for web applications HSS: A simple file storage system for web applications Abstract AOL Technologies has created a scalable object store for web applications. The goal of the object store was to eliminate the creation of

More information

GPFS Storage Server. Concepts and Setup in Lemanicus BG/Q system" Christian Clémençon (EPFL-DIT)" " 4 April 2013"

GPFS Storage Server. Concepts and Setup in Lemanicus BG/Q system Christian Clémençon (EPFL-DIT)  4 April 2013 GPFS Storage Server Concepts and Setup in Lemanicus BG/Q system" Christian Clémençon (EPFL-DIT)" " Agenda" GPFS Overview" Classical versus GSS I/O Solution" GPFS Storage Server (GSS)" GPFS Native RAID

More information

Reference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack

Reference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack Reference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack May 2015 Copyright 2015 SwiftStack, Inc. swiftstack.com Page 1 of 19 Table of Contents INTRODUCTION... 3 OpenStack

More information

IBM ELASTIC STORAGE SEAN LEE

IBM ELASTIC STORAGE SEAN LEE IBM ELASTIC STORAGE SEAN LEE Solution Architect Platform Computing Division IBM Greater China Group Agenda Challenges in Data Management What is IBM Elastic Storage Key Features Elastic Storage Server

More information

Parallels Plesk Automation

Parallels Plesk Automation Parallels Plesk Automation Contents Compact Configuration: Linux Shared Hosting 3 Compact Configuration: Mixed Linux and Windows Shared Hosting 4 Medium Size Configuration: Mixed Linux and Windows Shared

More information

Google File System. Web and scalability

Google File System. Web and scalability Google File System Web and scalability The web: - How big is the Web right now? No one knows. - Number of pages that are crawled: o 100,000 pages in 1994 o 8 million pages in 2005 - Crawlable pages might

More information

Big Science and Big Data Dirk Duellmann, CERN Apache Big Data Europe 28 Sep 2015, Budapest, Hungary

Big Science and Big Data Dirk Duellmann, CERN Apache Big Data Europe 28 Sep 2015, Budapest, Hungary Big Science and Big Data Dirk Duellmann, CERN Apache Big Data Europe 28 Sep 2015, Budapest, Hungary 16/02/2015 Real-Time Analytics: Making better and faster business decisions 8 The ATLAS experiment

More information

Introduction to the EIS Guide

Introduction to the EIS Guide Introduction to the EIS Guide The AirWatch Enterprise Integration Service (EIS) provides organizations the ability to securely integrate with back-end enterprise systems from either the AirWatch SaaS environment

More information

Storage solutions for a. infrastructure. Giacinto DONVITO INFN-Bari. Workshop on Cloud Services for File Synchronisation and Sharing

Storage solutions for a. infrastructure. Giacinto DONVITO INFN-Bari. Workshop on Cloud Services for File Synchronisation and Sharing Storage solutions for a productionlevel cloud infrastructure Giacinto DONVITO INFN-Bari Synchronisation and Sharing 1 Outline Use cases Technologies evaluated Implementation (hw and sw) Problems and optimization

More information

HDFS Users Guide. Table of contents

HDFS Users Guide. Table of contents Table of contents 1 Purpose...2 2 Overview...2 3 Prerequisites...3 4 Web Interface...3 5 Shell Commands... 3 5.1 DFSAdmin Command...4 6 Secondary NameNode...4 7 Checkpoint Node...5 8 Backup Node...6 9

More information

http://docs.trendmicro.com

http://docs.trendmicro.com Trend Micro Incorporated reserves the right to make changes to this document and to the products described herein without notice. Before installing and using the product, please review the readme files,

More information

The Ultimate Business & Enterprise Hosting Solutions. www.radonhosting.com

The Ultimate Business & Enterprise Hosting Solutions. www.radonhosting.com The Ultimate Business & Enterprise Hosting Solutions Radon is for businesses that demand high performance, versatile and scalable solutions. From hosting your Website, Email, Voice, SMS and Business Applications,

More information

www.thinkparq.com www.beegfs.com

www.thinkparq.com www.beegfs.com www.thinkparq.com www.beegfs.com KEY ASPECTS Maximum Flexibility Maximum Scalability BeeGFS supports a wide range of Linux distributions such as RHEL/Fedora, SLES/OpenSuse or Debian/Ubuntu as well as a

More information

Self service for software development tools

Self service for software development tools Self service for software development tools Michal Husejko, behalf of colleagues in CERN IT/PES CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it Self service for software development tools

More information

Datacenter Operating Systems

Datacenter Operating Systems Datacenter Operating Systems CSE451 Simon Peter With thanks to Timothy Roscoe (ETH Zurich) Autumn 2015 This Lecture What s a datacenter Why datacenters Types of datacenters Hyperscale datacenters Major

More information

http://docs.trendmicro.com

http://docs.trendmicro.com Trend Micro Incorporated reserves the right to make changes to this document and to the products described herein without notice. Before installing and using the product, please review the readme files,

More information

Hadoop & its Usage at Facebook

Hadoop & its Usage at Facebook Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System dhruba@apache.org Presented at the Storage Developer Conference, Santa Clara September 15, 2009 Outline Introduction

More information

Object storage in Cloud Computing and Embedded Processing

Object storage in Cloud Computing and Embedded Processing Object storage in Cloud Computing and Embedded Processing Jan Jitze Krol Systems Engineer DDN We Accelerate Information Insight DDN is a Leader in Massively Scalable Platforms and Solutions for Big Data

More information

AIX NFS Client Performance Improvements for Databases on NAS

AIX NFS Client Performance Improvements for Databases on NAS AIX NFS Client Performance Improvements for Databases on NAS October 20, 2005 Sanjay Gulabani Sr. Performance Engineer Network Appliance, Inc. gulabani@netapp.com Diane Flemming Advisory Software Engineer

More information

White Paper. Anywhere, Any Device File Access with IT in Control. Enterprise File Serving 2.0

White Paper. Anywhere, Any Device File Access with IT in Control. Enterprise File Serving 2.0 White Paper Enterprise File Serving 2.0 Anywhere, Any Device File Access with IT in Control Like it or not, cloud- based file sharing services have opened up a new world of mobile file access and collaborative

More information

Ignify ecommerce. Item Requirements Notes

Ignify ecommerce. Item Requirements Notes wwwignifycom Tel (888) IGNIFY5 sales@ignifycom Fax (408) 516-9006 Ignify ecommerce Server Configuration 1 Hardware Requirement (Minimum configuration) Item Requirements Notes Operating System Processor

More information

File Sharing and Network Marketing

File Sharing and Network Marketing GRAU DataSpace 2.0 THE SECURE COMMUNICATION PLATFORM FOR ENTERPRISES AND ORGANIZATIONS YOUR DATA. YOUR CONTROL Introduction Global file sharing is a mega trend Sync & Share is a new product category Consumer

More information

Distributed Data Storage Based on Web Access and IBP Infrastructure. Faculty of Informatics Masaryk University Brno, The Czech Republic

Distributed Data Storage Based on Web Access and IBP Infrastructure. Faculty of Informatics Masaryk University Brno, The Czech Republic Distributed Data Storage Based on Web Access and IBP Infrastructure Lukáš Hejtmánek Faculty of Informatics Masaryk University Brno, The Czech Republic Summary New web based distributed data storage infrastructure

More information

Hadoop Distributed File System. T-111.5550 Seminar On Multimedia 2009-11-11 Eero Kurkela

Hadoop Distributed File System. T-111.5550 Seminar On Multimedia 2009-11-11 Eero Kurkela Hadoop Distributed File System T-111.5550 Seminar On Multimedia 2009-11-11 Eero Kurkela Agenda Introduction Flesh and bones of HDFS Architecture Accessing data Data replication strategy Fault tolerance

More information

Enterprise Private Cloud Storage

Enterprise Private Cloud Storage Enterprise Private Cloud Storage The term cloud storage seems to have acquired many definitions. At Cloud Leverage, we define cloud storage as an enterprise-class file server located in multiple geographically

More information

files without borders

files without borders files without borders exploring Internet-connected storage for research Fabio Hernandez fabio@in2p3.fr IN2P3 / CNRS computing center, Lyon, France FJPPL compu+ng workshop, Lyon, March 11th 2015 2 Preamble

More information

www.basho.com Technical Overview Simple, Scalable, Object Storage Software

www.basho.com Technical Overview Simple, Scalable, Object Storage Software www.basho.com Technical Overview Simple, Scalable, Object Storage Software Table of Contents Table of Contents... 1 Introduction & Overview... 1 Architecture... 2 How it Works... 2 APIs and Interfaces...

More information

Michael Thomas, Dorian Kcira California Institute of Technology. CMS Offline & Computing Week

Michael Thomas, Dorian Kcira California Institute of Technology. CMS Offline & Computing Week Michael Thomas, Dorian Kcira California Institute of Technology CMS Offline & Computing Week San Diego, April 20-24 th 2009 Map-Reduce plus the HDFS filesystem implemented in java Map-Reduce is a highly

More information

HP reference configuration for entry-level SAS Grid Manager solutions

HP reference configuration for entry-level SAS Grid Manager solutions HP reference configuration for entry-level SAS Grid Manager solutions Up to 864 simultaneous SAS jobs and more than 3 GB/s I/O throughput Technical white paper Table of contents Executive summary... 2

More information

Oracle TimesTen In-Memory Database on Oracle Exalogic Elastic Cloud

Oracle TimesTen In-Memory Database on Oracle Exalogic Elastic Cloud An Oracle White Paper July 2011 Oracle TimesTen In-Memory Database on Oracle Exalogic Elastic Cloud Executive Summary... 3 Introduction... 4 Hardware and Software Overview... 5 Compute Node... 5 Storage

More information

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module June, 2015 WHITE PAPER Contents Advantages of IBM SoftLayer and RackWare Together... 4 Relationship between

More information

Performance Analysis and Capacity Planning Whitepaper

Performance Analysis and Capacity Planning Whitepaper Performance Analysis and Capacity Planning Whitepaper Contents P E R F O R M A N C E A N A L Y S I S & Executive Summary... 3 Overview... 3 Product Architecture... 4 Test Environment... 6 Performance Test

More information

SYSTEM SETUP FOR SPE PLATFORMS

SYSTEM SETUP FOR SPE PLATFORMS BEST PRACTICE SYSTEM SETUP FOR SPE PLATFORMS Product Snow License Manager Version 7.0 Content System requirements SQL Server configuration Maintenance Test environment Document date 2015-10-15 ABOUT THIS

More information

Apache Hadoop FileSystem and its Usage in Facebook

Apache Hadoop FileSystem and its Usage in Facebook Apache Hadoop FileSystem and its Usage in Facebook Dhruba Borthakur Project Lead, Apache Hadoop Distributed File System dhruba@apache.org Presented at Indian Institute of Technology November, 2010 http://www.facebook.com/hadoopfs

More information

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module June, 2015 WHITE PAPER Contents Advantages of IBM SoftLayer and RackWare Together... 4 Relationship between

More information

The OpenStack TM Object Storage system

The OpenStack TM Object Storage system The OpenStack TM Object Storage system Deploying and managing a scalable, open- source cloud storage system with the SwiftStack Platform By SwiftStack, Inc. contact@swiftstack.com Contents Introduction...

More information

Server Installation Manual 4.4.1

Server Installation Manual 4.4.1 Server Installation Manual 4.4.1 1. Product Information Product: BackupAgent Server Version: 4.4.1 2. Introduction BackupAgent Server has several features. The application is a web application and offers:

More information

Large Scale file storage with MogileFS. Stuart Teasdale Lead System Administrator we7 Ltd

Large Scale file storage with MogileFS. Stuart Teasdale Lead System Administrator we7 Ltd Large Scale file storage with MogileFS Stuart Teasdale Lead System Administrator we7 Ltd About We7 A web based streaming music service 6.5 million tracks 192kbps and 320kbps mp3s Sending over a gigabit

More information

Deploying Business Virtual Appliances on Open Source Cloud Computing

Deploying Business Virtual Appliances on Open Source Cloud Computing International Journal of Computer Science and Telecommunications [Volume 3, Issue 4, April 2012] 26 ISSN 2047-3338 Deploying Business Virtual Appliances on Open Source Cloud Computing Tran Van Lang 1 and

More information

WHITE PAPER: BEST PRACTICES. Sizing and Scalability Recommendations for Symantec Endpoint Protection. Symantec Enterprise Security Solutions Group

WHITE PAPER: BEST PRACTICES. Sizing and Scalability Recommendations for Symantec Endpoint Protection. Symantec Enterprise Security Solutions Group 2 WHITE PAPER: BEST PRACTICES Sizing and Scalability Recommendations for Symantec Rev 2.3 Symantec Enterprise Security Solutions Group White Paper: Symantec Best Practices Contents Introduction... 4 The

More information

How swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda

How swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda How swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda 1 Outline Build a cost-efficient Swift cluster with expected performance Background & Problem Solution Experiments

More information

Data storage services at CC-IN2P3

Data storage services at CC-IN2P3 Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules Data storage services at CC-IN2P3 Jean-Yves Nief Agenda Hardware: Storage on disk. Storage on tape. Software:

More information

DESYcloud: an owncloud & dcache update

DESYcloud: an owncloud & dcache update : an owncloud & dcache update Paul Millar (on behalf of team) : an owncloud & dcache update Cloud Services for Synchronisation and Sharing Zürich, Switzerland. 2016-01-18 2016-01-19 http://cs3.ethz.ch/

More information

Floating on a Hybrid Cloud: SQL Server 2012 & Microsoft Azure. Timothy P. McAliley Microsoft Premier Field Engineer SQL Server

Floating on a Hybrid Cloud: SQL Server 2012 & Microsoft Azure. Timothy P. McAliley Microsoft Premier Field Engineer SQL Server Floating on a Hybrid Cloud: SQL Server 2012 & Microsoft Azure Timothy P. McAliley Microsoft Premier Field Engineer SQL Server Agenda Hybrid cloud solutions Geo-replication Restore to VM Running SQL Server

More information

Cloud Optimize Your IT

Cloud Optimize Your IT Cloud Optimize Your IT Windows Server 2012 The information contained in this presentation relates to a pre-release product which may be substantially modified before it is commercially released. This pre-release

More information

SQL Server Hybrid Features. Luis Vargas, Senior Program Manager Lead Xin Jin, Program Manager @ Microsoft

SQL Server Hybrid Features. Luis Vargas, Senior Program Manager Lead Xin Jin, Program Manager @ Microsoft SQL Server Hybrid Features Luis Vargas, Senior Program Manager Lead Xin Jin, Program Manager @ Microsoft Key Takeaway SQL Server integrates cloud services from Azure providing end-to-end experiences that

More information

Server Scalability and High Availability

Server Scalability and High Availability Server Scalability and High Availability GO!Enterprise GLOBO Plc. March 2015 Copyright Notice and Usage Terms This guide is Copyright 2012 GLOBO. All Rights Reserved. Permission is granted to make and

More information

Alfresco Enterprise on AWS: Reference Architecture

Alfresco Enterprise on AWS: Reference Architecture Alfresco Enterprise on AWS: Reference Architecture October 2013 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 13 Abstract Amazon Web Services (AWS)

More information

MDMCenter Hardware Specification

MDMCenter Hardware Specification MDMCenter Hardware Specification 2011 This document provides the recommended specifications for the hardware required to run MDMCenter on different environments. It also provides a list of prerequisite

More information

SHAREPOINT 2010 REMOTE BLOB STORES WITH EMC ISILON NAS AND METALOGIX STORAGEPOINT

SHAREPOINT 2010 REMOTE BLOB STORES WITH EMC ISILON NAS AND METALOGIX STORAGEPOINT White Paper SHAREPOINT 2010 REMOTE BLOB STORES WITH EMC ISILON NAS AND METALOGIX STORAGEPOINT Abstract This white paper describes how to externalize Microsoft SharePoint Server 2010 BLOB stores to EMC

More information

Four Reasons To Start Working With NFSv4.1 Now

Four Reasons To Start Working With NFSv4.1 Now Four Reasons To Start Working With NFSv4.1 Now PRESENTATION TITLE GOES HERE Presented by: Alex McDonald Hosted by: Gilles Chekroun Ethernet Storage Forum Members The SNIA Ethernet Storage Forum (ESF) focuses

More information

NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB, Cassandra, and MongoDB

NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB, Cassandra, and MongoDB bankmark UG (haftungsbeschränkt) Bahnhofstraße 1 9432 Passau Germany www.bankmark.de info@bankmark.de T +49 851 25 49 49 F +49 851 25 49 499 NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB,

More information

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES Vincent Garonne, Mario Lassnig, Martin Barisits, Thomas Beermann, Ralph Vigne, Cedric Serfon Vincent.Garonne@cern.ch ph-adp-ddm-lab@cern.ch XLDB

More information

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database Built up on Cisco s big data common platform architecture (CPA), a

More information

ViewBox: Integrating Local File System with Cloud Storage Service

ViewBox: Integrating Local File System with Cloud Storage Service ViewBox: Integrating Local File System with Cloud Storage Service FAST 2014 Yupu Zhang +, Chris Dragga +*, Andrea Arpaci-Dusseau +, RemziArpaci-Dusseau + University of Wisconsin-Madison 1 Outline Introduction

More information

Scientific Computing Data Management Visions

Scientific Computing Data Management Visions Scientific Computing Data Management Visions ELI-Tango Workshop Szeged, 24-25 February 2015 Péter Szász Group Leader Scientific Computing Group ELI-ALPS Scientific Computing Group Responsibilities Data

More information

OIS. Update on Windows 7 at CERN & Remote Desktop Gateway. Operating Systems & Information Services CERN IT-OIS

OIS. Update on Windows 7 at CERN & Remote Desktop Gateway. Operating Systems & Information Services CERN IT-OIS Operating Systems & Information Services Update on Windows 7 at CERN & Remote Desktop Gateway CERN IT-OIS Tim Bell, Michal Kwiatek, Michal Budzowski, Andreas Wagner HEPiX Fall 2010 Workshop 4th November

More information

BlackBerry Enterprise Server for Microsoft Exchange. Version: 5.0 Service Pack: 4. Upgrade Guide

BlackBerry Enterprise Server for Microsoft Exchange. Version: 5.0 Service Pack: 4. Upgrade Guide BlackBerry Enterprise Server for Microsoft Exchange Version: 5.0 Service Pack: 4 Upgrade Guide Published: 2014-01-16 SWD-20140116175501016 Contents 1 Overview: BlackBerry Enterprise Server...7 2 Planning

More information

Hadoop IST 734 SS CHUNG

Hadoop IST 734 SS CHUNG Hadoop IST 734 SS CHUNG Introduction What is Big Data?? Bulk Amount Unstructured Lots of Applications which need to handle huge amount of data (in terms of 500+ TB per day) If a regular machine need to

More information

Best Practices Guide. McAfee epolicy Orchestrator 5.0.0 Software

Best Practices Guide. McAfee epolicy Orchestrator 5.0.0 Software Best Practices Guide McAfee epolicy Orchestrator 5.0.0 Software COPYRIGHT Copyright 2013 McAfee, Inc. Do not copy without permission. TRADEMARK ATTRIBUTIONS McAfee, the McAfee logo, McAfee Active Protection,

More information

Francesco.diaz@insight.com. http://francescodiaz.azurewebsites.net

Francesco.diaz@insight.com. http://francescodiaz.azurewebsites.net Francesco.diaz@insight.com http://francescodiaz.azurewebsites.net Agenda SQL workloads on Azure Microsoft Azure IaaS concepts Best Practices for SQL running on Azure VMs HA/DR SQL DB HA/DR Dedicated Higher

More information

Windows Server 2012 授 權 說 明

Windows Server 2012 授 權 說 明 Windows Server 2012 授 權 說 明 PROCESSOR + CAL HA 功 能 相 同 的 記 憶 體 及 處 理 器 容 量 虛 擬 化 Windows Server 2008 R2 Datacenter Price: NTD173,720 (2 CPU) Packaging All features Unlimited virtual instances Per processor

More information

Experience with Server Self Service Center (S3C)

Experience with Server Self Service Center (S3C) Experience with Server Self Service Center (S3C) Juraj Sucik, Sebastian Bukowiec IT Department, CERN, CH-1211 Genève 23, Switzerland E-mail: juraj.sucik@cern.ch, sebastian.bukowiec@cern.ch Abstract. CERN

More information

Software Environment. Options. Service guarantee:. 24/7 Hardware Support. 99% uptime

Software Environment. Options. Service guarantee:. 24/7 Hardware Support. 99% uptime Hosting : VPS 35 Service Specifications 35GB of Disk Space 2GB of RAM 100GB of Bandwidth 1 IP Address Included Hardware Specifications CPU: Xeon Lynnfield Quad-Core X3430 2.4GHz Hard drives: 2 x SAS SATA

More information

SAP HANA Operation Expert Summit BUILD - High Availability & Disaster Recovery

SAP HANA Operation Expert Summit BUILD - High Availability & Disaster Recovery SAP HANA Operation Expert Summit BUILD - High Availability & Disaster Recovery Dr. Ralf Czekalla/SAP HANA Product Management May 09, 2014 Customer Disclaimer This presentation outlines our general product

More information

Experience in integrating enduser cloud storage for CMS Analysis

Experience in integrating enduser cloud storage for CMS Analysis Experience in integrating enduser cloud storage for CMS Analysis Hassen Riahi CERN IT CS3 Workshop Zürich, 18 th January 2016 1/18/16 H. Riahi; CS3 workshop 2 Outline Overview Goals Integration architecture

More information

msuite5 & mdesign Installation Prerequisites

msuite5 & mdesign Installation Prerequisites CommonTime Limited msuite5 & mdesign Installation Prerequisites Administration considerations prior to installing msuite5 and mdesign. 7/7/2011 Version 2.4 Overview... 1 msuite version... 1 SQL credentials...

More information

IRODS use case : Ciment, the Univ. Grenoble-Alpes HPC center. B.Bzeznik / X.Briand Irods users group meeting 11/06/2015

IRODS use case : Ciment, the Univ. Grenoble-Alpes HPC center. B.Bzeznik / X.Briand Irods users group meeting 11/06/2015 IRODS use case : Ciment, the Univ. Grenoble-Alpes HPC center B.Bzeznik / X.Briand Irods users group meeting 11/06/2015 IRODS rocks! We like rocks here... Irods is used (famous) in the French Alps since

More information

Fileweave. Large File Transfer. Seamless Microsoft Outlook add-in. Simple drag and drop functionality

Fileweave. Large File Transfer. Seamless Microsoft Outlook add-in. Simple drag and drop functionality Overview The sparkweave suite of applications is next-generation secure communication and collaboration technology for the changing needs of the enterprise. Sparkweave combines the best attributes of consumer

More information

Storage Made Easy Enterprise File Share and Sync (EFSS) Cloud Control Gateway Architecture

Storage Made Easy Enterprise File Share and Sync (EFSS) Cloud Control Gateway Architecture Storage Made Easy Enterprise File Share and Sync (EFSS) Architecture Software Stack The SME platform is built using open Internet technologies. The base operating system used s hardened Linux CentOS. HTTPD

More information

Integrated Challan cum Return (ICR) - Scope of Work for ICR Platform

Integrated Challan cum Return (ICR) - Scope of Work for ICR Platform Integrated Challan cum Return (ICR) - Scope of Work for ICR Platform 1. Proposed System Modules: a) User Portal b) ICR Management Platform c) Integration with Bankers Core Banking platform d) Messaging

More information

Pre-Migration Assessment Report SAMPLE REPORT. Number and type of source servers: 6 Windows Number of type of destination servers: 3 windows

Pre-Migration Assessment Report SAMPLE REPORT. Number and type of source servers: 6 Windows Number of type of destination servers: 3 windows MIGRATION PLAN OVERVIEW Number and type of source servers: 6 Windows Number of type of destination servers: 3 windows Destination server use (i.e., DB2, Test/QA3): Web, DB, App (GP Dynamics) Notable changes

More information

CLOUD BASED SERVICE (CBS STORAGE)

CLOUD BASED SERVICE (CBS STORAGE) CLOUD BASED SERVICE (CBS STORAGE) Defining next generation of cloud based grid Power All Networks Ltd. Technical Whitepaper September 2008, version 1.04 PAGE 1 Table of Content Cloud Based Services (CBS

More information

Building low cost disk storage with Ceph and OpenStack Swift

Building low cost disk storage with Ceph and OpenStack Swift Background photo from: http://edelomahony.com/2011/07/25/loving-money-doesnt-bring-you-more/ Building low cost disk storage with Ceph and OpenStack Swift Paweł Woszuk, Maciej Brzeźniak TERENA TF-Storage

More information

Microsoft SharePoint Server 2010

Microsoft SharePoint Server 2010 Microsoft SharePoint Server 2010 Small Farm Performance Study Dell SharePoint Solutions Ravikanth Chaganti and Quocdat Nguyen November 2010 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY

More information