Tactical Storage: Simple, Secure, and Semantic Access to Remote Data. Prof. Douglas Thain University of Notre Dame

Size: px
Start display at page:

Download "Tactical Storage: Simple, Secure, and Semantic Access to Remote Data. Prof. Douglas Thain University of Notre Dame http://www.cse.nd."

Transcription

1 Tactical Storage: Simple, Secure, and Semantic Access to Remote Data Prof. Douglas Thain University of Notre Dame

2

3

4 Plentiful Computing Power As of 25 April Condor Worldwide: 56,682 CPUs /??? TB / 1758 sites Teragrid 15,328 CPUs / 220 TB / 6 sites Open Science Grid 21,156 CPUs / 83 TB / 61 sites EGEE Grid Lots???

5 Complex Ecology of Storage Shared Filesystem shared disk private disk private disk private disk HTTP, FTP, RFIO, glite, SRB, SCP, RSYNC, HTTP... private disk Independent Cluster Disks shared disk

6 Problems Accessing Data Large Burden on the User User may not be able/willing to state s in advance. Different services/protocols available at different sites. Programs not modified to take advantage of services. Different access modes for different purposes. File transfer: preparing system for intended use. File system: access to data for running jobs. Resources go unused. Disks on each node of a cluster. Unorganized resources in a department/lab. Would like to combine disks into larger structures. A global system can t t satisfy everyone! (Global means different things to different people.) Both a technical and social problem.

7 What s s the Problem? We often assume that the site administrator is responsible for making the site comfortable for the user. (Not possible on the grid!) Rather, the user should be able to bring along a mechanism to access multiple independent (remote?) data sources. Of course, we have to make it easy!

8 Tactical Storage Systems (TSS) A TSS allows any node to serve as a or as a system client. All components can be deployed without special privileges but with security. Users can build up complex structures. Filesystems,, databases, caches,... Admins need not know/care about larger structures. Two Independent Concepts: Resources The raw storage to be used. Abstractions The organization of storage.

9 App transfer App Parrot??? Simple Filesystem App Parrot Parrot Distributed Filesystem Abstraction Distributed Database Abstraction UNIX UNIX UNIX UNIX UNIX UNIX UNIX system system UNIX UNIX UNIX UNIX UNIX UNIX UNIX system 3PT system system system system Workstations owners control policy on each machine. Cluster administrator controls policy on all storage in cluster

10 Key Properties Tactical Storage is Simple: Appears as an ordinary system. Applies to unmodified applications and data w/out code changes, relinking,, kernel modules, etc... Tactical Storage is Secure: Authentication with standard GSI or Kerberos. Rich distributed access control system. Tactical Storage is Semantic: Name data by meaning, not by location. Supports external name resolution mechanisms.

11

12

13

14

15

16

17 Access Control in File Servers Unix Security is not Sufficient No global user database possible/desirable. Mapping external credentials to Unix gets messy. Instead, Make External Names First-Class Perform access control on remote, not local, names. Types: Globus,, Kerberos, Unix, Hostname, Address Each directory has an ACL: globus:/o= :/O=NotreDame/CN=DThain hostname:*.cs.nd.edu RWLA RWL RL address: * RWLA

18 Distributed Group ACLs App App App App UNIX UNIX UNIX UNIX UNIX UNIX UNIX system system system system system system system Chemistry Group List data Lab 5 Group List data Physics Group List ACL: Lab 5 RW Chemistry R ACL: Physics RW Lab 5 R

19 Semantic Data Access Appl Parrot /usr/local = /chirp/host5.nd.edu/software /tmp = /chirp/host9.nd.edu/scratch /data = /gsiftp/ftp.nd.edu/mydata /db = resolver:find_db Where is /db/dir/523? find_db /usr/local /data /tmp It s at /ftp/ftp.infn.it/db/xz host5 host9 FTP

20

21 Credit: Sander NIKHEF Remote Database Access GSI HEP Simulation Needs Direct DB Access App linked against Objectivity DB. Objectivity accesses system directly. How to distribute application securely? Solution: Remote Root Mount via Parrot: script Parrot Simple FS parrot M M /=/chirp//rootdir rootdir DB code can read/write/lock s directly. WAN GSI Auth system DB data libdb.so sim.exe

22 Credit: Igor Fermi National Lab Remote Application Loading Modular Simulation Needs Many Libraries Devel.. on workstations, then ported to grid. Selection of library depends on analysis tech. Constraint: Must use HTTP for access. Solution: Dynamic Link with TSS+HTTP: /home/cdfsoft -> > /http/dcaf.fnal.gov/cdfsoft appl Parrot HTTP proxy proxy HTTP system liba.so libb.so libc.so select several MB from 60 GB of libraries

23 Technical Problem HTTP is not a system!! (No directories) Advantages: Firewalls, caches, admins. Appl GET /home HTTP/1.0 HTTP Server Parrot opendir(/home) <HTML> <HEAD> <H1> root opendir(/home) home etc bin HTTP Module alice babar cms

24 Technical Problem Solution: Turn the directories into s. Can be cached in ordinary proxies! Hierarchical SHA1 integrity check. Appl Parrot opendir(/home) GET /home/.dir HTTP/1.0 alice babar cms HTTP Server root make httpfs opendir(/home).dir home etc bin HTTP Module alice babar cms.dir

25 Logical Access to Bio Data Many databases of biological data in different formats around the world: Archives: Swiss-Prot, TreMBL,, NCBI, etc... Replicas: Public, Shared, Private,??? Users and applications want to refer to data objects by logical name, not location! Access the nearest copy of the non-redundant protein database, don t t care where it is. Solution: EGEE data management system maps logical names (LFNs( LFNs) ) to physical names (SFNs( SFNs). Credit: Christophe Blanchet, Bioinformatics Center of Lyon, CNRS IBCP, France Christophe.Blanchet@ibcp.fr

26 Logical Access to Bio Data Run BLAST on LFN://ncbi.gov/nr.data BLAST open(lfn://ncbi.gov/nr.data) Parrot Where is LFN://ncbi.gov/nr.data? EGEE File Location Service Find it at: FTP://ibcp.fr/nr.data glite Server nr.data Chirp Server nr.data open(ftp://ibcp.fr/nr.data) RFIO glite HTTP FTP RETR nr.data FTP Server nr.data

27 % ( - / ( 3 2 % - / 3 2 Performance of Bio Apps on EGEE Performance of Bio Apps on EGEE -, )*+ ( + &' % $# -, )*+ +.* -, )*+ + & &10 *+ (2,4 5 &' % $# (2,4 5.* (2,4 5 & &10 *+ "!

28 Credit: John Notre Dame Astrophysics Dept. Expandable Filesystem for Experimental Data Project GRAND Can only analyze the most recent data. 2 GB/day today could be lots more! 30-year archive analysis code buffer disk daily tape daily daily tape daily tape daily tape tape

29 Credit: John Notre Dame Astrophysics Dept. Expandable Filesystem for Experimental Data Can analyze all data over large time scales. Project GRAND analysis code Adapter Distributed Shared Filesystem buffer disk 2 GB/day today could be lots more! daily tape 30-year archive daily daily tape daily tape daily tape tape

30 Current Work Now that we can easily use any storage... Much easier to arrange data/jobs arbitrarily. Idea: combine cluster storage / cluster comp! Goal: keep jobs close to data that they need. PINS: Processing in STorage Example: GEMS Distributed Databank Facility for creating, storing, and analyzing molecular dynamics data in a cluster. Goal: Be able to easily scale both CPU and storage capacity by adding commodity nodes. Credit: Jesus Izaguirre and Aaron Notre Dame

31 meta-data database D1 D1 D3 D4 Fetch D1 Compute F(D1) Query (Mol== CH4 ) && (T>300K) App Adapter Distributed Filesystem Abstraction D2 D3 D4 UNIX UNIX UNIX UNIX UNIX UNIX UNIX D1 D2 D3 D4 system D1 system F(D1) system D2 system D3 system D4 system system J1 F J2 J3 J4

32 More Open Problems Resource Management How to prevent overcommitment -> badput? Security How to easily express complex policies for sharing and controlling combined cpu/disk? Reliability How to deal with disconnection, erasure, rejection, unexpected performance, etc... Garbage Collection What s s to prevent me from filling every disk everywhere with computations that I might need? Debugging How do we dig out of numerous, noisy, distributed logs that state relevant to a complex workflow?

33 Conclusion Tactical storage allows end users to build large structures out of simple building blocks without getting stuck on the ugly details.

34 Acknowledgments Science Collaborators: Christophe Blanchet Patrick Flynn Sander Klous Peter Kunzst Erwin Laure John Poirier Igor Sfiligoi CS Collaborators: Jesus Izaguirre Aaron Striegel CS Students: Paul Brenner James Fitzgerald Jeff Hemmes Paul Madrid Chris Moretti Gerhard Niederwieser Phil Snowberger Justin Wozniak

35 For more information... Cooperative Computing Lab Cooperative Computing Tools Douglas Thain

36

37 Problem: Shared Namespace globus:/o=notredame/* RWLAX test.c a.out test.dat cms.exe

38 Solution: Reservation (V) Right /O=NotreDame/CN=Monk mkdir only! /O=NotreDame/CN=Ted O=NotreDame/CN=* V(RWLA) mkdir mkdir /O=NotreDame/CN=Monk RWLA /O=NotreDame/CN=Ted RWLA test.c a.out test.c a.out

CDF software distribution on the Grid using Parrot

CDF software distribution on the Grid using Parrot CDF software distribution on the Grid using Parrot G Compostella 1, S Pagan Griso 2, D Lucchesi 2, I Sfiligoi 3 and D Thain 4 1 INFN-CNAF, viale Berti Pichat 6/2, 40127 Bologna, ITALY 2 Dipartimento di

More information

Chirp: A Practical Global Filesystem for Cluster and Grid Computing

Chirp: A Practical Global Filesystem for Cluster and Grid Computing Chirp: A Practical Global Filesystem for Cluster and Grid Computing Douglas Thain, Christopher Moretti, and Jeffrey Hemmes Department of Computer Science and Engineering, University of Notre Dame Abstract.

More information

Attaching Cloud Storage to a Campus Grid Using Parrot, Chirp, and Hadoop

Attaching Cloud Storage to a Campus Grid Using Parrot, Chirp, and Hadoop Attaching Cloud Storage to a Campus Grid Using Parrot, Chirp, and Hadoop Patrick Donnelly, Peter Bui, Douglas Thain Computer Science and Engineering University of Notre Dame pdonnel3@nd.edu pbui@nd.edu

More information

The Consequences of Decentralized Security in a Cooperative Storage System

The Consequences of Decentralized Security in a Cooperative Storage System The Consequences of Decentralized Security in a Cooperative Storage System Douglas Thain, Christopher Moretti, Paul Madrid, Philip Snowberger, and Jeffrey Hemmes Department of Computer Science and Engineering

More information

Access Control for a Replica Management Database

Access Control for a Replica Management Database Access Control for a Replica Management Database Justin M. Wozniak Dept. of Computer Science and Engineering University of Notre Dame Notre Dame, Indiana jwozniak@nd.edu Paul Brenner Dept. of Computer

More information

EGEE-2 NA4 Biomed Bioinformatics in CNRS

EGEE-2 NA4 Biomed Bioinformatics in CNRS Enabling Grids for E-sciencE EGEE-2 NA4 Biomed Bioinformatics in CNRS Christophe Blanchet Institute of Biology and Chemistry of Proteins Lyon, April 28, 2006 www.eu-egee.org Enabling Grids for E-sciencE

More information

File Transfer Best Practices

File Transfer Best Practices File Transfer Best Practices David Turner User Services Group NERSC User Group Meeting October 2, 2008 Overview Available tools ftp, scp, bbcp, GridFTP, hsi/htar Examples and Performance LAN WAN Reliability

More information

campus Grid Management and the Chirp Filesystem

campus Grid Management and the Chirp Filesystem Attaching Cloud Storage to a Campus Grid Using Parrot, Chirp, and Patrick Donnelly, Peter Bui, Douglas Thain Computer Science and Engineering University of Notre Dame pdonnel3, pbui, dthain@nd.edu Abstract

More information

Data Management. Network transfers

Data Management. Network transfers Data Management Network transfers Network data transfers Not everyone needs to transfer large amounts of data on and off a HPC service Sometimes data is created and consumed on the same service. If you

More information

Storage Virtualization. Andreas Joachim Peters CERN IT-DSS

Storage Virtualization. Andreas Joachim Peters CERN IT-DSS Storage Virtualization Andreas Joachim Peters CERN IT-DSS Outline What is storage virtualization? Commercial and non-commercial tools/solutions Local and global storage virtualization Scope of this presentation

More information

The dcache Storage Element

The dcache Storage Element 16. Juni 2008 Hamburg The dcache Storage Element and it's role in the LHC era for the dcache team Topics for today Storage elements (SEs) in the grid Introduction to the dcache SE Usage of dcache in LCG

More information

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT SS Data & Storage CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT HEPiX Fall 2012 Workshop October 15-19, 2012 Institute of High Energy Physics, Beijing, China SS Outline

More information

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007 Data Management in an International Data Grid Project Timur Chabuk 04/09/2007 Intro LHC opened in 2005 several Petabytes of data per year data created at CERN distributed to Regional Centers all over the

More information

EXPLORING LINUX KERNEL: THE EASY WAY!

EXPLORING LINUX KERNEL: THE EASY WAY! EXPLORING LINUX KERNEL: THE EASY WAY! By: Ahmed Bilal Numan 1 PROBLEM Explore linux kernel TCP/IP stack Solution Try to understand relative kernel code Available text Run kernel in virtualized environment

More information

Installing and Configuring Websense Content Gateway

Installing and Configuring Websense Content Gateway Installing and Configuring Websense Content Gateway Websense Support Webinar - September 2009 web security data security email security Support Webinars 2009 Websense, Inc. All rights reserved. Webinar

More information

IBM Smart Business Storage Cloud

IBM Smart Business Storage Cloud GTS Systems Services IBM Smart Business Storage Cloud Reduce costs and improve performance with a scalable storage virtualization solution SoNAS Gerardo Kató Cloud Computing Solutions 2010 IBM Corporation

More information

Data Movement and Storage. Drew Dolgert and previous contributors

Data Movement and Storage. Drew Dolgert and previous contributors Data Movement and Storage Drew Dolgert and previous contributors Data Intensive Computing Location Viewing Manipulation Storage Movement Sharing Interpretation $HOME $WORK $SCRATCH 72 is a Lot, Right?

More information

irods at CC-IN2P3: managing petabytes of data

irods at CC-IN2P3: managing petabytes of data Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules irods at CC-IN2P3: managing petabytes of data Jean-Yves Nief Pascal Calvat Yonny Cardenas Quentin Le Boulc h

More information

Lab : Planning and Implementing a Virtual Machine Deployment and Management Strategy

Lab : Planning and Implementing a Virtual Machine Deployment and Management Strategy MS 20414A: Implementing an Advanced Server Infrastructure Description: Days: 5 Prerequisites: In this course, students will learn how to plan and implement some of the more advanced features available

More information

Alternative models to distribute VO specific software to WLCG sites: a prototype set up at PIC

Alternative models to distribute VO specific software to WLCG sites: a prototype set up at PIC EGEE and glite are registered trademarks Enabling Grids for E-sciencE Alternative models to distribute VO specific software to WLCG sites: a prototype set up at PIC Elisa Lanciotti, Arnau Bria, Gonzalo

More information

RH033 Red Hat Linux Essentials or equivalent experience with Red Hat Linux..

RH033 Red Hat Linux Essentials or equivalent experience with Red Hat Linux.. RH131 Red Hat Linux System Administration Course Summary For users of Linux (or UNIX) who want to start building skills in systems administration on Red Hat Linux, to a level where they can attach and

More information

Data storage services at CC-IN2P3

Data storage services at CC-IN2P3 Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules Data storage services at CC-IN2P3 Jean-Yves Nief Agenda Hardware: Storage on disk. Storage on tape. Software:

More information

An objective comparison test of workload management systems

An objective comparison test of workload management systems An objective comparison test of workload management systems Igor Sfiligoi 1 and Burt Holzman 1 1 Fermi National Accelerator Laboratory, Batavia, IL 60510, USA E-mail: sfiligoi@fnal.gov Abstract. The Grid

More information

Investigation of storage options for scientific computing on Grid and Cloud facilities

Investigation of storage options for scientific computing on Grid and Cloud facilities Investigation of storage options for scientific computing on Grid and Cloud facilities Overview Context Test Bed Lustre Evaluation Standard benchmarks Application-based benchmark HEPiX Storage Group report

More information

Data Management using irods

Data Management using irods Data Management using irods Fundamentals of Data Management September 2014 Albert Heyrovsky Applications Developer, EPCC a.heyrovsky@epcc.ed.ac.uk 2 Course outline Why talk about irods? What is irods?

More information

Contents III: Contents II: Contents: Rule Set Based Access Control (RSBAC) 4.2 Model Specifics 5.2 AUTH

Contents III: Contents II: Contents: Rule Set Based Access Control (RSBAC) 4.2 Model Specifics 5.2 AUTH Rule Set Based Access Control (RSBAC) Linux Kernel Security Extension Tutorial Amon Ott Contents: 1 Motivation: Why We Need Better Security in the Linux Kernel 2 Overview of RSBAC 3 How

More information

Centralized Mac Home Directories On Windows Servers: Using Windows To Serve The Mac

Centralized Mac Home Directories On Windows Servers: Using Windows To Serve The Mac Making it easy to deploy, integrate and manage Macs, iphones and ipads in a Windows environment. Centralized Mac Home Directories On Windows Servers: Using Windows To Serve The Mac 2011 ENTERPRISE DEVICE

More information

File server infrastructure @NIKHEF

File server infrastructure @NIKHEF File server infrastructure @NIKHEF CT system support June 2003 1 CT NIKHEF Outline Protocols Naming scheme (Unix, Windows) Backup and archiving Server systems Disk quota policy AFS June 2003 2 CT NIKHEF

More information

SINGLE COURSE. 136 Total Hours. After completing this course, students will be able to:

SINGLE COURSE. 136 Total Hours. After completing this course, students will be able to: NH204-1314 Designing and Implementing Server Infrastructure 136 Total Hours COURSE TITLE: Designing and Implementing Server Infrastructure COURSE OVERVIEW: After completing this course, students will be

More information

The Greenplum Analytics Workbench

The Greenplum Analytics Workbench The Greenplum Analytics Workbench External Overview 1 The Greenplum Analytics Workbench Definition Is a 1000-node Hadoop Cluster. Pre-configured with publicly available data sets. Contains the entire Hadoop

More information

files without borders

files without borders files without borders exploring Internet-connected storage for research Fabio Hernandez fabio@in2p3.fr IN2P3 / CNRS computing center, Lyon, France FJPPL compu+ng workshop, Lyon, March 11th 2015 2 Preamble

More information

Fine-Grained Access Control in the Chirp Distributed File System

Fine-Grained Access Control in the Chirp Distributed File System Fine-Grained Access Control in the Chirp Distributed File System Patrick Donnelly and Douglas Thain Department of Computer Science and Engineering University of Notre Dame Notre Dame, Indiana 46556 Email:

More information

(Possible) HEP Use Case for NDN. Phil DeMar; Wenji Wu NDNComm (UCLA) Sept. 28, 2015

(Possible) HEP Use Case for NDN. Phil DeMar; Wenji Wu NDNComm (UCLA) Sept. 28, 2015 (Possible) HEP Use Case for NDN Phil DeMar; Wenji Wu NDNComm (UCLA) Sept. 28, 2015 Outline LHC Experiments LHC Computing Models CMS Data Federation & AAA Evolving Computing Models & NDN Summary Phil DeMar:

More information

Implementing an Advanced Server Infrastructure

Implementing an Advanced Server Infrastructure Page 1 of 9 Overview Who should attend? Get hands-on instruction and practice planning, designing and deploying a physical and logical Windows Server 2012 R2 enterprise infrastructure in this 5-day Microsoft

More information

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

Deploying a distributed data storage system on the UK National Grid Service using federated SRB Deploying a distributed data storage system on the UK National Grid Service using federated SRB Manandhar A.S., Kleese K., Berrisford P., Brown G.D. CCLRC e-science Center Abstract As Grid enabled applications

More information

Big Science and Big Data Dirk Duellmann, CERN Apache Big Data Europe 28 Sep 2015, Budapest, Hungary

Big Science and Big Data Dirk Duellmann, CERN Apache Big Data Europe 28 Sep 2015, Budapest, Hungary Big Science and Big Data Dirk Duellmann, CERN Apache Big Data Europe 28 Sep 2015, Budapest, Hungary 16/02/2015 Real-Time Analytics: Making better and faster business decisions 8 The ATLAS experiment

More information

Virtual Managment Appliance Setup Guide

Virtual Managment Appliance Setup Guide Virtual Managment Appliance Setup Guide 2 Sophos Installing a Virtual Appliance Installing a Virtual Appliance As an alternative to the hardware-based version of the Sophos Web Appliance, you can deploy

More information

This section is intended to provide sample configurations and script examples common to long-term operation of a Jive SBS installation.

This section is intended to provide sample configurations and script examples common to long-term operation of a Jive SBS installation. Operations Cookbook Contents Operations Cookbook...2 Enabling SSL Encryption... 2 Disabling the Local Jive System Database... 2 Changing the Configuration of an Existing Instance... 3 Performing a Jive

More information

Websense Support Webinar: Questions and Answers

Websense Support Webinar: Questions and Answers Websense Support Webinar: Questions and Answers Configuring Websense Web Security v7 with Your Directory Service Can updating to Native Mode from Active Directory (AD) Mixed Mode affect transparent user

More information

SUSE Manager in the Public Cloud. SUSE Manager Server in the Public Cloud

SUSE Manager in the Public Cloud. SUSE Manager Server in the Public Cloud SUSE Manager in the Public Cloud SUSE Manager Server in the Public Cloud Contents 1 Instance Requirements... 2 2 Setup... 3 3 Registration of Cloned Systems... 6 SUSE Manager delivers best-in-class Linux

More information

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago Globus Striped GridFTP Framework and Server Raj Kettimuthu, ANL and U. Chicago Outline Introduction Features Motivation Architecture Globus XIO Experimental Results 3 August 2005 The Ohio State University

More information

Roberto Barbera. Centralized bookkeeping and monitoring in ALICE

Roberto Barbera. Centralized bookkeeping and monitoring in ALICE Centralized bookkeeping and monitoring in ALICE CHEP INFN 2000, GRID 10.02.2000 WP6, 24.07.2001 Roberto 1 Barbera ALICE and the GRID Phase I: AliRoot production The GRID Powered by ROOT 2 How did we get

More information

Concepts and Architecture of Grid Computing. Advanced Topics Spring 2008 Prof. Robert van Engelen

Concepts and Architecture of Grid Computing. Advanced Topics Spring 2008 Prof. Robert van Engelen Concepts and Architecture of Grid Computing Advanced Topics Spring 2008 Prof. Robert van Engelen Overview Grid users: who are they? Concept of the Grid Challenges for the Grid Evolution of Grid systems

More information

SGFS: Secure, Flexible, and Policy-based Global File Sharing

SGFS: Secure, Flexible, and Policy-based Global File Sharing SGFS: Secure, Flexible, and Policy-based Global File Sharing Vishal Kher Eric Seppanen Cory Leach Yongdae Kim {vkher,seppanen,leach,kyd}@cs.umn.edu University of Minnesota Motivation for Network attached

More information

Virtual Web Appliance Setup Guide

Virtual Web Appliance Setup Guide Virtual Web Appliance Setup Guide 2 Sophos Installing a Virtual Appliance Installing a Virtual Appliance This guide describes the procedures for installing a Virtual Web Appliance. If you are installing

More information

THE CCLRC DATA PORTAL

THE CCLRC DATA PORTAL THE CCLRC DATA PORTAL Glen Drinkwater, Shoaib Sufi CCLRC Daresbury Laboratory, Daresbury, Warrington, Cheshire, WA4 4AD, UK. E-mail: g.j.drinkwater@dl.ac.uk, s.a.sufi@dl.ac.uk Abstract: The project aims

More information

Simplest Scalable Architecture

Simplest Scalable Architecture Simplest Scalable Architecture NOW Network Of Workstations Many types of Clusters (form HP s Dr. Bruce J. Walker) High Performance Clusters Beowulf; 1000 nodes; parallel programs; MPI Load-leveling Clusters

More information

Hadoop Distributed File System. T-111.5550 Seminar On Multimedia 2009-11-11 Eero Kurkela

Hadoop Distributed File System. T-111.5550 Seminar On Multimedia 2009-11-11 Eero Kurkela Hadoop Distributed File System T-111.5550 Seminar On Multimedia 2009-11-11 Eero Kurkela Agenda Introduction Flesh and bones of HDFS Architecture Accessing data Data replication strategy Fault tolerance

More information

How To Manage File Access On Data Ontap On A Pc Or Mac Or Mac (For A Mac) On A Network (For Mac) With A Network Or Ipad (For An Ipad) On An Ipa (For Pc Or

How To Manage File Access On Data Ontap On A Pc Or Mac Or Mac (For A Mac) On A Network (For Mac) With A Network Or Ipad (For An Ipad) On An Ipa (For Pc Or Clustered Data ONTAP 8.3 File Access Management Guide for NFS NetApp, Inc. 495 East Java Drive Sunnyvale, CA 94089 U.S. Telephone: +1 (408) 822-6000 Fax: +1 (408) 822-4501 Support telephone: +1 (888) 463-8277

More information

Docker : devops, shared registries, HPC and emerging use cases. François Moreews & Olivier Sallou

Docker : devops, shared registries, HPC and emerging use cases. François Moreews & Olivier Sallou Docker : devops, shared registries, HPC and emerging use cases François Moreews & Olivier Sallou Presentation Docker is an open-source engine to easily create lightweight, portable, self-sufficient containers

More information

Automated Deployment of Oracle RAC Using Enterprise Manager Provisioning Pack

Automated Deployment of Oracle RAC Using Enterprise Manager Provisioning Pack Automated Deployment of Oracle RAC Using Enterprise Manager Provisioning Pack By Kai Yu As a part of the Oracle Enterprise Manager s lifecycle management solutions, the Oracle Enterprise Manager Provisioning

More information

Quick Introduction to HPSS at NERSC

Quick Introduction to HPSS at NERSC Quick Introduction to HPSS at NERSC Nick Balthaser NERSC Storage Systems Group nabalthaser@lbl.gov Joint Genome Institute, Walnut Creek, CA Feb 10, 2011 Agenda NERSC Archive Technologies Overview Use Cases

More information

The GENIUS Grid Portal

The GENIUS Grid Portal The GENIUS Grid Portal (*) work in collaboration with A. Falzone and A. Rodolico EGEE NA4 Workshop, Paris, 18.12.2003 CHEP 2000, 10.02.2000 Outline Introduction Grid portal architecture and requirements

More information

Implementing Network Attached Storage. Ken Fallon Bill Bullers Impactdata

Implementing Network Attached Storage. Ken Fallon Bill Bullers Impactdata Implementing Network Attached Storage Ken Fallon Bill Bullers Impactdata Abstract The Network Peripheral Adapter (NPA) is an intelligent controller and optimized file server that enables network-attached

More information

Detailed Outline of Hadoop. Brian Bockelman

Detailed Outline of Hadoop. Brian Bockelman Detailed Outline of Hadoop Brian Bockelman Outline of Hadoop Before we dive in to an installation, I wanted to survey the landscape. HDFS Core Services Grid services HDFS Aux Services Putting it all together

More information

Experiences with the GLUE information schema in the LCG/EGEE production Grid

Experiences with the GLUE information schema in the LCG/EGEE production Grid Experiences with the GLUE information schema in the LCG/EGEE production Grid Stephen Burke, Sergio Andreozzi and Laurence Field CHEP07, Victoria, Canada www.eu-egee.org EGEE and glite are registered trademarks

More information

Cloud Ready for Bioinformatics?

Cloud Ready for Bioinformatics? IDB acknowledges co-funding by the European Community's Seventh Framework Programme (INFSO-RI-261552) and the French National Research Agency's Arpege Programme (ANR-10-SEGI-001) Cloud Ready for Bioinformatics?

More information

MarkLogic Server. Connector for SharePoint Administrator s Guide. MarkLogic 8 February, 2015

MarkLogic Server. Connector for SharePoint Administrator s Guide. MarkLogic 8 February, 2015 Connector for SharePoint Administrator s Guide 1 MarkLogic 8 February, 2015 Last Revised: 8.0-1, February, 2015 Copyright 2015 MarkLogic Corporation. All rights reserved. Table of Contents Table of Contents

More information

DeployStudio Server Quick Install

DeployStudio Server Quick Install DeployStudio Server Quick Install v1.7.0 The DeployStudio Team info@deploystudio.com Requirements OS X 10.7.5 to 10.11.1 DeployStudioServer_v1.7.x.pkg and later NetBoot based deployment 100 Mb/s switched

More information

External Data Connector (EMC Networker)

External Data Connector (EMC Networker) Page 1 of 26 External Data Connector (EMC Networker) TABLE OF CONTENTS OVERVIEW SYSTEM REQUIREMENTS INSTALLATION (WINDOWS) INSTALLATION (UNIX) GETTING STARTED Perform a Discovery Perform a Migration ADVANCED

More information

Measurement of BeStMan Scalability

Measurement of BeStMan Scalability Measurement of BeStMan Scalability Haifeng Pi, Igor Sfiligoi, Frank Wuerthwein, Abhishek Rana University of California San Diego Tanya Levshina Fermi National Accelerator Laboratory Alexander Sim, Junmin

More information

Web Service Robust GridFTP

Web Service Robust GridFTP Web Service Robust GridFTP Sang Lim, Geoffrey Fox, Shrideep Pallickara and Marlon Pierce Community Grid Labs, Indiana University 501 N. Morton St. Suite 224 Bloomington, IN 47404 {sblim, gcf, spallick,

More information

Running a Workflow on a PowerCenter Grid

Running a Workflow on a PowerCenter Grid Running a Workflow on a PowerCenter Grid 2010-2014 Informatica Corporation. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording or otherwise)

More information

CMS Tier-3 cluster at NISER. Dr. Tania Moulik

CMS Tier-3 cluster at NISER. Dr. Tania Moulik CMS Tier-3 cluster at NISER Dr. Tania Moulik What and why? Grid computing is a term referring to the combination of computer resources from multiple administrative domains to reach common goal. Grids tend

More information

www.mvatcybernet.com PRODUCT VERSION: LYNC SERVER 2010, LYNC SERVER 2013, WINDOWS SERVER 2008

www.mvatcybernet.com PRODUCT VERSION: LYNC SERVER 2010, LYNC SERVER 2013, WINDOWS SERVER 2008 PRODUCT VERSION: LYNC SERVER 2010, LYNC SERVER 2013, WINDOWS SERVER 2008 With Forefront Threat Management Gateway 2010 now discontinued, we sought a suitable reverse proxy solution that works with Lync

More information

A Web Services Data Analysis Grid *

A Web Services Data Analysis Grid * A Web Services Data Analysis Grid * William A. Watson III, Ian Bird, Jie Chen, Bryan Hess, Andy Kowalski, Ying Chen Thomas Jefferson National Accelerator Facility 12000 Jefferson Av, Newport News, VA 23606,

More information

dcache - Managed Storage - LCG Storage Element - HSM optimizer Patrick Fuhrmann, DESY for the dcache Team

dcache - Managed Storage - LCG Storage Element - HSM optimizer Patrick Fuhrmann, DESY for the dcache Team dcache - Managed Storage - LCG Storage Element - HSM optimizer, DESY for the dcache Team dcache is a joint effort between the Deutsches Elektronen Synchrotron (DESY) and the Fermi National Laboratory (FNAL)

More information

MS 20414 Implementing an Advanced Server Infrastructure

MS 20414 Implementing an Advanced Server Infrastructure MS 20414 Implementing an Advanced Server Infrastructure P a g e 1 of 10 About this Course In this course, students will learn how to plan and implement some of the more advanced features available in Windows

More information

Clustered Data ONTAP 8.2

Clustered Data ONTAP 8.2 Updated for 8.2.1 Clustered Data ONTAP 8.2 File Access Management Guide for NFS NetApp, Inc. 495 East Java Drive Sunnyvale, CA 94089 U.S. Telephone: +1 (408) 822-6000 Fax: +1 (408) 822-4501 Support telephone:

More information

v7.8.2 Release Notes for Websense Content Gateway

v7.8.2 Release Notes for Websense Content Gateway v7.8.2 Release Notes for Websense Content Gateway Topic 60086 Web Security Gateway and Gateway Anywhere 12-Mar-2014 These Release Notes are an introduction to Websense Content Gateway version 7.8.2. New

More information

LifeSize UVC Video Center Deployment Guide

LifeSize UVC Video Center Deployment Guide LifeSize UVC Video Center Deployment Guide November 2013 LifeSize UVC Video Center Deployment Guide 2 LifeSize UVC Video Center LifeSize UVC Video Center records and streams video sent by LifeSize video

More information

WebSphere Portal 8 Using GPFS file sharing in a Portal Farm. Test Configuration

WebSphere Portal 8 Using GPFS file sharing in a Portal Farm. Test Configuration WebSphere Portal 8 Using GPFS file sharing in a Portal Farm Test Configuration Owner: Mark E. Blondell Configuration name: Test Infrastructure: WebSphere Portal 8 Using GPFS file sharing in a Portal Farm

More information

Solution Brief: Creating Avid Project Archives

Solution Brief: Creating Avid Project Archives Solution Brief: Creating Avid Project Archives Marquis Project Parking running on a XenData Archive Server provides Fast and Reliable Archiving to LTO or Sony Optical Disc Archive Cartridges Summary Avid

More information

UNISOL SysAdmin. SysAdmin helps systems administrators manage their UNIX systems and networks more effectively.

UNISOL SysAdmin. SysAdmin helps systems administrators manage their UNIX systems and networks more effectively. 1. UNISOL SysAdmin Overview SysAdmin helps systems administrators manage their UNIX systems and networks more effectively. SysAdmin is a comprehensive system administration package which provides a secure

More information

CS 356 Lecture 25 and 26 Operating System Security. Spring 2013

CS 356 Lecture 25 and 26 Operating System Security. Spring 2013 CS 356 Lecture 25 and 26 Operating System Security Spring 2013 Review Chapter 1: Basic Concepts and Terminology Chapter 2: Basic Cryptographic Tools Chapter 3 User Authentication Chapter 4 Access Control

More information

OPAS Prerequisites. Prepared By: This document contains the prerequisites and requirements for setting up OPAS.

OPAS Prerequisites. Prepared By: This document contains the prerequisites and requirements for setting up OPAS. OPAS Prerequisites This document contains the prerequisites and requirements for setting up OPAS. Prepared By: Luke Swords Principal Consultant 24/06/2015 Version 1.0 Contact Information Infront Consulting

More information

Managed Storage @ GRID or why NFSv4.1 is not enough. Tigran Mkrtchyan for dcache Team

Managed Storage @ GRID or why NFSv4.1 is not enough. Tigran Mkrtchyan for dcache Team Managed Storage @ GRID or why NFSv4.1 is not enough Tigran Mkrtchyan for dcache Team What the hell do physicists do? Physicist are hackers they just want to know how things works. In moder physics given

More information

Spectrum Scale HDFS Transparency Guide

Spectrum Scale HDFS Transparency Guide Spectrum Scale Guide Spectrum Scale BDA 2016-1-5 Contents 1. Overview... 3 2. Supported Spectrum Scale storage mode... 4 2.1. Local Storage mode... 4 2.2. Shared Storage Mode... 4 3. Hadoop cluster planning...

More information

This document presents the new features available in ngklast release 4.4 and KServer 4.2.

This document presents the new features available in ngklast release 4.4 and KServer 4.2. This document presents the new features available in ngklast release 4.4 and KServer 4.2. 1) KLAST search engine optimization ngklast comes with an updated release of the KLAST sequence comparison tool.

More information

DSS. High performance storage pools for LHC. Data & Storage Services. Łukasz Janyst. on behalf of the CERN IT-DSS group

DSS. High performance storage pools for LHC. Data & Storage Services. Łukasz Janyst. on behalf of the CERN IT-DSS group DSS High performance storage pools for LHC Łukasz Janyst on behalf of the CERN IT-DSS group CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it Introduction The goal of EOS is to provide a

More information

ESX 4 Patch Management Guide ESX 4.0

ESX 4 Patch Management Guide ESX 4.0 ESX 4 Patch Management Guide ESX 4.0 This document supports the version of each product listed and supports all subsequent versions until the document is replaced by a new edition. To check for more recent

More information

Roadmap for Applying Hadoop Distributed File System in Scientific Grid Computing

Roadmap for Applying Hadoop Distributed File System in Scientific Grid Computing Roadmap for Applying Hadoop Distributed File System in Scientific Grid Computing Garhan Attebury 1, Andrew Baranovski 2, Ken Bloom 1, Brian Bockelman 1, Dorian Kcira 3, James Letts 4, Tanya Levshina 2,

More information

Distributed File Systems

Distributed File Systems Distributed File Systems Paul Krzyzanowski Rutgers University October 28, 2012 1 Introduction The classic network file systems we examined, NFS, CIFS, AFS, Coda, were designed as client-server applications.

More information

Workflow Templates Library

Workflow Templates Library Workflow s Library Table of Contents Intro... 2 Active Directory... 3 Application... 5 Cisco... 7 Database... 8 Excel Automation... 9 Files and Folders... 10 FTP Tasks... 13 Incident Management... 14 Security

More information

Deployment of BioXSDenabled services on a Cloud. christophe.blanchet@ibcp.fr

Deployment of BioXSDenabled services on a Cloud. christophe.blanchet@ibcp.fr Deployment of BioXSDenabled services on a Cloud Outline IBCP, provider of BioXSD-enabled services Cloud Computing RENABI GRISBI, French infrastructure Bioinformatics Integrated s gbio-pbil.ibcp.fr/ws GBIO

More information

Archival Storage At LANL Past, Present and Future

Archival Storage At LANL Past, Present and Future Archival Storage At LANL Past, Present and Future Danny Cook Los Alamos National Laboratory dpc@lanl.gov Salishan Conference on High Performance Computing April 24-27 2006 LA-UR-06-0977 Main points of

More information

Introduction to HDFS. Prasanth Kothuri, CERN

Introduction to HDFS. Prasanth Kothuri, CERN Prasanth Kothuri, CERN 2 What s HDFS HDFS is a distributed file system that is fault tolerant, scalable and extremely easy to expand. HDFS is the primary distributed storage for Hadoop applications. HDFS

More information

HDFS Under the Hood. Sanjay Radia. Sradia@yahoo-inc.com Grid Computing, Hadoop Yahoo Inc.

HDFS Under the Hood. Sanjay Radia. Sradia@yahoo-inc.com Grid Computing, Hadoop Yahoo Inc. HDFS Under the Hood Sanjay Radia Sradia@yahoo-inc.com Grid Computing, Hadoop Yahoo Inc. 1 Outline Overview of Hadoop, an open source project Design of HDFS On going work 2 Hadoop Hadoop provides a framework

More information

NIIT Education and Training, Doha, Qatar - www.niitqatar.com Contact: +974-44551796/1798; 50656051

NIIT Education and Training, Doha, Qatar - www.niitqatar.com Contact: +974-44551796/1798; 50656051 414: Implementing an Advanced Server Infrastructure Duration: 40 Hours Overview About this Course Get hands-on instruction and practice planning, designing and deploying a physical and logical Windows

More information

Michael Thomas, Dorian Kcira California Institute of Technology. CMS Offline & Computing Week

Michael Thomas, Dorian Kcira California Institute of Technology. CMS Offline & Computing Week Michael Thomas, Dorian Kcira California Institute of Technology CMS Offline & Computing Week San Diego, April 20-24 th 2009 Map-Reduce plus the HDFS filesystem implemented in java Map-Reduce is a highly

More information

A Uniform Job Monitoring Service in Multiple Job Universes

A Uniform Job Monitoring Service in Multiple Job Universes A Uniform Job Monitoring Service in Multiple Job Universes Aleš Křenek, Miroslav Ruda CESNET, EGEE JRA1 www.eu-egee.org Information Society Grid Monitoring Workshop, June 25, 2007 1 Outline Logging and

More information

Esqu Science Experiments For Computer Network

Esqu Science Experiments For Computer Network Vangelis Floros, GRNET S.A. 3 rd Open Source Software Conference March 22, 2008 NTUA, Athens Greece Introduction What is a Grid? What is escience? Large Scientific Grids The example of EGEE Building Grid

More information

Parallels Cloud Server 6.0

Parallels Cloud Server 6.0 Parallels Cloud Server 6.0 Getting Started Guide March 17, 2015 Copyright 1999-2015 Parallels IP Holdings GmbH and its affiliates. All rights reserved. Parallels IP Holdings GmbH Vordergasse 59 8200 Schaffhausen

More information

Computer Networks 1 (Mạng Máy Tính 1) Lectured by: Dr. Phạm Trần Vũ MEng. Nguyễn CaoĐạt

Computer Networks 1 (Mạng Máy Tính 1) Lectured by: Dr. Phạm Trần Vũ MEng. Nguyễn CaoĐạt Computer Networks 1 (Mạng Máy Tính 1) Lectured by: Dr. Phạm Trần Vũ MEng. Nguyễn CaoĐạt 1 Lecture 10: Application Layer 2 Application Layer Where our applications are running Using services provided by

More information

Technical. Overview. ~ a ~ irods version 4.x

Technical. Overview. ~ a ~ irods version 4.x Technical Overview ~ a ~ irods version 4.x The integrated Ru e-oriented DATA System irods is open-source, data management software that lets users: access, manage, and share data across any type or number

More information

MEGA Web Application Architecture Overview MEGA 2009 SP4

MEGA Web Application Architecture Overview MEGA 2009 SP4 Revised: September 2, 2010 Created: March 31, 2010 Author: Jérôme Horber CONTENTS Summary This document describes the system requirements and possible deployment architectures for MEGA Web Application.

More information

Volume SYSLOG JUNCTION. User s Guide. User s Guide

Volume SYSLOG JUNCTION. User s Guide. User s Guide Volume 1 SYSLOG JUNCTION User s Guide User s Guide SYSLOG JUNCTION USER S GUIDE Introduction I n simple terms, Syslog junction is a log viewer with graphing capabilities. It can receive syslog messages

More information

ACE Management Server Deployment Guide VMware ACE 2.0

ACE Management Server Deployment Guide VMware ACE 2.0 Technical Note ACE Management Server Deployment Guide VMware ACE 2.0 This technical note provides guidelines for the deployment of VMware ACE Management Servers, including capacity planning and best practices.

More information

Mac OS X Directory Services

Mac OS X Directory Services Mac OS X Directory Services Agenda Open Directory Mac OS X client access Directory services in Mac OS X Server Redundancy and replication Mac OS X access to other directory services Active Directory support

More information

Phire Architect Hardware and Software Requirements

Phire Architect Hardware and Software Requirements Phire Architect Hardware and Software Requirements Copyright 2014, Phire. All rights reserved. The Programs (which include both the software and documentation) contain proprietary information; they are

More information