Distributed Data Storage Based on Web Access and IBP Infrastructure. Faculty of Informatics Masaryk University Brno, The Czech Republic



Similar documents
Grid Video Processing

Scalable Sharing of Wide Area Storage Resources

Distributed Data Storage with Data Versioning

Large Data Visualization using Shared Distributed Resources

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

Performance and scalability of a large OLTP workload

Configuration Maximums

Network Attached Storage. Jinfeng Yang Oct/19/2015

Sage CRM Technical Specification

Hadoop Distributed File System. T Seminar On Multimedia Eero Kurkela

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF

Enabling Technologies for Distributed Computing

Client-aware Cloud Storage

Aspera Direct-to-Cloud Storage WHITE PAPER

Configuration Maximums VMware Infrastructure 3

Optimizing Large Arrays with StoneFly Storage Concentrators

Using Linux Clusters as VoD Servers

Hyper-V over SMB Remote File Storage support in Windows Server 8 Hyper-V. Jose Barreto Principal Program Manager Microsoft Corporation

Ignify ecommerce. Item Requirements Notes

Ahsay Online Backup Suite v5.0. Whitepaper Backup speed

Enabling Technologies for Distributed and Cloud Computing

Hadoop & its Usage at Facebook

IT of SPIM Data Storage and Compression. EMBO Course - August 27th! Jeff Oegema, Peter Steinbach, Oscar Gonzalez

The Google File System

Storage Solutions For the DIY-types

Microsoft SQL Server 2005 on Windows Server 2003

Microsoft Azure. IaaS Networking Storage. Stefan Geiger Gerry

COSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters

Reference Architecture. EMC Global Solutions. 42 South Street Hopkinton MA

Open Text Archive Server and Microsoft Windows Azure Storage

High Availability Databases based on Oracle 10g RAC on Linux

NERSC File Systems and How to Use Them

Using Linux Clusters as VoD Servers

Parallels Cloud Storage

Configuration Maximums

Zadara Storage Cloud A

Hyper-V over SMB: Remote File Storage Support in Windows Server 2012 Hyper-V. Jose Barreto Principal Program Manager Microsoft Corporation

Fluke Networks NetFlow Tracker

New Storage System Solutions

Servers, Clients. Displaying max. 60 cameras at the same time Recording max. 80 cameras Server-side VCA Desktop or rackmount form factor

KFUPM Enterprise Network. Sadiq M. Sait

Informatica Data Director Performance

Best Practices for Data Sharing in a Grid Distributed SAS Environment. Updated July 2010

How to Choose your Red Hat Enterprise Linux Filesystem

High Performance Computing OpenStack Options. September 22, 2015

Release 8.2 Hardware and Software Requirements. PowerSchool Student Information System

When EP terminates the use of Hosting CC OG, EP is required to erase the content of CC OG application at its own cost.

COSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters

HP Intelligent Management Center User Access Management Software

SPECTRUM Network Operations Center (NOC) Manager

WHITE PAPER. How To Build a SAN. The Essential Guide for Turning Your Windows Server Into Shared Storage on Your IP Network

Hadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Bosch Video Management System High Availability with Hyper-V

Patriot Hardware and Systems Software Requirements

Minimum Hardware Configurations for EMC Documentum Archive Services for SAP Practical Sizing Guide

Disaster Recovery Checklist Disaster Recovery Plan for <System One>

WanVelocity. WAN Optimization & Acceleration


Hadoop & its Usage at Facebook

Quantum StorNext. Product Brief: Distributed LAN Client

Implementing a Digital Video Archive Based on XenData Software

Large Scale Storage. Orlando Richards, Information Services LCFG Users Day, University of Edinburgh 18 th January 2013

Redundancy in enterprise storage networks using dual-domain SAS configurations

Milestone Solution Partner IT Infrastructure Components Certification Summary

Softline VIP Payroll System Requirements v2.9a January 2010

Reference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack

Ultra-Scalable Storage Provides Low Cost Virtualization Solutions

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

Data Movement and Storage. Drew Dolgert and previous contributors

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

HP reference configuration for entry-level SAS Grid Manager solutions

Violin: A Framework for Extensible Block-level Storage

CloudSync Mobile Device Management Technical Overview

POWER ALL GLOBAL FILE SYSTEM (PGFS)

DSS. High performance storage pools for LHC. Data & Storage Services. Łukasz Janyst. on behalf of the CERN IT-DSS group

Scala Storage Scale-Out Clustered Storage White Paper

THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING. José Daniel García Sánchez ARCOS Group University Carlos III of Madrid

Volume Replication INSTALATION GUIDE. Open-E Data Storage Server (DSS )

Linux Powered Storage:

Server and Storage Virtualization. Virtualization. Overview. 5 Reasons to Virtualize

<Insert Picture Here> Btrfs Filesystem

Lecture 5: GFS & HDFS! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl

BookKeeper overview. Table of contents

SAN RFP Questions and Answers

ZFS Backup Platform. ZFS Backup Platform. Senior Systems Analyst TalkTalk Group. Robert Milkowski.

Architecting a High Performance Storage System

IT Business Management System Requirements Guide

RSA Security Analytics Virtual Appliance Setup Guide

StorPool Distributed Storage Software Technical Overview

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

Transcription:

Distributed Data Storage Based on Web Access and IBP Infrastructure Lukáš Hejtmánek Faculty of Informatics Masaryk University Brno, The Czech Republic

Summary New web based distributed data storage infrastructure built on the top of an IBP (Internet Backplane Protocol) Model overview distributed I-Node concept and directory services Security aspects Prototype implementation overview Performance tests Pilot groups

Pilot groups Multimedia applications read performance read and write access, no rewriting

Model Overview I-Nodes IBP byte array XML serialization (exnode) End-to-End services (encryption, checksums, compression) Stored on AFS volume one AFS server shared by all IBP depots

Model Overview Directory services Collection of I-Nodes ACL based security CVS-like versioning (no rewrite in IBP)

Model Overview Web based directory service upload download remove rename attributes Metadata permission XML Meta data FILE Data Authorization Service AFS storage IBP depots Web browser lors tools user

Internet Backplane Protocol developed at University of Tennessee Best effort block storage service Time limited storage Supported operations: read, append, manage Security based on capabilities User space solution

Network Storage Stack Applications LoRS Logistical Runtime System L-Bone exnode IBP Internet Backplane Protocol exnode Metadata collection of capabilities of allocated IBP byte arrays essential for file access IBP I-Nodes

exnode capabilities <exnode:metadata name="exnode_offset" type="integer">0</exnode:metadata> <exnode:metadata name="logical_length" type="integer">1347975</exnode:metadata> <exnode:read>ibp://test5.fi.muni.cz:6714/ibp-896306328... 7028/798203447/READ</exnode:read> <exnode:write>ibp://test5.fi.muni.cz:6714/ibp-896306322... 7028/679207605/WRITE</exnode:write> <exnode:manage>ibp://test5.fi.muni.cz:6714/ibp-896306320... 8870/1089394976/MANAGE</exnode:manage>

exnode End-to-End services <exnode:function name="aes_encrypt"> <exnode:argument name="key" type="string">a0209b203920e9206e20e820c5208920 </exnode:argument> <exnode:argument name="blocksize" type="integer">1048592 </exnode:argument> </exnode:function>

Security Aspects Evil Internet assumption Insecure low level media Different from local storage HTTPS connections into Web infrastructure IPsec tunnels among Web servers and IBP depots AES/DES encryption of IBP byte arrays Security model available to all users

Prototype Implementation Overview User AFS Web Servers lors tools IBP IBP IBP I Nodes Directory Service L Bone IBP depot pool

Prototype Implementation Overview User interface Web based file manipulation Upload and Download Permission changes IBP related changes duration, replication level

Prototype Implementation Overview Software implementation IBP depots modified L-Bone server (load balancing) Web server with mod backhand IPsec tunnels lors tools

IPsec HTTPS User Web Servers IPsec lors tools IPsec AFS IBP IBP IBP L Bone IBP depot pool

Prototype Implementation Overview IBP access library libxio provides standard UNIX I/O interface open(2) read(2) write(2) lseek(2) stat(2), fstat(2), lstat(2) close(2)

Crash scenarios AFS server crash no files available through web access all files available through lors tools if user has XML metadata L-Bone server crash all files available for read, writing not possible IBP server crash only particular files unavailable for read, writing possible

Performance tests Testbed 3 IBP depots and Web servers running on Intel Pentium 4 with 1GB RAM internal hardware RAID 5 SCSI disk array internal hardware RAID 5 PATA disk array external hardware RAID 5 PATA disk array 5TB disk capacity 1Gb ethernet connection up to 32 CPUs from Meta Center cluster as clients focus on read performance

Brno Cl. #2 Cl. #1 Cl. #3 Cl. #4 1Gb 1Gb 1Gb 1Gb router 400Mb Gb Switch 1Gb 1Gb WAN 1Gb Gb Switch 1Gb Prague

800 700 Aggregate bandwidth [Mbps] 600 500 400 300 200 100 0 2 4 8 16 32 64 128 Number of clients

Pilot groups Lecture recording at Faculty of Informatics, Masaryk University National Library in Prague storage of large electronic archive of books, maps, etc. National Library in Prague storage of MrSID files Offer storage capacities to all the Czech academic community

Future work Integrating web access with Linux filesystem via LUFS project Generic server interface (i.e. processing video file formats) Enhance reduced semantic to be close to distributed filesystem like AFS

Conclusions Scalable distributed infrastructure based on web access and IBP Suitable for number of clients Large capacity Easy access