From Research To Transfer: User-Controllable Cloud Storage

Similar documents
Safe File Storage and Databases

FlexCloud: Reliable and Secure Cloud Overlay Infrastructures

peacs - Performance and Efficiency Analysis for Cloud Storage

How to Store Data to the Cloud without giving the Cloud anything Jason Resch Cleversafe

Information Dispersion over Redundant Arrays of Optimal Cloud Storage for Desktop Users

MPSTOR Technology Overview

Orchestration of Distributed Storage Targets through Storage Flows

Flexible Service Ecosystems The serviceplatform.org perspective. Josef Spillner

DepSky Dependable and Secure Storage in a Cloud-of-Clouds Alysson Bessani, Miguel Correia, Bruno Quaresma, Fernando André, Paulo Sousa

Secure Cloud Computing with FlexCloud

IMPLEMENTING DATA SECURITY IN MULTI CLOUD

High Performance Computing OpenStack Options. September 22, 2015

The Pros and Cons of Erasure Coding & Replication vs. RAID in Next-Gen Storage Platforms. Abhijith Shenoy Engineer, Hedvig Inc.

RAIC Integration for Network Storages on Mobile Devices

Building low cost disk storage with Ceph and OpenStack Swift

Future Generation Computer Systems

Gladinet Cloud Backup V3.0 User Guide

High Performance Computing. Course Notes High Performance Storage

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

How To Use Attix5 Pro For A Fraction Of The Cost Of A Backup

Introduction to Gluster. Versions 3.0.x

EMC AVAMAR. Deduplication backup software and system. Copyright 2012 EMC Corporation. All rights reserved.

Trends in Application Recovery. Andreas Schwegmann, HP

Exercise 2 : checksums, RAID and erasure coding

owncloud Architecture Overview

CERNBox + EOS: Cloud Storage for Science

owncloud Architecture Overview

Cloud Gateway. Agenda. Cloud concepts Gateway concepts My work. Monica Stebbins

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

Veeam Backup and Replication Architecture and Deployment. Nelson Simao Systems Engineer

Course 20533: Implementing Microsoft Azure Infrastructure Solutions

Trust in your own cloud

HP StorageWorks Automated Storage Manager User Guide

CONFIGURATION GUIDELINES: EMC STORAGE FOR PHYSICAL SECURITY

Cloud File System Gateway & Cloud Data Management Interface (CDMI)

DeployStudio Server Quick Install

Providing an Enterprise File Share and Sync Solution for

Designing a Cloud Storage System

Module: Business Continuity

LEARNING SOLUTIONS website milner.com/learning phone

STRATEGIC PLANNING ASSUMPTION(S)

Implementing Microsoft Azure Infrastructure Solutions

Building Storage Service in a Private Cloud

Data Storage Solution Using PL-Grid UNICORE Infrastructure

How to use Cloud Solutions by Swisscom for Disaster Recovery. Whitepaper. Fabian Haldimann Stefan Lengacher Thomas Gfeller

ntier Verde Simply Affordable File Storage

VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014

Red Hat Storage Server

A File Storage Service on a Cloud Computing Environment for Digital Libraries

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

WHITE PAPER. Software Defined Storage Hydrates the Cloud

Service Overview CloudCare Online Backup

TERENA Trusted Cloud Drive

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

2011 FileTek, Inc. All rights reserved. 1 QUESTION

Reliability and Fault Tolerance in Storage

How To Manage Cloud Service Provisioning And Maintenance

Practical Data Integrity Protection in Network-Coded Cloud Storage

Storing Data: Disks and Files

Cloud Attached Storage 5.0

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Best Practices for Deploying Citrix XenDesktop on NexentaStor Open Storage

High Availability Databases based on Oracle 10g RAC on Linux

SoftLayer Fundamentals. Storage and Backup. August, 2014

Secure Framework for Data Storage from Single to Multi clouds in Cloud Networking

Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive

Openstack. Cloud computing with Openstack. Saverio Proto

Unitrends Virtual Backup Installation Guide Version 8.0

Protecting the Microsoft Data Center with NetBackup 7.6

Enterprise Backup and Restore technology and solutions

Long term retention and archiving the challenges and the solution

Cloud Storage and Backup

Comprehending the Tradeoffs between Deploying Oracle Database on RAID 5 and RAID 10 Storage Configurations. Database Solutions Engineering

Large Scale Storage. Orlando Richards, Information Services LCFG Users Day, University of Edinburgh 18 th January 2013

The safer, easier way to help you pass any IT exams. Exam : E Backup Recovery - Avamar Expert Exam for Implementation Engineers.

Feature and Technical

RAID Storage, Network File Systems, and DropBox

Object Oriented Storage and the End of File-Level Restores

UNDERSTANDING DATA DEDUPLICATION. Thomas Rivera SEPATON

Aspera Direct-to-Cloud Storage WHITE PAPER

Get Success in Passing Your Certification Exam at first attempt!

Solution for private cloud computing

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

Service Level Agreement (SLA) Arcplace Backup Enterprise Service

How To Encrypt Data With A Power Of N On A K Disk

e Number: Passing Score: 800 Time Limit: 120 min File Version: 1.0

A Distributed Storage Architecture based on a Hybrid Cloud Deployment Model

ESG Lab Review Symantec NetBackup 5230 Appliance Author: Vinny Choinski, Senior Lab Analyst, and Kerry Dolan, Lab Analyst.

Implementing Microsoft Azure Infrastructure Solutions 20533B; 5 Days, Instructor-led

A Secure And Reliable Storage Above Public Clouds Maxim Schnjakin

Course 20533B: Implementing Microsoft Azure Infrastructure Solutions

Testing ARES on the GTS framework: lesson learned and open issues. Mauro Femminella University of Perugia

Remote/Branch Office IT Consolidation with Lenovo S2200 SAN and Microsoft Hyper-V

How To Backup With Ec Avamar

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved.

Peer-to-peer Cooperative Backup System

RAID Overview

Deep Dive on SimpliVity s OmniStack A Technical Whitepaper

Prototyping a file sharing and synchronisation platform with owncloud

ADVANCED DEDUPLICATION CONCEPTS. Larry Freeman, NetApp Inc Tom Pearce, Four-Colour IT Solutions

Transcription:

Department of Computer Science Institute of Systems Architecture Chair of Computer Networks From Research To Transfer: User-Controllable Cloud Storage Josef Spillner mailto:josef.spillner@tu-dresden.de xmpp:josef.spillner@jabber.org 14th TERENA TF-Storage Meeting February 10-11, 2014, Zurich, Switzerland

The Trouble with Cloud Storage proprietary client backup/sync vs. storage quality? trust? institutional utility billing web access only pricing: progressive/ degressive file size limit 3 months limit; tariff choice

Taming the Trouble: RAICs, RAOCs proxy cloud controller/ gateway storage targets: services, devices service properties service bundles integration scheduling optimisation auto-selection auto-configuration auto-repair extension backup sync sharing...

The Cloud Storage Lab @ TU Dresden Short history: http://lab.nubisave.org/ research prototypical software development 2010 ideas and service modelling 2011 NubiSave storage controller started UCC paper on Cloud storage controllers 2012 Usable software, Debian packaging 2013 NubiVis; π-box personal Cloud VM FGCS article on optimal storage SUCRE summer school tutorial at KIT 2014 NubiGate VM experiments and pilot installations transfer widespread production deployments

Research

Storage Service Ecosystem Excursus: Service Science -> Bundles -> RAOC functionality super-domain Infrastructure domain 2 domain 1 domain Cloud Storage non-functional properties nfp 1 sub-domain File Storage nfp 2 consumer client SLAs protocol description service interface messages service instance service impl engineer service hosting provider nfp backup

Storage Service Descriptions Base Ontology (WSML) concept MeasurementUnit conversionfactor impliestype _double concept SpaceUnit subconceptof MeasurementUnit instance GB memberof SpaceUnit conversionfactor hasvalue 1024.0 Instance Ontology webservice GoogleStorage importsontology { _"urn:ontology:conqo/cloudqos.wsml#" } capability ServiceCapability postcondition definedby?servicetype memberof cloud#cloudstorage. instance PricePerData memberof { cloud#priceperdata, qos#servicespec } qos#value hasvalue 0.17 qos#unit hasvalue qos#euro

Storage Service Bundles $$ a Service bundle x replication (k+m) erasure coding s1 s2 s3 s4 s5 s1 secret sharing s6 s1 s2 s3 s4 s5 s6

Storage Service Markets

Storage Flows sink transport source Storage Targets Data Flow Integration USB stick local directory NFS export WebDAV Commercial (Dropbox) block modifier (compress) Splitter/ Multiplexer File modifier (enc,dedup) local directory block modifier (stego) CIFS proxy Splitter/ Multiplexer Splitter/ Multiplexer file modifier (enc) Splitter/ Multiplexer web application

Reconfigurable Storage Flows... alias Software-Defined Storage Modules (i.e. FUSE modules): - Splitter (1:n) - Modifiers (1:1) * encryption * compression *... - Transports * Directory * CIFS, NFS * WebDAV * SSHfs * S3fs *... * CloudFusion - SDS propagation

Multi-User Storage Gateways FlexDDPL: Flexible Data Distribution Policy Language scopes applied in contexts (e.g. user database) ~user!negation T:22:00 06:00 @group fragment mime:message/rfc822 rules applied to targets (e.g. storage provider) store control adapt

Feature: Algorithms Selection Background secret sharing forward error correction / replication Striping ( RAID0) capacity 1.0, safety 1.0, performance 2.0 Mirroring ( RAID1) capacity 0.5, safety 2.0, performance 1.0 XOR parity ( RAID5, RAID6) 4 disks: capacity 0.75, safety 1.33, performance 1.0 Erasure coding ( RAIDn) n storage targets, k significant, m redundant: capacity k/m, safety 1+m/k (Rotated) Reed-Solomon: maximum-[hamming-]distance separable (MDS) property Cauchy-Reed-Solomon, Vandermonde-Reed-Solomon: practically secure; CRS faster due to XOR operations compared to GF(2w) AONT: information-theoretically secure AONT-RS: blend of AONT with (C)RS; alternative: encrypted fragments

Feature: Nested Contexts Configuration Data context»normal«context»private«

Feature: Weights Assigning weights to the storage targets to fully utilise the capacity to exploit faster upload connections

Feature: Recursion Saving the database which contains all file and fragment metadata... to avoid single point of failure to allow for selective sharing

Feature: Streaming (1:x) Continuous operation during read or write of large file buffers on controller device can be kept small improved performance through parallel coding and transmission

Feature: Batching (x:1) Encoding speed of different batch sizes 15 nodes 100000 90000 80000 Encoding [MBs] 70000 M=0 M=4 M=8 M=12 60000 50000 40000 30000 20000 10000 0 0 1000 2000 3000 4000 5000 Filechunk size [KB] 6000 7000 8000 9000

Experiments Nubisave Read Performance - 1MB File - UseAllInParallel Read in KB/s 13000 12000 11000 10000 9000 8000 7000 6000 5000 14000 12000 10000 8000 6000 4000 0 10 20 30 40 50 60 Redundancy 70 peacs: Performance and Efficiency Analsis for Cloud Storage Experiment Week Feb 7-14 2014 3 80 90 100 10 9 8 7 6 4 5 Storages 2 1

Autonomous Storage Clients Scenario: A robot wants to store some data... Labcast available: http://www.youtube.com/ watch?v=ndln0fg9okk

Autonomous Storage Clients Scenario: Pervasive storage on all devices Credentials may be auto generated

Software

Comparison of Storage Controllers Prototype Origin Authors Approach Integration Public Availability Dependable Sky ('11) T-Clouds, Univ. of Lisbon Bessani, Correia, Quaresma, André, Sousa data units, byzantine fail, Rabin erasure application library no ('11) HPI, Potsdam Maxim Schnjakin, Christoph Meinel NFPs, propr. platform, Liberation erasure web interface no TrustedSafe ('10) egovcd, Berlin/ FhG FOKUS (no publication) n fragments: one on USB, others remote Windows plugin no Cloud Shredder ('11) Chinese Academy of Sciences Nan Zang, Jiwu Jing, Peng Liu Two fragments: one local, one remote (not yet known) no SecCSIE ('11) T-Systems MMS, Dresden Ronny Seiger, Stephan Groß Cachy ReedSolomon erasure CIFS proxy no RACS ('10) Cornell University Abu-Libdeh, Princehouse, Weatherspoon erasure coding: Zfec ReedSolomon S3 proxy, can be distributed w/ Zookeeper yes (www.cs.cornell.edu) since 2012, community projects: plus numerous commercial implementations: NetApp StorageGRID, Cleversafe, Trusted Safe etc. Tahoe-LAFS externals, TCD, OpenStack/Intel - PyEC2Lib

OpenStack Intermezzo

NubiSave Cloud Storage Controller Installation of NubiSave Git repository Debian packages git://nubisave.org/git/nubisave http://nubisave.org/packages/ Integrated into π-box or NubiGate VMs http://nubisave.org/downloads/

NubiSave Storage Flow Editor

NubiSave Configuration

NubiSave Configuration

NubiSave Tools Controller operation Storages and database $ nubisave [<instance>] $ nubisave-status Master script which starts both of the below combined. Lists all splitter instances and storage targets attached to them or as part of any storage flow. $ nubisave headless Starts a new instance of the splitter/dispersion file system. $ nubisave gui Starts the storage flow editor with storage integration configuration. $ nubisave-mounter [<module>] Mounts all splitters and/or storage targets. $ nubisave-unmounter [<module>] Unmounts; inverse of the above. $ nubisave stop $ nubisave-database [...] Stops the splitter. Inspection of storage metadata.

Splitter-NG Framework Plugins: Jerasure, JSharing, RAID-1, more coming...

Transport Module: CloudFusion

Additional Implementations

NubiVis: Distributed Data Visualisation

NubiVis Map View

NubiGate VM: Everything in one Box

NubiGate VM: OwnCloud Frontend Our instance RN-Dropbox 141.76.41.232/owncloud Your instance...? 141.76.41.206 141.76.41.207

Future Vision

Future Work What's next in our lab Software improvements Configuration through the web interface (student thesis) Git-based service registry query Research: MapReduce-like processing over dispersed data Scaling large with more users and storage providers, HA setups etc. Federation between research institutions: reciprocative/collaborative storage resource sharing <-- role of NRENs

Publications [SS13] Josef Spillner, Alexander Schill: Orchestration of Distributed Storage Targets through Storage Flows. 5th IEEE International Conference on Cloud Computing Technology and Science (CloudCom), Bristol, UK, December 2013. [SQ+13] Josef Spillner, Maximilian Quellmalz, Martin Friedrich, Alexander Schill: peacs - Performance and Efficiency Analysis for Cloud Storage. Workshop on Cloud Storage Optimisation (CLOUSO) @ 2nd ESOCC, Málaga, Spain, September 2013. [SMS13] Josef Spillner, Johannes Müller, Alexander Schill: Creating Optimal Cloud Storage Systems. Elsevier Future Generation Computer Systems (FGCS), Issue 29(4), p. 1062-1072, June 2013. DOI 10.1016/j.future.2012.06.004. [SS12] Josef Spillner, Alexander Schill: Flexible Data Distribution Policy Language and Gateway Architecture. 1st IEEE Latin American Conference on Cloud Computing and Communications (LatinCloud), Porto Alegre, Rio Grande do Sul, Brazil, November 2012. [SP+12] Josef Spillner, Christian Piechnick, Claas Wilke, Uwe Aßmann, Alexander Schill: Autonomous Participation in Cloud Services. 2nd International Workshop on Intelligent Techniques and Architectures for Autonomic Clouds (ITAAC) @ 5th IEEE/ACM UCC, Chicago, Illinois, USA, November 2012. [SG+11] Josef Spillner, Gerd Bombach, Steffen Matthischke, Johannes Müller, Rico Tzschichholz, Alexander Schill: Information Dispersion over Redundant Arrays of Optimal Cloud Storage for Desktop Users. 4th IEEE/ACM International Conference on Utility and Cloud Computing (UCC), Melbourne, Australia, December 2011. [Best Paper Award]