ViewBox: Integrating Local File System with Cloud Storage Service



Similar documents
Network File System (NFS) Pradipta De

Google File System. Web and scalability

Cloud Panel Service Evaluation Scenarios

Ryusuke KONISHI NTT Cyberspace Laboratories NTT Corporation

Flash-Friendly File System (F2FS)

The Google File System

Disk-to-Disk-to-Offsite Backups for SMBs with Retrospect

Why Computers Are Getting Slower (and what we can do about it) Rik van Riel Sr. Software Engineer, Red Hat

Practical Online Filesystem Checking and Repair

Enterprise Backup and Restore technology and solutions

MySQL Enterprise Backup

COS 318: Operating Systems

Fault Isolation and Quick Recovery in Isolation File Systems

Lecture 11. RFS A Network File System for Mobile Devices and the Cloud

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Cloud Sync White Paper. Based on DSM 6.0

Lecture 5: GFS & HDFS! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl

BookKeeper. Flavio Junqueira Yahoo! Research, Barcelona. Hadoop in China 2011

FileCruiser Backup & Restoring Guide

Data De-duplication Methodologies: Comparing ExaGrid s Byte-level Data De-duplication To Block Level Data De-duplication

Cloud n Service Presentation. NTT Communications Corporation Cloud Services

Virtual Infrastructure Security

Hadoop Distributed File System. Dhruba Borthakur June, 2007

White Paper for Data Protection with Synology Snapshot Technology. Based on Btrfs File System

09'Linux Plumbers Conference

Work. MATLAB Source Control Using Git

Gladinet Cloud Backup V3.0 User Guide

Computer Engineering and Systems Group Electrical and Computer Engineering SCMFS: A File System for Storage Class Memory

RAID Storage System of Standalone NVR

DFW Backup Software. Whitepaper DFW Backup Agent

Massive Data Storage

Storage Sync for Hyper-V. Installation Guide for Microsoft Hyper-V

Linux Filesystem Comparisons

Total Data Protection for your business

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

<Insert Picture Here> Btrfs Filesystem

Zoner Online Backup. Whitepaper Zoner Backup Agent

2.8.1 Creating an Acronis account Subscription to Acronis Cloud Creating bootable rescue media... 16

We mean.network File System

Ahsay Backup Software. Whitepaper Ahsay Backup Agent

Home storage and backup options. Chris Moates Head of Lettuce

Internet Storage Sync Problem Statement

Introduction to Cloud Storage GOOGLE DRIVE

Lecture 18: Reliable Storage

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices

Chapter 11: File System Implementation. Operating System Concepts 8 th Edition

How To Backup Your Computer With A File Copy Engine

1. Product Information

How swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda

Online Backup Client User Manual Linux

POSIX and Object Distributed Storage Systems

PipeCloud : Using Causality to Overcome Speed-of-Light Delays in Cloud-Based Disaster Recovery. Razvan Ghitulete Vrije Universiteit

How To Restore A Server From Backup To Backup

For Hyper-V Edition Practical Operation Seminar. 4th Edition

Symantec Backup Exec.cloud

Blaze Vault Online Backup. Whitepaper Blaze Vault Online Backup Agent

RecoveryVault Express Client User Manual

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Near-Instant Oracle Cloning with Syncsort AdvancedClient Technologies White Paper

Best Practices for Optimizing Your Linux VPS and Cloud Server Infrastructure

Cloud Computing. Chapter 6 Data Storage in the Cloud

Business Con*nuity with Docker

Overview. Timeline Cloud Features and Technology

File Systems Management and Examples

e-config Data Migration Guidelines Version 1.1 Author: e-config Team Owner: e-config Team

Cloud Forensics. 175 Lakeside Ave, Room 300A Phone: 802/ Fax: 802/

Office 365 Migration Performance & Server Requirements

DataTrust Backup Software. Whitepaper DataTrust Backup Agent. Version 6.3

farmerswife Contents Hourline Display Lists 1.1 Server Application 1.2 Client Application farmerswife.com

Hadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee

Migration and Disaster Recovery Underground in the NEC / Iron Mountain National Data Center with the RackWare Management Module

Online Backup Linux Client User Manual

Reference Guide WindSpring Data Management Technology (DMT) Solving Today s Storage Optimization Challenges

MapGuide Open Source Repository Management Back up, restore, and recover your resource repository.

SOP Common service PC File Server

Vembu BDR v Release Notes. Major features in v Other features in Vembu BDR. Cloud Disaster Recovery Service. VMware UltraBlaze.

Online Backup Client User Manual

Hardware Configuration Guide

Checksumming RAID. Brian Kroth Suli Yang Abstract. 1 Introduction. 2 Background. 2.

HDFS Users Guide. Table of contents

NSS Volume Data Recovery

insync Installation Guide

AFS Usage and Backups using TiBS at Fermilab. Presented by Kevin Hill

REMOTE BACKUP-WHY SO VITAL?

Chapter 13 File and Database Systems

Chapter 13 File and Database Systems

CERNBox + EOS: Cloud Storage for Science

PARALLELS CLOUD STORAGE

Phoenix backs up servers using Windows and Linux operating systems. Here is a list of Windows servers that Phoenix supports:

A Deduplication File System & Course Review

Transcription:

ViewBox: Integrating Local File System with Cloud Storage Service FAST 2014 Yupu Zhang +, Chris Dragga +*, Andrea Arpaci-Dusseau +, RemziArpaci-Dusseau + University of Wisconsin-Madison 1

Outline Introduction Motivation Design and Implementation Evaluation Conclusion 2

Introduction Cloud-based file synchronization services have become enormously popular in recent years Numerous providers: Dropbox, Google Drive, SkyDrive Large user base: Dropbox has more than 100 million users Promising benefit Reliable backup on the cloud Automatic synchronization across clients/devices 3

Motivation-Data Corruption Data Corruption Uploaded from local machine to cloud Propagated to other devices/clients 4

Data Corruption-Experiment Inject corruption to a synchronized file on disk by flipping bits through the device file of the underlying disk Execute both data operations and metadata-only operations on the corrupt file Check if corruption is propagated 5

Data Corruption Experiment L: corruption remains local LG: corruption is propagated(global) Since ZFS is able to detect local corruption, none of the synchronization clients propagate corruption 6

Data Corruption Lessons Where do synchronization services fail? Rely on file-level monitoring mechanism, e.g., inotify Cannot tell between legitimate changes and corruption Where do file systems fail? Many file systems do not checksum data 7

Motivation-Crash Inconsistency Crash inconsistency Out-of-sync synchronization 8

Crash Inconsistency-Experiment A file is synchronized at V 0 on disk and cloud Update the file from V 0 to V 1 Inject a crash and observe sync client s behavior 9

Crash Inconsistency-Experiment OOS: out-of-sync Service on ext4(ordered) produces erratic and inconsistent behavior All three services behave correctly on ZFS and ext4 with data journaling 10

Crash Inconsistency-Lessons Where do synchronization services fail? Depend on their own metadata tracking Inconsistent with file system metadata upon crash Where do file systems fail? Metadata journaling cannot provide data consistency 11

Motivation-Causal Inconsistency Causal inconsistency Files are uploaded out of order Cloud state does not match a valid FS state 12

Causal Inconsistency-Lessons Where do synchronization services fail? Synchronize files out of order Where do file systems fail? No efficient mechanism to provide a static and consistent view to sync services 13

Summery The sense of safety provided by synchronization services is largely illusory Both file systems and sync services are responsible for these failures Many file systems lack strong reliability mechanisms file system state correct state What sync clients see is different from what local file systems see cloud state file system state 14

Design and implementation-viewbox Based on ext4, Dropbox and Seafile Goals Integrity Consistency Recoverability Performance 15

ViewBox Overview Local detection No corruption/inconsistency is spread View-based Synchronization Present file system s view to sync service Basis for consistency and correct recovery Ext4-cksum View Manager Cloud-aided Recovery Restore file system to correct state upon failure Cloud Helper 16

17 ViewBox Architectrue

Ext4-cksum - Local Detection Ext4-cksum stores data checksums in a fixed-sized checksum region immediately after the inode table 32-bit CRC checksum per 4KB block 128KB checksum region for a 128MB block group 18

View Manager Create file system views Upload views to cloud through sync client Challenge 1 -How to provide consistency? Challenge 2 -How to create views efficiently? 19

How to Guarantee Consistency? Cloud journaling Treat cloud storage as external journal Synchronize local changes to cloud at FS epochs i.e., when ext4-cksum performs a journal commit Three types of views Active view (local)=>current FS state Frozen view (local) =>Last FS snapshot in memory Synced views (on cloud) => Previously uploaded views Roll back to the latest synced view upon failure 20

Synchronizing Frozen Views Create a new frozen view after the previous frozen view is synchronized and when FS reaches an epoch The state of frozen views is always static 21

Multi-client Consistency (a) the client directly applies the changes in view 1 to its frozen view and propagates those changes to the active view. (b) download view 1 first, then merges the two views 22

How to Efficiently Freeze a View? A frozen view is short-lived and kept only in memory Incremental snapshotting dirty table: to track what files and directories are modified in the active view operation log:records all successful namespace operations (e.g., create, mkdir, unlink, rmdir, and rename) in the active view 23

24 Incremental Snapshotting

Cloud Helper A user-level daemon Talks to local FS through ioctl Communicates with the server through web API Upon data corruption Fetches correct block from cloud After crash, two types of recovery Recovers damaged files Rolls back entire file system to the latest synced view 25

Evaluation 3.3GHz Intel Quad Core CPU, 16 GB memory 1TB Hitachi hard drive Linux kernel 3.6.11 (64-bit) Dropbox client 1.6.0 Seafile client and server 1.8.0 26

Cloud Helper Data Corruption D: Detected R: Recovered Crash Consistency Yes: occurred No: did not occur 27

Ext4-cksum The performance overhead is quite minimal 28

View Manager As shown under the After COW column, the overhead is negligible, because no data copying is performed. 29

View Manager Frozen view F1 F2 Active view F1 F2 30

ViewBox with Dropbox and Seafile The runtime of the workload in ViewBox is at most 5% slower and sometimes faster than that of the unmodified ext4 setup For iphoto view and iphoto edit, the synchronization time on ViewBox with Dropbox is much greater than that on ext4. This is due to Dropbox s lack of proper interface support for views 31

Conclusion Problem: Cloud storage services and file systems fail to protect data Many copies do NOT always make data safe cloud state file system state correct state Solution: ViewBox Enhance local file systems with data checksumming Present file system s view to sync service cloud state = file system state = correct state 32