Decentralized Deduplication in SAN Cluster File Systems

Similar documents
Decentralized Deduplication in SAN Cluster File Systems

Decentralized Deduplication in SAN Cluster File Systems

WHITE PAPER 1

Flash Storage Roles & Opportunities. L.A. Hoffman/Ed Delgado CIO & Senior Storage Engineer Goodwin Procter L.L.P.

Virtual server management: Top tips on managing storage in virtual server environments

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

Nimble Storage for VMware View VDI

Increasing Storage Performance, Reducing Cost and Simplifying Management for VDI Deployments

Using VMware VMotion with Oracle Database and EMC CLARiiON Storage Systems

Optimize VDI with Server-Side Storage Acceleration

VMware View Design Guidelines. Russel Wilkinson, Enterprise Desktop Solutions Specialist, VMware

Storage Considerations for VMware View BEST PRACTICES

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

GRIDCENTRIC VMS TECHNOLOGY VDI PERFORMANCE STUDY

Nimble Storage VDI Solution for VMware Horizon (with View)

FlashSoft Software from SanDisk : Accelerating Virtual Infrastructures

Low-Cost Data Deduplication for Virtual Machine Backup in Cloud Storage

Optimized Storage Solution for Enterprise Scale Hyper-V Deployments

Benchmarking Hadoop & HBase on Violin

NETAPP SYNCSORT INTEGRATED BACKUP. Technical Overview. Peter Eicher Syncsort Product Management

NetApp Data Compression and Deduplication Deployment and Implementation Guide

ALL-FLASH STORAGE ARRAY. A Hyper-Converged Infrastructure for High I/O Applications and Virtual Desktops

File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

Nutanix Complete Cluster Reference Architecture for Virtual Desktop Infrastructure

Speeding Up Cloud/Server Applications Using Flash Memory

A Deduplication File System & Course Review

Deploying and Optimizing SQL Server for Virtual Machines

Journey to the All-Flash Data Center

Data Compression and Deduplication. LOC Cisco Systems, Inc. All rights reserved.

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

Storage Sizing Issue of VDI System

WHITE PAPER Optimizing Virtual Platform Disk Performance

Chapter 12: Mass-Storage Systems

Virtualizing Performance-Critical Database Applications in VMware vsphere VMware vsphere 4.0 with ESX 4.0

HIGHLY AVAILABLE MULTI-DATA CENTER WINDOWS SERVER SOLUTIONS USING EMC VPLEX METRO AND SANBOLIC MELIO 2010

Cloud Optimize Your IT

Benchmarking Cassandra on Violin

TECHNICAL PAPER. Veeam Backup & Replication with Nimble Storage

Improve Business Productivity and User Experience with a SanDisk Powered SQL Server 2014 In-Memory OLTP Database

W H I T E P A P E R. Performance and Scalability of Microsoft SQL Server on VMware vsphere 4

The next step in Software-Defined Storage with Virtual SAN

IOmark- VDI. HP HP ConvergedSystem 242- HC StoreVirtual Test Report: VDI- HC b Test Report Date: 27, April

Flash Storage Optimizing Virtual Desktop Deployments

ORACLE 11g AND 12c DATABASE CONSOLIDATION AND WORKLOAD SCALABILITY WITH EMC XTREMIO 3.0

EMC XTREMIO EXECUTIVE OVERVIEW

idedup Latency-aware inline deduplication for primary workloads Kiran Srinivasan, Tim Bisson Garth Goodson, Kaladhar Voruganti

Maximizing VMware ESX Performance Through Defragmentation of Guest Systems. Presented by

WITH A FUSION POWERED SQL SERVER 2014 IN-MEMORY OLTP DATABASE

All Flash Arrays in Healthcare

VMware Software-Defined Storage & Virtual SAN 5.5.1

IOmark-VM. DotHill AssuredSAN Pro Test Report: VM a Test Report Date: 16, August

Configuring Apache Derby for Performance and Durability Olav Sandstå

Nutanix Tech Note. Configuration Best Practices for Nutanix Storage with VMware vsphere

Accelerating Applications and File Systems with Solid State Storage. Jacob Farmer, Cambridge Computer

Desktop Virtualization with VMware Horizon View 5.2 on Dell EqualLogic PS6210XS Hybrid Storage Array

Solution Paper. Virtualization security solutions provide a competitive advantage to service providers IaaS, PaaS and SaaS

COLO: COarse-grain LOck-stepping Virtual Machine for Non-stop Service

VIDEO SURVEILLANCE WITH SURVEILLUS VMS AND EMC ISILON STORAGE ARRAYS

The future is in the management tools. Profoss 22/01/2008

Xen and XenServer Storage Performance

VDI Without Compromise with SimpliVity OmniStack and Citrix XenDesktop

NetApp Deduplication for FAS and V-Series Deployment and Implementation Guide

Data Backup and Archiving with Enterprise Storage Systems

EMC FLASH STRATEGY. Flash Everywhere - XtremIO. Massimo Marchetti. Channel Business Units Specialty Sales EMC massimo.marchetti@emc.

Virtual Desktop Infrastructure (VDI) Overview

Enterprise Storage Solution for Hyper-V Private Cloud and VDI Deployments using Sanbolic s Melio Cloud Software Suite April 2011

Deduplication in VM Environments Frank Bellosa Konrad Miller Marc Rittinghaus

SanDisk SSD Boot Storm Testing for Virtual Desktop Infrastructure (VDI)

Storage Solutions to Maximize Success in VDI Environments

Merge Healthcare Virtualization

VMware vsphere Data Protection 6.0

Virtualize Without Compromise. Protecting and Storing Virtualized Data

BEST PRACTICES GUIDE: VMware on Nimble Storage

NetApp Deduplication for FAS and V-Series Deployment and Implementation Guide

RevDedup: A Reverse Deduplication Storage System Optimized for Reads to Latest Backups

DELL. Dell Microsoft Windows Server 2008 Hyper-V TM Reference Architecture VIRTUALIZATION SOLUTIONS ENGINEERING

Microsoft Hyper-V chose a Primary Server Virtualization Platform

NEXENTA S VDI SOLUTIONS BRAD STONE GENERAL MANAGER NEXENTA GREATERCHINA

Azure VM Performance Considerations Running SQL Server

ISE 820 All Flash Array. Performance Review. Performance Review. March 2015

Storage for VDI Environments

White Paper. NexentaConnect TM Technology Review

Dell Compellent Storage Center SAN & VMware View 1,000 Desktop Reference Architecture. Dell Compellent Product Specialist Team

Leith Automotive Group: Private Hybrid Cloud Enables Company-Wide Desktop Virtualization

Investing Strategically in All Flash Arrays

Driving Big Data with OCZ Enterprise SSDs

CONSOLIDATING MICROSOFT SQL SERVER OLTP WORKLOADS ON THE EMC XtremIO ALL FLASH ARRAY

McAfee MOVE / VMware Collaboration Best Practices

EMC AVAMAR. a reason for Cloud. Deduplication backup software Replication for Disaster Recovery

VNX HYBRID FLASH BEST PRACTICES FOR PERFORMANCE

Virtual Desktops at the Speed of Memory

EMC VFCACHE ACCELERATES ORACLE

RED HAT ENTERPRISE VIRTUALIZATION

Why Choose VMware vsphere for Desktop Virtualization? WHITE PAPER

SDFS Overview. By Sam Silverberg

PERFORMANCE STUDY. NexentaConnect View Edition Branch Office Solution. Nexenta Office of the CTO Murat Karslioglu

Chapter 10: Mass-Storage Systems

Version 2.1. Copyright 2014 DataCore Software Corp. All Rights Reserved. Copyright 2014 DataCore Software Corp. All Rights Reserved.

Transcription:

Decentralized Deduplication in SAN Cluster File Systems Austin T. Clements Irfan Ahmad Murali Vilayannur Jinyuan Li VMware, Inc. MIT CSAIL

Storage Area Networks

Storage Area Networks

Storage Area Networks

Storage Area Networks

Storage Area Networks 1.3 TB

Storage Area Networks 237 GB 1.3 TB

Storage Area Networks 237 GB 5X reduction 1.3 TB

Deduplication

Deduplication

Deduplication

Deduplication

Deduplication 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Deduplication 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Deduplication 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Deduplication 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

The Problem Slow Lots of IO in the write path. Can t cache the index.

The Problem Slow Lots of IO in the write path. Can t cache the index. Very Slow Writes require allocation and thus coordination. No hope of disk locality.

The Problem Slow Lots of IO in the write path. Can t cache the index. Very Slow Writes require allocation and thus coordination. No hope of disk locality. Hopelessly Slow Multi-host lock contention on shared index.

Three-Stage Deduplication Decentralized Deduplication

Three-Stage Deduplication DeDe

Three-Stage Deduplication DeDe

Three-Stage Deduplication DeDe Out-of-band deduplication of live, primary storage Process duplicates efficiently, in large batches Minimize contention on the index Resilient to stale index information Unique blocks remain mutable and sequential = No overhead for blocks that don t benefit from deduplication

Three-Stage Deduplication DeDe

Three-Stage Deduplication DeDe

Three-Stage Deduplication DeDe

Three-Stage Deduplication DeDe

Three-Stage Deduplication DeDe f4bea9.. f2a4d2.. 0e7a26.. 15ba2b.. d5e341.. bc6887.. ab7373.. 4ee207.. 9b1f28.. 9b575b.. 94c9c9.. 288bc7..

Three-Stage Deduplication DeDe f4bea9.. f2a4d2.. 0e7a26.. 15ba2b.. d5e341.. bc6887.. ab7373.. 4ee207.. 9b1f28.. 9b575b.. 94c9c9.. 288bc7..

Three-Stage Deduplication DeDe f4bea9.. f2a4d2.. 0e7a26.. 15ba2b.. d5e341.. bc6887.. ab7373.. 4ee207.. 9b1f28.. 9b575b.. 94c9c9.. 288bc7.. 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Three-Stage Deduplication DeDe f4bea9.. f2a4d2.. 0e7a26.. 15ba2b.. d5e341.. bc6887.. ab7373.. 4ee207.. 9b1f28.. 9b575b.. 94c9c9.. 288bc7.. 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Three-Stage Deduplication DeDe ab7373.. 4ee207.. 9b1f28.. 9b575b.. 94c9c9.. 288bc7.. 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Three-Stage Deduplication DeDe ab7373.. 4ee207.. 9b1f28.. 9b575b.. 94c9c9.. 288bc7.. 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Three-Stage Deduplication DeDe ab7373.. 4ee207.. 9b1f28.. 9b575b.. 94c9c9.. 288bc7.. 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Three-Stage Deduplication DeDe 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Three-Stage Deduplication DeDe 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Three-Stage Deduplication DeDe? 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..???

Three-Stage Deduplication DeDe? 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..???

Three-Stage Deduplication DeDe? 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..???

Implementation

Implementation

Implementation

Implementation 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

Implementation 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9..

VMFS Compare-and-share

VMFS Compare-and-share?

VMFS Compare-and-share

VMFS Compare-and-share DeDe finds duplicates. VMFS eliminates them.

Write Monitor

Write Monitor A lightweight kernel module monitors writes, computes hashes

Write Monitor A lightweight kernel module monitors writes, computes hashes It buffers the write log in userspace before writing it to disk

Write Monitor A lightweight kernel module monitors writes, computes hashes It buffers the write log in userspace before writing it to disk

Write Monitor A lightweight kernel module monitors writes, computes hashes It buffers the write log in userspace before writing it to disk Safe to buffer the log because index is resilient

Write Monitor A lightweight kernel module monitors writes, computes hashes It buffers the write log in userspace before writing it to disk Safe to buffer the log because index is resilient 150 MB of regular writes 1 MB sequential log write

The Index 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Map from hashes to block locators, list sorted by hash

The Index 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared Map from hashes to block locators, list sorted by hash

The Index 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable

The Index 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable

The Index 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable

The Index 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable A virtual arena stores COW references to all shared blocks

The Index 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable A virtual arena stores COW references to all shared blocks

The Index 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable A virtual arena stores COW references to all shared blocks

Indexing and Duplicate Elimination 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. 6cd412.. 12067c.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. 12067c.. 6cd412.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. 12067c.. 6cd412.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. 12067c.. 6cd412.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. 6cd412.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. 6cd412.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. 6cd412.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. c277d6..? Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. c277d6.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared

Indexing and Duplicate Elimination 0e7a26.. 12067c.. 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Unique Shared

Phew.

Evaluation

Evaluation How much space does DeDe save?

Evaluation How much space does DeDe save? How much overhead does DeDe introduce?

Evaluation How much space does DeDe save? How much overhead does DeDe introduce? How fast can DeDe deduplicate?

Space Savings: VDI Cluster Corporate Virtual Desktop Infrastructure cluster Desktop XP VM s 6 12 months of active use Originally cloned from small number of base images...

Space Savings: VDI Cluster Corporate Virtual Desktop Infrastructure cluster Desktop XP VM s 6 12 months of active use Originally cloned from small number of base images 1.9 TB 113 VM s...

Space Savings: VDI Cluster Corporate Virtual Desktop Infrastructure cluster Desktop XP VM s 6 12 months of active use Originally cloned from small number of base images 1.9 TB 1.3 TB 113 VM s...

Space Savings: VDI Cluster 1.3 TB

Space Savings: VDI Cluster 237 GB 1.3 TB

Space Savings: VDI Cluster 237 GB 1.3 TB 173 GB unique 61 GB shared

Space Savings: VDI Cluster 237 GB 1.3 TB 2.7 GB 173 GB unique 61 GB shared

Space Savings: VDI Cluster 237 GB 1.3 TB 2.7 GB 173 GB unique 61 GB shared 1.3 GB index file 194 MB v. arena 1.1 GB FS metadata

Runtime Effects Write monitoring Disk array caching

Runtime Effects Write monitoring Disk array caching

Runtime Effects Write monitoring Disk array caching EMC CLARiiON CX3-40

Runtime Overhead: Write Monitoring Worst-case benchmark: 100% sequential write IO, No computation

Runtime Overhead: Write Monitoring Worst-case benchmark: 100% sequential write IO, No computation Baseline Write Monitor CPU 33% 220%

Runtime Overhead: Write Monitoring Worst-case benchmark: 100% sequential write IO, No computation Baseline Write Monitor CPU 33% 220% Bandwidth (MB/s) 233 233 Latency (ms) 8.6 8.6

Runtime Overhead: Write Monitoring Worst-case benchmark: 100% sequential write IO, No computation Baseline Write Monitor CPU 33% 220% Bandwidth (MB/s) 233 233 Latency (ms) 8.6 8.6 (See paper for database application benchmark)

Runtime Gains: Disk Array Caching Reduced storage footprint Better caching Less IO

Runtime Gains: Disk Array Caching Reduced storage footprint Better caching Less IO Average boot time (secs) 90 80 70 60 50 40 30 20 10 0 VDI "boot storm" 2 4 6 8 10 12 14 16 18 20 # VMs booting concurrently

Runtime Gains: Disk Array Caching Reduced storage footprint Better caching Less IO Average boot time (secs) 90 80 70 60 50 40 30 20 10 0 VDI "boot storm" Full copies 2 4 6 8 10 12 14 16 18 20 # VMs booting concurrently

Runtime Gains: Disk Array Caching Reduced storage footprint Better caching Less IO Average boot time (secs) 90 80 70 60 50 40 30 20 10 0 VDI "boot storm" Full copies Deduplicated 2 4 6 8 10 12 14 16 18 20 # VMs booting concurrently

Out-of-band Deduplication Rate Index scan COW sharing

Out-of-band Deduplication Rate Index scan COW sharing 6.6 GB/sec Virtually no cost for unique blocks

Out-of-band Deduplication Rate Index scan 6.6 GB/sec COW sharing 2.6 MB/sec Virtually no cost for unique blocks

Out-of-band Deduplication Rate Index scan 6.6 GB/sec COW sharing 2.6 MB/sec (It s a prototype!) Virtually no cost for unique blocks

Out-of-band Deduplication Rate Index scan 6.6 GB/sec COW sharing 2.6 MB/sec (It s a prototype!) Virtually no cost for unique blocks 9 GB of new shared blocks per hour (And provisioning can be special-cased)

Related Work Centralized archival Venti Data Domain Foundation Centralized primary storage NetApp ASIS Microsoft Single Instance Store Distributed Farsite SAN with Coordinator DDE

Conclusion Decentralized, out-of-band, live file system deduplication.

Conclusion Decentralized, out-of-band, live file system deduplication. Deduplication is effective.

Conclusion Decentralized, out-of-band, live file system deduplication. Deduplication is effective. Deduplication is hard.

Conclusion Decentralized, out-of-band, live file system deduplication. Deduplication is effective. Deduplication is hard. Three-stage deduplication has only modest performance overhead.

Conclusion Decentralized, out-of-band, live file system deduplication. Deduplication is effective. Deduplication is hard. Three-stage deduplication has only modest performance overhead. Thank you.