FAST 11. Yongseok Oh <ysoh@uos.ac.kr> University of Seoul. Mobile Embedded System Laboratory

Similar documents

CAFTL: A Content-Aware Flash Translation Layer Enhancing the Lifespan of Flash Memory based Solid State Drives

Offline Deduplication for Solid State Disk Using a Lightweight Hash Algorithm

Speeding Up Cloud/Server Applications Using Flash Memory

MAD2: A Scalable High-Throughput Exact Deduplication Approach for Network Backup Services

ChunkStash: Speeding up Inline Storage Deduplication using Flash Memory

Top Ten Questions. to Ask Your Primary Storage Provider About Their Data Efficiency. May Copyright 2014 Permabit Technology Corporation

A Data De-duplication Access Framework for Solid State Drives

Inline Deduplication

ioscale: The Holy Grail for Hyperscale

Using Synology SSD Technology to Enhance System Performance Synology Inc.

Best Practices for Optimizing SQL Server Database Performance with the LSI WarpDrive Acceleration Card

SOS: Software-Based Out-of-Order Scheduling for High-Performance NAND Flash-Based SSDs

A Deduplication File System & Course Review

Tradeoffs in Scalable Data Routing for Deduplication Clusters

Solid State Drive (SSD) FAQ

SkimpyStash: RAM Space Skimpy Key-Value Store on Flash-based Storage

Deduplication in SSDs: Model and Quantitative Analysis

SSD Performance Tips: Avoid The Write Cliff

Theoretical Aspects of Storage Systems Autumn 2009

Distributed File Systems

Using Synology SSD Technology to Enhance System Performance Synology Inc.

The Curious Case of Database Deduplication. PRESENTATION TITLE GOES HERE Gurmeet Goindi Oracle

Block-level Inline Data Deduplication in ext3

Using Synology SSD Technology to Enhance System Performance. Based on DSM 5.2

A Novel Way of Deduplication Approach for Cloud Backup Services Using Block Index Caching Technique

A SCALABLE DEDUPLICATION AND GARBAGE COLLECTION ENGINE FOR INCREMENTAL BACKUP

Key Components of WAN Optimization Controller Functionality

A Survey on Aware of Local-Global Cloud Backup Storage for Personal Purpose

Physical Data Organization

Indexing on Solid State Drives based on Flash Memory

Read Performance Enhancement In Data Deduplication For Secondary Storage

DEXT3: Block Level Inline Deduplication for EXT3 File System

Understanding endurance and performance characteristics of HP solid state drives

Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55%

The What, Why and How of the Pure Storage Enterprise Flash Array

Building a High Performance Deduplication System Fanglu Guo and Petros Efstathopoulos

In-Block Level Redundancy Management for Flash Storage System

Flash Memory Technology in Enterprise Storage

WHITE PAPER FUJITSU PRIMERGY SERVER BASICS OF DISK I/O PERFORMANCE

IS IN-MEMORY COMPUTING MAKING THE MOVE TO PRIME TIME?

Data Backup and Archiving with Enterprise Storage Systems

An Exploration of Hybrid Hard Disk Designs Using an Extensible Simulator

Databases Acceleration with Non Volatile Memory File System (NVMFS) PRESENTATION TITLE GOES HERE Saeed Raja SanDisk Inc.

Hardware Configuration Guide

Evaluation Report: Database Acceleration with HP 3PAR StoreServ 7450 All-flash Storage

Nutanix Tech Note. Configuration Best Practices for Nutanix Storage with VMware vsphere

Multi-level Metadata Management Scheme for Cloud Storage System

SOLUTION BRIEF. Resolving the VDI Storage Challenge

Calsoft Webinar - Debunking QA myths for Flash- Based Arrays

Solid State Storage in Massive Data Environments Erik Eyberg

Impact of Stripe Unit Size on Performance and Endurance of SSD-Based RAID Arrays

09'Linux Plumbers Conference

Contents. WD Arkeia Page 2 of 14

FlashSoft Software from SanDisk : Accelerating Virtual Infrastructures

Accelerating Server Storage Performance on Lenovo ThinkServer

FlashTier: a Lightweight, Consistent and Durable Storage Cache

CONSOLIDATING MICROSOFT SQL SERVER OLTP WORKLOADS ON THE EMC XtremIO ALL FLASH ARRAY

SH-Sim: A Flexible Simulation Platform for Hybrid Storage Systems

COS 318: Operating Systems

ABSTRACT 1 INTRODUCTION

Avoiding the Disk Bottleneck in the Data Domain Deduplication File System

The Classical Architecture. Storage 1 / 36

CURRENTLY, the enterprise data centers manage PB or

Why Computers Are Getting Slower (and what we can do about it) Rik van Riel Sr. Software Engineer, Red Hat

VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014

File System Management

Boost SQL Server Performance Buffer Pool Extensions & Delayed Durability

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Deduplication Demystified: How to determine the right approach for your business

IBM Systems and Technology Group May 2013 Thought Leadership White Paper. Faster Oracle performance with IBM FlashSystem

A Deduplication-based Data Archiving System

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

The assignment of chunk size according to the target data characteristics in deduplication backup system

File-System Implementation

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

Accelerating I/O- Intensive Applications in IT Infrastructure with Innodisk FlexiArray Flash Appliance. Alex Ho, Product Manager Innodisk Corporation

89 Fifth Avenue, 7th Floor. New York, NY White Paper. HP 3PAR Thin Deduplication: A Competitive Comparison

A Context Aware Block Layer: The Case for Block Layer Deduplication

FUSION iocontrol HYBRID STORAGE ARCHITECTURE 1

A Comparison of Client and Enterprise SSD Data Path Protection

Design and Implementation of a Storage Repository Using Commonality Factoring. IEEE/NASA MSST2003 April 7-10, 2003 Eric W. Olsen

Boosting Database Batch workloads using Flash Memory SSDs

All-Flash Arrays: Not Just for the Top Tier Anymore

DEDUPLICATION has become a key component in modern

WAN Optimized Replication of Backup Datasets Using Stream-Informed Delta Compression

Implementation of Buffer Cache Simulator for Hybrid Main Memory and Flash Memory Storages

A High-Throughput In-Memory Index, Durable on Flash-based SSD

File Systems for Flash Memories. Marcela Zuluaga Sebastian Isaza Dante Rodriguez

A Content-Based Load Balancing Algorithm for Metadata Servers in Cluster File Systems*

Byte-index Chunking Algorithm for Data Deduplication System

Introduction Disks RAID Tertiary storage. Mass Storage. CMSC 412, University of Maryland. Guest lecturer: David Hovemeyer.

ACHIEVING STORAGE EFFICIENCY WITH DATA DEDUPLICATION

VDI Without Compromise with SimpliVity OmniStack and Citrix XenDesktop

Data De-duplication Methodologies: Comparing ExaGrid s Byte-level Data De-duplication To Block Level Data De-duplication

Benchmarking Cassandra on Violin

idedup: Latency-aware, inline data deduplication for primary storage

WHITE PAPER Improving Storage Efficiencies with Data Deduplication and Compression

EMC VNXe File Deduplication and Compression

SOLID STATE DRIVES AND PARALLEL STORAGE

Transcription:

CAFTL: A Content-Aware Flash Translation Layer Enhancing the Lifespan of flash Memory based Solid State Drives FAST 11 Yongseok Oh <ysoh@uos.ac.kr> University of Seoul Mobile Embedded System Laboratory 1

Introduction The limited lifespan is the Achilles hill of Flash Memory based SSDs Not-so-well-known reasons As bit density increases, flash memory chips become less reliable and less durable Traditional redundancy solutions are considered less effective for SSDs Some prior research work has presented empirical and modeling based studies on the lifespan of flash memories The lifespan of SSDs is a function of three factors The amount of incoming write traffic The size of over-provisioned flash space The efficiency of garbage collection and wear-leveling mechanisms Mobile Embedded System Laboratory 2

Data Duplication is Common Kernel developers can have multiple kernel sources Word tools often automatically save a copy of documents Mobile Embedded System Laboratory 3

Contributions Study of data duplications for improving endurance of SSDs in file systems and various workloads Design of a content-aware FTL to extend the SSD lifespan by removing duplicate writes (up to 24.2%) and redundant data (up to 31.2%) with minimal overhead. Acceleration methods for in-line deduplication in SSD devices Implementation of CAFTL in the DiskSim simulator Mobile Embedded System Laboratory 4

Technical Challenges Limited resources CAFTL is designed for running in an SSD device with limited memory space and computing power, rather than running on a dedicated powerful enterprise server. Relatively lower redundancy CAFTL mostly handles regular file system workloads, which have an impressive but much lower duplication rate than that of backup streams with high redundancy (often 10 times or even higher). Lack of semantic hints CAFTL works at the device level and only sees a sequence of logical blocks without any semantic hints from host file systems. Low overhead requirement CAFTL must retain high data access performance for regular workloads, while this is a less stringent requirement in backup systems that can run during out-of-office hours. Mobile Embedded System Laboratory 5

The Design of CAFTL Tree critical objectives Reducing unnecessary write traffic Extending available flash space Retaining access performance Mobile Embedded System Laboratory 6

Hashing and Fingerprint Store A fixed-sized chunking approach (e.g. 4KB) The basic operation unit in flash is a page (Mapping policy) SHA1 (160-bit) Fingerprint store in memory Only 10-20% of the fingerprints are highly deduplicated (skewed) Most fingerprints are unique A complete search in the fingerprint store would incur high lookup latencies We should only store and search in the most likely-to-be-duplicated fingerprint in memory 15 Disks Mobile Embedded System Laboratory 7

Hashing and Fingerprint Store {fingerprint,(location, reference)} Fingerprint f Hash Function f mod N Highly Duplicated Fingerprints are kept in RAM Optimization techniques (1) Range Check (2) Hotness-based Reorganization (3) Bucket level binary Search Seg 0 Seg N-1 Bucket Memory Flash If a fingerprint cannot be found in Fingerprint Store, Out-of-line scanning can still identify these duplicates later Mobile Embedded System Laboratory 8

Indirect Mapping Two-level indirect mapping Mobile Embedded System Laboratory 9

Acceleration Methods Fingerprinting is the key bottleneck of the in-line deduplicaton in CAFTL Three acceleration methods have been proposed Sampling for hashing Light-weight pre-hashing Dynamic switches Mobile Embedded System Laboratory 10

Sampling for hashing We selectively pick only one page as a sample page for fingerprint We use this sample fingerprint to query the fingerprint store We propose to use Content-based Sampling We select the first four bytes, called sample bytes Mobile Embedded System Laboratory 11

Light-weight pre-hashing Producing a 32-bit CRC32 hash value is over 10 times faster than computing a 160-bit SHA-1 hash value Reducing the hash strength would not incur a significant increase of false positives for a typical SSD capacity Light-weight pre-hashing Compute a CRC32 and query the fingerprint store If a mach is found Generate SHA1 fingerprint and confirm it Otherwise, write data to flash We have also considered using a Bloom filter Multiple hashings Summary vector cannot be updated when a fingerprint is removed Mobile Embedded System Laboratory 12

Dynamic switches Incoming requests may wait for available buffer space to be released by previous requests Dynamic switch Set high watermark and low watermark to turn on/off inine deduplication If the percentage of the occupied cache space reaches a high watermark (95%) We disable in-line deduplication to flush writes quickly to flash Once the low watermark (50%) is hit, we re-enable the in-line deduplication. Mobile Embedded System Laboratory 13

Out-of-line Deduplication CAFTL does not pursue a perfect in-line deduplication An internal routine is periodically launched to perform out-ofline fingerprinting and out-of-line deduplication during the device idle Mobile Embedded System Laboratory 14

Performance Evaluation Experimental Systems SSD Simulator Microsoft Research SSD extension for the DiskSim Write buffering SSD Configurations Workloads and trace collection Desktop Hadoop TPC-H Transaction TPC-C Mobile Embedded System Laboratory 15

Effective of Deduplication Removing duplicate writes Mobile Embedded System Laboratory 16

Effective of Deduplication Extending flash space Mobile Embedded System Laboratory 17

Performance Impact Cache size Mobile Embedded System Laboratory 18

Performance Impact Hashing speed Mobile Embedded System Laboratory 19

Performance Impact Fingerprint searching Mobile Embedded System Laboratory 20

Acceleration Methods Sampling Because of the significantly reduced waiting time for the buffer Mobile Embedded System Laboratory 21

Acceleration Methods Light-weight pre-hashing This workload is write intensive and has a long waiting queue, which makes queuing effect particularly significant Mobile Embedded System Laboratory 22

Acceleration Methods Dynamic switch Mobile Embedded System Laboratory 23

Conclusions CAFTL can remove duplicate writes Enhance the lifespan of SSDs while retaining high data access performance If budget allows, we would suggest maintaining the fingerprint store fully in memory, which not only improves deduplication rate but also simplifies designs PCM into the SSDs to maintain the metadata Which can remove much design complexity On-line compression into SSDs Mobile Embedded System Laboratory 24