PLC-Cache: Endurable SSD Cache for Deduplication-based Primary Storage



Similar documents
Speeding Up Cloud/Server Applications Using Flash Memory

Why are state-of-the-art flash-based multi-tiered storage systems performing poorly for HTTP video streaming?

Read Performance Enhancement In Data Deduplication For Secondary Storage

Main Memory Data Warehouses

Why Computers Are Getting Slower (and what we can do about it) Rik van Riel Sr. Software Engineer, Red Hat

White paper. QNAP Turbo NAS with SSD Cache

Using Synology SSD Technology to Enhance System Performance Synology Inc.

An Overview of Flash Storage for Databases

Using Synology SSD Technology to Enhance System Performance Synology Inc.

High Performance Computing Specialists. ZFS Storage as a Solution for Big Data and Flexibility

Sistemas Operativos: Input/Output Disks

Secondary Storage. Any modern computer system will incorporate (at least) two levels of storage: magnetic disk/optical devices/tape systems

Operating Systems. Virtual Memory

Hyper-converged IT drives: - TCO cost savings - data protection - amazing operational excellence

Hardware Configuration Guide

Theoretical Aspects of Storage Systems Autumn 2009

Flash 101. Violin Memory Switzerland. Violin Memory Inc. Proprietary 1

Accelerating Server Storage Performance on Lenovo ThinkServer

MAD2: A Scalable High-Throughput Exact Deduplication Approach for Network Backup Services

SOLUTION BRIEF. Resolving the VDI Storage Challenge

Solid State Storage in Massive Data Environments Erik Eyberg

Using Synology SSD Technology to Enhance System Performance. Based on DSM 5.2

How To Get The Most Out Of An Ecm Xtremio Flash Array

A Novel Way of Deduplication Approach for Cloud Backup Services Using Block Index Caching Technique

Fusion iomemory iodrive PCIe Application Accelerator Performance Testing

FAST 11. Yongseok Oh University of Seoul. Mobile Embedded System Laboratory

Fragmentation in in-line. deduplication backup systems

Assuring Demanded Read Performance of Data Deduplication Storage with Backup Datasets

idedup: Latency-aware, inline data deduplication for primary storage

COS 318: Operating Systems. Storage Devices. Kai Li Computer Science Department Princeton University. (

SSD Performance Tips: Avoid The Write Cliff

hybridfs: Integrating NAND Flash-Based SSD and HDD for Hybrid File System

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

Flash for Databases. September 22, 2015 Peter Zaitsev Percona

Bringing Greater Efficiency to the Enterprise Arena

HP Z Turbo Drive PCIe SSD

A Survey on Aware of Local-Global Cloud Backup Storage for Personal Purpose

Practical Applications of Lustre/ZFS Hybrid Systems LUG 2014 Miami FL

Evaluation Report: Supporting Microsoft Exchange on the Lenovo S3200 Hybrid Array

Open Source Data Deduplication

Frequently Asked Questions August s840z SAS SSD Enterprise-Class Solid-State Device. Frequently Asked Questions

FREE VOICE CALLING IN WIFI CAMPUS NETWORK USING ANDROID

VDI Without Compromise with SimpliVity OmniStack and Citrix XenDesktop

Software Environment. Options. Service guarantee:. 24/7 Hardware Support. 99% uptime

CONSOLIDATING MICROSOFT SQL SERVER OLTP WORKLOADS ON THE EMC XtremIO ALL FLASH ARRAY

The What, Why and How of the Pure Storage Enterprise Flash Array

ntier Verde Simply Affordable File Storage

Maximizing Your Server Memory and Storage Investments with Windows Server 2012 R2

STORAGE. Buying Guide: TARGET DATA DEDUPLICATION BACKUP SYSTEMS. inside

SDFS Overview. By Sam Silverberg

Frequently Asked Questions December ZeusRAM Solid-State Drive. Frequently Asked Questions

Indexing on Solid State Drives based on Flash Memory

E4 UNIFIED STORAGE powered by Syneto

Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance

09'Linux Plumbers Conference

Arif Goelmhd Goelammohamed Solutions Hyperconverged Infrastructure: The How-To and Why Now?

NV-DIMM: Fastest Tier in Your Storage Strategy

Data Backup and Archiving with Enterprise Storage Systems

What is the real cost of Commercial Cloud provisioning? Thursday, 20 June 13 Lukasz Kreczko - DICE 1

SCI Briefing: A Review of the New Hitachi Unified Storage and Hitachi NAS Platform 4000 Series. Silverton Consulting, Inc.

A+ Guide to Managing and Maintaining Your PC, 7e. Chapter 1 Introducing Hardware

Filesystems Performance in GNU/Linux Multi-Disk Data Storage

Comparison of Hybrid Flash Storage System Performance

Reference Guide WindSpring Data Management Technology (DMT) Solving Today s Storage Optimization Challenges

Multi-level Metadata Management Scheme for Cloud Storage System

Delivering SDS simplicity and extreme performance

Solid State Drive Technology

Outline. CS 245: Database System Principles. Notes 02: Hardware. Hardware DBMS Data Storage

WAVE: Popularity-based and Collaborative In-network Caching for Content-Oriented Networks

The new frontier of the DATA acquisition using 1 and 10 Gb/s Ethernet links. Filippo Costa on behalf of the ALICE DAQ group

Demystifying Deduplication for Backup with the Dell DR4000

Brainlab Node TM Technical Specifications

Evaluation of the deduplication file system "Opendedup"

Analyzing Big Data with Splunk A Cost Effective Storage Architecture and Solution

Real-Time Storage Tiering for Real-World Workloads

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Analysis of VDI Storage Performance During Bootstorm

Database!Fatal!Flash!Flaws!No!One! Talks!About!!!

Flash s Role in Big Data, Past Present, and Future OBJECTIVE ANALYSIS. Jim Handy

FLASH IMPLICATIONS IN ENTERPRISE STORAGE ARRAY DESIGNS

For Hyper-V Edition Practical Operation Seminar. 4th Edition

EMC Unified Storage for Microsoft SQL Server 2008

DEDUPLICATION has become a key component in modern

QUESTIONS & ANSWERS. ItB tender 72-09: IT Equipment. Elections Project

The Why and How of SSD Performance Benchmarking. Esther Spanjer, SMART Modular Easen Ho, Calypso Systems

Backup Software Data Deduplication: What you need to know. Presented by W. Curtis Preston Executive Editor & Independent Backup Expert

Using SSD to boost performance on Infortrend NAS storage systems

DEPLOYING HYBRID STORAGE POOLS With Sun Flash Technology and the Solaris ZFS File System. Roger Bitar, Sun Microsystems. Sun BluePrints Online

Advancements in Storage QoS Management in National Data Storage

Data Deduplication in Tivoli Storage Manager. Andrzej Bugowski Spała

Transcription:

PLC-Cache: Endurable SSD Cache for Deduplication-based Primary Storage Jian Liu, Yunpeng Chai, Yuan Xiao Renmin University of China Xiao Qin Auburn University Speaker: Tao Xie MSST, June 5, 2014

Deduplication Age of Big Data Bad news: Goble data will be more than 40 ZB in 2020 [IDC]; Capacity and price of disks changes slowly after reaching TB-level Good new: 75% data is redundant [IDC] Dedup improves space-efficiency Secondary Dedup: backup, archive systems (most of previous work on dedup) Primary Dedup: online storage systems (e.g., email, database, web, etc.) [idedup@fast 12, ZFS, SDFS, LessFS, LBFS]

Primary Deduplication Reasons of limited performance of Dedup Additional metadata accesses Data Fragmentation on disks Defragmentation: Limited containers of each file [Capping@FAST 13] Storing sequence of chunks [idedup@fast 12] SSD data cache to boost [SAR@NAS 12] (focus of this paper) Large, Fast, Energy-efficient

SSD Cache to Boost Container-level caching instead of chunk-level caching [e.g. SAR] Reading less containers from HDD (as the example) Write amplification reduction Containers size match erase unit of Flash chips (several MB)

Challenge of SSD Cache Too much writes on SSD cache leads to Low performance caused by request congestion; Importantly, VERY SHORT lifetime of SSD cache Cache Algorithms The measured writing speed of SSD Expected 60GB Intel 520 SSD lifetime Expected 60GB Intel 910 SSD lifetime FIFO 56.7 MB/s 7.6 Days 162 Days LRU 50.7 MB/s 8.5 Days 181 Days LFU 53.1 MB/s 8.1 Days 173 Days Intel 520 SSD, 60 GB TBW: 36 TB (used in experiment) Intel 910 SSD, 400 GB TBW: 5 PB (enterprise SSD) Results are from an experiment based on a dedup system with 60GB SSD cache

Analysis Analyze the composition of written data set Is all the written data NECESSARY? Four quadrants Caching target: PLC data (Best): QA NOT necessary QC/QD: few benefits QB: repeat writes Loop: evict-enter-evict-enter-

Analysis (Cont.) Trace Analysis PLC data (QA): low percentage Chunk -> container: QA (PLC data) QC+QD QB FIFO, LRU, LFU have similar results Other real-world traces lead to similar observations

Basic Idea Strategies to cache PLC data Exclude QC and QD data with low benefits Convert QB1 data with long-term hot potential to PLC data Exclude QB2 data with similar popularity with cached one

PLC-Cache Three modules (steps) 1. Popularity Filter excludes QC and QD data; 2. Delayed Eviction converts QB1 data into PLC data 3. In-Out Comparison excludes QB2 data

Example (Popularity Filter) A missed container b b.ac < PF.thr: not replace b.ac >= PF.thr: SSD Cache is not full -> b enter SSD cache; SSD cache is full -> Container s properties: ac: access_count

Example (Delayed Eviction) Selecting victim in SSD cache Pointer scans, a.ip-- a.ip == 0: a.dc--; a.dc > 0 -> not replace a.dc == 0 -> Container s properties: ac: access_count +1 when hit ip: inter_priority dc: delay_count

Example (In-Out Comparison) Comparison between container b and a (b.rc > a.rc) or (b.ac > IOC.m*a.ac) -> replace else -> not replace Container s properties: ac: access_count ip: inter_priority dc: delay_count rc: reference_count Related file count of dedup

Evaluations Experiment Setup Platform Primary dedup enhancement based on Destor (https://github.com/fomy/destor) Hardware Intel Core (TM) i7-2600 quad-core CPU, 3.40GHz 4GB RAM, 2TB Western Digital hard disk 60GB Intel 520 SSD Data Collected in the laboratory server of RUC, 200GB Access Randomly read 520GB data (Zipf distribution)

Overall Results PLC-Cache vs. HDD-only Performance: 41.9% SSD: 20% capacity of HDD data PLC-Cache vs. traditional Cache Performance: 23.4% SSD written amount: >15x

Distributions of four quadrants Amount of SSD written data LRU: 48.77 GB PLC-Cache: 3.18 GB 15.34x Four quadrants QA (PLC data): 0% 81.63% QB: 26.23% 18.37% QC+QD: 73.77% 0% LRU PLC-Cache

Under Various SSD Cache Size Performance PLC-Cache always outperforms LRU Amount of SSD written data PLC-Cache is steadily very low LRU writes less for larger SSD, because of high hit rate.

Under Various α of Zipf Performance PLC-Cache always outperforms LRU Higher performance for a larger α (i.e. more concentrated access) Amount of SSD written data PLC-Cache is steadily very low

Under Various Container Size Performance PLC-Cache is better than LRU Bad performance for too large container Amount of SSD written data PLC-Cache is always much lower than LRU Container-level caching Reduce erase operations by 54.4% compared with SAR By reading S.M.A.R.T. info of SSD before and after experiment.

Under Various Chunk Size PLC-Cache outperforms LRU in both performance and SSD writes. Chunk size does not have obvious impacts Only too small chunk leads to worse performance, because of more metadata to access for more chunks

Impacts of PF threshold Low threshold of PF means more QC/QD data enters SSD cache More SSD writes Request congestion Performance and SSD writing amount exhibit similar pattern

Summarizing SSD-age caching Previous cache algorithms Unlimited write: FIFO, LRU, LFU, MQ, LIRS, ARC, Simply remove some writes: LARC, SieveStore, EMC FastCache, SUN L2ARC PLC-Cache: Comprehensive analysis of cache s written data set A series method to improve distribution of 4 quadrants Excellent results: performance 23.4% amount of SSD written data 15x

Thank You! ypchai@ruc.edu.cn