Why are state-of-the-art flash-based multi-tiered storage systems performing poorly for HTTP video streaming?

Similar documents
Impact of Flash Memory on Video-on-Demand Storage: Analysis of Tradeoffs

An Overview of Flash Storage for Databases

The Data Placement Challenge

Flash-Friendly File System (F2FS)

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

PLC-Cache: Endurable SSD Cache for Deduplication-based Primary Storage

USB Flash Drives as an Energy Efficient Storage Alternative

White Paper. Educational. Measuring Storage Performance

Indexing on Solid State Drives based on Flash Memory

Accelerating Server Storage Performance on Lenovo ThinkServer

Parallel Processing and Software Performance. Lukáš Marek

Scalus Winter School Storage Systems

Lab Evaluation of NetApp Hybrid Array with Flash Pool Technology

Best Practices for Increasing Ceph Performance with SSD

Flash Performance in Storage Systems. Bill Moore Chief Engineer, Storage Systems Sun Microsystems

Using Synology SSD Technology to Enhance System Performance Synology Inc.

Caching Mechanisms for Mobile and IOT Devices

LSI MegaRAID CacheCade Performance Evaluation in a Web Server Environment

Flash for Databases. September 22, 2015 Peter Zaitsev Percona

Database Hardware Selection Guidelines

89 Fifth Avenue, 7th Floor. New York, NY White Paper. HP 3PAR Adaptive Flash Cache: A Competitive Comparison

CAVE: Channel-Aware Buffer Management Scheme for Solid State Disk

Solid State Storage in Massive Data Environments Erik Eyberg

Flash 101. Violin Memory Switzerland. Violin Memory Inc. Proprietary 1

High Performance Computing Specialists. ZFS Storage as a Solution for Big Data and Flexibility

SSD Write Performance IOPS Confusion Due to Poor Benchmarking Techniques

Using Synology SSD Technology to Enhance System Performance Synology Inc.

Virtuoso and Database Scalability

The What, Why and How of the Pure Storage Enterprise Flash Array

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Enabling Technologies for Distributed Computing

N /150/151/160 RAID Controller. N MegaRAID CacheCade. Feature Overview

Dualog Connection Suite Hardware and Software Requirements

Managing Storage Space in a Flash and Disk Hybrid Storage System

p-oftl: An Object-based Semantic-aware Parallel Flash Translation Layer

High-Performance SSD-Based RAID Storage. Madhukar Gunjan Chakhaiyar Product Test Architect

Muse Server Sizing. 18 June Document Version Muse

SALSA Flash-Optimized Software-Defined Storage

Benchmarking Cassandra on Violin

SSDs and 4D v11 SQL. By Josh Fletcher, Technical Services Team Member, 4D Inc. Technical Note 10-09

DEPLOYING HYBRID STORAGE POOLS With Sun Flash Technology and the Solaris ZFS File System. Roger Bitar, Sun Microsystems. Sun BluePrints Online

Sistemas Operativos: Input/Output Disks

File System Reliability (part 2)

Linux flash file systems JFFS2 vs UBIFS

LSI MegaRAID FastPath Performance Evaluation in a Web Server Environment

Speeding Up Cloud/Server Applications Using Flash Memory

Enabling Technologies for Distributed and Cloud Computing

Advances in Flash Memory Technology & System Architecture to Achieve Savings in Data Center Power and TCO

Fjord: Informed Storage Management for Smartphones

SOLID STATE DRIVES AND PARALLEL STORAGE

Using Synology SSD Technology to Enhance System Performance. Based on DSM 5.2

Implementation of Buffer Cache Simulator for Hybrid Main Memory and Flash Memory Storages

Journal of Systems Architecture

Optimizing SQL Server Storage Performance with the PowerEdge R720

The Economics of Intelligent Hybrid Storage. An Enmotus White Paper Sep 2014

Datacenter Operating Systems

Integrating Flash-based SSDs into the Storage Stack

Hybrid Storage Management for Database Systems

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array

COS 318: Operating Systems. Storage Devices. Kai Li Computer Science Department Princeton University. (

A Close Look at PCI Express SSDs. Shirish Jamthe Director of System Engineering Virident Systems, Inc. August 2011

Very Large Enterprise Network Deployment, 25,000+ Users

How it can benefit your enterprise. Dejan Kocic Netapp

hybridfs: Integrating NAND Flash-Based SSD and HDD for Hybrid File System

How To Evaluate A Buffer Cache Performance On A Flash Storage Device

Data Distribution Algorithms for Reliable. Reliable Parallel Storage on Flash Memories

Capturing Client IO Workloads for SSD Performance Evaluation

Microsoft Windows Server 2003 with Internet Information Services (IIS) 6.0 vs. Linux Competitive Web Server Performance Comparison

Embedded Operating Systems in a Point of Sale Environment. White Paper

Bringing Greater Efficiency to the Enterprise Arena

Don t Let RAID Raid the Lifetime of Your SSD Array

Intel RAID Performance 12Gb/s SAS RAID Controllers

SOLUTION BRIEF. Resolving the VDI Storage Challenge

Integrating Flash-based SSDs into the Storage Stack

Methods to achieve low latency and consistent performance

Testing SSD s vs. HDD s. The Same But Different. August 11, 2009 Santa Clara, CA Anthony Lavia President & CEO

Future Prospects of Scalable Cloud Computing

Accelerate Oracle Performance by Using SmartCache of T Series Unified Storage

Very Large Enterprise Network, Deployment, Users

Database!Fatal!Flash!Flaws!No!One! Talks!About!!!

ACM, This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution.

Benefits of Solid-State Storage

Building a Private Cloud with Eucalyptus

Essentials Guide CONSIDERATIONS FOR SELECTING ALL-FLASH STORAGE ARRAYS

White paper. QNAP Turbo NAS with SSD Cache

Cartal-Rijsbergen Automotive Improves SQL Server Performance and I/O Database Access with OCZ s PCIe-based ZD-XL SQL Accelerator

Solid State Technology What s New?

Accomplish Optimal I/O Performance on SAS 9.3 with

Buffer-Aware Garbage Collection for NAND Flash Memory-Based Storage Systems

In-Block Level Redundancy Management for Flash Storage System

COS 318: Operating Systems. Storage Devices. Kai Li and Andy Bavier Computer Science Department Princeton University

A High-Throughput In-Memory Index, Durable on Flash-based SSD

Nasir Memon Polytechnic Institute of NYU

Performance of vsphere Flash Read Cache in VMware vsphere 5.5

IBM Systems and Technology Group May 2013 Thought Leadership White Paper. Faster Oracle performance with IBM FlashSystem

Certification Document macle GmbH Grafenthal-S1212M 24/02/2015. macle GmbH Grafenthal-S1212M Storage system

Comparison of Hybrid Flash Storage System Performance

Best Practices for Optimizing SQL Server Database Performance with the LSI WarpDrive Acceleration Card

Delivering Quality in Software Performance and Scalability Testing

OCZ s NVMe SSDs provide Lower Latency and Faster, more Consistent Performance

Transcription:

Why are state-of-the-art flash-based multi-tiered storage systems performing poorly for HTTP video streaming? Moonkyung Ryu Hyojun Kim Umakishore Ramachandran Georgia Institute of Technology

Contents Background Flash Memory SSD Adap:ve HTTP Video Streaming Why Mul:- :ered Storage System? State- of- the- art Mul:- :ered Storage Systems Evalua:on Analysis Recommenda:ons Conclusion

Flash Memory Access Time Type Page Read (us) Page Write (us) Block Erase (ms) SLC 25 250 1.5 MLC 60 900 3.5 Page size 4 ~ 8 KB Block size 256 ~ 512 pages Constraints Read/Write in a PAGE unit Erase in a BLOCK unit No in- place update No write on a page unless it s clean Limited erase count SLC (100,000), MLC (10,000) Wear- leveling

Flash Memory Flash Transla:on Layer Mapping table on RAM Virtual address (exposed to upper level) à physical address (on flash chips) Page/Block/Hybrid mapping Hides erase opera:on to upper level Logging technique Over- provisioning 5 ~ 10 % for low- end SSD 30 ~ 50 % for high- end SSD Garbage Collec:on Cri:cal to performance

Flash Memory Garbage Collec:on V I I V I I V I V V I V V V C I V C V V V V I C This block is to be recycled. (3 valid pages and 5 invalid pages) C V V C V V C I Valid page Invalid page Clean page

Flash Memory Garbage Collec:on I I I I I I I I V V I V V V V I Valid pages are copied to clean pages. V C V V V V I V V V V C V V C I Valid page Invalid page Clean page

Flash Memory Garbage Collec:on C C C C C C C C V V I V V V V I V C V V V V I V The block is then erased. Overheads: Valid page copying Block erasing Mapping table update V V V C V V C I Valid page Invalid page Clean page

Flash Memory Pros Fast random read Fast sequen:al read/write Low power/heat No vibra:on or noise Cons Poor small random write Limited life :me Unpredictable/Uncontrollable GC Higher cost/gb than HDDs

Adaptive HTTP Streaming Streaming by WEB servers Client adap:vely selects different bitrate segments NETFLIX CDN Caching Servers HTTP GET

Why Multi-tiered Storage? Highly skewed video access paiern Cheap MLC Flash Memory Low power and heat 4 3.5 3 stream / $ 2.5 2 1.5 1 0.5 0 HDD only HDD + DRAM HDD + Flash SSD Reference: Ryu et al., Impact of Flash Memory on Video- on- Demand Storage, In Proc. of MMSys, 2011

Flash Caching Systems Flashcache Developed by Facebook Block- device solu:on ZFS Developed by SUN for SOLARIS File- system solu:on L2ARC is read cache ZIL is write cache

Evaluation Apache web server Linux kernel 2.6.32 Workload Video sequence Valkaama (78 mins, 1.06 GB) 2 Mbps on average 10 sec segments (466 total segments) 300 video objects Zipf 0.271 Hardware Xeon 2.26 GHz Quad core 4 GB RAM Two 7200 RPM HDDs Linux Soqware RAID- 0 SSDs INTEL X25- M G1 (60 / 80 GB) OCZ Core V2 (60 / 124 GB)

Performance Flashcache and ZFS are even worse than HDDs Only!

Flashcache Analysis Severe small random reads/writes Read Write

Flashcache Analysis Severe small random reads/writes Frequent GC in SSD

Flashcache Analysis Cache hit ra:o Cold cache: 0.7% Warm cache: 25.7% Reasons for poor performance Writes to flash memory is on the cri:cal path Small random writes fire frequent GC No priority for reads over writes

ZFS L2ARC ARC MFU ARC MRU Headroom L2ARC Feed Thread L2ARC Write Hand L2ARC Evicts ahead Flash writes are separated from data serving FIFO replacement policy

ZFS Analysis Good design decisions Write access paiern to flash is sequen:al Read Write Flash writes are not on the cri:cal path

ZFS Analysis Reasons for poor performance FIFO replacement policy for L2ARC Ideal for flash memory write throughput Avoids frequent GC However, does not capture video popularity Low hit ra:o (11.1 %)

Recommendations No small random writes No flash writes on the cri:cal path Higher priority for reads Flash reads (serving video) should have higher pri ority than flash writes (caching to flash) Object- level caching MLC flash memory is sufficiently cost- effec:ve to hold hot videos as a whole Very good hit ra:o (55.4 %) with skewed video acc ess paiern (zipf 0.271)

Conclusion Flashcache and ZFS are not properly designed for HTTP video streaming New design for mul:- :ered video storage syst em is needed

Thank You!