LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE. Matt Kixmoeller, Pure Storage

Similar documents
How it can benefit your enterprise. Dejan Kocic Hitachi Data Systems (HDS)

How it can benefit your enterprise. Dejan Kocic Netapp

Accelerating Applications and File Systems with Solid State Storage. Jacob Farmer, Cambridge Computer

How To Write On A Flash Memory Flash Memory (Mlc) On A Solid State Drive (Samsung)

Application-Focused Flash Acceleration

Solid State Storage in a Hard Disk Package. Brian McKean, LSI Corporation

SSD and Deduplication The End of Disk?

WHITE PAPER. Drobo TM Hybrid Storage TM

HPe in Datacenter HPe3PAR Flash Technologies

Lab Evaluation of NetApp Hybrid Array with Flash Pool Technology

A Close Look at PCI Express SSDs. Shirish Jamthe Director of System Engineering Virident Systems, Inc. August 2011

Flash-optimized Data Progression

Flash Memory Arrays Enabling the Virtualized Data Center. July 2010

Automated Data-Aware Tiering

The Data Placement Challenge

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

June Blade.org 2009 ALL RIGHTS RESERVED

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

FLASH IMPLICATIONS IN ENTERPRISE STORAGE ARRAY DESIGNS

Understanding Enterprise NAS

HP 3PAR StoreServ 8000 Storage - what s new

Deploying Flash in the Enterprise Choices to Optimize Performance and Cost

Flash In The Enterprise

The Revival of Direct Attached Storage for Oracle Databases

Leveraging EMC Fully Automated Storage Tiering (FAST) and FAST Cache for SQL Server Enterprise Deployments

High Performance Computing OpenStack Options. September 22, 2015

Accelerating Server Storage Performance on Lenovo ThinkServer

White Paper. Educational. Measuring Storage Performance

Flash 101. Violin Memory Switzerland. Violin Memory Inc. Proprietary 1

Make A Right Choice -NAND Flash As Cache And Beyond

Solid State Drive Architecture

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Visions for Ethernet Connected Drives. Vice President, Dell Oro Group March 25, 2015

The Technologies & Architectures. President, Demartek

SUN STORAGE F5100 FLASH ARRAY

Sistemas Operativos: Input/Output Disks

FUSION iocontrol HYBRID STORAGE ARCHITECTURE 1

Microsoft SQL Server 2014 Fast Track

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

THE SUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

Scaling from Datacenter to Client

Total Cost of Solid State Storage Ownership

EMC XtremSF: Delivering Next Generation Performance for Oracle Database

Creating a Catalog for ILM Services. Bob Mister Rogers, Application Matrix Paul Field, Independent Consultant Terry Yoshii, Intel

How To Create A Flash-Enabled Storage For Virtual Desktop 2.5 (Vdi) And 3.5D (Vdi) With Nimble Storage

SQL Server Virtualization

All-Flash Storage Solution for SAP HANA:

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Taking Linux File and Storage Systems into the Future. Ric Wheeler Director Kernel File and Storage Team Red Hat, Incorporated

Restoration Technologies. Mike Fishman / EMC Corp.

SNIA Solid State Storage Performance Test Specification. Easen Ho CTO, Calypso Systems, Inc.

Comparison of Hybrid Flash Storage System Performance

Intel RAID SSD Cache Controller RCS25ZB040

89 Fifth Avenue, 7th Floor. New York, NY White Paper. HP 3PAR Adaptive Flash Cache: A Competitive Comparison

Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

Understanding endurance and performance characteristics of HP solid state drives

Solid State Drive vs. Hard Disk Drive Price and Performance Study

COS 318: Operating Systems. Storage Devices. Kai Li and Andy Bavier Computer Science Department Princeton University

Everything you need to know about flash storage performance

EMC VFCACHE ACCELERATES ORACLE

AirWave 7.7. Server Sizing Guide

Trends in Application Recovery. Andreas Schwegmann, HP

Evaluation Report: Supporting Microsoft Exchange on the Lenovo S3200 Hybrid Array

BUYING PROCESS FOR ALL-FLASH SOLID-STATE STORAGE ARRAYS

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array

Converged storage architecture for Oracle RAC based on NVMe SSDs and standard x86 servers

Sujee Maniyam, ElephantScale

Flash at the price of disk Redefining the Economics of Storage. Kris Van Haverbeke Enterprise Marketing Manager Dell Belux

HyperQ Storage Tiering White Paper

Improving IT Operational Efficiency with a VMware vsphere Private Cloud on Lenovo Servers and Lenovo Storage SAN S3200

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

Understanding the Economics of Flash Storage

21 st Century Storage What s New and What s Changing

EMC XTREMIO EXECUTIVE OVERVIEW

WHITE PAPER Addressing Enterprise Computing Storage Performance Gaps with Enterprise Flash Drives

Deploying Affordable, High Performance Hybrid Flash Storage for Clustered SQL Server

EMC XtremSF: Delivering Next Generation Storage Performance for SQL Server

Maxta Storage Platform Enterprise Storage Re-defined

How To Scale Myroster With Flash Memory From Hgst On A Flash Flash Flash Memory On A Slave Server

MS Exchange Server Acceleration

Analyzing Big Data with Splunk A Cost Effective Storage Architecture and Solution

Assessing RAID ADG vs. RAID 5 vs. RAID 1+0

EMC SOLUTION FOR SPLUNK

Transcription:

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE Matt Kixmoeller, Pure Storage

SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and individual members may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced in their entirety without modification The SNIA must be acknowledged as the source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA Education Committee. Neither the author nor the presenter is an attorney and nothing in this presentation is intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney. The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 2

Abstract This session is for Storage Administrators and Application Architects seeking to understand how to best take advantage of flash memory in enterprise storage environments. The relative advantages of flash tiering, caching and all-flash approaches will be considered, across the dimensions of performance, cost, reliability and predictability. 3

Agenda The Storage I/O Crisis How to Evaluate Flash Solutions Approaches for Leveraging Flash PCI cards Flash caching in arrays Flash tiering in arrays All-flash arrays Analyzing Flash ROI 4

Moore s Law vs. Newton s Law Moore s Law: 58% CAGR HDD Areal Density: 40-100% CAGR 10 HDD Latency (Seek Time) -3% CAGR ms 5 Sources: Intel, IDEMA, IBM Research 0 2006 2008 2011 2013 5

Rotating Disk Challenges Deconstructing a Random Read Typical Resulting Duty Pattern: 1 Spin disk ~2ms 3 Read Data ~0.04ms Data Data Data Data S E E K S E E K S E E K 2 Actuate arm ~4ms Time Disks spend >95% of their time seeking and rotating, not delivering data!

Randomization: The I/O Blender Traditional Architecture Virtualized / Consolidated Architecture Single-workload Serialized Cached Optimized Multi-workload No cross-vm optimization Highly randomized 7

Summary: The Storage I/O Crisis Performance Demands Randomization: The I/O Blender IOPS / TB + + Multi-core CPUs Data growth Instant user expectations Virtualization Data consolidation Cloud architectures 95% time seeking/rotating IOPS/TB dropping 8

Understanding Your Application s I/O Fingerprint 1Ks-10Ks IOPS I/O Load 10Ks-100Ks IOPS Small Block <8K Block Size Large Block 10sK - Ms Sequential, Predictable, Cacheable Locality of Access Random, Unpredictable Write- Centric Access Pattern Read- Centric 10s of ms, Variability OK Latency Sensitivity <1-3 ms, Consistency Key 9

How to Evaluate Flash Alternatives Cost: $/GB? $/GB: Managed + Operations team $/GB: Operational + Datacenter space + Power / Cooling $/GB: Protected + Snapshot / replication software + Snapshot / replication copies + Backup software and media Performance Read IOPS Write IOPS Latency Bandwidth $/IOP Protection How to make HA? How to backup? How to manage drive loss? Power & Size Rackspace / floor space Power consumption Cooling $/GB: Usable + Cost of RAID parity + Over-provisioning waste + Management software $/GB: Raw + Cost of raw disk / flash Operational Simplicity Makes operations more or less complex? Increases TBs/admin managed? Integration Fits in current architecture? Requires process change? 10

Flash Architecture Approaches Server-Attach PCI Flash Array with Flash Cache Array with Flash Tier All-Flash LUN or Array PCI card in application server Host CPU typically used for flash management 100s of GB Flash as a read and/or write cache in array, or to cache FS metadata All data persisted to spinning disk Typically <1% - 5% of total capacity Sub-LUN/FS tiering, where a LUN or FS is spread across flash and disk Hot blocks/files moved to flash, cold left on disk Typically 1% - 10% of total capacity 100% flash LUN or array May or may not use DRAM for read/write caching 11

Host-Based PCI Flash Overview: Host-based PCI flash cards, typically 100s of GB in size Either non-raided, or two cards mirrored in the server Either non-ha, or application clustered across multiple servers Leverages host CPU for flash management Server-Attach PCI Flash High-level Benefits: Highest-performance flash architecture possible Eliminates the cost/burden of shared storage Ultra-small footprint Potentially allows for the reduction of host DRAM High-level Challenges: Very high-cost, creates islands of expensive flash in every server Requires a re-architecture for most enterprises Expensive and difficult to protect (HA, backup) Requires mirroring for HA across multiple servers Requires app-level or external replication for DR 12

Array with Flash Cache Overview: Expansion of array s DRAM cache with flash Implemented either via controller-connected PCI flash cards, or SSDs in drive bays Typically read-cache only, although some implementations of write cache as well Cache page size / caching scheme varies by array (typically 4-16K) Array with Flash Cache High-level Benefits: Expands array s cache buffer from <<1% of total storage to 1-5% 10 latency improvement on IOs which hit the flash cache (10+ms <1ms) Off-loading these IOs from disk reduces the load on disk and also makes the disk perform better Result: depends completely on cacheability of I/O stream, but typically 30-80% performance improvement High-level Challenges: Improvement depends completely on cacheability of the I/O stream, results vary Flash as a cache heavily exercised: requires SLC flash Very expensive: $50-150/GB list prices typical from major vendors Cache requires time to warm to see the performance benefit, often doesn t persist across re-boots or HA events Minimal benefit for truly random I/O streams: random I/O can t be cached, it is random 13

Array with Flash Tier Overview: Creation of multiple tiers of storage (flash, enterprise FC/SAS HDD, SATA HDD) with the ability to spread a LUN / FS across them Typically implemented via 3.5 or 2.5 SSDs in the drive bays/shelves Generally 5-10% Flash, 90-95% disk in typical implementations Back-end storage is virtualized, and blocks/files are migrated up to flash or down to disk, based upon access patterns Chunk size for migration varies by vendor: typically MBs to GBs large Frequency varies by vendor: typically daily during low I/O windows Array with Flash Tier High-level Benefits: Allows one to leverage flash for IOPs, HDDs for capacity Less cost than an all-flash solution, blends the economics of flash and HDD Suitable for MLC and SLC implementations of flash Potentially allows for significantly less disk / footprint / power if disk has been over-built for performance High-level Challenges: Success depends on the predictability/randomness of the I/O stream Only works well if the hot blocks are consistent Current implementation block sizes are large; a single hot block could pin a whole MB or GB chunk in flash Not a real-time adaptable solution, chunks only promoted/demoted once daily Difficult to setup and manage is it worth the complexity of another tier to manage? 14

All-Flash LUN or Array Overview: Creation of a LUN on 100% flash Typically implemented via 3.5 or 2.5 SSDs in the drive bays/shelves Suitable for MLC or SLC flash implementations High-level Benefits: 10x+ potential performance improvement vs. 1-2x for cached/tiered solutions Much more consistent latency: no cache-miss penalty, all IOs at the speed of flash (typically <1ms) Dramatically smaller size, allows for reduction of storage footprint by 4-5x Flash fast enough to eliminate the need for significant DRAM caching in the array RAID re-build times greatly improved due to the speed of flash (hours 10s of minutes) All-Flash LUN or Array High-level Challenges: Cost: flash varies from $20/GB - $150/GB depending on the vendor and type (MLC vs. SLC) Connectivity: most flash arrays have limited connectivity options compared to their more mature disk array alternatives HA & DR: varies by vendor, but the HA and DR models of these arrays are less mature than existing disk array alternatives 15

Understanding Flash ROI Cost: $/GB? Flash Impacts $/GB: Managed + Operations team $60 / GB 1-2x Reduction in management cost (performance troubleshooting) $/GB: Operational + Datacenter space + Power / Cooling $45 / GB 1-2x 2-5x reduction in space 2-5x reduction in power $/GB: Protected + Snapshot / replication software + Snapshot / replication copies + Backup software and media $30 / GB 2-3x 10x+ $/GB: Usable + Cost of RAID parity + Over-provisioning waste + Management software $10 / GB 2x 2-3x reduction in disk over-provisioning for performance $/GB: Raw + Cost of raw disk / flash $5 / GB 5-10x the raw cost Source: averaged / anonymized customer examples 16

Summary Datacenters are facing a storage I/O crisis + + Increasing Demand Randomization Shrinking IOPS/TB Understanding your application's I/O fingerprint is key to choosing the best flash strategy for your environment Flash should be evaluated on several dimensions, it s not all about performance Performance Cost Power & Size Integration Protection Operations Simplicity Look at the full picture (beyond $/GB raw!) to build your flash ROI 17

Visit the Hands-On Lab Check out the Hands-On Lab: Solid State Storage in the Enterprise 18

Q&A / Feedback Please send any questions or comments on this presentation to SNIA: tracksolidstate@snia.org Matt Kixmoeller Many thanks to the following individuals for their contributions to this tutorial. - SNIA Education Committee 19

Q&A / Feedback Please send any questions or comments on this presentation to SNIA: tracksolidstate@snia.org 20