1
VNX2 BEST PRACTICES FOR EFFICIENCIES: FLASH, DEDUPLICATION & COMPRESSION RYAN POULIN CORPORATE SYSTEMS ENGINEERING CORE TECHNOLOGIES DIVISION, VNX BU 2
ROADMAP INFORMATION DISCLAIMER EMC makes no representation and undertakes no obligations with regard to product planning information, anticipated product characteristics, performance specifications, or anticipated release dates (collectively, Roadmap Information ). Roadmap Information is provided by EMC as an accommodation to the recipient solely for purposes of discussion and without intending to be bound thereby. Roadmap information is EMC Restricted Confidential and is provided under the terms, conditions and restrictions defined in the EMC Non- Disclosure Agreement in place with your organization. 3
VNX Unified Hybrid Flash! All Workloads No need for Workload Segregation Unified Storage File and Block Cloud Ready Private or Hybrid Cloud Hybrid Flash Performance of Flash with the Cost of Disk 4
AGENDA Introduction VNX Efficiency Technologies VNX File Deduplication & Compression VNX Block Compression VNX2 Block Deduplication New Feature Coming Soon! 5
STORAGE EFFICIENCY CHALLENGES Relentless growth High storage growth rate Overallocation Unused, allocated storage Inefficient utilization Allocated storage used to store inactive or duplicate files Relentless information growth 50%+ 100%+ 50%+ 1996 2000 Future High percentage of stranded storage Storage allocated for unused file system space Free space ~30% Used space ~70% Thin Provisioning High percentage of wasted space Storage consumed by inactive files and multiple copies of identical files FAST VP ~90% inactive data ~10% active data 6
STORAGE EFFICIENCY SOLUTIONS THIN PROVISIONING, COMPRESSION, & DEDUPLICATION Thin Provisioning Compression/Deduplication Storage allocated on as-needed basis Unallocated 30 GB Allocated 70 GB Redundant data minimized Unallocated space 30 GB Freed space 30 GB Used space 40 GB 100 GB = 70 GB! 100 GB = 40 GB! 7
VNX FILE DEDUPLICATION & COMPRESSION 8
VNX FILE DEDUPLICATION METHOD SINGLE INSTANCE STORAGE OF FILES Stores one copy of each unique file Identifies matching files using digest or bit compare Stores file in hidden area Replaces individual copies with pointers to the hidden copy No processing required on read Write-split to avoid changing shared data 9
FILE DEDUPLICATION AND COMPRESSION Intelligent data selection Typically avoids active data Option to target active files Compression support for VMs through the VSI Plug-in for VMware vsphere End-user file-level activation Tunable options by file system Size of files Age of files File extension Directory filtering ~100 GB Active data ~100 GB Active data 1 TB Traditional file data ~400 GB Dedupe and compressed data ~900 GB Inactive, aged data, or specifically targeted data Up to 500 GB savings Internal policy engine Runs in the background Throttles to avoid negative impact on client services ~ 500 GB File-level Deduplicationenabled file system Up to 50% savings 10
STORAGE OPTIMIZATION WITH VMWARE PLUG-IN INCREASE STORAGE EFFICIENCY WHILE MAINTAINING OPERATION INTEGRITY VMDK Files Read request Write operation VMware Administrator selects those virtual machine(s) to be optimized Leverages compression engine active file support VM file store is compressed by up to 50% of original size Reads are performed by uncompressing only the portion of the file store requested Writes are committed to a set-aside file Periodically, writes are compressed into the original file Performance impact is typically less than 10% 11
FILE DEDUPLICATION AND COMPRESSION CONFIGURING WITH WINDOWS EXPLORER 12
VNX BLOCK COMPRESSION 13
COMPRESSION Shrinks existing data Manipulates data Identifies bit patterns in data Replaces with run length encoding Stored as offset, length pairs Requires decompression on read May reduce data read from disk 14
VNX BLOCK COMPRESSION OVERVIEW & BEST PRACTICES Can enable Compression on Storage Pool and Classic LUNs Thick LUNs become Thin Start with a Thin LUN if planning to use Block Compression Classic LUNs will be migrated to a pool Data is compressed during migration to pool 64 KB increments Will only compress data if 8 KB or more can be saved Intended for relatively inactive LUNs, such as backup copies, archives, and static data repositories Host reads are decompressed in memory, but left compressed on disk New host writes are not compressed on the fly Recompressed as background task 15
BLOCK COMPRESSION LIFECYCLE Initial Compression Turn on compression Compress entire LUN Decompression Turn off compression Uncompress whole LUN Uncompressed Compressed Decompressed Recompression Triggered by threshold Initial compression R1 R2 R3 Decompression t 0 16
VNX2 BLOCK DEDUPLICATION 17
VNX2 BLOCK DEDUPLICATION OVERVIEW HOW IS VNX2 BLOCK DEDUPLICATION IMPLEMENTED? Run on a pool by pool basis C D B A Utilizes fixed-block deduplication 8 KB blocks are compared and duplicates are eliminated A C B D A C B D LUN mappings are updated to point to the new location of the data Enabled on a LUN by LUN basis Pool LUNs only LUN becomes Thin A C B D 18
VNX2 BLOCK DEDUPLICATION OVERVIEW COMPONENTS Private Deduplication Space SPA SPB Storage Pool 1 LUN 0 LUN 2 LUN 4 LUN 1 LUN 3 LUN 5 Deduplication Container LUN 6 LUN 7 = SPA LUN = SPB LUN 19
VNX2 BLOCK DEDUPLICATION OVERVIEW THE VNX2 BLOCK DEDUPLICATION PROCESS Matching Hash 00f576947c1ab678900e 52036cc456171514b011 00f576947c1ab678900e f5483c51110d14455712 4bb363867cd111998240 4bb363867cd111998240 5672755bb23451576b12 52036cc456171514b011 5672755bb23451576b12 52036cc456171514b011 f5483c51110d14455712 5672755bb23451576b12 00f576947c1ab678900e 4bb363867cd111998240 5672755bb23451576b12 52036cc456171514b011 f5483c51110d14455712 00f576947c1ab678900e f5483c51110d14455712 Pages LUN map with is duplicate updated Digests of 8 KB areas The digests to reflect are compared shared are digests calculated then and sorted blocks When at the and enough bit de-allocation stored compared level blocks to ensure of are for space freed, duplicates no on collisions slices slice(s) are 1011000110110110000 returned to the pool 1011000110110110000 Deduplication Container Pool LUNs (Mapping) LUN 6 1 0 10 LUN 7 Slices (256MB) 20
VNX2 BLOCK DEDUPLICATION BEST PRACTICES Containing large amounts of: Duplicate and/or static data Experiencing less than 30% writes Where the workload is not: Sequential Large Block (>32 KB) Random IO Where Thin LUN performance is acceptable 21
IT S ALL ABOUT BALANCE = SPA LUN = SPB LUN SPA SPB Storage Pool 2 LUN 6 LUN 8 LUN 10 LUN 7 LUN 9 LUN 11 Deduplication Container Deduplication Private Space Storage Pool 1 Deduplication Private Space LUN 0 LUN 2 LUN 4 LUN 1 LUN 3 LUN 5 Deduplication Container 22
CREATING THE DEDUPLICATION CONTAINER 23
DEDUPLICATION SUMMARY WINDOW 24
STORAGE POOL PROPERTIES WINDOW DEDUPLICATION TAB 25
CREATING DEDUPLICATION ENABLED LUNS 26
ENABLE BLOCK DEDUPLICATION EXISTING LUNS 27
ENABLE BLOCK DEDUPLICATION EXISTING LUNS Private Deduplication Space SPA SPB Storage Pool 1 LUN 0 LUN 1 Deduplication Container LUN 1 Migrations complete. Deduplication is enabled on LUN 1. = SPA LUN = SPB LUN 28
ENABLE BLOCK DEDUPLICATION EXISTING LUNS 29
ENABLE VNX2 BLOCK DEDUPLICATION LUN OWNERSHIP 30
VNX2 BLOCK DEDUPLICATION FOR FILE COMING SOON TO VNX2 31
BLOCK DEDUPLICATION FOR VNX2 FILE OVERVIEW Provide the ability to utilize VNX2 Block Deduplication enabled LUNs as the storage for File based data Enabling/disabling of Block Deduplication is controlled by VNX2 File Removed the ability to enable/disable Block Deduplication on File LUNs directly Deduplication savings achieved at the Block Storage Pool level 32
VNX2 BLOCK DEDUPLICATION FOR FILE File System A E C E C F D B D G Pool LUNs (Mapping) Thin LUN Slice (256MB) A F E D C B E D C G 33
WHICH METHOD DO YOU CHOOSE? FILE-LEVEL DEDUPLICATION VS. FIXED-BLOCK DEDUPLICATION File-Level Deduplication Fixed-Block Deduplication Single instance storing of files Policy based deduplication - (Age of files/size of file/filtering) Savings returned to the File System Savings cannot span across File Systems Deduplication & Compression 8KB Fixed-Block deduplication Deduplication across all data regardless of age or usage Savings returned to the Deduplication LUNs or Pool Savings across all deduplication enabled LUNs within a pool - Potentially across multiple File Systems Deduplication Only 34
WANT TO LEARN MORE? EMC Online Support: EMC VNX2 Deduplication and Compression (White Paper) EMC VNX Unified Best Practices for Performance (White Paper) EMC VNX2 Block Deduplication Basic Overview YouTube: Next-Gen VNX Video - File Deduplication and Compression Next-Gen VNX Video - Block Data Compression Next-Gen VNX Video - Block Deduplication Overview Next-Gen VNX Video - Block Deduplication - Configuring Deduplication 35
VISIT EMC Products & Solutions Booth #813 RACE A CAR TO WIN RECHARGE YOUR DEVICES AND YOURSELF TAKE A DATA CENTER TOUR MEET WITH THE EXPERTS TAKE THE HANDS ON LABS* COLLECT YOUR EMC FLAIR TO WIN JOIN THE CONVERSATION @EMCProtectData @EMCStorage @EMCXtremIO * Located in Village Area 36
RELATED EMC WORLD SESSIONS Title Date Time VNX Hybrid Flash: Designing For FAST Cache & FAST VP On Unified Storage Monday Wednesday 12:00 PM 3:00 PM VNX Hybrid Flash File Deep Dive Monday Thursday 1:30 PM 10:00 AM 37
Session Name Date Time VNX & VNXe Disaster Recovery Strategies (On-Array, Off-Array & DR to the Cloud) VNX: Effectively Monitoring & Reporting On Your VNX Family Systems Best Practices For Deploying VNX In The Hybrid Cloud VNX Family For Virtualized Environments: New Features, Solutions, & Future Directions Virtual VNX: Overview, Architecture & Use Cases VNX Family: Systems, Architecture & Solutions VNX Hybrid Flash Best Practices For Performance VNX Hybrid Flash File Deep Dive VNX Data Mobility Strategies, Tools, & Considerations VNX Best Practices For Efficiencies: Flash, Deduplication & Compression VNX Hybrid Flash: Designing For FAST Cache & FAST VP On Unified Storage VNX & VNXe Security Best Practices VNX Best Practices For Running Microsoft Applications 5/5 (Tuesday) 5/7 (Thursday) 8:30AM 9:30 AM 11:30 AM 12:30 PM 5/4 (Monday) 3:00PM 4:00 PM 5/6 (Wednesday) 8:30AM 9:30 AM 5/5 (Tuesday) 3:00PM 4:00 PM 5/6 (Wednesday) 3:00PM 4:00 PM 5/4 (Monday) 4:30PM 5:30PM 5/7 (Thursday) 8:30AM 9:30 AM 5/5 (Tuesday) 12:00PM 1:00PM 5/6 (Wednesday) 4:30PM 5:30PM 5/4 (Monday) 12:00PM 1:00PM 5/6 (Wednesday) 12:00PM 1:00PM 5/4 (Monday) 3:00PM 4:00 PM 5/6 (Wednesday) 8:30AM 9:30 AM 5/4 (Monday) 1:30PM 2:30PM 5/7 (Thursday) 10:00AM 11:00AM 5/5 (Tuesday) 3:00PM 4:00 PM 5/7 (Thursday) 11:30 AM 12:30 PM 5/6 (Wednesday) 12:00PM 1:00PM 5/7 (Thursday) 8:30AM 9:30 AM 5/4 (Monday) 12:00PM 1:00PM 5/6 (Wednesday) 3:00PM 4:00 PM 5/4 (Monday) 8:30AM 9:30 AM 5/7 (Thursday) 1:00PM 2:00PM 5/5 (Tuesday) 4:30PM 5:30PM 5/7 (Thursday) 1:00PM 2:00PM 38
QUESTIONS? 39