Understanding Enterprise NAS

Similar documents
ADVANCED DEDUPLICATION CONCEPTS. Larry Freeman, NetApp Inc Tom Pearce, Four-Colour IT Solutions

Accelerating Applications and File Systems with Solid State Storage. Jacob Farmer, Cambridge Computer

Introduction to NetApp Infinite Volume

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

UNDERSTANDING DATA DEDUPLICATION. Thomas Rivera SEPATON

High Performance Computing OpenStack Options. September 22, 2015

Storage Management Best Practices & Tips

UNDERSTANDING DATA DEDUPLICATION. Tom Sas Hewlett-Packard

How it can benefit your enterprise. Dejan Kocic Hitachi Data Systems (HDS)

UNDERSTANDING DATA DEDUPLICATION. Jiří Král, ředitel pro technický rozvoj STORYFLEX a.s.

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

How it can benefit your enterprise. Dejan Kocic Netapp

Cloud and Big Data initiatives. Mark O Connell, EMC

StarWind Virtual SAN for Microsoft SOFS

High Performance Server SAN using Micron M500DC SSDs and Sanbolic Software

Key Messages of Enterprise Cluster NAS Huawei OceanStor N8500

Restoration Technologies. Mike Fishman / EMC Corp.

Introduction to Data Protection: Backup to Tape, Disk and Beyond

THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.

Storage Cloud Environments. Alex McDonald NetApp

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE. Matt Kixmoeller, Pure Storage

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

Creating a Catalog for ILM Services. Bob Mister Rogers, Application Matrix Paul Field, Independent Consultant Terry Yoshii, Intel

Protect Data... in the Cloud

How To Write On A Flash Memory Flash Memory (Mlc) On A Solid State Drive (Samsung)

Red Hat Storage Server

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Big Data Storage Options for Hadoop Sam Fineberg, HP Storage

The BIG Data Era has. your storage! Bratislava, Slovakia, 21st March 2013

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise

Symantec NetBackup Appliances

Trends in Application Recovery. Andreas Schwegmann, HP

10th TF-Storage Meeting

Best Practice and Deployment of the Network for iscsi, NAS and DAS in the Data Center

SSD and Deduplication The End of Disk?

IBM Storwize V7000 Unified and Storwize V7000 storage systems

VDI Without Compromise with SimpliVity OmniStack and Citrix XenDesktop

Cloud-integrated Storage What & Why

Backup for branch offices and compartment backups. Måns Höiom & Rikard Lindkvist

Actifio Big Data Director. Virtual Data Pipeline for Unstructured Data

Cloud-integrated Enterprise Storage. Cloud-integrated Storage What & Why. Marc Farley

Maxta Storage Platform Enterprise Storage Re-defined

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Solid State Storage in a Hard Disk Package. Brian McKean, LSI Corporation

EMC ISILON SCALE-OUT STORAGE PRODUCT FAMILY

Building Optimized Scale-Out NAS Solutions with Avere and Arista Networks

WHITE PAPER. Get Ready for Big Data:

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

Microsoft Private Cloud Fast Track

Microsoft Windows Server Hyper-V in a Flash

Distributed File System Choices: Red Hat Storage, GFS2 & pnfs

Sujee Maniyam, ElephantScale

NEXENTA S VDI SOLUTIONS BRAD STONE GENERAL MANAGER NEXENTA GREATERCHINA

Backup to Tape, Disk and Beyond. Jason Iehl, NetApp

NEXT GENERATION EMC: LEAD YOUR STORAGE TRANSFORMATION. Copyright 2013 EMC Corporation. All rights reserved.

New Features in PSP2 for SANsymphony -V10 Software-defined Storage Platform and DataCore Virtual SAN

Demystifying Deduplication for Backup with the Dell DR4000

WAN Optimization and Cloud Computing. Josh Tseng, Riverbed

Nexenta Performance Scaling for Speed and Cost

How To Migrate To A Network (Wan) From A Server To A Server (Wlan)

EMC Unified Storage for Microsoft SQL Server 2008

Simple. Extensible. Open.

Server and Storage Consolidation with iscsi Arrays. David Dale, NetApp

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN

How To Test A Flash Storage Array For A Health Care Organization

June Blade.org 2009 ALL RIGHTS RESERVED

Cloud Storage Clients. Rich Ramos, Individual

ntier Verde Simply Affordable File Storage

Deploying Public, Private, and Hybrid Storage Clouds. Marty Stogsdill, Oracle

Technology Insight Series

Business Benefits of Data Footprint Reduction

Cloud OS Vision. Modern platform for the world s apps

Choosing Storage Systems

Future-Proofed Backup For A Virtualized World!

DataStax Enterprise, powered by Apache Cassandra (TM)

Seagate Cloud Systems & Solutions

Top Ten Questions. to Ask Your Primary Storage Provider About Their Data Efficiency. May Copyright 2014 Permabit Technology Corporation

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT

EMC VNX FAMILY. Copyright 2011 EMC Corporation. All rights reserved.

Everything you need to know about flash storage performance

RIVERBED EXTENDS FROM WAN OPTIMIZATION TO EDGE VIRTUAL SERVER INFRASTRUCTURE (EDGE-VSI)

FAS6200 Cluster Delivers Exceptional Block I/O Performance with Low Latency

EMC ISILON INSIGHTIQ Customizable analytics platform to accelerate workflows and applications on Isilon clusters

Transcription:

Anjan Dave, Principal Storage Engineer LSI Corporation Author: Anjan Dave, Principal Storage Engineer, LSI Corporation

SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and individual members may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced in their entirety without modification The SNIA must be acknowledged as the source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA Education Committee. Neither the author nor the presenter is an attorney and nothing in this presentation is intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney. The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 2

Abstract Understanding Enterprise Network Attached Storage (NAS) With the continuous growth of unstructured data, it has become paramount for enterprise storage stakeholders to understand the features and benefits of enterprise NAS solutions, and differentiate between the values of scale-out and scale-up NAS storage. This tutorial will help the audience gain insight into the usefulness and effectiveness of some of the key differentiating features of today s enterprise NAS offerings 3

Related Tutorials Check out SNIA Tutorial: What's Old is New Again - Storage Tiering Check out SNIA Tutorial: The File Systems Evolution Check out SNIA Tutorial: pnfs & NFSv4.2; a Filesystem for Grid, Virtualization and Database 4

Agenda NAS Overview Enterprise NAS Flavors Enterprise NAS Requirements Performance, Scalability Caching in NAS Storage Tiering in NAS Scale-Up NAS Solutions Scale-Out NAS Solutions About De-duplication 5

NAS Overview Traditional NAS what is it? Storage accessible over the network, usually over IP From simple partition sharing to traditional file servers to dedicated NAS appliances Well suited for workgroups/departments, SMB environments Can be multi-purpose (i.e. combine with email, print, other services) Not scalable in performance or capacity 6

NAS Overview Enterprise NAS what is it? Purpose built - to serve structured & unstructured data over one or more protocols Scalable in capacity and performance Higher performance Can support large number of clients Better redundancy Enterprise features - tiering, caching, de-duplication, multitenancy, replication, multi-protocol support, etc Well suited for high-performance needs, large datasets, large number of clients 7

NAS Overview Types of Enterprise NAS Solutions Traditional Scale up (add disks and upgrade NAS "head") Appliance based (with integrated or 3 rd party storage) Proprietary solutions (native file processing and storage) Open-Source software (with commodity hardware & storage) Clustered NAS Scale-Out Software based (or software on appliance) using commodity hardware (Commercial and Open-Source) Proprietary Integrated File Processing and Storage Special purpose Some software programmed into Integrated Circuits NAS Cloud Gateway NOTE: NAS Head refers to NAS Processing Unit, Understanding wherever Enterprise applicable NAS 8

NAS Overview - Examples Scale-Up Integrated NAS Scale-Out NAS Using Proprietary Clustered Nodes Each node has Disk Drives and does File & Network Processing Private Cluster Network Scale-Up Gateway NAS LAN Scale-Out NAS Using Commodity Servers and Storage Storage Network Storage Network 9

NAS Overview Enterprise NAS Features Today Clustered NAS (scale-up or scale-out) PetaByte scale, 1000s of disk drives Support for drive mixing Storage Tiering Enhanced Caching De-duplication (file or block level, or both) and compression Single Name Space Multi-Tenancy 10

Enterprise NAS Requirements Performance Requirements Types of workloads High Disk IOPs driven applications (disk driven) Very High NFS (or CIFS) Operations ('head' driven) Cache friendly or unfriendly workloads Small files Vs large files and Sequential Vs Random workflows Replication and Backup workloads De-duplication and other overheads 11

Enterprise NAS Requirements Performance Areas to Watch "Head" Vs "Disk" contention Dedup, replication, backups could cause contention on head or disks or both Head memories - how much, where? File Processing, FC, CPU, Network processing, NDMP Caching (more on this in other slides) 'Front-end' Cache to accelerate performance (comes in various forms) could become an overhead Are the workloads cache friendly? Does the system allow cache utilization reporting? Is it tunable? 12

Enterprise NAS Requirements Performance Areas to Watch Contention on the LAN Load balancing across multiple 1GigE trunks using Link Aggregation Use 10GigE if possible, combine with 1GigE for failover 13

Enterprise NAS Requirements How to gauge performance? Create your own performance benchmarks to find a baseline Find worst case and best case scenarios based on system configurations Put it to test for worst real-world workloads Do this in "production" configuration, not in test configuration Test for performance during a component failure scenario Combine it with backups, replication, dedup, etc Understand the penalties of failover of "head" 14

Enterprise NAS Requirements Enterprise NAS Requirements - Scalability Long Term Concerns What is your growth rate? Can you visualize your NAS/Storage infrastructure in next 5 years? How do you plan to do Data Management across the entire NAS footprint? How would you manage your NAS Infrastructure today and in 5 years? 15

Enterprise NAS Requirements Scale-Up NAS Silos... Scale-Up NAS system 1 25TB SAS Only Feature set 1 Scale-Up NAS system 2 30TB SAS+NL-SAS Feature set 2 Scale-Up NAS system 3 35TB SAS Only Scale-Up NAS system 4 60TB Scale-Up NAS system 255 30TB Feature set N Separate NAS Entities resulting in storage inefficiencies and sprawl over time (Example capacities) LAN 16

Enterprise NAS Requirements Short Term Concerns Are you adding to or replacing existing solution? How much performance and disk capacity are you planning for? How would you add more performance? How would you add more capacity? Independently or tied to performance? How much capacity is too much on a single system? 17

Caching in NAS About "Front-End" Cache Implementations Examples - External Caching Device - between clients and NAS Internal, based on PCIe SSD adapter Internal as a set of SSDs External Metadata Acceleration External Extended NAS configuration (hub and spoke) 18

Caching in NAS "Front-End" Cache Considerations Usually, 'learning' or adaptation period is involved Can take minutes to weeks depending on workloads Size of the cache is a factor Sizing front-end cache (if it's an option) against spinning disks can be challenging - due to costs Determine Workload suitability for "hit" rates Known or predictable Vs unknown workloads can impact cache efficiency Structured and unstructured data Reads Vs writes, random Vs sequential Serves Reads, Writes, or both? 19

Caching in NAS "Front-End" Cache Considerations PCIe and SSD implementations have difference in performance/scalability PCIe based usually is faster, but not scalable Is it volatile to power or system exceptions? Re-learning in case of power failures (can take a long time to relearn, volatile Vs non-volatile) Data promotion/eviction or maintaining multiple cache copies across cluster nodes could become an overhead for certain workloads 20

Caching in NAS About Storage Tiering and Caching Both could be SSD implementations Cache is reactive, tiering can be both reactive and proactive based on policies Caching is to Accelerate Data, Tiering is to Manage Data Caching is workload dependent, Tiering is time dependent Caching and Tiering complement each other, not compete 21

Storage Tiering In NAS Can be block or file based, automatic or manual Beneficial if system intelligently places data for you Even better if it works across discrete systems - automatically! Far better if tiered storage is file-system aware Backup Performance in block-based tiered Storage can be a challenge Sizing the disk tiers for balance of performance and cost is the key - be conservative initially 22

Storage Tiering In NAS Tier0 in NAS (Tier0 = SSD for this discussion) Best if Tier0 is space is managed automatically Estimating Tier0 capacity is the key Estimate against lower tiers Between 1 to 10% being standard Significant cost differences between 3% and 10% Tier0 Sizing active data appropriately is important Getting numbers from backup related stats (or replication deltas) could be useful 23

Storage Tiering In NAS Reporting & Monitoring of Automatic Tiering How effective is Tier0 (since it's most expensive)? Consider Hits and Capacity Utilization Which workloads are being benefitted? Can you establish a correlation between blocks and files? Are there levers to fine-tune the process and policies? 24

Scale-Up NAS Scale-up or Vertically Scalable (typically only disks) For high performance cannot add large amount of disks Overruns the Storage/NAS Processor Storage Processor and/or NAS Heads do not scale Could be upgraded though Creates a big sprawl of independent NAS systems Results in disk space inefficiencies Good all-in-one/multi-purpose solutions Multi-protocol support is good Not focused on one type of workload Could be designed for specific workloads 25

Scale-Up Feature rich - snapshots, dedup, replication, VMware integration, etc Multiple storage buckets, not all in one solution Flexibility with tiered storage Block and file can be unified Can be open-source, commercial or a combination Tiering across individual NAS systems is a challenge 26

Scale-Out NAS Scale-out or Horizontally scalable (typically capacity and performance) - usually are clustered NAS Multiple redundant nodes performing file processing and/or disk functions Scale-out NAS origins Typical Use Cases in - Media, Entertainment, Oil & Gas exploration, Imaging (high bandwidth) Usually associated with High-Performance Computing Usually distributed filesystem Performance independent redundancy Data striped across multiple nodes 27

Scale-Out NAS Data and metadata could be separated One large storage bucket Could use commodity hardware and storage Can be open-source, commercial, or a combination Could provide higher aggregate performance Single-client performance usually not impacted much High-performance, mainly benefiting large files Massive scalability for disk capacity Usually serves a single purpose 28

Scale-Out NAS May increase networking requirements (due to more nodes) May increase power/cooling needs depending on the configuration Could get better disk utilization as the system grows 29

Scale-Up Vs Scale-Out NAS When investigating scale-out Vs scale-up, check your requirements and concerns What type of performance are you looking for (small file, large file workloads) Snapshot capabilities Uptime requirements (SLAs), and vendor support Are you combining file and block on the same system Replication, Mirroring Protocol support (NFS, CIFS, FTP, HTTP, etc) Backing up the data to tape 30

Scale-Up Vs Scale-Out NAS Existing Infrastructure and protocols in place for LAN/SAN, and Application layer) the ones that are in use and IT teams are familiar with - or not in place, i.e., additional costs to support it Multi-tenancy support De-duplication, compression, Virtualization environment integration 31

De-duplication on NAS De-duplication & Compression Know the differences Compression - removal of redundant bits File Dedup - removal of redundant files Block/Data Dedup - removal of redundant blocks Check what is offered Dedup only (file or block) - good Dedup (file or block) + compression - better Dedup (File or block) + compression - best 32

De-duplication on NAS De-duplication not for everyone May put performance penalty on CPU and memory May impact user response times on active filesystems Storage space gains can be realized for less active filesystems, but could be at the cost of additional data management 33

De-duplication on NAS Challenging to track on/off/status/gains for 100s of active filesystems Under what scenarios (backups, mirroring, snapshots) and workloads hydration is triggered Space gains from file Vs block dedup could vary additional 2% gain in a large environment is significant 34

In Summary Many Enterprise NAS flavors available today Know your requirements to find the right solution Plan for long term - scalability, stability, data management Scale-up and Scale-out both have their use cases 35

Q&A / Feedback Many thanks to Many thanks to the following individuals for their contributions to this tutorial. - SNIA Education Committee Dr. Joseph White Simon Gordon Send any questions or comments on this presentation to SNIA: tracktutorials@snia.org 36