Storage Management Best Practices & Tips



Similar documents
Author: Mario Vosschmidt, SNIA Europe Director, LSI Presenter: Tomáš Martínek, Interim Chairman of Czech and Slovak Country Committe, NetApp

Understanding Enterprise NAS

Accelerating Applications and File Systems with Solid State Storage. Jacob Farmer, Cambridge Computer

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

How To Write On A Flash Memory Flash Memory (Mlc) On A Solid State Drive (Samsung)

How it can benefit your enterprise. Dejan Kocic Netapp

ADVANCED DEDUPLICATION CONCEPTS. Larry Freeman, NetApp Inc Tom Pearce, Four-Colour IT Solutions

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE. Matt Kixmoeller, Pure Storage

The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000

FlexArray Virtualization

Server and Storage Virtualization with IP Storage. David Dale, NetApp

How it can benefit your enterprise. Dejan Kocic Hitachi Data Systems (HDS)

June Blade.org 2009 ALL RIGHTS RESERVED

Leveraging EMC Fully Automated Storage Tiering (FAST) and FAST Cache for SQL Server Enterprise Deployments

The IntelliMagic White Paper on: Storage Performance Analysis for an IBM San Volume Controller (SVC) (IBM V7000)

UNDERSTANDING DATA DEDUPLICATION. Thomas Rivera SEPATON

CASS COUNTY GOVERNMENT. Data Storage Project Request for Proposal

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

UNDERSTANDING DATA DEDUPLICATION. Tom Sas Hewlett-Packard

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

Restoration Technologies. Mike Fishman / EMC Corp.

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

High Performance Computing OpenStack Options. September 22, 2015

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

The Data Placement Challenge

Trends in Application Recovery. Andreas Schwegmann, HP

The Revival of Direct Attached Storage for Oracle Databases

New Features in SANsymphony -V10 PSP1 Software-defined Storage Platform

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

Quantum StorNext. Product Brief: Distributed LAN Client

Best Practices for Optimizing SQL Server Database Performance with the LSI WarpDrive Acceleration Card

UNDERSTANDING DATA DEDUPLICATION. Jiří Král, ředitel pro technický rozvoj STORYFLEX a.s.

Best Practice and Deployment of the Network for iscsi, NAS and DAS in the Data Center

Introduction to Data Protection: Backup to Tape, Disk and Beyond

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

Data Center Convergence. Ahmad Zamer, Brocade

Solid State Storage in a Hard Disk Package. Brian McKean, LSI Corporation

VIDEO SURVEILLANCE WITH SURVEILLUS VMS AND EMC ISILON STORAGE ARRAYS

Enhancements of ETERNUS DX / SF

OPTIMIZING EXCHANGE SERVER IN A TIERED STORAGE ENVIRONMENT WHITE PAPER NOVEMBER 2006

Technology Insight Series

Using HP StoreOnce Backup systems for Oracle database backups

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

Configuring Celerra for Security Information Management with Network Intelligence s envision

Maxta Storage Platform Enterprise Storage Re-defined

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation (Author and Presenter)

ntier Verde Simply Affordable File Storage

Scalable NAS for Oracle: Gateway to the (NFS) future

EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

SoftLayer Fundamentals. Storage and Backup. August, 2014

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

Addendum No. 1 to Packet No Enterprise Data Storage Solution and Strategy for the Ingham County MIS Department

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

EqualLogic PS Series Load Balancers and Tiering, a Look Under the Covers. Keith Swindell Dell Storage Product Planning Manager

Big Data Storage Options for Hadoop Sam Fineberg, HP Storage

SAN Conceptual and Design Basics

Q & A From Hitachi Data Systems WebTech Presentation:

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

Server and Storage Consolidation with iscsi Arrays. David Dale, NetApp

EMC Unified Storage for Microsoft SQL Server 2008

STORAGE CENTER. The Industry s Only SAN with Automated Tiered Storage STORAGE CENTER

Dell Compellent Storage Center SAN & VMware View 1,000 Desktop Reference Architecture. Dell Compellent Product Specialist Team

(Scale Out NAS System)

HPe in Datacenter HPe3PAR Flash Technologies

Virtual server management: Top tips on managing storage in virtual server environments

Violin Memory Arrays With IBM System Storage SAN Volume Control

Analyzing Big Data with Splunk A Cost Effective Storage Architecture and Solution

Calsoft Webinar - Debunking QA myths for Flash- Based Arrays

Pivot3 Desktop Virtualization Appliances. vstac VDI Technology Overview

Choosing Storage Systems

Luxembourg June

EMC XTREMIO EXECUTIVE OVERVIEW

How To Connect Virtual Fibre Channel To A Virtual Box On A Hyperv Virtual Machine

Creating a Catalog for ILM Services. Bob Mister Rogers, Application Matrix Paul Field, Independent Consultant Terry Yoshii, Intel

Flash Storage Roles & Opportunities. L.A. Hoffman/Ed Delgado CIO & Senior Storage Engineer Goodwin Procter L.L.P.

Taking Linux File and Storage Systems into the Future. Ric Wheeler Director Kernel File and Storage Team Red Hat, Incorporated

Virtualization, Business Continuation Plan & Disaster Recovery for EMS -By Ramanj Pamidi San Diego Gas & Electric

Comparison of Hybrid Flash Storage System Performance

Increasing Storage Performance

RFP-MM Enterprise Storage Addendum 1

nexsan NAS just got faster, easier and more affordable.

Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage

Oracle Database Deployments with EMC CLARiiON AX4 Storage Systems

FAS6200 Cluster Delivers Exceptional Block I/O Performance with Low Latency

Hitachi Path Management & Load Balancing with Hitachi Dynamic Link Manager and Global Link Availability Manager

Transcription:

Storage Management Best Practices & Tips Considerations for file & block storage provisioning Anjan Dave, Principal Storage Engineer LSI Corporation

SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and individual members may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced in their entirety without modification The SNIA must be acknowledged as the source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA Education Committee. Neither the author nor the presenter is an attorney and nothing in this presentation is intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney. The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 2

Abstract File and Block Storage Deployment in Enterprise Environments The non-stop growth in the amount of information generated and stored today presents a significant challenge in storing the data and managing the associated storage infrastructure. This tutorial presents some of the challenges in file storage management, and provides some best practices and tips that apply to managing medium and large sized Network Attached Storage based environments. This tutorial also provides some practical tips for managing block-based storage for business critical applications and databases. 3

File Storage Purchase Planning Anytime you plan to purchase NAS storage or related solutions, plan for Storage Capacity Planning This is about the amount of storage you are dealing with Storage Performance Planning This is about the performance of the amount of storage you are dealing with File Storage Management Basic things to keep in mind when managing a large NAS environment Also keep in mind data center power, cooling, floor space, LAN/WAN bandwidth, available admin-power, training, and future costs of supporting the hardware, software and licenses (not covered in this presentation) 4

File Storage Purchase Planning Storage Capacity Planning How well do you know your data? Classify your data (age, size, type, etc) so you know where to spend the money (figure 1) What % of your total data is active? 20% is not unusual How will you or the solution deal with the remaining 80% Today s active data will become inactive tomorrow Policy based data migration tools are there, but does a tool fit in your environment? Depending on your workflows and processes, how much of the inactive data can be parked for long term on secondary storage? You may NOT want to track, chart, report, backup, increase performance, provide high redundancy, occupy ports, maintain support (e.g., 4-hr support), buy a solution, or spin expensive disks for data that is hardly accessed 5

Addressing Active Data NAS Acceleration or Compression or other Appliances Figure 1: Spend your money on what matters 6

File Storage Purchase Planning Storage Capacity Planning Assuming you HAVE to buy more disks One Storage Array could hold PetaBytes of storage. What happens when you need to do maintenance on it? Spreading the disks on more controllers may make more sense Know how you count your disk spaces Available or Usable Disk Space can mean several things (see graphic) Consider disk overheads formatted capacity, spares, parity disks, snapshots, etc. Don t forget disks for replication You are not just buying disks! Do you have the additional ports on your fiber switch/director? Do you have additional ports on the LAN side? How about free ports on patch panels? Are electrical circuits available to power the new equipment? 7

File Storage Purchase Planning Cold Capacity Formatted Disk Capacity What to count, where? Hot Spares RAID-Parity Space File System Overhead Snapshot/other reserves Quoted space Non-quoted space Archive space Compressed data Thin provisioning Purchased (TB) Spinning Usable (TB) Disk Space Report Example Spinning Used (TB) End-user Usable (GB) End-user Used (GB) 8

File Storage Purchase Planning Storage Capacity Planning When was your last disk/node purchase? Will you be pairing the new (faster) controller with the old one? Can you accommodate a new generation node in an old cluster? How will you address the increasing disk drive densities? E.g., new 600GB drives Vs your existing 300GB drives Does the solution require you to grow disk capacity in specific capacities only? Check with your vendor Drive mixing Does the solution allow mixing SATA and FC drives in same cabinet, same disk module? How about SSDs? 9

File Storage Purchase Planning Storage Performance Planning Back to know your data & end-users Do you know the life cycle of your data? Origin, growth, decay, expiry One file or a set of data will be accessed simultaneously? By how many users or applications? How many people will login simultaneously? What is the average file size and how many? Dealing with lots of tiny files is far different than large files Who is the customer? Online consumer, report generator, partner, developer, or a community? What is their workflow? 10

File Storage Purchase Planning Storage Performance Planning Know your applications Understand Read-Only Vs Read-Write applications and what are the influencing factors (what, when and how it becomes RO). Is any content static, such as graphics? Can it be compressed? One file or a set of data will be accessed simultaneously? By how many users or applications? Sequential data versus non-sequential data can be planned for, but can you aggregate data with similar characteristics? Other performance factors to consider How many host ports, disk ports, Ethernet ports are available, what speeds, and what are the options for port aggregation? If considering new storage networking technology, review your existing investment in the storage networking infrastructure 11

File Storage Purchase Planning Storage Performance Planning Other performance factors to consider Incremental growth mind the performance when adding disk space (concatenation Vs re-striping on the added space) Know what protocol(s) are strongly used in your env., and whether supporting that protocol is a strong trait of the product. CIFS is supported widely, but every vendor s implementation could be different 12

File Storage Purchase Planning Storage Performance Planning Other performance factors to consider Understand how the file system works a powerful hardware is not of much use with weak file system Know how much flexibility you have in fine-tuning the system for performance. Ask what options are available, more the better. Out of box settings work only to some extent To deliver highest performance, the vendor may want you to configure, operate and upgrade their product or solution in a specific manner. Ask for details and whether it is feasible for you Does the system allow you to configure the back-end disk storage? Make sure you understand the meaning of front-end grows independently of back-end 13

File Storage Purchase Planning File Storage Management Dealing with Data Data migration you may need to transfer lot of data on the new platform what are your options, especially if downtime is hard to buy from users? NFS and CIFS coexistence how well is this implemented? Don t just know what protocols are to be supported, rather how widely and heavily each is used (e.g., 90% NFS, 10% CIFS) Single-pane of management if you consider deploying at multiple locations, you ll need it Will the solution cut down the usage of in-house scripts/apps? How does the system handle file system corruptions? Does the product fix the file systems in mounted state? 14

File Storage Purchase Planning File Storage Management Reporting Reporting can the system report the metrics across your full deployment landscape? Does reporting need it s own operating infrastructure? Consider following Can the system report performance related numbers such as wait times, IOPs, throughput at the system and port level and device level? Can the system report system and user level statistics for NFS/CIFS and other protocols? Can the system report on it s CPU, Memory, Cache utilization? Can the system tell if it s overloaded in some way? Or it leaves you wondering? Can the system report hotspots on disks? 15

File Storage Purchase Planning File Storage Management Understand the Specs of the platform There s a difference between published specs and recommended specs Things to consider What is the largest size of the volume/file system you can create? Practical max number of LUNs the system can address Max and recommended number of LUNs that make up the volume/file system Practical max number of volumes/file systems per node Maximum NFS and/or CIFS Ops/sec Max number of NDMP sessions the system can handle per node 16

File Storage Purchase Planning Storage Management and SNIA s SMI-S SMI-S is a standard for storage management defining the communication between management applications (such as a Storage Resource Management application) and instrumentation (software/firmware components) SMI-S can help you achieve a single pane of glass view into your storage landscape, managing, monitoring, and reporting via a single interface Verify following beforehand- Are the existing arrays, new arrays and the SRM tools SMI-S compliant? Understand what features do the arrays from different vendors support via SMI-S Are you able to collect meaningful information (e.g., performance statistics) in a uniform way from all arrays? All arrays may not report the stats you need Understand any limits imposed by a particular storage/san vendor (e.g., number of arrays, volumes, etc) that can affect your deployment You may need to upgrade or install a specific version of firmware/software on arrays or other devices this may affect your compatibility In general, how much can you manage and/or monitor and report, end-to-end from host to SAN to Storage via SMI-S? 17

File Storage Deployment Planning File Storage Management What data will you migrate to FC/SAS, SATA, SSD areas? Distribute the data of I/O intensive apps across spindles to avoid hotspots Know how the file system lays the data If you are allowed to build the back-end disk, keep in mind - the amount of data, the read Vs write, random Vs sequential, active Vs inactive data, snapshot reserve space, optimum use of disk space, disk types in the array, etc 18

File Storage Deployment Planning File Storage Management What other factors can you consider? IOPs, controller cache settings, RAID level, meta-volumes, stripe size, number of spindles, controller ownership of lun, number of hot spares, etc What are your options to build the file system? Understand how the back-end design factors such as number of luns, lun size, stripe size, etc affect the performance Consider NFS mount options (udp/tcp), journaling and a- time updates, their impact to end-users 19

File Storage Deployment Planning File Storage Management The largest file system you can create (given backup windows, number/size of files, snapshots, file system checks, performance, etc) Test the network (client connectivity) configuration options. Port aggregation works differently depending on your LAN switches. Can you use Jumbo Frames? Consider number of snapshots, their location. Are snapshots based on volumes? Can snapshot creation and deletion be staggered? 20

Block Storage Deployment Planning Know all the requirements beforehand Details of all disk spaces needed Hardware details of all storage, SAN and host resources available GROWTH in databases for next 1 to 2 yrs List of all applications, databases, corresponding host names Availability requirements, SLAs, DR (replication, clustering) Performance requirements for each DB, and application 21

Block Storage Deployment Planning Know all the requirements beforehand Processes needing storage team involvement such as DB upgrades, refreshes, patching, etc needing snapshots, or backup/restores Don t forget impact of Disk/Tape backup, de-duplication especially for 5. and 6 Tiered storage, SSD, Virtualization at Host and Storage level, Caching, Compression usage of these technologies and their expected benefits/impacts should be well understood 22

Block Storage Deployment Planning Know all the requirements beforehand For optimum disk layout for databases, check the vendor best practices in addition to internal standards Pick an application first. You can t club the best practices for a Report Generation application and an OLTP together Review DB tunable parameters first they are usually tuned with Application in mind, not just storage Then match DB parameters with storage array settings (e.g., IO size pre-fetch unit could be a multiple of segment size) 23

Block Storage Deployment Planning There may be specific parameters for hosting a particular RDBMS on a storage platform from certain vendor get all parties in agreement, same for applications Understand the COMPLETE landscape of your block storage needs, not just the need at a time Pool apps with similar requirements on one array or a set of spindles Boot from SAN Don t put OS and Swap luns on same disks Segregate OS/Swap luns for different hosts on separate disks Test path failover thoroughly Put the OS related luns in beginning of the RAID-set 24

Block Storage Deployment Planning Process for deploying new block storage for applications/databases List of Apps, Type of Apps Disk Space Needs Today + Next 1-2 yrs + The Surprise Factor! Space Reserve for App/DB Processes (Upgrades, DB Refresh cycles, etc) Space Reserve or Loss for Snapshots, Short-stroking, Max Disk Utilization Limit Vendor Disk Layout Recommendations Internal or Vendor IOPs Needs Array (HSP), Disk (format), File system, RAID Overheads 25

Block Storage Deployment Planning Existing Cold Space Replication/Mirroring New Storage Net Cold Space to Purchase Internal and Vendor Performance Best Practices (Stripe size, cache, etc) Vendor Recommendations Specific to Storage Array Configure Storage Array(s) Note Virtualization, SSDs, Automated Tiered Storage can change the above process 26

Block Storage Deployment Planning Inside the Array Segregate like-minded applications on separate disk arrays/pools Set array and LUN level cache settings. Check with Vendor first Review array maintenance related background activities settings RAID Level Review DB, Storage vendor best practices, and consult your DBAs General Rule of Thumb - Choose RAID-10 over RAID-5 for write-heavy usage Consider usage of RAID-5 Vs RAID-10 for table spaces, transaction logs, archive logs, Indexes, Temp space, Sort space, etc For # of spindles, understand the IOPs requirements for reads/writes as well as the size of the IO Keep in mind the IOPs needs for backups + transactions Pick a suitable segment size based on each app/db needs Can you stripe on top of a group of LUNs? 27

Block Storage Deployment Planning Inside the Array Standardize on 2 or 3 sizes of LUNs Smaller LUNs for binaries, OS, Swap, etc Larger luns for Database files Maintain a balance of ownership of luns among the controllers Allocate LUNs from different RAID-sets (i.e., spread the IO) If you re not short-stroking, then keep utilization below 80% For Redirect-on-write operation, make SURE the disks you allocate for the deltas are configured same as the original LUNs For Copy-on-write operation, make sure the disks used for Deltas are separate spindles, and not shared with original disks Standardize on Host/Host Group naming style (match it with zones) Check out SNIA Tutorial: Storage Performance 101 28

Block Storage Deployment Planning At the Host level Set the optimum HBA driver settings SAN Topology Queue Depth FiberChannel Speed Test path failover break the path in every possible combination Controller failure, just pull it out! Manual LUN failover to alternate controller Block port on the switch/director Unplug fiber cable Know if LUN names are visible by the OS, it helps Make sure LUNs are ingested correctly i.e., a LOG lun is not configured as a database lun Know the limits Max # of luns, paths, LUN Zero requirement 29

Planning for Storage Virtualization If Storage Virtualization is involved Understand the new storage terminology against existing LUN/Volume/Slice/Partition/Stripe/Pool/Reserved_pool/disk_group, etc, etc Compatibility check will be more complex, but do it For new install, pick one OS platform and test all features/functions Test how you ll virtualize existing data/storage Test volume expansion/shrinking Document physical to logical (NPIV) mappings of the WWNs do this end-to-end Test path failover break the path in every possible combination Design several storage pools with different characteristics RAID-5, RAID-10, etc Based on number of underlying disks Based on disk type/capacity Based on disk groups having hot spares Based on array specs/type Check out SNIA Tutorial: Five Best Practices In Virtualization 30

Planning for SSDs If you plan to have SSDs in the environment Make sure (evaluate) that the specs meet your workload/workflow Refer to the SNIA SSS Performance Test Suite Specification Know the costs for YOUR platform of choice, compared to disks Refer to the SNIA Enterprise TCO Calculator Know your data! What to place on the SSDs What policies, processes and tools will you employ to Put data on SSDs (active and/or performance-critical) Move data out of SSDs (to Tier1 or SATA) Size the controller/cpu for # of SSD and Disk drives in the array Make sure you factor in the write amplification in your tests Know the Erase Block Size and align the partitions accordingly Make sure you can measure/quantify the expected performance gains 31

Q&A / Feedback Please send any questions or comments on this presentation to SNIA: trackstoragemgmt@snia.org Anjan Dave Many thanks to the following individuals for their contributions to this tutorial. - SNIA Education Committee 32