OPTIMIZING VIRTUAL TAPE PERFORMANCE: IMPROVING EFFICIENCY WITH DISK STORAGE SYSTEMS



Similar documents
Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

Best Practices RAID Implementations for Snap Servers and JBOD Expansion

RAID Basics Training Guide

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1

RAID Overview: Identifying What RAID Levels Best Meet Customer Needs. Diamond Series RAID Storage Array

How To Create A Multi Disk Raid

Q & A From Hitachi Data Systems WebTech Presentation:

WHITE PAPER FUJITSU PRIMERGY SERVER BASICS OF DISK I/O PERFORMANCE

Guide to SATA Hard Disks Installation and RAID Configuration

IBM ^ xseries ServeRAID Technology

Getting Started With RAID

Data Storage - II: Efficient Usage & Errors

Definition of RAID Levels

VERY IMPORTANT NOTE! - RAID

PIONEER RESEARCH & DEVELOPMENT GROUP

File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

RAID technology and IBM TotalStorage NAS products

RAID OPTION ROM USER MANUAL. Version 1.6

RAID Levels and Components Explained Page 1 of 23

Filing Systems. Filing Systems

Guide to SATA Hard Disks Installation and RAID Configuration

RAID Performance Analysis

Sonnet Web Management Tool User s Guide. for Fusion Fibre Channel Storage Systems

RAID Technology White Paper

RAID Level Descriptions. RAID 0 (Striping)

RAID Made Easy By Jon L. Jacobi, PCWorld

NVIDIA RAID Installation Guide

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

Storage node capacity in RAID0 is equal to the sum total capacity of all disks in the storage node.

How to choose the right RAID for your Dedicated Server

VERITAS Volume Management Technologies for Windows

CONFIGURATION GUIDELINES: EMC STORAGE FOR PHYSICAL SECURITY

A Virtual Tape Library Architecture & Its Benefits

Storage Technologies for Video Surveillance

DAS to SAN Migration Using a Storage Concentrator

VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014

VIA RAID configurations

RAID Utility User Guide. Instructions for setting up RAID volumes on a computer with a Mac Pro RAID Card or Xserve RAID Card

Dynamode External USB3.0 Dual RAID Encloure. User Manual.

SAN Conceptual and Design Basics

WHITE PAPER PPAPER. Symantec Backup Exec Quick Recovery & Off-Host Backup Solutions. for Microsoft Exchange Server 2003 & Microsoft SQL Server

Flexible backups to disk using HP StorageWorks Data Protector Express white paper

RAID Storage System of Standalone NVR

USER S GUIDE. MegaRAID SAS Software. June 2007 Version , Rev. B

Using HP StoreOnce Backup systems for Oracle database backups

Chapter 12: Mass-Storage Systems

Chapter 10: Mass-Storage Systems

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS

RAID Technology Overview

Data Storage Solutions

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

UK HQ RAID Chunk Size T F ISO 14001

Chapter 9: Peripheral Devices: Magnetic Disks

PrimeArray Data Storage Solutions Network Attached Storage (NAS) iscsi Storage Area Networks (SAN) Optical Storage Systems (CD/DVD)

BrightStor ARCserve Backup for Windows

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

Using RAID6 for Advanced Data Protection

How To Make A Backup System More Efficient

RAID Utility User s Guide Instructions for setting up RAID volumes on a computer with a MacPro RAID Card or Xserve RAID Card.

Lecture 36: Chapter 6

Price/performance Modern Memory Hierarchy

MICROSOFT EXCHANGE best practices BEST PRACTICES - DATA STORAGE SETUP

Windows Server Performance Monitoring

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05

EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage

SATA II 4 Port PCI RAID Card RC217 User Manual

Nutanix Tech Note. Failure Analysis All Rights Reserved, Nutanix Corporation

Input / Ouput devices. I/O Chapter 8. Goals & Constraints. Measures of Performance. Anatomy of a Disk Drive. Introduction - 8.1

Milestone Solution Partner IT Infrastructure Components Certification Summary

2-Bay Raid Sub-System Smart Removable 3.5" SATA Multiple Bay Data Storage Device User's Manual

Chapter Introduction. Storage and Other I/O Topics. p. 570( 頁 585) Fig I/O devices can be characterized by. I/O bus connections

Database Management Systems

Backup and Recovery 1

Storing Data: Disks and Files

Introduction. What is RAID? The Array and RAID Controller Concept. Click here to print this article. Re-Printed From SLCentral

FANTEC MR-35DU3-6G USER MANUAL

Introduction Disks RAID Tertiary storage. Mass Storage. CMSC 412, University of Maryland. Guest lecturer: David Hovemeyer.

June Blade.org 2009 ALL RIGHTS RESERVED

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

HP Smart Array Controllers and basic RAID performance factors

EMC CLARiiON Backup Storage Solutions

A Dell Technical White Paper Dell Compellent

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

VMware Virtual Machine File System: Technical Overview and Best Practices

Product Brief: XenData X2500 LTO-6 Digital Video Archive System

Protect Microsoft Exchange databases, achieve long-term data retention

Benefits of Intel Matrix Storage Technology

SATARAID5 Serial ATA RAID5 Management Software

Enterprise Backup and Restore technology and solutions

Guide to SATA Hard Disks Installation and RAID Configuration

AMD RAID Installation Guide

WHITE PAPER: ENTERPRISE SECURITY. Symantec Backup Exec Quick Recovery and Off-Host Backup Solutions

Configuring RAID for Optimal Performance

DF to-2 SATA II RAID Box

Transcription:

W H I T E P A P E R OPTIMIZING VIRTUAL TAPE PERFORMANCE: IMPROVING EFFICIENCY WITH DISK STORAGE SYSTEMS By: David J. Cuddihy Principal Engineer Embedded Software Group June, 2007 155 CrossPoint Parkway Amherst, NY 14068 P.716.691.1999 F.716.691.9353

TABLE OF CONTENTS TABLE OF CONTENTS...1 ABSTRACT...2 THEORY...3 NUMBER OF VOLUMES (TAPES)...4 NUMBER OF VIRTUAL DRIVES...5 RECOMMENDED CONFIGURATIONS...5 A - Use 3 rd Party Disk Array as JBOD...5 B - Use 3 rd Party Disk Array as RAID 1...5 C - Use 3 rd Party Disk Array as RAID 10...5 D - Use 3 rd Party Disk Array as RAID 5 or 6...6 CONCLUSION...7

ABSTRACT The FastStream Virtual Tape appliance is a standalone storage peripheral which emulates a tape library while storing data to a standard SAS, Fibre Channel, SATA or SCSI disk array. The virtual tape appliance is flexible enough to allow you to use existing storage - JBOD or RAID arrays - to create virtual tape libraries. Since the virtual tape has been designed to interoperate with a large number of third party disk arrays, the configuration of both the virtual tape appliance and the drive array can have significant impacts on performance. This document will present general guidelines for configuring the virtual tape appliance in a wide variety of environments. It will describe the theory of how a virtual tape appliance writes to disks, present guidelines for number of tape volumes and number of drives, then discuss configuration of disk arrays and virtual tape appliances to maximize performance. While a virtual tape appliance performs quite well when treated like just another tape library, it is important to remember that even greater efficiency can be attained by reconfiguring servers for numerous simultaneous backups. See ATTO s Tech Brief: Faster backups using Virtual Tape for help in optimizing the backup of an entire data center.

THEORY Before diving into the format of the virtual tape data, it is helpful to remember how data is stored on a disk drive. A hard drive is made up of a number of spinning platters (disks) covered with magnetic material. These disks are read by a read-head on the end of an actuator arm. Before reading or writing to a specific portion of the disk, the hard drive must move the actuator arm to the proper spot on the disk (known as seek time), then wait for the platter to rotate around to the exact position (known as rotational delay) for the data to be stored. The seek time for a given transfer varies based on how far the actuator arm must move before reading or writing the data. Although modern hard drives are designed to reduce the delays, improperly utilizing the hard drive can cause too much seeking on the disk, seriously degrading performance. If reads or writes to a disk have sequential block addresses, the actuator arm will already be in position, minimizing seek time. If the block addresses used for disk access vary wildly, seek time will go up with a corresponding slump in performance. Prudent users of virtual tape technology will want to store the virtual tape data with RAID. The algorithms for RAID 0, 5 and 10 use a technique called data striping, in which a single write is segmented into multiple pieces, each written on a separate disk. The size of each piece to be written to disk is called the interleave. The group of the interleaved units across all data drives is called a stripe, while the size of the stripe is called the stripe width. RAID 5 uses one of the disk writes for storing parity information. The parity information is written to different disks for adjacent writes (distributed across the disks) for better failure protection. A 1 A 2 A 3 A p B 2 B 3 B p B 1 C 3 C p C 1 C 2 D p D 1 D 2 D 3 Figure 1 Figure 1 depicts RAID 5 on a four-disk array. There are four different stripes, lettered A through D. Each stripe is composed of three pieces of data, numbered 1-3, plus a parity block, labeled with a p. The format of the virtual tape data on the disk is relatively straightforward. Each virtual tape volume is allocated a contiguous set of logical block addresses. The virtual tape data is stored in two parts: tape format data, known as metadata, and backup data. The metadata describes block lengths, filemarks, etc., and is placed at the beginning of the tape region. Backup data is placed on the volume (logically) after the metadata, in sequential logical block addresses. During a standard backup and restore the 3

metadata written to the disks is negligible the real efficiency is gained by writing the backup data as fast as possible. Volume 1 Metadata Volume 1 Backup Data Volume 2 Metadata Volume 2 Backup Data Figure 2 Figure 2 diagrams two virtual tape volumes on a disk. Each volume has space set aside for metadata followed by a large logically-contiguous block of backup data. NUMBER OF VOLUMES (TAPES) The FastStream Virtual Tape appliance lets the user choose the number of volumes created on the disk array. The number and size of tape volumes relies on the backup scenario for your particular installation. Some backup administrators want virtual tapes to closely match the physical tape used for archiving. Other backup administrators want a large number of small virtual tapes for incremental backups, and others want a small number of large virtual tapes for ease of management. Regardless of the usage model, creating the right number of tapes can improve backup performance on a JBOD or RAID 1 (mirrored) array. 1 We ve seen how virtual tape data is written to the disk: each virtual tape volume gets a contiguous set of logical block addresses. If these volumes are too small, many volumes may fit onto the same physical disk, causing several backup streams to be simultaneously written to the same disk. Writing simultaneous backup streams to the same disk, as we have seen, causes the disk to perform many seeks, seriously degrading backup performance. If the tape volumes are too large the opposite effect can occur. Volumes which are too large only allow a small number of physical disks to be active at one time. Other disks in the drive array may be idle, wasting disk bandwidth. When backing up to a JBOD or RAID 1 array it is important to use all of the physical disks simultaneously. The best way to use all of the disks while minimizing seek time is to create virtual tapes which are about the same size as each physical disk and assign each virtual tape volume to its own virtual tape drive. It follows, then, that a JBOD array should be configured so that each LUN presented by the JBOD array is the same size as a physical drive (no drive concatenation). Following this same formula, a RAID 1 1 If RAID striping is used, the number of volumes does not significantly affect performance. 4

(mirrored) array should present one LUN for each mirrored pair of disks. This will allow the FastStream Virtual Tape appliance to use all of the available disk bandwidth for a backup. NUMBER OF VIRTUAL DRIVES For best performance the number of virtual tape drives should be configured to maximize the number of disks being used at once, without performing too many simultaneous backups to the same disk drive. This needs to be balanced with the number of servers which being backed up at your installation. As a rule of thumb, if the virtual tape and disk array are JBOD or RAID 1, there should be one virtual drive for each LUN presented to the virtual tape appliance. The disk array should be configured so that each LUN corresponds to a drive (JBOD) or pair of drives (RAID 1). Some recommended configurations are presented below. RECOMMENDED CONFIGURATIONS Reference Performance Level Disk Array Configuration A High JBOD No Drive Concatenation RAID 5 A High JBOD No Drive Concatenation RAID 10 B Medium-High RAID 1 (Mirrored) RAID 0 C Medium RAID 10 (Mirrored + Striped) RAID 0 D Medium RAID 5 JBOD D Medium RAID 6 JBOD ATTO Configuration Note: The ATTO virtual tape RAID 0/5/10 interleave will be 128 Ki bytes (128 * 1024 bytes). A - Use 3 rd Party Disk Array as JBOD ATTO RAID 5, RAID 10 This scenario will typically yield the best performance, since the ATTO RAID algorithms and the FastStream Virtual Tape were designed to work together. When configuring the disk array, create a single JBOD partition for each disk drive and map each JBOD partition as a separate Logical Unit (LUN). Do not concatenate the drives in the disk array. Configure the virtual tape as RAID 5 or RAID 10. B - Use 3 rd Party Disk Array as RAID 1 ATTO RAID 0 Some disk arrays cannot be configured in JBOD mode. If RAID 1 (mirroring) is available on the disk array, configure the mirroring so that each mirrored volume is the size of a single disk drive. Map each mirrored volume to a separate Logical Unit (LUN), which will effectively match LUNs to mirrored pairs of drives. Configure the virtual tape as RAID 0 (striped). This allows the virtual tape to maximize the number of active disks at a single time. C - Use 3 rd Party Disk Array as RAID 10 ATTO RAID 0 In this scenario, the disk array is configured as RAID 10 (mirrored + striped). If the stripe width is configurable, set it to an even multiple of 128 Ki bytes times the number of drive mirrors. Create stripe partitions equal to the number of drive mirrors, each mapped to a separate Logical Unit (LUN). For 5

example, in a disk array with 16 drives: configure the stripe width to 512 Ki bytes (128 Ki bytes * 4), and split the array into four striped partitions, each mapped to its own LUN. Configure the virtual tape as RAID 0 (striped). D - Use 3 rd Party Disk Array as RAID 5 or RAID 6 ATTO JBOD This scenario is useful since it provides RAID 5/6 parity protection on the disk array but it may incur the penalty of reduced virtual tape performance. To maximize performance in this setup, it is useful to know the data block size of the backup software package. These block sizes are typically around 64Ki (65536) bytes, although many packages use 63 Ki (64512) bytes. Set the RAID interleave to approximately twice the backup software block size 2. Partition the RAID 5/6 array to equal the number of data drives ( [total number of drives]-1 for RAID 5, [total number]-2 for RAID 6), with each partition mapped to a separate LUN. Configure the ATTO Virtual Tape in JBOD mode. 2 Some disk arrays do not allow a user-configured interleave size. In this case, it may be advantageous to use ATTO RAID 0 instead of JBOD. The ATTO RAID 0 interleave will be 128Ki bytes. 6

CONCLUSION ATTO Technology s RAID functionality was developed for use in demanding digital video and audio applications. These applications, by definition, read and write large chunks of sequential data. The ATTO Virtual Tape solution was designed to take advantage of ATTO Technology s RAID algorithms, coupling the streaming nature of tape reads and writes with ATTO s best-in-class streaming RAID solution. The end result is a virtual tape solution with most consistent performance when the external disk array can be configured as just a bunch of disks (JBOD) with the ATTO FastStream providing RAID protection. If this is not desirable, it is essential to choose a FastStream-to-disk array pairing which reduces disk seek time while keeping all of the disk drives busy. The configuration will take into account the number of disk array LUNs, virtual tape volumes, and virtual tape drives to achieve the best performance from the ATTO FastStream Virtual Tape appliance. 7