Staying afloat in today s Storage Pools. Bob Trotter IGeLU 2009 Conference September 7-9, 2009 Helsinki, Finland



Similar documents
Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

RAID technology and IBM TotalStorage NAS products

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive

UN 4013 V - Virtual Tape Libraries solutions update...

VTrak SATA RAID Storage System

CONFIGURATION GUIDELINES: EMC STORAGE FOR PHYSICAL SECURITY

ZFS Administration 1

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

Best Practices RAID Implementations for Snap Servers and JBOD Expansion

File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

HITACHI VIRTUAL STORAGE PLATFORM FAMILY MATRIX

Operating System Concepts. Operating System 資 訊 工 程 學 系 袁 賢 銘 老 師

A Virtual Tape Library Architecture & Its Benefits

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

Implementing a Digital Video Archive Based on XenData Software

Large Scale Storage. Orlando Richards, Information Services LCFG Users Day, University of Edinburgh 18 th January 2013

The Raw Truth about Storage Capacity. A Real-World Look at the Capacity You Buy vs. the Capacity You Can Use

ZFS Backup Platform. ZFS Backup Platform. Senior Systems Analyst TalkTalk Group. Robert Milkowski.

HITACHI VIRTUAL STORAGE PLATFORM FAMILY MATRIX

Optimizing LTO Backup Performance

Chapter 12: Secondary-Storage Structure

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array

Distribution One Server Requirements

What is RAID--BASICS? Mylex RAID Primer. A simple guide to understanding RAID

Storage Technologies for Video Surveillance

High Availability Databases based on Oracle 10g RAC on Linux

PARALLELS CLOUD STORAGE

IBM System Storage DS5020 Express

Implementing an Automated Digital Video Archive Based on the Video Edition of XenData Software

Get Success in Passing Your Certification Exam at first attempt!

E4 UNIFIED STORAGE powered by Syneto

EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage

Best Practices for Data Sharing in a Grid Distributed SAS Environment. Updated July 2010

Optimizing Large Arrays with StoneFly Storage Concentrators

Hyper ISE. Performance Driven Storage. XIO Storage. January 2013

Overcoming Backup & Recovery Challenges in Enterprise VMware Environments

The functionality and advantages of a high-availability file server system

Chapter 12: Mass-Storage Systems

SAN Conceptual and Design Basics

The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

Best in Class Storage Solutions for BS2000/OSD

ntier Verde Simply Affordable File Storage

STORAGE CENTER. The Industry s Only SAN with Automated Tiered Storage STORAGE CENTER

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

SUN HARDWARE FROM ORACLE: PRICING FOR EDUCATION

Best Practices for Optimizing Storage for Oracle Automatic Storage Management with Oracle FS1 Series Storage ORACLE WHITE PAPER JANUARY 2015

Dynamode External USB3.0 Dual RAID Encloure. User Manual.

Data Storage Solutions

Customized storage solutions for versatile usage scenarios Efficiently managed data growth leveraging enterprise features

FlexArray Virtualization

Delivery Method Chart: Replacement Parts and Installation of Integrated Software Updates

Dell PowerVault MD Family. Modular storage. The Dell PowerVault MD storage family

June Blade.org 2009 ALL RIGHTS RESERVED

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Serial ATA in Servers and Networked Storage

BMC Recovery Manager for Databases: Benchmark Study Performed at Sun Laboratories

Firebird and RAID. Choosing the right RAID configuration for Firebird. Paul Reeves IBPhoenix. mail:

Next Generation NAS: A market perspective on the recently introduced Snap Server 500 Series

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

SMB Direct for SQL Server and Private Cloud

Advanced Knowledge and Understanding of Industrial Data Storage

Choosing and Architecting Storage for Your Environment. Lucas Nguyen Technical Alliance Manager Mike DiPetrillo Specialist Systems Engineer

Automated Data-Aware Tiering

Storage Backup and Disaster Recovery: Using New Technology to Develop Best Practices

Striped Set, Advantages and Disadvantages of Using RAID

Slash Costs and Improve Operations with Server, Storage and Backup Virtualization. December 2008

Tape. Emerging Storage Technology Panel IEEE/NASA Conference MSST2008 September 25, 2008

Increasing Storage Performance

AFS Usage and Backups using TiBS at Fermilab. Presented by Kevin Hill

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Network Attached Storage Common Configuration for Entry-Level

IBM and VERITAS Practical disaster recovery

Frequently Asked Questions: EMC UnityVSA

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

SLIDE 1 Previous Next Exit

THE SUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance

The Revival of Direct Attached Storage for Oracle Databases

Implementing Enterprise Disk Arrays Using Open Source Software. Marc Smith Mott Community College - Flint, MI Merit Member Conference 2012

The Microsoft Large Mailbox Vision

WHITE PAPER. Drobo TM Hybrid Storage TM

RAID5 versus RAID10. First let's get on the same page so we're all talking about apples.

Chapter 10: Mass-Storage Systems

HP StorageWorks P2000 G3 and MSA2000 G2 Arrays

Ultra-Scalable Storage Provides Low Cost Virtualization Solutions

Cisco Small Business NSS2000 Series Network Storage System

Addendum No. 1 to Packet No Enterprise Data Storage Solution and Strategy for the Ingham County MIS Department

OPTIMIZING VIRTUAL TAPE PERFORMANCE: IMPROVING EFFICIENCY WITH DISK STORAGE SYSTEMS

ClearPath Storage Update Data Domain on ClearPath MCP

Driving the New Paradigm of Software Defined Storage Solutions White Paper July 2014

Disk-to-Disk Backup and Restore Solution

Analyzing Big Data with Splunk A Cost Effective Storage Architecture and Solution

ETERNUS for small and medium-sized businesses Reliable Storage Solutions for your Dynamic Infrastructures

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

Transcription:

Staying afloat in today s Storage Pools Bob Trotter IGeLU 2009 Conference September 7-9, 2009 Helsinki, Finland

Background Storage Today What we do Storage Issues Trends Topics to Discuss

Background We don t have DigiTool Digital Library of Georgia (DLG) Archive W. J. Brown Media Archives Special Collections Building Technical Stuff

Background: DLG Archive Georgia Government Publications State Depository Digitizing since 1994 Around 40,000 documents Georgia HomePLACE Image Collections from around Georgia Funded by IMLS Digitization Guidelines Image from Sanborn Maps Collection

DLG Archive

Newspapers DLG Archive Red and Black student newspaper Georgia Newspaper Project Since the early 1950s Microfilm scanners for digitization

DLG Archive ELUNA 2009 Conference, Richmond, Virginia

Video DLG Archive Civil Rights Digital Library (CRDL) Digitized newsfilm, radio, and images

Background: Media Archives Peabody Award Collection Over 40,000 Award winning Titles Radio programs dating from 1940 Television programs starting in 1948 Newsfilm Collections WSB (Atlanta) over 5 million feet of film WALB (Albany) over 1,600 cans of film

Digitized Video SAMMA Lite system for video digitization Creates streaming derivative, and archival copy Archival copy straight to LTO tape Grant funded project working on 2,200 hours Total of Around 60,000 hours of video Will require 600-800 TB archival storage

New Special Collections Building Break ground this summer Huge archival vault for collections and archival copies Data center Processing/storing archival data I have already been asked to provide specs for what hardware will be installed

New Special Collections Building

Background: Technical Stuff Talk about this a little later

Storage Today: What Is It Storage Array: The basic unit of storage Controller tray Contains the brains; the storage OS that handles the I/O, caching, RAID, etc. Also has disks JBOD: Just a Bunch Of Disks Has to be attached to a Controller tray Depending on Array type: 2 7 / controller Available now with 1 Terabyte drives Multiple Terabyte coming

Storage Today Many different layouts DAS: Direct Attached Storage Usually over Fibre Channel SAN: Storage Area Network Switch based collection of arrays NAS: Network Attached Storage Large arrays with built in computer front end Accessable via NFS, Samba, etc.

Storage Today Tiered Storage / Hierarchical Storage Data storage environment consisting of two or more kinds of storage delineated by differences in at least one of these four attributes: Price, Performance, Capacity and Function.

Performance / Price Architecture of array Tiers FC-AL: Fibre Channel Arbitrated Loop Fastest most expensive SAS: Serial Attached SCSI Not quite as fast or expensive SATA: Serial ATA Least Expensive; slightly slower than SAS

Tiers Example of 3 Tiered Storage Teir 1: FC-AL array(s): DAS or SAN Sun StorageTek 6140s or 6540 Tier 2: SATA array(s): DAS or NAS Sun X4500(s) Tier 3: Tape Robotic device Sun SL3000 Modular Library System

1 TB!= 1 TB Storage Today Vendor TB based on 1,000 Byte K Actual TB (what computer reports) based 1,024 byte K

DLG Storage Sun Storage arrays: 3310, 3511, 6140 Exceeded backup window When expanded with 3511s (~2005) Had just recently retrofitted a room in MLC to expand our data center Purchased two 3511s and another server Set up an offsite mirror in data center at MLC Use modified rsync for nightly syncing of offsite mirror

DLG Storage Also set up automated zipping of archive files Provide staff with scripts to unzip when needed Weren t sure how much this would help, but on scanned text documents we get about 40% compression

Current setup (DAS) DLG Storage Each side of mirror is: One - 6140 controller tray with 16 x 500GB SATA drives One 16 TB JBOD (16 x 1 TB SATA drives) Just purchased two 12 TB Sun 2540s (12 x 1 TB SATA drives) for expansion

DLG Storage 24 TB total raw space for each side of mirror 21.84 TB as the computer sees it Each mirror Comprised of 3 RAID5 (single parity) volumes With hot spares and parity drives comes to 16.1 TB usable storage Average of 30% of space taken by recoverability overhead

Media Archival Storage Grant funded project working on 2,200 hours of the Peabody Awards collection Originally designed to write archival version to LTO tape Many problems with both creating tar files and writing the LTO tapes from windows box Total archival output of project estimated at around 24 TB Had money left in grant so started looking for dense, inexpensive arrays

Media Archival Storage Settled on 48 TB Sun X4500 Storage Server (Thumper) and purchased two It is an array with built in server front end: 2 x dual core AMD; 16 GB Memory; 8 onboard disk controllers; customized version of Solaris x86; ZFS

Configuring the X4500 Now we had to configure it Two drives used for mirrored root file system for server. Found lots of information on web and settled on a configuration recommended by several experts. One large zpool with four 11 disk raidz2 groups, and two hot spares. This gives you a total of just over 32 TB of available space (reported by the system).

Configuring the X4500 That s right, it went from 48 to 32 TB. First 2 went to the root filesystem Another 2 for hot spares With raidz2 you have 2 parity drives for each of the four groups = 8 more Total of 12, but 48 12 = 36. But remember 48!= 48; 48 ~= 43.8; and 12 ~= 10.9. So, 43.8 10.9 = 32.9

Capacity/Recoverability Dilemma With that much capacity loss, I wanted to try other configurations. We came up with a configuration that had the least redundancy we could stand. Only gained back ~3.6 TB Decided it was not worth the recoverability hit

Our Future Consolidate all archives to 3 Tiered Storage configuration. With Hierarchical Storage Management software controlling it. Like the 3 Tiered configuration I mentioned earlier. Sun s SAM-FS (Storage Archive Management-File System)

Tiered Storage Archiving and Staging

Trends Proactive data protection Self Healing Copy on Write Scrubbing

Integration of Solid State Devices (SSD) Much Faster Also Much more expensive ($10,000 for 100 GB Read Flash Accelerator) Trends

More Topics Storage Virtualization the process of abstracting logical storage from physical storage iscsi, IP SAN Deduping

Eat up with Acronyms RAID Redundant Array of Inexpensive Disks SCSI Small Computer System Interface JBOD Just a Bunch Of Disks SATA Serial Advanced Technology Attachment SAS Serial Attached SCSI iscsi SCSI over internet connection HBA Host Bus Adapter

Isilon Systems OneFS OS Other Companies On each node Vertical Striping

Xiotech Other Companies Intelligent Storage Element Sealed DataPac Factory remanufacturing Comes with 5 Year Warranty

Questions? rwt@mail.libs.uga.edu