Low-cost BYO Mass Storage Project. James Cizek Unix Systems Manager Academic Computing and Networking Services

Similar documents
The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

Large Scale Storage. Orlando Richards, Information Services LCFG Users Day, University of Edinburgh 18 th January 2013

Solid State Storage in Massive Data Environments Erik Eyberg

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

Perforce with Network Appliance Storage

Outline. Introduction Virtualization Platform - Hypervisor High-level NAS Functions Applications Supported NAS models

VIDEO SURVEILLANCE WITH SURVEILLUS VMS AND EMC ISILON STORAGE ARRAYS

Virtualization, Business Continuation Plan & Disaster Recovery for EMS -By Ramanj Pamidi San Diego Gas & Electric

NETGEAR SMB Storage Line Update and ReadyNAS 2100 Introduction

Understanding Enterprise NAS

Virtual server management: Top tips on managing storage in virtual server environments

Preparation Guide. How to prepare your environment for an OnApp Cloud v3.0 (beta) deployment.

broadberry.co.uk/storage-servers

How To Back Up A Computer To A Backup On A Hard Drive On A Microsoft Macbook (Or Ipad) With A Backup From A Flash Drive To A Flash Memory (Or A Flash) On A Flash (Or Macbook) On

Addendum No. 1 to Packet No Enterprise Data Storage Solution and Strategy for the Ingham County MIS Department

Network Attached Storage Common Configuration for Entry-Level

EMC XtremSF: Delivering Next Generation Performance for Oracle Database

RFP-MM Enterprise Storage Addendum 1

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

White Paper. Recording Server Virtualization

The Modern Virtualized Data Center

Simplify Data Management and Reduce Storage Costs with File Virtualization

Virtualization of the MS Exchange Server Environment

Introduction to Gluster. Versions 3.0.x

Building a Scalable Storage with InfiniBand

Protect SQL Server 2012 AlwaysOn Availability Group with Hitachi Application Protector

TCO Case Study Enterprise Mass Storage: Less Than A Penny Per GB Per Year

Moving Virtual Storage to the Cloud

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

MS Exchange Server Acceleration

Moving Virtual Storage to the Cloud. Guidelines for Hosters Who Want to Enhance Their Cloud Offerings with Cloud Storage

Implementing SAN & NAS with Linux by Mark Manoukian & Roy Koh

Cisco Unified Computing System and EMC VNXe3300 Unified Storage System

Introduction of Huawei Backup Solution

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Red Hat Enterprise Virtualization - KVM-based infrastructure services at BNL

Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive

The skinny on storage clusters

E4 UNIFIED STORAGE powered by Syneto

Recoup with data dedupe Eight products that cut storage costs through data deduplication

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

EMC DATA DOMAIN OVERVIEW. Copyright 2011 EMC Corporation. All rights reserved.

Xanadu 130. Business Class Storage Solution. 8G FC Host Connectivity and 6G SAS Backplane. 2U 12-Bay 3.5 Form Factor

Virtual Server and Storage Provisioning Service. Service Description

Technical White Paper. Symantec Backup Exec 10d System Sizing. Best Practices For Optimizing Performance of the Continuous Protection Server

White paper. ATCA Compute Platforms (ACP) Use ACP to Accelerate Private Cloud Deployments for Mission Critical Workloads. Rev 01

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

modular Storage Solutions MSS Series

PCIe Over Cable Provides Greater Performance for Less Cost for High Performance Computing (HPC) Clusters. from One Stop Systems (OSS)

Windows Server Performance Monitoring

PrimeArray Data Storage Solutions Network Attached Storage (NAS) iscsi Storage Area Networks (SAN) Optical Storage Systems (CD/DVD)

The future is in the management tools. Profoss 22/01/2008

The VMware Administrator s Guide to Hyper-V in Windows Server Brien Posey Microsoft

Microsoft SQL Server 2005 on Windows Server 2003

Step by Step Guide To vstorage Backup Server (Proxy) Sizing

Accelerating Data Compression with Intel Multi-Core Processors

SMB Direct for SQL Server and Private Cloud

EMC Unified Storage for Microsoft SQL Server 2008

An Affordable Commodity Network Attached Storage Solution for Biological Research Environments.

K2 LxO RAID Storage Systems

Ultra-Scalable Storage Provides Low Cost Virtualization Solutions

Cloud Sure - Virtual Machines

Enabling Technologies for Distributed Computing

Improving IT Operational Efficiency with a VMware vsphere Private Cloud on Lenovo Servers and Lenovo Storage SAN S3200

Qsan Document - White Paper. Performance Monitor Case Studies

Enabling Technologies for Distributed and Cloud Computing

nexsan NAS just got faster, easier and more affordable.

Nexenta Performance Scaling for Speed and Cost

1. Specifiers may alternately wish to include this specification in the following sections:

White paper. QNAP Turbo NAS with SSD Cache

Practical issues in DIY RAID Recovery

SUN STORAGE F5100 FLASH ARRAY

Virtual SAN Design and Deployment Guide

HP StorageWorks P2000 G3 and MSA2000 G2 Arrays

SAN TECHNICAL - DETAILS/ SPECIFICATIONS

WHITE PAPER. How To Build a SAN. The Essential Guide for Turning Your Windows Server Into Shared Storage on Your IP Network

The Technologies & Architectures. President, Demartek

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

Open-E Data Storage Software and Intel Modular Server a certified virtualization solution

Comparison of Hybrid Flash Storage System Performance

Cisco Prime Home 5.0 Minimum System Requirements (Standalone and High Availability)

CompTIA Storage+ Powered by SNIA

SSDs and RAID: What s the right strategy. Paul Goodwin VP Product Development Avant Technology

Integrated Grid Solutions. and Greenplum

HP reference configuration for entry-level SAS Grid Manager solutions

The Power of Deduplication-Enabled Per-VM Data Protection SimpliVity s OmniCube Aligns VM and Data Management

SoftLayer Offerings. What s Inside

Implementing a Digital Video Archive Based on XenData Software

IBM System Storage DS5020 Express

Overcoming Backup & Recovery Challenges in Enterprise VMware Environments

Increasing Storage Performance

Scaling from Datacenter to Client

CMS Tier-3 cluster at NISER. Dr. Tania Moulik

Transcription:

Low-cost BYO Mass Storage Project James Cizek Unix Systems Manager Academic Computing and Networking Services

The Problem Reduced Budget Storage needs growing Storage needs changing (Tiered Storage) I NEED MORE DISK SPACE! (DBA s!!) Current commercial offerings are not addressing this problem without major budget implications Novermber 17, 2010 Subnet Managers Meeting 2

Projected Needs (2009 Survey) 5,000 Research Data Storage Need (TBytes) 4,000 3,000 4,914 2,000 1,000-219 1,384 Now 2 Years 5 Years Novermber 17, 2010 Subnet Managers Meeting 3

The Goal Find a mass storage solution that won t break the bank CSU attempted NSF grant to meet this need ($1 million for 500 TB x 2), but were not awarded the grant (1,000 1 TB drives!!!) Vendors sell high-speed, costly systems (suitable for Amazon, Google, etc.), but we want slower, low-cost Looking at vendor offerings, we decided to roll our own Maximize TB/$$ with reasonable assurance that data are redundant and safe Novermber 17, 2010 Subnet Managers Meeting 4

Some Understandings Project approached as Secondary or Tier 2 type storage, not intended to replace extremely fast, ultra-reliable, expensive disk systems Device management, support, and component failure need to be addressed Novermber 17, 2010 Subnet Managers Meeting 5

A starting point Online backup company Backblaze opensourced their storage pod design, see https://www.backblaze.com/petabytes-on-a-budget-howto-build-cheap-cloud-storage.html Starting with a proven design would eliminate many unknowns and speed up our design process Turned out to be helpful, but ran into many of our own headaches Novermber 17, 2010 Subnet Managers Meeting 6

The BackBlaze design Novermber 17, 2010 Subnet Managers Meeting 7

BackBlaze vs. CSU design goals Realized that the BackBlaze design didn t exactly meet our requirements No redundant power supplies Cheap SATA cards didn t take advantage of performance available by having large number of spinning hard drives Case too small to accommodate server-class motherboard Single system hard drive is single point of failure. Realized the need to over-engineer cooling and vibration reduction (2 major contributors to drive failure) Chassis was red instead of CSU green! Novermber 17, 2010 Subnet Managers Meeting 8

CSU design changes Lengthened case by 3 inches to accommodate dual CPU server-class motherboard Added more RAM for file system buffering (6 GB compared to BackBlaze 4GB) Added larger, redundant power supplies - individual supply can run entire case Used Enterprise grade drives instead of consumer grade, after much research Drives selected have vibration sense / damping Replaced cheap SATA cards with highperformance PCI-e cards Novermber 17, 2010 Subnet Managers Meeting 9

CSU chassis nearing completion Novermber 17, 2010 Subnet Managers Meeting 10

CSU chassis nearing completion Novermber 17, 2010 Subnet Managers Meeting 11

Costs Case: $700 1 TB Drives: $100 x 45 ($4,500) Drives were purchased earlier this year, now 1.5TB for $100 Motherboard / Processors / Memory: $900 Power Supplies: $200 SATA cards: $300 Ethernet card with iscsi offload: $350 SATA Multipliers: $45 x 9 ($405) Fans/Cables/Hardware/DVD/Mounts/etc.: $1,000 Total: 45 Raw TB for $8,355! Novermber 17, 2010 Subnet Managers Meeting 12

Testing Environment Testing was done with both small files and large files (Larger than largest memory buffer) Same data was used for all tests. Allowed us to validate results from various benchmark utilities against each other All RAID configurations were done in multiples of 3 to spread load across as many backplanes as possible All test data below assume worst case (Reads all random, writes all continuous) Highest recorded temperature (excluding CPU exhaust fan) under full load is 100F (ambient office temperature at input, should see even more improvement in datacenter) Novermber 17, 2010 Subnet Managers Meeting 13

Initial Performance Internal performance (using dd) 11GB dataset using 18 drive Raid6: Read: 472 MB/s Write: 162 MB/s Over 4GB Fibre Channel connection 11GB dataset using 18 drive Raid6: Read: 115 MB/s Write: 98 MB/s RAID sets less than 6 drives showed degraded performance, RAID sets above 18 drives showed only small performance benefits Novermber 17, 2010 Subnet Managers Meeting 14

Cost / Performance Comparison We are using IBM DS4300 and DS4700 Fibre channel disk systems as Tier 1 disk in the unix environment. These use 18U and nearly $100K We are using Equallogic (various models) iscsi arrays for Tier 1 disk in windows environment. P6500E model hold 48 TB but runs near $80K We are using Jetstor SATA based products for Tier 2. 16TB capacity for $8000 Although Fibre channel capable, have no ability to present disk space standalone (i.e. must be connected to a server) DIY disk box is 45 (67) TB for $8300 in 4U Novermber 17, 2010 Subnet Managers Meeting 15

Configuration Much was learned during testing RAID levels, 5 & 6 tested, 5 faster, but not enough to disregard the added safety of 6. 1 & 10 not considered Operating system Debian 64bit Server Performance testing unix DD, IOmeter, IOzone Consistent data obtained from all tools Connection offerings (Fibre, iscsi, NFS, AOE) Fibre iscsi NFS (SLOW!!!) AOE (Working out kinks) Novermber 17, 2010 Subnet Managers Meeting 16

Challenges ahead Support management (What happens when a disk fails?) Backup and protection of stored data Mirroring units Avoid backing up to enterprise backup system Data storage and protection policies Parallel file system Novermber 17, 2010 Subnet Managers Meeting 17

Where will this be useful? Library digital repository Research computing HPC, tier 2 Campus wide Cloud storage Second or Third Tier storage for your Enterprise backups Email/File archiving Database snapshots kept for long term (LMS) Novermber 17, 2010 Subnet Managers Meeting 18

What other possibilities? Very large JBOD (Directly attached to server) Linux server offering CIFS/NFS NAS capability iscsi target Direct connection via FibreChannel (HPC) VMWare ESXi Standalone VM cluster with massive attached storage 4 U server running Windows/Linux/FreeNAS/OpenFiler Novermber 17, 2010 Subnet Managers Meeting 19

Where are we today? Novermber 17, 2010 Subnet Managers Meeting 20

Next steps at CSU Collection of final parts list for a complete build Documentation Put it into semi production and see how it performs under real-world situations Novermber 17, 2010 Subnet Managers Meeting 21

Resources http://blog.backblaze.com/2009/09/01/petab ytes-on-a-budget-how-to-build-cheap-cloudstorage/ (Original BackBlaze project) http://www.ctcustomfab.com (Cases) http://www.chyangfun.com (SATA multipliers) http://www.colostate.edu/curtisb/mass_stora ge (Wiki on CSU progress) Novermber 17, 2010 Subnet Managers Meeting 22

Questions? james@colostate.edu Novermber 17, 2010 Subnet Managers Meeting 23