S 2 -RAID: A New RAID Architecture for Fast Data Recovery



Similar documents
Supplemental File of S 2 -RAID: Parallel RAID Architecture for Fast Data Recovery

Exploring RAID Configurations

Online Remote Data Backup for iscsi-based Storage Systems

Praktijkexamen met Project VRC. Virtual Reality Check

RAID technology and IBM TotalStorage NAS products

Cisco Prime Home 5.0 Minimum System Requirements (Standalone and High Availability)

Comparing Dynamic Disk Pools (DDP) with RAID-6 using IOR

RAID is the de facto storage architecture [16] that

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

RAID 5 rebuild performance in ProLiant

IBM ^ xseries ServeRAID Technology

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

Certification Document macle GmbH GRAFENTHAL R2208 S2 01/04/2016. macle GmbH GRAFENTHAL R2208 S2 Storage system

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1

Oracle Database Scalability in VMware ESX VMware ESX 3.5

VTrak SATA RAID Storage System

Areas Covered. Chapter 1 Features (Overview/Note) Chapter 2 How to Use WebBIOS. Chapter 3 Installing Global Array Manager (GAM)

High Performance Tier Implementation Guideline

Test Report Newtech Supremacy II NAS 05/31/2012. Newtech Supremacy II NAS storage system

Recommended hardware system configurations for ANSYS users

RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29

Maximizing SQL Server Virtualization Performance

Optimizing LTO Backup Performance

Microsoft SharePoint Server 2010

HP ProLiant Gen8 vs Gen9 Server Blades on Data Warehouse Workloads

System Requirements Table of contents

Chapter 2 Array Configuration [SATA Setup Utility] This chapter explains array configurations using this array controller.

Certification Document bluechip STORAGEline R54300s NAS-Server 03/06/2014. bluechip STORAGEline R54300s NAS-Server system

Building High-Performance iscsi SAN Configurations. An Alacritech and McDATA Technical Note

Intel Solid-State Drives Increase Productivity of Product Design and Simulation

Fusionstor NAS Enterprise Server and Microsoft Windows Storage Server 2003 competitive performance comparison

I3: Maximizing Packet Capture Performance. Andrew Brown

Deployments and Tests in an iscsi SAN

Optimization of Cluster Web Server Scheduling from Site Access Statistics

Intel RAID SSD Cache Controller RCS25ZB040

IT Business Management System Requirements Guide

IOmark- VDI. Nimbus Data Gemini Test Report: VDI a Test Report Date: 6, September

Virtualized Exchange 2007 Archiving with EMC Xtender/DiskXtender to EMC Centera

Maximize System Performance

Minimum Hardware Configurations for EMC Documentum Archive Services for SAP Practical Sizing Guide

AlphaTrust PRONTO - Hardware Requirements

vrealize Business System Requirements Guide

Oracle Database Deployments with EMC CLARiiON AX4 Storage Systems

Scaling in a Hypervisor Environment

Summer Student Project Report

Models Smart Array 6402A/128 Controller 3X-KZPEC-BF Smart Array 6404A/256 two 2 channel Controllers

DELL TM PowerEdge TM T Mailbox Resiliency Exchange 2010 Storage Solution

Step-by-Step Guide. to configure Open-E DSS V7 Active-Active iscsi Failover on Intel Server Systems R2224GZ4GC4. Software Version: DSS ver. 7.

Configuring Apache Derby for Performance and Durability Olav Sandstå

Scaling from Datacenter to Client

Microsoft Exchange Server 2003 Deployment Considerations

Sistemas Operativos: Input/Output Disks

The Monitis Monitoring Agent ver. 1.2

How To Create A Multi Disk Raid

RAID Implementation for StorSimple Storage Management Appliance

Quiz for Chapter 6 Storage and Other I/O Topics 3.10

UCS M-Series Modular Servers

Performance Comparison of SQL based Big Data Analytics with Lustre and HDFS file systems

EMC Avamar Backup Solutions for VMware ESX Server on Celerra NS Series 2 Applied Technology

Performance Analysis of RAIDs in Storage Area Network

Virtuoso and Database Scalability

RAID: Redundant Arrays of Independent Disks

Definition of RAID Levels

Best Practices RAID Implementations for Snap Servers and JBOD Expansion

Benchmarking FreeBSD. Ivan Voras

VIA Fedora Linux Core 8 (x86&x86_64) VT8237R/VT8237A/VT8237S/VT8251/CX700/VX800 V-RAID V3.10 Driver Installation Guide

RAMDISK Benchmarks Test system: Software OS Setup Hypothesis License Author

Using Red Hat Network Satellite Server to Manage Dell PowerEdge Servers

Introduction to MPIO, MCS, Trunking, and LACP

Architecting a High Performance Storage System

Delivering Accelerated SQL Server Performance with OCZ s ZD-XL SQL Accelerator

Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage

Microsoft SharePoint Server 2010

Cray DVS: Data Virtualization Service

Improving Lustre OST Performance with ClusterStor GridRAID. John Fragalla Principal Architect High Performance Computing

Using Synology SSD Technology to Enhance System Performance Synology Inc.

SMB Direct for SQL Server and Private Cloud

WHITE PAPER Optimizing Virtual Platform Disk Performance

QuickSpecs. HP Smart Array 5312 Controller. Overview

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1

PARALLELS CLOUD STORAGE

Software-defined Storage at the Speed of Flash

Hyper-V over SMB Remote File Storage support in Windows Server 8 Hyper-V. Jose Barreto Principal Program Manager Microsoft Corporation

H ARDWARE C ONSIDERATIONS

White Paper. Recording Server Virtualization

Understanding the Benefits of IBM SPSS Statistics Server

Performance Tuning and Optimizing SQL Databases 2016

IMPLEMENTING GREEN IT

Post-production Video Editing Solution Guide with Microsoft SMB 3 File Serving AssuredSAN 4000

Transcription:

S 2 -RAID: A New RAID Architecture for Fast Data Recovery Jiguang Wan*, Jibin Wang*, Qing Yang+, and Changsheng Xie* *Huazhong University of Science and Technology, China +University of Rhode Island,USA

Overview A reconstruction solution-s 2 -RAID Using parallel data layout to boost data construction Online reconstruction performance Average user response time Shorten reconstruction time by a factor of 3~6 Comparing with the traditional RAID 1

Outline Reconstruction background Data layout strategy S 2 -RAID prototype Evaluation results Performance analyse 2

Background High-capacity disk keep increasing. Offline reconstruction is result in service down time. Existing reconstruction solutions Long reconstruction time and Average user response time 3

S 2 -RAID Idea Our goals Reducing construction time sharply Maximizing Parallel reconstruction Minimizing the impact on front end performance. S 2 -RAID data layout Parallel reconstruction model Using subraid concept Each subraid uses standard RAID 4

Traditional RAID 5 reconstruction + Failed Disk 0 Disk 1 Disk 2 Spare disk Single reconstruction stream long reconstruction time 5

S 2 -RAID data layout subraid0 0.0 0.1 0.2 Standard RAID 5 D0 D3 D6 0.0 0.1 0.2 partition D0 D1 D2 D3 D4 D5 D6 D7 D8 6

S 2 -RAID data layout structure D0 D1 D2 D3 D4 D5 D6 D7 D8 0 1 2 0 1 2 0 1 2 3 4 5 5 3 4 4 5 3 6 7 8 7 8 6 8 6 7 Group 0 Group 1 Group 2 0 1 2 0 1 2 0 1 2 0 3 4 5 shift 5 3 4 shift 4 5 3 1 6 7 8 7 8 6 8 6 7 2 Init. matrix M 0 M 1 M 2 7

S 2 -RAID data layout structure P i,j :subraid numbers of the (j+1) th partition on disks of (i+1) th group in the RAID K: the partition number of the disk Note: the size of the group must be a prime number 8

S 2 -RAID data layout subraid0 0.0 1.0 2.0 0.1 1.1 2.1 0.2 1.2 2.2 3.0 4.0 5.0 5.1 3.1 4.1 4.2 5.2 3.2 6.0 7.0 8.0 7.1 8.1 6.1 8.2 6.2 7.2 9 D0 D1 D2 D3 D4 D5 D6 D7 subraid8 S 2 -RAID structure 9 disks 9 subraids RAID type RAID 5 RAID 10 RAID 6 etc. D8

S 2 -RAID 5 reconstruction D0 D1 D2 D3 D4 D5 D6 D7 D8 0.0 1.0 2.0 0.1 1.1 2.1 0.2 1.2 2.2 3.0 4.0 5.0 5.1 3.1 4.1 4.2 5.2 3.2 6.0 7.0 8.0 7.1 8.1 6.1 8.2 6.2 7.2 X.0 8.1 3.1 1.1 X.2 D4 was divided into 3 partitions Reconstruction speed! No bottleneck in reconstruction No operation conflict(write or read) SD0 SD1 SD2 10

S 2 -RAID 5 reconstruction D0 D1 D2 D3 D4 D5 D6 D7 D8 0.0 1.0 2.0 0.1 1.1 2.1 0.2 1.2 2.2 3.0 4.0 5.0 5.1 3.1 4.1 4.2 5.2 3.2 6.0 7.0 8.0 7.1 8.1 6.1 8.2 6.2 7.2 X.0 X.2 Free disk can serve front end and Spare disk 3.1 SD0 1.1 SD1 8.1 SD2 11

S 2 -RAID prototype structure iscsi Initator Config iscsi Target S 2 -RAID mdadm MD IO Schedule Disk Disk Disk 12 S 2 -RAID prototype based on MD, are using the open source The iscsi target module modifies the IET SCSI command handling and disk IO parts. The Config module provides RAID setup and configuration functions using mdadm commands to realize different S 2 -RAID subraid functions. The S 2 -RAID module realizes the basic functions of RAID10 and RAID5 including RAID rebuilder based on MD.

Experimental Setup Hardware of server and client Evaluation tools of the storage server and client OS Fedora Core 8.0 OS Fedora Core 8.0 blktrace blktrace 1.0 postmark postmark1.5.1 disks TPC-C TPC-C tpccuva-1.2.3 Disks postgresql postgresql 8.1.15 gnuplot mainboard gnuplot 4.2.5 Mainboard TPC-W TPC-W 1.5 Jdk CPU jdk 1.5.0.06 CPU Tomcat tomcat 5.5 Mysql mysql 5.0.45 NIC NIC 1 Seagate ST3160023AS, 160GB, 7200RPM. 12 Seagate Seagate ST3500320AS, ST3160023AS, 500GB, 160GB, 7200RPM. 7200RPM SUPER X7DVL-I GA-945GCMX-S2 Intel(R) Celeron(R) CPU 2.80GHz Intel(R) Xeon(R) CPU 5110 @ 1.60GHz Tigon3 Intel PRO/1000 iscsi Memory initator memory iscsi-initiator-util 1GB 512MB DDR2 6.2.0865 DDR2 HBA Highpoint 2240 RAID, client tools server 13

S 2 -RAID 5 reconstruction performance 16 Two evaluation parameters Average User Response Time Reconstruction Time 14 12 MD S2-RAID Trace File Write Ratio Ave Size: KB Req Total Req 10 8 6 4 2 0 Fin1 Fin2 Web Average User Response Time(ms) Financial-1 76.84% 3.38 5,334,987 Financial-2 17.65% 2.39 3,699,195 Websearch 0% 15.07 4,579,809 SPC trace characteristics 14

S 2 -RAID 5 reconstruction performance 10000 9000 8000 7000 6000 5000 4000 MD S2-RAID 3000 2000 1000 0 Fin1 Fin2 Web Reconstruction Time(seconds) 15

S 2 -RAID 5 Normal Performance 16 14 12 10 32% 8 6 MD S2-RAID 4 2 0 Fin1 Fin2 Web Average User Response Time(ms) 16

S 2 -RAID 5 Degraded Performance 18 16 14 12 10 8 MD S2-RAID 6 4 2 0 Fin1 Fin2 Web Average User Response Time(ms) 17

Speedup Other Benchmark Performance (MD vs S 2 -RAID) Average User Response Time Reconstruction Time 8 7 6 5 4 3 2 1 0 TPCC TPCW Postmark TPCC:20 warehouses with 10 terminals per warehouse interval of 120 minutes TPCW:150 emulated browsers Postmark:20,000 files of size 4KB to 500KB and to perform 100,000 transactions 18

Sensitivity Parameters for Reconstruction Some sensitivity parameters Reconstruction speed bandwidth I/O request block size Number of spare disk(additional disk not system disk) 19

Speedup Reconstruction speed bandwidth Average User Response Time Reconstruction Time 7 6 Speed threshold:1mb/s~200mb/s 5 4 3 2 1 0 1 10 50 Speed Limit Min (MB/s) The result is based on Financial-1 traces 20

Speedup Number of spare disk Average User Response Time Reconstruction Time 7 6 5 4 3 2 1 0 1 2 3 No. of Spare Disks Speed bandwidth and No. of spare disk is insensitive to s 2 -RAID 21

Data Transfer Rate (MB/s) I/O request block size RAID5-random S2-RAID5-random RAID5-sequence S2-RAID5-sequence 120 100 S 2 -RAID has smaller stripe span 80 60 40 20 0 1 4 16 64 256 1024 I/O Request Block Size (KB) 22

Conclusion A parallel reconstruction data layout Implement the s 2 -RAID prototype and evaluation of this structure S 2 -Raid reduces the reconstruction time greatly. User response time of S 2 -Raid is comparable to that of MD. Optimization? Embedding existing rebuilding process (distributed sparing)------reduce the number of disks Tolerate the mulit-disk failures. 23

Thank you for your attention! Questions?