RAID Overview 91.520



Similar documents
Storing Data: Disks and Files

Lecture 36: Chapter 6

Database Management Systems

Definition of RAID Levels

How To Create A Multi Disk Raid

Input / Ouput devices. I/O Chapter 8. Goals & Constraints. Measures of Performance. Anatomy of a Disk Drive. Introduction - 8.1

CS420: Operating Systems

RAID Overview: Identifying What RAID Levels Best Meet Customer Needs. Diamond Series RAID Storage Array

What is RAID? data reliability with performance

Disks and RAID. Profs. Bracy and Van Renesse. based on slides by Prof. Sirer

File System Design and Implementation

How To Write A Disk Array

Chapter 6 External Memory. Dr. Mohamed H. Al-Meer

Reliability and Fault Tolerance in Storage

An Introduction to RAID. Giovanni Stracquadanio

RAID Performance Analysis

Striped Set, Advantages and Disadvantages of Using RAID

Fault Tolerance & Reliability CDA Chapter 3 RAID & Sample Commercial FT Systems

CS161: Operating Systems

Operating Systems. RAID Redundant Array of Independent Disks. Submitted by Ankur Niyogi 2003EE20367

Why disk arrays? CPUs improving faster than disks

Lecture 23: Multiprocessors

PIONEER RESEARCH & DEVELOPMENT GROUP

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

RAID. Contents. Definition and Use of the Different RAID Levels. The different RAID levels: Definition Cost / Efficiency Reliability Performance

RAID. Storage-centric computing, cloud computing. Benefits:

CS 153 Design of Operating Systems Spring 2015

Improving Lustre OST Performance with ClusterStor GridRAID. John Fragalla Principal Architect High Performance Computing

Outline. Database Management and Tuning. Overview. Hardware Tuning. Johann Gamper. Unit 12

RAID. Tiffany Yu-Han Chen. # The performance of different RAID levels # read/write/reliability (fault-tolerant)/overhead

RAID technology and IBM TotalStorage NAS products

Price/performance Modern Memory Hierarchy

1 Storage Devices Summary

RAID Level Descriptions. RAID 0 (Striping)

COSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters

Module 6. RAID and Expansion Devices

Sistemas Operativos: Input/Output Disks

Q & A From Hitachi Data Systems WebTech Presentation:

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05

RAID Technology. RAID Overview

Why disk arrays? CPUs speeds increase faster than disks. - Time won t really help workloads where disk in bottleneck

Data Storage - II: Efficient Usage & Errors

Chapter Introduction. Storage and Other I/O Topics. p. 570( 頁 585) Fig I/O devices can be characterized by. I/O bus connections

Comprehending the Tradeoffs between Deploying Oracle Database on RAID 5 and RAID 10 Storage Configurations. Database Solutions Engineering

Filing Systems. Filing Systems

Hard Disk Drives and RAID

COS 318: Operating Systems. Storage Devices. Kai Li Computer Science Department Princeton University. (

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

How To Make A Backup System More Efficient

Disk Array Data Organizations and RAID

Chapter 9: Peripheral Devices: Magnetic Disks

CS 61C: Great Ideas in Computer Architecture. Dependability: Parity, RAID, ECC

Assessing RAID ADG vs. RAID 5 vs. RAID 1+0

Eliminating Backup System Bottlenecks: Taking Your Existing Backup System to the Next Level. Jacob Farmer, CTO, Cambridge Computer

RAID: Redundant Arrays of Independent Disks

Using RAID6 for Advanced Data Protection

CSE 120 Principles of Operating Systems

How To Improve Performance On A Single Chip Computer

RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29

RAID Basics Training Guide

VERITAS Volume Management Technologies for Windows

03 infra TI RAID. MTBF; RAID Protection; Mirroring and Parity; RAID levels; write penalty

Identifying the Hidden Risk of Data Deduplication: How the HYDRAstor TM Solution Proactively Solves the Problem

Read this before starting!

Guide to SATA Hard Disks Installation and RAID Configuration

Block1. Block2. Block3. Block3 Striping

CS 6290 I/O and Storage. Milos Prvulovic

W4118: RAID. Instructor: Junfeng Yang

California Software Labs

What is RAID and how does it work?

Object Storage, Cloud Storage, and High Capacity File Systems

Guide to SATA Hard Disks Installation and RAID Configuration

RAID 5 rebuild performance in ProLiant

Summer Student Project Report

Storage. The text highlighted in green in these slides contain external hyperlinks. 1 / 14

The Advantages of Using RAID

HP Smart Array Controllers and basic RAID performance factors

Introduction to I/O and Disk Management

Introduction. What is RAID? The Array and RAID Controller Concept. Click here to print this article. Re-Printed From SLCentral

RAID Rebuilding. Objectives CSC 486/586. Imaging RAIDs. Imaging RAIDs. Imaging RAIDs. Multi-RAID levels??? Video Time

Oracle Database 10g: Performance Tuning 12-1

William Stallings Computer Organization and Architecture 7 th Edition. Chapter 6 External Memory

How to choose the right RAID for your Dedicated Server

Alternatives to Big Backup

HARD DRIVE CHARACTERISTICS REFRESHER

Review. Lecture 21: Reliable, High Performance Storage. Overview. Basic Disk & File System properties CSC 468 / CSC /23/2006

COS 318: Operating Systems. Storage Devices. Kai Li and Andy Bavier Computer Science Department Princeton University

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1

Introduction Disks RAID Tertiary storage. Mass Storage. CMSC 412, University of Maryland. Guest lecturer: David Hovemeyer.

Exploring RAID Configurations

Data Corruption In Storage Stack - Review

Transcription:

RAID Overview 91.520 1

The Motivation for RAID Computing speeds double every 3 years Disk speeds can t keep up Data needs higher MTBF than any component in system IO Performance and Availability Issues! 2

PERFORMANCE Parallelism Load Balancing AVAILABILITY RAID to the Rescue! Redundancy: Mirroring, or Striping with Parity FLEXIBILITY Selectable Performance/Availability/Cost 3

What is RAID Technology? Redundant Array of Independent Disks (a.k.a. Disk Array ) Multiple drives, single host disk unit Provides opportunity to increase: Performance via Parallelism Data Availability via Redundancy Cheap cost via commodity disks 4

Performance: Disk Striping Chunk size tuneable for BW vs Thruput tradeoffs Large Chunk High Throughput (IO/sec) Small Chunk High Bandwidth (MB/sec) 1 2 3 4 5 Chunk 0 Chunk 1 Chunk 2 Chunk 3 Chunk 4 Chunk 5 Chunk Parity6 Chunk 7 Chunk 8 Chunk 9 Chunk 10 Chunk 11 Chunk 12 Chunk 13 Chunk 14 Chunk 15 Chunk 16 Chunk 17 Chunk 18 Chunk 19 Chunk 20 Chunk 21 Chunk 22 Chunk 23 Chunk 24 5

Availability: Redundancy Mirroring Same data written to both disks. Single Host Unit Striping with Parity Data Stream, OR IO Stream, can be multiplexed across multiple disks, depending on BW vs Thruput. Parity data is also stored on disk. > Add XOR d parity for increased availability. 6

Parity Redundancy Parity = XOR of data from every disk in the RAID unit DATA 0 1 1 0 0 parity - Any single disk s data can be recovered by XOR ing the data of the surviving disks. 7

RAID Levels Many to choose from Each offers unique tradeoffs Performance Availability Costs We will consider 0, 1, 3, 5, 10 8

RAID 0 Disk Striping with No Redundancy High Performance; Low Availability Data Striped on Multiple Disks Multi-threaded Access Data Stream, OR IO Stream, can be Striped across multiple disks (BW vs Thruput). 9

RAID 0 Striping Chunk size tuneable for BW or Thruput No redundancy 1 2 3 4 5 Chunk 0 Chunk 1 Chunk 2 Chunk 3 Chunk 4 Chunk 5 Chunk Parity6 Chunk 7 Chunk 8 Chunk 9 Chunk 10 Chunk 11 Chunk 12 Chunk 13 Chunk 14 Chunk 15 Chunk 16 Chunk 17 Chunk 18 Chunk 19 Chunk 20 Chunk 21 Chunk 22 Chunk 23 Chunk 24 10

RAID 1 Disk Mirroring Single-disk Performance; Expensive Availability Data 100% duplicated across both spindles. Single-threaded access SAME data written to BOTH disks -- no segmenting. Single Host Unit 11

RAID 1/0 Striped Mirrors Highest Performance; Most Expensive Availability Multi-threaded Access Single Host Unit Data Stream, OR IO Stream, can be Striped across multiple disks (BW vs Thruput). 12

RAID 3 Disk Striping with dedicated parity drive High BW Performance; Cheap Availability Sector-granular data striping Single-threaded Access Data Stream is Striped across N-1 disks for high bandwidth. XOR Parity Data 13

RAID 3 Striping Chunk size = single sector (pure RAID 3 would be single byte) All parity data on same spindle 1 2 3 4 5 Chunk 0 Chunk 1 Chunk 2 Chunk 3 Parity Chunk 4 Chunk Parity5 Chunk 6 Chunk 7 Parity Chunk 8 Chunk 9 Chunk 10 Chunk 11 Parity Chunk 12 Chunk 13 Chunk 14 Chunk 15 Parity Chunk 16 Chunk 17 Chunk 18 Chunk 19 Parity 14

RAID 5 Disk Striping with rotating parity drive High Read Performance, expensive Write performance; Cheap Availability Tuneable Stripe granularity Optimized for multi-thread access IO Stream is Striped across N-1 disks for high IOs per second (thruput). XOR Parity Data 15

RAID 5 Striping Chunk size is tuned such that typical IO aligns on single disk. Parity rotates amongst disks to avoid write bottleneck 1 2 3 4 5 Chunk 0 Chunk 1 Chunk 2 Chunk 3 Parity Chunk 4 Chunk Parity5 Chunk 6 Parity Chunk 7 Chunk 8 Chunk 9 Parity Chunk 10 Chunk 11 Chunk 12 Parity Chunk 13 Chunk 14 Chunk 15 Parity Chunk 16 Chunk 17 Chunk 18 Chunk 19 16

RAID 5 - Write Operation 1. Read old data. 2. Write new data 3. XOR old and new data to create Partial Product. 4. Read old parity data. 5. Xor old parity with partial product, writing out result as new parity. } Old New P. P. Old P. XOR } XOR New P. 1 2 3 4 5 Chunk 0 Chunk 1 Chunk 2 Chunk 3 Parity 17

RAID Level Review RAID 0 - Data striping, Non-redundant. High Performance, Low Availability RAID 1 - Mirroring Moderate Performance, Expensive High Availability RAID 1/0 - Striping and Mirroring High Performance, Expensively High Availability RAID 3 - Striping, single parity disk. High Bandwidth Performance, Cheap Availability RAID 5 - Striping, rotating parity disk. High Thruput Performance, Cheap Availability 18

Summary Increasing performance gap between CPU and IO Data availability a priority RAID meets the IO challenge: Performance via parallelism Data Availability via redundancy Flexibility via multiple RAID levels, each offer unique performance/availability/cost tradeoffs 19