W4118: RAID. Instructor: Junfeng Yang



Similar documents
How To Write A Disk Array

Lecture 36: Chapter 6

Operating Systems. RAID Redundant Array of Independent Disks. Submitted by Ankur Niyogi 2003EE20367

CS161: Operating Systems

Input / Ouput devices. I/O Chapter 8. Goals & Constraints. Measures of Performance. Anatomy of a Disk Drive. Introduction - 8.1

Disk Array Data Organizations and RAID

Reliability and Fault Tolerance in Storage

Storing Data: Disks and Files

CS 6290 I/O and Storage. Milos Prvulovic

Data Storage - II: Efficient Usage & Errors

How To Improve Performance On A Single Chip Computer

Price/performance Modern Memory Hierarchy

RAID. Storage-centric computing, cloud computing. Benefits:

RAID. Tiffany Yu-Han Chen. # The performance of different RAID levels # read/write/reliability (fault-tolerant)/overhead

RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29

RAID Overview

1 Storage Devices Summary

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

Definition of RAID Levels

Solving Data Loss in Massive Storage Systems Jason Resch Cleversafe

File System Design and Implementation

Outline. Database Management and Tuning. Overview. Hardware Tuning. Johann Gamper. Unit 12

Disk Storage & Dependability

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1

Why disk arrays? CPUs improving faster than disks

Chapter 6 External Memory. Dr. Mohamed H. Al-Meer

Introduction. What is RAID? The Array and RAID Controller Concept. Click here to print this article. Re-Printed From SLCentral

Why disk arrays? CPUs speeds increase faster than disks. - Time won t really help workloads where disk in bottleneck

RAID 5 rebuild performance in ProLiant

an analysis of RAID 5DP

Introduction Disks RAID Tertiary storage. Mass Storage. CMSC 412, University of Maryland. Guest lecturer: David Hovemeyer.

Disks and RAID. Profs. Bracy and Van Renesse. based on slides by Prof. Sirer

California Software Labs

Storage Technologies - 2

PIONEER RESEARCH & DEVELOPMENT GROUP

Filing Systems. Filing Systems

COSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters

Storage and File Structure

An Introduction to RAID. Giovanni Stracquadanio

Reliability of Systems and Components

What is RAID? data reliability with performance

Storage node capacity in RAID0 is equal to the sum total capacity of all disks in the storage node.

Guide to SATA Hard Disks Installation and RAID Configuration

Chapter Introduction. Storage and Other I/O Topics. p. 570( 頁 585) Fig I/O devices can be characterized by. I/O bus connections

CS420: Operating Systems

RAID Level Descriptions. RAID 0 (Striping)

CS 61C: Great Ideas in Computer Architecture. Dependability: Parity, RAID, ECC

RAID. Contents. Definition and Use of the Different RAID Levels. The different RAID levels: Definition Cost / Efficiency Reliability Performance

COMP 7970 Storage Systems

Guide to SATA Hard Disks Installation and RAID Configuration

How To Create A Multi Disk Raid

Storing Data: Disks and Files. Disks and Files. Why Not Store Everything in Main Memory? Chapter 7

RAID: Redundant Arrays of Independent Disks

RAID Basics Training Guide

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

Storage. The text highlighted in green in these slides contain external hyperlinks. 1 / 14

A Tutorial on RAID Storage Systems

Assessing RAID ADG vs. RAID 5 vs. RAID 1+0

How to choose the right RAID for your Dedicated Server

Physical Data Organization

Block1. Block2. Block3. Block3 Striping

Lecture 23: Multiprocessors

3PAR Fast RAID: High Performance Without Compromise

Striped Set, Advantages and Disadvantages of Using RAID

Chapter 11 I/O Management and Disk Scheduling


File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS

Module 6. RAID and Expansion Devices

RAID Implementation for StorSimple Storage Management Appliance

RAID Made Easy By Jon L. Jacobi, PCWorld

A backup is a copy of your files that will be able to reproduce the original, if it is lost, damaged or stolen.

Database Management Systems

What is RAID and how does it work?

Timing of a Disk I/O Transfer

Firebird and RAID. Choosing the right RAID configuration for Firebird. Paul Reeves IBPhoenix. mail:

Redundant Array of Independent Disks (RAID)

Fault Tolerance & Reliability CDA Chapter 3 RAID & Sample Commercial FT Systems

RAID Storage, Network File Systems, and DropBox

RAID 6 with HP Advanced Data Guarding technology:

RAID Performance Analysis

Benefits of Intel Matrix Storage Technology

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05

Chapter 2 Data Storage

VERY IMPORTANT NOTE! - RAID

Guide to SATA Hard Disks Installation and RAID Configuration

Chapter 9: Peripheral Devices: Magnetic Disks

Guide to SATA Hard Disks Installation and RAID Configuration

GENERAL INFORMATION COPYRIGHT... 3 NOTICES... 3 XD5 PRECAUTIONS... 3 INTRODUCTION... 4 FEATURES... 4 SYSTEM REQUIREMENT... 4

Transcription:

W4118: RAID Instructor: Junfeng Yang References: Modern Operating Systems (3 rd edition), Operating Systems Concepts (8 th edition), previous W4118, and OS at MIT, Stanford, and UWisc

RAID motivation Performance Disks are slow compared to CPU Disk speed improves slowly compared to CPU Reliability In single disk systems, one disk failure è data loss Cost A single fast, reliable disk is expensive 1

RAID idea RAID idea: use redundancy to improve performance and reliability Redundant array of cheap disks as one storage unit Fast: simultaneous read and write disks in the array Reliable: use parity to detect and correct errors RAID can have different redundancy levels, achieving different performance and reliability Seven different RAID levels (0-6) 2

Evaluating RAID Cost: check disk capacity / total capacity Storage utilization: data capacity / total capacity Reliability Tolerance of disk failures Performance (Large) sequential read, write, read-modify-write (Small) random read, write, read-modify-write Speedup over a single disk 3

Computing cost D = number of data disks in a RAID group C = number of check disks in a RAID group Cost = C/(D+C) 4

Computing reliability N = total number of disks D = number of data disks in a RAID group C = number of check/parity disks in a RAID group MTTF(disk) = mean time to failure for a disk Estimated as MTTF (in years) = 1 / AFR (annual failure rate) Ex) 114 years (1M hours) = 1 / 0.88% Source: "Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you?", FAST'07 MTTR = mean time to repair for a failed disk MTTF(group) = mean time to two failed disks before first gets repaired in one group MTTF(raid) = mean time to failure over entire array MTTF(raid) = MTTF(group) / Num. groups 5

Computing reliability (cont d) Assume single-error tolerance in one group If another error comes before repair, group fails MTTF(group) = MTTF(1 disk) / Prob[Another failure within MTTR] If Prob[...] 1, MTTF(group) same as MTTF(1 disk) - no benefit of RAID If Prob[...] 0, MTTF(group) approaches. MTTF(1 disk) = MTTF(disk)/(D+C) MTTF(another disk) = MTTF(disk)/(D+C-1) Prob[Another failure within MTTR] = MTTR/(MTTF(disk)/(D+C-1)) MTTF(group) = MTTF(1 disk)/prob[another failure within MTTR] = (MTTF(disk))2/((D+C)*(D+C-1)*MTTR) Num groups G = N / (D+C) MTTF(raid) = MTTF(group) / G = MTTF(group) / (N/(D+C)) Thus: MTTF(raid) = (MTTF(disk))2 / (N * (D+C-1) * MTTR) But: are the assumptions valid? 6

RAID 0: non-redundant striping Structure Data striped across all disks in an array No parity Advantages: Good performance: with N disks, roughly N times speedup Disadvantages: Poor reliability: one disk failure è data loss MTTF(raid)=MTTF(disk)/N 7

RAID 0 performance Large read of 100 blocks. One disk: 100 * t, Raid0: 100/N * t * S S: slowdown. Need to wait for slowest disk to complete before return. Performance: Large read: N/S Large write: N/S Large R-M-W: N/S Small read: N Small write: N Small R-M-W: N 8

Structure RAID 1: mirroring Keep a mirrored (shadow) copy of data Advantages Good reliability: one disk failure OK Good read performance Disadvantage High cost: one data disk requires one parity disk 9

RAID 1 performance Cost = C/(D+C) = 1/(1+1) = 50% MTTF(raid) = MTTF(disk)2/(N*MTTR) Performance Large read: N/S Large write: N/2S Large R-M-W: 2N/3S X sectors, 2X events (X reads, X writes) Speedup (w.r.t. to 1 disk) = 2X / (X/(N/S) + X/(N/2S)) = 2N/3S Small read: N (no S here since only two disks) Small write: N/2 Small R-M-W: 2N/3 10

RAID 2: error-correction parity Structure A data sector striped across data disks Compute error-correcting parity and store in parity disks Advantages Good reliability with higher storage utilization than mirroring Disadvantages Unnecessary cost: disk can already detect failure Poor random performance parity disks 11

RAID 3: bit-interleaved parity Structure Single parity disk (XOR of each stripe of a data sector) Advantages Same reliability with one disk failure as RAID2 since disk controller can determine what disk fails Higher storage utilization Disadvantages Poor random performance 12

RAID 4: block-interleaved parity Structure A set of data sectors (parity group) striped across data disks Advantages Same reliability as RAID3 Good random read performance Disadvantages Poor random write and read-modify-write performance P 13

RAID 4 performance One parity disk (XOR of data sectors) Write data disk + parity disk To update parity, don t have to read all disk sectors Parity = oldparity xor (changed bits) = oldparity xor newdata xor olddata Number of groups: G = N/(D+1) = number of check disks Performance Large read: (N-G)/S Large write: (N-G)/S Large R-M-W: (N-G)/S Small read: N-G Small write: ½*G (for each block, need a read and a write to parity disk) RAID: X sectors. Small R-M-W: 1*G X/((X/1) + (X/1)) = ½ RAID: X sectors. 2X/((X/1) + (X/1)) = 1 14

RAID 5: block-interleaved distributed parity Structure Parity sectors distributed across all disks Advantages Good performance 1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 P P 5 6 7 8 12 P 9 10 11 15 16 P 13 14 15

RAID 5 performance Same as RAID4 except no single parity disk Good small write and read-modify-write performance Performance Large read: (N-G)/S Large write: (N-G)/S Large R-M-W: (N-G)/S Small read: N Small write: N/4 One disk: X sectors * t. Raid 5: (X (read original) + X (read parity) + X (write original) + X (write parity)) / N * t Raid5 can do 4X over all N disks Small R-M-W: N/2 Same as small write, except read-original is not wasted. 16

RAID6: P+Q redundancy Structure Same as RAID 5 except using two parity sectors per parity group Advantages Can tolerate two disk failures Q P Q P Q P Q P 17

RAID levels