CS420: Operating Systems

Similar documents
Definition of RAID Levels

Lecture 36: Chapter 6

Operating Systems. RAID Redundant Array of Independent Disks. Submitted by Ankur Niyogi 2003EE20367

Reliability and Fault Tolerance in Storage

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1

How To Create A Multi Disk Raid

RAID. Storage-centric computing, cloud computing. Benefits:

Filing Systems. Filing Systems

How to choose the right RAID for your Dedicated Server

RAID Overview

RAID Level Descriptions. RAID 0 (Striping)

Why disk arrays? CPUs speeds increase faster than disks. - Time won t really help workloads where disk in bottleneck

CS161: Operating Systems

PIONEER RESEARCH & DEVELOPMENT GROUP

Price/performance Modern Memory Hierarchy

RAID Basics Training Guide

How To Write A Disk Array

An Introduction to RAID. Giovanni Stracquadanio

SSDs and RAID: What s the right strategy. Paul Goodwin VP Product Development Avant Technology

Input / Ouput devices. I/O Chapter 8. Goals & Constraints. Measures of Performance. Anatomy of a Disk Drive. Introduction - 8.1

Chapter Introduction. Storage and Other I/O Topics. p. 570( 頁 585) Fig I/O devices can be characterized by. I/O bus connections

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

Disks and RAID. Profs. Bracy and Van Renesse. based on slides by Prof. Sirer

RAID technology and IBM TotalStorage NAS products

Storing Data: Disks and Files

What is RAID and how does it work?

Introduction. What is RAID? The Array and RAID Controller Concept. Click here to print this article. Re-Printed From SLCentral

Storage Technologies - 2

Striped Set, Advantages and Disadvantages of Using RAID

Using RAID6 for Advanced Data Protection

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 13-1

Database Management Systems

Fault Tolerance & Reliability CDA Chapter 3 RAID & Sample Commercial FT Systems

RAID Made Easy By Jon L. Jacobi, PCWorld

Chapter 13 Disk Storage, Basic File Structures, and Hashing.

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05

VERY IMPORTANT NOTE! - RAID

RAID Performance Analysis

Block1. Block2. Block3. Block3 Striping

CS 6290 I/O and Storage. Milos Prvulovic

Storage node capacity in RAID0 is equal to the sum total capacity of all disks in the storage node.

Chapter 9: Peripheral Devices: Magnetic Disks

Summer Student Project Report

RAID: Redundant Arrays of Independent Disks

Standard RAID levels - Wikipedia, the free encycl...

RAID Levels and Components Explained Page 1 of 23

Sistemas Operativos: Input/Output Disks

Disk Array Data Organizations and RAID

RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29

Chapter 13. Chapter Outline. Disk Storage, Basic File Structures, and Hashing

RAID. Contents. Definition and Use of the Different RAID Levels. The different RAID levels: Definition Cost / Efficiency Reliability Performance

Why disk arrays? CPUs improving faster than disks

RAID Technology Overview

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

What is RAID? data reliability with performance

RAID Rebuilding. Objectives CSC 486/586. Imaging RAIDs. Imaging RAIDs. Imaging RAIDs. Multi-RAID levels??? Video Time

Data Storage - II: Efficient Usage & Errors

Module 6. RAID and Expansion Devices

Chapter 6 External Memory. Dr. Mohamed H. Al-Meer

Storage. The text highlighted in green in these slides contain external hyperlinks. 1 / 14

Chapter 13. Disk Storage, Basic File Structures, and Hashing

RAID Overview: Identifying What RAID Levels Best Meet Customer Needs. Diamond Series RAID Storage Array

Hard Disk Drives and RAID

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

03 infra TI RAID. MTBF; RAID Protection; Mirroring and Parity; RAID levels; write penalty

RAID Technology. RAID Overview

Disk Storage & Dependability

How To Improve Performance On A Single Chip Computer

IncidentMonitor Server Specification Datasheet

CS 61C: Great Ideas in Computer Architecture. Dependability: Parity, RAID, ECC

CS 153 Design of Operating Systems Spring 2015

Surviving Two Disk Failures. Introducing Various RAID 6 Implementations

Lecture 23: Multiprocessors

Outline. Database Management and Tuning. Overview. Hardware Tuning. Johann Gamper. Unit 12

Distributed Storage Networks and Computer Forensics

HARD DRIVE CHARACTERISTICS REFRESHER

Algorithms and Methods for Distributed Storage Networks 5 Raid-6 Encoding Christian Schindelhauer

Assessing RAID ADG vs. RAID 5 vs. RAID 1+0

BrightStor ARCserve Backup for Windows

RAID 6 with HP Advanced Data Guarding technology:

Guide to SATA Hard Disks Installation and RAID Configuration

NVIDIA RAID Installation Guide

New Advanced RAID Level for Today's Larger Storage Capacities: Advanced Data Guarding

Using Multipathing Technology to Achieve a High Availability Solution

Reliability of Systems and Components

William Stallings Computer Organization and Architecture 7 th Edition. Chapter 6 External Memory

NAS 251 Introduction to RAID

Chapter 10: Mass-Storage Systems

1 Storage Devices Summary

Guide to SATA Hard Disks Installation and RAID Configuration

Transcription:

NK YORK COLLEGE OF PENNSYLVANIA HG OK 2 RAID YORK COLLEGE OF PENNSYLVAN James Moscola Department of Physical Sciences York College of Pennsylvania Based on Operating System Concepts, 9th Edition by Silberschatz, Galvin, Gagne

RAID Redundant Array of Independent Disks (RAID) - Combines multiple physical drives into a single logical volume - Originally created as an alternative to large expensive disks Combine small inexpensive disk to achieve the same storage capacity at lower cost (was once Redundant Array of Inexpensive Disks ) - Two main goals of RAID systems: Increase reliability of storage through redundancy Increase I/O performance by distributing the load 2

Improved Reliability Through Redundancy All hard disks will fail eventually -- it s just a question of when With a single disk, if a disk failure occurs data is lost Reliability of data storage can be increased by using multiple disks - If a single disk fails, one or more additional disks contain enough information to reconstruct the data from the lost disk - Multiple disks can be configured for redundancy in a couple of ways Mirroring - one disk mirrors the data from another disk Parity - a parity bit is stored for the bits located in the same location on each of the disks in a RAID 3

Improved Performance Through Striping Mechanical disk drives are S L O W - Approximately 8 ms seek time + some rotational latency Speed of read and write operations can be improved by dividing up the data to be written and writing it to multiple disk -- striping - Writing x bytes to a disk may take some time t - When striping data across n disks, Only need to write x/n bytes to each disk Time to read/write may be reduced by a factor of n Striping may take place on different scales - Write a bit each of file to a different disk; write a byte of each file to a different disk; write a block of each file to a different disk; etc. 4

Mean Time Between Failures Mean Time Between Failures (MTBF) is a prediction of the amount of time between component failures (assumes that the failed component can be repaired/replaced) - A measure of hardware reliability (e.g. disk reliability) - Does NOT mean that your hardware won t fail sooner - Does NOT mean that your disks won t all fail at the same time Example of determining MTBF: - Run 10,000 units for 1,000 hours each, count how many fail (e.g. 20 failures) - MTBF = (10,000 * 1000) / 20 = 500,000 hours ~= 57 years If the MTBF of a single disk is 100,000 hours, then the MTBF of a RAID with 100 disks will be 100,000/100 = 1000 hours ~= 41 days - This can be improved through redundancy 5

RAID Levels RAID comes in a variety of different levels - There are many standard levels of RAID - Different levels of RAID provide different advantages/disadvantages There are also a number of non-standard RAID implementations - Typically developed by a company or research group that feels the standard levels of RAID do not meet the needs of all users 6

RAID Level 0 (Striping) Block-level striping - Simply provides the ability to combine multiple physical disk drives into a single volume - Improves the read and write performance by a factor of n where n is the number of disks in the array - Does NOT provide any redundancy -- data is NOT protected - A single disk failure means the contents of the entire array are lost Array Failure Rate = 1 - (1 - r) n where n is the number of disks in the array, and r is the failure rate for each disk over some period of time Example: 10 disks with 5% failure rate over 1 year 1 - (1 -.05) 10 = 40% chance array will fail by end of first year RAID diagrams from Wikimedia Commons 7

RAID Level 1 (Mirroring) Mirroring (no parity or striping) - Each disk in the array has an exact copy somewhere else in the array (all data is written to n disks) Provides redundancy through data replication - Can improve the read performance by a factor of n since data can be read from any of the n disks No performance increase for writing data - Fault tolerant up to n-1 disk failures Read performance will degrade as disks fail - Can be expensive Array Failure Rate = r n where n is the number of disks in the array, and r is the failure rate for each disk over some period of time Example: 10 disks with 5% failure rate over 1 year.05 10 = very small chance of failure in first year 8

RAID Level 3 Byte-level striping with dedicated parity disk - Data is striped across n-1 disks at the byte level, a single disk is used for parity - Only a single parity disk is required - Can improve the read performance by a factor of n-1 since a portion of the data is read from each of the n-1 disks - Can only tolerate a single disk failure - Since all disks are required for each I/O request, RAID 3 arrays can only service a single I/O request at a time Read/write a block at a time - Parity computation can be expensive Most RAID controllers contain specialized hardware to compute parity Array Failure Rate = n(n -1)r 2 Example: 10 disks with 5% failure rate over 1 year 10(10-1) *.05 2 = 22.5% chance of failure 9

RAID Level 4 Block-level striping with dedicated parity disk - Data is striped across n-1 disks at the block level, a single disk is used for parity - Only a single parity disk is required - Can improve the read performance by a factor of n-1 since data is a portion of the data is read from each of the n-1 disks - Can only tolerate a single disk failure - Can handle multiple reads in parallel when block sized reads are performed - Parity computation can be expensive Most RAID controllers contain specialized hardware to compute parity - Single parity disk can become a bottleneck - Writing data smaller than a full stripe is costly Array Failure Rate = n(n -1)r 2 Example: 10 disks with 5% failure rate over 1 year 10(10-1) *.05 2 = 22.5% chance of failure Requires read/modify/write to compute new parity 10

RAID Level 5 Block-level striping with distributed parity - Similar to RAID Level 4, but distributes parity among all n disks in the array - Avoids overusing a single parity disk Increase longevity of parity disk Eliminates single parity disk bottleneck - Can only tolerate a single disk failure - Most popular RAID implementation since it provides good performance and redundancy is inexpensive - Suffers the same problems as RAID 4 when write is smaller than a stripe Array Failure Rate = n(n -1)r 2 Example: 10 disks with 5% failure rate over 1 year 10(10-1) *.05 2 = 22.5% chance of failure 11

RAID Level 6 Block-level striping with double distributed parity - Similar to RAID 5, but adds a second level of parity - Data and parity blocks are striped across the n disks in the array - Typically uses ECC codes such as Reed- Solomon instead of simple parity - Can tolerate two disk failures before data loss - Parity calculations are even more expensive than in previous levels of RAID Array Failure Rate = n(n -1)(n-2)r 3 Example: 10 disks with 5% failure rate over 1 year 10(10-1)(10-2) *.05 3 = 9% chance of failure 12

Combining RAID Levels (Hybrid RAID) RAID levels can be nested to get benefits from two different levels Examples: - RAID 0+1 - uses a second striped set to mirror the first striped set - RAID 5+0 (a.k.a RAID 50) - combines block level striping of RAID 0 with distributed parity of RAID 5 - Many other possible nested RAID levels, RAID 10, 51, 60, 61, 100, etc. 13

Rebuilding an Array When a disk failure occurs, it is important to replace the disk as soon as possible - Many RAID levels can only sustain a single drive failure - Rebuilding the array can take MANY hours (10+ hours depending on disk sizes) - A secondary failure during array rebuild can cause total data loss :-( - A secondary failure during array rebuild becomes more likely as ALL disk are working continuously to rebuild array RAID 6 provides better protection during array rebuild as it can sustain a second failure and still continue the rebuild the array 14

Nonstandard RAID Implementations Many nonstandard versions of RAID exist - RAID-DP (double parity) - uses two bit parity on dedicated disks - BeyondRAID (Drobo devices) - FlexRAID - unraid :-) - Many others 15