ISTANBUL AYDIN UNIVERSITY

Similar documents
Introduction. What is RAID? The Array and RAID Controller Concept. Click here to print this article. Re-Printed From SLCentral

IncidentMonitor Server Specification Datasheet

How To Create A Multi Disk Raid

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05

RAID 6 with HP Advanced Data Guarding technology:

RAID Made Easy By Jon L. Jacobi, PCWorld

What is RAID and how does it work?

Chapter 6 External Memory. Dr. Mohamed H. Al-Meer

RAID Basics Training Guide

Lecture 36: Chapter 6

RAID Implementation for StorSimple Storage Management Appliance

The idea behind RAID is to have a number of disks co-operate in such a way that it looks like one big disk.

RAID Storage System of Standalone NVR

VERY IMPORTANT NOTE! - RAID

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

RAID Overview: Identifying What RAID Levels Best Meet Customer Needs. Diamond Series RAID Storage Array

Benefits of Intel Matrix Storage Technology

Storage node capacity in RAID0 is equal to the sum total capacity of all disks in the storage node.

Operating Systems. RAID Redundant Array of Independent Disks. Submitted by Ankur Niyogi 2003EE20367

Input / Ouput devices. I/O Chapter 8. Goals & Constraints. Measures of Performance. Anatomy of a Disk Drive. Introduction - 8.1

How to choose the right RAID for your Dedicated Server

Technology Update White Paper. High Speed RAID 6. Powered by Custom ASIC Parity Chips

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

Data Storage - II: Efficient Usage & Errors

PIONEER RESEARCH & DEVELOPMENT GROUP

Distribution One Server Requirements

How To Use A Raid

Data Integrity: Backups and RAID

SSDs and RAID: What s the right strategy. Paul Goodwin VP Product Development Avant Technology

NVIDIA RAID Installation Guide

Firebird and RAID. Choosing the right RAID configuration for Firebird. Paul Reeves IBPhoenix. mail:

Overview of I/O Performance and RAID in an RDBMS Environment. By: Edward Whalen Performance Tuning Corporation

An Introduction to RAID. Giovanni Stracquadanio

RAID Levels and Components Explained Page 1 of 23

Block1. Block2. Block3. Block3 Striping

CS 6290 I/O and Storage. Milos Prvulovic

Definition of RAID Levels

GENERAL INFORMATION COPYRIGHT... 3 NOTICES... 3 XD5 PRECAUTIONS... 3 INTRODUCTION... 4 FEATURES... 4 SYSTEM REQUIREMENT... 4

Non-Redundant (RAID Level 0)

IBM ^ xseries ServeRAID Technology

Assessing RAID ADG vs. RAID 5 vs. RAID 1+0

CS420: Operating Systems

Storing Data: Disks and Files

An Introduction to RAID 6 ULTAMUS TM RAID

RAID Level Descriptions. RAID 0 (Striping)

Summer Student Project Report

Guide to SATA Hard Disks Installation and RAID Configuration

RAID technology and IBM TotalStorage NAS products

Fault Tolerance & Reliability CDA Chapter 3 RAID & Sample Commercial FT Systems

Hard Disk Drives and RAID

RAID. Storage-centric computing, cloud computing. Benefits:

Getting Started With RAID

RAID Technology Overview

RAID-DP : NETWORK APPLIANCE IMPLEMENTATION OF RAID DOUBLE PARITY FOR DATA PROTECTION A HIGH-SPEED IMPLEMENTATION OF RAID 6

Gladstone Health & Leisure Technical Services

Barracuda Backup Server. Introduction

Intel Rapid Storage Technology

Guide to SATA Hard Disks Installation and RAID Configuration

Serial ATA RAID PCI. User's Manual

Using RAID6 for Advanced Data Protection

Storage. The text highlighted in green in these slides contain external hyperlinks. 1 / 14

In This Lecture. Physical Design. RAID Arrays. RAID Level 0. RAID Level 1. Physical DB Issues, Indexes, Query Optimisation. Physical DB Issues

CHAPTER 4 RAID. Section Goals. Upon completion of this section you should be able to:

The Cloud Hosting Revolution: Learn How to Cut Costs and Eliminate Downtime with GlowHost's Cloud Hosting Services

Technical White paper RAID Protection and Drive Failure Fast Recovery

High Availability Server Clustering Solutions

Deploying and Optimizing SQL Server for Virtual Machines

Physical Data Organization

RAID installation guide for ITE8212F

NetApp RAID-DP : Dual-Parity RAID 6 Protection Without Compromise

How To Set Up A Raid On A Hard Disk Drive On A Sasa S964 (Sasa) (Sasa) (Ios) (Tos) And Sas964 S9 64 (Sata) (

SiS964/SiS180 SATA w/ RAID User s Manual. Quick User s Guide. Version 0.3

ZCP 7.0 (build 41322) Zarafa Collaboration Platform. Zarafa Archiver Deployment Guide

Module 6. RAID and Expansion Devices

PARALLEL I/O FOR HIGH PERFORMANCE COMPUTING

Version : 1.1. SR2760-2S-S2 User Manual. SOHORAID Series

RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection

What is RAID--BASICS? Mylex RAID Primer. A simple guide to understanding RAID

High Performance Computing. Course Notes High Performance Storage

USER GUIDE. PageScope Enterprise Suite. Backup and Recovery

Version : 1.0. SR3620-2S-SB2 User Manual. SOHORAID Series

How Does the ECASD Network Work?

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS

STAR Watch Statewide Technology Assistance Resources Project A publication of the Western New York Law Center,Inc.

How To Write A Disk Array

RAID Technology White Paper

an analysis of RAID 5DP

CS161: Operating Systems

Storage Technologies - 2

5-Bay Raid Sub-System Smart Removable 3.5" SATA Multiple Bay Data Storage Device User's Manual

BrightStor ARCserve Backup for Windows

Seradex White Paper. Focus on these points for optimizing the performance of a Seradex ERP SQL database:

Onboard-RAID. Onboard-RAID supports striping (RAID 0), mirroring (RAID 1), striping/mirroring (RAID 0+1), or spanning (JBOD) operation, respectively.

Storage Options for Document Management

RAID5 versus RAID10. First let's get on the same page so we're all talking about apples.

Linux Software Raid. Aug Mark A. Davis

Intel RAID Software User s Guide:

SiS 180 S-ATA User s Manual. Quick User s Guide. Version 0.1

RAID Technology. RAID Overview

Transcription:

ISTANBUL AYDIN UNIVERSITY 2013-2014 Academic Year Fall Semester Department of Software Engineering SEN361 COMPUTER ORGANIZATION HOMEWORK REPORT STUDENT S NAME : GÖKHAN TAYMAZ STUDENT S NUMBER : B1105.090068 ADVISER : HASAN HÜSEYİN BALIK

What is RAID? Using Multiple Hard Drives for Performance and Reliability Introduction Back in the late 1980's and early 1990's, computer information servers were encountering a dramatic increase in the amount of data they needed to serve and store. Storage technologies were getting very expensive to place a large number of high capacity hard drives in the servers. A solution was needed and thus RAID was born. So what exactly is RAID? First of all, the acronym stands for Redundant Array of Inexpensive Disks. It was a system developed whereby a large number of low cost hard drives could be linked together to form a single large capacity storage device that offered superior performance, storage capacity and reliability over older storage solutions. It has been widely used and deployed method for storage in the enterprise and server markets, but over the past 5 years has become much more common in end user systems. Advantages of RAID There are three primary reasons that RAID was implemented: Redundancy Increased Performance Lower Costs Redundancy is the most important factor in the development of RAID for server environments. This allowed for a form of backup of the data in the storage array in the event of a failure. If one of the drives in the array failed, it could either be swapped out for a new drive without turning the systems off (referred to as hot swappable) or the redundant drive could be used. The method of redundancy depends on which version of RAID is used. The increased performance is only found when specific versions of the RAID are used. Performance will also be dependent upon the number of drives used in the array and the controller. All managers of IT departments like low costs. When the RAID standards were being developed, cost was also a key issue. The point of a RAID array is to provide the same or greater storage capacity for a system compared to using individual high capacity hard drives. A good example of this can be seen in the price differences between the highest capacity hard drives and lower capacity drives. Three drives of a smaller size could cost less than an individual high-capacity drive but provide more capacity. There are typically three forms of RAID used for desktop computer systems: RAID 0, RAID 1 and RAID 5. In most cases, only the first two of these versions is available and one of the two technically is not a form of RAID.

RAID 0 Following are the key points to remember for RAID level 0. Minimum 2 disks. Drive 1 Drive 2 Excellent performance ( as blocks are striped ). Block 1 1 2 No redundancy ( no mirror, no parity ). Block 2 3 4 Don t use this for any critical system. Block 3 5 6 The lowest designated level of RAID, level 0, is actually not a valid type of RAID. It was given the designation of level 0 because it fails to provide any level of redundancy for the data stored in the array. Thus, if one of the drives fails, all the data is damaged.raid 0 uses a method called striping. Striping takes a single chunk of data like a graphic image, and spreads that data across multiple drives. The advantage that striping has is in improved performance. Twice the amount of data can be written in a given time frame to the two drives compared to that same data being written to a single drive.below is an example of how data is written in a RAID 0 implementation. Each row in the chart represents a physical block on the drive and each column is the individual drive. The numbers in the table represent the data blocks. Duplicate numbers indicate a duplicated data block.thus, if the 6 blocks of data above constitute a single data file, it can be read and written to the drive much faster than if it were on a single drive. Each drive working in parallel could read only 3 physical blocks while it would take a single drive twice as long because it has to read 6 physical blocks. The drawback of course is that if one drive fails, the data is no longer functional. All 6 data blocks are needed for the file, but only three are accessible. Increased storage performance No loss in data capacity Disadvantages: No redundancy of data

RAID 1 Minimum 2 disks. Good performance ( no striping. no parity ). Excellent redundancy ( as blocks are mirrored ). RAID version 1 was the first real implementation of RAID. It provides a simple form of redundancy for data through a process called mirroring. This form typically requires two individual drives of similar capacity. One drive is the active drive and the secondary drive is the mirror. When data is written to the active drive, the same data is written to the mirror drive.the following is an example of how the data is written in a RAID 1 implementation. Each row in the chart represents a physical block on the drive and each column is the individual drive. The numbers in the table represent the data blocks. Duplicate numbers indicate a duplicated data block. Drive 1 Drive 2 Block 1 1 1 Block 2 2 2 Block 3 3 3 This provides a full level of redundancy for the data on the system. If one of the drives fails, the other drive still has all the data that existed in the system. The big drawback of course is that the capacity of the RAID will only be as big as the smallest of the two drives, effectively halving the amount of storage capacity if the two drives were used independently. Provides full redundancy of data Disadvantages Storage capacity is only as large as the smallest drive No performance increases Some downtime to change active drive during a failure

RAID 0+1 This is a hybrid form of RAID that some manufacturers have implemented to try and give the advantages of each of the two versions combined. Typically this can only be done on a system with a minimum of 4 hard drives. It then combines the methods of mirroring and striping to provide the performance and redundancy. The first set of drives will be active and have the data striped across them while the second set of drives will be a mirror of the data on the first two. Below is an example of how data is written in a RAID 0+1 implementation. Each row in the chart represents a physical block on the drive and each column is the individual drive. The numbers in the table represent the data blocks. Duplicate numbers indicate a duplicated data block. Drive 1 Drive 2 Drive 3 Drive 4 Block 1 1 2 1 2 Block 2 3 4 3 4 Block 3 5 6 5 6 In this case, the data blocks will be striped across the drives within each of the two sets while it is mirrors between the sets. This gives the increased performance of RAID 0 because it takes the drive half the time to write the data compared to a single drive and it provides redundancy. The major drawback of course is the cost. This implementation requires a minimum of 4 hard drives. Increased performance Data is fully redundant Disadvantages: Large number of drives required Effective data capacity is halved

RAID 10 or 1+0 Minimum 4 disks. This is also called as stripe of mirrors Excellent redundancy ( as blocks are mirrored ) Excellent performance ( as blocks are striped ) If you can afford the dollar, this is the BEST option for any mission critical applications RAID 10 is effectively a similar version to RAID 0+1. Rather than striping data between the disk sets and then mirroring them, the first two drives in the set are a mirrored together. The second two drives form another set of disks that is are mirror of one another but store striped data with the first pair. This is a form of nested RAID setup. Drives 1 and 2 are a RAID 1 mirror and drives 3 and 4 are also a mirror. These two sets are then setup as stripped array.below is an example of how data is written in a RAID 10 implementation. Each row in the chart represents a physical block on the drive and each column is the individual drive. The numbers in the table represent the data blocks. Duplicate numbers indicate a duplicated data block. Drive 1 Drive 2 Drive 3 Drive 4 Block 1 1 1 2 2 Block 2 3 3 4 4 Block 3 5 5 6 6 Just like the RAID 0+1 setup, RAID 10 requires a minimum of four hard drives to function. Performance is pretty much the same but the data is a bit more protected than the RAID 0+1 setup. Increased performance Data is fully redundant Disadvantages: Large number of drives required Effective data capacity is halved

RAID 5 Minimum 3 disks. Good performance ( as blocks are striped ). Good redundancy ( distributed parity ). Best cost effective option providing both performance and redundancy. Use this for DB that is heavily read oriented. Write operations will be slow. This is the most powerful form of RAID that can be found in a desktop computer system. Typically it requires the form of a hardware controller card to manage the array, but some desktop operating systems can create these via software. This method uses a form of striping with parity to maintain data redundancy. A minimum of three drives is required to build a RAID 5 array and they should be identical drives for the best performance. Parity is essentially a form of binary math that compares two blocks a data and forms a third data block based upon the first two. The easiest way to explain it is even and odd. If the sum of the two data blocks is even, then the parity bit is even. If the sum of the two data blocks is odd, the parity bit is odd. So 0+0 and 1+1 both equal 0 while 0+1 or 1+0 will equal 1. Based on this form of binary math, a failure in one drive in the array will allow the parity bit to reconstruct the data when the drive is replaced. With that information in mind, here is an example of how a RAID 5 array would work. Each row in the chart represents a physical block on the drive and each column is the individual drive. The numbers in the table represent the data blocks. Duplicate numbers indicate a duplicated data block. A "P" indicates a parity bit for two blocks of data. Drive 1 Drive 2 Drive 3 Block 1 1 2 P Block 2 3 P 4 Block 3 P 5 6 The parity bit shifts between the drives to increase the performance and reliability of the data. The drive array will still have increased performance over a single drive because the multiple drives can write the data faster than a single drive. The data is also fully redundant because of the parity bits. In

the case of drive 2 failing, the data can be rebuilt based on the data and parity bits on the two remaining drives. Data capacity is reduced due to the parity data blocks. In practice the capacity of the array is based on the following equation where n is the number of drives and z is the capacity: (n-1)z = Array Capacity In the case of three 500 gigabyte hard drives, the total capacity would be (3-1)x500GB or 1000 gigabytes. Hardware RAID 5 implementations can also have a function called hot swap. This allows for drives to be replaced while the array is still functioning to either increase the drives capacity or to replace a damaged drive. The drive controller then takes time while the array is running to rebuild the data array across the drives. This is a valuable feature for systems that require 24x7 operation. Increased storage array performance Full data redundancy Ability to run 24x7 with hot swap Disadvantages High costs to implement Performance degrades during rebuilding Software vs. Hardware RAID In order for RAID to function, there needs to be software either through the operating system or via dedicated hardware to properly handle the flow of data from the computer system to the drive array. This is particularly important when it comes to RAID 5 due to the large amount of computing required to generate the parity calculations. In the case of software implementations, CPU cycles are taken away from the general computing environment to perform the necessary tasks for the RAID interface. Software implementations are very low cost monetarily because all that is necessary to implement one is the hard drives. The problem with software RAID implementations is the performance drop of the system. In general, this performance hit can be anywhere from 5% or even greater depending upon the processor, memory, drives used and the level of RAID implemented. Most people do not use software RAID anymore due to the decreasing costs of hardware RAID controllers over the years. Hardware RAID has the advantage of dedicated circuitry to handle all the RAID drive array calculations outside of the processor. This provides excellent performance for the storage array. The drawbacks to hardware RAID have been the costs. In the case of RAID 0/1 controllers, those costs have become so low that many chipset and motherboard manufacturers are including these capabilities on the motherboards. The real costs rest with RAID 5 hardware that require more circuitry for added computing ability.

Drive Selection What a lot of people don't realize is that the performance and capacity of a RAID array is heavily dependent upon the hard drives used in the array. For the best results, all hard drives in the array should be the same brand and model. This means that all of the hard drives will have the same capacity and performance levels. It is not a requirement that the drives be matched, but mismatching the drives can actually hurt the RAID array. The capacity of the RAID array will depend upon the method implemented. In the case of RAID 0, the striping can only be done across an equal amount of space on the two drives. As a result, if an 80GB and 100GB drive are used to make the array, the final capacity of the array would only be 160GB. Similarly, in RAID 1 the drives can only mirror data equal to the smallest size. Thus based on the two drives mentioned before, the final data size would only be 80GB. RAID 5 is a bit more complicated because of the formula mentioned before. Once again the smallest capacity would be used. So if a 80GB, 100 GB and 120GB drive were used to make a RAID 5 array, the final capacity would be 160GB of data. Performance of the array is also dependent upon the drives. In order for the array to function properly, it must wait for the data to be written to each of the drives before it can continue. This means that in the example charts for the RAID arrays, the controller must wait until all physical data has been written to block 1 across all the drives in the array before it can continue to the next set of data for the drives. This means an array where one drive has half the performance of the other two will slow down the overall performance of the other drives. Conclusions Overall RAID provides systems with a variety of benefits depending upon the version implemented. Most consumer users will likely opt to use the RAID 0 for increased performance without the loss of storage space. This is primarily because redundancy is not an issue for the average user. In fact, most computer systems will only offer either RAID 0 or 1. The costs of implementing a RAID 0+1 or RAID 5 system generally are too expensive for the average consumer and are only found in high-end workstation or server level systems. References https://wiki.mq.edu.au/display/~mq20111141/09+-+raid+disc+storage http://www.ssbg.com.cn/admin/webedit/uploadfile/what%20is%20raid.pdf http://www.ustudy.in/node/6729