Three-Dimensional Redundancy Codes for Archival Storage

Size: px
Start display at page:

Download "Three-Dimensional Redundancy Codes for Archival Storage"

Transcription

1 Three-Dimensional Redundancy Codes for Archival Storage Jehan-François Pâris Darrell D. E. Long Witold Litwin Department of Computer Science University of Houston Houston, T, USA Department of Computer Science University of California Santa Cruz, CA, USA LAMSADE Université Paris-Dauphine Paris, France Abstract Fault-tolerant disk arrays rely on replication or erasure-coding to reconstruct lost data after a disk failure. As disk capacity increases, so does the risk of encountering irrecoverable read errors that would prevent the full recovery of the lost data. We propose a three-dimensional erasure-coding technique that reduces that risk by guaranteeing full recovery in the presence of all triple and nearly all quadruple disk failures. Our solutioerforms better than existing solutions, such as sets of disk arrays using Reed-Solomon codes against triple failures in each individual array. Given its very high reliability, it is especially suited to the needs of very large data sets that must be preserved over long periods of time. Keywords-RAID arrays I. INTRODUCTION One of the major challenges facing the storage community is finding efficient ways to store enormous amounts of data and preserve its integrity over time periods that can span decades. There are already several archives that exceed ten petabytes. Such a ten-petabyte archive would require ten thousand one-terabyte disks. Suppose that we group these ten thousand disks into one thousand disks arrays each with ten data disks and we add enough redundant storage to each individual array to give it a percent reliability over the lifetime of the archive. As the reliability of the whole archive is the product of the reliabilities of its constituting elements, that reliability would only be 90.5 percent. As a result, traditional solutions that worked well for much smaller data sets become inadequate. Cost is another factor to consider. Mirroring is a popular solution for storing small to medium-size data sets. It is less effective for very large archives as it would double the hardware cost and the power consumption of an already costly storage system. At the same time, solutions that provide higher reliability with a lower space overhead become much more attractive. We present a storage architecture that satisfies these two criteria. Conventional linear RAID arrays consist of several data disks and one parity disk. We extend that approach to a three-dimensional space and organize our data disks in such a way that each data disk participates in three distinct parity groups. As a result, the contents of these disks are protected against all triple and nearly all quadruple failures. Another advantage of our approach is its low space overhead. A three- Supported iart by Grant CCF , by the Department of Energy under Award Number DE-FC02-10ER26017/DE-SC and by the industrial members of the Storage Systems Research Center. D P 2 P 1 P D 1 D 2 D 1 D 2 P D 12 Figure 1. A two-dimensional RAID organization using four parity disks to protect the contents of six data disks against all double disk failures. dimensional array with parity disks carotect the contents n of up to p II. PREVIOUS WORK Two-dimensional RAID arrays, or 2D-Parity arrays, were investigated by Schwarz [1] and Hellerstein et al. [2] who noted that these arrays tolerate all double disk failures but did not investigate how they reacted to triple or quadruple disk failures. More recently, Lee patented a two-dimensional disk array organization with prompt parity updates in one dimension and delayed parity updates in the second dimension []. Pâris et al. [] presented two-dimensional RAID arrays that reorganized themselves after a disk failure and noted that all two-dimensional RAID arrays tolerated most triple failures. More recently, Pâris et al. [5] investigated two-dimensional n RAID arrays consisting of parity disks and p data disks and showed that this architecture protected the contents of the data disks against all double and most triple failures. Uehara presented several MeshRAID organizations [6, 7]. III. OUR TECHNIQUE Before introducing three-dimensional RAID arrays, let us briefly describe the two- dimensional RAID organizations on which they are based. Consider the two-dimensional RAID organization described in Fig. 1. It consists of four parity disks, labeled P 1 to P, and six data disks, labeled D 12 to D. These disks are organized in such a way that [5]: 1. Each parity disk is on a separate parity stripe; 2. All parity stripes intersect with each other;. These intersections contain a single data disk;. All data disks belong to exactly two parity stripes.

2 γ γ γ γ γ γ Figure 2. A three-dimensional RAID organization using four parity disks to protect the contents of four data disks against all triple disk failures. Each data disk is identified by the three parity planes to which it belongs. Adding a fifth parity disk to the array, would allow us to form an additional parity stripe and place four new data disks on this stripe, all located at the intersection of the new parity stripe with one of the extant four parity stripes. More generally, adding an additional parity disk to an array with parity disks would allow us to add one additional data disk to each of n the extant parity stripes disks for a total p + 1 data disks and + 1 parity disks. As all data disks belong to two distinct parity stripes, the array can recover from all double failures without losing any data. [5]. Moving from two to three dimensions is the most natural way to increase the protection afforded by two-dimensional arrays. To keep the duality betweearity stripes representing segments and data disks represented by segment intersections, we introduce the notion of parity plane. A parity plane consists of a certain number of data disks and one parity disk that contains the exclusive or (OR) of the contents of the Our new organization will be defined by the following four properties: 1. Each parity plane contains a distinct parity disk; 2. All parity planes intersect with each other;. The intersections of three parity planes contain a single data disk;. All data disks belong to exactly three parity planes. As all data disks belong to three distinct parity planes, these three-dimensional RAID arrays will be able to recover from all three failures without losing any data. Figure 2 represents a three-dimensional RAID array with four data disks and four parity planes As we can see, the four intersecting planes,, γ, and define a tetrahedron with six edges and four summits, and each of these summits defines a data disk, to which we attach the labels of the three intersecting parity planes. Because each data disk is on three distinct parity planes, the array will be able to recover from all triple disk failures without any data loss. Observing that all parity planes intersect with each other and each intersection of three parity planes consists of a single data disk, we can see an array with parity disks and the n same number of parity planes will hold p Since each data disk is on three separate parity planes, each n p n p 1 parity plane will contain = For n p instance, a three dimensional array with 6 parity disks will contain 20 data disks and each parity plane will contain 10 data disks and one parity disk. 2 Figure. A pair of data disks that have failed and lost the two parity planes γ and they do not share. The data disks cannot recover from their two other parity planes and because they share both planes. γ (failed) γ γ Figure. Three data disks that have failed and lost their shared parity plane. They cannot recover from their three other parity planes, and γ. Each data disk is identified by the three parity planes to which it belongs. γ γ Figure 5. A fatal quadruple failure where all four failed disk occupy the four summits of a tetrahedron formed by four parity planes. Let us now consider how three-dimensional arrays react to quadruple failures. As long as these failures involve unrelated disks, the array will be able to recover without any data loss. The sole exceptions are: 1. The failure of a data disk and its three parity disks: both the data on the data disk and the parity information needed to reconstruct them is lost. 2. The failure of two data disks and the two parity disks they do not share: since the data disks share their two other parity disks, we cannot use them to recover the contents of the two Figure illustrates this case.. The failures of three data disks occupying the three summits of one side of a tetrahedron formed by four parity planes plus the parity disk of the parity plane that contains the three data disks: recovery is prevented because the data disks also share their other parity planes. Figure illustrates this case.. The failure of four data disks such that none of the four failed disks belongs to a parity plane that is not shared with one of the three other disks: recovery is then impossible because each parity plane contains a single parity disk and can only recover from a single disk failure within the parity plane. We have to distinguish here among three possible scenarios: γ

3 γ γε ε (γ is hidden γ Figure 6. A quadruple failure where the four failed disks occupy four summits of a truncated triangular pyramid formed by five parity planes. Node γε will be able to recover as it is the sole failed disk that is iarity plane ε. ε ζ ε ζ γζ γ ( is hidden) Figure 7. A quadruple failure where all four failed disks occupy four summits of a hexahedron formed by six parity planes. Node ζ will be able to recover, as it is the sole node that belongs to the parity plane ζ. Let n d = γε a. The four failed data disks are the summits of a tetrahedron formed by four parity planes: the four data disks will share four parity planes and each of these four planes will contain three failed disks. Figure 5 illustrates this case. b. The four failed data disks are at one of the six summits of a truncated triangular pyramid formed by five parity planes and none of the four failed disks belongs a parity plane that is not shared with one of the three other disks. This is the case whenever three of the four failed disks and the fourth failed disk on the opposite side will be able to recover because that failed disk belongs to a parity plane it does not share with any of the three other disks. As shown in Fig. 6, configurations that place three of the failed disks at either the top or the bottom of the truncated pyramid, and the fourth failed disk on the opposite side will be able to recover because that failed disk belongs to a parity plane it does not share with any of the three other disks. c. The four failed disks are at four of the eight summits of a hexahedron formed by six parity planes and none of the four failed disks belongs to a parity plane that is not shared. As shown on Fig. 7, the array can recover if one of the four failed disks belongs to a parity plane that it does not share with any of the three other failed disks. γε + n d denote the number of data disks in a threedimensional array with parity disks. Out of the possible quadruple failures the array can experience, we can enumerate: TABLE I. SUMMARY OF THE PROPERTIES OF SMALL TO MEDIUM-SIZE THREE- DIMENSIONAL RAID ARRAYS. Parity disks Data disks Total Space Overhead Disks per plane Tolerated faults Coding overhead n d distinct failures of a data disk and its three parity disks; 2. distinct failures of two data disks sharing two parity planes plus their two other parity disks;. distinct failures of three data disks forming the summits of one of the four sides of a tetrahedron plus the parity disk of the parity plane that contains the three disks;. distinct failures of four data disks forming the summits of a tetrahedron; 5. 9 distinct fatal failures of four data disks placed at the summits of a truncated pyramid: there are n p distinct truncated triangular pyramids formed 5 by the intersection of five parity planes and 6 of the 6 possible positions for the four failed disks are safe distinct fatal failures of four data disks placed at the summits of a hexahedron: there are n p distinct truncated triangular pyramids formed by the intersection of five parity planes and 56 of the 8 possible positions for the four failed disks on a giveyramid are safe. As a result, the fraction of quadruple failures that will result in a data loss is n p n p n p n p nd = 5 6. np + nd This ratio quickly decreases with the size of the array: it is already less than percent for an array with 15 disks and becomes less than 0.5 percent for all arrays with disks or more. As a result all three-dimensional arrays with 15 disks or more tolerate nearly all quadruple failures. Table I summarizes the properties of small to medium-size three-dimensional arrays. We define the space overhead as the fraction of space occupied by the parity data and the coding

4 Nλ (N-1)λ (N-2)λ (1-)(N-)λ 1.E+10 0 μ 1 2μ 2 μ (N-)λ μ Data Loss (N-)λ Figure 8. State transitiorobability diagram for a three-dimensional RAID array with N disks. MTTDL (years) 1.E+09 1.E+08 1.E+07 1.E+06 1.E+05 1.E+0 1.E+0 D: 20 data + 6 parity disks D: 5 data + 7 parity disks D: 56 data + 8 parity disks overhead as by dividing the number of parity disks by the average number of disk failures the array can tolerate. As we can see, three-dimensional RAID arrays with 26 to 6 disks offer an attractive combination of relatively low storage and coding overheads. IV. RELIABILITY ANALYSIS Estimating the reliability of a storage system means estimating the probability R(t) that the system will operate correctly over the time interval [0, t] given that it operated correctly at time t = 0. Computing that function requires solving a system of linear differential equations, a task that becomes quickly intractable as the complexity of the system grows. A simpler option is to use instead the mean time to data loss (MTTDL) of the storage system, which is the approach we will take here. Our system model consists of an array of disks with independent failure modes. Whenever a disk fails, a repair process is immediately initiated for that disk. Should several disks fail, the repair process will be performed iarallel on those disks. We assume that disk failures are independent events and times between consecutive failures for a given device are exponentially distributed with mean λ. In addition, we require repairs to be exponentially distributed with mean μ. Both hypotheses are necessary to represent our system by a Markov process with a finite number of states. Figure 8 displays the state transitiorobability diagram for a three-dimensional RAID array with parity disks and n d data disks for a total of N = + nd disks. State <0> is the original state where all N disks are operational. Should one of the disks fail, the system would move to state <1> with an aggregate failure rate Nλ. A second failure would bring the system to state <2>, and a third failure would bring the system to state <>. As we have seen, only a fraction of all possible quadruple failures will result in a data loss. Hence the two failure transitions from state <> are: 1. A transition to the failure state with rate (N )λ 2. A transition to state <> with rate (1 )(N )λ. As long as the total number of nodes in the array remains small, the possibility that the array will experience a quintuple failure during its lifetime will remain very small. We can thus safely neglect the probability that the array will survive a quintuple failure and assume that all quintuple failures will result in a data loss transition from state <> to the failure state. Recovery transitions are more straightforward: they bring the array from state <> to state <>, then from state <> to state <2> and so on until the system returns to state <0>. As we did in our previous studies [5], we compute first the Laplace transforms of the system of differential equations describing our model, solve this system and use the formula 1.E+02 1.E+01 1.E Repair Time (days) Figure 9. Mean Times To Data Loss of three-dimensional arrays with respectively 26, 2 and 6 disks. MTTDL = * p i i= 0 to compute the MTTDL of our system. For instance, the MTTDL of a three-dimensional RAID array with 6 parity disks and 20 data disks is λ μλ μ λ μ λ +1950μ 7800λ (82225λ +178μ) Figure 9 displays on a logarithmic scale the MTTDLs achieved by three-dimensional arrays with respectively 26, 2 and 6 disks. We assumed a disk failure rate λ of one failure every one hundred thousand hours, which is slightly less than one failure every eleven years. This rate is at the high end of the failure rates observed by Pinheiro et al. [8] as well as Schroeder and Gibson [9]. MTTDLs are expressed in years and repair times in days. As we can see, the three arrays exhibit MTTDLs that exceed ten million years when failed disks are promptly replaced and one million years when the replacement process takes several days. While these very large MTTDLs may appear irrelevant, we can use them to compute the probability that the array will lose no data during its lifetime. Assuming an array lifetime of five years, and observing that array longterm failure rates do not significantly differ from their average failure rates over their first five years, we can convert the MTTDLs into reliabilities using the formula d R d = exp( ) MTTDL where d is a five-year interval expressed in the same units as the MTTDL. We can then see that a MTTDL of ten million years corresponds to a probability of no data loss over a five year interval while a MTTDL of one million years correspond to a probability of no data loss over the same time interval. We also compared these MTTDLs with those that could be obtained by two other disk array organizations. The two benchmarks we selected were: 1. A pair of disk arrays with 10 data disks and parity disks each: this organization offers the same storage capacity and the same storage overhead as a threedimensional RAID array with 20 data disks and 6 parity disks. (0)

5 MTTDL (years) 1.E+10 1.E+09 1.E+08 1.E+07 1.E+06 1.E+05 1.E+0 1.E+0 1.E+02 1.E+01 D: 20 data + 6 parity disks Two arrays with 10 data + parity disks 60 disks with triplicated data 2D: 21 data + 7 parity disks 1.E Repair Time (days) Figure 10. Compared Mean Times To Data Loss of (a) a three-dimensional RAID array with 20 data disks and six parity disks, (b) a pair of RAID arrays with 10 data disks and parity disks each, (c) 60 disks containing triplicate data and (d) a two-dimensional RAID array with 21 data disks and 7 parity disks. The first three configurations have the same storage capacity. 2. A single disk array with 60 disks using three-way mirroring to store the equivalent content of 20 data disks.. A two-dimensional RAID array with 21 data disks and 7 parity disks [5]. Figure 10 summarizes our results. As we can see, the three-dimensional RAID organization achieves MTTDLs that are.5 to 7.5 times better than those obtained by the pair of RAID arrays with triple parity. Both organizations provide MTTDLs much higher than those obtained by the three-way mirroring organization and the two-dimensional RAID array. This should not surprise us as three-way mirroring and twodimensional RAID arrays only protect data against double disk failures while the two other organizations can tolerate triple disk failures without data loss. V. IMPLEMENTATION CONSIDERATIONS Achieving the highest possible reliability for a given redundancy level is not the sole factor to consider when selecting a disk array organization. If this were the case, all disk arrays would use Reed-Solomon codes. There are instead other factors to consider such as update rates and the complexity of data recovery operations. Update rates are essentially limited by the complexity of the update operations and by the existence of potential bottlenecks. All storage arrays that protect against triple disk failures require all updates to be recorded on at least four disks. Three-dimensional RAID arrays do not differ in that sense from any other solution including those using Reed- Solomon codes. The main difference is that they comprise distinct data disks and parity disks while most other solutions allow each disk to hold both data and parity information. Since all updates must be propagated to three parity disks, the update rates of three-dimensional RAID arrays cannot exceed one third of the aggregate update bandwidth of their parity disks. As a result, these arrays are best suited to applications with low update rates and high reliability requirements such as archival storage systems. Data recovery offers a different picture. Whenever a conventional RAID array loses a data disk, it needs to read the contents of all the other data disks in the same parity stripe as the failed disk plus one of the parity disks of that stripe. The process is essentially sequential and often very slow. This is not the case for three-dimensional RAID arrays. Since each data disk belongs to three distinct parity planes, we can speed up data recovery by accessing iarallel these three parity planes and dividing recovery tasks among them. Installations with three or more spare disks on hand could achieve an even higher speedup by reconstructing the lost data on three separate disks [10]. VI. CONCLUSIONS We have presented a three-dimensional RAID organization where each data disk belongs to three parity planes and each parity plane contains a single parity disk. As our organization requires all parity planes to intersect and prevents more than three planes from sharing a common intersection, we can place a maximum number of data disks at each intersection of the three parity planes. The outcome is an organization that requires data disks to protect its contents against all triple failures and at least 98 percent of quadruple disk failures, thus performing much better than organizations storing all data in triplicate and significantly better than other triple-parity RAID organizations. Its sole limitation is its relatively low update bandwidth, which restricts its use to applications with low update rates and high reliability requirements such as archival storage systems. More work is still needed to evaluate the impact of correlated failures on three-dimensional array reliability and investigate other three-dimensional organizations. REFERENCES [1] T. J. E. Schwarz, Reliability and Performance of Disk Arrays, Ph.D. Thesis, Dept. of Computer Science and Engineering, University of California, San Diego, 199. [2] L. Hellerstein, G. Gibson, R. M. Karp, R. H. Katz, and D.A. Patterson, Coding techniques for handling failures in large disk arrays, Algorithmica, 12(-): , June 199. [] W. S. Lee, Two-dimensional storage array with prompt parity in one dimension and delayed parity in a second dimension, US Patent # B1, 200 [] J.-F. Pâris, T. J. E. Schwarz and D. D. E. Long, Self-adaptive archival storage systems, Proc. 26 th Int. Performance of Computers and Communication Conf., pp , Apr [5] J.-F. Pâris, A. Amer, and T. J. E. Schwarz, Low-Redundancy Two- Dimensional RAID Arrays, Proc Int. Conf. on Computing, Networking and Communications, Data Storage Technology and Applications Symp., pp , Jan.-Feb [7] M. Uehara, Design and implementation of D MeshRAID in virtual large-scale disks, Proc. rd Int. Conf. on Intelligent Networking and Collaborative Systems, pp , Nov. Dec [8] E. Pinheiro, W.-D. Weber and L. A. Barroso, Failure trends in a large disk drive population, Proc. 5 th USENI Conf. on File and Storage Technologies, pp , Feb [8] M. Uehara, Design and Implementation of Mesh RAID with multiple parities in virtual large-scale disks, Proc. 26 th IEEE Int. Conf. on Advanced Information Networking and Applications, pp , Mar [9] B. Schroeder and G. A. Gibson, Disk failures in the real world: what does an MTTF of 1,000,000 hours mean to you? Proc. 5 th USENI Conf. on File and Storage Technologies, pp. 1 16, Feb [10] I. Corderi, T. J. Schwarz, A. Amer, D. D. E. Long and J.-F. Pâris, Self-Adjusting Two-Failure Tolerant Disk Arrays, Proc. 5 th Int. Workshop on Petascale Data Storage, Nov

Protecting Data Against Early Disk Failures

Protecting Data Against Early Disk Failures Protecting Data gainst Early Disk Failures Jehan-François Pâris Dept. of Computer Science University of Houston Houston, TX 77204-3010 Thomas J. E. Schwarz Dept. of Computer Engineering Santa Clara University

More information

Self-Adaptive Disk Arrays

Self-Adaptive Disk Arrays Proc. 8 th International Symposium on Stabilization, Safety and Securiity of Distributed Systems (SSS 006), Dallas, TX, Nov. 006, to appear Self-Adaptive Disk Arrays Jehan-François Pâris 1*, Thomas J.

More information

Using Shared Parity Disks to Improve the Reliability of RAID Arrays

Using Shared Parity Disks to Improve the Reliability of RAID Arrays Using Shared Parity Disks to Improve the Reliability of RAID Arrays Jehan-François Pâris Department of Computer Science University of Houston Houston, TX 7704-010 paris@cs.uh.edu Ahmed Amer Department

More information

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels technology brief RAID Levels March 1997 Introduction RAID is an acronym for Redundant Array of Independent Disks (originally Redundant Array of Inexpensive Disks) coined in a 1987 University of California

More information

How To Improve Performance On A Single Chip Computer

How To Improve Performance On A Single Chip Computer : Redundant Arrays of Inexpensive Disks this discussion is based on the paper:» A Case for Redundant Arrays of Inexpensive Disks (),» David A Patterson, Garth Gibson, and Randy H Katz,» In Proceedings

More information

Solving Data Loss in Massive Storage Systems Jason Resch Cleversafe

Solving Data Loss in Massive Storage Systems Jason Resch Cleversafe Solving Data Loss in Massive Storage Systems Jason Resch Cleversafe 2010 Storage Developer Conference. Insert Your Company Name. All Rights Reserved. 1 In the beginning There was replication Long before

More information

PIONEER RESEARCH & DEVELOPMENT GROUP

PIONEER RESEARCH & DEVELOPMENT GROUP SURVEY ON RAID Aishwarya Airen 1, Aarsh Pandit 2, Anshul Sogani 3 1,2,3 A.I.T.R, Indore. Abstract RAID stands for Redundant Array of Independent Disk that is a concept which provides an efficient way for

More information

Why RAID is Dead for Big Data Storage. The business case for why IT Executives are making a strategic shift from RAID to Information Dispersal

Why RAID is Dead for Big Data Storage. The business case for why IT Executives are making a strategic shift from RAID to Information Dispersal Why RAID is Dead for Big Data Storage The business case for why IT Executives are making a strategic shift from RAID to Information Dispersal Executive Summary Data is exploding, growing 10X every five

More information

How To Create A Multi Disk Raid

How To Create A Multi Disk Raid Click on the diagram to see RAID 0 in action RAID Level 0 requires a minimum of 2 drives to implement RAID 0 implements a striped disk array, the data is broken down into blocks and each block is written

More information

an analysis of RAID 5DP

an analysis of RAID 5DP an analysis of RAID 5DP a qualitative and quantitative comparison of RAID levels and data protection hp white paper for information about the va 7000 series and periodic updates to this white paper see

More information

Operating Systems. RAID Redundant Array of Independent Disks. Submitted by Ankur Niyogi 2003EE20367

Operating Systems. RAID Redundant Array of Independent Disks. Submitted by Ankur Niyogi 2003EE20367 Operating Systems RAID Redundant Array of Independent Disks Submitted by Ankur Niyogi 2003EE20367 YOUR DATA IS LOST@#!! Do we have backups of all our data???? - The stuff we cannot afford to lose?? How

More information

Non-Redundant (RAID Level 0)

Non-Redundant (RAID Level 0) There are many types of RAID and some of the important ones are introduced below: Non-Redundant (RAID Level 0) A non-redundant disk array, or RAID level 0, has the lowest cost of any RAID organization

More information

How To Write A Disk Array

How To Write A Disk Array 200 Chapter 7 (This observation is reinforced and elaborated in Exercises 7.5 and 7.6, and the reader is urged to work through them.) 7.2 RAID Disks are potential bottlenecks for system performance and

More information

Reliability and Fault Tolerance in Storage

Reliability and Fault Tolerance in Storage Reliability and Fault Tolerance in Storage Dalit Naor/ Dima Sotnikov IBM Haifa Research Storage Systems 1 Advanced Topics on Storage Systems - Spring 2014, Tel-Aviv University http://www.eng.tau.ac.il/semcom

More information

PROTEUS: A PORTABLE SIMULATION PROGRAM FOR ESTIMATING DATA LOSS RISKS IN DISK ARRAYS. A Thesis. Presented to. University of Houston

PROTEUS: A PORTABLE SIMULATION PROGRAM FOR ESTIMATING DATA LOSS RISKS IN DISK ARRAYS. A Thesis. Presented to. University of Houston PROTEUS: A PORTABLE SIMULATION PROGRAM FOR ESTIMATING DATA LOSS RISKS IN DISK ARRAYS A Thesis Presented to the Faculty of the Department of Computer Science University of Houston In Partial Fulfillment

More information

Combining Low IO-Operations During Data Recovery with Low Parity Overhead in Two-Failure Tolerant Archival Storage Systems

Combining Low IO-Operations During Data Recovery with Low Parity Overhead in Two-Failure Tolerant Archival Storage Systems 2015 IEEE 21st acific Rim International Symposium on Dependable Computing Combining Low IO-Operations During Data Recovery with Low arity Overhead in Two-Failure Tolerant Archival Storage Systems Thomas

More information

A Fault Tolerant Video Server Using Combined Raid 5 and Mirroring

A Fault Tolerant Video Server Using Combined Raid 5 and Mirroring Proceedings of Multimedia Computing and Networking 1997 (MMCN97), San Jose, CA, February 1997 A Fault Tolerant Video Server Using Combined Raid 5 and Mirroring Ernst W. BIERSACK, Christoph BERNHARDT Institut

More information

Theoretical Aspects of Storage Systems Autumn 2009

Theoretical Aspects of Storage Systems Autumn 2009 Theoretical Aspects of Storage Systems Autumn 2009 Chapter 1: RAID André Brinkmann University of Paderborn Personnel Students: ~13.500 students Professors: ~230 Other staff: ~600 scientific, ~630 non-scientific

More information

Lecture 36: Chapter 6

Lecture 36: Chapter 6 Lecture 36: Chapter 6 Today s topic RAID 1 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f. one large disk) Parallelism improves performance Plus extra disk(s) for

More information

RAID Basics Training Guide

RAID Basics Training Guide RAID Basics Training Guide Discover a Higher Level of Performance RAID matters. Rely on Intel RAID. Table of Contents 1. What is RAID? 2. RAID Levels RAID 0 RAID 1 RAID 5 RAID 6 RAID 10 RAID 0+1 RAID 1E

More information

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1 RAID HARDWARE On board SATA RAID controller SATA RAID controller card RAID drive caddy (hot swappable) Anne Watson 1 RAID The word redundant means an unnecessary repetition. The word array means a lineup.

More information

RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection

RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection Technical Report RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection Jay White & Chris Lueth, NetApp May 2010 TR-3298 ABSTRACT This document provides an in-depth overview of the NetApp

More information

Data Storage - II: Efficient Usage & Errors

Data Storage - II: Efficient Usage & Errors Data Storage - II: Efficient Usage & Errors Week 10, Spring 2005 Updated by M. Naci Akkøk, 27.02.2004, 03.03.2005 based upon slides by Pål Halvorsen, 12.3.2002. Contains slides from: Hector Garcia-Molina

More information

Distributed Storage Networks and Computer Forensics

Distributed Storage Networks and Computer Forensics Distributed Storage Networks 5 Raid-6 Encoding Technical Faculty Winter Semester 2011/12 RAID Redundant Array of Independent Disks Patterson, Gibson, Katz, A Case for Redundant Array of Inexpensive Disks,

More information

Designing a Cloud Storage System

Designing a Cloud Storage System Designing a Cloud Storage System End to End Cloud Storage When designing a cloud storage system, there is value in decoupling the system s archival capacity (its ability to persistently store large volumes

More information

Algorithms and Methods for Distributed Storage Networks 5 Raid-6 Encoding Christian Schindelhauer

Algorithms and Methods for Distributed Storage Networks 5 Raid-6 Encoding Christian Schindelhauer Algorithms and Methods for Distributed Storage Networks 5 Raid-6 Encoding Institut für Informatik Wintersemester 2007/08 RAID Redundant Array of Independent Disks Patterson, Gibson, Katz, A Case for Redundant

More information

Technical White paper RAID Protection and Drive Failure Fast Recovery

Technical White paper RAID Protection and Drive Failure Fast Recovery Technical White paper RAID Protection and Drive Failure Fast Recovery RAID protection is a key part of all ETERNUS Storage Array products. Choices of the level chosen to meet customer application requirements

More information

CS161: Operating Systems

CS161: Operating Systems CS161: Operating Systems Matt Welsh mdw@eecs.harvard.edu Lecture 18: RAID April 19, 2007 2007 Matt Welsh Harvard University 1 RAID Redundant Arrays of Inexpensive Disks Invented in 1986-1987 by David Patterson

More information

RAID Level Descriptions. RAID 0 (Striping)

RAID Level Descriptions. RAID 0 (Striping) RAID Level Descriptions RAID 0 (Striping) Offers low cost and maximum performance, but offers no fault tolerance; a single disk failure results in TOTAL data loss. Businesses use RAID 0 mainly for tasks

More information

Increased Reliability with SSPiRAL Data Layouts

Increased Reliability with SSPiRAL Data Layouts Increased Reliability with SSPiRAL Data Layouts Ahmed Amer University of Pittsburgh amer@cs.pitt.edu Darrell D. E. Long University of California, Santa Cruz darrell@cs.ucsc.edu Thomas Schwarz Santa Clara

More information

What is RAID and how does it work?

What is RAID and how does it work? What is RAID and how does it work? What is RAID? RAID is the acronym for either redundant array of inexpensive disks or redundant array of independent disks. When first conceived at UC Berkley the former

More information

How to choose the right RAID for your Dedicated Server

How to choose the right RAID for your Dedicated Server Overview of RAID Let's first address, "What is RAID and what does RAID stand for?" RAID, an acronym for "Redundant Array of Independent Disks, is a storage technology that links or combines multiple hard

More information

Storage node capacity in RAID0 is equal to the sum total capacity of all disks in the storage node.

Storage node capacity in RAID0 is equal to the sum total capacity of all disks in the storage node. RAID configurations defined 1/7 Storage Configuration: Disk RAID and Disk Management > RAID configurations defined Next RAID configurations defined The RAID configuration you choose depends upon how you

More information

RAID Technology Overview

RAID Technology Overview RAID Technology Overview HP Smart Array RAID Controllers HP Part Number: J6369-90050 Published: September 2007 Edition: 1 Copyright 2007 Hewlett-Packard Development Company L.P. Legal Notices Copyright

More information

Mean time to meaningless: MTTDL, Markov models, and storage system reliability

Mean time to meaningless: MTTDL, Markov models, and storage system reliability Mean time to meaningless: MTTDL, Markov models, and storage system reliability Kevin M. Greenan ParaScale, Inc. James S. Plank University of Tennessee Jay J. Wylie HP Labs Abstract Mean Time To Data Loss

More information

CS420: Operating Systems

CS420: Operating Systems NK YORK COLLEGE OF PENNSYLVANIA HG OK 2 RAID YORK COLLEGE OF PENNSYLVAN James Moscola Department of Physical Sciences York College of Pennsylvania Based on Operating System Concepts, 9th Edition by Silberschatz,

More information

CSE 120 Principles of Operating Systems

CSE 120 Principles of Operating Systems CSE 120 Principles of Operating Systems Fall 2004 Lecture 13: FFS, LFS, RAID Geoffrey M. Voelker Overview We ve looked at disks and file systems generically Now we re going to look at some example file

More information

Guide to SATA Hard Disks Installation and RAID Configuration

Guide to SATA Hard Disks Installation and RAID Configuration Guide to SATA Hard Disks Installation and RAID Configuration 1. Guide to SATA Hard Disks Installation...2 1.1 Serial ATA (SATA) Hard Disks Installation...2 2. Guide to RAID Configurations...3 2.1 Introduction

More information

Implementation of Reliable Fault Tolerant Data Storage System over Cloud using Raid 60

Implementation of Reliable Fault Tolerant Data Storage System over Cloud using Raid 60 International Journal of Computer Sciences and Engineering Open Access Review Paper Volume-4, Issue-2 E-ISSN: 2347-2693 Implementation of Reliable Fault Tolerant Data Storage System over Cloud using Raid

More information

Energy aware RAID Configuration for Large Storage Systems

Energy aware RAID Configuration for Large Storage Systems Energy aware RAID Configuration for Large Storage Systems Norifumi Nishikawa norifumi@tkl.iis.u-tokyo.ac.jp Miyuki Nakano miyuki@tkl.iis.u-tokyo.ac.jp Masaru Kitsuregawa kitsure@tkl.iis.u-tokyo.ac.jp Abstract

More information

CS 153 Design of Operating Systems Spring 2015

CS 153 Design of Operating Systems Spring 2015 CS 153 Design of Operating Systems Spring 2015 Lecture 22: File system optimizations Physical Disk Structure Disk components Platters Surfaces Tracks Arm Track Sector Surface Sectors Cylinders Arm Heads

More information

Why disk arrays? CPUs speeds increase faster than disks. - Time won t really help workloads where disk in bottleneck

Why disk arrays? CPUs speeds increase faster than disks. - Time won t really help workloads where disk in bottleneck 1/19 Why disk arrays? CPUs speeds increase faster than disks - Time won t really help workloads where disk in bottleneck Some applications (audio/video) require big files Disk arrays - make one logical

More information

Filing Systems. Filing Systems

Filing Systems. Filing Systems Filing Systems At the outset we identified long-term storage as desirable characteristic of an OS. EG: On-line storage for an MIS. Convenience of not having to re-write programs. Sharing of data in an

More information

Definition of RAID Levels

Definition of RAID Levels RAID The basic idea of RAID (Redundant Array of Independent Disks) is to combine multiple inexpensive disk drives into an array of disk drives to obtain performance, capacity and reliability that exceeds

More information

RAID 5 rebuild performance in ProLiant

RAID 5 rebuild performance in ProLiant RAID 5 rebuild performance in ProLiant technology brief Abstract... 2 Overview of the RAID 5 rebuild process... 2 Estimating the mean-time-to-failure (MTTF)... 3 Factors affecting RAID 5 array rebuild

More information

RAID 6 with HP Advanced Data Guarding technology:

RAID 6 with HP Advanced Data Guarding technology: RAID 6 with HP Advanced Data Guarding technology: a cost-effective, fault-tolerant solution technology brief Abstract... 2 Introduction... 2 Functions and limitations of RAID schemes... 3 Fault tolerance

More information

Summer Student Project Report

Summer Student Project Report Summer Student Project Report Dimitris Kalimeris National and Kapodistrian University of Athens June September 2014 Abstract This report will outline two projects that were done as part of a three months

More information

IBM ^ xseries ServeRAID Technology

IBM ^ xseries ServeRAID Technology IBM ^ xseries ServeRAID Technology Reliability through RAID technology Executive Summary: t long ago, business-critical computing on industry-standard platforms was unheard of. Proprietary systems were

More information

RAID Made Easy By Jon L. Jacobi, PCWorld

RAID Made Easy By Jon L. Jacobi, PCWorld 9916 Brooklet Drive Houston, Texas 77099 Phone 832-327-0316 www.safinatechnolgies.com RAID Made Easy By Jon L. Jacobi, PCWorld What is RAID, why do you need it, and what are all those mode numbers that

More information

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering DELL RAID PRIMER DELL PERC RAID CONTROLLERS Joe H. Trickey III Dell Storage RAID Product Marketing John Seward Dell Storage RAID Engineering http://www.dell.com/content/topics/topic.aspx/global/products/pvaul/top

More information

RAID: Redundant Arrays of Independent Disks

RAID: Redundant Arrays of Independent Disks RAID: Redundant Arrays of Independent Disks Dependable Systems Dr.-Ing. Jan Richling Kommunikations- und Betriebssysteme TU Berlin Winter 2012/2013 RAID: Introduction Redundant array of inexpensive disks

More information

A Highly Available Replication Control Protocol Using Volatile Witnesses

A Highly Available Replication Control Protocol Using Volatile Witnesses Highly vailable Replication ontrol Protocol Using Volatile Witnesses Jehan-François Pâris epartment of omputer Science University of Houston Houston, TX 77204-3475 STRT We propose a highly available replication

More information

RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29

RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f. one large disk) Parallelism improves performance Plus extra disk(s) for redundant data storage Provides fault tolerant

More information

System Availability and Data Protection of Infortrend s ESVA Storage Solution

System Availability and Data Protection of Infortrend s ESVA Storage Solution System Availability and Data Protection of Infortrend s ESVA Storage Solution White paper Abstract This white paper analyzes system availability and data protection on Infortrend s ESVA storage systems.

More information

How To Understand And Understand The Power Of Aird 6 On Clariion

How To Understand And Understand The Power Of Aird 6 On Clariion A Detailed Review Abstract This white paper discusses the EMC CLARiiON RAID 6 implementation available in FLARE 26 and later, including an overview of RAID 6 and the CLARiiON-specific implementation, when

More information

RAID. Storage-centric computing, cloud computing. Benefits:

RAID. Storage-centric computing, cloud computing. Benefits: RAID Storage-centric computing, cloud computing. Benefits: Improved reliability (via error correcting code, redundancy). Improved performance (via redundancy). Independent disks. RAID Level 0 Provides

More information

Data Corruption In Storage Stack - Review

Data Corruption In Storage Stack - Review Theoretical Aspects of Storage Systems Autumn 2009 Chapter 2: Double Disk Failures André Brinkmann Data Corruption in the Storage Stack What are Latent Sector Errors What is Silent Data Corruption Checksum

More information

Using RAID6 for Advanced Data Protection

Using RAID6 for Advanced Data Protection Using RAI6 for Advanced ata Protection 2006 Infortrend Corporation. All rights reserved. Table of Contents The Challenge of Fault Tolerance... 3 A Compelling Technology: RAI6... 3 Parity... 4 Why Use RAI6...

More information

RAID-DP : NETWORK APPLIANCE IMPLEMENTATION OF RAID DOUBLE PARITY FOR DATA PROTECTION A HIGH-SPEED IMPLEMENTATION OF RAID 6

RAID-DP : NETWORK APPLIANCE IMPLEMENTATION OF RAID DOUBLE PARITY FOR DATA PROTECTION A HIGH-SPEED IMPLEMENTATION OF RAID 6 RAID-DP : NETWORK APPLIANCE IMPLEMENTATION OF RAID DOUBLE PARITY FOR DATA PROTECTION A HIGH-SPEED IMPLEMENTATION OF RAID 6 Chris Lueth, Network Appliance, Inc. TR-3298 [12/2006] ABSTRACT To date, RAID

More information

Reliability of Data Storage Systems

Reliability of Data Storage Systems Zurich Research Laboratory Ilias Iliadis April 2, 25 Keynote NexComm 25 www.zurich.ibm.com 25 IBM Corporation Long-term Storage of Increasing Amount of Information An increasing amount of information is

More information

Fault Tolerance & Reliability CDA 5140. Chapter 3 RAID & Sample Commercial FT Systems

Fault Tolerance & Reliability CDA 5140. Chapter 3 RAID & Sample Commercial FT Systems Fault Tolerance & Reliability CDA 5140 Chapter 3 RAID & Sample Commercial FT Systems - basic concept in these, as with codes, is redundancy to allow system to continue operation even if some components

More information

3PAR Fast RAID: High Performance Without Compromise

3PAR Fast RAID: High Performance Without Compromise 3PAR Fast RAID: High Performance Without Compromise Karl L. Swartz Document Abstract: 3PAR Fast RAID allows the 3PAR InServ Storage Server to deliver higher performance with less hardware, reducing storage

More information

RAID. Contents. Definition and Use of the Different RAID Levels. The different RAID levels: Definition Cost / Efficiency Reliability Performance

RAID. Contents. Definition and Use of the Different RAID Levels. The different RAID levels: Definition Cost / Efficiency Reliability Performance RAID Definition and Use of the Different RAID Levels Contents The different RAID levels: Definition Cost / Efficiency Reliability Performance Further High Availability Aspects Performance Optimization

More information

Reliability of Systems and Components

Reliability of Systems and Components Reliability of Systems and Components GNS Systems GmbH Am Gaußberg 2 38114 Braunschweig / Germany Fri 13 Apr 2012, Revision 0.1.1 Please send comments about this document to: norman.krebs@gns-systems.de

More information

How to recover a failed Storage Spaces

How to recover a failed Storage Spaces www.storage-spaces-recovery.com How to recover a failed Storage Spaces ReclaiMe Storage Spaces Recovery User Manual 2013 www.storage-spaces-recovery.com Contents Overview... 4 Storage Spaces concepts and

More information

Guide to SATA Hard Disks Installation and RAID Configuration

Guide to SATA Hard Disks Installation and RAID Configuration Guide to SATA Hard Disks Installation and RAID Configuration 1. Guide to SATA Hard Disks Installation... 2 1.1 Serial ATA (SATA) Hard Disks Installation... 2 2. Guide to RAID Configurations... 3 2.1 Introduction

More information

Best Practices RAID Implementations for Snap Servers and JBOD Expansion

Best Practices RAID Implementations for Snap Servers and JBOD Expansion STORAGE SOLUTIONS WHITE PAPER Best Practices RAID Implementations for Snap Servers and JBOD Expansion Contents Introduction...1 Planning for the End Result...1 Availability Considerations...1 Drive Reliability...2

More information

Review. Lecture 21: Reliable, High Performance Storage. Overview. Basic Disk & File System properties CSC 468 / CSC 2204 11/23/2006

Review. Lecture 21: Reliable, High Performance Storage. Overview. Basic Disk & File System properties CSC 468 / CSC 2204 11/23/2006 S 468 / S 2204 Review Lecture 2: Reliable, High Performance Storage S 469HF Fall 2006 ngela emke rown We ve looked at fault tolerance via server replication ontinue operating with up to f failures Recovery

More information

How To Make A Backup System More Efficient

How To Make A Backup System More Efficient Identifying the Hidden Risk of Data De-duplication: How the HYDRAstor Solution Proactively Solves the Problem October, 2006 Introduction Data de-duplication has recently gained significant industry attention,

More information

December 4, 2013 MATH 171 BASIC LINEAR ALGEBRA B. KITCHENS

December 4, 2013 MATH 171 BASIC LINEAR ALGEBRA B. KITCHENS December 4, 2013 MATH 171 BASIC LINEAR ALGEBRA B KITCHENS The equation 1 Lines in two-dimensional space (1) 2x y = 3 describes a line in two-dimensional space The coefficients of x and y in the equation

More information

CS 6290 I/O and Storage. Milos Prvulovic

CS 6290 I/O and Storage. Milos Prvulovic CS 6290 I/O and Storage Milos Prvulovic Storage Systems I/O performance (bandwidth, latency) Bandwidth improving, but not as fast as CPU Latency improving very slowly Consequently, by Amdahl s Law: fraction

More information

RAID for the 21st Century. A White Paper Prepared for Panasas October 2007

RAID for the 21st Century. A White Paper Prepared for Panasas October 2007 A White Paper Prepared for Panasas October 2007 Table of Contents RAID in the 21 st Century...1 RAID 5 and RAID 6...1 Penalties Associated with RAID 5 and RAID 6...1 How the Vendors Compensate...2 EMA

More information

OceanStor 9000 InfoProtector Technical White Paper. Issue 01. Date 2014-02-13 HUAWEI TECHNOLOGIES CO., LTD.

OceanStor 9000 InfoProtector Technical White Paper. Issue 01. Date 2014-02-13 HUAWEI TECHNOLOGIES CO., LTD. OceanStor 9000 InfoProtector Technical White Paper Issue 01 Date 2014-02-13 HUAWEI TECHNOLOGIES CO., LTD. Copyright Huawei Technologies Co., Ltd. 2014. All rights reserved. No part of this document may

More information

PARALLELS CLOUD STORAGE

PARALLELS CLOUD STORAGE PARALLELS CLOUD STORAGE Performance Benchmark Results 1 Table of Contents Executive Summary... Error! Bookmark not defined. Architecture Overview... 3 Key Features... 5 No Special Hardware Requirements...

More information

CS 61C: Great Ideas in Computer Architecture. Dependability: Parity, RAID, ECC

CS 61C: Great Ideas in Computer Architecture. Dependability: Parity, RAID, ECC CS 61C: Great Ideas in Computer Architecture Dependability: Parity, RAID, ECC Instructor: Justin Hsia 8/08/2013 Summer 2013 Lecture #27 1 Review of Last Lecture MapReduce Data Level Parallelism Framework

More information

OPTIMIZING VIRTUAL TAPE PERFORMANCE: IMPROVING EFFICIENCY WITH DISK STORAGE SYSTEMS

OPTIMIZING VIRTUAL TAPE PERFORMANCE: IMPROVING EFFICIENCY WITH DISK STORAGE SYSTEMS W H I T E P A P E R OPTIMIZING VIRTUAL TAPE PERFORMANCE: IMPROVING EFFICIENCY WITH DISK STORAGE SYSTEMS By: David J. Cuddihy Principal Engineer Embedded Software Group June, 2007 155 CrossPoint Parkway

More information

An Introduction to RAID 6 ULTAMUS TM RAID

An Introduction to RAID 6 ULTAMUS TM RAID An Introduction to RAID 6 ULTAMUS TM RAID The highly attractive cost per GB of SATA storage capacity is causing RAID products based on the technology to grow in popularity. SATA RAID is now being used

More information

Functional-Repair-by-Transfer Regenerating Codes

Functional-Repair-by-Transfer Regenerating Codes Functional-Repair-by-Transfer Regenerating Codes Kenneth W Shum and Yuchong Hu Abstract In a distributed storage system a data file is distributed to several storage nodes such that the original file can

More information

CSE-E5430 Scalable Cloud Computing P Lecture 5

CSE-E5430 Scalable Cloud Computing P Lecture 5 CSE-E5430 Scalable Cloud Computing P Lecture 5 Keijo Heljanko Department of Computer Science School of Science Aalto University keijo.heljanko@aalto.fi 12.10-2015 1/34 Fault Tolerance Strategies for Storage

More information

Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com

Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...

More information

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS TECHNOLOGY BRIEF August 1999 Compaq Computer Corporation Prepared by ISSD Technology Communications CONTENTS Executive Summary 1 Introduction 3 Subsystem Technology 3 Processor 3 SCSI Chip4 PCI Bridge

More information

StorTrends RAID Considerations

StorTrends RAID Considerations StorTrends RAID Considerations MAN-RAID 04/29/2011 Copyright 1985-2011 American Megatrends, Inc. All rights reserved. American Megatrends, Inc. 5555 Oakbrook Parkway, Building 200 Norcross, GA 30093 Revision

More information

A Solution to the Network Challenges of Data Recovery in Erasure-coded Distributed Storage Systems: A Study on the Facebook Warehouse Cluster

A Solution to the Network Challenges of Data Recovery in Erasure-coded Distributed Storage Systems: A Study on the Facebook Warehouse Cluster A Solution to the Network Challenges of Data Recovery in Erasure-coded Distributed Storage Systems: A Study on the Facebook Warehouse Cluster K. V. Rashmi 1, Nihar B. Shah 1, Dikang Gu 2, Hairong Kuang

More information

NAS 251 Introduction to RAID

NAS 251 Introduction to RAID NAS 251 Introduction to RAID Set up a storage volume with RAID A S U S T O R C O L L E G E COURSE OBJECTIVES Upon completion of this course you should be able to: 1. Have a basic understanding of RAID

More information

Striped Set, Advantages and Disadvantages of Using RAID

Striped Set, Advantages and Disadvantages of Using RAID Algorithms and Methods for Distributed Storage Networks 4: Volume Manager and RAID Institut für Informatik Wintersemester 2007/08 RAID Redundant Array of Independent Disks Patterson, Gibson, Katz, A Case

More information

Why disk arrays? CPUs improving faster than disks

Why disk arrays? CPUs improving faster than disks Why disk arrays? CPUs improving faster than disks - disks will increasingly be bottleneck New applications (audio/video) require big files (motivation for XFS) Disk arrays - make one logical disk out of

More information

Load Balancing in Fault Tolerant Video Server

Load Balancing in Fault Tolerant Video Server Load Balancing in Fault Tolerant Video Server # D. N. Sujatha*, Girish K*, Rashmi B*, Venugopal K. R*, L. M. Patnaik** *Department of Computer Science and Engineering University Visvesvaraya College of

More information

How To Build A Clustered Storage Area Network (Csan) From Power All Networks

How To Build A Clustered Storage Area Network (Csan) From Power All Networks Power-All Networks Clustered Storage Area Network: A scalable, fault-tolerant, high-performance storage system. Power-All Networks Ltd Abstract: Today's network-oriented computing environments require

More information

Energy Efficient MapReduce

Energy Efficient MapReduce Energy Efficient MapReduce Motivation: Energy consumption is an important aspect of datacenters efficiency, the total power consumption in the united states has doubled from 2000 to 2005, representing

More information

Disk Array Data Organizations and RAID

Disk Array Data Organizations and RAID Guest Lecture for 15-440 Disk Array Data Organizations and RAID October 2010, Greg Ganger 1 Plan for today Why have multiple disks? Storage capacity, performance capacity, reliability Load distribution

More information

Module 6. RAID and Expansion Devices

Module 6. RAID and Expansion Devices Module 6 RAID and Expansion Devices Objectives 1. PC Hardware A.1.5 Compare and contrast RAID types B.1.8 Compare expansion devices 2 RAID 3 RAID 1. Redundant Array of Independent (or Inexpensive) Disks

More information

The idea behind RAID is to have a number of disks co-operate in such a way that it looks like one big disk.

The idea behind RAID is to have a number of disks co-operate in such a way that it looks like one big disk. People often ask: Should I RAID my disks? The question is simple, unfortunately the answer is not. So here is a guide to help you decide when a RAID array is advantageous and how to go about it. This guide

More information

RAID0.5: Active Data Replication for Low Cost Disk Array Data Protection

RAID0.5: Active Data Replication for Low Cost Disk Array Data Protection RAID0.5: Active Data Replication for Low Cost Disk Array Data Protection John A. Chandy Department of Electrical and Computer Engineering University of Connecticut Storrs, CT 06269-2157 john.chandy@uconn.edu

More information

Don t Let RAID Raid the Lifetime of Your SSD Array

Don t Let RAID Raid the Lifetime of Your SSD Array Don t Let RAID Raid the Lifetime of Your SSD Array Sangwhan Moon Texas A&M University A. L. Narasimha Reddy Texas A&M University Abstract Parity protection at system level is typically employed to compose

More information

BrightStor ARCserve Backup for Windows

BrightStor ARCserve Backup for Windows BrightStor ARCserve Backup for Windows Tape RAID Option Guide r11.5 D01183-1E This documentation and related computer software program (hereinafter referred to as the "Documentation") is for the end user's

More information

Nutanix Tech Note. Failure Analysis. 2013 All Rights Reserved, Nutanix Corporation

Nutanix Tech Note. Failure Analysis. 2013 All Rights Reserved, Nutanix Corporation Nutanix Tech Note Failure Analysis A Failure Analysis of Storage System Architectures Nutanix Scale-out v. Legacy Designs Types of data to be protected Any examination of storage system failure scenarios

More information

CURTAIL THE EXPENDITURE OF BIG DATA PROCESSING USING MIXED INTEGER NON-LINEAR PROGRAMMING

CURTAIL THE EXPENDITURE OF BIG DATA PROCESSING USING MIXED INTEGER NON-LINEAR PROGRAMMING Journal homepage: http://www.journalijar.com INTERNATIONAL JOURNAL OF ADVANCED RESEARCH RESEARCH ARTICLE CURTAIL THE EXPENDITURE OF BIG DATA PROCESSING USING MIXED INTEGER NON-LINEAR PROGRAMMING R.Kohila

More information

Availability and Disaster Recovery: Basic Principles

Availability and Disaster Recovery: Basic Principles Availability and Disaster Recovery: Basic Principles by Chuck Petch, WVS Senior Technical Writer At first glance availability and recovery may seem like opposites. Availability involves designing computer

More information

Chapter 13. Chapter Outline. Disk Storage, Basic File Structures, and Hashing

Chapter 13. Chapter Outline. Disk Storage, Basic File Structures, and Hashing Chapter 13 Disk Storage, Basic File Structures, and Hashing Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Chapter Outline Disk Storage Devices Files of Records Operations on Files Unordered Files

More information

Revolutionary Methods to Handle Data Durability Challenges for Big Data

Revolutionary Methods to Handle Data Durability Challenges for Big Data WHITE PAPER Data Center Scalability/ Cloud Storage Big Data Revolutionary Methods to Handle Data Durability Challenges for Big Data Intel and Amplidata address the storage challenges presented by the coming

More information