1 RAID Levels and Components Explained Page 1 of 23 What's RAID? The purpose of this document is to explain the many forms or RAID systems, and why they are useful, and their disadvantages. RAID - Redundant Array of Inexpensive Disks - is a method of combining several hard drives into one logical unit. It can offer fault tolerance and higher throughput levels than a single hard drive or group of independent hard drives. RAID is a mature technology that speeds up data access while at the same time protecting your data from hard disk failure. RAID is quickly becoming a necessary component in every network since data loss and downtime can prove both fatal and financially destructive. Most networks are designed to provide instant access to massive amounts of data. More and more employees have to access customer and other databases. Intranets and corporate Web sites provide access to huge databases online. Raid Components and Concepts First, let us define Logical Arrays as a split or combination of Physical Arrays, which in turn are one or more Physical Drives that are simply the individual hard disks that comprise these arrays. Logical Drives are then made of one or more Logical Arrays. Mirroring refers to complete redundancy of data on identical disks. The data that is being written on one Logical Array is completely duplicated on a similar array thereby providing 100% data redundancy. The cost associated with mirroring is that the amount of available storage is reduced by 50%; writes are slightly slower albeit reads are faster in some situations. Striping refers to a technique that allows Physical Drives in a Logical Array to be used in parallel in order to gain in performance. In this technique, data is broken down in Byte or Block levels or stripes, where every Byte or Block is written to a separate disk in the array. Byte level can at times be a 512-byte sector, while Block size can be selected from variety of choices. The gain in performance is similar between Reads and Writes. In some RAID levels, striping is combined with a technique called Parity to enhance fault tolerance. Parity, similar to parity in memory, is simply adding a Block (Byte) of calculated parity data to several Blocks (Bytes) in such a way that any one of the Blocks (Bytes) can
2 RAID Levels and Components Explained Page 2 of 23 be reconstructed in case of loss, from the remainder of the Blocks (Bytes) and the parity Block (Byte). While Parity gains from performance of striping, its disadvantages are more complexity and loss of some disk space, which is taken up by parity information. There are many ways to combine RAID techniques. Some standardized combinations are referred to as RAID Levels, even though Level in this context does not denote any hierarchy or advantage. Levels are independent and different. Some RAID levels combine multiple other levels to achieve certain aims. The RAID Advisory Board (RAB) has been active since 1992 in education and standardization of RAID technology. See Techniques discussed above are used in different levels. Mirroring is used in levels 1, 0+1, 10 (1+0). Striping without parity is used in level 0, 0+1, and 10. Striping with Block level parity is used in level 5 and 6. While the minimum number of drives required at each level are noted, there is no inherent maximum to number of drives in arrays other than the one imposed by controllers. RAID-0 -- Striping Simple striping is used in this level to gain in performance. This level does not offer any redundancy. Data is broken into stripes of user-defined size and written to a different drive in the array. Minimum of two disks are required. It uses 100% of the storage capacity since no redundant information is written. Recommended use for this level is when your data changes infrequently and is backed up regularly and you require high-speed access. Web servers, graphics design, audio and video editing, and online gaming are some example applications that might benefit from this level. RAID-1 - Mirroring This level uses mirroring and data is duplicated on two drives. If either fails, the other continues to function until the failed drive is replaced. At the cost of 50% of available capacity, this level provides very high availability. Rebuild of failed drives is relatively fast. Read performance is good and write performance is fair compared to single drive read and write. A minimum of 2 drives is required. Whenever the need for high availability and vital data are involved, this level is a good candidate for use.
3 RAID Levels and Components Explained Page 3 of 23 RAID-2, RAID-3, and RAID-4 RAID-4 interleaves stripes like RAID-0, but it requires an additional drive just to store the parity, which is used to provide redundancy. In a RAID-4 system, if any one of the disks fails, the data on the remaining disks can be used to reconstruct the data that was on the failed disk. Even if the parity disk fails, the other disks are still intact. Thus RAID-4 can survive the failure of any of its disks. RAID-2 and RAID-3 are seldom used anymore, and have mostly been made obsolete by modern disk technology. RAID-2 stores ECC information instead of parity, but since all modern disk drives incorporate ECC, RAID-2 offers little additional protection. RAID-3 is similar to RAID-4, except that it uses the smallest possible stripe size. As a result, any given read will involve all disks, making overlapping I/O requests difficult/impossible. In order to avoid delay due to rotational latency, RAID-3 requires that all disk drive spindles be synchronized. Most modern disk drives lack spindle-synchronization ability, so RAID-3 is no longer used. RAID-5 -- Striping with Parity One of the most popular RAID techniques, it uses Block Striping of data along with parity and writes them to all drives. In contrast to the RAID levels that write the parity information to a single drive and use the rest of the drives for data blocks, RAID-5 distributes the parity blocks amongst all drives, keeping parity separate from the data blocks generating it. RAID-5 systems require a minimum of 3 disks. The impact on capacity is equivalent to removing one drive from the array. If any one drive fails, the array is said to be degraded, and the data blocks residing on that drive can be derived from parity and data on remainder of the drives. RAID controllers usually allow a hot spare drive to be configured that is used when the array is degraded and the array can be rebuilt in the background while normal operation continues. RAID-5 combines good performance, good fault tolerance, with high efficiency. It is best suited for transaction processing and is often used for general purpose service, as well as for relational database applications, enterprise resource planning and other business systems.
4 RAID Levels and Components Explained Page 4 of 23 RAID-6 -- Striping with dual Parity This level is identical to level 5 except that it calculates and writes an extra parity block to all drives. While this will have the effect of reducing the useable capacity by one more drive, it reduces the window of vulnerability during the RAID-5 rebuilds considerably and can withstand the failure of a second drive during rebuilds. The advantages of RAID-6 becomes even more pronounced as the capacity of SATA drives go up and rebuilds take longer to finish. While calculating a second parity has a negative impact on performance in software based RAID systems, the effect is very minimal when hardware RAID engines that have built in circuitry to do the parity calculations are used. RAID-6 requires a minimum of four drives to be implemented and the usable capacity is always 2 less than the number of available disk drives in the RAID set. Applications suited for this level are the same as those of level 5. RAID-10 - A Stripe of Mirrors RAID-10 is an example of combining two RAID levels to achieve more targeted results. It is often confused with its brethren level 0+1 that is referred to as Mirrored Stripes. While in each case drives are mirrored and blocks are striped to these drives, in RAID-10, Blocks are striped to N/2 sets of mirrored drives (N being number of drives in the array) while in level 0+1, blocks are striped to 2 mirrored sets each containing N/2 drives. Because of RAID-10 s mirroring, the storage efficiency is at 50%. This level offers excellent fault tolerance and availability. It is recommended for applications requiring high performance and high reliability that are willing to sacrifice the efficiency (twice the number of drives to achieve the capacity). These include enterprise servers and moderate size database systems. RAID Striping across multiple RAID--5 s Also referred to as level 5+0. It combines Block Striping with distributed parity with straight Block Striping of level 0. In other words it uses a Block Stripe of level 0 on Level 5 elements. The minimum number of drives is 6 and the capacity can be derived from subtracting one drive for each set of Level 5 elements. As an example, a 6-drive array would have the capacity of the five drives. Level 50 is recommended when high fault tolerance, large capacity,
5 RAID Levels and Components Explained Page 5 of 23 and random read/writes are required. It is sometimes used for large databases. RAID-60 Striping across multiple RAID-6 s Also referred to as level 6+0, combines multiple RAID-6 sets with RAID-0 (striping). Dual parity allows the failure of two disks in each RAID-6 array. Striping helps to increase capacity and performance without adding disks to each RAID-6 array (which would decrease data availability and could impact performance in degraded mode). Benefits of RAID RAID provides increased storage capacities, and protects your important data from hard drive failure. There are multiple benefits of using RAID: Reliability and Scalability Real-time data recovery with uninterrupted access when a hard drive fails System uptime and network availability and protection from loss Protection against data loss Multiple drives working together increase system performance A disk system with RAID capability can protect its data and provide on-line, immediate access to its data, despite a single disk failure (some RAID storage systems can withstand two concurrent disk failures). RAID capability also provides for the on-line reconstruction of the contents of a failed disk to a replacement disk. RAID offers faster hard drive performance and nearly complete data safety. Storage requirements are expanding as file sizes get bigger and rendering needs get more complex. If you handle very large images or work on audio and video files, faster data throughput means enhanced productivity. RAID can be backed up to tape while the system is in use. RAID Levels The most common RAID levels are shown below in a tabular format, complete with pros and cons and uses for the given type of RAID.
6 RAID Levels and Components Explained Page 6 of 23 RAID - 0 (STRIPING) RAID-0 stripes data across multiple disks without any redundant information. Data being written to the array is broken down into blocks or stripes and are distributed sequentially across the member disks of the array. This type of array provides high I/O performance at low inherent cost but provides no redundancy of Fault Tolerance. The data is not stored contiguously on a single drive, and can be accessed in parallel - that is to say the pieces of data are read back from multiple sources nearly simultaneously. Unfortunately, striping reduces the level of data availability since a disk failure will cause the entire array to be inaccessible. RAID-0 was not defined originally but has become a commonly used term. Minimum number of drives required: 2 Recommended Applications Video Production and Editing Image Editing Pre-Press Applications Any application requiring high bandwidth Advantages of RAID-0 High performance Very simple design. Easy to implement No parity overhead No capacity loss - all storage is usable Disadvantages of RAID-0 Lack of fault-tolerance Failure of a single drive will result in loss of all data on the array Should never be used in mission critical environments
7 RAID Levels and Components Explained Page 7 of 23 RAID 1 -- (MIRRORING / DUPLEXING) RAID-1 provides data redundancy. Data written to one disk drive is simultaneously written to another disk drive, called the mirror. If one disk fails, the other disk can be used to run the system and reconstruct the failed disk. Since the disk is mirrored, it does not matter if one of them fails because both disks contain the same data at all times. RAID-1 provides high data availability since two complete copies of all information are maintained. In addition, read performance may be enhanced if the array controller allows simultaneous reads from both members of a mirrored pair. Higher availability will be achieved if both disks in a mirror pair are on separate I/O busses, known as duplexing. Minimum number of drives required: 2 Recommended Applications Accounting, Payroll, and Financial Any application requiring very high availability Advantages of RAID-1 One Write or two Reads possible per mirrored pair Twice the Read transaction rate of single disks, same Write transaction rate as single disks Fault tolerant Transfer rate per block is equal to that of a single disk Easy to recover data in case of drive failure, as no rebuild is necessary in case of a disk failure, just a copy to the replacement disk Easy to implement Simplest RAID storage subsystem design
8 RAID Levels and Components Explained Page 8 of 23 Disadvantages of RAID-1 Inefficient - 100% parity overhead is the highest of all RAID types. Becomes very costly as number of disks increase, it requires twice the desired disk space The RAID function is done by system software, loading the CPU/Server and degrading throughput at high activity levels. Hardware RAID recommended May not support hot swap of failed disk when implemented in "software" RAID-5 (STRIPING AND PARITY) RAID-5 stripes data and parity to generate redundancy. However, instead of requiring entirely new disk for parity storage, the parity is distributed through the stripe of the disk array. In RAID-5 both parity and data are striped across a set of separate disks. Next, the new parity is calculated. Finally, the new data and parity are written to separate disks. Data chunks are much larger than the average I/O size, but are still resizable. Disks are able to satisfy requests independently which provides high read performance in a request rate intensive environment. Since parity information is used, a RAID-5 stripe can withstand a single disk failure without losing data or access to data. Minimum number of drives required: 3 Recommended Applications File and Application servers Database servers WWW, , and News servers Intranet servers Most versatile RAID level
9 RAID Levels and Components Explained Page 9 of 23 Advantages of RAID-5 High efficiency - highest read data transaction rates, Medium Write data transaction rates Good aggregate transfer rate Cost effective - only 1 extra disk is required Fault tolerant Low ratio of ECC (Parity) disks to data disks means high efficiency The best choice in multi-user environments which are not write performance sensitive. Disadvantages of RAID-5 Disk failure has a medium impact on throughput Most complex controller design Difficult to rebuild in the event of a disk failure (as compared to RAID-1) Individual block data transfer rate same as single disk RAID 0+1 RAID-01 is technically a combination of RAID-1 and RAID-0, includes both mirroring and striping, but without parity. RAID-10 is a stripe across a number of mirrored drives, and is implemented as a striped array whose segments are RAID-1 arrays. RAID-10 has the same fault tolerance as RAID-1, as well as the same overhead for fault-tolerance as mirroring alone. Advantages: Very high I/O rates are achieved by striping RAID-1 segments Excellent solution for sites that would normally use RAID- 1 Great for Oracle and other databases which need high performance and fault tolerance. Minimum number of drives required: 4
10 RAID Levels and Components Explained Page 10 of 23 Advantages of RAID 0+1 Fault tolerant Very High I/O rates Disadvantages of RAID 0+1 Very expensive - Expensive to maintain As with Raid-1 total capacity is equal to half of the total capacity of all disk in the array High overhead Very limited scalability RAID-10 A STRIPE OF MIRRORS RAID-10 is not RAID 0+1. RAID-10 uses RAID-1 mirroring and RAID-0 striping, and has both security and sequential performance. RAID-10 is a striped RAID-0 array whose segments are mirrored RAID-1. It is similar in performance to RAID 0+1, but with better fault tolerance and rebuild performance. It has the same fault tolerance as RAID-1 with the same overhead for fault tolerance as mirroring alone. Typically four plus hard drives are used, because RAID-10 creates two pairs of mirrored arrays and combines these arrays to form one RAID-0 array. RAID-10 is appropriate for redundant storage of large files, and because parity is not calculated, write operations are very fast. Minimum number of drives required: 4 Recommended Applications Database server requiring high performance and fault tolerance Advantages of RAID-10 High fault tolerance High I/O rates achieved by striping RAID-
11 RAID Levels and Components Explained Page 11 of 23 1 segments Faster rebuild performance than RAID 0+1 Under certain circumstances, RAID-10 array can sustain multiple simultaneous drive failures Excellent solution for sites that would have otherwise gone with RAID-1 but need some additional performance boost Disadvantages of RAID-10 Very expensive High overhead All drives must move in parallel to proper track lowering sustained performance Very limited scalability at a very high inherent cost RAID-50 A STRIPE ACROSS A RAID-5 ARRAY RAID-50 is a striped RAID-0 array which is striped across a RAID-5 array. Performance is improved compared to RAID-5 because of the addition of the striped array. Fault tolerance is also improved. Minimum number of drives required: 6 Advantages of RAID-50 Higher fault tolerance than RAID-5 Higher efficiency than RAID-10 Higher I/O rates Disadvantages of RAID-50 Very complex and expensive to implement
12 RAID Levels and Components Explained Page 12 of 23 More on RAID-5 Each entire data block is written on a data disk; parity for blocks in the same rank is generated on Writes, recorded in a distributed location and checked on Reads. The following table lists advantages and disadvantages of RAID-5. RAID-5 Advantages Highest Read data transaction rate Medium Write data transaction rate Low ratio of ECC (Parity) disks to data disks means high efficiency RAID-5 Disadvantages Disk failure has a medium impact on throughput Most complex controller design Difficult to rebuild in the event of a disk failure (as compared to RAID-1 Good aggregate transfer rate Individual block data transfer rate same as single disk
13 RAID Levels and Components Explained Page 13 of 23 RAID-6: Dual Parity Stripes Two independent parity computations must be used in order to provide protection against double disk failure. Two different algorithms are employed to achieve this purpose. RAID-6 requires a minimum of 4 drives to implement RAID-6 Characteristics and Advantages RAID-6 is essentially an extension of RAID-5 which allows for additional fault tolerance by using a second independent distributed parity scheme (dual parity). Data is striped on a block level across a set of drives, as in RAID-5. A second set of parity is calculated and written across all the drives. RAID-6 provides for an extremely high data fault tolerance and can sustain multiple simultaneous drive failures. RAID-6 protects against multiple bad block failures while non-degraded. RAID-6 protects against a single bad block failure while operating in a degraded mode.
14 RAID Levels and Components Explained Page 14 of 23 Raid-6 Disadvantages More complex controller design. Controller overhead to compute parity addresses is extremely high. Write performance can be brought on par with RAID-5 by using a custom ASIC for computing Reed-Solomon parity. Requires N+2 drives to implement because of dual parity scheme. Is Raid-5 Going Away and being replaced by RAID-6? Has RAID-5 s time finally come? Is it dead, and if so, why? When you consider RAID-5 against RAID-10, you might wonder why RAID-5 ever won out. Both protect against drive failure and data loss, but RAID-10 is much more straightforward to implement and has much higher performance. RAID-5 requires less drive overhead for basically the same level of data protection, as each RAID-5 requires just one drive s worth of storage to protect all the other drives in the array. Thus, if you had a five drive RAID-5, it would be 80% efficient, whereas an 8-drive RAID-10 is only 50% efficient. As drives get bigger and cheaper, one would think that companies can afford to just throw away half their storage for the benefits of RAID-10. This might be true for PATA and SATA directly attached drives. But in an enterprise, do we ever have too much network storage in the office? In fact, the IT crew is always trying to get more budget for more drives, switch ports, etc. But more components always means more hardware that will break and a bigger facility just to hold all our storage stuff. Have you ever been to a computer center that was not crowded, with things stashed in every available place? So dropping from RAID-5 s 80-90% efficiency to RAID-10 s 50% just won t fly. Bigger drives just mean that we will find more ways to fill them. So it looks like the requirement for RAID-5 efficiency is here to stay. And along comes RAID-6 RAID-6 is like RAID-5, but it uses two different types of parity stripes to support two concurrent drive failures rather than just one. Would you ever have two drives failing at once? Mean Time Between Failures on currently shipping drives are up over a million hours that s 114 years! What s the chance of two drives failing at once? Probably slim, but
15 RAID Levels and Components Explained Page 15 of 23 Murphy s Law and its insidious Corollary still apply. Murphy s corollary assures that not only will a drive break, but it will break at the worst possible time. For RAID arrays, if a non-failed drive breaks during a rebuild, your storage is truly vulnerable. Again, the chance of this happening is slim, but today s drives are pretty large and during the time it takes to rebuild the array you re vulnerable to data loss. Why did that drive fail to begin with? Maybe it wasn t just a random drive failure. Maybe it was system related, such as a fan failing and temperatures rising, or noise on the power cables, or flakey cables. When taking environmental failures into account it s common to reduce the second drive s MTBF to 1/10th the value of the first drive. Now take into account all the systems you ve installed or shipped. What s the chance of just one of those systems experiencing a two drive failure? The chance of failure for each individual installation is still relatively low, but the chance that at least one of those installations will lose data can be pretty high. A second way to get a two drive failure is purely human error. When a drive in a RAID-5 fails, a well-designed system will light a fault LED next to the failed drive. Assuming that the system is in use 24/7, the administrator will remove that failed drive from the live system in order to replace it with a new drive. Hopefully he or she is able to do that without (a) removing the wrong drive, or (b) yanking hard enough on the drive carrier to dislodge an adjacent drive. Of course neither should ever happen, but accidents do happen. The single biggest reason for using RAID-6 is based on the chance of drive errors during an array rebuild after just a single drive failure. Rebuilding the data on a failed drive requires that all the other data on the other drives be pristine and error free. If there is a single error in a single sector, then the data for the corresponding sector on the replacement drive cannot be reconstructed. Data is lost. In the drive industry, the measurement of how often this occurs is called the Bit Error Rate (BER). Simple calculations will show that the chance of data loss due to BER is much greater than all the other reasons combined. PATA and SATA drives have historically had much greater bit errors per drive than SCSI and SAS drives, causing some vendors to recommend RAID-6 for SATA drives if they re used for mission critical data.
16 RAID Levels and Components Explained Page 16 of 23 A wise man once said, If it sounds too good to be true, it is too good to be true! What s RAID-6 s downside? In read operations the performance is basically identical to RAID-5 since there is no need to read or manipulate the parity data, assuming that the array contains no failed drives. And on long sequential write operations the overhead of calculating the additional parity is not significant compared to all the other data that is being written. A well designed RAID-6 controller should give 90% of the performance of a similar RAID-5 controller. Significant degradation may occur on short random writes, which are typical in transaction database updates. Most database administrators needing excellent performance choose to run their databases on RAID-10 arrays. The bottom-line is that in all the access patterns that matter, RAID-6 performance is close enough to RAID-5 performance to make the issue moot. Most major RAID vendors, including EMC, have started shipping products incorporating RAID-6. Although different vendors use different algorithms, the results are the same they can stay up, even with two drive failures. Eventually, all the major vendors will support hardware RAID-6. Once everyone supports RAID-6, there really is no need for RAID-5. The moral of the story? Make sure you use RAID-6. Raid A Mirror of Stripe Sets
17 RAID Levels and Components Explained Page 17 of 23 RAID Characteristics and Advantages RAID 0+1 is implemented as a mirrored array whose segments are RAID- 0 arrays. RAID 0+1 has the same fault tolerance as RAID-5, and has the same overhead for fault-tolerance as RAID-1, mirroring alone. RAID 0+1 has high I/O rates are achieved thanks to multiple stripe segments. RAID is an excellent solution for sites that need high performance but are not concerned with achieving maximum reliability. RAID Disadvantages RAID 0+1 is NOT to be confused with RAID-10. A single drive failure will cause the whole array to become, in essence, a RAID-0 array RAID is very expensive / with high overhead All the RAID drives must move in parallel to properly track, which can lower sustained performance RAID has very limited scalability at a very high inherent cost. Raid-10 Stripe Sets of Mirrored Drives Raid RAID-10 Characteristics and Advantages RAID-10 is implemented as a striped array whose segments are RAID-1 arrays and RAID-10 has the same fault tolerance as RAID-1.
18 RAID Levels and Components Explained Page 18 of 23 RAID-10 has the same overhead for fault-tolerance as mirroring alone. High I/O rates are achieved by striping RAID-1 segments. Under certain circumstances, RAID-10 array can sustain multiple simultaneous drive failures. RAID-10 provides an excellent solution for sites that would have otherwise gone with RAID-1 but need some additional performance boost. RAID-10 Disadvantages Very expensive / High overhead All drives must move in parallel to properly track lowering sustained performance. Very limited scalability at a very high inherent cost. RAID-50 (5+0) Block Striping with Distributed Parity Simply stated, a RAID-50 array is a RAID-0 array on top of a RAID-5 array. Thus, RAID-50 forms large arrays by combining the block striping and parity of RAID-5 with the straight block striping of RAID-0. RAID-50 improves upon the performance of RAID-5 through the addition of RAID- 0, particularly during writes. RAID-50 also provides better fault tolerance than the single RAID level does. Because of the improved speed and fault tolerance, RAID-50 is excellent for transactional environments. RAID-50 systems require a high-end hardware controller. RAID-50 Axles When you create a RAID-50, you must specify the number of axles. An axle refers to a single RAID-5 array that is striped with other RAID-5 arrays to make RAID-50. The number of drives in the RAID-50 array must be factorable into two integers, one of which must be 2 or higher and the other 3 or higher. We can easily deduce the minimum number of drives in a RAID-50 array to be 2x3 or 6. With 12 drives, you could have a either 2x6 or a 3x4 array. With 16 drives, you could have either an 8x2 or a 4x4 array. There are limitations on the number of axles and drives controllers can support. Some drive configurations and enclosures might yield an unbalanced RAID-50.
19 RAID Levels and Components Explained Page 19 of 23 In a RAID-5 array, you can keep going with one failed drive. In a RAID- 50 array, you can keep going with one failed drive per axle! The diagram below shows a RAID-50 2x4 array with two 4-drive axles. Our diagram will help us understand how a RAID-50 system would store files in its array. There are four files to be stored in our RAID-50 array, each of varying color and size. First off, our array will use a grey 16KB stripe, so files will be stored in 16KB chunks. The tiny red file is 4 KB in size; the blue file is 20 KB; the green file is 100 KB; and the magenta file is 500 KB. The data will be evenly striped between these two RAID-5 arrays using RAID- 0. Then within each RAID-5 array, the data is stored using striping with the 16 KB parity blocks. Axle 1 Axle 2 How we store the four files: We assume a top/bottom expansion, and that the 4KB red file, the 20KB blue file, 100KB green file, and 500KB magenta file are to be stored on the array, in order, with the small red file first. Since we are using a 16KB stripe, all of the 4KB red file, and 12KB of the second blue file, were sent to Axle 1; the remaining 8KB of the blue file and the first 8KB of the green 100KB file went to Axle 2. Then 16KB of the green file went to Axle 1, the next 16KB went to Axle 2, and so forth, until all the magenta file is stored. RAID-50 Array Capacity and Storage Efficiency To discuss capacity and efficiency, we will use the abbreviation NDA as the number of drives in a RAID-50 axle. The capacity formula is
20 RAID Levels and Components Explained Page 20 of 23 (Smallest Drive Size) * ( NDA - 1) * ( # of Axles). To illustrate, suppose we purchase a 12 bay SAS enclosure for holding 15K RPM, 300 GB, SAS drives. There are two possible RAID-50 configurations with 12 drives, the 2x6 and the 3x4 arrays. For the 2x6 array, the capacity is 300GB * (6-1) * 2 or 300 * 10 or 3.0 TB. For the 3x4 array, the capacity is 300GB * (4-1) * 3 or 2.7TB. The RAID-50 efficiency formula is ( NDA 1 )/ NDA. The efficiencies for our two 12 drive array configurations are (6 1) / 6 or 84% for the 2x6 array and ( 4 1) / 4 or 75% for the 3x4 array. Since both the capacity and efficiency are better for the 2x6 array, why would we choose the 3x4 array? The only answer is that the more drives are on an axle, the longer it takes to computer the required parities. If we purchased a 16 bay SAS enclosure, to hold the same 300 GB, 15 K SAS Drives, we would have two choices, a 2x8 array or a 4x4 array. For the 2x8 array, our capacity would be 300 * 7 * 2, or 4,200 GB or 4.2 TB, and the efficiency would be (8-1)/8 = 7/8 or 87.5%. The 4x4 array would have a capacity of 300 * 3 * 4 = 3.6KB and an efficiency of (4-1)/4 or 75%. As a final RAID-50 example, suppose we purchased a 24 bay enclosure to hold 24 1GB 7,200 RPM SATA drives. Of course, these spin much slower than the SAS drives, but are much, much less expensive. In November, 2007, a median Web price for the 300 GB, 15K SAS was $800, and $400 for the 1 TB, 7,200 RPM SATA. For a 24 drive array, we could have 3 arrays, a 2x12, a 3x8, or a 4x6 array. For the 2x12, the capacity and efficiencies would be 1 TB * (2-1) * 12 or 12TB and (2-1)/2 = 50%. For the 3x8 array, the capacity is 1 TB * (8-1) * 3 = 21 TB and the efficiency is ( 8 1) / 8 or 87.5%. For the 4x6 array, the capacity is 1TB * 5 * 4 or 20TB, and the efficiency is 80%. Curiously, the drive, enclosure, and controller costs would be the about the same for either the 12 SAS drives or the 24 SATA drives. With 15K RPM SAS drives, the maximum storage we get is 3.0 TB, but with the 7.2K RPM SATA drives, we get 20 TB for 7.2K SATA. Looking at these numbers, you might ask, Why go with SAS drives? SAS drives sell because of their extra speed and inherent drive reliability, and dual-ported connectors, which give extra drive redundancy. It is well known that hardware costs are only a small portion of the storage dollar.
Click on the diagram to see RAID 0 in action RAID Level 0 requires a minimum of 2 drives to implement RAID 0 implements a striped disk array, the data is broken down into blocks and each block is written
Dependable Systems 9. Redundant arrays of inexpensive disks (RAID) Prof. Dr. Miroslaw Malek Wintersemester 2004/05 www.informatik.hu-berlin.de/rok/zs Redundant Arrays of Inexpensive Disks (RAID) RAID is
RAID The basic idea of RAID (Redundant Array of Independent Disks) is to combine multiple inexpensive disk drives into an array of disk drives to obtain performance, capacity and reliability that exceeds
DELL RAID PRIMER DELL PERC RAID CONTROLLERS Joe H. Trickey III Dell Storage RAID Product Marketing John Seward Dell Storage RAID Engineering http://www.dell.com/content/topics/topic.aspx/global/products/pvaul/top
technology brief RAID Levels March 1997 Introduction RAID is an acronym for Redundant Array of Independent Disks (originally Redundant Array of Inexpensive Disks) coined in a 1987 University of California
SURVEY ON RAID Aishwarya Airen 1, Aarsh Pandit 2, Anshul Sogani 3 1,2,3 A.I.T.R, Indore. Abstract RAID stands for Redundant Array of Independent Disk that is a concept which provides an efficient way for
9916 Brooklet Drive Houston, Texas 77099 Phone 832-327-0316 www.safinatechnolgies.com RAID Made Easy By Jon L. Jacobi, PCWorld What is RAID, why do you need it, and what are all those mode numbers that
RAID Basics Training Guide Discover a Higher Level of Performance RAID matters. Rely on Intel RAID. Table of Contents 1. What is RAID? 2. RAID Levels RAID 0 RAID 1 RAID 5 RAID 6 RAID 10 RAID 0+1 RAID 1E
RAID HARDWARE On board SATA RAID controller SATA RAID controller card RAID drive caddy (hot swappable) Anne Watson 1 RAID The word redundant means an unnecessary repetition. The word array means a lineup.
Disk drives are an integral part of any computing system. Disk drives are usually where the operating system and all of an enterprise or individual s data are stored. They are also one of the weakest links
RAID 6 with HP Advanced Data Guarding technology: a cost-effective, fault-tolerant solution technology brief Abstract... 2 Introduction... 2 Functions and limitations of RAID schemes... 3 Fault tolerance
Using RAI6 for Advanced ata Protection 2006 Infortrend Corporation. All rights reserved. Table of Contents The Challenge of Fault Tolerance... 3 A Compelling Technology: RAI6... 3 Parity... 4 Why Use RAI6...
RAID Level Descriptions RAID 0 (Striping) Offers low cost and maximum performance, but offers no fault tolerance; a single disk failure results in TOTAL data loss. Businesses use RAID 0 mainly for tasks
Reliability and Fault Tolerance in Storage Dalit Naor/ Dima Sotnikov IBM Haifa Research Storage Systems 1 Advanced Topics on Storage Systems - Spring 2014, Tel-Aviv University http://www.eng.tau.ac.il/semcom
Click here to print this article. Re-Printed From SLCentral RAID: An In-Depth Guide To RAID Technology Author: Tom Solinap Date Posted: January 24th, 2001 URL: http://www.slcentral.com/articles/01/1/raid
Hard Disk Drives and RAID Janaka Harambearachchi (Engineer/Systems Development) INTERFACES FOR HDD A computer interfaces is what allows a computer to send and retrieve information for storage devices such
Technology Update White Paper High Speed RAID 6 Powered by Custom ASIC Parity Chips High Speed RAID 6 Powered by Custom ASIC Parity Chips Why High Speed RAID 6? Winchester Systems has developed High Speed
RAID configurations defined 1/7 Storage Configuration: Disk RAID and Disk Management > RAID configurations defined Next RAID configurations defined The RAID configuration you choose depends upon how you
HPTER 4 RI s it was originally proposed, the acronym RI stood for Redundant rray of Inexpensive isks. However, it has since come to be known as Redundant rray of Independent isks. RI was originally described
Data Storage - II: Efficient Usage & Errors Week 10, Spring 2005 Updated by M. Naci Akkøk, 27.02.2004, 03.03.2005 based upon slides by Pål Halvorsen, 12.3.2002. Contains slides from: Hector Garcia-Molina
Lecture 36: Chapter 6 Today s topic RAID 1 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f. one large disk) Parallelism improves performance Plus extra disk(s) for
STORAGE SOLUTIONS WHITE PAPER Best Practices RAID Implementations for Snap Servers and JBOD Expansion Contents Introduction...1 Planning for the End Result...1 Availability Considerations...1 Drive Reliability...2
RAID: Redundant Arrays of Independent Disks Dependable Systems Dr.-Ing. Jan Richling Kommunikations- und Betriebssysteme TU Berlin Winter 2012/2013 RAID: Introduction Redundant array of inexpensive disks
W H I T E P A P E R OPTIMIZING VIRTUAL TAPE PERFORMANCE: IMPROVING EFFICIENCY WITH DISK STORAGE SYSTEMS By: David J. Cuddihy Principal Engineer Embedded Software Group June, 2007 155 CrossPoint Parkway
RAID Technology Overview HP Smart Array RAID Controllers HP Part Number: J6369-90050 Published: September 2007 Edition: 1 Copyright 2007 Hewlett-Packard Development Company L.P. Legal Notices Copyright
RAID5 versus RAID10 First let's get on the same page so we're all talking about apples. What is RAID? RAID originally stood for Redundant Arrays of Inexpensive Disk and was an idea proposed in the early
Intel RAID Software User s Guide: Intel Embedded Server RAID Technology II Intel Integrated Server RAID Intel RAID Controllers using the Intel RAID Software Stack 3 Revision 8.0 August, 2008 Intel Order
StorTrends RAID Considerations MAN-RAID 04/29/2011 Copyright 1985-2011 American Megatrends, Inc. All rights reserved. American Megatrends, Inc. 5555 Oakbrook Parkway, Building 200 Norcross, GA 30093 Revision
CS341: Operating System Lect 36: 1 st Nov 2014 Dr. A. Sahu Dept of Comp. Sc. & Engg. Indian Institute of Technology Guwahati File System & Device Drive Mass Storage Disk Structure Disk Arm Scheduling RAID
RAID OPTION ROM USER MANUAL Version 1.6 RAID Option ROM User Manual Copyright 2008 Advanced Micro Devices, Inc. All Rights Reserved. Copyright by Advanced Micro Devices, Inc. (AMD). No part of this manual
Chapter 1 Storage Devices Summary Dependability is vital Suitable measures Latency how long to the first bit arrives Bandwidth/throughput how fast does stuff come through after the latency period Obvious
Fault Tolerance & Reliability CDA 5140 Chapter 3 RAID & Sample Commercial FT Systems - basic concept in these, as with codes, is redundancy to allow system to continue operation even if some components
Benefits of Using RAID 50 or 60 in Single High Capacity RAID Array Volumes Greater than 16 Disk Drives Document Version 1.0 Promise Technology Inc www.promise.com Copyright 2007 Promise Technology, Inc.
ISTANBUL AYDIN UNIVERSITY 2013-2014 Academic Year Fall Semester Department of Software Engineering SEN361 COMPUTER ORGANIZATION HOMEWORK REPORT STUDENT S NAME : GÖKHAN TAYMAZ STUDENT S NUMBER : B1105.090068
Lecture 21: Storage Administration Take QUIZ 15 over P&H 6.1-4, 6.8-9 before 11:59pm today Project: Cache Simulator, Due April 29, 2010 NEW OFFICE HOUR TIME: Tuesday 1-2, McKinley Last Time Exam discussion
SSDs and RAID: What s the right strategy Paul Goodwin VP Product Development Avant Technology SSDs and RAID: What s the right strategy Flash Overview SSD Overview RAID overview Thoughts about Raid Strategies
Why the Subsystem Storage Is a Must for the Applications of Mission Critical Surveillance Projects? Application Notes What is Mission Critical? Mission critical surveillance projects indicate any failure
Availability and Disaster Recovery: Basic Principles by Chuck Petch, WVS Senior Technical Writer At first glance availability and recovery may seem like opposites. Availability involves designing computer
IBM ^ xseries ServeRAID Technology Reliability through RAID technology Executive Summary: t long ago, business-critical computing on industry-standard platforms was unheard of. Proprietary systems were
Chapter 6 Storage and Other I/O Topics 6.1 Introduction I/O devices can be characterized by Behavior: input, output, storage Partner: human or machine Data rate: bytes/sec, transfers/sec I/O bus connections
Q & A From Hitachi Data Systems WebTech Presentation: RAID Concepts 1. Is the chunk size the same for all Hitachi Data Systems storage systems, i.e., Adaptable Modular Systems, Network Storage Controller,
Dell Systems Getting Started With RAID www.dell.com support.dell.com Notes, Notices, and Cautions NOTE: A NOTE indicates important information that helps you make better use of your computer. NOTICE: A
Intel RAID Software User s Guide: Intel Embedded Server RAID Technology II Intel Integrated Server RAID Intel RAID Controllers using the Intel RAID Software Stack 3 Revision 9.0 December, 2008 Intel Order
Operating Systems RAID Redundant Array of Independent Disks Submitted by Ankur Niyogi 2003EE20367 YOUR DATA IS LOST@#!! Do we have backups of all our data???? - The stuff we cannot afford to lose?? How
Intel RAID Controllers Best Practices White Paper April, 2008 Enterprise Platforms and Services Division - Marketing Revision History Date Revision Number April, 2008 1.0 Initial release. Modifications
Intel RAID Software User s Guide: Intel Embedded Server RAID Technology II Intel Integrated Server RAID Intel RAID Controllers using the Intel RAID Software Stack 3 Revision 11.0 July, 2009 Intel Order
What is RAID and how does it work? What is RAID? RAID is the acronym for either redundant array of inexpensive disks or redundant array of independent disks. When first conceived at UC Berkley the former
Overview of RAID Let's first address, "What is RAID and what does RAID stand for?" RAID, an acronym for "Redundant Array of Independent Disks, is a storage technology that links or combines multiple hard
What is RAID--BASICS? Mylex RAID Primer A simple guide to understanding RAID Let's look at a hard disk... Several platters stacked on top of each other with a little space in between. One to n platters
A Detailed Review Abstract This white paper discusses the EMC CLARiiON RAID 6 implementation available in FLARE 26 and later, including an overview of RAID 6 and the CLARiiON-specific implementation, when
NK YORK COLLEGE OF PENNSYLVANIA HG OK 2 RAID YORK COLLEGE OF PENNSYLVAN James Moscola Department of Physical Sciences York College of Pennsylvania Based on Operating System Concepts, 9th Edition by Silberschatz,
RAID Utility User s Guide Instructions for setting up RAID volumes on a computer with a MacPro RAID Card or Xserve RAID Card. 1 Contents 3 RAID Utility User s Guide 3 Installing the RAID Software 4 Running
Overview of I/O Performance and RAID in an RDBMS Environment By: Edward Whalen Performance Tuning Corporation Abstract This paper covers the fundamentals of I/O topics and an overview of RAID levels commonly
An Introduction to RAID Giovanni Stracquadanio firstname.lastname@example.org www.dmi.unict.it/~stracquadanio Outline A definition of RAID An ensemble of RAIDs JBOD RAID 0...5 Configuring and testing a Linux
People often ask: Should I RAID my disks? The question is simple, unfortunately the answer is not. So here is a guide to help you decide when a RAID array is advantageous and how to go about it. This guide
Performance Report Modular RAID for PRIMERGY Version 1.1 March 2008 Pages 15 Abstract This technical documentation is designed for persons, who deal with the selection of RAID technologies and RAID controllers
RAID Performance Analysis We have six 500 GB disks with 8 ms average seek time. They rotate at 7200 RPM and have a transfer rate of 20 MB/sec. The minimum unit of transfer to each disk is a 512 byte sector.
RAID Implementation for StorSimple Storage Management Appliance Alpa Kohli June, 2012 KB-00008 Document Revision 1 StorSimple knowledge base articles are intended to provide customers with the information
Technical white paper HP Smart Array Controllers and basic RAID performance factors Technology brief Table of contents Abstract 2 Benefits of drive arrays 2 Factors that affect performance 2 HP Smart Array
White Paper October 2001 Prepared by Industry Standard Storage Group Compaq Computer Corporation Contents Overview...3 Defining RAID levels...3 Evaluating RAID levels...3 Choosing a RAID level...4 Assessing
Technical Report RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection Jay White & Chris Lueth, NetApp May 2010 TR-3298 ABSTRACT This document provides an in-depth overview of the NetApp
Guide to SATA Hard Disks Installation and RAID Configuration 1. Guide to SATA Hard Disks Installation...2 1.1 Serial ATA (SATA) Hard Disks Installation...2 2. Guide to RAID Configurations...3 2.1 Introduction
Storing Data: Disks and Files (From Chapter 9 of textbook) Storing and Retrieving Data Database Management Systems need to: Store large volumes of data Store data reliably (so that data is not lost!) Retrieve
Sistemas Operativos: Input/Output Disks Pedro F. Souto (email@example.com) April 28, 2012 Topics Magnetic Disks RAID Solid State Disks Topics Magnetic Disks RAID Solid State Disks Magnetic Disk Construction
www.gateway.com About RAID About RAID RAID (Redundant Array of Inexpensive/Independent Disks) lets your computer use multiple hard drives more efficiently. Your computer supports RAID 0, RAID 1, RAID 5,
200 Chapter 7 (This observation is reinforced and elaborated in Exercises 7.5 and 7.6, and the reader is urged to work through them.) 7.2 RAID Disks are potential bottlenecks for system performance and
RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f. one large disk) Parallelism improves performance Plus extra disk(s) for redundant data storage Provides fault tolerant
Guest Lecture for 15-440 Disk Array Data Organizations and RAID October 2010, Greg Ganger 1 Plan for today Why have multiple disks? Storage capacity, performance capacity, reliability Load distribution
ES- Elettronica dei Sistemi Computer Architecture Lesson 7 Disk Arrays Network Attached Storage 4"» "» 8"» 525"» 35"» 25"» 8"» 3"» high bandwidth disk systems based on arrays of disks Decreasing Disk Diameters
A TECHNOLOGY WHITE PAPER from VERITAS Software Corporation fpa RAID technology and implementation backgrounder to help system administrators and application designers make intelligent on-line storage subsystem
BrightStor ARCserve Backup for Windows Tape RAID Option Guide r11.5 D01183-1E This documentation and related computer software program (hereinafter referred to as the "Documentation") is for the end user's
Summer Student Project Report Dimitris Kalimeris National and Kapodistrian University of Athens June September 2014 Abstract This report will outline two projects that were done as part of a three months
Nutanix Tech Note Failure Analysis A Failure Analysis of Storage System Architectures Nutanix Scale-out v. Legacy Designs Types of data to be protected Any examination of storage system failure scenarios
Chapter 6 External Memory Dr. Mohamed H. Al-Meer 6.1 Magnetic Disks Types of External Memory Magnetic Disks RAID Removable Optical CD ROM CD Recordable CD-R CD Re writable CD-RW DVD Magnetic Tape 2 Introduction
RAID Utility User Guide Instructions for setting up RAID volumes on a computer with a Mac Pro RAID Card or Xserve RAID Card Contents 3 RAID Utility User Guide 3 The RAID Utility Window 4 Running RAID Utility
Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...
Chapter 10: Mass-Storage Systems Physical structure of secondary storage devices and its effects on the uses of the devices Performance characteristics of mass-storage devices Disk scheduling algorithms
The read/write head of a hard drive only detects changes in the magnetic polarity of the material passing beneath it, not the direction of the polarity. Writes are performed by sending current either one
Solving Data Loss in Massive Storage Systems Jason Resch Cleversafe 2010 Storage Developer Conference. Insert Your Company Name. All Rights Reserved. 1 In the beginning There was replication Long before
CS 6290 I/O and Storage Milos Prvulovic Storage Systems I/O performance (bandwidth, latency) Bandwidth improving, but not as fast as CPU Latency improving very slowly Consequently, by Amdahl s Law: fraction
Assessing the Reliability of RAID Systems By Abraham Long, Jr. To determine the overall reliability of a RAID-based storage system, it is important to accurately assess the reliability of the RAID subsystem.
1/19 Why disk arrays? CPUs speeds increase faster than disks - Time won t really help workloads where disk in bottleneck Some applications (audio/video) require big files Disk arrays - make one logical
RAID Chunk Size Notices The information in this document is subject to change without notice. While every effort has been made to ensure that all information in this document is accurate, Xyratex accepts