Revolutionary Methods to Handle Data Durability Challenges for Big Data

Size: px
Start display at page:

Download "Revolutionary Methods to Handle Data Durability Challenges for Big Data"

Transcription

1 WHITE PAPER Data Center Scalability/ Cloud Storage Big Data Revolutionary Methods to Handle Data Durability Challenges for Big Data Intel and Amplidata address the storage challenges presented by the coming tsunami of unstructured data that will stress data center scalability Big Data will earn its place as the next must have competency in 22 as the volume of digital content grows to 2. zettabytes, up 4 percent from 2. Over percent of this information will be unstructured full of rich information, but challenging to understand and analyze. IDC EXECUTIVE SUMMARY The growth in unstructured data is pushing the limits of data center scalability at the same time that disk drive vendors are pushing the limits of data density at tolerable device level bit error rates (BER). For organizations delivering Cloud-hosted services will be a primary concern. The traditional RAID (Redundant Array of Inexpensive Disks) approach in wide use today simply will not provide the levels of data durability and performance required by enterprises dealing with the escalating volume of data. New approaches that go beyond traditional RAID promise to improve rebuild times on high-density disk drives, and reduce susceptibility to disk-error induced corruption, 2 which otherwise would result in crisis if traditional RAID is simply scaled up using current algorithms. In this paper, we will discuss why RAID doesn t scale for Big Data, why erasure code is a better option, and how various erasure code alternatives compare. We will use the long-standing mean-time-to-data-loss (MTTDL) model to compute the risk of data loss over time and show how the Amplidata computationally intense BitSpread algorithm deployed on Intel Xeon processor-based platforms deliver high BitSpread is Amplidata s rateless erasure coding software which is delivered commercially in the AmpliStor Optimized Object Storage system, a Petabyte-scale storage system purpose built for storing massive amounts of big unstructured data. Dr. Sam Siewert, PhD Founder and Senior Partner, Trellis Logic, LLC Cloud Data Scaling Total data produced per year surpassed one zettabyte in 2 and continues to more than double every two years. 3 To put this in perspective, about two zettabytes (ZB) of digital universe will be created (per IDC) in 2 is two million terabytes, or over one million new 3.5" SATA disks with two terabyte (TB) capacities. At the same time, not only is total data stored growing rapidly, but so is storage density. For example, Seagate demonstrated HAMR (Heat Assisted Magnetic Recording) technology with a terabit per square inch hard disk drive that is expected to lead to TB 3.5" hard disk drives within a decade. 4 High capacity drives will challenge traditional RAID with long rebuild times when a single TB drive fails and has to be rebuilt from parity, or mirrored data restored. Huge drives mean RAID systems will have to operate longer in degraded mode with risk of double or triple faults, and data loss will increase. The marketing research community generally agrees that Cloud-hosted data will be somewhere between to 2 percent of all annually generated data, so when total data hits ZB somewhere around 2, then over one ZB will likely be

2 Table of Contents Cloud Data Scaling Scaling the Cloud with RAID Why RAID MTTDL Models are Unrealistic Are There Better Alternatives to Traditional RAID? Amplidata BitSpread Algorithm... 5 Comparing Amplidata BitSpread to Traditional RAID..... A Reference Architecture Proposal Summary stored in the public Cloud. This does not include private and personal Cloud storage Projections from IDC are even more bullish, stating that Finally, Big Data will earn its place as the next must have competency in 22 as the volume of digital content grows to 2. zettabytes, up 4 percent from 2. 5 Over percent of this information will be unstructured (e.g., images, media and web-enabled workloads) full of rich information, but challenging to understand and analyze. Needless to mention, it will be challenging to store all of that data reliably. Users of public Cloud services will expect their data to be safe. But if the same hard disk drive technology is used in the Cloud as is used at home and in small businesses, the only way this will be true is if it is based on enterprise data systems design. When a component storing data fails in an enterprise system, we call that a data erasure. Cloud systems will be expected to not only recover that data, but to do so such that the data remains available with minimal to no service interruption. In some cases a Cloud service provider might use mirrored data, even triple or N-way mirrored, but this is the most costly method to protect against data loss when drives fail. Scaling the Cloud with RAID An enterprise measures the durability of data in terms of how long a system is expected to operate with no data loss. Standard methods include mirroring of data or RAID level, parity RAID or RAID level 5, and more advanced Reed-Solomon (RS) erasure codes which allow for multiple storage component failures before data is lost or becomes inaccessible. Mirroring data is by far the simplest, but also the most costly because it requires full data duplication and twice the resources for RAID, and three times for triple mirroring. RAID level 5 uses simple exclusive OR logic to reduce that duplication to N/N+, or for example, would provide TB of usable capacity. ciency ratio (usable capacity/raw capacity). RAID level extends this parity approach to include double parity, using several different competing encoding algorithms, mathematics for RS encoding of P (XOR In context of the standard MTTDL model (Equation ), RAID breaks down at petascale and beyond (the likely scale for Cloud data centers of the future hosting zettabytes of total data) and lacks the durability needed for safe hosting of user data in the Cloud. MTTDL MTTF (X+) (Number of Data Loss Combinations) * (MTTR) Where, X = simultaneous erasure tolerance X Equation. 2

3 Simply put: Time to Multiple Failures (MTTF) (raised to X+ power). This means, the bigger the MTTF, the bigger is the MTTDL leading to better data durability. to the Number of Data Loss Combinations. This means that the lower the number of combinations we have, the bigger the MTTDL leading to better MTTDL durability. the Mean Time To Recovery (MTTR) (raised to the Xth power). This means, the smaller the MTTR, the bigger is the MTTDL leading to better data durability. In summary, any increase in drive durability has a huge payoff because MTTF is increased, and for the more fault tolerant RAID levels this has a power law, so the any reduction in MTTR improves durability and furthermore, for the more fault tolerant levels, this MTTR term is a power durability for RAID and beyond compared ANNUAL PROBABILITY OF DATA LOSS PER K UNITS, DATA PROTECTION SCALING to single fault tolerant methods. The ideal system maximizes MTTF and minimizes MTTR, but also, perhaps more important, provides fault tolerance to a higher number of drive failures in a protection set. Or, restated, it vastly increases the protection by minimizing the potential for concurrent failures and by shrinking down the exposure window. Figure summarizes the annual probabilities of data loss for RAID5, RAID, mirroring (RAID), three-way mirroring (RAID 3-way), and triple-parity erasure coding. In this probability of one or higher affords weaker data durability, and any scheme that results in lower than one probability affords stronger data durability. As is clear from we resort to triple mirroring or compute intensive and complex triple-erasure coding. The example in Figure shows what might be a common full-rack of storage with several hundred disk drives (at 3 to 4 TB each) and the probability of data loss as a function of useful capacity (not including parity or mirrored data, just unique data) RAID5 RAID RAID RAID 3way EC3 In context of the standard MTTDL model, RAID breaks down at petascale and beyond. Knowing that real systems can suffer greater loss potential than this standard MTTDL model predicts and because even RAID approaches a data loss probability of one at just 35 TB, it is clear that petascale systems demand better protection. With RAID, the only viable work-arounds for better protection than RAID are either 3-way mirroring or RAID+ (mirrored RAID). Why RAID MTTDL Models Are Unrealistic The RAID MTTDL formulations are overly optimistic because they do not consider failure modes such as infant mortality, end-of-life, stress modes like overheating in a data center, or anything beyond what is expected from the manufacturers highly accelerated (perhaps simulated) lifetime testing of a population of the devices. These models do not consider any sort of partial failure of a device (it is a two state Markov model), so devices are considered to either be in a probable working state or less probable total failure state, and not some sort of partially working state. Nor do these formulations estimate the magnitude of data loss which is expected to be at least one more drives-worth of data per RAID set, but loss is considered to be an all or none type of calculation. Perhaps more convincing that the model is optimistic are the large statistical studies such as the one million disk drive study summarized in Figure 2 (see next page).. TOTAL USABLE TERABYTES Figure. Standard MTTDL model comparison of a variety of RAID protection levels along with triple protection erasure coding (EC3). 3

4 NODE LEVEL SYSTEM LEVEL CUMULATIVE PROBABILITY CUMULATIVE PROBABILITY DATA WEIBULL LOGNORMAL GAMMA EXPONENTIAL DATA WEIBULL LOGNORMAL GAMMA EXPONENTIAL INFANT MORTALITY TIME BETWEEN FAILURES (SEC.) 5 The standard MTTDL exponential model is the dashed blue line and the alternative distributions are super-imposed along with the green line, real data. For anyone building a Cloud data center, there are four alarming observations. Viewed one chart at a time we can see: Node Level Infant Mortality (Upper Left): shows that the number of early failures for a single RAID controller can be up to an order of magnitude more frequent than predicted by the standard model. Node Level Mid to End of Life (Upper Right): shows that node failures rates are higher than the standard model. System Level Infant Mortality (Lower Left): shows an even wider rate of disk failures in a large set of drives as compared to node failures. System Level Mid to End-of-Life (Lower Right): is where the standard model predicts best, but is still optimistic. (a) TIME BETWEEN FAILURES (SEC.) Figure 2. Standard model compared to statistics, CUMULATIVE PROBABILITY CUMULATIVE PROBABILITY DATA WEIBULL LOGNORMAL GAMMA EXPONENTIAL MID TO END OF LIFE DATA WEIBULL LOGNORMAL GAMMA EXPONENTIAL TIME BETWEEN FAILURES (SEC.) (b) TIME BETWEEN FAILURES (SEC.) Experience by IT professionals running on-line deep archives is that partial failures and software bugs are a real phenomenon and that RAID level is the minimum protection required but really requires mirrored RAID (RAID+). What necessitates this in deep archives? The need comes from petabytes of data and the need for this data on demand common requirements with public Cloud data services. Triple mirroring is rarely used due to the high cost of 33 percent storage only slightly better since each RAID vol- percent in +2) and with mirroring becomes 4 percent, not much better than triple mirrors, which are simpler to recover when a disk member is lost. RAID level is ok for on-demand but occasional access where the can be tolerated, but for faster access, this will either require hardware acceleration or better durability than RAID+. Are There Better Alternatives to Traditional RAID? The lack of scalability of RAID to petabytes with safe levels of durability has led to re-visiting the fundamentals of data protection using a family of algorithms broadly known as erasure codes. To be accurate, RAID, 5, and are erasure codes, Maximal Distance Separation (MDS) codes. Table summarizes the RAID algorithms in use today. All of these algorithms meet the MDS criteria for encoding, whereby the data protection level matches the number of code or mirror segments in the RAID set. As shown in Table 2, sticking with traditional hierarchical RAID algorithms (not meeting MDS criteria), which combine parity RAID with mirroring, are possible. But these approaches have low storage operation cost) and hence undesirable at Petabyte and higher scale storage systems. Beyond simple XOR parity and mirroring, the mathematics to compute the code segments for MDS codes become high. So, the sheer complexity of these calculations can limit MDS solutions from achieving line rates when this data is accessed or updated. Some proprietary work-arounds oped, such as RAID DP (Double Parity) and the EVEN/ODD algorithm, but general extension of MDS codes beyond double fault protection requires Reed Solomon erasure code. To go beyond double fault protection triple-protection Reed-Solomon erasure code is an option, but innovators have proposed new erasure codes which are simpler, perhaps not meeting MDS criteria in a small set, yet having all of the protection advantages and are designed for scaling huge. 4

5 Algorithm RAID 2-way (mirroring) Data Protection Code/Mirror Segments Unique Data Segments Single fault 5% RAID5 Single fault 4 % RAID Double fault 2 4 % RAID 3-way (triple mirroring) Reed-Solomon Extensions of RAID Table. Hierarchical Algorithm Double fault 2 33% N faults (e.g., 3 faults) Data Protection k=n (3) Code/Mirror Segments m (configurable) (4) Unique Data Segments Storage Efficiency m/(k+m) (5%) RAID5+ Double fault 4 4% RAID+ Quadruple fault 4 33% Table 2. Storage Efficiency BitSpread is a non-mds code, and hence it trades a small amount of extra space overhead compared to MDS codes in exchange for much higher-levels of throughput, faster self-healing, dynamic policy changes, and bit-perfect data integrity assurances as follows: encode/decode times with respect to the number of symbols being encoded. This maximizes throughput, speeds rebuilds and broadens the set of use-cases. BitSpread furthermore builds upon the Intel Xeon SSE 4.2 instruction sets to achieve further performance gains and drive full Gb Ethernet throughput rates on standard Intel Xeon multi-core servers. easy-to-shift spreads from local to distributed multi-site policies on-the-fly and vice-versa. Amplidata BitSpread Algorithm One very interesting example of an innovative algorithm like this is the Amplidata BitSpread algorithm, which is implemented in Amplidata s AmpliStor Optimized Object Storage system. AmpliStor provides storage services over an http/rest interface that enables applications to store virtually unbounded numbers of objects, of any type and any size. BitSpread is hosted on AmpliStor controller nodes, which encodes objects using its rateless (referring to the ability to generate an unlimited number of equations from the input source) erasure-coding algorithm and distributes the encoded data across the AmpliStor storage nodes. Only a subset of the encoded data elements (actually equations, as demonstrated below), are required to retrieve the original data objects, thereby protecting the data against erasures (due to component failures) or bit error induced corruption. The BitSpread rateless erasure code is distinct from other erasure code implementations based on RS style MDS codes or its variants. While RS codes have the ability to protect data against higher numbers of simultaneous failures than RAID5 or RAID and with storage overhead directly proportional to their protection or safety level, they pay a price in several areas: linear with respect to the number of symbols being encoded). This impacts overall system performance. for new spread policies. RS codes must recalculate for new spread and safety levels, forcing data to be unloaded and reloaded to make any changes in policy especially when adding or removing new data center sites. of bit error protection that can be corrected to a few bit errors per object. super-granular bit error protection over bit errors can occur per object while preserving full object protection, availability and integrity. The easiest way to understand the Amplidata BitSpread algorithm is to consider a simple case where the encoding is a set of equations as shown in Figure 3 below. three simple equations and data chunks X and Y derived from a stored object, which become two unknowns in two equations if there is an erasure. This simple approach is not strictly proportional to protection level in terms of bit overhead compared to RS encoding. But it has huge pay-off from the simplicity, which allows the encoding and recovery to run with minimal complexity and achieving better throughput compared to RS. 5

6 Figure 3 Value Efficiency Durability Series of Equations Any 2 out of 3 equations uniquely determine object Description Raw/usable efficiency Relative data loss risk SIMPLIFIED SIMPLIFIED MATHEMATICS MATHEMATICS Original Object Decomposed Object 5 5 X+Y = 2 X Y = 2 2X+Y = BitSpread m=, k=, nodes 5 5 BitSpeed 5 Number nodes=32, drives/node, Cap/Node=3 TB RAID+ m=, k=, 2 nodes RAID5+ m=, k=, 2 nodes RAID+ m=, k=2, 2 nodes 3% 5% 45% 4% 33% RAID 3-way m=, k=2, 3 nodes Table 3. Summary of Performance for RAID strategies compared to Amplidata BitSpread BitSpread trades a small amount of extra space overhead compared to MDS codes in exchange for much higher-levels of throughput, faster self-healing, dynamic policy changes, and bit-perfect data integrity assurances. Traditional erasure coding schemes implemented by competitive storage solutions have limited device-level BER protection (e.g., 4 four bit errors per device). But Amplidata s solution can tolerate up to 5- BER per object stored without loss of data integrity. Comparing Amplidata BitSpread to Traditional RAID Using the standard MTTDL model as an optimistic proxy for reality, which we believe correctly allows for comparison of data durability based on MTTF and MTTR along with the protection set design, we Amplidata BitSpread protection sets to traditional RAID, including 3-way mirrors and RAID+ mirrored parity sets. The following table summarizes this numerical comparison of methods derived from the two-state Markov model for risk of data loss, which if anything, understates the need for new methods of data protection beyond simple RAID. These numbers indicate that the storage Amplidata s BitSpread than RAID+ and RAID 3-way, providing Cloud applications with high data durability storage capacity. We see that there is less than one chance in million of data loss for thousand units shipped using the BitSpread approach. By comparison, there is high likelihood that data will be lost with RAID 3-way. While RAID+ is durable, it is still times more likely that we will see data loss with this method compared to Amplidata BitSpread. Initial Capex is driven by the cost of the controllers and the storage nodes and disks, so it is directly proportional to the reduction in storage overhead. Furthermore, Opex is predominately due to power use and heat generation of the drives along with IT monitoring costs. The reduction in storage overhead provided

7 Component Quantity Per Rack Configuration Controller Node 3 Table 4. Storage Node Up to 3 (in 44U rack) Network Switch 2 by BitSpread means fewer spindles are needed for data protection, and therefore less power consumed per gigabyte stored, along with reductions in management costs due to self-healing. A Reference Architecture Proposal Intel and Amplidata have proposed a reference architecture (Table 4) to stand-up a robust storage system for enterprises, Internet portals and service providers. Deploying Amplidata Bitspread on Intel Xeon processor based platforms affords high performance scalability, optimal power consumption and very high levels of durability for large-scale data storage. For controller nodes, Amplidata BitSpread uniquely builds upon the Intel Xeon instruction sets (SSE 4.2) to provide scalable high-throughput capabilities. With low-power Intel processors (such as the Xeon Processor E3 22L) used in building the storage nodes, Amplidata enables multi-petabyte scale storage with both high-density and high-performance but also the lowest power consumption of any large scale storage system, requiring under 3 Watts per TB idle. BitSpread is highly optimized for Intel Xeon multi-core processors, multithreading capabilities and high memorybandwidth through advanced software algorithms that take advantage of the provides scalable, full line rate GB Ethernet network throughput per controller. Summary The standard MTTDL model for data durability is an imperfect and optimistic predictor, but good enough for comparing afforded between erasure codes including traditional RAID and more advanced erasure codes like Amplidata BitSpread. The model indicates the same negative trend for traditional RAID strategies even with this optimistic model (it only gets worse with more accurate models). We have shown here using the optimistic standard model that Cloud data centers require data protection better than RAID 3-way or RAID+. scalable, and practical alternative to the stop-gap of combined RAID levels like + and N-way mirroring. Furthermore, Amplidata s BitSpread is more practical to implement and affords better operational codes based storage systems. Deploying Amplidata s BitSpread technology based storage systems using Intel tion and operational savings for the data center implementers while achieving very high levels of data durability.

8 For more information on Amplidata, AmpliStor, and BitSpread please visit An Analysis of Latent Sector Errors in Disk Drives, Lakshmi Bairavasundaram, Garth R. Goodson, et al. 2 An Analysis of Data Corruption in the Storage Stack, Lakshmi Bairavasundaram, Garth R. Goodson, et al. 3 World s data will grow by 5x in next decade, IDC study predicts, by Lucas Mearian, Computerworld, June 2. 4 Seagate hits terabit per square inch, TB hard drives on their way, by Sebastian Anthony, March Mean time to meaningless: MTTDL, Markov models, and storage system reliability, Kevin Greenan, James Plank, Jay Wylie, Hot Topics in Storage and File Systems, June 2. Schroeder, Bianca and Gibson, Garth, A large-scale study of failures in high-performance computing systems (CMU-PDL-5-2) (25). Parallel Data Library. Paper 4. /pdl/4 Disk failures in the real world: What does an MTTF of,, hours mean to you?, Bianca Schroeder, Garth A. Gibson. Unbreakable Object Storage for Exabyte-scale Unstructured Data Amplidata Technology Paper, V5.3, Amplidata Inc., 2. BitSpread overhead has been further reduced in the current release. INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. UNLESS OTHERWISE AGREED IN WRITING BY INTEL, THE INTEL PRODUCTS ARE NOT DESIGNED NOR INTENDED FOR ANY APPLICATION IN WHICH THE FAILURE OF THE INTEL PRODUCT COULD CREATE A SITUATION WHERE PERSONAL INJURY OR DEATH MAY OCCUR. of documents which have an order number and are referenced in this document, or other Intel literature, may be obtained by calling , or by visiting Intel s Web site at Copyright 22 Intel Corporation. All rights reserved. Intel, the Intel logo, and Xeon are trademarks of Intel Corporation in the U.S. and other countries. *Other names and brands may be claimed as the property of others. Printed in USA 2/CSP/HBD/PDF Please Recycle 32-US

Building Storage Clouds for Online Applications A Case for Optimized Object Storage

Building Storage Clouds for Online Applications A Case for Optimized Object Storage Building Storage Clouds for Online Applications A Case for Optimized Object Storage Agenda Introduction: storage facts and trends Call for more online storage! AmpliStor: Optimized Object Storage Cost

More information

RAID Basics Training Guide

RAID Basics Training Guide RAID Basics Training Guide Discover a Higher Level of Performance RAID matters. Rely on Intel RAID. Table of Contents 1. What is RAID? 2. RAID Levels RAID 0 RAID 1 RAID 5 RAID 6 RAID 10 RAID 0+1 RAID 1E

More information

Next Generation Scalable and Efficient Data Protection

Next Generation Scalable and Efficient Data Protection Next Generation Scalable and Efficient Data Protection Dr. Sam Siewert, Software Engineer, Intel Greg Scott, Cloud Storage Manager, Intel STOS004 Agenda What Is Durability and Why Should You Care? Measuring

More information

an analysis of RAID 5DP

an analysis of RAID 5DP an analysis of RAID 5DP a qualitative and quantitative comparison of RAID levels and data protection hp white paper for information about the va 7000 series and periodic updates to this white paper see

More information

Why RAID is Dead for Big Data Storage. The business case for why IT Executives are making a strategic shift from RAID to Information Dispersal

Why RAID is Dead for Big Data Storage. The business case for why IT Executives are making a strategic shift from RAID to Information Dispersal Why RAID is Dead for Big Data Storage The business case for why IT Executives are making a strategic shift from RAID to Information Dispersal Executive Summary Data is exploding, growing 10X every five

More information

Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study

Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study The adoption of cloud computing creates many challenges and opportunities in big data management and storage. To

More information

Reliability and Fault Tolerance in Storage

Reliability and Fault Tolerance in Storage Reliability and Fault Tolerance in Storage Dalit Naor/ Dima Sotnikov IBM Haifa Research Storage Systems 1 Advanced Topics on Storage Systems - Spring 2014, Tel-Aviv University http://www.eng.tau.ac.il/semcom

More information

How To Make A Backup System More Efficient

How To Make A Backup System More Efficient Identifying the Hidden Risk of Data De-duplication: How the HYDRAstor Solution Proactively Solves the Problem October, 2006 Introduction Data de-duplication has recently gained significant industry attention,

More information

Intel Platform and Big Data: Making big data work for you.

Intel Platform and Big Data: Making big data work for you. Intel Platform and Big Data: Making big data work for you. 1 From data comes insight New technologies are enabling enterprises to transform opportunity into reality by turning big data into actionable

More information

Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com

Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...

More information

Solving Data Loss in Massive Storage Systems Jason Resch Cleversafe

Solving Data Loss in Massive Storage Systems Jason Resch Cleversafe Solving Data Loss in Massive Storage Systems Jason Resch Cleversafe 2010 Storage Developer Conference. Insert Your Company Name. All Rights Reserved. 1 In the beginning There was replication Long before

More information

Best Practices RAID Implementations for Snap Servers and JBOD Expansion

Best Practices RAID Implementations for Snap Servers and JBOD Expansion STORAGE SOLUTIONS WHITE PAPER Best Practices RAID Implementations for Snap Servers and JBOD Expansion Contents Introduction...1 Planning for the End Result...1 Availability Considerations...1 Drive Reliability...2

More information

How To Understand And Understand The Power Of Aird 6 On Clariion

How To Understand And Understand The Power Of Aird 6 On Clariion A Detailed Review Abstract This white paper discusses the EMC CLARiiON RAID 6 implementation available in FLARE 26 and later, including an overview of RAID 6 and the CLARiiON-specific implementation, when

More information

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels technology brief RAID Levels March 1997 Introduction RAID is an acronym for Redundant Array of Independent Disks (originally Redundant Array of Inexpensive Disks) coined in a 1987 University of California

More information

Identifying the Hidden Risk of Data Deduplication: How the HYDRAstor TM Solution Proactively Solves the Problem

Identifying the Hidden Risk of Data Deduplication: How the HYDRAstor TM Solution Proactively Solves the Problem Identifying the Hidden Risk of Data Deduplication: How the HYDRAstor TM Solution Proactively Solves the Problem Advanced Storage Products Group Table of Contents 1 - Introduction 2 Data Deduplication 3

More information

Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms

Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms EXECUTIVE SUMMARY Intel Cloud Builder Guide Intel Xeon Processor-based Servers Red Hat* Cloud Foundations Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms Red Hat* Cloud Foundations

More information

How To Create A Multi Disk Raid

How To Create A Multi Disk Raid Click on the diagram to see RAID 0 in action RAID Level 0 requires a minimum of 2 drives to implement RAID 0 implements a striped disk array, the data is broken down into blocks and each block is written

More information

HadoopTM Analytics DDN

HadoopTM Analytics DDN DDN Solution Brief Accelerate> HadoopTM Analytics with the SFA Big Data Platform Organizations that need to extract value from all data can leverage the award winning SFA platform to really accelerate

More information

Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms

Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms Ubuntu* Enterprise Cloud Executive Summary Intel Cloud Builder Guide Intel Xeon Processor Ubuntu* Enteprise Cloud Canonical*

More information

Intel RAID SSD Cache Controller RCS25ZB040

Intel RAID SSD Cache Controller RCS25ZB040 SOLUTION Brief Intel RAID SSD Cache Controller RCS25ZB040 When Faster Matters Cost-Effective Intelligent RAID with Embedded High Performance Flash Intel RAID SSD Cache Controller RCS25ZB040 When Faster

More information

RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection

RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection Technical Report RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection Jay White & Chris Lueth, NetApp May 2010 TR-3298 ABSTRACT This document provides an in-depth overview of the NetApp

More information

Data Protection Technologies: What comes after RAID? Vladimir Sapunenko, INFN-CNAF HEPiX Spring 2012 Workshop

Data Protection Technologies: What comes after RAID? Vladimir Sapunenko, INFN-CNAF HEPiX Spring 2012 Workshop Data Protection Technologies: What comes after RAID? Vladimir Sapunenko, INFN-CNAF HEPiX Spring 2012 Workshop Arguments to be discussed Scaling storage for clouds Is RAID dead? Erasure coding as RAID replacement

More information

Three-Dimensional Redundancy Codes for Archival Storage

Three-Dimensional Redundancy Codes for Archival Storage Three-Dimensional Redundancy Codes for Archival Storage Jehan-François Pâris Darrell D. E. Long Witold Litwin Department of Computer Science University of Houston Houston, T, USA jfparis@uh.edu Department

More information

PIONEER RESEARCH & DEVELOPMENT GROUP

PIONEER RESEARCH & DEVELOPMENT GROUP SURVEY ON RAID Aishwarya Airen 1, Aarsh Pandit 2, Anshul Sogani 3 1,2,3 A.I.T.R, Indore. Abstract RAID stands for Redundant Array of Independent Disk that is a concept which provides an efficient way for

More information

RAID 5 rebuild performance in ProLiant

RAID 5 rebuild performance in ProLiant RAID 5 rebuild performance in ProLiant technology brief Abstract... 2 Overview of the RAID 5 rebuild process... 2 Estimating the mean-time-to-failure (MTTF)... 3 Factors affecting RAID 5 array rebuild

More information

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000 Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000 Clear the way for new business opportunities. Unlock the power of data. Overcoming storage limitations Unpredictable data growth

More information

Accelerating Business Intelligence with Large-Scale System Memory

Accelerating Business Intelligence with Large-Scale System Memory Accelerating Business Intelligence with Large-Scale System Memory A Proof of Concept by Intel, Samsung, and SAP Executive Summary Real-time business intelligence (BI) plays a vital role in driving competitiveness

More information

WHITE PAPER. QUANTUM LATTUS: Next-Generation Object Storage for Big Data Archives

WHITE PAPER. QUANTUM LATTUS: Next-Generation Object Storage for Big Data Archives WHITE PAPER QUANTUM LATTUS: Next-Generation Object Storage for Big Data Archives CONTENTS Executive Summary....................................................................3 The Limits of Traditional

More information

Reliability of Data Storage Systems

Reliability of Data Storage Systems Zurich Research Laboratory Ilias Iliadis April 2, 25 Keynote NexComm 25 www.zurich.ibm.com 25 IBM Corporation Long-term Storage of Increasing Amount of Information An increasing amount of information is

More information

Reducing Data Center Power Consumption Through Efficient Storage

Reducing Data Center Power Consumption Through Efficient Storage NETAPP WHITE PAPER Reducing Data Center Power Consumption Through Efficient Storage Larry Freeman, NetApp, Inc. July 2009 WP-7010-0709 AN EIGHT-POINT PLAN FOR FIGHTING POWER CONSUMPTION The NetApp eight-point

More information

EMC XTREMIO EXECUTIVE OVERVIEW

EMC XTREMIO EXECUTIVE OVERVIEW EMC XTREMIO EXECUTIVE OVERVIEW COMPANY BACKGROUND XtremIO develops enterprise data storage systems based completely on random access media such as flash solid-state drives (SSDs). By leveraging the underlying

More information

RAID 6 with HP Advanced Data Guarding technology:

RAID 6 with HP Advanced Data Guarding technology: RAID 6 with HP Advanced Data Guarding technology: a cost-effective, fault-tolerant solution technology brief Abstract... 2 Introduction... 2 Functions and limitations of RAID schemes... 3 Fault tolerance

More information

Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms

Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms Intel Cloud Builders Guide Intel Xeon Processor-based Servers RES Virtual Desktop Extender Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms Client Aware Cloud with RES Virtual

More information

An Oracle White Paper November 2010. Backup and Recovery with Oracle s Sun ZFS Storage Appliances and Oracle Recovery Manager

An Oracle White Paper November 2010. Backup and Recovery with Oracle s Sun ZFS Storage Appliances and Oracle Recovery Manager An Oracle White Paper November 2010 Backup and Recovery with Oracle s Sun ZFS Storage Appliances and Oracle Recovery Manager Introduction...2 Oracle Backup and Recovery Solution Overview...3 Oracle Recovery

More information

FLASH 15 MINUTE GUIDE DELIVER MORE VALUE AT LOWER COST WITH XTREMIO ALL- FLASH ARRAY Unparal eled performance with in- line data services al the time

FLASH 15 MINUTE GUIDE DELIVER MORE VALUE AT LOWER COST WITH XTREMIO ALL- FLASH ARRAY Unparal eled performance with in- line data services al the time FLASH 15 MINUTE GUIDE DELIVER MORE VALUE AT LOWER COST WITH XTREMIO ALL- FLASH ARRAY Unparalleled performance with in- line data services all the time OVERVIEW Opportunities to truly innovate are rare.

More information

Accelerating Business Intelligence with Large-Scale System Memory

Accelerating Business Intelligence with Large-Scale System Memory Accelerating Business Intelligence with Large-Scale System Memory A Proof of Concept by Intel, Samsung, and SAP Executive Summary Real-time business intelligence (BI) plays a vital role in driving competitiveness

More information

WHITE PAPER. Reinventing Large-Scale Digital Libraries With Object Storage Technology

WHITE PAPER. Reinventing Large-Scale Digital Libraries With Object Storage Technology WHITE PAPER Reinventing Large-Scale Digital Libraries With Object Storage Technology CONTENTS Introduction..........................................................................3 Hitting The Limits

More information

Comprehending the Tradeoffs between Deploying Oracle Database on RAID 5 and RAID 10 Storage Configurations. Database Solutions Engineering

Comprehending the Tradeoffs between Deploying Oracle Database on RAID 5 and RAID 10 Storage Configurations. Database Solutions Engineering Comprehending the Tradeoffs between Deploying Oracle Database on RAID 5 and RAID 10 Storage Configurations A Dell Technical White Paper Database Solutions Engineering By Sudhansu Sekhar and Raghunatha

More information

Summer Student Project Report

Summer Student Project Report Summer Student Project Report Dimitris Kalimeris National and Kapodistrian University of Athens June September 2014 Abstract This report will outline two projects that were done as part of a three months

More information

SSDs and RAID: What s the right strategy. Paul Goodwin VP Product Development Avant Technology

SSDs and RAID: What s the right strategy. Paul Goodwin VP Product Development Avant Technology SSDs and RAID: What s the right strategy Paul Goodwin VP Product Development Avant Technology SSDs and RAID: What s the right strategy Flash Overview SSD Overview RAID overview Thoughts about Raid Strategies

More information

RAID Technology Overview

RAID Technology Overview RAID Technology Overview HP Smart Array RAID Controllers HP Part Number: J6369-90050 Published: September 2007 Edition: 1 Copyright 2007 Hewlett-Packard Development Company L.P. Legal Notices Copyright

More information

StorTrends RAID Considerations

StorTrends RAID Considerations StorTrends RAID Considerations MAN-RAID 04/29/2011 Copyright 1985-2011 American Megatrends, Inc. All rights reserved. American Megatrends, Inc. 5555 Oakbrook Parkway, Building 200 Norcross, GA 30093 Revision

More information

BrightStor ARCserve Backup for Windows

BrightStor ARCserve Backup for Windows BrightStor ARCserve Backup for Windows Tape RAID Option Guide r11.5 D01183-1E This documentation and related computer software program (hereinafter referred to as the "Documentation") is for the end user's

More information

An Introduction to RAID 6 ULTAMUS TM RAID

An Introduction to RAID 6 ULTAMUS TM RAID An Introduction to RAID 6 ULTAMUS TM RAID The highly attractive cost per GB of SATA storage capacity is causing RAID products based on the technology to grow in popularity. SATA RAID is now being used

More information

Intel RAID Controllers

Intel RAID Controllers Intel RAID Controllers Best Practices White Paper April, 2008 Enterprise Platforms and Services Division - Marketing Revision History Date Revision Number April, 2008 1.0 Initial release. Modifications

More information

IBM ^ xseries ServeRAID Technology

IBM ^ xseries ServeRAID Technology IBM ^ xseries ServeRAID Technology Reliability through RAID technology Executive Summary: t long ago, business-critical computing on industry-standard platforms was unheard of. Proprietary systems were

More information

Three Paths to Faster Simulations Using ANSYS Mechanical 16.0 and Intel Architecture

Three Paths to Faster Simulations Using ANSYS Mechanical 16.0 and Intel Architecture White Paper Intel Xeon processor E5 v3 family Intel Xeon Phi coprocessor family Digital Design and Engineering Three Paths to Faster Simulations Using ANSYS Mechanical 16.0 and Intel Architecture Executive

More information

Upgrading Data Center Network Architecture to 10 Gigabit Ethernet

Upgrading Data Center Network Architecture to 10 Gigabit Ethernet Intel IT IT Best Practices Data Centers January 2011 Upgrading Data Center Network Architecture to 10 Gigabit Ethernet Executive Overview Upgrading our network architecture will optimize our data center

More information

Pivot3 Desktop Virtualization Appliances. vstac VDI Technology Overview

Pivot3 Desktop Virtualization Appliances. vstac VDI Technology Overview Pivot3 Desktop Virtualization Appliances vstac VDI Technology Overview February 2012 Pivot3 Desktop Virtualization Technology Overview Table of Contents Executive Summary... 3 The Pivot3 VDI Appliance...

More information

Using RAID6 for Advanced Data Protection

Using RAID6 for Advanced Data Protection Using RAI6 for Advanced ata Protection 2006 Infortrend Corporation. All rights reserved. Table of Contents The Challenge of Fault Tolerance... 3 A Compelling Technology: RAI6... 3 Parity... 4 Why Use RAI6...

More information

Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage 11-16-2015

Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage 11-16-2015 Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage 11-16-2015 Table of Contents Introduction... 4 Certified Products... 4 Key Findings... 5 Solution

More information

Replication and Erasure Coding Explained

Replication and Erasure Coding Explained Replication and Erasure Coding Explained Caringo Elastic Content Protection Technical Overview Paul Carpentier, CTO, Caringo Introduction: data loss Unlike what most storage vendors will try to make you

More information

Inside Track Research Note. In association with. Data Protection and RAID. Modern business needs new storage protection

Inside Track Research Note. In association with. Data Protection and RAID. Modern business needs new storage protection Research Note In association with Data Protection and RAID Modern business needs new storage protection January 2015 In a nutshell About this The insights presented in this document are derived from independent

More information

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,

More information

Enterprise-class versus Desktopclass

Enterprise-class versus Desktopclass Enterprise-class versus Desktopclass Hard Drives April, 2008 Enterprise Platforms and Services Division - Marketing Revision History Date Revision Number April, 2008 1.0 Initial Release Modifications Disclaimers

More information

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering DELL RAID PRIMER DELL PERC RAID CONTROLLERS Joe H. Trickey III Dell Storage RAID Product Marketing John Seward Dell Storage RAID Engineering http://www.dell.com/content/topics/topic.aspx/global/products/pvaul/top

More information

January 2013. A Better RAID Strategy for High Capacity Drives in Mainframe Storage

January 2013. A Better RAID Strategy for High Capacity Drives in Mainframe Storage January 2013 A Better RAID Strategy for High Capacity Drives in Mainframe Storage Introduction... 2 RAID6 Data Integrity Now and Then... 2 What Causes This Change... 4 Time to Rethink RAID6... 4 Assigning

More information

3PAR Fast RAID: High Performance Without Compromise

3PAR Fast RAID: High Performance Without Compromise 3PAR Fast RAID: High Performance Without Compromise Karl L. Swartz Document Abstract: 3PAR Fast RAID allows the 3PAR InServ Storage Server to deliver higher performance with less hardware, reducing storage

More information

Technology Update White Paper. High Speed RAID 6. Powered by Custom ASIC Parity Chips

Technology Update White Paper. High Speed RAID 6. Powered by Custom ASIC Parity Chips Technology Update White Paper High Speed RAID 6 Powered by Custom ASIC Parity Chips High Speed RAID 6 Powered by Custom ASIC Parity Chips Why High Speed RAID 6? Winchester Systems has developed High Speed

More information

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT UNPRECEDENTED OBSERVABILITY, COST-SAVING PERFORMANCE ACCELERATION, AND SUPERIOR DATA PROTECTION KEY FEATURES Unprecedented observability

More information

PARALLELS CLOUD STORAGE

PARALLELS CLOUD STORAGE PARALLELS CLOUD STORAGE Performance Benchmark Results 1 Table of Contents Executive Summary... Error! Bookmark not defined. Architecture Overview... 3 Key Features... 5 No Special Hardware Requirements...

More information

High Availability Server Clustering Solutions

High Availability Server Clustering Solutions White Paper High vailability Server Clustering Solutions Extending the benefits of technology into the server arena Intel in Communications Contents Executive Summary 3 Extending Protection to Storage

More information

GENERAL INFORMATION COPYRIGHT... 3 NOTICES... 3 XD5 PRECAUTIONS... 3 INTRODUCTION... 4 FEATURES... 4 SYSTEM REQUIREMENT... 4

GENERAL INFORMATION COPYRIGHT... 3 NOTICES... 3 XD5 PRECAUTIONS... 3 INTRODUCTION... 4 FEATURES... 4 SYSTEM REQUIREMENT... 4 1 Table of Contents GENERAL INFORMATION COPYRIGHT... 3 NOTICES... 3 XD5 PRECAUTIONS... 3 INTRODUCTION... 4 FEATURES... 4 SYSTEM REQUIREMENT... 4 XD5 FAMILULARIZATION... 5 PACKAGE CONTENTS... 5 HARDWARE

More information

Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Xeon Processor-based Platforms

Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Xeon Processor-based Platforms Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Xeon Processor-based Platforms Enomaly Elastic Computing Platform, * Service Provider Edition Executive Summary Intel Cloud Builder Guide

More information

REDUCING DATA CENTER POWER CONSUMPTION THROUGH EFFICIENT STORAGE

REDUCING DATA CENTER POWER CONSUMPTION THROUGH EFFICIENT STORAGE NETAPP VISION SERIES REDUCING DATA CENTER POWER CONSUMPTION THROUGH EFFICIENT STORAGE Brett Battles, Cathy Belleville, Susan Grabau, Judith Maurier February 2007 WP-7010-0207 AN EIGHT-POINT PLAN FOR FIGHTING

More information

Leading Virtualization 2.0

Leading Virtualization 2.0 Leading Virtualization 2.0 How Intel is driving virtualization beyond consolidation into a solution for maximizing business agility within the enterprise White Paper Intel Virtualization Technology (Intel

More information

How To Improve Performance On A Single Chip Computer

How To Improve Performance On A Single Chip Computer : Redundant Arrays of Inexpensive Disks this discussion is based on the paper:» A Case for Redundant Arrays of Inexpensive Disks (),» David A Patterson, Garth Gibson, and Randy H Katz,» In Proceedings

More information

How to choose the right RAID for your Dedicated Server

How to choose the right RAID for your Dedicated Server Overview of RAID Let's first address, "What is RAID and what does RAID stand for?" RAID, an acronym for "Redundant Array of Independent Disks, is a storage technology that links or combines multiple hard

More information

Mean time to meaningless: MTTDL, Markov models, and storage system reliability

Mean time to meaningless: MTTDL, Markov models, and storage system reliability Mean time to meaningless: MTTDL, Markov models, and storage system reliability Kevin M. Greenan ParaScale, Inc. James S. Plank University of Tennessee Jay J. Wylie HP Labs Abstract Mean Time To Data Loss

More information

EMC XtremSF: Delivering Next Generation Performance for Oracle Database

EMC XtremSF: Delivering Next Generation Performance for Oracle Database White Paper EMC XtremSF: Delivering Next Generation Performance for Oracle Database Abstract This white paper addresses the challenges currently facing business executives to store and process the growing

More information

RAID Levels and Components Explained Page 1 of 23

RAID Levels and Components Explained Page 1 of 23 RAID Levels and Components Explained Page 1 of 23 What's RAID? The purpose of this document is to explain the many forms or RAID systems, and why they are useful, and their disadvantages. RAID - Redundant

More information

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW 757 Maleta Lane, Suite 201 Castle Rock, CO 80108 Brett Weninger, Managing Director brett.weninger@adurant.com Dave Smelker, Managing Principal dave.smelker@adurant.com

More information

RAID Overview: Identifying What RAID Levels Best Meet Customer Needs. Diamond Series RAID Storage Array

RAID Overview: Identifying What RAID Levels Best Meet Customer Needs. Diamond Series RAID Storage Array ATTO Technology, Inc. Corporate Headquarters 155 Crosspoint Parkway Amherst, NY 14068 Phone: 716-691-1999 Fax: 716-691-9353 www.attotech.com sales@attotech.com RAID Overview: Identifying What RAID Levels

More information

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05 www.informatik.hu-berlin.de/rok/zs

Dependable Systems. 9. Redundant arrays of. Prof. Dr. Miroslaw Malek. Wintersemester 2004/05 www.informatik.hu-berlin.de/rok/zs Dependable Systems 9. Redundant arrays of inexpensive disks (RAID) Prof. Dr. Miroslaw Malek Wintersemester 2004/05 www.informatik.hu-berlin.de/rok/zs Redundant Arrays of Inexpensive Disks (RAID) RAID is

More information

REDUCING DATA CENTER POWER CONSUMPTION THROUGH EFFICIENT STORAGE

REDUCING DATA CENTER POWER CONSUMPTION THROUGH EFFICIENT STORAGE NETAPP WHITE PAPER REDUCING DATA CENTER POWER CONSUMPTION THROUGH EFFICIENT STORAGE Brett Battles, Cathy Belleville, Susan Grabau, Judith Maurier, Network Appliance, Inc. February 2007 WP-7010-0207 AN

More information

ANY SURVEILLANCE, ANYWHERE, ANYTIME

ANY SURVEILLANCE, ANYWHERE, ANYTIME ANY SURVEILLANCE, ANYWHERE, ANYTIME WHITEPAPER DDN Storage Powers Next Generation Video Surveillance Infrastructure INTRODUCTION Over the past decade, the world has seen tremendous growth in the use of

More information

RAID-DP : NETWORK APPLIANCE IMPLEMENTATION OF RAID DOUBLE PARITY FOR DATA PROTECTION A HIGH-SPEED IMPLEMENTATION OF RAID 6

RAID-DP : NETWORK APPLIANCE IMPLEMENTATION OF RAID DOUBLE PARITY FOR DATA PROTECTION A HIGH-SPEED IMPLEMENTATION OF RAID 6 RAID-DP : NETWORK APPLIANCE IMPLEMENTATION OF RAID DOUBLE PARITY FOR DATA PROTECTION A HIGH-SPEED IMPLEMENTATION OF RAID 6 Chris Lueth, Network Appliance, Inc. TR-3298 [12/2006] ABSTRACT To date, RAID

More information

WHITE PAPER. Dedupe-Centric Storage. Hugo Patterson, Chief Architect, Data Domain. Storage. Deduplication. September 2007

WHITE PAPER. Dedupe-Centric Storage. Hugo Patterson, Chief Architect, Data Domain. Storage. Deduplication. September 2007 WHITE PAPER Dedupe-Centric Storage Hugo Patterson, Chief Architect, Data Domain Deduplication Storage September 2007 w w w. d a t a d o m a i n. c o m - 2 0 0 7 1 DATA DOMAIN I Contents INTRODUCTION................................

More information

Protecting Microsoft SQL Server with an Integrated Dell / CommVault Solution. Database Solutions Engineering

Protecting Microsoft SQL Server with an Integrated Dell / CommVault Solution. Database Solutions Engineering Protecting Microsoft SQL Server with an Integrated Dell / CommVault Solution Database Solutions Engineering By Subhashini Prem and Leena Kushwaha Dell Product Group March 2009 THIS WHITE PAPER IS FOR INFORMATIONAL

More information

Intel Solid-State Drives Increase Productivity of Product Design and Simulation

Intel Solid-State Drives Increase Productivity of Product Design and Simulation WHITE PAPER Intel Solid-State Drives Increase Productivity of Product Design and Simulation Intel Solid-State Drives Increase Productivity of Product Design and Simulation A study of how Intel Solid-State

More information

VERY IMPORTANT NOTE! - RAID

VERY IMPORTANT NOTE! - RAID Disk drives are an integral part of any computing system. Disk drives are usually where the operating system and all of an enterprise or individual s data are stored. They are also one of the weakest links

More information

Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software

Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software Real-Time Big Data Analytics with the Intel Distribution for Apache Hadoop software Executive Summary is already helping businesses extract value out of Big Data by enabling real-time analysis of diverse

More information

Data Corruption In Storage Stack - Review

Data Corruption In Storage Stack - Review Theoretical Aspects of Storage Systems Autumn 2009 Chapter 2: Double Disk Failures André Brinkmann Data Corruption in the Storage Stack What are Latent Sector Errors What is Silent Data Corruption Checksum

More information

Intel Rapid Storage Technology

Intel Rapid Storage Technology Intel Rapid Storage Technology User Guide August 2011 Revision 1.0 1 Document Number: XXXXXX INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED,

More information

The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage

The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage The Shortcut Guide to Balancing Storage Costs and Performance with Hybrid Storage sponsored by Dan Sullivan Chapter 1: Advantages of Hybrid Storage... 1 Overview of Flash Deployment in Hybrid Storage Systems...

More information

Data Deduplication: An Essential Component of your Data Protection Strategy

Data Deduplication: An Essential Component of your Data Protection Strategy WHITE PAPER: THE EVOLUTION OF DATA DEDUPLICATION Data Deduplication: An Essential Component of your Data Protection Strategy JULY 2010 Andy Brewerton CA TECHNOLOGIES RECOVERY MANAGEMENT AND DATA MODELLING

More information

Benefits of Intel Matrix Storage Technology

Benefits of Intel Matrix Storage Technology Benefits of Intel Matrix Storage Technology White Paper December 2005 Document Number: 310855-001 INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED,

More information

IT@Intel. Comparing Multi-Core Processors for Server Virtualization

IT@Intel. Comparing Multi-Core Processors for Server Virtualization White Paper Intel Information Technology Computer Manufacturing Server Virtualization Comparing Multi-Core Processors for Server Virtualization Intel IT tested servers based on select Intel multi-core

More information

Why StrongBox Beats Disk for Long-Term Archiving. Here s how to build an accessible, protected long-term storage strategy for $.003 per GB/month.

Why StrongBox Beats Disk for Long-Term Archiving. Here s how to build an accessible, protected long-term storage strategy for $.003 per GB/month. Why StrongBox Beats Disk for Long-Term Archiving Here s how to build an accessible, protected long-term storage strategy for $.003 per GB/month. The True Cost of Archival Data We keep purchasing more disk

More information

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how EMC Elastic Cloud Storage (ECS ) can be used to streamline the Hadoop data analytics

More information

Taking Virtualization

Taking Virtualization Taking Virtualization to the SMB Market How Lenovo is driving Virtualization solutions for the SMB market by lowering costs and simplifying deployment White Paper Lenovo Virtualization Solutions Introduction

More information

Distribution One Server Requirements

Distribution One Server Requirements Distribution One Server Requirements Introduction Welcome to the Hardware Configuration Guide. The goal of this guide is to provide a practical approach to sizing your Distribution One application and

More information

Disk Array Data Organizations and RAID

Disk Array Data Organizations and RAID Guest Lecture for 15-440 Disk Array Data Organizations and RAID October 2010, Greg Ganger 1 Plan for today Why have multiple disks? Storage capacity, performance capacity, reliability Load distribution

More information

Object Storage: Out of the Shadows and into the Spotlight

Object Storage: Out of the Shadows and into the Spotlight Technology Insight Paper Object Storage: Out of the Shadows and into the Spotlight By John Webster December 12, 2012 Enabling you to make the best technology decisions Object Storage: Out of the Shadows

More information

How To Build A Clustered Storage Area Network (Csan) From Power All Networks

How To Build A Clustered Storage Area Network (Csan) From Power All Networks Power-All Networks Clustered Storage Area Network: A scalable, fault-tolerant, high-performance storage system. Power-All Networks Ltd Abstract: Today's network-oriented computing environments require

More information

Data Sheet Fujitsu ETERNUS CS High End V5.1 Data Protection Appliance

Data Sheet Fujitsu ETERNUS CS High End V5.1 Data Protection Appliance Data Sheet Fujitsu ETERNUS CS High End V5.1 Data Protection Appliance Radically simplifying data protection ETERNUS CS Data Protection Appliances The Fujitsu ETERNUS CS storage solutions, comprising ETERNUS

More information

Cloud based Holdfast Electronic Sports Game Platform

Cloud based Holdfast Electronic Sports Game Platform Case Study Cloud based Holdfast Electronic Sports Game Platform Intel and Holdfast work together to upgrade Holdfast Electronic Sports Game Platform with cloud technology Background Shanghai Holdfast Online

More information

RAID Level Descriptions. RAID 0 (Striping)

RAID Level Descriptions. RAID 0 (Striping) RAID Level Descriptions RAID 0 (Striping) Offers low cost and maximum performance, but offers no fault tolerance; a single disk failure results in TOTAL data loss. Businesses use RAID 0 mainly for tasks

More information

Data Sheet FUJITSU Storage ETERNUS CD10000

Data Sheet FUJITSU Storage ETERNUS CD10000 Data Sheet FUJITSU Storage ETERNUS CD10000 Data Sheet FUJITSU Storage ETERNUS CD10000 The ultimate hyperscale storage system for large enterprises ETERNUS CD10000 The ETERNUS CD10000 provides unlimited,

More information

A Superior Hardware Platform for Server Virtualization

A Superior Hardware Platform for Server Virtualization A Superior Hardware Platform for Server Virtualization Improving Data Center Flexibility, Performance and TCO with Technology Brief Server Virtualization Server virtualization is helping IT organizations

More information