An Analysis of Data Corruption in the Storage Stack

Similar documents
An Analysis of Data Corruption in the Storage Stack

don t blame disks for every storage subsystem failure

Silent data corruption in SATA arrays: A solution

End-to-end Data integrity Protection in Storage Systems

Data Corruption In Storage Stack - Review

Parity Lost and Parity Regained

Redundant Arrays of Inexpensive Disks (RAIDs)

Checksumming RAID. Brian Kroth Suli Yang Abstract. 1 Introduction. 2 Background. 2.

ZFS Administration 1

Data Domain Overview. Jason Schaaf Senior Account Executive. Troy Schuler Systems Engineer. Copyright 2009 EMC Corporation. All rights reserved.

Taking Linux File and Storage Systems into the Future. Ric Wheeler Director Kernel File and Storage Team Red Hat, Incorporated

Improving File System Reliability and Availability with Continuous Checker and Repair

NetApp RAID-DP : Dual-Parity RAID 6 Protection Without Compromise

Ensuring Robust Data Integrity

WHITEPAPER: Understanding Pillar Axiom Data Protection Options

DNF SECURITY. High Definition Data Protection for Reliable Storage Systems

Mean time to meaningless: MTTDL, Markov models, and storage system reliability

Virtual server management: Top tips on managing storage in virtual server environments

RAID HARDWARE. On board SATA RAID controller. RAID drive caddy (hot swappable) SATA RAID controller card. Anne Watson 1

Guide to SATA Hard Disks Installation and RAID Configuration

Configuration Rules and Examples for FAS20xx Systems

an introduction to networked storage

Guide to SATA Hard Disks Installation and RAID Configuration

Difference between Enterprise SATA HDDs and Desktop HDDs. Difference between Enterprise Class HDD & Desktop HDD

HPC Advisory Council

Xangati Storage Solution Brief. Optimizing Virtual Infrastructure Storage Systems with Xangati

An Integrated End-to-End Data Integrity Solution to Protect Against Silent Data Corruption

The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000

How To Write A Server On A Flash Memory On A Perforce Server

File System Reliability (part 2)

RAID technology and IBM TotalStorage NAS products

Advanced DataTools Webcast. Webcast on Oct. 20, 2015

How it can benefit your enterprise. Dejan Kocic Netapp

RELIABLE AUDIOVISUAL ARCHIVING USING UNRELIABLE STORAGE TECHNOLOGY AND SERVICES

Using RAID6 for Advanced Data Protection

General Parallel File System (GPFS) Native RAID For 100,000-Disk Petascale Systems

How Does the ECASD Network Work?

HDD Ghosts, Goblins and Failures in RAID Storage Systems

Optimizing Ext4 for Low Memory Environments

Continuous Data Protection in iscsi Storages

How To Back Up A Computer To A Backup On A Hard Drive On A Microsoft Macbook (Or Ipad) With A Backup From A Flash Drive To A Flash Memory (Or A Flash) On A Flash (Or Macbook) On

Best Practices RAID Implementations for Snap Servers and JBOD Expansion

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT

CompTIA Storage+ Powered by SNIA

RAID. Tiffany Yu-Han Chen. # The performance of different RAID levels # read/write/reliability (fault-tolerant)/overhead

Exam Number/Code:C Exam Name:Enterprise Storage. Version: Demo. Sales V3.

Clustered Data ONTAP 8.3

How To Protect Your Data With Netapp Storevault

COS 318: Operating Systems. Storage Devices. Kai Li Computer Science Department Princeton University. (

High Availability and MetroCluster Configuration Guide For 7-Mode

Oracle Acceleration with the SanDisk ION Accelerator Solution

Storage Subsystem Resiliency Guide

OceanStor 9000 InfoProtector Technical White Paper. Issue 01. Date HUAWEI TECHNOLOGIES CO., LTD.

Trends in Enterprise Backup Deduplication

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

Guide to SATA Hard Disks Installation and RAID Configuration

Xanadu 130. Business Class Storage Solution. 8G FC Host Connectivity and 6G SAS Backplane. 2U 12-Bay 3.5 Form Factor

Choosing Storage Systems

PARALLELS CLOUD STORAGE

THE SUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

SUSE Enterprise Storage Highly Scalable Software Defined Storage. Māris Smilga

A Virtual Tape Library Architecture & Its Benefits

Price/performance Modern Memory Hierarchy

GPFS Storage Server. Concepts and Setup in Lemanicus BG/Q system" Christian Clémençon (EPFL-DIT)" " 4 April 2013"

Data Protection. Senior Director of Product Management Database High Availability Oracle

FlexArray Virtualization

WHITE PAPER Addressing Enterprise Computing Storage Performance Gaps with Enterprise Flash Drives

Optimizing Large Arrays with StoneFly Storage Concentrators

EMC DATA DOMAIN OPERATING SYSTEM

Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you?

QuickSpecs. HP Smart Array 5312 Controller. Overview

EMC DATA DOMAIN OPERATING SYSTEM

Disk Failures in Storage Arrays

Highly-Available Distributed Storage. UF HPC Center Research Computing University of Florida

Next Generation Scalable and Efficient Data Protection

File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

Lecture 18: Reliable Storage

SSDs and RAID: What s the right strategy. Paul Goodwin VP Product Development Avant Technology

EMC DATA DOMAIN DATA INVULNERABILITY ARCHITECTURE: ENHANCING DATA INTEGRITY AND RECOVERABILITY

Improving Lustre OST Performance with ClusterStor GridRAID. John Fragalla Principal Architect High Performance Computing

Flash Memory Technology in Enterprise Storage

Guide to SATA Hard Disks Installation and RAID Configuration

NetApp Software. SANtricity Storage Manager Concepts for Version NetApp, Inc. 495 East Java Drive Sunnyvale, CA U.S.

Maintenance Best Practices for Adaptec RAID Solutions

THE CLOUD STORAGE ARGUMENT

Models Smart Array 6402A/128 Controller 3X-KZPEC-BF Smart Array 6404A/256 two 2 channel Controllers

White Paper. Low Cost High Availability Clustering for the Enterprise. Jointly published by Winchester Systems Inc. and Red Hat Inc.

Database Configuration: SAN or NAS

NetApp FAS2000 Series

Important Differences Between Consumer and Enterprise Flash Architectures

CS 6290 I/O and Storage. Milos Prvulovic

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

A Deduplication File System & Course Review

An Oracle White Paper November Backup and Recovery with Oracle s Sun ZFS Storage Appliances and Oracle Recovery Manager

Traditionally, a typical SAN topology uses fibre channel switch wiring while a typical NAS topology uses TCP/IP protocol over common networking

RAID: Redundant Arrays of Independent Disks

HOW TRUENAS LEVERAGES OPENZFS. Storage and Servers Driven by Open Source.

Block-level RAID is dead

Dynamic Disk Pools Delivering Worry-Free Storage

Best Practices for Optimizing Storage for Oracle Automatic Storage Management with Oracle FS1 Series Storage ORACLE WHITE PAPER JANUARY 2015

Transcription:

Department of Computer Science, Institute for System Architecture, Operating Systems Group An Analysis of Data Corruption in the Storage Stack Lakshmi N. Bairavasundaram, Garth Goodson, Bianca Schroeder, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau Presented by Carsten Weinhold

About the Study Large scale study: Tens of thousands of production systems 41 months 1.53 million disks 400,000+ checksum mismatches Both nearline and enterprise class disks Focus on silent data corruption (e.g., not about latent sector errors) Slide 2 of 21

Background: NetApp Storage Systems All storage systems by Network ApplianceTM Dedicated network filers: WAFL file system RAID with parity SCSI layer Fibre Channel (FC) loops Fibre Channel disks / SATA disks with adapter Data collected using Autosupport Sent to central database Note: not all disks were in use for the full duration of 41 months Slide 3 of 21

Background: Data Integrity Segments Slide 4 of 21

Corruption & Detection Slide 5 of 21

Summary Statistics Total of 1.53 million disks Total of 400,000+ checksum mismatches Percentage of corrupt disks varies: 0.86% of 358,000 nearline disks 0.065% of 1,170,000 enterprise class disks Observation 1: the probability of developing checksum mismatches is an order of magnitude higher for nearline disks (+SATA/FC adapter) than for enterprise class disks Slide 6 of 21

Factor Disk Age: Nearline Disks Slide 7 of 21

Factor Disk Age: Enterprise Class Disks Slide 8 of 21

Observations Observation 2: probability of developing checksum mismatches varies significantly across disk models in the same class of disks Observation 3: age affects disk models differently with respect to the probability of developing checksum mismatches Slide 9 of 21

Factor Disk Size?? Slide 10 of 21

(Non-)Factors?? Observation 4: there is no clear indication that disk size affects the probability of developing checksum mismatches Observation 5: there is no clear indication that workload affects the probability of developing checksum mismatches... but: the collected data on access patterns was very coarse and likely to be insufficient Slide 11 of 21

Characteristics: Models, Classes Observation 6: the number of checksum mismatches varies greatly across disks Observation 7: on average, corrupt enterprise class disks develop many more checksum mismatches than corrupt nearline disks Slide 12 of 21

Characteristics: Disks and Disk Shelves Observation 8: checksum mismatches within the same disk are not independent Observation 9: the probability of developing a checksum mismatch is not independent of that of other disks in the same storage system Example: One system had 92 disks develop errors Caused by faulty storage controller Slide 13 of 21

Characteristics: Locality Observation 10: checksum mismatches have high spatial locality Observation 11 & 12: there is temporal locality Slide 14 of 21

Characteristics: Error Type Correlation Observations 12: checksum mismatches correlate with system resets Observation 13: weak positive correlation between checksum mismatches and latent sector errors If latent sector errors detected, probability of developing checksum mismatches increases: Nearline disks: 1.4 times Enterprise class disks: 2.2 times Slide 15 of 21

Request Type Analysis Slide 16 of 21

Comparison to Latent Sector Errors Slide 17 of 21

Lessons Learned Silent corruption does happen: up to 4% of drives developed errors in 17 months On average, 8% of checksum mismatches detected during RAID reconstruction Protection against double disk failure required An enterprise class disk is likely to quickly develop more corruption after first occurrance The faulty disk should be replaced soon Some block numbers are more likely to be affected, possibly due to hardware/firmware bugs Staggered striping for RAID should be used Slide 18 of 21

Lessons Learned (II) Corruptions have strong spatial locality Redundant data structures should stored distant from each other Corruptions also have strong temporal locality Same write request? Use multiple write request for important / redundant data? To be leveraged for smarter scrubbing? Correlation of silent corruption and other errors could be used to improve failure prediction (e.g., latent sector errors) Slide 19 of 21

Discussion Points RAID does not (always) help and most file systems don't do checksumming! Is everything lost? Laptops have only one disk. ZFS supports redundancy on same disk. Any experiences? Can checksumming in the disk itself be improved? What would that mean with respect to firmware bugs? Why are enterprise class disks so much more reliable? Is there any hope that consumer disks catch up in the future? What about flash disks? Slide 20 of 21

References Lakshmi N. Bairavasundaram, Garth Goodson, Bianca Schroeder, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, An Analysis of Data Corruption in the Storage Stack, FAST '08, San Jose Slide 21 of 21