Maintenance Best Practices for Adaptec RAID Solutions



Similar documents
2-Bay Raid Sub-System Smart Removable 3.5" SATA Multiple Bay Data Storage Device User's Manual

PROMISE ARRAY MANAGEMENT (PAM) for

SATA RAID SIL 3112 CONTROLLER USER S MANUAL

Configuring ThinkServer RAID 100 on the TS140 and TS440

SATA RAID Function (Only for chipset Sil3132 used) User s Manual

VIA RAID Installation Guide

5-Bay Raid Sub-System Smart Removable 3.5" SATA Multiple Bay Data Storage Device User's Manual

RAID Utility User s Guide Instructions for setting up RAID volumes on a computer with a MacPro RAID Card or Xserve RAID Card.

Intel Matrix Storage Console

SATARaid Serial ATA RAID Management Software

RAID Basics Training Guide

Configuring ThinkServer RAID 100 on the Lenovo TS130

NVIDIA RAID Installation Guide

HighPoint Technologies, Inc. Web RAID Management Software (WebGUI)

HighPoint RAID Management User s Guide

SATARAID5 Serial ATA RAID5 Management Software

Definition of RAID Levels

ESATA PCI CARD. User s Manual

RAID Utility User Guide. Instructions for setting up RAID volumes on a computer with a Mac Pro RAID Card or Xserve RAID Card

UTC3100 and 3170 POS RAID Information

RAID Technology Overview

Intel Rapid Storage Technology

Mini-EPic System of RAID

Using RAID Admin and Disk Utility

NEC ESMPRO Manager RAID System Management Guide for VMware ESXi 5 or later

RocketRAID User's Manual. Revision: 1.0 Date: Mar HighPoint Technologies, Inc.

RAID Level Descriptions. RAID 0 (Striping)

USB3.0/eSATA/1394b-to-SATA II RAID SUBSYSTEM

BrightStor ARCserve Backup for Windows

Chapter 2 Array Configuration [SATA Setup Utility] This chapter explains array configurations using this array controller.

RAID-DP: NetApp Implementation of Double- Parity RAID for Data Protection

Intel RAID Controllers

esata 6G RAID PCIe 2.0 Controller Card

NetApp RAID-DP : Dual-Parity RAID 6 Protection Without Compromise

RAID-DP : NETWORK APPLIANCE IMPLEMENTATION OF RAID DOUBLE PARITY FOR DATA PROTECTION A HIGH-SPEED IMPLEMENTATION OF RAID 6

NEC Express5800 Series MegaRAID Storage Manager User's Guide - Windows Edition -

High Density RocketRAID Rocket EJ220 Device Board Data RAID Installation Guide

Sonnet Web Management Tool User s Guide. for Fusion Fibre Channel Storage Systems

Dell SAS RAID Storage Manager. User s Guide. support.dell.com

Intel RAID Basic Troubleshooting Guide

SiS 180 S-ATA User s Manual. Quick User s Guide. Version 0.1

This user guide describes features that are common to most models. Some features may not be available on your computer.

RAIDXPERT USER MANUAL. Version 2.1

RAID EzAssist Configuration Utility Quick Configuration Guide

Using Drobo for On-site and Off-site Backup with CrashPlan

SiS964 RAID. User s Manual. Edition. Trademarks V1.0 P/N: U49-M2-0E

NEC Express5800 Series Web-based Promise Array Manager User's Guide (For N /N /N Disk Array Controller)

RAID configuration and driver installation guide

DF to-2 SATA II RAID Box

USER S GUIDE. MegaRAID SAS Software. June 2007 Version , Rev. B

Intel RAID Volume Recovery Procedures

Intel Matrix Storage Manager 8.x

Mini-EPICa SATA II-to-SATA II RAID SUBSYSTEM

SiS S-ATA User s Manual. Quick User s Guide. Version 0.1

Offline Array Recovery Procedures SuperTrak SX6000 and UltraTrak

Remote PC Guide Series - Volume 2b

RAID SYSTEMS SOFTWARE USER MANUAL.

Configuration Tool and Utilities Operation Manual. for Fusion RAID Storage Systems

VIA / JMicron RAID Installation Guide

NSS Volume Data Recovery

RAID OPTION ROM USER MANUAL. Version 1.6

RAID User Guide. Edition. Trademarks V1.0 P/N: C51GME0-00

SATA RAID Function. (Only For Chipset Sil 3112 /Sil 3512 Used) USER S MANUAL 12M E-SI

Intel RAID Software User s Guide:

How To Create A Multi Disk Raid

How To Set Up A Raid On A Hard Disk Drive On A Sasa S964 (Sasa) (Sasa) (Ios) (Tos) And Sas964 S9 64 (Sata) (

SiS964/SiS180 SATA w/ RAID User s Manual. Quick User s Guide. Version 0.3

SATARAID5 Serial ATA RAID5 Management Software. Users Manual

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS

StarPort iscsi and ATA-over-Ethernet Initiator: Using Mirror (RAID1) disk device

FastTrak TX4000, FastTrak S150 TX2plus, FastTrak S150 TX4 PROMISE ARRAY MANAGEMENT ( PAM) User Manual

Nutanix Tech Note. Failure Analysis All Rights Reserved, Nutanix Corporation

Technical White paper RAID Protection and Drive Failure Fast Recovery

User Guide Online Backup

Xserve RAID Using RAID Admin 1.2 and Disk Utility Includes instructions for creating RAID arrays and monitoring Xserve RAID systems

This chapter explains how to update device drivers and apply hotfix.

User Guide - English. Embedded MegaRAID Software

SATA II 4 Port PCI RAID Card RC217 User Manual

SiI3132 SATARAID5 Quick Installation Guide (Windows version)

Serial ATA RAID PCI. User's Manual

This user guide describes features that are common to most models. Some features may not be available on your computer.

AMD SP5100 Dot Hill SATA RAID Users Manual V100

The following is a preview of an Nfina storage system with an embedded LSI RAID controller using MegaRAID Storage Manager software.

Xserve G5 Using the Hardware RAID PCI Card Instructions for using the software provided with the Hardware RAID PCI Card

USER S GUIDE. MegaRAID SAS Software. April , Rev. D

ASM_readme_6_10_18451.txt README.TXT

Adaptec Event Monitor Utility. User s Guide

WebBIOS Configuration Utility Guide

HP Array Configuration Utility User Guide

ReadyRECOVER. Reviewer s Guide. A joint backup solution between NETGEAR ReadyDATA and StorageCraft ShadowProtect

Areas Covered. Chapter 1 Features (Overview/Note) Chapter 2 How to Use WebBIOS. Chapter 3 Installing Global Array Manager (GAM)

Marvell SATA3 RAID Installation Guide

New Advanced RAID Level for Today's Larger Storage Capacities: Advanced Data Guarding

esata 3.0Gbps RAID ExpressCard

Mylex Online RAID Expansion Enhancement for DACCF Utilities 4.4 and Global Array Manager Part Number D01

Hot-Plug Disk Drive Removal and Replacement Guidelines for Compaq Smart Array Controllers

VT8237 SATA RAID User Manual

README.TXT

ITE RAID Controller USER MANUAL

Beckhoff onboard SATA-RAID

Transcription:

Maintenance Best Practices for Adaptec RAID Solutions Note: This document is intended to provide insight into the best practices for routine maintenance of Adaptec RAID systems. These maintenance best practices are recommended to all Adaptec RAID customers to help avoid data loss, maintain data integrity and minimize downtime. The use of these practices will ensure a better customer experience through maintaining the integrity of data and minimizing the costs of downtime. It is important to understand both the benefits of RAID and its limitations. Many RAID users tend to be complacent with maintenance and backups due to the common misconception that their data is foolproof and invulnerable. RAID is the most common method of data protection and most companies rely on the redundancy provided by RAID at various levels to protect them from disk drive failures. RAID s ability to protect data has become increasingly challenging with the exponential increase in drive capacities and the increased use of less reliable drives. RAID cannot protect data against virus attack, human error, data deletion, or natural or unnatural disaster. RAID cannot protect data beyond its advertised disk drive redundancy (for RAID-1, RAID-10, and RAID-5 one drive failure, for RAID-6 two drive failures, for example). Adaptec Technical Support often sees cases where an array is in a degraded state for a longer period of time and data loss then occurs when a further drive finally fails. The best RAID controller cannot help in this situation. In addition to timely maintenance, periodic backup still remains one of the most critical practices in data operations. THE EFFECT OF MODERN LARGER DISK SIZES AND DRIVE QUALITY ISSUES ON RAID Modern disk architectures have continued to evolve, as have other computer-related technologies. Disk drives are orders of magnitude larger than they were when RAID was first introduced. As disk drives have gotten larger, their reliability has not improved, and, more importantly, the bit error likelihood per drive has increased proportionally with the larger media. These three factors larger disks, unimproved reliability, and increased bit errors with larger media all have serious consequences for the ability of RAID to protect data. The risk of data loss is further compounded when lower-cost SATA disks (desktop edition drives) are employed for workloadappropriate applications. Hard drive media defects and other drive quality issues have steadily improved over time, even as drive sizes have grown substantially. However, hard drives are not expected to be totally free of flaws. In addition, normal wear on a drive may result in an increase in media defects, or grown defects, over time. The data block containing the defect becomes unusable and must be remapped to another location on the drive. If a bad block is encountered during a normal write operation, the controller marks that block as bad and the block is added to the grown defects list in the drive s NVRAM. That write operation is not complete until the data is properly written in a remapped location. When a bad block is encountered during a normal read operation, the controller will reconstruct the missing data from parity operations and remap the data to the new location. A condition known as a double fault ( bad stripe ) occurs when a RAID controller encounters a bad block on a drive in a RAID volume and then encounters an additional bad block on another hard drive in the same data stripe. This double fault scenario can also occur while rebuilding a degraded array, leaving the controller with insufficient parity information to reconstruct the data stripe. The end result is a rebuild failure with the loss of any data in that stripe, assuming the stripe is in the user data area. Page 1

In the past when drive capacities were smaller, this type of problem was less likely to occur because small arrays typically contain fewer media defects than large arrays. Medium errors are specified as x number of defects per y number of bits. Therefore, larger drives are prone to more defects because they contain a greater number of bits and the disks are written with a higher density. Today, hard drive capacities have increased remarkably, and the likelihood has grown that one or more media defects will occur over the lifespan of the drive. In addition, large arrays take longer to rebuild than small arrays, thus increasing the amount of time the array is not redundant. WHAT IS THE NEED FOR RAID-6? RAID-5 implementations use single parity to protect against single disk failure. Single disk failure is the issue. Traditional single-parity RAID offers adequate protection against a single failure event. The caveat is that no other disk failure or uncorrectable media error can occur while reconstruction is still in progress. During normal operation, if there is an unrecoverable error and the event is a read error, then recreating data from parity occurs almost instantaneously, and the array remains online. However, if a disk in a RAID-5 group fails, then all its data has to be recreated, and the array remains in a vulnerable degraded mode (without parity protection) until data has been reconstructed onto a spare disk. For a high-density or slow disk, that could be a very long time. RAID-6 with a dual-parity configuration offers a greatly enhanced data protection against any two disk failure events occurring in the same RAID group. OVERVIEW OF STEPS THAT CAN BE TAKEN IN KEEPING WITH RAID BEST PRACTICES Perform all recommended driver, controller firmware, and Storage Management application (Adaptec Storage Manager) updates. Install Adaptec Storage Manager: Adaptec Storage Manager helps you to monitor and maintain Adaptec RAID controllers, enclosures, and disk drives in your storage space from a single location. When Adaptec Storage Manager is installed on a system, the Adaptec Storage Manager Agent is also installed automatically as a service. It s designed to run in the background, without user intervention, and its job is to monitor and manage system health, event notifications, tasks schedules, and other on-going processes on that system. It sends notices when tasks are completed successfully, and sounds an alarm when errors or failures occur on that system. Please see the Adaptec Storage Manager User s Guide for more details. Run regular consistency checks on the system: Verification is designed to proactively detect hard disk media defects while the array is online and redundant. A RAID-5 or RAID-6 array is inconsistent when the data and parity do not match. Likewise, a RAID-1 array is inconsistent when the data and mirror do not match. The verification process issues commands to each drive in the array to test all sectors. When a bad sector is found, the RAID controller instructs the hard drive to reassign the bad sector, and then reconstructs the data using the other drives. The affected hard drive then writes data to the newly assigned good sector. These operations continue so that all sectors of each configured drive are checked, including hot spares. As a result, bad sectors can be remapped before data loss occurs. Page 2

Two run modes are available to help enhance flexibility and data protection: 1. Background Consistency Check (auto mode): In this mode, the tool is always on. Adaptec Storage Manager continually and automatically checks your logical drives once they re in use. Once Background Consistency Check has checked all sectors of the array, it repeats this check indefinitely. As its name indicates, Background Consistency Check is always a background or secondary process. Data I/O remains the highest priority for the RAID subsystem. Note: With this feature enabled, there may be an impact to performance. To enable Background Consistency Check using Adaptec Storage Manager: a. In the Enterprise View, right-click the controller. b. Select Background Consistency Check and then click Enable. Once enabled, the Background Consistency Check period can be adjusted: a. In the Enterprise view, right-click the controller. b. Click on Background Consistency Check, then select Change period. The Change Background Consistency Check period window opens. c. Adjust the slider control from Very Slow (365 days) to Fast (10 days). Alternatively, in the New Period field, use the arrow keys to increase or decrease the setting. d. Click OK. 2. Verify with fix (manual mode): Page 3

This mode is used to perform a single, quick check of the array. After the verification process has checked all sectors of the array, it stops and will not start again until started manually by the administrator. In manual mode, the verification process commands are given a higher priority than in Auto mode so that the check completes significantly faster. Verify with fix is a data-level check and requires more controller resources to read and compare data. Also, because of the additional resources required, verify with fix is not designed to run continuously. Rather, it should be scheduled to run at a regular interval, preferably during periods of low drive activity, or during system maintenance. To verify and fix a logical drive using Adaptec Storage Manager: a. In the Logical Devices View, right-click the logical drive. b. Select Verify with fix and confirm that you want to verify c. To begin the verification immediately, click Yes. To schedule the verification, click Schedule, and then set the date and time. You can also choose to set the verification as a recurring task. While the verification is in progress, the logical drive is shown as an animated icon to indicate that the task is in progress. When the verification is complete, an event notice is generated in the local system s event log. Monitor Storage Manager Event Logs: You can see status information and messages about the activity (or events) occurring on your storage space by checking component properties and looking at the Event Viewer and status icons in Adaptec Storage Manager. To open a full-screen version of the event log, click the Events button in the tool bar: Page 4

The event log lists activity occurring in your storage space, with the most recent event listed at the top. Double-click any event to open the Configuration Event Detail window to see more information in an easier-to-read format. Adaptec Storage Manager can be configured to send email messages (or notifications) about events on a system in your storage space. We recommend doing this if your storage space is not managed by a dedicated person, or if that particular system is off-site or not connected to a monitor. Email notifications can help you monitor activity on your entire storage space from any location, and are especially useful in storage spaces that include multiple systems running the Adaptec Storage Manager Agent only. To set up email notifications: a. In the Configure menu (on the tool bar), select the system you want, and then select Email Notifications. b. The Email Notifications window opens. The SMTP Server Settings window opens if you haven t set up email notifications previously. c. Enter the address of your SMTP server and the From address to appear in email notifications. If an email recipient will be replying to email notifications, be sure that the From address belongs to a system that is actively monitored. d. Click OK to save the settings. Page 5

e. In the Email Notifications window tool bar, click Add email recipient. The Add Email Recipient window opens. f. Enter the recipient s email address, select the level of events for which the recipient will receive an email, and then click Add. Repeat this Step to add more email recipients. Click Cancel to close the window. You can also set Adaptec Storage Manager to send status alerts about a specified system to all users who are logged into your storage space. When you set Adaptec Storage Manager to broadcast event alerts, all logged-in users receive messages about all types of events. In Windows, these alerts appear as pop-up messages; in all other operating systems, these alerts appear as console messages. Adaptec Storage Manager provides several different ways to monitor the status of your storage space. Please see the latest Adaptec Storage Manager User's Guide for details. Replace drives that have either failed completely, or are starting to show signs of failing (medium errors, S.M.A.R.T. errors, etc.) immediately. How to Find More Information For more information search the Adaptec Support Knowledgebase (ASK) for articles, troubleshooting tips, and frequently asked questions for your product. PMC-Sierra, Inc. 1380 Bordeaux Dr. Sunnyvale, CA 94089 USA Sales: 800.442.7274 www.adaptec.com Copyright PMC-Sierra, Inc. 2012. All rights reserved. PMC, PMC-SIERRA and Adaptec are registered trademarks of PMC- Sierra, Inc. Adaptec by PMC is a trademark of PMC-Sierra, Inc. Other product and company names mentioned herein may be trademarks of their respective owners. For a complete list of PMC-Sierra trademarks, see www.pmc-sierra.com/legal. Page 6