Algorithms and Methods for Distributed Storage Networks 2. Hard Disks Christian Schindelhauer



Similar documents
File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

HARD DRIVE CHARACTERISTICS REFRESHER

COS 318: Operating Systems. Storage Devices. Kai Li Computer Science Department Princeton University. (

Chapter 10: Mass-Storage Systems

Floppy Drive & Hard Drive

Introduction to I/O and Disk Management

Algorithms and Methods for Distributed Storage Networks 3. Solid State Disks Christian Schindelhauer

COS 318: Operating Systems. Storage Devices. Kai Li and Andy Bavier Computer Science Department Princeton University

William Stallings Computer Organization and Architecture 7 th Edition. Chapter 6 External Memory

Performance. Storage capacity. Software support. Reliability

William Stallings Computer Organization and Architecture 8 th Edition. External Memory

Chapter 12: Mass-Storage Systems

Sistemas Operativos: Input/Output Disks

Chapter Introduction. Storage and Other I/O Topics. p. 570( 頁 585) Fig I/O devices can be characterized by. I/O bus connections

1 of :01

Price/performance Modern Memory Hierarchy

Mobile HDDs. 1. Introduction. 2. Market trends in 2.5-inch ATA HDDs. (Manuscript received July 19, 2005) V Shigenori Yanagi

Chapter 9: Peripheral Devices: Magnetic Disks

Supporting Hard Drives

Table of Contents HDD/CD/DVD

Disks and RAID. Profs. Bracy and Van Renesse. based on slides by Prof. Sirer

Algorithms and Methods for Distributed Storage Networks 1. Motivation, Organization, Overview Christian Schindelhauer

Lecture 16: Storage Devices

Chapter 7: Storage Devices

Data recovery from a drive with physical defects within the firmware Service Area, unrecoverable using

Solid State Drive Architecture

Hard Disk Drives and RAID

The Future of Data Storage

Disk Storage & Dependability

Operating System Concepts. Operating System 資 訊 工 程 學 系 袁 賢 銘 老 師

DATA RECOVERY BOOK V1.0

Communicating with devices

Lecture 9: Memory and Storage Technologies

Chapter 12: Secondary-Storage Structure

Case Study: Quick data recovery using HOT SWAP trick in Data Compass

CSCA0201 FUNDAMENTALS OF COMPUTING. Chapter 5 Storage Devices

MRT Advanced HDD Repair and Data Recovery Training Course

13.1 Characteristics of the Memory Hierarchy

Outline. CS 245: Database System Principles. Notes 02: Hardware. Hardware DBMS Data Storage

1 Storage Devices Summary

Introduction Disks RAID Tertiary storage. Mass Storage. CMSC 412, University of Maryland. Guest lecturer: David Hovemeyer.

With respect to the way of data access we can classify memories as:

Difference between Enterprise SATA HDDs and Desktop HDDs. Difference between Enterprise Class HDD & Desktop HDD

A+ Guide to Managing and Maintaining Your PC, 7e. Chapter 1 Introducing Hardware

Platter. Track. Index Mark. Disk Storage. PHY 406F - Microprocessor Interfacing Techniques

Lesson Plan. Preparation

SOLID STATE DRIVES AND PARALLEL STORAGE

CSCA0102 IT & Business Applications. Foundation in Business Information Technology School of Engineering & Computing Sciences FTMS College Global

Chapter 2: Computer-System Structures. Computer System Operation Storage Structure Storage Hierarchy Hardware Protection General System Architecture

Nasir Memon Polytechnic Institute of NYU

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

EUCIP IT Administrator - Module 1 PC Hardware Syllabus Version 3.0

Disk Geometry: How Data is Organized on a Hard Disk Drive

Chapter 7 Types of Storage. Discovering Computers Your Interactive Guide to the Digital World

Solid State Drive (SSD) FAQ

Input/output (I/O) I/O devices. Performance aspects. CS/COE1541: Intro. to Computer Architecture. Input/output subsystem.

Data Storage and Backup. Sanjay Goel School of Business University at Albany, SUNY

Q & A From Hitachi Data Systems WebTech Presentation:

Enterprise-class versus Desktopclass

IDE/ATA Interface. Objectives. IDE Interface. IDE Interface

CPS104 Computer Organization and Programming Lecture 18: Input-Output. Robert Wagner

How to Start a Data Recovery Business?

Digital Image Formation. Storage Technology

Taurus Super-S3 LCM. Dual-Bay RAID Storage Enclosure for two 3.5-inch Serial ATA Hard Drives. User Manual March 31, 2014 v1.2

This user guide describes features that are common to most models. Some features may not be available on your computer.

The role of the magnetic hard disk drive

Forensic Imaging of Hard Disk Drives

File System Management

HP Notebook Hard Drives & Solid State Drives. Identifying, Preventing, Diagnosing and Recovering from Drive Failures. Care and Maintenance Measures

Chapter 7. Disk subsystem

HP Z Turbo Drive PCIe SSD

2» 10» 18» 26» PD »

Computer Systems Structure Main Memory Organization

Disk Imaging: A Vital Step in Data Recovery

Chapter 11 I/O Management and Disk Scheduling

Distribution One Server Requirements

HDD Ghosts, Goblins and Failures in RAID Storage Systems

Data Storage at IBT. Topics. Storage, Concepts and Guidelines

WHITE PAPER FUJITSU PRIMERGY SERVER BASICS OF DISK I/O PERFORMANCE

HP Smart Array Controllers and basic RAID performance factors

The Bus (PCI and PCI-Express)

Mass Storage Structure

Computer Organization & Architecture Lecture #19

Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15

Update: About Apple RAID Version 1.5 About this update

ESATA PCI CARD. User s Manual

Cool USB 2.0/eSATA to SATA Docking Duplicator Quick Installation Guide

Industrial Flash Storage Trends in Software and Security

Data Storage - I: Memory Hierarchies & Disks

Types Of Storage Device

ibm Product summary Ultrastar 73LZX Ultra 160 SCSI IBM storage products

Chapter 13 Selected Storage Systems and Interface

The Impact of Hard Disk Firmware Steganography on Computer Forensics

Read this before starting!

Computer Peripherals

PCI-to-SATA RAID Adapter AEC-6890M. User s Manual Version:1.0

Data Storage Solutions

How To Improve Performance On A Single Chip Computer

Transcription:

Algorithms and Methods for Distributed Storage Networks 2. Hard Disks Institut für Informatik Wintersemester 2007/08

Hard Disks History Capacity and Access Speed Prices Form factors Construction and Operation Mechanics Storage technology Low-Level Data Structures Encoding, Decoding Tracks, Cylinders LBA Interfaces ATA, SATA SCSI, SAS Fibre-Channel esata Lifetime and Disk Failures Error Management and Recovery Types S.M.A.R.T. Counter methods Special Issues Sound avoidance Data security 2

Hard Disks History 3

Evolution of Hard Disk Capacity 4

History 1956 IBM invents 305 RAMAC (Random Access Method of Accounting and Control) 5 MBytes, 24 in 1961 IBM invents air bearing heads 1970 IBM invents 8 in floppy disk drives 1973 IBM ships 3340 Winchester sealed hard drives 30 MBytes 1980 Seagate introduces 5.25 in hard disk drive 5 MBytes 1981 Sony ships first 3.25 in floppy drive 1983 Rodime produces 3.25 in disk drive 1986 Conner introduces first 3.25 in voice coil actuators 1997 Seagate introduces 7,200 RPM Ultra hard disk 1996 Fujitsu introduse aero dynamic design for lower flighing heads 1999 IBM develops the smallest hard disk of the World 1in (340 MB) 2007 Hitachi introduces 1 TB hard disk 5

History of Hard Disk Prices Figure 5 Storage system volumetric density trend Figure 6 Cost of storage at the disk drive and system level 10 1 RANGE OF POSSIBLE VALUES 1000 SYSTEMS W/LARGE FF DRIVES SYSTEMS W/SMALL FF DRIVES 10 0 100 VOLUMETRIC DENSITY, GBITS/CUBIC INCH 10 1 10 2 10 3 10 4 Technological impact of magnetic SYSTEMS W/LARGE SYSTEMS W/SMALL hard disk drives on FF DRIVES storage systems, FF DRIVES Grochowski, R. D. Halem IBM SYSTEMS JOURNAL, VOL 42, NO 2, 2003 10 5 1980 1985 1990 1995 2000 2005 2010 PRODUCTION YEAR PRICE/MBYTE, DOLLARS 10 1 0.1 0.01 0.001 SMALL FF HDD STORAGE SYSTEM DESKTOP HDD INDUSTRY PROJECTION FROM MULTIPLE SOURCES HIGH PERFOR- MANCE, SERVER HDD 0.0001 1980 1985 1990 1995 2000 2005 2010 PRODUCTION YEAR age had significantly undercut film and paper costs by the late 1990s. It is obvious that this is a principal 6 advantage to the growth and popularity of on-line (i.e., nonarchival) magnetic storage. Concurrently, there has been an accompanying increase in the quantity of DRAM found in most storage subsystems. When first introduced in the mid- Figure 7 1000 100 ABYTE) 10 Cost of storage for disk drive, paper, film, and semiconductor memory SEMICONDUCTOR MEMORY (DRAM AND FLASH)

Hard Disks Construction and Operation 7

Construction of a Hard Disk (c) Western Digital Corporation 8

Construction of a Hard Disk (c) Seagate Technology 9

Physical Components Platters round flat disks with special material to store magnetic patterns stacked onto a spindle rotate at high speed Read/Write Devices usually two per platter Actuator - old: stepper motor mechanic adjusts to discrete positions low track density still used in floppy disks - now: voice coil actuator servo system dynamicall positions the heads directly over the data tracks - Head arms are moved by the actuator to choose the tracks - Head sliders are responsible to keep the heads in a small defined distance above the platter heads fly over the platter on an air cushion - Read/write heads mounted on top of arms 10

Slider Figure 6. Illustration of Albrecht, IBM) suspension and slider. Left: schematic. Right: photograph. (Source: Tom Proceedings of the American Control Conference,Arlington, VA June 25-27, 2001 A Tutorial on Controls for Disk Drives William Messner, Rick Ehrlich MR element 11 coil / inductive head

Magnetization Techniques Longitudinal recording magnetic moments in the direction of rotation problem: super-paramagnetic effect 100-200 Gigabit per square inch Perpendicular magnetic moments are orthogonal to the rotation direction increases the data density 1 Terabit per square inch HAMR (Heat Assisted Magnetic Recording) upcoming technology Laser heats up area to keep the necessary magnetic field as small as possible 12

Electronic Components Magnetized Surface on platter Read/Write-Head Embedded controller Disk buffer (disk cache) store bits going to and from the platter read-ahead/read-behind speed matching write acceleration command queueing Interface 13

Hard Disks Low Level Data Structure 14

Tracks and Cylinders Tracks is a circle with data on a platter Cylinder is the set of tracks on all platters that are simultaneously accessed by the heads Sector basic unit of data storage angular section of a circle (c) Quantum Corporation 15

Addressing CHS (cylinder, head, sector) each logical unit is addressed by the cylinder - set of corresponding tracks on both sides of the platters head sector (angular section) old system LBA (Logical Block Addressing) simpler system all logical blocks are number the translation to CHS is 16

Adapting Sectors Zoned bit recording adapt the sector size to the bit sensity different number of sectors depending from the distance from the center Sector interleaving for cylinder switch when the arm moves then the disk continues spinning to avoid waiting times the numbering of the sectors has an offset 17 http://www.storagereview.com/guide2000/ref/hdd/geom/ trackszbr.html

Sector Format A sector is the atomic data unit of an hard disk No absolute position must be identified from its contents Contents ID Information (number and location) Synchronization fields Data ECC: Error correcting codes Gaps Specific contents varies from hard disk type 18

Formatting Low-level formatting creates the physical structures (tracks, sectors, control information) - starts from empty platter - map out bad sectors Partitioning divides the disk into logical pieces (i.e. hard disk volumes) High-level formatting logical structures for the operating-system level components 19

Encoding Problem Only the difference of orientation can be measured Because of the para-magnetic effect orientation changes need a minimum distance Long sequences of same orientation lead to errors Encoding must have long, but not too long flux reversals 20

MFM R: Flux reversal N: no flux reversal FM (Frequency Modulation) 0 -> RN 1 -> RR MFM (Modified Frequency Modulation) 0 (preceded by 0) -> RN 0 (preceded by 1) -> NN 1 -> NR 21

Run Length Limited (RLL) http://www.storagereview.com/guide2000/ref/hdd/geom/datarll.html 22

Partial Response, Maximum Likelihood (PRML) Peak detection by analog to digital conversion use multiple data samples to determine the peak increase areal density by 30-40% to standard peak detection Extended PRML further improvement of PRML http://www.storagereview.com/guide2000/ref/hdd/geom/dataprml.html 23

Hard Disks Interfaces 24

ATA (AT Attachment) Parallel connection standard, a.k.a. P-ATA evolves since 1994 ATA-1: 1994-99, up to 8.3 MB/s ATA-2: 1996-01 - PCMCIA connector ATA-3: 1997-02 - introduces S.M.A.R.T. ATA-4: 1998-, - supports CD-ROM, tape, etc. - features for solid state drives, - hidden protected area hidden against OS ATA-5: 2000-, up to 66 MB/s ATA-6: 2002-, up to 100 MB/s, automatic acoustic management ATA-7: 2005-, SATA 1.0, up to 150 MB/s ATA-8, in progress 25

SATA Serial Advanced Technology Attachmnet Serial ATA Computer bus designed for transfer of data between motherboard and mass storage device faster transfers than P-ATA, designed as successor allows removing and adding devices while operating (hot swapping) Evolution: SATA 1.5 Gbit/s SATA 3.0 Gbit/s SATA 6.0 Gbit/s 26

SCSI Small Computer System Interface offers higher data rates than SATA hides the complexity of physical format peripheral interface: 8/16 devices can be attached to a single bus buffered interface Evolution of Parallel SCSI SCSI: 1986, 5 MB/s Fast SCSI: 1994, 10 MB/s... Ultra SCSI: 1999, 160 MB/s Ultra-320 SCSI: 2002, 320 MB/s Ultra-640 SCSI: 2003, 640 MB/s Evolution of Serial SCSI SSA: 1990 40 MB/s SAS: Serial Attached SCSI, 300 MB/ s SCSI-Fibre Channel interface FC-AL 1Gb: 1993 Fibre Channel 100 MB/s FC-AL 2Gb: Fibre Channel 200 MB/s FC-AL 4Gb: Fibre Channel 400 MB/s length 500m / 3km 27

Other Interfaces esata (since 2004) variant of SATA for consumer market maximum cable length of 2m USB (Universal Serial Bus) allows hot swapping 12 or 480 MBit/s Firewire (IEEE 1394 interface) serial bus interface standard Firewire 400: ~100/200/400 MBit/s half-duplex Firewire 800: 786 MBit/s full-duplex 28

Hard Disks Lifetime and Disk Failures 29

Disk Failure Rates to be good indicators of somee drive. Filtering for spurious le set size by less than 0.1%. Failure Trends in a Large Disk Drive Population, Pinheiro, Weber, Barroso, Google Inc. FAST 2007 ure behavior of our fleet of disk onitoring data collected over a window. During this time we as well as all the available eny data and most of the SMART es themselves. Failure informanger interval (approximately five from an older repairs database. here were tested for their statisthe appropriate tests. re Rates erage Annualized Failure Rates our study, aged zero to 5 years, Figure 2: Annualized failure rates broken down by age groups ulation. The higher baseline AFR for 3 and 4 year old drives is more strongly influenced by the underlying reliability of the particular models in that vintage than by 30 disk drive aging effects. It is interesting to note that our 3-month, 6-months and 1-year data points do seem to indicate a noticeable influence of infant mortality phe-

Reasons for Failures From: www.datarecorvery.org Physical reasons scratched platter broken arm/slider hard drive motor failed humidity, smoke in the drive manufacturer defect firmware corruption bad sectors overheated hard drive head crash power surge water or fire damage Logical Reasons failed boot sector master boot record failure drive not recognized by BIOS operating system malfunction accidentally deleted data software crash corrupt file system employee sabotage improper shutdown disk repair utilities computer viruses... 31

Reasons for Failure Failure Trends in a Large Disk Drive Population, Pinheiro, Weber, Barroso, Google Inc. FAST 2007 ones. At the end of this section we discuss and reason about the usefulness of SMART p in obtaining predictive models for individual failures. We present results in three forms. First w the AFR of drives with zero and non-zero co given parameter, broken down by the same a as in figures 2 and 3. We also find it useful probability of survival of drives over the nineservation window for different ranges of para ues. Finally, in addition to the graphs, we de gle metric that could relay how relevant the a given SMART parameter are in predicting failures. To that end, for each SMART par Figure 4: Distribution of average temperatures and failures look for thresholds that increased the probabil rates. ure in the next 60 days by at least a factor o Rechnernetze respect to drives und that Telematik have zero counts for th 32 Albert-Ludwigs-Universität ter. We report such Freiburg Critical Thresholds whene able to find them with high confidence (> 95% 3.5.1 Scan Errors

S.M.A.R.T. Self-Monitoring, Analysis and Reporting Technolgoy Relevant Parameters Seek error rate - track was not hit Raw read error rate - problems in the magnetic medium hardware ECC recovered - recovered bits by error correction (not really alarming) Scan error rate - at periodic check non repairable error occurs (problems in the magnetic medium) Throughout performance - spinning rate problem Spin up time - startup time Reallocated sector count - number of used reserve sectors Drive temperature Informative parameters Start/stop count Power on hours count Load/unload cycle count Ultra DMA CRC Error Count 33

Hard Disks Special Issues 34

Landing Zones Problem When a hard disk is switched off the head can damage the platter Power loss Sudden movements - can lead to permanent damage Solution Extra landing zones - in the middle of the disk - outside of the disk at extra park situation 35

Sound Control In desktop computers the working sound can irritate and disturb Solution Extra mode with - slower actuator arm movement - slower rotation time Problem Performance slows down 36

Data Safety Problem Resold or disposed hard disk still carry sensible data Deleting data does not overwrite data Overwriting does not completely erase the information Solution Extra hardware (strong magnets, physical destruction) Cryptographic algorithms for storing Sophisticated overwriting algorithms 37

Algorithms and Methods for Distributed Storage Networks 2. Hard Disks Institut für Informatik Wintersemester 2007/08