Tape Drive Data Compression Q & A



Similar documents
IBM Half High LTO Gen 4 SAS Tape Drive

Quantum Corporation Solution Guide

technology brief RAID Levels March 1997 Introduction Characteristics of RAID Levels

IBM LTO Full High Ultrium 4 Tape Drives, T1600P for OEM customers

FUJIFILM Recording Media U.S.A., Inc. LTO Ultrium Technology

encoding compression encryption

Data Sheet FUJITSU Storage ETERNUS LT260 Tape System

Sony LTO Data Media Storage Tapes

QUICK REFERENCE GUIDE: KEY FEATURES AND BENEFITS

Full Drive Encryption with Samsung Solid State Drives

Storage Backup and Disaster Recovery: Using New Technology to Develop Best Practices

QuickSpecs. HP Ultrium Media Overview

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

Datasheet Fujitsu ETERNUS LT20 S2

QuickSpecs. HP Ultrium Media Overview

Reference Guide WindSpring Data Management Technology (DMT) Solving Today s Storage Optimization Challenges

Data Sheet FUJITSU Storage ETERNUS LT20 S2 Tape System

Information, Entropy, and Coding

Tape Storage at Sun. Dwayne Edling Manager Archive Development Sun Microsystems, Inc.

STORAGE SOURCE DATA DEDUPLICATION PRODUCTS. Buying Guide: inside

Optimizing LTO Backup Performance

Comparison of Native Fibre Channel Tape and SAS Tape Connected to a Fibre Channel to SAS Bridge. White Paper

Don t Get Duped By Dedupe or Dedupe Vendors

Data Storage Solutions

Real-time Compression: Achieving storage efficiency throughout the data lifecycle

Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive

A Comparative TCO Study: VTLs and Physical Tape. With a Focus on Deduplication and LTO-5 Technology

balesio Native Format Optimization Technology (NFO)

Streaming Lossless Data Compression Algorithm (SLDC)

WHITE PAPER WHY ORGANIZATIONS NEED LTO-6 TECHNOLOGY TODAY

Hardware Configuration Guide

XenData Archive Series Software Technical Overview

PowerPoint Presentation to Accompany. Chapter 3. File Management. Copyright 2014 Pearson Educa=on, Inc. Publishing as Pren=ce Hall

Implementing Offline Digital Video Storage using XenData Software

William Stallings Computer Organization and Architecture 8 th Edition. External Memory

Frequently Asked Questions

Implementing an Automated Digital Video Archive Based on the Video Edition of XenData Software

Implementing a Digital Video Archive Based on XenData Software

RAID Basics Training Guide

How To Make A Backup System More Efficient

Price List - Ultrium LTO Data Cartridges Fujifilm, HP, IBM, SONY, Quantum, Maxell, Dell

Solutions for Encrypting Data on Tape: Considerations and Best Practices

BrightStor ARCserve Backup for Windows

LTFS for Microsoft Windows User Guide

Understanding Compression Technologies for HD and Megapixel Surveillance

Enterprise Backup and Restore technology and solutions

Summer Student Project Report

actitape LTO-3 and LTO-4 internal SCSI and SAS Quick Start Guide

IBM ^ xseries ServeRAID Technology

Data Deduplication in Tivoli Storage Manager. Andrzej Bugowski Spała

FAQs. Getting started with the industry s most advanced compression technology. when it counts

Comparison of different image compression formats. ECE 533 Project Report Paula Aguilera

An Authorized Duplicate Check Scheme for Removing Duplicate Copies of Repeating Data in The Cloud Environment to Reduce Amount of Storage Space

Chapter 8. Secondary Storage. McGraw-Hill/Irwin. Copyright 2008 by The McGraw-Hill Companies, Inc. All rights reserved.

Considerations when Choosing a Backup System for AFS

Backup and Disaster Recovery in Schools

How Do We Archive Our Digital Data?

1. Redistributions of documents, or parts of documents, must retain the SWGIT cover page containing the disclaimer.

NetApp Data Compression and Deduplication Deployment and Implementation Guide

ANALYSIS OF THE EFFECTIVENESS IN IMAGE COMPRESSION FOR CLOUD STORAGE FOR VARIOUS IMAGE FORMATS

RAID Storage System of Standalone NVR

Answered Questions regarding techsafe Online Storage Rev Technology Partners

Chapter 8 Memory Units

Subject: Request for Information (RFI) Franchise Tax Board (FTB) Security Information and Event Management (SIEM) Project.

Parallels Virtuozzo Containers

Backup and Recovery: The Benefits of Multiple Deduplication Policies

Backup Software Data Deduplication: What you need to know. Presented by W. Curtis Preston Executive Editor & Independent Backup Expert

William Stallings Computer Organization and Architecture 7 th Edition. Chapter 6 External Memory

2014 Electrical Server Installation Guide

LTO tape technology continues to evolve with LTO 5

A Proposal for OpenEXR Color Management

MDL 100S Product Description

GFI Backup 2010 Business Edition. Getting Started Guide

DELL RAID PRIMER DELL PERC RAID CONTROLLERS. Joe H. Trickey III. Dell Storage RAID Product Marketing. John Seward. Dell Storage RAID Engineering

TECHNOLOGY BRIEF. Compaq RAID on a Chip Technology EXECUTIVE SUMMARY CONTENTS

Ten Reasons Why You Should Consider Enterprise-Class Tape for Open Systems Storage

In addition, a decision should be made about the date range of the documents to be scanned. There are a number of options:

Digital Video: A Practical Guide

Video compression: Performance of available codec software

RAID Levels and Components Explained Page 1 of 23

Tape Devices Common Configuration for Tape Devices

A block based storage model for remote online backups in a trust no one environment

WHITE PAPER Improving Storage Efficiencies with Data Deduplication and Compression

7i Imaging on Demand PACS Solution FAQ s

Demystifying Deduplication for Backup with the Dell DR4000

Transcription:

Tape Drive Data Compression Q & A Question What is data compression and how does compression work? Data compression permits increased storage capacities by using a mathematical algorithm that reduces redundant strings of data. There are many different compression schemes that can be performed by software or hardware. For tape drives, the compression algorithm is typically implemented in hardware and will remove redundancy from data by encoding patterns of input characters more efficiently. The more repeat patterns in the data, the more the data can be compressed. Thus the amount of data compression obtained will vary depending upon the characteristics of the data. Question Can I assume that I will get the advertised 2:1 compressed capacity? No; you may get more or less. A more definitive statement that could be used for advertising capacity, which assumes 2:1 data compression, would be Assuming Your Data Compresses 2:1. Question Why wouldn t my tape drive get the capacity indicated at the advertised compression ratio? Your data is unique not like anyone else s. Therefore when your data is compressed the ratio will also be unique. The amount of compressed data stored on your tape will be different; the capacity is unpredictable. Furthermore, data with random bit strings, encrypted or pre-compressed data are unlikely to show any capacity improvement from the tape drive compression. Capacity can even decrease slightly when highly random data is compressed (see page 5 for more on this subject). Question Do all tape drives feature hardware based data compression? No; check your drive s manual for feature information relating to the exact model-number of your drive or contact the drive manufacturer. Page 1 of 5

Question Some tape drive vendors claim > 2:1 data compression. Why? The effects of data compression vary, because some types of data are more compressible than others. Different mathematical compression algorithms are better at compressing certain data types than others. Some vendors may conclude that their compression scheme is more powerful; however, it may not be more powerful for your mix of data. When comparing technologies, it is best to compare native-to-native capacity figures. Compression claims may or may not be relevant to your application. Question My backup software features data compression and my tape drive also has a compression feature. Should I use software compression and/or the tape drive s hardware based compression? When compression is available in the tape drive hardware/firmware, the compression feature of any software package should be turned off. This reduces the processing burden on your computer. Also, hardware based compression is typically much more effective. Do not double compress; compressed files could even be bigger after compression. This can even cause the tapes to hold less-than their native capacity. Some tape drives prevent lowering capacity due to files that grow larger from compression by writing the uncompressed file when this happens. The drive tags each block (compressed/not compressed) for appropriate handling during playback e.g., LTO Ultrium drives have this feature. See page 5 for more information. Question Does data compression affect the tape drive s data transfer rate? Sustained data transfer rates will increase at the same ratio as data capacity when employing hardware based data compression (up to the speed of the compression hardware). The same may not be true for compression implemented by software running on your computer. Software-based compression typically degrades performance, since the software or separate utility has to compress the data before saving it to tape. Retrieval time is also longer, since the software must decompress. Today s hardware-based compression chips are typically many times faster than the tape drives they work with. Therefore compression does not introduce any overhead into the writing or reading processes. Since compression reduces the number of bits written to or read from the tape, significant performance gains are realized. Page 2 of 5

Question Does the tape media need to store more data bytes with compression? No; the tape may appear to hold twice as much data, but the capacity (density) of bits/bytes written on the tape media will not be increased. Question Can data compressed on my drive be read on other [same type] drives? Yes, if you used the drive s hardware based compression process and the other drive has the compression feature. All drives within a family of drives will employ the same data compression hardware. If another compression capable drive can read uncompressed data from your device, it will be able to decompress your compressed data too. Of course if you are using a software program to compress your data, the same software program must be available to the other drive. Question What is decompression? An inverse procedure of the compression software or the compression hardware that returns compressed data to its original size and condition. Question What is loss-free compression? Loss-free compression schemes allow full recovery of the original data. Only loss-free (lossless) algorithms are used for computer data, insuring that decompressed data output is exactly the same as the uncompressed data input. Question What is lossy compression? Lossy compression is not applicable to tape drives used by computers. Lossy compression allows some of the original data to be lost. Lossy compression can be used where the data is meant to be interpreted by humans (not computers). Employing lossy compression schemes can provide a much better compression ratio. Lossy compression can be used for data such as graphic images, video, pictures, photographic images and music. Question How do I compute my Compression Ratio? Compression ratio is the ratio of a file s original size divided by its compressed size. A compression ratio of 2:1 means the compressed file is ½ as large as the original file. Page 3 of 5

Data Storage Tape Drive Hardware Examples Example Data Compression Features & Issues: Important Considerations When Troubleshooting DLTtape Capacity Issues: Troubleshooting DLTtape Capacity Issues Question Why do some of my DLTtape IV data cartridges hold less than 80 GB when used on my DLT8000 drive [or < 70 GB on a DLT7000, etc.]? There are three possible reasons: First, those values assume 2:1 compression of your data and your data (or some of your data) may not be compressible at the typical 2:1 ratio. See above questions and answers for more information on compression. Second, if the data cartridge was used previously on a DLT4000 drive, and your software isn t set up to change the 20/40 GB format on a write from beginning of tape (BOT), this will cause the DLT8000 drive to write in its 20 GB native (40 GB compressed) backward compatible mode. Not only will this cause capacity to be reduced to the DLT4000 value, but also the drive will write at the slower transfer-rate of the DLT4000, which is only 1.5 MB/second. Likewise for prior use on a DLT7000 drive, if your software isn t set up to change the existing DLT7000 format, this will cause the DLT8000 drive to write data in its DLT7000 backward compatible mode (lower capacity and lower speed). Refer to Fujifilm s Technical Support Document, DLTtape Data Cartridge Initial Calibration for more information concerning density format compatibility between the different DLT drive models. Third, native capacity (without compression turned on) is 20 GB, 35 GB or 40 GB for DLTtape IV on DLT4000, DLT7000 and DLT8000 drives, respectively. If you are writing without the drive s compression turned on, this will limit capacity to the native value. Perhaps you (or the software application) did not specify for the drive to use compression or maybe compression was automatically turned off because the host computer system could not meet the drive s transfer-rate requirements. Consult with your drive hardware manufacturer or your application-software manufacturer concerning these possibilities. Page 4 of 5

Question I get less than the advertised Native capacity on my backup tapes; how could this be possible? There are several common possibilities: 1) Compression could be expanding your data: Data that is already highly compressed or random data such as encrypted data or seismic data (i.e. geoseismic data recorded during exploration for oil) can actually expand if you write with the tape drive's compression turned on. However, some tape drive families have a pass-through feature for incompressible data. Some tape drive technologies (tape drive families), such as 34XX (mainframe tape) and LTO Ultrium, limit expansion of data by automatically switching compression off, if the data cannot be compressed. Data compression automatically switches back on, when data becomes compressible again. Both compressed and not-compressed data can be recorded on a tape; data will be marked accordingly for proper treatment during playback. Such intelligence prevents the expansion of previously compressed or otherwise incompressible data. Perhaps your data is incompressible and you are trying to use the compression feature of a tape drive that does not have this intelligent compression feature. This can cause a 5-10 percent reduction in capacity capability. 2) You could have compression off and have the application software set to limit the data capacity to less than the full capability: Many software applications will have a feature that allows a user to limit the capacity of their data cartridge tapes to less than full capacity. This feature is often employed when the user wants to make one-to-one copies of the recorded tapes. All tapes have an allowable +/ variation in tape media length. A tape recorded to its full capacity using a slightly longer tape, may not copy onto another slightly shorter tape. By limiting capacity for all original recordings to something slightly less than the full capacity capability, copies will always fit on one tape when duplicating. Perhaps you (or someone before you) set this software feature to limit the capacity of your data cartridges to some percentage of full capacity. 3) If the Read/Write head on your tape drive is dirty or worn to near its end-of-life, high error rewrite activity (skipping small sections of tape) can reduce capacity. For More Data Storage Tape Technical Support Documents and Product Information, go to: www.fujifilmusa.com/tapestorage Resource Center Technical Center. For questions or comments, go to Ask the Expert in the Technical Center. Page 5 of 5