Data Loss in a Virtual Environment An Emerging Problem



Similar documents
Kroll Ontrack Data Recovery. Oracle Data Loss: When the best of plans fail

The 9 Ugliest Mistakes Made with Data Backup and How to Avoid Them

Leveraging Virtualization for Disaster Recovery in Your Growing Business

Open Source Mainframe Backup with Hercules

Business Continuity: Choosing the Right Technology Solution

Kroll Ontrack VMware Forum. Survey and Report

MS Design, Optimize and Maintain Database for Microsoft SQL Server 2008

NETWORK SERVICES WITH SOME CREDIT UNIONS PROCESSING 800,000 TRANSACTIONS ANNUALLY AND MOVING OVER 500 MILLION, SYSTEM UPTIME IS CRITICAL.

The Microsoft Large Mailbox Vision

DEFINING THE RIGH DATA PROTECTION STRATEGY

Designing, Optimizing and Maintaining a Database Administrative Solution for Microsoft SQL Server 2008

Whitepaper - Disaster Recovery with StepWise

Recovering Microsoft Exchange Server Data

Virtual Desktop Infrastructure

Four Steps to Disaster Recovery and Business Continuity using iscsi

Disaster Recovery Planning

Complete Storage and Data Protection Architecture for VMware vsphere

(Formerly Double-Take Backup)

The case for cloud-based disaster recovery

Best Practices in Disaster Recovery Planning and Testing

This white paper describes the three reasons why backup is a strategic element of your IT plan and why it is critical to your business that you plan

Expert Reference Series of White Papers. Visions of My Datacenter Virtualized

Management in Today s Regulatory Environment

WHITE PAPER. Storage Savings Analysis: Storage Savings with Deduplication and Acronis Backup & Recovery 10

Protecting enterprise servers with StoreOnce and CommVault Simpana

Evaluation of Enterprise Data Protection using SEP Software

Real-time Protection for Hyper-V

Every organization has critical data that it can t live without. When a disaster strikes, how long can your business survive without access to its

SQL SERVER ADVANCED PROTECTION AND FAST RECOVERY WITH EQUALLOGIC AUTO-SNAPSHOT MANAGER

METALOGIX REPLICATOR FOR SHAREPOINT: Supporting Government and Military Missions Worldwide

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

Data recovery Data management Electronic Evidence

Protecting Microsoft Hyper-V 3.0 Environments with CA ARCserve

Using HP StoreOnce D2D systems for Microsoft SQL Server backups

The Benefits of Continuous Data Protection (CDP) for IBM i and AIX Environments

Cloud Backup and Recovery

BACKUP STRATEGY AND DISASTER RECOVERY POLICY STATEMENT

Database as a Service (DaaS) Version 1.02

Disaster Recovery for Small Businesses

SAN Conceptual and Design Basics

DeltaV Virtualization High Availability and Disaster Recovery

Disaster Recovery. Stanley Lopez Premier Field Engineer Premier Field Engineering Southeast Asia Customer Services and Support

DATASHEET. Proactive Management and Quick Recovery for Exchange Storage

WHITE PAPER ONTRACK POWERCONTROLS. Recovering Microsoft Exchange Server Data

Maximizing Deduplication ROI in a NetBackup Environment

VMware vsphere Data Protection 5.8 TECHNICAL OVERVIEW REVISED AUGUST 2014

Availability and Disaster Recovery: Basic Principles

The Difference Between Disaster Recovery and Business Continuance

Protect Microsoft Exchange databases, achieve long-term data retention

Virtualization Essentials

Data Protection and Recovery

Recovering Microsoft Office SharePoint Server Data

Using HP StoreOnce Backup systems for Oracle database backups

One major business challenge is maintaining and improving the efficiency and effectiveness of a company s information technology. Wouldn t it be nice

How To Restore From A Backup In Sharepoint

Combining Onsite and Cloud Backup

Cloud, Appliance, or Software? How to Decide Which Backup Solution Is Best for Your Small or Midsize Organization.

DISASTER RECOVERY AND CONTINGENCY PLANNING CHECKLIST FOR ICT SYSTEMS

WhitePaper. Private Cloud Computing Essentials

BUSINESSES NEED TO MAXIMIZE PRODUCTIVITY, LOWER COSTS AND DECREASE RISKS EVERY DAY.

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

Course Syllabus. Maintaining a Microsoft SQL Server 2005 Database. At Course Completion

Ensuring your DR plan does not Lead to a Disaster

Solution Brief: Creating Avid Project Archives

Veritas Cluster Server by Symantec

Disaster Recovery Strategies: Business Continuity through Remote Backup Replication

Recovering Microsoft Office SharePoint Server Data. Granular search and recovery means time and cost savings.

Introduction. Setup of Exchange in a VM. VMware Infrastructure

Understanding data backups: why SMEs need them

Barracuda Backup Server. Introduction

Why Fails MessageOne Survey of Outages

Planning and Deploying a Disaster Recovery Solution

High Availability and Disaster Recovery Solutions for Perforce

Backup and Recovery for Microsoft Hyper-V Using Best Practices Planning. Brien M. Posey

WHITE PAPER. Building Blocks of the Modern Data Center

BACKUP ESSENTIALS FOR PROTECTING YOUR DATA AND YOUR BUSINESS. Disasters happen. Don t wait until it s too late.

VDI can reduce costs, simplify systems and provide a less frustrating experience for users.

Protecting Citrix XenServer Environments with CA ARCserve

50x Zettabytes*

Affordable Remote Data Replication

EMERGENCY PREPAREDNESS PLAN Business Continuity Plan

EMC Replication Manager and Kroll Ontrack PowerControls for Granular Recovery of SharePoint Items

CA ARCserve Replication and High Availability Deployment Options for Hyper-V

VMware vsphere Data Protection 6.0

Managing business risk

Transcription:

Data Loss in a Virtual Environment An Emerging Problem Solutions to successfully meet the requirements of business continuity. An Altegrity Company

2 3 4 5 Introduction Common Virtual Data Loss Scenarios Recent Virtualization Data Loss Case Studies Data Loss Consequences 1 Copyright 2011 Kroll Ontrack Inc. All Rights Reserved. Kroll Ontrack, Ontrack and other Kroll Ontrack brand and product names referred to herein are trademarks or registered trademarks of Kroll Ontrack Inc. and/or its parent company, Kroll Inc., in the United States and/or other countries.

Introduction The terms business continuity and disaster recovery have been blurred over the years and this has created confusion for businesses who are attempting to protect their business operations. A business continuity plan is a comprehensive policy that ensures all of a business departments can successfully operate with minimal or limited impact during a disruptive event. 1 The disaster recovery plan and emergency response procedures are generally part of a larger business continuity plan. The advent of virtualization technology has enabled business continuity planning and execution for many organizations. However, virtualization technology is complex and requires specialized skill and knowledge from both IT staff and management. In fact, if deployed or managed carelessly, virtualization can itself create business disruptions or data disasters as outlined by the graphs below. Traditional System If deployed or managed carelessly, virtualization can itself create business disruptions or data disasters. 74 % Machine 26 % Virtual System Human Source of Failure: Human vs. Machine Human error Lack of education System and hardware failure Environmental (power outage, over-voltage, etc.) 35 % Machine 65 % Human (figure 1) According to Forrester Research s report on the business state of disaster recovery preparedness, a joint effort with the Disaster Recovery Journal, 2 many organizations have improved their disaster recovery capabilities over the past few years. Despite a slow economy, survey respondents reported an increased confidence in being prepared for a data center disaster or site failure. Seventy-six percent of survey respondents reported no disaster or major disruption in the past five years, yet Forrester Researcher reports that companies should not take comfort in this statistic. Instead, it should serve as a wake-up call because a whopping 25 percent of companies are likely to declare a disaster. Furthermore, 1 For purposes of this article a business disruption is anything that prevents day-to-day work from being done, including power disruption, downed phone lines, and so forth. A data disaster occurs when data is corrupted. Hence, a data disaster is a subset of business disruption. 2 Forrester Research s 2010 report on the business state of disaster recovery preparedness, a joint effort with the Disaster Recovery Journal, http://www.drj.com/images/surveys_pdf/forrester/2011forrester_survey.pdf 2

business disruptions are much more common than declared disasters. Getting an organization to declare a disaster can be a matter of perspective, according to Don Stewart, director of professional services at Ongoing Operations, a non-profit business continuity service provider for U.S. credit unions. In some events, IT is so focused on fixing the problem that they don t inform senior management of the disaster event, Stewart relates. Some organizations have not defined what a business disruption is, therefore senior management will hesitate to declare a disaster if the event is perceived to be minor; for instance in the case of a phone system failure, or delays in e-mail messaging. Fifteen percent of respondents knew the cost of their business downtime; it averaged nearly $145,000 USD per hour. Staying prepared requires more than having a documented business continuity plan; it requires teamwork from all stakeholders. Having a stake in planning at this level ensures that business operations would be maintained in the event of a disruption. Stewart recommends that a good plan starts with a risk impact analysis. Most companies, according to Stewart, will purchase an in-depth risk assessment and then do nothing about it; the report just sits there with no further actions being taken. This is as effective as making a list of essentials to pack in a kit in case of a house fire but never assembling the kit. Common Virtual Data Loss Scenarios When data loss happens within a virtual data center it is usually due to human error. Other virtual data loss events result from hardware failure and are exacerbated by the lack of a disaster recovery plan. Disaster recovery plans that are weak or not regularly tested force IT staff to focus on un-tested repairs based on faulty troubleshooting. Obviously, nobody wants a data loss or business disruption on the systems they are responsible for. Too often, a serious data loss or business disruption results in unemployment for all responsible or thought to be responsible. Other data loss scenarios have been due to overconfidence in a SAN s redundancy. Precious recovery time is lost when it is discovered that important backups are corrupted or not readable during the middle of a disaster. This is the worst possible time to learn that backups have been failing or that backup software has not been reporting media failures during backup sessions. Reference the pie chart provided by Kroll Ontrack that provides a breakdown of 2010 virtual data loss types: Virtual Data Loss Types 3 6 % All Other 13 % Format & Reinstall 28 % Virtual Disk Corruption 28 % 25 % RAID/Hardware Fault Deleted Files on Virtual File System (figure 2)

Recent Virtualization Data Loss Case Studies A virtualization data loss can be catastrophic for an organization. Determining the financial impact of a business disruption is difficult because there are both tangible factors, including productivity loss, missed sales opportunities and staff s hourly time, but also less tangible factors to downtime such as potential non-compliance penalties, damage to corporate image and weakened customer confidence. The previously quoted Forrester-DRJ survey noted that 15 percent of respondents knew the cost of their business downtime; it averaged nearly $145,000 USD per hour. That is an estimate that would make any director or CIO take notice of the readiness of their business continuity plan. Virtualization technology can compound those losses as illustrated by the following cases. The Case of the Reformatted Server A business in Italy recently experienced a business disruption when their 4TB virtual host server lost access to the storage system. The virtual environment contained 40 virtual machines in a mixed operating system environment; some were Linux, a few were legacy UNIX systems and the remainder consisted of Microsoft Windows servers. These supported business application servers, web servers and database servers. The virtual host server was operating a Linux-based hypervisor with two 2TB LUNs attached. At some point, the storage LUNs were reformatted. The reason for the reformat wasn t divulged, but the damage to the existing file system structures was severe and extensive. During the reformat process, the Linux storage manager writes EXT file system metadata in predefined areas throughout the volume. This metadata contains only a couple of thousand bytes of information, yet the impact upon the virtual host server s file system and virtual disk files was devastating. Each virtual machine had four to six virtual disk files totaling 70-90 virtual disk files stored by the host server. Some of the virtualized Microsoft Windows servers employed dynamic disk volume configurations (i.e., logical volume manager in Linux parlance) between multiple virtual disk files, further complicating recovery efforts. After the organization s IT department exhausted all internal resources, a professional data recovery firm was engaged to recover the data. Despite the damage, virtual disk files were found and critical data was restored. It is industry best practices combined with IT management procedures that ensure data protection. The Case of the Disastrous Data Merge A United States business merger suffered a disaster while the two company s IT departments were merging their data. Evidence suggests the disaster was caused by employee sabotage and the cause is still under investigation by computer forensic investigators. The first company s virtual host server held over 400 virtual machines across 20 storage LUNs. During the data merge, someone with administrative access to the virtual host server systematically deleted the 400 virtual machines and their virtual disk files, causing the loss of over 440 virtual disk files and over a thousand snapshot files. 4

The merging company quickly engaged emergency data recovery services and prioritized core servers that provided essential services. In three days, those systems were up and running. For the next two weeks emergency recovery efforts continued on the rest of the storage system. This required extensive recovery engineering efforts to search the unallocated areas of the storage LUN for potential virtual disk files, identifiable only by their file system attributes. Through a combined effort of backup restoration and original volume recovery, data was recovered. Most of the virtual disk files were complete, while other virtual disks required the file contents to be extracted due to file system damage. The Case of the Off-Site SAN Reformat Disaster recovery efforts went from bad to worse for a company in Luxemburg. During routine maintenance on the company s SAN storage that housed its virtual machines, the SAN was presented to a different physical server by accident. When the SAN storage was identified as unknown, the volume was automatically reformatted. Initially, some staff panicked due to the potential for data loss. They were relieved when they were reminded of the identical SAN storage located off-site which employed the SAN equipment s automated site replication technology. It was thought that this would be a minor business disruption. Upon logging into the remote SAN, the IT team discovered that the remote SAN was an identical copy of the primary site; the SAN s automated site replication technology had not been disabled prior to the maintenance. Thus, when the reformat occurred at the primary site, the secondary SAN was reformatted as well. Through the efforts of experienced data recovery engineers, the virtual machines and virtual disk files were successfully recovered. This organization did not have any backups because it was assumed that dual storage architecture and site replication mechanisms provided complete data and system redundancy. This case is especially compelling because storage equipment features provided a false sense of security. In reality, it is industry best practices combined with IT management procedures that ensure data protection. 5

Data Loss Consequences Virtualization technology has revolutionized the IT industry and has delivered on the promise of a reduction in facility expenses and equipment costs. According to IDC s worldwide tracking of external disk storage systems, total disk storage capacity shipped was over 5,100 petabytes a 55.7 percent increase over the previous year. 3 This continued growth requires IT management to maintain disaster recovery documentation and to exercise recovery plans regularly. This will minimize or eliminate business disruptions due to data loss within virtualized environments. As more storage is consumed by virtualization technology, additional attention must be given to the management and protection of virtual assets. Maintaining business continuity by having well planned and tested disaster recovery plans is essential. Successful organizations realize that any disruption within the virtual infrastructure, regardless of how small, will have an amplified impact on the business as a whole. This has led IT leaders and business continuity planners to proactively include data recovery services in their contingency plans. Choosing a data recovery service vendor before a disaster occurs prepares the IT team for a successful survival of a business disruption caused by a data disaster. 3 Worldwide Disk Storage Systems Finishes 2010 with Double-Digit Growth on Strong Fourth Quarter Results, IDC, March, 2011 6

For more information, call or visit us online. 800.872.2599 in the U.S. and Canada +1.952.937.5161 www.krollontrack.com Copyright 2011 Kroll Ontrack Inc. All Rights Reserved. Kroll Ontrack, Ontrack and other Kroll Ontrack brand and product names referred to herein are trademarks or registered trademarks of Kroll Ontrack Inc. and/or its parent company, Kroll Inc., in the United States and/or other countries. All other brand and product names are trademarks or registered trademarks of their respective owners. XXXXX