Technical white paper Protect Microsoft Exchange databases, achieve long-term data retention HP StoreOnce Backup systems, HP StoreOnce Catalyst, and Symantec NetBackup OpenStorage Table of contents Introduction... 2 Technology overview... 2 HP StoreOnce Backup systems Key features and benefits... 2 HP StoreOnce Catalyst and Symantec NetBackup OpenStorage (OST)... 3 HP StoreOnce Catalyst seamless data movement across the enterprise... 3 HP StoreOnce Backup systems in small to large data centers... 4 Advantages of using HP StoreOnce B6200 Backup systems for Microsoft Exchange backups... 4 HP StoreOnce Backup systems for Microsoft Exchange backup using Symantec NetBackup... 4 Symantec NetBackup infrastructure components... 4 Capacity planning... 5 Symantec NetBackup schedule type characteristics with an HP StoreOnce B6200 Backup system... 7 Symantec NetBackup block size effect on Microsoft Exchange backup throughput and deduplication ratio... 8 DR with local HP StoreOnce Backup system and remote replication... 11 Symantec NetBackup replication using HP StoreOnce Catalyst... 11 Recovery scenarios... 11 Recommendations... 12 Conclusion... 13 Useful links... 13 Learn more... 13
Introduction In today s business environment, customers rely on the most efficient, high performing, and reliable backup systems to protect critical business information, such as Microsoft Exchange. Customers need to protect increasing levels of data while keeping costs under control. HP StoreOnce Backup systems provide a disk-based data protection platform while addressing data growth by applying HP StoreOnce deduplication software for efficient, longer term data retention. The HP StoreOnce B6200 Backup system, the latest deduplication appliance in the HP StoreOnce product line, provides a unique combination of features, including industry-leading performance (up to 100 TB/hr), high-availability, and high-capacity making the HP StoreOnce B6200 Backup system the industry leader in the enterprise deduplication sector. HP StoreOnce Catalyst software was developed to dramatically improve the performance, function, and integration of backup applications such as Symantec NetBackup. HP StoreOnce Catalyst delivers deduplication on an appliance server, media server, or dedicated appliance. Since it uses the same deduplication algorithm globally, data can be moved between platforms without rehydration. HP StoreOnce Catalyst allows better utilization of advanced, disk-based storage solutions while increasing efficiency and performance. This document describes the benefits of using HP StoreOnce B6200 Backup systems combined with HP StoreOnce Catalyst software and Symantec NetBackup to backup Microsoft Exchange data. This document also recommends backup and recovery implementations. The following are key recommendations for backing up data to an HP StoreOnce B6200 Backup system utilizing HP StoreOnce Catalyst software: Increased Exchange database backup speed: Use HP StoreOnce Catalyst with Symantec NetBackup OST to deduplicate backup data on the client for improved Exchange database backup throughput performance. Improved backup storage utilization: Backup multiple Exchange database files simultaneously for quickly rising deduplication ratios that are maintained over time. Trade-off between backup impact and ease of recovery: Configure a weekly full with daily incremental or initial full with daily incremental backup schedule to reduce the amount of end-to-end data and decrease the time required to run daily backups. HP StoreOnce Backup systems data deduplication will further reduce the end-to-end data size. Efficient and cost effective movement of backup data offsite: Use the HP StoreOnce remote replication feature to seamlessly replicate Exchange server data to an HP StoreOnce Backup system in a remote facility for simpler recovery in the event of a disaster. HP StoreOnce Backup systems are a disk-based backup system that deliver leading price-performance and deduplicates enterprise backup data. The HP StoreOnce Backup system can be used to automate and consolidate multiple backups onto a single, rack mountable device while improving reliability by reducing errors caused by media handling. For business environments with remote offices, or a disaster recovery (DR) site, the HP StoreOnce Backup system can be used to replicate data to a central data center or remote facility. HP StoreOnce Backup systems are ideal for mission-critical application backup data for small to large data centers running key business applications, such as Microsoft Exchange. Proper backup configurations with a data protection application to the HP StoreOnce Backup system provide the best backup throughput performance and data deduplication ratios. HP StoreOnce Backup systems integrate seamlessly into current IT environments and offer the flexibility of VTL and NAS targets, as well as Catalyst stores. Technology overview HP StoreOnce Backup systems Key features and benefits HP StoreOnce deduplication store more data on disk HP StoreOnce deduplication reduces the disk space required to store backup data sets without impacting backup performance. Retaining more backup data on disk for longer periods of time enables greater data accessibility for rapid restore of lost or corrupt files and reduces downtime. Deduplication ratios are strongly influenced by two factors data change rate and backup data retention periods. Low data change rates and data retained for longer periods of time yield higher deduplication ratios. Deduplication enabled replication HP StoreOnce Deduplication is the technology enabler for HP StoreOnce Deduplication-enabled replication which allows fully automated replication over links to a disaster recovery site, giving Remote Office/Branch Office (ROBO) and small data centers a cost effective DR solution for the first time. 2
Rapid restore of data for dependable, worry-free data protection HP StoreOnce Backup systems offer immediate access to backups for rapid restores. HP StoreOnce deduplication allows more data to be stored closer to the data center for longer periods of time which offers immediate access for rapid restores. Automate, simplify, and improve the backup process HP StoreOnce Backup systems automate the backup processes allowing reduced time spent managing data protection. Implementing hands-free, unattended daily backup is especially valuable for environments with limited IT resources, such as remote or branch offices. HP StoreOnce systems can backup multiple servers via a standard Ethernet or Fibre Channel network simultaneously to a disk-based solution at peak speeds of up to 100 TB/hr instead of sequentially to a tape drive or autoloader, meaning that substantially reduced backup windows are possible. HP StoreOnce systems can be intuitively managed and configured by using the built-in Web browser s administrative interface. For larger deployments of replicating HP StoreOnce appliances, the HP StoreOnce Replication Manager can monitor multiple backup systems across geographies. HP StoreOnce systems are self-managing backup appliances that require little, if any, routine maintenance. Unlike other disk-based storage devices, HP StoreOnce systems do not require virus protection or LUN provisioning. HP StoreOnce Catalyst and Symantec NetBackup OpenStorage (OST) HP StoreOnce Catalyst HP StoreOnce Catalyst is an object-based storage target on an HP StoreOnce Backup system that offers client-side deduplication for use with backup and recovery applications. NetBackup OST OST is a Symantec backup interface that allows intelligent storage devices, like the HP B6200 HP StoreOnce Backup system, to work with Symantec s NetBackup software. OST provides NetBackup administrators with advanced capabilities, such as optimized duplication and DR copy creation. HP OST plug-in for Symantec NetBackup The HP OST plug-in is installed on the NetBackup media servers. It uses an HP StoreOnce Catalyst client-side library interface to interact with the HP StoreOnce Backup systems. HP StoreOnce Catalyst seamless data movement across the enterprise HP StoreOnce Catalyst brings the HP StoreOnce vision of a single, integrated enterprise-wide deduplication algorithm a step closer. It allows the seamless movement of deduplicated data across the enterprise to other HP StoreOnce Catalyst systems without rehydration. This means benefits can be seen from: Simplified management of data movement from a single pane of glass: Tighter integration with the backup application to centrally manage file replication across the enterprise. Seamless control across complex environments: This supports a range of flexible configurations that enable the concurrent movement of data from one site to multiple sites, and the ability to cascade data around the enterprise (sometimes referred to as multi-hop). Enhance performance: Distributed deduplication processing using HP StoreOnce Catalyst stores on the B6200 and on multiple servers can optimize loading and utilization of backup hardware, network links and backup servers for faster deduplication and backup performance. Note Actual performance is dependent upon configuration data set type, compression levels, number of data streams, number of devices emulated and number of concurrent tasks, such as housekeeping or replication. 3
HP StoreOnce Backup systems in small to large data centers Advantages of using HP StoreOnce B6200 Backup systems for Microsoft Exchange backups Easier setup to protect, manage and access data Provides a single portable data deduplication algorithm that can run on an application server, media server, or dedicated appliance Faster backup and restoration of more data across more usage models Reduces workload on IT infrastructure, thus reducing CPU and memory requirements on the HP StoreOnce B6200 Backup system allowing for faster throughput performance Allows more backup data to be retained on disk for longer periods Improved functionality, performance and total cost of ownership (TCO) while migrating data protection environments from disparate small systems into scalable HP StoreOnce Backup systems Improved performance, function, and integration of backup applications such as Symantec NetBackup HP StoreOnce Backup systems for Microsoft Exchange backup using Symantec NetBackup An important part of server administration is maintaining a consistent set of backup data, which should be available for recovery. When data is lost due to user error, system failure, or catastrophic site failure, there is a need for complete server recovery along with application data recovery. Symantec NetBackup provides data protection for multiple, heterogeneous servers and operating systems. The backup data can be consolidated to a single HP StoreOnce Backup system leveraging 10 Gb Ethernet and Fibre Channel speed. HP StoreOnce Backup systems integrated with a well-planned data protection strategy include regular backups to maintain a consistent set of data for recovery purposes. Symantec NetBackup infrastructure components Table 1. Symantec NetBackup architecture Component Master Server Media Server Client SAN Client Description The Master Server manages backups, archives, and restores and is responsible for media and device selection. The Media Server provides additional storage by allowing NetBackup to use the storage devices that are attached to them. A Client is any system that requires protection. A NetBackup Client sends and receives data across the local area network (LAN) to and from a NetBackup Master/Media Server. A Media Server can also be considered a Client. A SAN Client contains critical data that requires high bandwidth for backups. It connects to a NetBackup media server over Fibre Channel. 4
Figure 1 shows an example of Symantec NetBackup architecture connections with an HP StoreOnce Backup system. Figure 1. Symantec NetBackup architecture with a HP StoreOnce Backup system Capacity planning The required backup storage capacity for server backups depends on the following: Size of Exchange databases and number of servers Backup retention policy (recovery points needed) Type of backups (full, incremental, differential) Frequency of backups Data rate of change The deduplication ratio achieved by the HP StoreOnce B6200 Backup system HP StoreOnce B6200 Backup systems do not deduplicate across Catalyst stores. Each Catalyst store is an independent deduplication domain. Note The rate of change of a data refers to the amount of data that would be contained in an incremental backup as a percentage of a full backup. A 100 GB full backup with a subsequent 5 GB incremental backup before the next full backup would be a five percent rate of change. In performing these tests HP used an Exchange server with four database files and four log folders. An administrator may desire to have two weeks of Exchange backup data stored on the HP StoreOnce Backup system for quick recovery access. The HP OpenStorage (OST) plug-in for Symantec NetBackup has two write mode options for transferring data to HP StoreOnce B6200 Backup systems: Low bandwidth (client-side deduplication ON) High bandwidth (client-side deduplication OFF) 5
Figure 2 shows the throughput performance comparison between HP StoreOnce Catalyst and high bandwidth transfer policies. Figure 2. HP StoreOnce Catalyst throughput performance comparisons HP StoreOnce Catalyst throughput 1% rate of change high bandwidth vs. Microsoft Exchange data Backup Throughput Low bandwidth High bandwidth 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Number of backups Figure 3 shows the data rate of change effect on deduplication ratios when backing up Microsoft Exchange data using HP StoreOnce Catalyst. The transfer policy was used and the deduplication is taking place at the media server (client-side deduplication). Figure 3. HP StoreOnce Catalyst rate of change effect on HP StoreOnce deduplication ratios HP StoreOnce Catalyst deduplication ratios relative to the data rate of change Expected Deduplication Ratio 1% 2% 3% 4% 5% Data rate of change between each backup 6
Symantec NetBackup schedule type characteristics with an HP StoreOnce B6200 Backup system Many Exchange backup environments take advantage of Symantec NetBackup s different backup schedule types such as: Weekly full with daily incremental Note Data compaction refers to the removal of redundant information from a backup set prior to storing on a backup device. Incremental backups, deduplication, and compression are all methods for removing redundant data from a backup set. Each backup shows the overall size of the backup data without deduplication vs. the size of the data on the HP StoreOnce B6200 Backup system after deduplication with Catalyst. Figure 4 illustrates the end-to-end data compaction of full and incremental Microsoft Exchange backups, varying streams, and high bandwidth backups to an HP StoreOnce B6200 Backup system over a period of 20 days. Figure 4. Data compaction comparison of different backup types impacting deduplication ratios. HP StoreOnce Catalyst data compaction variations impacting deduplicationratios Terabytes Client backup Size on HP StoreOnce Deduplication single stream Weekly full + daily incremental single stream two streams four streams single stream high bandwidth 7
Figure 5. Magnified view to show different deduplication ratios HP StoreOnce Catalyst data compaction variations impacting deduplication (magnifiedto show different deduplication ratios) Terabytes Client backup Size on HP StoreOnce Deduplication single stream Weekly full + daily incremental single stream two streams four streams single stream high bandwidth Symantec NetBackup block size effect on Microsoft Exchange backup throughput and deduplication ratio Symantec NetBackup supports varying block sizes for backing up data. Larger block sizes usually result in better throughput and an increase in data deduplication ratios. HP recommends keeping the block size set at 256 and above for better throughput. Figure 6 illustrates how Microsoft Exchange backup with HP StoreOnce Catalyst affects throughput by using higher block size (using ). Figure 6. Varying block sizes vs. backup throughput (using ). Microsoft Exchange with HP StoreOnce Catalyst throughput varying block sizes 1% rate of change Backup Throughput 128K Block Size 256K Block Size 512K Block Size 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Number of backups 8
Figure 7 illustrates little deduplication difference between using various block sizes with Microsoft Exchange backups and HP StoreOnce Catalyst. For the chart below, data was updated between each backup until the desired rate of change was reached. Figure 7. Varying block sizes vs. deduplication ratios (using ). Microsoft Exchange with HP StoreOnce Catalyst deduplication ratio Varying block sizes 1% rate of change Expected Deduplication Ratio 128K Block Size 256K Block Size 512K Block Size 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Number of backups Figure 8 shows how backup performance throughput increased when multiple data streams were used with an Exchange configuration with four database files. Figure 8. Backup performance increases with multiple data streams (using ). Microsoft Exchange with HP StoreOnce Catalyst throughput Varying streams 1% rate of change Backup Throughput 1 Stream 2 Streams 4 Streams 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Number of backups 9
Other factors affecting throughput: Data multiplexing: When multiple Exchange database files backup in parallel to a single device, multiplexing results in better utilization of the device which helps in better overall throughput and reduction of the backup window. With the help of multiplexing, it is possible to backup multiple database files simultaneously. Streams and data writers: A data stream can be thought of as a data channel that connects Exchange database files to the storage media. Multiple streams running in parallel through a single channel increases the rate at which data can be written to storage media Figure 9 illustrates how deduplication ratios increase when multiple data streams are used. Figure 9. Deduplication ratios increase with multiple data streams (using ). Microsoft Exchange with HP StoreOnce Catalyst deduplication ratio Varying streams 1% rate of change Expected Deduplication Ratio 1 Stream 2 Streams 4 Streams 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Number of backups Other factors affecting deduplication ratio: Backup policies: Retaining Exchange data for longer periods of time improves the chance that common data will already exist in storage, resulting in greater storage savings and better deduplication ratio. Compression/Encryption: Software compression and encryption prevents optimal deduplication from taking place since data that is already compressed and encrypted cannot be efficiently deduplicated. Data types: To improve deduplication ratio, configure individual Catalyst stores with Symantec NetBackup for individual server/type of operating systems. 10
DR with local HP StoreOnce Backup system and remote replication Most companies recognize the importance of a robust data protection strategy. Exchange customers are likely to invest in local server s recovery, as well as, site DR at a remote site using replication. Many companies, large and small, are protecting servers in remote offices where untrained IT staff are expected to manage a daily backup process generally involving the changing of physical tapes, which is a process prone to more time consumption, resource draining, inefficient and human error. Replicating large volumes of Exchange data backup over a typical WAN is expensive. However, today s products with data deduplication have made it possible to replicate Exchange data over lower bandwidth links for a more cost effective, network-efficient replication solution that provides a practical disaster recovery solution and an ideal solution for centralizing the backup of remote offices. Data deduplication shrinks the amount of Exchange backup data that needs to be replicated from the source HP StoreOnce appliance, and as a result significantly reduces replication bandwidth requirements. Once a replica of the Exchange data backup set has been created on a remote HP StoreOnce target appliance all that is required to keep the replica identical to the source is the automatic, periodic copying and movement of the new data segments which are created during each backup. With such small amounts of data being transmitted asynchronously, lower bandwidth networks offer sufficient performance and a much lower cost solution. Note Replication of data can only occur between devices within the same product family i.e., HP StoreOnce B6200, but not VLS. Symantec NetBackup replication using HP StoreOnce Catalyst One of the key features HP StoreOnce Catalyst stores provide is allowing NetBackup to utilize duplication (or copy) jobs. NetBackup allows duplicate backup jobs to function between Catalyst stores, allowing NetBackup complete control over the backup data lifecycle. This is accomplished by using NetBackup s Storage Lifecycle Policies to control the lifecycle of backup data. Replicating backup data between HP StoreOnce Backup systems is accomplished by properly configuring a Storage Lifecycle Policy to contain a set of sequential jobs, one being the backup job and the other being a duplicate (or copy) job. Therefore, whenever the backup job completes the duplicate (or copy) job is executed to the alternate HP StoreOnce Backup system. There is flexibility in doing data recovery, depending upon the situation or type of failure (see figure 9 below). For instance: An Exchange server can be recovered at the HP StoreOnce source site (original server location). In the event of an Exchange server source site disaster, the target site HP StoreOnce can be shipped to the source site or the Exchange backup data can be replicated back to the source site for complete database recovery. An Exchange database can be recovered at the HP StoreOnce target site (remote location). Recovery scenarios Figure 10 illustrates disaster recovery scenarios that may occur and the recovery path available when replicating between HP StoreOnce Backup systems. 11
Figure 10. Recovery scenarios Client recovery scenarios Scenario 1 Disk failure Scenario 2 Site failure Scenario 3 Site failure Exchange Server Exchange Server Exchange Server Source site 10GbE Source site 10GbE Source site 10GbE Source StoreOnce Source HP StoreOnce Source StoreOnce Source HP StoreOnce Source HP StoreOnce Scenario 1 Recovery Scenario 2 Recovery Scenario 3 Recovery at target site 1 Recover Exchange Server data volume Exchange Server 1 Recover source site hardware Exchange Server 1 Integrate target HP StoreOnce with backup application. Exchange Server 10GbE 10GbE 10GbE Source site Source site Target site Source Source HP StoreOnce Target Source HP StoreOnce Source Target StoreOnce HP StoreOnce 2 Recover Exchange Server database 2 Ship target HP StoreOnce to source site and integrate with backup application 4 3 Recover Exchange Server database 2 Recover Exchange Server database from HP StoreOnce replicated backup image using target site HP StoreOnce Target HP StoreOnce Recommendations Increased backup speed By enabling source-side deduplication (), HP has the advantage of performing deduplication at the data creation point. Source-side deduplication eliminates the need for specialist deduplication hardware at remote and branch office sites. HP StoreOnce Catalyst allows customers to align backup with data protection needs, such as minimizing bandwidth utilization when moving data between sites or data centers. If backup throughput performance is the highest priority, use Symantec NetBackup multiplexing to send multiple streams to the HP StoreOnce Backup system. Using multiple data readers/writers to send multiple Exchange databases simultaneously to the HP StoreOnce Backup system will also improve throughput performance. Do not multiplex if increased deduplication ratios are desirable Backup one server at a time by disabling NetBackup multiplexing and client-side deduplication to achieve the best possible deduplication ratio. Backup block size Use a larger block size for improved backup throughput performance. The backup block size used by Symantec NetBackup may have a small effect on data deduplication ratios on the HP StoreOnce Backup system. or weekly full with daily incremental backups backups deduplicate at a much higher rate than weekly full with daily incremental, but require more server and HP StoreOnce processing resources and SAN/LAN bandwidth. End-to-end data compaction is greater for incremental backup schedules over an extended time period, which means less storage space will be used on the HP StoreOnce Backup system. 12
Disaster recovery HP StoreOnce remote replication offers a replication solution to and from remote sites, which is ideal for server disaster recovery. Backups through Symantec NetBackup to a HP StoreOnce pair configured with remote replication provides recovery for local disk failures, complete server failure or complete site failures by keeping server backup copies at local and remote sites. Conclusion Microsoft Exchange customers demand an efficient, reliable data growth management backup system environment while keeping costs under control. HP provides a variety of reliable data protection storage solutions that address such requirements. HP StoreOnce Catalyst, along with Symantec NetBackup OST, is one such solution. HP StoreOnce Backup systems offer high performance and reliability, while addressing data growth through HP StoreOnce data deduplication technology. In addition, Symantec NetBackup s data protection solution brings together a full generation of traditional and next generation data protection from backup, to disk to replication management, to tape under one platform. In all, HP StoreOnce Backup systems integrate easily with Symantec NetBackup to protect important Microsoft Exchange data and other mission-critical applications. Useful links HP StoreOnce Backup hp.com/go/storeonce HP StoreOnce Backup system user guide http://bizsupport1.austin.hp.com/bc/docs/support/supportmanual/c02295179/c02295179.pdf HP StoreOnce Backup systems Linux and UNIX configuration guide http://h20000.www2.hp.com/bc/docs/support/supportmanual/c02299831/c02299831.pdf HP StoreOnce Backup systems best practices guide http://bizsupport2.austin.hp.com/bc/docs/support/supportmanual/c02511912/c02511912.pdf HP StoreOnce Catalyst Solution Service Technical data sheet http://h20195.www2.hp.com/v2/getpdf.aspx/4aa4-4489enw.pdf Customer brochure http://h20195.www2.hp.com/v2/getpdf.aspx/4aa4-4480enw.pdf Symantec NetBackup backup and recovery documentation http://www.symantec.com/netbackup Learn more Protect your data and other mission-critical applications such as Microsoft Exchange with HP StoreOnce Catalyst and Symantec NetBackup OpenStorage (OST). To learn how, visit hp.com/go/storeonce. Sign up for updates hp.com/go/getupdated Rate this document Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein. Microsoft is a U.S. registered trademark of Microsoft Corporation. 4AA4-5278ENW, February 2013