Better business decisions start with better backup Solution Overview: Data Protection Introduction Big Data is an emerging, evolving technology. To some it s that big yellow elephant in the tent holding the promise to help make sense of the terabytes, petabytes, and exabytes of data being generated. In today s 24 hours a day, seven days a week information driven economy, how can they quickly find and extract those key strategic nuggets of data to make their business more agile, make better business decisions, and give them that next competitive advantage? A well-known case in point, where a large credit card company gained that competitive edge with the advanced analytics from Hadoop by reporting they reduced the process time for 73 billion transactions, amounting to 36 TB of data, fell from one month with traditional methods to a mere 13 minutes. 1 Big Data today, as expressed by many, is simply the daily challenge virtually every enterprise IT organization faces when managing the protection of exploding data growth within shrinking backup windows, growing compliance requirements, and working hard to transform their data centers. Whether it s growing multiterabyte databases, data warehouse appliances bursting at the seams, or simply 100s of millions, or billions of files that need both fast protection and recovery, big data is Big Data and in its many forms and applications, needs to be protected. Big Data starts with the Symantec NetBackup Global Enterprise Data Protection Platform Reduce cost, risk, and accelerate time-to-value for Big Data deployments Big Data is not new to the enterprise scalability of the NetBackup Platform. Optimized for the wide range of workloads and backup storage infrastructures that exist in today s data centers, enterprise customers are turning to Symantec first, to better understand how they can architect their new Big Data deployments to leverage the wide range of advanced data protection technologies integrated into the NetBackup Platform. Starting with industry leading enterprise Backup and Recovery software, 2 they are looking to simplify and reduce the costs and risks of architecting and deploying emerging new technologies with a known, proven, and highly scalable data protection platform solution for the globally deployed enterprise. 1. Kajeepeta, Sreedhar, "How Hadoop Tames Enterprises' Big Data," InformationWeek Reports, Feb 2012. 2. Symantec statement. Symantec is named as a #1 leader in Core Storage Management based on 2011 worldwide revenue Market Share: Storage Management Software, Worldwide, 2011, Gartner (May 2012)" 1
Big Data workloads exist in virtually every enterprise today and comes in many shapes and sizes: Large numbers of files: 100s millions billions, 100 TBs to 10 PBs+ (and growing) Large data warehouses/appliances: Oracle Exadata, EMC Greenplum, Teradata, IBM Netezza Large structured databases: Oracle, SAP, IBM, and Microsoft databases Large analytics ala Hadoop: Large volumes and varieties of semi-structured/unstructured files As Big Data evolves into useful and competitive tools, in order to reduce cost, risk, and time to value, current thinking is to leverage many proven and well understood enterprise technologies already in-place and will look very much like the diagram in Figure 1. The relationship between mission-critical, transactional databases and data warehouse applications have been well understood for structured data for many years. However, the desire to apply analytics to the huge volume and variety of data being generated today, has created Hadoop-like technology to both normalize and analyze this data. Using Hadoop to normalize the data and feed it into both DBMS and Data Warehouse applications and appliances for the business analytics, can greatly accelerate deployment and usefulness of Big Data. Managing exponential data growth across these workloads is creating many new challenges for IT professionals. One major challenge, and significant inhibitor, to modernization efforts are the many workload based silos that have emerged within globally dispersed data centers. These silos are driving increased complexity, risk, and redundant CapEx and OpEx costs associated with deploying, learning, and managing disparate, non-integrated point tools trying to protect too much data with too many, different data protection solutions. Whether the workload is virtual, physical, array based, or Big Data, from a single management console, NetBackup can break down these silos and simplify the global deployment, management, and protection of virtually any workload to any storage, anywhere. 2
The NetBackup Platform provides integrated data protection and management for each of the following key components in a Big Data deployment: Comprehensive database support Symantec has a long history of extending comprehensive backup support for many databases including Oracle, SAP, IBM, and Microsoft databases soon after their introductions. SAP HANA, Sybase IQ, and Oracle Exadata also fall in this category. In early 1990s, NetBackup pioneered integration with Oracle Recovery Manager (RMAN) and influenced Oracle in RMAN development. In late 1990s, we added off-host backup feature to the Oracle Agent. Responding to the challenges of managing data growth, we were the first to fully integrate deduplication into backup software and appliances, as well as snapshot management of databases to the list of supported backup methods. For decades, we have proven performance and scalability in the largest and most challenging enterprise database environments. NetBackup agents database optimized: Oracle and Sun, SAP, Microsoft SQL Server, Active Directory, Sharepoint, Sybase, IBM DB2, Informix, Lotus Notes, Domino, and others NetBackup agents Database plus deduplication optimized: Oracle Protecting data warehouse appliances As vendors have introduced database and data warehouse based appliances, they very often have approached Symantec to better integrate with the NetBackup enterprise-class data protection platform with the end result of helping to remove customer concerns around data protection and potential barriers to adoption. This support comes through the tight integration and leverage of the Symantec Technology Enablement Program (STEP) where vendors work with Symantec in a close engineering-to-engineering relationship to produce and qualify agents for their databases that work seamlessly with the NetBackup Platform. These solutions can be found across all industries including the largest financial, manufacturing, and telecommunications companies. NetBackup agents Data warehouse appliances: Oracle Exadata, EMC Greenplum, Teradata Aster, and IBM Netezza appliances Tackling Hadoop and Big Data analytics For data stored in the Hadoop, we work with customers individually to understand the nature of the protection required and to help them architect the most appropriate solution based on the NetBackup platform. Our recommendations will vary depending on how heavily big data problems weigh on the axes of volume, velocity, variety, and variability. If data is heavily redundant, we recommend using NetBackup client side intelligent deduplication in combination with snapshots for data consistency. Examples include history-based archiving, where Hadoop is used as a dumping ground, and documents attached to the DBMS. Since the Hadoop file system is based on additions rather than updates/deletion of data, large storage savings are accomplished when applying deduplication. For most other use cases NetBackup Accelerator combined with the snapshot technology is a good method for achieving fast backup. The benefit is in getting full backup recovery performance at the speed and cost of incremental backups. Archiving into Hadoop as an inexpensive, easily accessed storage is gaining in popularity. We support this capability today for structured and unstructured data. NetBackup can easily be configured to use Hadoop as a target for the initial backup or as a backup archive through the storage lifecycle management. We also provide database archiving into Hadoop through the reselling of the Informatica Data Archive product that has capability to archive application data into Hadoop. 3
NetBackup Accelerator: High speed unstructured file backup Changed tracking plus deduplication NetBackup Snapshot/Replication Director: Zero to low production impact NetBackup Deduplication: Deduplication everywhere flexibility Client side, media server, target deduplication appliance Whether it s vast numbers of files or emerging use cases that involve the combination of a DBMS for storing structured data and Hadoop for storing accompanying non structured data, leveraging the array of capabilities available in the NetBackup Platform today to protect those workloads can simplify data protection requirements. NetBackup Platform options: Better backup for Big Data NetBackup data protection optimization Faster backup, reduce Big Data volume NetBackup Data Protection Optimization allows NetBackup 7.5 customers to use integrated data deduplication and replication features. NetBackup Data Protection Optimization radically transforms traditional deduplication by applying the best approach for a specific backup without the need for computationally intensive processing. Symantec NetBackup 7.5 with V-Ray technology includes the intelligence that gives NetBackup Intelligent Deduplication the unique ability to exactly identify the data formats and object boundaries for optimized deduplication performance. Customers can install and configure deduplication and optimized duplication (via replication available at no additional cost) from the NetBackup console. Some key benefits include: NetBackup client deduplication moves deduplication closest to the source, thereby eliminating the need to send full backup streams across the entire network NetBackup media server deduplication at the target for workloads where deduplication needs to be offloaded from the client Flexibility to deploy on commodity storage or using turnkey NetBackup Appliances Deduplicate across physical and virtual environments Seamless support for NetBackup Accelerator NetBackup Accelerator Protect 100s of millions of files NetBackup Accelerator significantly reduces the amount of resources (client I/O, time, network, and storage) that a traditional full backup takes. By using NetBackup Accelerator, a very large file system with millions or billions of files can be fully backed up in the amount of time required for an incremental backup. Systems which were problematic to back up during the backup window can now be backed up much more quickly, allowing the backup to complete in the allotted time. NetBackup Accelerator employs change tracking to dramatically reduce the file system overhead associated with traversing a large file system identifying and accessing only changed data. This reduced set of data can be deduplicated at the client or media server, further reducing the demand on network and storage resources. An optimized synthetic full backup is created and catalogued inline, providing full restore capabilities and shortened Recovery Time Objective (RTO). The net results include: Much faster backups for a significantly reduced backup window Reduced CPU and I/O overhead on the client Reduced usage of network resources Reduced usage of network resources, and a great way to send backups to cloud storage units All of these results combine to deliver lower operating expenses and reduced capital expenses. 4
Snapshot client Low impact database protection The snapshot client enables low-impact backup and recovery for databases and applications. By allowing the leveraging of a variety of software or hardware snapshot capabilities, this feature can greatly reduce the amount of data being transferred during backup and recovery, as well as dramatically improve backup performance and recovery time for mission-critical databases and applications. Big Data and Data Warehouse appliances: Teradata Teradata Extension for NetBackup integrates Teradata Data Warehouse and NetBackup operations to provide a parallel-stream backup and recovery solution for Teradata Database Warehouse systems. Scalable data protection Across 100s of nodes, 100s of terabytes Quickly move data Between Teradata system and NetBackup infrastructure Simple DBA administration Integration with the Teradata GUI lets DBA easily schedule and manage NetBackup operations Tape vault management Enables parallel copy, inline copy, site-to-site replication for disaster recovery IBM Netezza NetBackup for IBM Netezza, offered in partnership with IBM, provides centralized backup and recovery of IBM Netezza appliance databases using the XBSA API. On-demand and scheduled backups Full, differential, and incremental backups of IBM Netezza databases Optimized backup Multistream database backup and automatic backup compression Simple backup management Easy database and table recovery with GUI based monitoring and reporting Oracle Exadata and other engineered machines NetBackup supports Oracle's engineered machines through the Oracle Agent since 2008 when Oracle introduced Exadata. The NetBackup Oracle Agent is tightly integrated with the Oracle RMAN to deliver high-performance backup and recovery solutions. It supports Oracle Real Application Clusters (RAC), offers continuous data protection, and can clone Oracle databases from backup images. Online/hot t Oracle backups Keep the database online and increase reliability by eliminating manual processes and scripts RMAN integration Tightly integrated with the Oracle RMAN wizard to deliver high-performance backup and recovery Eliminate backup and recovery windows By using the NetBackup Advanced Client, backup process is offloaded to the media server eliminating the need for backup window, and data can be recovered instantly from the retained snapshot Improved Oracle deduplication Alternative backup methods which preserve the physical layout of the Oracle blocks enabling much higher deduplication rates 5
Microsoft t application and database protection: Active Directory For any enterprise, Active Directory is integral in managing network services. Therefore, the comprehensive protection and quick recovery of Active Directory is critical. Recovery of any Active Directory item Granular recovery including the ability to recover all items with full attributes (for example, user, server, printers) to satisfy both daily and disaster recovery (DR) requirements with one backup image Elimination of Active Directory server reboot Minimize downtime and keep the application up and running Ease of use and control Leverage Microsoft Visual SourceSafe (VSS) system state backup to ensure consistency when a backup is being made with Microsoft Exchange Server Symantec recognizes the need for nondisruptive mechanisms to protect Exchange data, along with all data in the enterprise. NetBackup for Exchange helps provide the performance and flexibility required for effective backup and recovery operations within large Exchange-based environments. Online backup Complete, nondisruptive protection of the Exchange database and mailbox components, including mailbox-level backup Flexible restore options Rapid, granular recovery of databases and mailboxes, including support for performing individual message restores Elimination of MAPI backups Granular recovery of the full application and a single email with only one backup pass and one copy stored SharePoint Server Ease of use One console can be used to protect SharePoint and Windows services; includes server farm configuration and single sign-on for databases Granular recovery Save time and storage by recovering SharePoint files, including different versions of a file, sites, subsites, and lists such as calendars and links SQL Server NetBackup for SQL Server delivers comprehensive data protection for SQL Server and SQL Server databases. High-speed recovery Verify-only restores can be used to verify the SQL contents of a backup image without actually restoring the data Point-in-time recovery Recovers SQL databases to the exact point in time or transaction log mark by rolling forward only the transactions that occurred before a user-specified date and time Granular database view Display of database object properties provides backup and recovery flexibility 6
Enterprise application and database protection: Oracle (Big Data/Data Warehouse appliances) The NetBackup Oracle Agent is tightly integrated with the Oracle RMAN to deliver high-performance backup and recovery solutions. It supports Oracle RAC, offers continuous data protection, and can clone Oracle databases from backup images. Online/hot t Oracle backups Keep the database online and increase reliability by eliminating manual processes and scripts RMAN integration Tightly integrated with the Oracle RMAN wizard to deliver high-performance backup and recovery Eliminate backup and recovery windows Using the NetBackup Advanced Client, backup process is off-loaded to the media server eliminating the need for backup window, and data can be recovered instantly from the retained snapshot Improved Oracle deduplication Alternative backup methods which preserve the physical layout of the Oracle blocks SAP enabling much higher deduplication rates The integration of NetBackup for SAP NetWeaver with the SAP DBA administrative interface, along with the BR backup and recovery commands, provides a solid, SAP NetWeaver-centric data protection solution for customer-specific configurations on the UNIX, Windows, and Linux platforms. Oracle RMAN support Leverage Oracle RMAN benefits when protecting SAP NetWeaver No backup window and instant recovery By using the NetBackup RealTime option, continuous data protection data can be recovered instantly and to any point in time Flexible implementation Perform database operations through NetBackup, or use the SAP tools (SAP DBA interface) independently Sybase NetBackup for Sybase offers high-performance backup and recovery for Sybase Adaptive Server Enterprise (ASE) databases on leading UNIX and Windows platforms. Parallel backup and recovery Supports the parallel backup and restore capabilities of the Sybase ASE Backup Server. This permits you to run more than one tape device at a time for a single Sybase ASE backup or restore, helping to reduce the time necessary to complete the operation Track backup history Detailed views of backup history help to simplify restores because backups of databases and transaction logs are easier to track MySQL NetBackup for MySQL, offered in partnership with Zmanda, provides centralized backup and recovery of multiple MySQL databases/servers across Linux, Oracle Solaris, and Windows operating systems. Benefits include: Scheduled full, differential, and incremental backups of MySQL database Easy database recovery to a required point in time or to any particular database event Backup compression and encryption for improved data protection and security Extensive monitoring and reporting 7
Postgre SQL NetBackup for PostgreSQL, offered in partnership with Zmanda, now provides live backup of PostgreSQL. PostgreSQL agent is integrated and certified with NetBackup 7.x using the XBSA API and supports backup of PostgreSQL versions 8.x and 9.x. This solution is fully integrated with the PostgreSQL write ahead log (WAL) to provide point-in-time recovery (PITR). EMC Documentum CYA HOTBackup, offered in partnership with CYA, tightly integrates with NetBackup to deliver a comprehensive hot backup and recovery solution for EMC Documentum. Online, hot t backup of all EMC Documentum components Including the full text index, database, and storage area keep the system online and increase reliability by eliminating manual processes and scripts CYA SmartRecovery Enables granular, hot recovery of full EMC objects including metadata and content Improve Recovery point objectives (RPO) Reduce data loss to within 15 minutes Teradata (Big Data/Data Warehouse appliances) Teradata Extension for NetBackup integrates Teradata Database and NetBackup operations to provide a parallel-stream backup and recovery solution for Teradata Database Warehouse systems. Scalable data protection Across 100s of nodes, 100s of terabytes Quickly move data Between Teradata system and NetBackup infrastructure Simple DBA administration Integration with the Teradata GUI lets DBA easily schedule and manage NetBackup operations Tape vault management Enables parallel copy, inline copy, site-to-site replication for disaster recovery IBM application and database protection: IBM DB2 UDB Universal Database NetBackup works with IBM DB2 native backup and recovery utilities to protect DB2 databases and archive logs. NetBackup offers unprecedented techniques for protecting DB2 databases from integrated block-level backup and recovery to instant recovery via the NetBackup Snapshot Client. Flexible backup options Full, incremental, and block-level backups as well as backup of archive logs via the user exit program Wizard-based configuration Performs step-by-step configuration of DB2 backup and recovery via an intuitive, graphical, wizard-based interface IBM Informix Dynamic Server NetBackup protects IBM Informix Dynamic Server and IBM Informix Extended Parallel Server running on many of today s popular operating platforms. NetBackup, in conjunction with the Informix ON-Bar utility, provides a methodology to help ensure that business-critical Informix applications are safely backed up and can be recovered quickly. 8
Continuous logical log backup Configure automated logical log backup to help prevent logs from filling up and locking up the database server Flexible backup options Backup and restore at the dbspace, blobspace, and logical log file level IBM Netezza (Big Data/Data warehouse appliances) NetBackup for IBM Netezza, offered in partnership with IBM, provides centralized backup and recovery of IBM Netezza appliance databases using the XBSA API. On demand and scheduled backups Full, differential, and incremental backups of IBM Netezza database Optimized backup Multistream database backup and automatic backup compression Simple backup management Easy database and table recovery with GUI based monitoring and reporting IBM Lotus Notes and Lotus Domino server Symantec offers high-performance backup and recovery for Lotus Notes and Lotus Domino server. Administrators can leverage intuitive GUIs to help set consistent backup policies across the enterprise and to quickly identify critical Lotus files or databases when they need to be restored. Alternate restoration techniques With backups not tied to a specific backup server, NetBackup can restore Lotus data either on an alternate system or in an alternate directory Advanced Lotus integration Support for partitioned Lotus servers and Lotus clustering Additional Symantec information manatgement solutions: Archiving and ediscovery: y:the combination of NetBackup, Symantec Enterprise Vault, and Informatica Data Archive provides customers with a robust solution for protecting and archiving large databases and applications. For more information please visit: www.symantec.com/enterprisevault Enterprise Vault white papers Websites: What s New in NetBackup: www.betterbackupforall.com Enterprise Leading Backup & Recovery software: www.symantec.com/netbackup NetBackup Appliances Simplifying Backup and Deduplication: www.symantec.com/backup-appliance NetBackup data sheets and solution briefs: NetBackup 7.5 Clients and Agents (data sheet) NetBackup 7.5 Options (data sheet) OpsCenter Analytics Global NetBackup Management (data sheet) NetBackup 7.5 for VMware (solution brief) Symantec V-Ray for Virtualization and Deduplication (video) 9
More Information Visit our website http://enterprise.symantec.com To speak with a Product Specialist in the U.S. Call toll-free 1 (800) 745 6054 To speak with a Product Specialist outside the U.S. For specific country offices and contact numbers, please visit our website. About Symantec Symantec protects the world s information and is the global leader in security, backup, and availability solutions. Our innovative products and services protect people and information in any environment from the smallest mobile device to the enterprise data center to cloud-based systems. Our industry-leading expertise in protecting data, identities, and interactions gives our customers confidence in a connected world. More information is available at www.symantec.com or by connecting with Symantec at go.symantec.com/socialmedia. Symantec World Headquarters 350 Ellis St. Mountain View, CA 94043 USA +1 (650) 527 8000 1 (800) 721 3934 www.symantec.com Copyright 2012 Symantec Corporation. All rights reserved. Symantec, the Symantec Logo, and the Checkmark Logo are trademarks or registered trademarks of Symantec Corporation or its affiliates in the U.S. and other countries. Other names may be trademarks of their respective owners. Symantec helps organizations secure and manage their information-driven world with data backup and recovery software. 21262886 09/12 10