SEIZE THE DATA. 2015 SEIZE THE DATA. 2015

Similar documents
HP Vertica on Amazon Web Services Backup and Restore Guide

MapGuide Open Source Repository Management Back up, restore, and recover your resource repository.

5054A: Designing a High Availability Messaging Solution Using Microsoft Exchange Server 2007

Perforce Backup Strategy & Disaster Recovery at National Instruments

DB2 9 for LUW Advanced Database Recovery CL492; 4 days, Instructor-led

MICROSOFT EXCHANGE best practices BEST PRACTICES - DATA STORAGE SETUP

EMC DOCUMENTUM xplore 1.1 DISASTER RECOVERY USING EMC NETWORKER

50238: Introduction to SQL Server 2008 Administration

HP PolyServe Software upgrade guide

HP Data Replication Solution Service for 3PAR Virtual Copy

SQL Server Training Course Content

Backup and Restore of CONFIGURATION Object on Windows 2008

Administering a Microsoft SQL Server 2000 Database

Designing, Optimizing and Maintaining a Database Administrative Solution for Microsoft SQL Server 2008

HP LeftHand SAN Solutions

Case Studies Using EMC Legato NetWorker for OpenVMS Backups

Course 2788A: Designing High Availability Database Solutions Using Microsoft SQL Server 2005

How To Fix A Powerline From Disaster To Powerline

MS Design, Optimize and Maintain Database for Microsoft SQL Server 2008

Configuring High Availability for VMware vcenter in RMS Distributed Setup

SQL Server Database Administrator s Guide

Upgrading Horizon Workspace

Leveraging Virtualization for Disaster Recovery in Your Growing Business

HP LeftHand SAN Solutions

Integrating Data Protection Manager with StorTrends itx

inforouter Version 8.0 Administrator s Backup, Restore & Disaster Recovery Guide

BrightStor ARCserve Backup for Windows

Maximum Availability Architecture. Oracle Best Practices For High Availability. Backup and Recovery Scenarios for Oracle WebLogic Server: 10.

ASM and for 3rd Party Snapshot Solutions - for Offhost. Duane Smith Nitin Vengurlekar RACPACK

Backup Strategies for Integrity Virtual Machines

Using HP StoreOnce D2D systems for Microsoft SQL Server backups

Cloud Attached Storage

Innovative technology for big data analytics

HP Data Protector software Zero Downtime Backup and Instant Recovery

Storage node capacity in RAID0 is equal to the sum total capacity of all disks in the storage node.

HP One-Button Disaster Recovery (OBDR) Solution for ProLiant Servers

Implementing a Microsoft SQL Server 2005 Database

MOC 5047B: Intro to Installing & Managing Microsoft Exchange Server 2007 SP1

Symantec NetBackup Blueprints

W H I T E P A P E R. Understanding VMware Consolidated Backup

EMC NETWORKER SNAPSHOT MANAGEMENT

Using Symantec NetBackup with Symantec Security Information Manager 4.5

Zero Downtime Backup solution for Oracle10g

Planning for a Disaster Using Tivoli Storage Manager. Laura G. Buckley Storage Solutions Specialists, Inc.

DB2 backup and recovery

BrightStor ARCserve Backup for Windows

Managing Cisco ISE Backup and Restore Operations

Administering a Microsoft SQL Server 2000 Database

HP External Hard Disk Drive Backup Solution by Seagate User Guide. November 2004 (First Edition) Part Number

Backups and Maintenance

HP StorageWorks EBS Solutions guide for VMware Consolidated Backup

CXS Citrix XenServer 6.0 Administration

HP Data Protector software Zero Downtime Backup and Instant Recovery. Data sheet

A Better Approach to Backup and Bare-Metal Restore: Disk Imaging Technology

Application Brief: Using Titan for MS SQL

Designing and Deploying Messaging Solutions with Microsoft Exchange Server 2010 Service Pack B; 5 days, Instructor-led

HP 3PAR Software Installation and Startup Service

ITCertMaster. Safe, simple and fast. 100% Pass guarantee! IT Certification Guaranteed, The Easy Way!

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

WHITE PAPER: ENTERPRISE SECURITY. Symantec Backup Exec Quick Recovery and Off-Host Backup Solutions

Computer Visions Course Outline

Business Process Desktop: Acronis backup & Recovery 11.5 Deployment Guide

WHITE PAPER PPAPER. Symantec Backup Exec Quick Recovery & Off-Host Backup Solutions. for Microsoft Exchange Server 2003 & Microsoft SQL Server

EMC NetWorker Module for Microsoft for Windows Bare Metal Recovery Solution

HP Data Protector best practices for backing up and restoring Microsoft SharePoint Server 2010

Technical Notes TECHNICAL NOTES. Release number 8.2 Service Pack REV 01. January, 2015

Backing Up the CTERA Portal Using Veeam Backup & Replication. CTERA Portal Datacenter Edition. May 2014 Version 4.0

vcenter Configuration Manager Backup and Disaster Recovery Guide VCM 5.3

Symantec Enterprise Vault 10.x for File System Archiving: Administration

SEP Disaster Recovery and Backup Restore: Best Practices

Designing a Microsoft SQL Server 2005 Infrastructure

Implementing and Managing Windows Server 2008 Clustering

Microsoft Exchange 2003 Disaster Recovery Operations Guide

Using HP StoreOnce Backup systems for Oracle database backups

HP Storage Data Migration Service

Best Practices for HP Vertica OEM Customers

Veeam Backup and Replication Architecture and Deployment. Nelson Simao Systems Engineer

EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage

SEP Disaster Recovery and Backup Restore: Best

BrightStor ARCserve Backup for Linux

SAP Note FAQ: SAP HANA Database Backup & Recovery

Managing and Maintaining a Windows Server 2003 Network Environment

Extending the Power of Your Datacenter

How to use Data Protector 6.0 or 6.10 with Exchange Recovery Storage Groups to restore a single mailbox

HP SiteScope. HP Vertica Solution Template Best Practices. For the Windows, Solaris, and Linux operating systems. Software Version: 11.

Oracle Recovery Manager 10g. An Oracle White Paper November 2003

Backup Assistant. User Guide. NEC NEC Unified Solutions, Inc. March 2008 NDA-30282, Revision 6

HP STORAGEWORKS ENTERPRISE BACKUP SOLUTIONS (EBS)

Managing and Maintaining Windows Server 2008 Servers (6430) Course length: 5 days

EMC NetWorker Module for Microsoft Applications Release 2.3. Application Guide P/N REV A02

Microsoft Exchange Server 2007, Upgrade from Exchange 2000/2003 ( /5049/5050) Course KC Days OVERVIEW COURSE OBJECTIVES AUDIENCE

Administering a Microsoft SQL Server 2000 Database

PROTECTING MICROSOFT SQL SERVER TM

Transcription:

1 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

BIG DATA CONFERENCE 2015 Boston August 10-13 Vertica Backup and Restore Ramesh Narayanan, Vertica Professional Services Aug 10, 2015

Module Overview Backup and Restore Copy Vertica Database Online Recovery 3 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Backup and Restore 4 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Backup - Overview Backup is the process of copying the actual data files to a specified location Vertica data and backup files are written once Once a file is written Vertica will not update it Number of files increase with each backup Tuple Mover keeps the number of files under control The TM mergeout process consolidates smaller ROS containers into larger ones To backup, copy Vertica files to stable storage Can be direct attached storage, NFS mounts or SAN Those files can then be moved to tape backup or integrated with other tools 5 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Backup When? Backup is the process of copying the actual data files to a specified location Part of Regular Disaster Recovery Strategy Nightly, weekly, depending on business continuity requirements and resources After loading or altering a large volume of data Before Maintenance Tasks Upgrading to another version of Vertica Dropping a Partition Before and after adding, removing or replacing nodes 6 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Backup and Restore Options There are several ways to take a Vertica Backup Backup and Restore by Database Most common backup process Backs up the entire database which includes all the schemas and objects within them Backup and Restore by Schema Multi-tenant database with different backup frequency Multi-application cluster with different backup requirements /policies Backup and Restore by Table Can be used to backup some critical tables Restore certain tables for QA / Testing Backup frequency depends on the criticality / tolerance of data loss / recovery 7 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Vertica Backup Restore VBR vbr.py is a Python script located under /opt/vertica/bin Use vbr.py with various options to take backup and restore data Create a configuration file vbr.py --setupconfig Goes into interactive mode, gathers all parameters and creates the configuration file VBR parameters Database name, schema name, snapshot name, object names Restore points, backup location, node names, temporary directories etc. 8 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

vbr.py setupconfig options 9 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

vbrtest.ini 10 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Vertica Backup Restore VBR A few parameters explained Snapshot Name stores all the files under that named directory Restore Points number of incremental backups stored in addition to full backup Node Names of nodes in the cluster Data is backed up from each node of the cluster Backup Directory Location where the backup files are stored If it is NFS mount, a separate directory for each node gets created under the backup directory 11 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

VBR preparation Steps and some prerequisites Backup location to be configured on all the nodes Verify database is running Ensure backup hosts are running if data is backed up to those hosts Backup can be done to the same cluster nodes Backup can also be done to a dedicated host which has the SAN storage Backup Directory Permissions / Contents Ensure that the user who starts the backup process has write permissions Backup directory contains sub-directories for each node (if NFS location) Under the Backup directory VBR creates the sub-directory for each snapshot The full backup and each incremental backups are stored in separate directories 12 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Performing a Backup How to run the vbr.py script vbr.py --task backup --config-file <myconfigfile> Same command is used for full and incremental backups First run does a full backup All data files are copied to the sub-directory with the snapshot name Subsequent runs are incremental Copies files which have changed since last backup Files are only added or deleted, never modified Each incremental backup goes into a separate sub-directory with a timestamp Each incremental backup also adds those files to the full backup 13 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

VBR Process Infographics 14 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Performing a Restore The same vbr.py script is used for restore vbr.py --task restore --config-file <myconfigfile> The configuration file is the same that is used for the Backup Restore can be specific Entire database, specific schema or table depending on the configuration file used Vertica copies the files from backup location to the data directory location Some key features Vertica does not have the concept of transaction logging There is no roll forward or roll back of transactions Objects can be restored to the timestamp of the last snapshot 15 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Copy Vertica Database 16 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Copy Vertica Database This option of VBR copies the entire Database (cluster) to a target cluster When do we need copycluster? Maintain a warm-standby cluster for Disaster Recovery Provide an alternative cluster to a different set of users / applications Prerequisites Source and Target cluster must have same number of nodes Database, node names and dbadmin user have to be the same on both sides Password-less ssh has to be established between all the nodes on both sides Target database has to be shut down before starting the process vbr.py --task copycluster --config-file <cfgfile> The task runs as one continuous transaction 17 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Online Recovery 18 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Node Recovery Vertica is highly available MPP architecture, but nodes may go down Node can recover from failure A node can rebuild its data set from other nodes in the cluster if the cluster is K-safe In a full recovery the node rebuilds from scratch Incremental Recovery Node rebuilds from the current persisted state To speed up a full recovery, use a prior backup for the given node and perform incremental recovery RAID 10 is best practice RAID arrays (5,6,10) can be rebuilt without impact to other cluster nodes 19 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Monitor Recovery Monitor disk space df h SELECT * FROM v_monitor.disk_storage; Monitor Recovery tail vertica.log SELECT * FROM v_monitor.recovery_status; 20 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

QUESTIONS? Please attend our Q&A with HP Big Data experts today Marina Ballroom, Lobby level 10:15 am 10:30 am 12:00 pm 1:00 pm 2:30 pm 3:00 pm 4:30 pm 5:00 pm