SanDisk ION Accelerator High Availability



Similar documents
SAN Conceptual and Design Basics

Veritas Storage Foundation High Availability for Windows by Symantec

Oracle Acceleration with the SanDisk ION Accelerator Solution

Symantec Storage Foundation High Availability for Windows

Improve Business Productivity and User Experience with a SanDisk Powered SQL Server 2014 In-Memory OLTP Database

Getting Started with Endurance FTvirtual Server

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

SQL Server Storage Best Practice Discussion Dell EqualLogic

An Oracle White Paper January A Technical Overview of New Features for Automatic Storage Management in Oracle Database 12c

EMC VPLEX FAMILY. Continuous Availability and data Mobility Within and Across Data Centers

Best practices for fully automated disaster recovery of Microsoft SQL Server 2008 using HP Continuous Access EVA with Cluster Extension EVA

The functionality and advantages of a high-availability file server system

High Availability with Windows Server 2012 Release Candidate

EMC VPLEX FAMILY. Continuous Availability and Data Mobility Within and Across Data Centers

Solution Brief Availability and Recovery Options: Microsoft Exchange Solutions on VMware

High Availability Database Solutions. for PostgreSQL & Postgres Plus

Building a Flash Fabric

High Availability and MetroCluster Configuration Guide For 7-Mode

Implementing Storage Concentrator FailOver Clusters

Using EonStor FC-host Storage Systems in VMware Infrastructure 3 and vsphere 4

High Availability & Disaster Recovery Development Project. Concepts, Design and Implementation

Cisco Active Network Abstraction Gateway High Availability Solution

VERITAS Storage Foundation 4.3 for Windows

EMC Virtual Infrastructure for SAP Enabled by EMC Symmetrix with Auto-provisioning Groups, Symmetrix Management Console, and VMware vcenter Converter

VBLOCK SOLUTION FOR SAP: HIGH AVAILABILITY FOR THE PRIVATE CLOUD

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

Creating A Highly Available Database Solution

Implementing disaster recovery solutions with IBM Storwize V7000 and VMware Site Recovery Manager

Availability Digest. MySQL Clusters Go Active/Active. December 2006

Module: Business Continuity

Appendix A Core Concepts in SQL Server High Availability and Replication

HRG Assessment: Stratus everrun Enterprise

Disaster Recovery Solution Achieved by EXPRESSCLUSTER

EMC RECOVERPOINT FAMILY

High Availability with Postgres Plus Advanced Server. An EnterpriseDB White Paper

Neverfail for Windows Applications June 2010

Step-by-Step Guide to Open-E DSS V7 Active-Active Load Balanced iscsi HA Cluster

Contingency Planning and Disaster Recovery

Westek Technology Snapshot and HA iscsi Replication Suite

StarWind Virtual SAN Installation and Configuration of Hyper-Converged 2 Nodes with Hyper-V Cluster

HGST Virident Solutions 2.0

Informatica MDM High Availability Solution

Administrator Guide VMware vcenter Server Heartbeat 6.3 Update 1

Long-Distance Configurations for MSCS with IBM Enterprise Storage Server

A virtual SAN for distributed multi-site environments

Windows Host Utilities Installation and Setup Guide

Microsoft SQL Server Acceleration with SanDisk

Oracle Database Disaster Recovery Using Dell Storage Replication Solutions

Network-based Intelligent Data Protection Yossi Mossel, Product Manager

Whitepaper Continuous Availability Suite: Neverfail Solution Architecture

Using VMWare VAAI for storage integration with Infortrend EonStor DS G7i

Best Practices Guide: Network Convergence with Emulex LP21000 CNA & VMware ESX Server

Rajesh Gupta Best Practices for SAP BusinessObjects Backup & Recovery Including High Availability and Disaster Recovery Session #2747

W H I T E P A P E R. Disaster Recovery Virtualization Protecting Production Systems Using VMware Virtual Infrastructure and Double-Take

VMware Site Recovery Manager with EMC RecoverPoint

Creating SANLess Microsoft SQL Server Failover Cluster Instances with SIOS DataKeeper Cluster Edition and SanDisk Fusion iomemory

High Availability Solutions for the MariaDB and MySQL Database

High Availability for Citrix XenApp

Building Highly Available OpenZFS Storage Appliances Grenville Whelan

Total Disaster Recovery in Clustered Storage Servers

EonStor DS remote replication feature guide

Enterprise Linux Business Continuity Solutions for Critical Applications

Setup for Failover Clustering and Microsoft Cluster Service

Step-by-Step Guide to Open-E DSS V7 Active-Active iscsi Failover

Building Reliable, Scalable AR System Solutions. High-Availability. White Paper

DATA CENTER. Best Practices for High Availability Deployment for the Brocade ADX Switch

SQL SERVER ADVANCED PROTECTION AND FAST RECOVERY WITH EQUALLOGIC AUTO-SNAPSHOT MANAGER

Setup for Failover Clustering and Microsoft Cluster Service

Disaster Recovery for Oracle Database

An Oracle White Paper November Oracle Real Application Clusters One Node: The Always On Single-Instance Database

Table of contents. Matching server virtualization with advanced storage virtualization

Symantec and VMware: Virtualizing Business Critical Applications with Confidence WHITE PAPER

SQL SERVER ADVANCED PROTECTION AND FAST RECOVERY WITH DELL EQUALLOGIC AUTO SNAPSHOT MANAGER

VMware vsphere Data Protection 5.8 TECHNICAL OVERVIEW REVISED AUGUST 2014

Informix Dynamic Server May Availability Solutions with Informix Dynamic Server 11

HIGH AVAILABILITY STRATEGIES

Virtual SAN Design and Deployment Guide

High Availability Storage

Dell High Availability Solutions Guide for Microsoft Hyper-V

MaximumOnTM. Bringing High Availability to a New Level. Introducing the Comm100 Live Chat Patent Pending MaximumOn TM Technology

DISASTER RECOVERY STRATEGIES FOR ORACLE ON EMC STORAGE CUSTOMERS Oracle Data Guard and EMC RecoverPoint Comparison

Deployment Options for Microsoft Hyper-V Server

Contents. SnapComms Data Protection Recommendations

OVERVIEW. CEP Cluster Server is Ideal For: First-time users who want to make applications highly available

HP Serviceguard Cluster Configuration for HP-UX 11i or Linux Partitioned Systems April 2009

Top 10 Reasons why MySQL Experts Switch to SchoonerSQL - Solving the common problems users face with MySQL

High Availability Cluster for RC18015xs+

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

Infortrend ESVA Family Enterprise Scalable Virtualized Architecture

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

hp ProLiant network adapter teaming

Pervasive PSQL Meets Critical Business Requirements

Integrated Application and Data Protection. NEC ExpressCluster White Paper

Step-by-Step Guide. to configure Open-E DSS V7 Active-Active iscsi Failover on Intel Server Systems R2224GZ4GC4. Software Version: DSS ver. 7.

Microsoft SharePoint 2010 on VMware Availability and Recovery Options. Microsoft SharePoint 2010 on VMware Availability and Recovery Options

HP 3PAR Software Installation and Startup Service

TECHNICAL PAPER. Veeam Backup & Replication with Nimble Storage

Microsoft SMB File Sharing Best Practices Guide

HP 3PAR storage technologies for desktop virtualization

HP Serviceguard Cluster Configuration for Partitioned Systems

Transcription:

WHITE PAPER SanDisk ION Accelerator High Availability 951 SanDisk Drive, Milpitas, CA 95035 www.sandisk.com

Table of Contents Introduction 3 Basics of SanDisk ION Accelerator High Availability 3 ALUA Multipathing with SanDisk ION Accelerator HA 4 End- to- End Availability 4 SanDisk ION Accelerator Management 5 Non- Disruptive Update and FRU Servicing 5 Cluster Management in SanDisk ION Accelerator HA 5 Volume Replication in SanDisk ION Accelerator HA 6 Efficient Synchronization 6 Variable- Rate Synchronization 7 Split Brain Notification and Automatic Recovery 7 Fusion iomemory iodrive Error Handling Strategies 7 Application Acceleration using SanDisk ION Accelerator 7 2

Introduction The SanDisk ION Accelerator High Availability feature enables deployment of SanDisk ION Accelerator units into environments that require the highest levels of data protection and availability goals. Properly implemented high availability architectures must meet the following basic objectives: Protect data from failure of system components Provide data availability even if a system component fails Provide data availability while failed components are replaced Provide non- disruptive software/firmware upgrades Provide non- disruptive Field Replaceable Unit (FRU) servicing The mechanism that implements HA must be fault- tolerant The SanDisk ION Accelerator High Availability storage service meets these objectives with a solution that has low complexity and high performance. Basics of SanDisk ION Accelerator High Availability With the HA feature, every write to a SanDisk ION Accelerator volume is synchronously replicated across a pair of clustered SanDisk ION Accelerator nodes. A write is not acknowledged to the initiator until it has been written to power- cut- safe Fusion iomemory tier on both nodes. As a result, data is protected from component failures or catastrophic failures of a single node. If an application is unable to access data from an HA volume, it will automatically use an alternate path to data on the other node. As a result, data remains available even if access to one SanDisk ION Accelerator node is lost. When a node is being repaired or replaced, the remaining node ensures data availability by staying online. When the repaired node is brought back online, it is automatically resynchronized with the online node before resuming service. Software and firmware upgrades are non- disruptive through rolling cluster updates. One single node is updated and then restarted while the remaining node stays online, providing data availability. The HA feature employs redundancy to maintain performance and availability against data fabric or cluster interconnect failure. 3

Figure 1: SanDisk ION Accelerator High Availability ALUA Multipathing with SanDisk ION Accelerator HA In the dark ages of storage networking, storage array developers had to develop custom drivers for server multipathing. These custom drivers were needed due to an absence of standards for managing multipath I/O in dual controller configurations. Along with SCSI3 standards initiated in 2005 came ALUA (Asymmetric Logical Unit Access), providing a method to implement multipathing without having to create custom drivers. ALUA multipathing has a couple of key mechanisms for controlling access to LUNs and managing failover and failback between asymmetric active:active controllers, called implicit and explicit ALUA. Implicit and explicit ALUA refer to methods for managing which paths are active to an array. Implicit ALUA is a means for a storage cluster to communicate to an initiator which storage path should be used per LUN, and explicit ALUA allows an initiator to switch over to another path in case a storage path between the initiator and a cluster node fails. End- to- End Availability SanDisk ION Accelerator HA works hand- in- hand with server, application, and storage architecture redundancies. For example, clustered servers, whether clustered by the operating system (such as Microsoft Cluster) or the application itself (such as Oracle RAC), provide high availability in the event an application server fails. Redundant interface paths from servers directly to SanDisk ION Accelerator appliances or through redundant fabric switches create fault- tolerant I/O paths. When combined with the SanDisk ION Accelerator HA feature the entire solution provides end- to- end data availability. 4

Figure 2: SanDisk ION Accelerator and Oracle RAC SanDisk ION Accelerator Management SanDisk ION Accelerator HA is administered by the iosphere graphical user interface (GUI) and Command Line Interface (CLI). Both the GUI and CLI provide Single- System- Image, an ease of use simplification that allows multiple SanDisk ION Accelerator nodes to be managed as one. SanDisk ION Accelerator HA configuration, event monitoring, and performance are managed by an integrated, cluster- aware GUI and CLI. Non- Disruptive Update and FRU Servicing SanDisk ION Accelerator HA provides Non- Disruptive Update (NDU) and Field Replaceable Unit (FRU) servicing. Software update packages are applied one node at a time. Each node automatically enters a maintenance mode, performs the update, and then rejoins the HA cluster. A rolling HA update is automated by Fusion iosphere software. Similarly, if a FRU, such as an Fusion iomemory iodrive card, HBA, or Cluster Interconnect Card requires servicing, the node can be taken into maintenance mode, servicing performed, and when appropriate, bought back online. Cluster Management in SanDisk ION Accelerator HA SanDisk ION Accelerator HA relies on industry leading tools (Corosync and Pacemaker) for cluster resource management and messaging for detection and recovery of node- and resource- level failures. These tools are fully integrated into SanDisk ION Accelerator HA. They require no additional user configuration and are maintained as part of the overall software package. This integrated software stack works seamlessly to provide a robust high availability solution. 5

Volume Replication in SanDisk ION Accelerator HA SanDisk ION Accelerator HA includes a volume replication capability that is based on Linbit s DRBD. According to Linbit, Using DRBD in conjunction with Pacemaker is arguably DRBD s most frequently found use case. Pacemaker is also one of the applications that make DRBD extremely powerful in a wide range of usage scenarios. With SanDisk ION Accelerator HA, every volume has a role, which may be primary or secondary. The choice of primary and secondary as terms here is not arbitrary. These roles were deliberately not named Active and Passive. Primary vs. secondary refers to a concept related to availability of storage, whereas active vs. passive refers to the availability of an application. It is usually the case in a high- availability environment that the primary node is also the active one, but this is by no means necessary. A SanDisk ION Accelerator volume in the primary role can be used for read and write operations. A SanDisk ION Accelerator volume in the secondary role receives all updates from the peer node s device, but otherwise disallows access completely. It cannot be used by applications, neither for read nor write access. The reason for disallowing even read- only access to the device is the necessity to maintain consistency, which would be impossible if a secondary resource were made accessible in any way. At any given time, a volume is in the primary role on only one cluster member. SanDisk ION Accelerator HA implements a synchronous replication protocol. Local write operations on the primary volume are considered durable only after both the local and the remote write have been completed. As a result, loss of a single node is guaranteed not to cause data inconsistency. Data loss is, of course, inevitable even with this replication protocol if both nodes are irreversibly destroyed at the same time. The SanDisk ION Accelerator HA replication and synchronization framework leverages commodity high performance Ethernet networking. Efficient Synchronization Re- synchronization is distinct from replication. While replication occurs on any write event to a resource in the primary role, synchronization is decoupled from incoming writes. Rather, it affects the device as a whole. Synchronization is necessary if the replication link has been interrupted for any reason, be it due to failure of the primary node, failure of the secondary node, or interruption of the replication link. Synchronization is efficient in the sense that SanDisk ION Accelerator HA does not synchronize modified blocks in the order they were originally written, but in linear order, which has the following consequences: Synchronization is fast, since blocks in which several successive write operations occurred are only synchronized once. Synchronization is efficient; activity is logged and only hot extents need be synchronized. During synchronization, the data set on the standby node is partly obsolete and partly already updated. This state of data is called inconsistent. Initiator access to storage remains available on the active node, while background synchronization is in progress. 6

Note: A node with inconsistent data cannot be put into operation, thus it is desirable to keep the time period during which a node is inconsistent as short as possible. SanDisk ION Accelerator, more so than any typical replication approach, accomplishes this due to its use of high speed a Fusion iomemory tier as the storage media. Variable- Rate Synchronization In variable- rate synchronization, SanDisk ION Accelerator HA detects the available bandwidth on the synchronization links, compares it to incoming foreground application I/O, and selects an appropriate synchronization rate based on a fully automatic control loop. The administrator can, of course, schedule a time of day for synchronization when performing planned maintenance. Split Brain Notification and Automatic Recovery Split brain is a situation where, due to temporary failure of all network links between cluster nodes, or possibly due to human error, both nodes switch to the primary role while disconnected. This is a potentially harmful state, as it implies that modifications to the data might have been made on either node without having been replicated to the peer. Thus, it is possible in this situation that two diverging sets of data have been created, which cannot be trivially merged. SanDisk ION Accelerator HA resolves detected split brain situations in the following manner: Graceful recovery if there have been no intermediate changes: In this mode, if only one node had made modifications during split brain, DRBD will simply recover gracefully and declare the split brain resolved. Automatic split brain resolution is a fairly likely scenario because initiators have no immediate reason to failover an I/O path; they will remain connected to ALUA preferred nodes. Disconnect: If a divergence was detected, SanDisk ION Accelerator HA will protect data by refusing to reconnect the replica. Maintenance Mode: SanDisk ION Accelerator HA will automatically enter maintenance mode and refuse to accept initiator I/O, if a split brain condition is detected during boot. Fusion iomemory iodrive Error Handling Strategies Customers have the option of creating redundant (mirrored) or non- redundant storage pools, and in so doing, make an explicit choice that trades capacity for availability. Because the SanDisk ION Accelerator appliance is replicating data between nodes, it is not strictly necessary to configure local redundancy to achieve high availability. If a FRU within a non- redundant storage pool goes offline, SanDisk ION Accelerator HA will failover quickly. The initiator s multipath I/O will failover to access replicated storage on the other node. Application Acceleration using SanDisk ION Accelerator The SanDisk ION Accelerator appliance maximizes business critical application performance for data- intensive workloads including databases, virtualization, virtual desktop infrastructure, and Big Data solutions across Microsoft SQL Server, Oracle, MySQL, SAP HANA, VMware vsphere and View, and many others. FOR MORE INFORMATION Contact a SanDisk representative, 1-800- 578-6007 or fusion- sales@sandisk.com The performance results discussed herein are based on testing and use of the above described products. Results and performance may vary according to configurations and systems, including drive capacity, system architecture and applications. 2014 SanDisk Corporation. All rights reserved. SanDisk is a trademark of SanDisk Corporation, registered in the United States and other countries. Fusion iomemory, iodrive, Fusion iosphere, SanDisk ION Accelerator and others are trademarks of SanDisk Enterprise IP LLC. Other brand names mentioned herein are for identification purposes only and may be the trademarks of their respective holder(s). 7