High availability on the Catalyst Cloud



Similar documents
DISASTER RECOVERY WITH AWS

Fully Managed Secure Data Sharing (a cloud service)

Westek Technology Snapshot and HA iscsi Replication Suite

Feature Comparison. Windows Server 2008 R2 Hyper-V and Windows Server 2012 Hyper-V

High Availability with Postgres Plus Advanced Server. An EnterpriseDB White Paper

A G U I D E T O C O - L O C A T I O N

Our Hosting Infrastructure. An introduction to our Platform, Data Centres and Data Security.

Exchange Data Protection: To the DAG and Beyond. Whitepaper by Brien Posey

Autodesk PLM 360 Security Whitepaper

Data Center Content Delivery Network

Telecom Business Continuity Solutions FOR INTERNAL USE ONLY

AMAZON S3: ARCHITECTING FOR RESILIENCY IN THE FACE OF FAILURES Jason McHugh

The Microsoft Large Mailbox Vision

Advanced High. Architecture.

Real-time Protection for Hyper-V

NY-1 DATACENTER AT A GLANCE. NY-1 is a Tier III-rated, SAS SSAE16 and HIPAA-certified data center

Highly Available Mobile Services Infrastructure Using Oracle Berkeley DB

Disaster Recovery Design Ehab Ashary University of Colorado at Colorado Springs

Veritas Storage Foundation High Availability for Windows by Symantec

Frankfurt Data Centre Overview

Design For Availability. October 2013 Stevan Vlaovic

Building Storage Service in a Private Cloud

Perceptive Software Platform Services

Vess A2000 Series HA Surveillance with Milestone XProtect VMS Version 1.0

Windows Server 2008 R2 Hyper-V Server and Windows Server 8 Beta Hyper-V

Complete Storage and Data Protection Architecture for VMware vsphere

The evolution of data connectivity

Blackboard Managed Hosting SM Disaster Recovery Planning Document

Web Drive Limited TERMS AND CONDITIONS FOR THE SUPPLY OF SERVER HOSTING

(Formerly Double-Take Backup)

Availability and Disaster Recovery: Basic Principles

Achta's IBAN Validation API Service Overview (achta.com)

Level I - Public. Technical Portfolio. Revised: July 2015

Clustering ExtremeZ-IP 4.1

Cloud Operations Excellence & Reliability

Datacenter Assessment

DATA CENTER COLOCATION

Service Description Cloud Storage Openstack Swift

Sophisticated Password Policy

BME CLEARING s Business Continuity Policy

FortiBalancer: Global Server Load Balancing WHITE PAPER

Sovereign. The made to measure data centre

BEST PRACTICES FOR IMPROVING EXTERNAL DNS RESILIENCY AND PERFORMANCE

Security Policy JUNE 1, SalesNOW. Security Policy v v

The Anatomy of a. High-Availability Rack. November Online Tech, Inc. 220 E. Huron Ann Arbor, MI (734)

Online Backup by Mozy. Common Questions

DISASTER RECOVERY BUSINESS CONTINUITY DISASTER AVOIDANCE STRATEGIES

The VMware Administrator s Guide to Hyper-V in Windows Server Brien Posey Microsoft

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

DriveHQ Security Overview

High Availability with Windows Server 2012 Release Candidate

Hyper-V Replica. Aidan Finn

Stretched Clusters and VMware

High Availability Essentials

Fedora Directory Server FUDCon III London, 2005

Joe Young, Senior Windows Administrator, Hostway

Disaster Recovery Strategies

Cloud Infrastructure Operational Excellence & Reliability

DATA CENTRE DATA CENTRE MAY 2015

SAN Conceptual and Design Basics

GSN Cloud Contact Centre Availability & DR Datasheet

Cloud Optimize Your IT

Security Controls for the Autodesk 360 Managed Services

FAQ: BroadLink Multi-homing Load Balancers

Deploy App Orchestration 2.6 for High Availability and Disaster Recovery

Every business owns precious data and every business should recognise the risk of losing their data. Natural Disaster

Windows Server Failover Clustering April 2010

Restricted Document. Pulsant Technical Specification

Network Enabled Cloud

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Windows Azure and private cloud

Designing a Cloud Storage System

Load Balancing ContentKeeper With RadWare

Traditional v/s CONVRGD

Oracle Maps Cloud Service Enterprise Hosting and Delivery Policies Effective Date: October 1, 2015 Version 1.0

Virtual Infrastructure Security

TÓPICOS AVANÇADOS EM REDES ADVANCED TOPICS IN NETWORKS

FioranoMQ 9. High Availability Guide

TÓPICOS AVANÇADOS EM REDES ADVANCED TOPICS IN NETWORKS

Making the Business and IT Case for Dedicated Hosting

Data Center Presentation

Voice Over IP. MultiFlow IP Phone # 3071 Subnet # Subnet Mask IP address Telephone.

SwiftStack Global Cluster Deployment Guide

Cisco Disaster Recovery: Best Practices White Paper

How To Protect Data On Network Attached Storage (Nas) From Disaster

Apptix Online Backup by Mozy

High-Availability in the Cloud Architectural Best Practices

Transcription:

White paper High availability on the Catalyst Cloud Features and techniques to improve availability, resiliency and business continuity of web applications hosted on the Catalyst Cloud 3 February 2016

Table of Contents 1. Introduction...1 2. Physical Infrastructure...1 3. Catalyst Cloud features that enhance availability and resiliency...2 Commercial in Confidence

1. Introduction This document outlines the physical infrastructure and software features that make the Catalyst Cloud highly available and resilient. It covers built in features that are inherited by every project and services that can be used to enhance the availability of web applications and web sites hosted on the Catalyst Cloud. 2. Physical Infrastructure The Catalyst Cloud is hosted on multiple regions or geographical locations. Regions are completely independent and isolated from each other, providing fault tolerance and geographic diversity. This section describes the physical attributes of our data centres. Security Catalyst Cloud regions have the following security attributes: Comprehensive access control systems with multiple perimeters Closed Circuit TV with cameras covering entrance and data centre Fire suppression mechanisms Power and Cooling Catalyst Cloud regions have the following power supply and cooling attributes: High Capacity UPS Backup Diesel Generators A+B power to UPS and racks (Porirua and Hamilton) N+1 process coolers Network The physical networks backing the Catalyst Cloud have the following properties: Diverse Fibre paths linking cloud regions Diverse Fibre providers for fibre paths Diverse ISPs Diverse International Upstream Providers High capacity intra cloud bandwidth Commercial in Confidence // Page 1 of 1

3. Catalyst Cloud features that enhance availability and resiliency Compute If a physical compute node fails, our monitoring systems will detect the failure and trigger an evacuate process that will restart all affected virtual compute instances on a healthy physical server. This process usually takes between 5 to 20 minutes which allows us to meet our 99.9% availability SLA for individual compute instances. Customers that require more than 99.9% availability can combine multiple compute instances within the same region using anti affinity groups. Anti affinity groups ensure that compute instances belonging to the same group are hosted on different physical servers. This reduces the risk and probability of multiple compute instances failing at the same time due to the loss of a single server. Our documentation provides instructions on how to deploy highly available compute instances using anti-affinity filters and keepalived: http://docs.catalystcloud.io/tutorials/deploying-highly-available-instances-withkeepalived.html Customers that require their applications to survive the loss of an entire data centre can launch compute instances in different regions (assuming your application supports operating in this way). The following tutorial explains how to use the Fastly CDN as a fail-over mechanism between regions: http://docs.catalystcloud.io/tutorials/region-failover-using-the-fastly-cdn.html Block Storage We run a distributed storage system that by default retains three copies of your data on different servers spread across a region (a datacenter). We can afford to lose many disks and multiple storage nodes without losing any data. As soon as a disk or storage node fails, our storage solution begins recovering the data from an existing copy, always ensuring that three replicas are present. The storage solution is self managing and self healing and places your data in optimal locations for data survival and resiliency. It runs automated error checks in the background that can detect and recover a single bit of data wrong, by comparing the three copies of the data and ensuring they are identical. The solution is designed and implemented with very high availability and data resiliency in mind. It has no single points of failure. Object Storage Currently Object Storage is backed by the same technology and physical infrastructure as block storage. Consequently it offers the same protections and guarantees as Block storage does. In the first half of 2016 we will start replicating object storage data across three Commercial in Confidence // Page 2 of 1

regions, providing 99.999999999% durability and 99.99% availability for data over a given year. Virtual Routers In the same way that if a compute instance fails, if a physical network node fails our monitoring systems will detect the failure and trigger the evacuate process that will ensure all affected virtual router instances are restarted on a healthy server. This process usually takes between 5 to 20 minutes. We are working on a new feature that launches two virtual routers on separate network nodes responding on the same IP address. Once this is complete the failover between routers will take milliseconds which will most likely not be noticed. Meanwhile customers requiring higher availability are advised to combine compute instances from multiple regions where possible. Monitoring The catalyst cloud has robust fine grained monitoring systems in place. These systems are monitored 24x7. Roadmap The following features are expected to be available on the Catalyst Cloud in 2016 and will improve resiliency and availability even further: Highly Available redundant routers (using VRRP) Object storage replicated across three regions Load Balance as a Service (LBaaS) Commercial in Confidence // Page 3 of 1