High-Availability in the Cloud Architectural Best Practices



Similar documents
Design for Failure High Availability Architectures using AWS

Scalable Web Application

Deploying Database clusters in the Cloud

ArcGIS for Server in the Amazon Cloud. Michele Lundeen Esri

Storage and Disaster Recovery

Web Application Hosting in the AWS Cloud Best Practices

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Scalable Architecture on Amazon AWS Cloud

Making Your ColdFusion Apps Highly Available. Brian Klaas Johns Hopkins Bloomberg School of Public Health

Migration Scenario: Migrating Backend Processing Pipeline to the AWS Cloud

Alfresco Enterprise on AWS: Reference Architecture

Fault-Tolerant Computer System Design ECE 695/CS 590. Putting it All Together

TECHNOLOGY WHITE PAPER Jun 2012

TECHNOLOGY WHITE PAPER Jan 2016

Amazon Elastic Beanstalk

Web Application Hosting in the AWS Cloud Best Practices

Cloud Computing Disaster Recovery (DR)

Designing Apps for Amazon Web Services

Software Performance, Scalability, and Availability Specifications V 3.0

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

IAN MASSINGHAM. Technical Evangelist Amazon Web Services

Amazon Web Services Student Tutorial

Software- as- a- Service (SaaS) on AWS Business and Architecture Overview

Best Practices for Using MySQL in the Cloud

Postgres Plus Cloud Database!

Building Fault-Tolerant Applications on AWS October 2011

Learning Management Redefined. Acadox Infrastructure & Architecture

How To Choose Between A Relational Database Service From Aws.Com

Reliable Data Tier Architecture for Job Portal using AWS

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

DLT Solutions and Amazon Web Services

NetflixOSS A Cloud Native Architecture

ur skills.com

Cloud Based Application Architectures using Smart Computing

RackWare Solutions Disaster Recovery

DISASTER RECOVERY WITH AWS

Using ArcGIS for Server in the Amazon Cloud

RightScale mycloud with Eucalyptus

Amazon Relational Database Service (RDS)

Design For Availability. October 2013 Stevan Vlaovic

Blackboard Managed Hosting SM Disaster Recovery Planning Document

Cloud Computing with Amazon Web Services and the DevOps Methodology.

Drupal in the Cloud. Scaling with Drupal and Amazon Web Services. Northern Virginia Drupal Meetup

Amazon Web Services Yu Xiao

Application Security Best Practices. Matt Tavis Principal Solutions Architect

Amazon Relational Database Service. User Guide API Version

High Performance MySQL Choices in Amazon Web Services: Beyond RDS. Andrew Shieh, SmugMug Operations smugmug.

Case Study. Highly Available, Fault Tolerant Cloud Solution & AWS Managed Support. Case Study. A Telehealthcare Company

Managed Amazon Web Services

Primex Wireless OneVue Architecture Statement

Cloud Computing: Designing Different System Architecture Depending On Real-World Examples

How To Backup To Cloud

Preparing Your IT for the Holidays. A quick start guide to take your e-commerce to the Cloud

SOFTWARE-DEFINED STORAGE IN ACTION

SQL Server Storage Best Practice Discussion Dell EqualLogic

The Pros and Cons of Erasure Coding & Replication vs. RAID in Next-Gen Storage Platforms. Abhijith Shenoy Engineer, Hedvig Inc.

SysAid Cloud Architecture Including Security and Disaster Recovery Plan

Chapter 9 PUBLIC CLOUD LABORATORY. Sucha Smanchat, PhD. Faculty of Information Technology. King Mongkut s University of Technology North Bangkok

Service Organization Controls 3 Report

Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings

Moving to the Cloud. Sam Hornstein Jetline Jason Nokes President, Distributor Central Garrett Ausfeldt Starline

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Amazon Compute - EC2 and Related Services

Extending your datacenter to the cloud

Best Practices for Siebel on AWS

CLOUD COMPUTING FOR THE ENTERPRISE AND GLOBAL COMPANIES Steve Midgley Head of AWS EMEA

COST-BENEFIT ANALYSIS: HIGH AVAILABILITY IN THE CLOUD AVI FREEDMAN, TECHNICAL ADVISOR. a white paper by

Server and Storage Virtualization with IP Storage. David Dale, NetApp

CONNECTRIA MANAGED AMAZON WEB SERVICES (AWS)

Multi-Datacenter Replication

PHP in the Cloud. Running Business-Critical PHP Applications in the Cloud

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Windows Server 2003 Migration Guide: Nutanix Webscale Converged Infrastructure Eases Migration

Designing Private and Hybrid Clouds

BeBanjo Infrastructure and Security Overview

OnApp Cloud. The complete platform for cloud service providers. 114 Cores. 286 Cores / 400 Cores

OTM in the Cloud. Ryan Haney

Amazon EC2 Product Details Page 1 of 5

Achieve Economic Synergies by Managing Your Human Capital In The Cloud

Reference Model for Cloud Applications CONSIDERATIONS FOR SW VENDORS BUILDING A SAAS SOLUTION

Cloud models and compliance requirements which is right for you?

Migration and Disaster Recovery Underground in the NEC / Iron Mountain National Data Center with the RackWare Management Module

EXECUTIVE SUMMARY CONTENTS. 1. Summary 2. Objectives 3. Methodology and Approach 4. Results 5. Next Steps 6. Glossary 7. Appendix. 1.

Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida

Every Silver Lining Has a Vault in the Cloud

JAVA IN THE CLOUD PAAS PLATFORM IN COMPARISON

WE RUN SEVERAL ON AWS BECAUSE WE CRITICAL APPLICATIONS CAN SCALE AND USE THE INFRASTRUCTURE EFFICIENTLY.

Leveraging Public Clouds to Ensure Data Availability

High Availability for Database Systems in Cloud Computing Environments. Ashraf Aboulnaga University of Waterloo

DNS use guidelines in AWS Jan 5, 2015 Version 1.0

Contents. Foreword. Acknowledgments

Building an AWS-Compatible Hybrid Cloud with OpenStack

Transcription:

1 High-Availability in the Cloud Architectural Best Practices Josh Fraser, VP Business Development, RightScale Brian Adler, Sr. Professional Services Architect

2 # RightScale World s #1 cloud management system Managing cloud deployments for 4+ Years globally More than 40,000 users; launched over 2.5MM servers Powering the largest production deployments on the cloud

3 # Where RightScale fits

4 # High Availability in the Cloud

5 # Summary Solid track record of AWS operational excellence Need design for failure Use architectural best practices Back up and replication need to be taken seriously Unsurpassed availability of resources and options are now available

6 # Designing for Failure (in a good way) Large scale failures in the cloud are rare but do happen Application owners are ultimately responsible for availability and recoverability Need to balance cost and complexity of HA efforts against risk(s) you are willing to bear Cloud infrastructure has made DR and HA remarkably affordable versus past options Multi-server Multi-AZ Multi-Region Multi-Cloud

7 # What do we mean by Cloud? A cloud is a physical datacenter entity behind an API endpoint What does that really mean? Amazon Web Services is not a cloud EC2 is not a cloud Eucalyptus, Cloud.com, OpenStack are not clouds EC2 US-East, my private cloud these are clouds An availability zone is not a cloud (but it is part of one) Think of a cloud as a resource pool accessed via an API

8 # Distributed Platform Managing Multiple Clouds ServerTemplates are a design once, deploy multiple times approach RightScale handles the heavy lifting between different clouds

9 # Multi-cloud Best Practices: ServerTemplates VS. Integrated approach that puts together all the parts needed to architect single & multi-server deployments

10 # ServerTemplates are Server DNA Cloud Agnostic One Asset, Multiple Options Public Cloud Web Server Private Cloud Future Cloud

11 # HA Architectures

12 # HA/DR Checklist for Risk Mitigation Determine who owns the architecture, DR process and testing. Develop expertise in-house and / or get outside help. Conduct a risk assessment for each application. Specify your target Recovery Time Objective and Recovery Point Objective. Design for failure starting with application architecture. This will help drive the infrastructure architecture. Implement HA best practices balancing cost, complexity and risk. Automate infrastructure for consistency and reliability. Document operational processes and automations. Test the failover... then test it again. Release the Chaos Monkey.

13 # General HA Best Practices Avoid single points of failure Always place (at least) one of each component (load balancers, app servers, databases) in at least two AZs Maintain sufficient capacity to absorb AZ / cloud failures Reserved Instances guarantee capacity is available in a separate region/cloud Replicate data across AZs and backup or replicate across clouds/regions for failover Setup monitoring, alerts and operations to identify and automate problem resolution or failover process Design stateless applications for resilience to reboot / relaunch

14 # Regions & Availability Zones Zones within a region share a LAN (high bandwidth, low latency, private IP access) Zones utilize separate power sources, are physically segregated Regions are islands, and share no resources

15 # Application Architecture Deployment Options Component DNS Load Balancing Storage Server Array Database Options/Considerations DNS APIs for dynamic configuration (DynDNS, Route 53, DNS Made Easy) HAProxy, Zeus, aicache, ELB Local storage, EBS, S3, CloudFiles, GlusterFS, etc. Scalable tiers; scale up and down conservatively MySQL, PostgreSQL, SQL Server, RDS, NoSQL

16 # Multi-Cloud Cold / Warm / Hot DR Options No Downtime > 5 Minutes > 1 Hour > Few Hours Multi-Cloud HA (Live/Live Config) Hot DR (Least Common) Warm DR (Recommended) Cold DR (Most Common) $ $$ $$$ $$$$

17 # Multi-Cloud Cold DR Example Staged Server Configuration and generally no staged data Not recommended if rapid recovery is required Slow to replicate data to other cloud Slow to bring database to an operational state

18 # Multi-Cloud Warm DR Example Staged Server Configuration, pre-staged data and running Slave Database Server Generally recommended DR solution Minimal additional cost Allows fairly rapid recovery

19 # Multi-Cloud Hot DR Example Parallel Deployment with all servers running but all traffic going to primary Not recommended Very high additional cost Allows rapid recovery, but not significantly faster than warm configuration

20 # Multi-Cloud Hot HA Example Live/Live configuration. May use Geo-target IP services to direct traffic to regional load balancers. Possible, but not recommended (more to follow ) Maximum additional cost Provides high availability, but complex to implement and manage

21 # Multi-Cloud Hot HA Example Multi-Cloud looks similar to Multi-AZ but there are additional problems to solve as some resources are not shared across clouds You need DNS management or a global load balancer Security is an issue as security groups are Region-specific. Images for cloud servers are specific to the cloud/region. You need to copy or replicate data yourself as EBS snapshots are specific to the source AZ. Getting the data out requires a slave sync or taking LVM snapshots and transferring the data. Consider distributed NoSQL databases with the same distribution considerations. Spread primary and replica nodes across multiple AZs. Place as many as you need for required resiliency.

22 # So What s Best? Design for failure Assume everything will fail, and architect a solution capable of handing each and every failure condition No one size fits all solution Every application has its own architecture Not all infrastructure building blocks fit well with all applications Tradeoffs between levels of resiliency and cost The options available in the cloud today are unprecedented Capabilities for global redundancy Time to access Investment required Follow our High Availability Checklist (or create your own) Multi-AZ configurations with a solid DR plan are generally the most viable and cost-conscious solutions

23 # Q & A

24 # Mention this AWS Event and get: Free High Availability Assessment Free Design/Architecture Recommendations Half Off Onboarding Fee Ready to get started? Contact us at sales@rightscale.com or (866) 720-0208 Learn More RightScale Free Edition White Paper Library Webinar Library RightScale.com/free Rightscale.com/whitepapers RightScale.com/webinars