How To Avoid Snowflakes



Similar documents
DISASTER RECOVERY WITH AWS

Cloud & DevOps Program Vision and Strategy. Jason Snyder, Steve Martino, and Erica Bradshaw

Overview. The Cloud. Characteristics and usage of the cloud Realities and risks of the cloud

Mile Run World Record Progression 1913 to 1999 (Source: Wikipedia) Rapid App Development Fosters Improvement

Week Overview. Installing Linux Linux on your Desktop Virtualization Basic Linux system administration

UDiMan. Introduction. Benefits: Name: UDiMan Identity Management service. Service Type: Software as a Service (SaaS Lot 3)

CLOUD COMPUTING An Overview

Hybrid Cloud for Development and Testing with VMware vcloud Air

Minder. simplifying IT. All-in-one solution to monitor Network, Server, Application & Log Data

Urbancode Deploy Overview

Increased Security, Greater Agility, Lower Costs for AWS DELPHIX FOR AMAZON WEB SERVICES WHITE PAPER

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

Server & Cloud Management

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Vodafone Private Cloud

Organizations that are standardizing today are enjoying lower management costs, better uptime. INTRODUCTION

Invest in your business with Ubuntu Advantage.

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

A BETTER SOLUTION FOR MAINTAINING HEALTHCARE DATA SECURITY IN THE CLOUD

Making the Transition. From ISV to SaaS. with Xterity Wholesale Cloud

OpenShift 3.0 in the Sogeti Services Factory

IJRSET 2015 SPL Volume 2, Issue 11 Pages: 29-33

SharePoint 2013 Best Practices

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Bring the cloud to your datacenter

The Cloud is Not Enough Why Hybrid Infrastructure is Shaping the Future of Cloud Computing

Enterprise GIS Architecture Deployment Options. Andrew Sakowicz

How To Use Windows Small Business Server 2011 Essentials

Partners are Telling Us. Customers are Telling Us. Cloud Managed Services are misunderstood. It would be great if AWS could better define.

Maximizing Configuration Management IT Security Benefits with Puppet

Using SUSE Studio to Build and Deploy Applications on Amazon EC2. Guide. Solution Guide Cloud Computing.

John D. Bonam Disaster Recovery Architecture Session # 2841

All Clouds Are Not Created Equal THE NEED FOR HIGH AVAILABILITY AND UPTIME

Cloud Courses Description

CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment

Cloud computing is a marketing term that means different things to different people. In this presentation, we look at the pros and cons of using

Technology Services Overview Python/Django

Migration and Disaster Recovery Underground in the NEC / Iron Mountain National Data Center with the RackWare Management Module

Media Company Reduces Time-to-Market by 80 Percent with Cloud-Hosting Solution

Enabling Cloud Computing for Enterprise Web Applications:

System Management with RHN Satellite

GET CLOUD EMPOWERED. SEE HOW THE CLOUD CAN TRANSFORM YOUR BUSINESS.

Cloud Computing 101 Dissipating the Fog 2012/Dec/xx Grid-Interop 2012

The 9 Ugliest Mistakes Made with Data Backup and How to Avoid Them

Analytics In the Cloud

Realizing the Value of Standardized and Automated Database Management SOLUTION WHITE PAPER

Making Leaders Successful Every Day

Microsoft SharePoint Architectural Models

Simplified Private Cloud Management

Deployment patterns for Fusion Middleware. a best practice session by Simon Haslam & Jacco H. Landlust

High Availability Essentials

What every small business should know about Cloud storage

Modern App Architecture for the Enterprise Delivering agility, portability and control with Docker Containers as a Service (CaaS)

Leveraging Public Cloud for Affordable VMware Disaster Recovery & Business Continuity

Course 20533: Implementing Microsoft Azure Infrastructure Solutions

Assignment # 1 (Cloud Computing Security)

About Us. 2 Managed Services E: sales@ironcovesolutions.com T: W: Our Mission. What We Do

Cloud & DevOps Program Big Group. March 13, 2015 Friday 2:00-3:00 p.m. Science Center Hall A

The Case for Cloud Computing - A strategic Perspective

Implementing Microsoft Azure Infrastructure Solutions

Develop an intelligent disaster recovery solution with cloud technologies

All the benefits of Public Cloud on Private, Dedicated Infrastructure. Benefits. Enterprise-Level Security. High Performance. Compliant and Audited

Trend Micro. Advanced Security Built for the Cloud

10 steps to the Cloud for SMBs Introduction to Cloud computing. Ask the Experts. Making Business Work Better Online

Agenda About SUNY and ITEC Cloud project Challenges and Use cases for ITEC Cloud EM Solution Business Benefits

BASHO DATA PLATFORM SIMPLIFIES BIG DATA, IOT, AND HYBRID CLOUD APPS

Expand Your Infrastructure with the Elastic Cloud. Mark Ryland Chief Solutions Architect Jenn Steele Product Marketing Manager

Agilisys G-Cloud Service V

Scaling Web Applications in a Cloud Environment. Emil Ong Caucho Technology 8621

Top Ten Reasons to Transition Your IT Sandbox Environments to the Cloud

How To Cloud Compute At The Cloud At The Cyclone Center For Cnc

Building your Server for High Availability and Disaster Recovery. Witt Mathot Danny Krouk

SteelFusion with AWS Hybrid Cloud Storage

A Guide to. Cloud Services for production workloads

Using a Java Platform as a Service to Speed Development and Deployment Cycles

Private Cloud. One solution managed by Applied

Autodesk PLM 360 Security Whitepaper

Testing Cloud Application System Resiliency by Wrecking the System

Cloud Computing: Making the right choices

How To Use Arcgis For Free On A Gdb (For A Gis Server) For A Small Business

PHP in the Cloud. Running Business-Critical PHP Applications in the Cloud

Hadoop in the Hybrid Cloud

Automated Cloud Migration

Transcription:

Don t Get Caught in a Blizzard! Avoiding Snowflakes in Cloud Migrations IT Summit 2015 June 4, 2015 Thursday 1:10-2:00 p.m.

Agenda Chaos Monkeys love snowflakes those one-off, special services and setups that are particularly vulnerable to crashes and problems with application and infrastructure management. This talk gives a behind-the-scenes look at how HUIT s Cloud & DevOps group avoided snowflakes when migrating to the cloud while also making it faster and easier for service teams to get their work done. Introduction (5 min) Why Are We Talking About Snowflakes? (5 min) Snowflakes (10 min) Blizzard in the Cloud (5 min) Patterns and Standard Components Calm the Storm (15 min) Questions (10 min) 2

Who Are We? Joel Fanton Director of DevOps, HUIT Magnus Bjorkman Director of Solution Architecture, HUIT 3

Why Are We Talking About Snowflakes? Introducing the Cloud Migration Project. Per mandate from Harvard CIO Anne Margulies, we will migrate approximately 75% of 600 existing applications and all new applications to the cloud The Cloud Migration Team is taking a DevOps approach to migration, and is currently building an automation engine to provide for: rapid, consistent cloud infrastructure deployment continuous application integration, testing, and delivery built-in fault tolerance and security best practices highly available resources and disaster recovery As part of our DevOps approach to cloud development, infrastructure and resource creation should be repeatable and predictable Once apps have been migrated, operational support must be sustainable Resource snowflakes are a major deterrent to repeatability, predictability, and operational sustainability 4

What Are Snowflakes? Servers uniquely configured for specific projects and applications, often using nonstandard tools and processes Systems that suffer config drift and document rot over time Automation can help rectify some problems, but you can still have snowflakes with automation Application Middleware Middleware Tool/ Agent Tool/ Agent Tool/ Agent Tool/ Agent Configuration Connectivity Application Operating System 5

Snowflakes: Setting Up New Systems Setting up new systems poses a special set of challenges: Unique systems take a longer time to set up (often weeks) usually by multiple people specializing in distinct technologies Testing that components work together as a whole takes additional time, which lengthens the overall timeline Must have specialists available for components that need to be configured; hand-off and scheduling will take additional time 6

Snowflakes: Maintaining Existing Systems There are problems with maintaining existing systems, too: If you have 1,000 unique setups, you ll need to do every update in 1,000 different ways this is the larger and accumulating cost How do you update for a zero-day attack across 1,000 unique setups in a quick, controlled way? it requires a lot of people How do you integrate new tools across 1,000 unique setups? with local resources, over a very long time 7

Snowflakes: When Something Bad Happens What happens when a snowflake system crashes? How do you quickly recreate it? Find people that have knowledge about it Find current documentation How far does the backup get us? Do you mirror the production environment for testing? Do you create another instance to add to a cluster, even if you can t guarantee exactly the same setup? 8

Snowflakes: Managing the Farm Managing a farm of snowflakes poses some problems: How do you know what to monitor? (manual setup and manual track) How do you identify what each server does? (manually document) How do you detect what works and what doesn t? How do you know that you follow best practices, including security? 9

Here Comes the Blizzard! What happens when we move to the cloud? It s even easier to create snowflakes everything you need is at your fingertips It s fast and easy to create new systems so you can get a lot of systems Systems can be set up automatically depending on your traffic and needs 10

Blizzard: Losing Track of Systems Over time, it becomes very easy to lose track of the systems you have. Orphaned instances: How do you keep track of which servers are actually running important workloads? It can become Russian roulette if you don t have a standard and automated way to track usage Security holes: How do you detect config drift and the risk of data theft? Unexpected costs: If you don t manage the scale of your environments (especially non-prod), you ll very likely have higher costs than on-premise?? Port: 80 Add Port: 3389 11

Blizzard: Keeping Tabs on What You Have How do you keep control of rapidly proliferating instances? Use standardized methods for monitoring and tracking instances so you can determine what to run and what to terminate Use standardized ways to detect config drift and security holes Only run instances/environments when needed otherwise, destroy them 12

The Need: Continuous Setup and Teardown Run it when you need it! Tear it down when we don t need it Set it up again when we need it As we keep rebuilding, we need to handle changes to standards and components: A setup process that accounts for change A process to test over time that changes are not disruptive or detrimental v1.0 v1.2 v6.3 13

How Do We Do This? Since everything is infrastructure-as-code, components can be developed once, tested, and then used many times Build a component once, build it well, and make it reusable across the board Make components independent but tested to work together this enables users to choose which ones they want to use Make components integrate with standardized tools to offer visibility and management of the farm Make tools self-service and integrate with cloud providers, so users can take advantage of the cloud s speed and control Java Application Deployment WebLogic Splunk Amazon Linux Code Deploy Drupal/PHP Deployment GUnicorn New Relic RHEL.NET Deployment Apache DR Agent IIS Ansible Configuration Connectivity Tomcat Django Deployment Windows 14

Patterns & Standard Components: New Systems Key points when it comes to setting up new systems: Quick setup using automated self-service components (can be minutes!) Choose from a selection of tested components needed for applications Standardized ways to set application-specific configurations Specialist knowledge is baked into components, reducing the level of expertise required to set up infrastructure Automated Tools ting Tes 15

Patterns & Standard Components: Existing Systems What about maintaining existing systems? Update in one place, then roll out to every user In the case of a zero-day attack, test updates in lower environment and then roll out in production When introducing new tools, one place to swap out tools and test against component contract Automated Tools ting Tes 16

Patterns & Standard Components: Solutions What about solutions when something bad happens? A system crashes? Your automated setup of standardized components can be run at any time by anyone Mirror production environment for testing? Keep running the environment setup as many times as you want Create another instance to add to a cluster? This happens automatically as part of integrating with cloud providers and automated setup using standardized components Automated Tools 17

Patterns & Standard Components: Managing the Farm How do you know what to monitor? Standardized components are automatically discovered and tagged with information that determines usage How do you identify what each server does? Standardized tagging will sync with information in ServiceNow How do you detect what works and what doesn t? Standardized endpoints and configurations allow automatic discovery of what s healthy and what s not How do you know you follow best practices, including security? Standardized practices and policies can be codified and compared with the discoverability of standardized components ServiceNow Logging Monitoring/Alerting Security Audit Change Management 18

Additional Best Practices Using a micro services architecture to manage and deliver patterns and components for frequent releases and scalability Continuous integration and regression testing of all patterns/components Versioning of interfaces and features for backward compatibility and stability Documented, predictable extension points 19

Current Activities Building the micro service infrastructure and model Building an initial set of patterns and components, with a focus on the LAMP stack Preparing to work with application teams to incorporate use cases and requirements Working with the HUIT Software Standardization/Software Engineering Working Group to develop a guide for migrating applications to the cloud Starting work with teams with existing cloud deployments 20

Questions?

Thank you!