Architecting Self-Managing Distributed Systems



Similar documents
Private Cloud Management

Modern App Architecture for the Enterprise Delivering agility, portability and control with Docker Containers as a Service (CaaS)

Journey to the Private Cloud. Key Enabling Technologies

AppStack Technology Overview Model-Driven Application Management for the Cloud

empower your digital transformation with Private Cloud Solutions Khaled M Khondker

Cloud Computing and Open Source: Watching Hype meet Reality

journey to a hybrid cloud

Why Cisco for Cloud? IT Service Delivery, Orchestration and Automation

SolidFire SF3010 All-SSD storage system with Citrix CloudPlatform Reference Architecture

Migration and Disaster Recovery Underground in the NEC / Iron Mountain National Data Center with the RackWare Management Module

Sacha Dubois RED HAT TRENDS AND TECHNOLOGY PATH TO AN OPEN HYBRID CLOUD AND DEVELOPER AGILITY. Solution Architect Infrastructure

Modern Application Architecture for the Enterprise

A Comprehensive Cloud Management Platform with Vblock Systems and Cisco Intelligent Automation for Cloud

The Impact of PaaS on Business Transformation

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Server & Cloud Management

Hybrid Cloud: Overview of Intercloud Fabric. Sutapa Bansal Sr. Product Manager Cloud and Virtualization Group

Implementing Microsoft Azure Infrastructure Solutions

Accelerate Your Enterprise Private Cloud Initiative

Course 20533: Implementing Microsoft Azure Infrastructure Solutions

RED HAT CONTAINER STRATEGY

Building Private & Hybrid Cloud Solutions

Federated Application Centric Infrastructure (ACI) Fabrics for Dual Data Center Deployments

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

TRANSFORMING I.T. WITH AN OPEN HYBRID CLOUD

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

TOSCA Interoperability Demonstration

CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment

Building an AWS-Compatible Hybrid Cloud with OpenStack

Cisco Intelligent Automation for Cloud

BRINGING NETWORKS TO THE CLOUD ERA

Microsoft Private Cloud

Why Use OPTIMIS? Build and Run Services in the Most Suitable Cloud Venues. May 2013

Software-Defined Networks Powered by VellOS

Plan for Success with a Hybrid Cloud! Thanks to IBM Power Systems OpenStack SoftLayer and UrbanCode

Use Case Brief BUILDING A PRIVATE CLOUD PROVIDING PUBLIC CLOUD FUNCTIONALITY WITHIN THE SAFETY OF YOUR ORGANIZATION

Intel IT Cloud 2013 and Beyond. Name Title Month, Day 2013

EMERGING CLOUD COMPUTING

Zenoss for Cisco ACI: Application-Centric Operations

DevOps. Josh Preston Solutions Architect Stardate

BMC Software Cloud Service Lifecycle

RackWare Solutions Disaster Recovery

VMware Cloud Automation Design and Deploy IaaS Service

HOW SDN AND (NFV) WILL RADICALLY CHANGE DATA CENTRE ARCHITECTURES AND ENABLE NEXT GENERATION CLOUD SERVICES

PLATFORM-AS-A-SERVICE: ADOPTION, STRATEGY, PLANNING AND IMPLEMENTATION

Copyright 2014, Oracle and/or its affiliates. All rights reserved.

Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings

Scale Cloud Across the Enterprise

Cisco Prime Network Services Controller. Sonali Kalje Sr. Product Manager Cloud and Virtualization, Cisco Systems

VALUE PROPOSITION FOR SERVICE PROVIDERS. Helping Service Providers accelerate adoption of the cloud

WHITEPAPER. Beyond Infrastructure Virtualization Platform Virtualization, PaaS and DevOps

Addressing Storage Management Challenges using Open Source SDS Controller

SOLUTION WHITE PAPER. Building a flexible, intelligent cloud

Het is een kleine stap naar een hybrid cloud

VIRTUALIZED SERVICES PLATFORM Software Defined Networking for enterprises and service providers

VMware Software-Defined Storage Vision

I D C T E C H N O L O G Y S P O T L I G H T

Enterprise Cloud Solutions

Empowering Private Cloud with Next Generation Infrastructure. Martin Ip, Head of Advanced Solutions and Services Macroview Telecom

Traditional v/s CONVRGD

Management for the Mobile-Cloud Era

Increased Security, Greater Agility, Lower Costs for AWS DELPHIX FOR AMAZON WEB SERVICES WHITE PAPER

Design Your Cloud Strategy for Long-term Success

Private Clouds. Krishnan Subramanian Analyst & Researcher Krishworld.com. A whitepaper sponsored by Trend Micro Inc.

EMC IT AUTOMATES ENTERPRISE PLATFORM AS A SERVICE

Cloudify and OpenStack Heat

Stackato PaaS Architecture: How it works and why.

APPLICATION DELIVERY IN OPENSTACK WITH AVI NETWORKS

MANAGEMENT AND ORCHESTRATION WORKFLOW AUTOMATION FOR VBLOCK INFRASTRUCTURE PLATFORMS

WHY SERVICE PROVIDERS NEED A CARRIER PaaS SOLUTION cpaas for Network

Intel IT s Cloud Journey. Speaker: [speaker name], Intel IT

Industry Brief. The Epic Migration. to Software Defined Storage. SUSE Enterprise Storage. Featuring

RIDE THE SDN AND CLOUD WAVE WITH CONTRAIL

Implementing Microsoft Azure Infrastructure Solutions 20533B; 5 Days, Instructor-led

STRATEGIC WHITE PAPER. The next step in server virtualization: How containers are changing the cloud and application landscape

CA Automation Suite for Data Centers

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

Remote Voting Conference

Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems

Course 20533B: Implementing Microsoft Azure Infrastructure Solutions

Disaster Recovery As A Service Storage by CloudGrid and Zerto Virtual Replication Disaster Recovery and Business Continuity Platform

<Insert Picture Here> Infrastructure as a Service (IaaS) Cloud Computing for Enterprises

How To Compare The Two Cloud Computing Models

Easy Deployment of Mission-Critical Applications to the Cloud

Elastic Private Clouds

SUSE Linux Enterprise Server for VMware

Master Hybrid Cloud Management with VMware vrealize Suite. Increase Business Agility, Efficiency, and Choice While Keeping IT in Control

Optimally Manage the Data Center Using Systems Management Tools from Cisco and Microsoft

Transcription:

Architecting Self-Managing Distributed Systems Architecting Self-Managing Distributed Systems Dr. Rao Mikkilineni 2015 1

The DIME Computing Model & the Manageability of Computed and the Computer Application of O-Machine Design to Architect Self-Managing Distributed Databases Dr. Rao Mikkilineni C 3 DNA Inc., Santa Clara, CA, USA Dr. Rao Mikkilineni 2015 2

System Resiliency Manageability (Resiliency) The Complexity of Manageability in Distributed Systems Cloud is a pool of on-demand managed resources Cloud Computing Automation of Administration & point of diminishing returns Complexity cliff Conventional computing Layers of resilient infrastructure management Silos Layers of rigid infrastructure management silos Scale Turing Machines do not include manageability and assume infinite resource pool Dr. Rao Mikkilineni 2015 3

.Computation and Its Limits The key property of general-purpose computer is that they are general purpose. We can use them to deterministically model any physical system, of which they are not themselves a part, to an arbitrary degree of accuracy. Their logical limits arise when we try to get them to model a part of the world that includes themselves. Cockshott P., MacKenzie L. M., and Michaelson, G, (2012) Computation and its Limits, Oxford University Press, Oxford. Manageability is in the architecture A non-functional requirement is a requirement that specifies criteria that can be used to judge the operation of a system, rather than specific behaviors. This should be contrasted with functional requirements that define specific behavior or functions. The plan for implementing functional requirements is detailed in the system design. (The Computed) The plan for implementing non-functional requirements is detailed in the system architecture. These requirements include availability, reliability, performance, security, scalability and efficiency at runtime. (The Computer) Dr. Rao Mikkilineni 2015 4

What is the DIME Network Architecture (DNA) DIME (distributed intelligent managed element) is a form of computing that introduces manageability using Oracle* design, Oracle networks and process control by Oracles It configures the computed with appropriate resources, monitors its vital signs and acts to optimize resources based on a blueprints of descriptions of the computers and the computed It manages the Life-cycle quality of computation, including mobility, self-repair, replication, and security * Following Turing s comments in his thesis borrowing the concept of the Oracle of Delphi Dr. Rao Mikkilineni 2015 5

Oracle Design & Super Recursive Computing DIME is the prime instrument for configuring, monitoring and controlling the computation with appropriate resources Managed Computations of DIMEs are composed to create groups Groups are used to compose higher level computations and a group is managed by a group oracle Down-stream or lower level computations are managed by configuring, monitoring and controlling them using an oracle network with global view of all relevant computations at hand The Oracle Dr. Rao Mikkilineni 2015 6

Real-time Signaling The Oracle Machine Design The Computer & the Computed Every computed is configured, monitored and managed to execute in best execution venue (BEV) using an Oracle Oracle is more able than the group it manages Able to influence the computation based on its blueprint of description or specifcation Provides the ability to infuse cognition (self-* properties) into computing Allows Super Recursive Computing at the group level Java, C++, C etc. Legacy Pre-Cloud Applications (on Physical Servers) Resource Blueprint C 3 DNA Platform Web, Databases, NoSQL etc. New Cloud Native Applications (in Containers, VMs or Bare Metal) End-to-End Service Transaction-Level Visibility and Control Infrastructure Agnostic C 3 DNA Application Cognitive HyperContainer Lifecycle Management Plane HyperContainers Service Blueprint Linux Apache Tomcat MySQL OpenLDAP RabbitMQ MongoDB Docker HAProxy Distributed Infrastructure Provisioning Control Plane Public Private Clouds Dr. Rao Mikkilineni 2015 Cloud 7

What is DIME Network Architecture Self-Awareness: Application is embedded along with parallel resource monitors and configuration Managers to optimize resources Self-Reasoning: Signaling overlay network allows service transaction policy management and distributed reasoning at run-time Self-Control: File/device Read/Write control based on local policies driven by global policies and soft-switch for I/O redirection at run-time App fd1 fd2 OS (A) fd1 fd1 fd2 OS (B) Dr. Rao Mikkilineni 2015 8 fd2

System Resiliency The evolution of cloud computing Unification of the computer and the Computed Cloud Computing Automation of Administration & point of diminishing returns Complexity cliff Beyond Cloud Computing: Cognitive Cloud: Hyper-scale, resiliency Conventional computing Layers of resilient infrastructure management Silos & efficiency of computation Model of part of a world that includes the Computer and the computed seamlessly Layers of rigid infrastructure management silos Scaling with number of compute elements Dr. Rao Mikkilineni 2015 9

DIME Network Blueprint DIME Networked Application Group Example Non-functional Requirements Managed Service Network Policies Non-functional Requirements fulfillment Wide Area Application Managers & Policies DIME Workflow Managers Application Area Network Managers Process Dynamics DIME Runtime Libraries OS native Applications Runtime App DB App Applications App DB App DIME DIME DB Reader IP-1 IP-2 IP N-1 IP-N Distributed Computing Cluster Nodes Distributed Intelligent Infrastructure Control Plane DIME Writer Logical Distributed Computing Cluster Network DIME Distributed Cognitive DIME Platform Deamon Docker LXC/LXD IaaS SoftLayer Infrastructure Virtualized or Not Any Public Cloud OS Physical Hardware Managed Distributed Infrastructure Dr. Rao Mikkilineni 2015 10

Video Replicate an existing App 1 Discover & Migrate 2 Replicate Cisco Cluster 1 Cisco Cluster 1 Dr. Rao Mikkilineni 2015 11

Video DB Migration 3 Live-Migrate & Synchronize DB across Multiple Clouds Cisco Cluster 1 Cisco Cluster 2 AWS Cluster Dr. Rao Mikkilineni 2015 12

Video AWS Cloud OpenStack Cloud Dr. Rao Mikkilineni 13 2015

Dr. Rao Mikkilineni 14 2015

Dr. Rao Mikkilineni 2015 15

Display transactions in Real time (transaction contains application and database IP addresses when th transaction is created wherever they are) Dr. Rao Mikkilineni 2015 16

Dr. Rao Mikkilineni 2015 17

MySQL IP address change MySQL IP with address MySQL IP address no transaction change MySQL IP with address loss no change with transaction no change with loss no transaction losstransaction loss Dr. Rao Mikkilineni 18 2015

MySQL IP address change with no transaction loss Dr. Rao Mikkilineni 19 2015

Potential Impact of our Solution on Stakeholders Line-of-Business Assurance of Service Levels, QoS End-to-end visibility and control Precise resource accounting Regulatory Compliance Enterprise IT Policy-driven visibility and control for developer workloads Security and Regulatory compliance No Cloud vendor lock-in Eliminate VM Taxes! Developers Ability to use the latest, coolest development tools Service Composability with QoS Service scalability, reliability and security without worrying about architecture or infrastructure Converged DevOps for dev/test/deploy agility End-to-end visibility across the stack for debugging Dr. Rao Mikkilineni 2015 20

Application 1 Requirement: Easy onboarding of existing and new applications High degree of Reliability and Resilience Application availability & service level assurance across clouds Intercloud Beta in progress at two of their data centers in Texas and Virginia DIME network Solution One-Click onboarding of legacy and new applications Hardening of Intercloud OpenStack infrastructure Apply policies dynamically to change application deployment behavior No dependence on underlying infrastructure. 21 Dr. Rao Mikkilineni 2015 21

Application 2 Requirements: Service Orchestration for high availability, performance optimization and security DIME Network Solution Beta in progress at their data centers with Site to site VPN between CenturyLink cloud and other datacenters One-Click infrastructure provisioning via real time interface. One-Click onboarding of legacy and new applications Apply policies dynamically to change application deployment behavior One-Click inter/intra cloud service orchestration Offer DIME Network service orchestration to CenturyLink cloud customers through their Blueprint offering using their self-service portal On-board applications to CenturyLink Cloud from other clouds and datacenters without disruption to their services Provide inter-cloud and intra-cloud application live-migration, self-repair, autoscaling and self-protection Multi-datacenter/Cloud HA/DR without infrastructure based backup and recovery tools multi-database synchronization, data consistency and live-migration based on RPO & RTO Interoperable shared services between private and public Datacenters 22 Dr. Rao Mikkilineni 2015 22

Co-design Partner - Sabre Holdings Product capability demonstrated by successfully migrating their TripCase app. Requirement Modernizing legacy applications for cloud scaling and resilience Enable proactive monitoring and control of web-scale applications and consistent Dev-Ops process from build and test to operate Tomcat, MySQL, Apache, Passenger etc. DIME Network Solution: Bring Cloud scaling and resilience to legacy applications Policy-based Multi-master HA/DR without re-architecting of MySQL or change database Enterprise-wide standardization of application-level monitoring and control 23 Dr. Rao Mikkilineni 2015 23

"It's very likely that on the basis of philosophy that every error has to be caught, explained, and corrected, a system of the complexity of the living organism would not run for a millisecond. ----- von Neumann, Papers of John von Neumann on Computing and Computing Theory, Hixon Symposium, September 20, 1948, Pasadena, CA, The MIT Press, 1987, p474 The DIME Network Architecture puts safety and survival of application/service first allows the application to self-scale, self-migrate and allows sectionalizing, isolation, diagnosis and fixing of errors at leisure.