One click Hadoop clusters - anywhere



Similar documents
Deploy Your First CF App on Azure with Template and Service Broker. Thomas Shao, Rita Zhang, Bin Xia Microsoft Azure Team

CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment

Managing Hybrid deployments using Cloud Foundry on Azure

Sistemi Operativi e Reti. Cloud Computing

Cloud Computing and Open Source: Watching Hype meet Reality

Deploying Your Application On Public Cloud

HOPS: Hadoop Open Platform-as-a-Service

Savanna Hadoop on. OpenStack. Savanna Technical Lead

HADOOP BIG DATA DEVELOPER TRAINING AGENDA

HP OpenStack & Automation

How Bigtop Leveraged Docker for Build Automation and One-Click Hadoop Provisioning

OpenStack. Orgad Kimchi. Principal Software Engineer. Oracle ISV Engineering. 1 Copyright 2013, Oracle and/or its affiliates. All rights reserved.

VMware on VMware: Private Cloud Case Study Customer Presentation

Cloudify and OpenStack Heat

Get Ship Done! Microservices Cloud Development Made Easy Charles Eckel and David Tootill Cisco Systems

Cisco Prime Network Services Controller. Sonali Kalje Sr. Product Manager Cloud and Virtualization, Cisco Systems

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

Cloud enablement with Flexiant. Cloud Orchestrator

<Insert Picture Here> Private Cloud with Fusion Middleware

Considerations for Adopting PaaS (Platform as a Service)

Powering the Next Generation Cloud with Azure Stack, Nano Server & Windows Server 2016! Jeff Woolsey Principal Program Manager Cloud & Enterprise

Modern App Architecture for the Enterprise Delivering agility, portability and control with Docker Containers as a Service (CaaS)

OpenShift Enterprise PaaS by Red Hat. Andrey Markelov RHCA Red Hat, Presales Solution Architect

CoIP (Cloud over IP): The Future of Hybrid Networking

Hadoop on OpenStack Cloud. Dmitry Mescheryakov Software

Private Cloud Management

Availability Digest. HPE Helion Private Cloud and Cloud Broker Services February 2016

CYCLOPS The Ultimate Billing Framework for Cloud Services Dr. Piyush Harsh, ICCLab InIT Zurich University of Applied Sciences Zürcher Fachhochschule

APP DEVELOPMENT ON THE CLOUD MADE EASY WITH PAAS

Linux A first-class citizen in Windows Azure. Bruno Terkaly bterkaly@microsoft.com Principal Software Engineer Mobile/Cloud/Startup/Enterprise

Beyond Cloud: Implications for the Industrialization of IT and the 3rd Era of Computing. Jonathan

FI-WARE Cloud Overview

OpenFlow/SDN for IaaS Providers

An enterprise- grade cloud management platform that enables on- demand, self- service IT operating models for Global 2000 enterprises

OnApp Cloud. The complete platform for cloud service providers. 114 Cores. 286 Cores / 400 Cores

Windows Azure and private cloud

The Virtualization Practice

Experiences with Transformation to Hybrid Cloud: A Case Study for a Large Financial Enterprise

ArcGIS for Server: In the Cloud

Why the Datacenter needs an Operating System. Dr. Bernd Mathiske Senior Software Architect Mesosphere

AppStack Technology Overview Model-Driven Application Management for the Cloud

A Gentle Introduction to Cloud Computing

WHY SERVICE PROVIDERS NEED A CARRIER PaaS SOLUTION cpaas for Network

IN DETAIL. Smart & Dedicated Servers

Monitor Open stack environments from the bottom up and front to back. Roger Ruttimann VP Engineering, GroundWork OpenSource November 17, 2015

How To Understand Cloud Computing

RED HAT CONTAINER STRATEGY

Use Case Brief CLOUD MANAGEMENT SOFTWARE AUTOMATION

Hadoop-based Open Source ediscovery: FreeEed. (Easy as popcorn)

1 What is Cloud Computing? Cloud Infrastructures OpenStack Amazon EC CAMF Cloud Application Management

Kubernetes-Murano Integration in Mirantis OpenStack 7.0

Why Cisco for Cloud? IT Service Delivery, Orchestration and Automation

Cloud Optimize Your IT

Overview. The OnApp Cloud Platform. Dashboard APPLIANCES. Used Total Used Total. Virtual Servers. Blueprint Servers. Load Balancers.

Aligning Applications and Connectivity to Enable Fast And Safe Cloud Computing Derrick Loi, May 2015

How an Open Source Cloud Will Help Keep Your Cloud Strategy Options Open

'Run any app on any cloud' is CliQr's bold claim

DevOps. Josh Preston Solutions Architect Stardate

OpenNebula Open Souce Solution for DC Virtualization. C12G Labs. Online Webinar

Architecting Self-Managing Distributed Systems

Virtualizing Apache Hadoop. June, 2012

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

Sacha Dubois RED HAT TRENDS AND TECHNOLOGY PATH TO AN OPEN HYBRID CLOUD AND DEVELOPER AGILITY. Solution Architect Infrastructure

Business transformation with Hybrid Cloud

Data Sheet Netrounds Control Center

Cloud computing - Architecting in the cloud

Use Case Brief BUILDING A PRIVATE CLOUD PROVIDING PUBLIC CLOUD FUNCTIONALITY WITHIN THE SAFETY OF YOUR ORGANIZATION

Drupal in the Cloud. Scaling with Drupal and Amazon Web Services. Northern Virginia Drupal Meetup

Modern Web development and operations practices. Grig Gheorghiu VP Tech Operations Nasty Gal

CloudStack and Big Data. Sebastien May 22nd 2013 LinuxTag, Berlin

Sun Cloud API: A RESTful Open API for Cloud Computing

Cloud Essentials for Architects using OpenStack

Software Defined Everything

Journey to the Cloud and Application Release Automation Shane Pearson VP, Portfolio & Product Management

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

VMware ESXi in a Cloud-based Lab David Davis, VCP, VCAP, and vexpert

Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

Amazon EC2 Product Details Page 1 of 5

HP Cloud OS. Платформа OpenStack корпоративного уровня. Иван Кровяков Архитектор облачных решений HP Центральная и Восточная Европа

An Introduction to Private Cloud

Assignment # 1 (Cloud Computing Security)

Hybrid Cloud Management with Red Hat CloudForms

Hadoop in the Hybrid Cloud

KT ucloud storage. Two Years of Life with OpenStack Swift / Jaesuk Ahn, Cloud OS Dev. Team, Korea Telecom

Modern Application Architecture for the Enterprise

OpenNebula Open Souce Solution for DC Virtualization

Transcription:

One click Hadoop clusters - anywhere Janos Matyas, Senior Director of Engineering October, Page 1 2015

Overview Page 2 Introduction Goals and motivations Technology stack How it works Results/achievements/future plans Demo and Q&A

Goals and motivations Full Hadoop stack provisioning everywhere Automate and unify the process Zero-configuration approach Same process through a cluster lifecycle (Dev, QA, UAT, Prod) Provide tooling - UI, REST API and CLI/shell Secure and multi-tenant SLA policy based autoscaling Page 3

Technology stack Docker Swarm Consul Apache Ambari Page 4

Docker Container based virtualization Lightweight and portable Build once, run anywhere Ease of packaging applications Automated and scripted Isolated Page 5

Docker How it works Containers are isolated, but share OS and bins/ libraries No need to emulate hardware Page 6

Swarm Native clustering for Docker Distributed container orchestration Same API as Docker Page 7

Swarm How it works Swarm managers/agents Discovery services Advanced scheduling Page 8

Consul Service discovery/registry Health checking Key/Value store DNS Multi datacenter aware Page 9

Consul How it works Consul servers/agents Consistency through a quorum (RAFT) Scalability due to gossip based protocol (SWIM) Decentralized and fault tolerant Highly available Consistency over availability (CP) Multiple interfaces - HTTP and DNS Support for watches Page 10

Apache Ambari Easy Hadoop cluster provisioning Management and monitoring Key feature - Blueprints REST API, CLI shell Extensible Stacks Services Views Page 11

Apache Ambari How it works Ambari server/agents Define a blueprint (blueprint.json) Define a host mapping (hostmapping.json) Post the cluster create Page 12

Cloudbreak Cloudbreak is a cloud-agnostic Hadoop as a Service API. Abstracts the provisioning and ease Cloudbreak is a powerful left surf that breaks over a coral reef, a mile off southwest the island of Tavarua, Fiji. management and monitoring of on-demand clusters. Page 13

Cloudbreak Benefits Zero configuration Elastic Secure Infrastructure agnostic Heterogenous clusters Auto-scaling Main REST resources /template specify an instance group infrastructure /stack creates an infrastructure based on a template /blueprint describes a Hadoop cluster /cluster creates a Hadoop cluster Page 14

Cloudbreak How it works Start VMs - with a running Docker daemon Cloudbreak Bootstrap Start Consul Cluster Start Swarm Cluster (Consul for discovery) Start Ambari servers/agents - Swarm API Ambari services registered in Consul (Registrator) Post Blueprint Page 15

Cloudbreak - Features Extensible easy to implement Service Provider Interface Cloudbreak recipes Automate host configuration Pre/post Ambari lifecycle hooks Services reconfiguration Automate/execute custom actions Side effects Ambari CLI/shell and Groovy based client Cloud Foundry s UAA Dockerized Munchausen bootstrap Swarm with Consul Dockerized full Hadoop stack (Apache Hadoop 60K+, Ambari 12K+, Spark 10K+ downloads) Page 16

Cloudbreak - Hadoop as a Service API Public tech preview Microsoft Azure Amazon AWS Google Cloud Platform OpenStack Private tech preview R&D Bare metal Rackspace Managed Cloud HP Helion Public Cloud *integration SPI is available Page 17

Cloudbreak SPI Cloud providers have very different API, though model is very similar Non invasive implementation One interface to implement - CloudPlatformConnector Network Security Group Image Subnet Rules Rules UserData Instance IP Volume Address Instance IP Volume Address Instance IP Volume Address Volumes Volumes Volumes Page 18

Periscope Periscope is a powerful, fast, thick and topto-bottom right-hander, eastward from Sumbawa's famous west-coast. Periscope is a heuristic Hadoop scheduler associated with a QoS profile. Built on YARN schedulers, cloud and VM resource management API's it allows to associate SLA's to applications and customers. Page 19

Periscope Benefits Zero configuration Metric and time based alarms SLA policy based autoscaling Secure Hostgroup specific Main REST resources /clusters specify a cluster to be monitored /alerts time and metric based /policies specify an SLA policy for a cluster based on an alarm /applications specify an SLA policy for an application (under development) Page 20

Periscope How it works Configures/monitors alarms in Ambari Setup alarm, cooldown periods Manages cluster sizes Allow to associate SLA scaling policies to alarms Orchestrates Cloudbreak to up/downscale the cluster Page 21

Demo and Q&A Page 22