Bursting to a Hybrid Cloud for Services OFC 2015



Similar documents
Microsoft Big Data Solutions. Anar Taghiyev P-TSP

GTC Presentation March 19, Copyright 2012 Penguin Computing, Inc. All rights reserved

Chapter 19 Cloud Computing for Multimedia Services

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

Cloud Courses Description

THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.

Cloud Courses Description

Alternative Deployment Models for Cloud Computing in HPC Applications. Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix

Elastic Private Clouds

CliQr Technologies CliQr CloudCenter Private Cloud - Page 1. CliQr CloudCenter Simplify Private Cloud Management

The future of Big Data A United Hitachi View

Historians and Production Management as Cloud Applications

Cloud-Based Big Data Analytics in Bioinformatics

Maginatics Cloud Storage Platform for Elastic NAS Workloads

COMPUTER MEASUREMENT GROUP - India Hyderabad Chapter. Strategies to Optimize Cloud Costs By Cloud Performance Monitoring

How To Compare The Two Cloud Computing Models

Introduction to Software Defined Networking (SDN) and how it will change the inside of your DataCentre

LONDON. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

Personalized Medicine and IT

Big Data Use Case: Business Analytics

Unified Communications and the Cloud

Using GPUs in the Cloud for Scalable HPC in Engineering and Manufacturing March 26, 2014

Cloud-based Analytics and Map Reduce

Software defined networking. Your path to an agile hybrid cloud network

MagFS: The Ideal File System for the Cloud

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

Communications in the Cloud: Why It Makes Sense for Today s Business

New solutions for Big Data Analysis and Visualization

SCALABILITY IN THE CLOUD

What Is Big Data? Craig C. Douglas University of Wyoming

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Big Data & Its Bigger Possibilities In The Cloud

Industry Trends & Challenges in Oil & Gas

Delivering Cloud Services Transformation : Plan > Build> Assure> Secure. Stephen Miles Vice President, Solution Sales, APJ

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

Advanced Big Data Analytics with R and Hadoop

Analyzing Big Data with AWS

Big Data on the Open Cloud

Cloud 101. Mike Gangl, Caltech/JPL, 2015 California Institute of Technology. Government sponsorship acknowledged

Virtualization: The entire suite of communication services can be deployed in a virtualized environment 2.

Simplifying Data Data Center Center Network Management Leveraging SDN SDN

Software Defined Hybrid IT. Execute your 2020 plan

What is Analytic Infrastructure and Why Should You Care?

An Mformation Whitepaper ENTERPRISE MOBILITY SOLUTIONS FROM THE CLOUD REMOVE THE BARRIERS 1

Software Defined Networking and Network Virtualization

Increased Security, Greater Agility, Lower Costs for AWS DELPHIX FOR AMAZON WEB SERVICES WHITE PAPER

SAP HANA An In-Memory Data Platform for Real-Time Business

HadoopTM Analytics DDN

Scientific and Technical Applications as a Service in the Cloud

Amazon Elastic MapReduce. Jinesh Varia Peter Sirota Richard Cole

Service offerings Emerging Technologies - Cloud Computing

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

Communications in the Cloud Why It Makes Sense for Today s Business

An Overview of the Open Cloud Consortium

Accenture Cloud Platform at v3 - the Airbnb or Uber of cloud?

HOW SDN AND (NFV) WILL RADICALLY CHANGE DATA CENTRE ARCHITECTURES AND ENABLE NEXT GENERATION CLOUD SERVICES

Nokia Networks. Nokia Networks. telco cloud is on the brink of live deployment

DeIC Watson Agreement - hvad betyder den for DeIC medlemmerne

Enterprise Mobility Space

Trends driving software-defined storage

Bruhati Technologies. About us. ISO 9001:2008 certified. Technology fit for Business

Cloud Based Solutions for Media and Entertainment

Parallel Computing: Strategies and Implications. Dori Exterman CTO IncrediBuild.

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Cloud for Large Enterprise Where to Start. Terry Wise Director, Business Development Amazon Web Services

OpenStack in the Enterprise: From Strategy to Real Life. Radhesh Balakrishnan General Manager OpenStack

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Transformation to a ITaaS Model & the Cloud

Cloud Computing: Making the right choices

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing

Big Data Analytics: Today's Gold Rush November 20, 2013

Hadoop in the Hybrid Cloud

Cloud, where are we? Mark Potts, HP Fellow, CTO Cloud November 2014

Transcription:

Bursting to a Hybrid Cloud for Services OFC 2015

Big Data applications Big Compute in the cloud Why burst to the cloud? Opportunities 2

Big Data Apps Need Big Compute Life Sciences Bioinformatics Next Gen Sequence Analysis Video in Media & Entertainment Video Transcoding Rendering Engineering Design and Simulation Computer Aided Engineering Fluid Dynamics Simulation Analytics Hadoop Predictive Algorithms Data mining Oil & Gas Seismic Processing Reservoir Simulation Characteristics Computational intense Large datasets Increasingly distributed Processing speed matters 3

Big Compute in Life Science CardioDX performs genomic research. One of their major initiatives over the past several years was developing a predictive test that could identify coronary artery disease in its most nascent stages. To do so, researchers at the company analyzed over 100 million gene samples to ultimately identify the 23 primary predictive genes for coronary artery disease. The resulting test, known as the Corus CAD Test, was recognized as on of the Top Ten Medical Breakthroughs of 2010 by TIME Magazine. Single human genome is approximately 3 GBytes of information. (http://sandwalk.blogspot.com/2011/03/how-big-is-human-genome.html) Approximately 20,000 genes / genome = 150Kb / gene 100 Million gene samples = 15,000 TB of information 1 per 1TB internal hard drive = 1250 feet. 4

Big Compute in Life Science Burrows-Wheeler Alignment Part of genomic sequencing Prelude to pattern discernment Application Highly parallel compute Huge data set Numerous iterations Cloud based solutions Single workstation 1x Cloud cluster 324x Accelerated hybrid 2268x # Jobs per Hour 35.0 30.0 25.0 20.0 15.0 10.0 5.0 0.0 Genome Mapping Application (BWA) Per Iteration Amazon AWS Hybrid Cloud 5

Big Compute in Analytics Real-time analytics Application: Real time tracking of paid TV media and the related earned digital activity across social, search & video Big Data problem: Proprietary audio and video fingerprinting identifies content 67.4k TV spots, 38.7MM airings across 103 broadcast/tv networks All in real-time Big Compute problem: Need results in minutes, not hours/days/weeks Difficult to forecast compute demand, or immediate needs of clients 6

Big Compute in Analytics Top 10 Digital Share of Voice (SOV) of the week (3/16/15) Android: Friends Furever 1.9% Digital SOV (share of voice) 1,099,154 Online Views 19,689 Social Actions $235,641 Est. TV Spend 7

Big Compute in Engineering Simulation Fluid modeling Application: Fluid dynamics simulation Big Data problem: Fine-point analysis provides more accurate and complete simulation results Big Compute problem: Fine-point analysis multiplies the complexity of the simulation exponentially. Simulation cycle time is real money to clients Solution: Cloud-based processing, network and storage for purpose-built application 8

Big Compute in Engineering Simulation Fluid modeling Yesterday Application on workstation Dataset in workstation Low utilization Compute time: 5 hours / iteration $35k on workstation Today Burst to application in cloud Dataset in private cloud On-demand compute Compute time: 30 minutes/iteration $19k in cloud 9

Bursting Why? Batch and/or lumpy application demand Short-lived projects Capex and Opex Cost Don t want to wait for IT How? Application orchestration Compute (Big Compute PaaS) Network (SDN, Openflow) Storage (Openstack, Hadoop) Big Data + Big Compute requires data movement Inside datacenter (easiest) Datacenter to datacenter (easier) Public network (cost and availability) 10

Opportunities Data movement Technically solved, operationally cumbersome Costs are high, prohibitive or highly variable (help!) Standards moving very quickly (help!) Hybrid architectures Two fundamental types of storage for big data Three fundamental types of compute Applications need optimization and abstraction (help!) Application burst orchestration (read: simple, simple, simple) Private clouds (homogenous case) Virtual private clouds (turnkey by integrator) Private / public clouds (help!) Applications drive value, not the network. Vertically integrated MSPs usually lead the way (help!) 11