2012 AIRI Petabyte Challenge Chris Dagdigian

Size: px
Start display at page:

Download "2012 AIRI Petabyte Challenge Chris Dagdigian chris@bioteam.net"

Transcription

1 Mapping Informatics To the Cloud 2012 AIRI Petabyte Challenge Chris Dagdigian

2 I m Chris. I m an infrastructure geek. I work for the BioTeam.

3 The C Word.

4 When I say cloud I m talking IaaS.

5 Amazon AWS Is the IaaS cloud. Most others are fooling themselves. (Has-beens, also-rans & delusional marketing zombies)

6 A message for the pretenders

7 No APIs? Not a cloud.

8 No self-service? Not a cloud.

9 I have to a human? Not a cloud.

10 ~50% failure rate when provisioning new servers? Stupid cloud.

11 Block storage and virtual servers only? (barely) a cloud;

12 Private Clouds: My $.02

13 Private Clouds in 2012: Hype vs. Reality ratio still wacky Sensible only for certain shops Have you seen what you have to do to your networks & gear? There are easier ways

14 Private Clouds: My Advice for 12 Remain cynical (test vendor claims) Due Diligence still essential I personally would not deploy/buy anything that does not explicitly provide Amazon API compatibility

15 Private Clouds: My Advice for 12 Most people are better off: Adding VM platforms to existing HPC clusters & environments Extending enterprise VM platforms to allow user self-service & server catalogs

16 Enough Bloviating. Advice time.

17 Tip #1

18 HPC & Clouds: Whole New World

19 We have spent decades learning to tune research HPC systems for shared access & many users. The cloud upends this model

20 Far more common to see Dedicated cloud resources spun up for each app or use case Each system gets individually tuned & optimized

21 Tip #2

22 Hybrid Clouds & Cloud Bursting

23 Lots of aggressive marketing Lots of carefully constructed case studies and prototypes The truth? Less usable than you ve been told Possible? Heck yeah. Practical? Only sometimes.

24 Advice Be cynical Demand proof Test carefully

25 Still want to do it? Buy it, don t build it Cycle Computing Univa BrightComputing

26 Follow the crowd In the real world we see: Separation between local and cloud HPC resources Send your work to the system most suitable

27 Tip #3

28 You can t rewrite EVERYTHING.

29 Salesfolk will just glibly tell you to rewrite your apps so you can use whatever big data analysis framework they happen to be selling today

30 They have no clue.

31 In life science informatics we have hundreds of codes that will never be rewritten. We ll be needing them for years to come.

32 Advice: MapReduceish methods are the future for big-data informatics It will take years to get there We still have to deal with legacy algorithms and codes

33 You will need: A process for figuring out when it s worthwhile to rewrite/re-architect Tested cloud strategies for handling three use cases

34 You need 3 cloud architectures: 1. Legacy HPC 2. Cloudy HPC 3. Big Data HPC (Hadoop)

35 Legacy HPC on the cloud MIT StarCluster This is your baseline Extend as needed

36 Cloudy HPC Use this method when It makes sense to rewrite or rearchitect an HPC workflow to better leverage modern cloud capabilities

37 Cloudy HPC, continued Ditch the legacy compute farm model Leverage elastic scale-out tools (***) Spot Instances for elastic & cheap compute SimpleDB for job statekeeping SQS for job queues & workrflow glue SNS for message passing & monitoring S3 for input & output data Etc.

38 Big Data HPC It s gonna be a MapReduce world Little need to roll your own Ecosystem already healthy Multiple providers today Often a slam-dunk cloud use case

39 Tip #4

40 The Cloud was not designed for us

41 HPC is an edge case for the hyperscale IaaS clouds We need to deal with this and engineer around it.

42 Many examples Eventual consistency Networking & subnets Latency Node placement

43 Advice Manage expectations Benchmark & test Evangelize (pester the cloud sales reps )

44 Tip #5

45 Data Movement Is Still Hard

46 Consistently getting easier Amazon is not a bottleneck AWS Import/Export AWS Direct Connect Aspera has some amazing stuff out right now

47 Advice AWS Import/Export works well Size of pipe is not everything Sweat the small stuff Tracking, checksums, disk speed Dedicated workstations Secure media storage

48 Dedicated data movement station

49 naked Terabyte-scale data movement

50 Don t overlook media storage

51 Advice for 2012 BioTeam is dialing down our advocacy of physical data ingestion into the cloud Why? Operationally hard, expensive and no longer strictly needed

52 Real world cross-country internet-based data movement March 2012

53 700Mb/sec into Amazon, stress-free & zero tuning March 2012

54 People trying to move data via physical media quickly realize the operational difficulties Bandwidth is cheaper than hiring another body to manage physical data ingestion & movement In 2012 we strongly recommend network-based data movement when at all possible

55 u r doing it wrong

56 cool data movement, bro!

57 Tips #6 & 7

58 Cloud storage. Still slow.

59 Big shared storage. Still hard.

60 Not much we can do except engineer around it AWS compute cluster instances are a huge step forward AWS competitors take note

61 We are not database nerds We care about more than just random IO performance We need it all Random I/O Long sequential read/write

62 Faster Storage Options Software RAID on EBS Various GlusterFS options Even if you optimize everything, the virtual NICs are still a bottleneck

63 Big Shared Storage 10GbE nodes and NFS Software RAID sets GlusterFS or similar 2012: pnfs finally?

64 Tip #8

65 Things fail differently in the cloud.

66 Stuff breaks It breaks in weird ways Transient/temporary issues more common than what we see at home

67 Advice Pessimism is good Design for failure Think hard about How will you detect? How will you respond?

68 Advice Remove humans from loop Automate recovery Automate your backups

69 Tip #9

70 Serial/batch computing at-scale

71 Loosely coupled workflows are ideal Break the pipeline into discrete components Components should be able to scale up down independently

72 Component = Opportunity to: Make a scaling decision (# nodes in use) Make sizing decision (instance type in use)

73 Nirvana is

74 independent loosely connected components that can self-scale and communicate asynchronously

75 Advice: Many people already doing this Best practices are well known Steal from the best: RightScale, Opscode & Cycle Computing

76 Phew. Think I m done now.

77 Questions? Slides available at

78 End;

79 Backup Slides

80 Private Clouds: Pick Your Poison OpenStack - Pro: Super smart developers; significant mindshare; True Open Source Con: Commitment to AWS API compatibility (?) & stability

81 Private Clouds: Pick Your Poison CloudStack- Pro: Explicit AWS API support; very recent move away from open-core model; usability Con: Developer mindshare? Sudden switch to Apache

82 Private Clouds: Pick Your Poison Eucalyptus- Pro: Direct AWS API compatibility; lots of hypervisor support Con: Open-core model; mindshare; Recent ressurection

Cloud Sobriety. Technical challenges in mapping Informatics to the cloud. Chris Dagdigian 2010 NHGRI Cloud Workshop

Cloud Sobriety. Technical challenges in mapping Informatics to the cloud. Chris Dagdigian 2010 NHGRI Cloud Workshop Cloud Sobriety Technical challenges in mapping Informatics to the cloud Chris Dagdigian 2010 NHGRI Cloud Workshop Welcome To Day 2! Excellent talk lineup for today! Focus on implementation, architecture

More information

Utility Computing For Cynics

Utility Computing For Cynics Utility Computing For Cynics 2009 Amazon NYC Start-Up Tour Chris Dagdigian BioTeam Inc. Topics for Today Who we are: Why we agreed to speak Why we use AWS: How we came to drink the Kool-aid Pharma Example:

More information

Grid Engine & Amazon EC2

Grid Engine & Amazon EC2 Grid Engine & Amazon EC2 2009 Sun HPC Workshop Chris Dagdigian BioTeam Inc. Utility Computing For Cynics Doubt and cynicism are totally appropriate Personally burned by 90 s era OMG!! GRID Computing Hype

More information

Storage Architectures for Big Data in the Cloud

Storage Architectures for Big Data in the Cloud Storage Architectures for Big Data in the Cloud Sam Fineberg HP Storage CT Office/ May 2013 Overview Introduction What is big data? Big Data I/O Hadoop/HDFS SAN Distributed FS Cloud Summary Research Areas

More information

Scalable Architecture on Amazon AWS Cloud

Scalable Architecture on Amazon AWS Cloud Scalable Architecture on Amazon AWS Cloud Kalpak Shah Founder & CEO, Clogeny Technologies kalpak@clogeny.com 1 * http://www.rightscale.com/products/cloud-computing-uses/scalable-website.php 2 Architect

More information

Data Management & Storage for NGS

Data Management & Storage for NGS Data Management & Storage for NGS 2009 Pre-Conference Workshop Chris Dagdigian BioTeam Inc. Independent Consulting Shop: Vendor/technology agnostic Staffed by: Scientists forced to learn High Performance

More information

STeP-IN SUMMIT 2013. June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

STeP-IN SUMMIT 2013. June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case) 10 th International Conference on Software Testing June 18 21, 2013 at Bangalore, INDIA by Sowmya Krishnan, Senior Software QA Engineer, Citrix Copyright: STeP-IN Forum and Quality Solutions for Information

More information

Migration Scenario: Migrating Batch Processes to the AWS Cloud

Migration Scenario: Migrating Batch Processes to the AWS Cloud Migration Scenario: Migrating Batch Processes to the AWS Cloud Produce Ingest Process Store Manage Distribute Asset Creation Data Ingestor Metadata Ingestor (Manual) Transcoder Encoder Asset Store Catalog

More information

SGE & Amazon EC2. Chris Dagdigian 2008 OSGC Conference

SGE & Amazon EC2. Chris Dagdigian 2008 OSGC Conference SGE & Amazon EC2 Chris Dagdigian 2008 OSGC Conference Putting my $$ where my mouth is I m about to pay $.30 for the privilege of showing you this demo by firing up a 3-node cluster within EC2 Using cheap

More information

Cloud for Large Enterprise Where to Start. Terry Wise Director, Business Development Amazon Web Services

Cloud for Large Enterprise Where to Start. Terry Wise Director, Business Development Amazon Web Services Cloud for Large Enterprise Where to Start Terry Wise Director, Business Development Amazon Web Services Amazon Retail Business Tens of millions of active customer accounts Seven countries: US, UK, Germany,

More information

Scalable Application. Mikalai Alimenkou http://xpinjection.com 11.05.2012

Scalable Application. Mikalai Alimenkou http://xpinjection.com 11.05.2012 Scalable Application Development on AWS Mikalai Alimenkou http://xpinjection.com 11.05.2012 Background Java Technical Lead/Scrum Master at Zoral Labs 7+ years in software development 5+ years of working

More information

Cloud Sensibility. Hype aside, what can the cloud do for life sciences today? Chris Dagdigian 2010 Bio-IT-World Cloud Workshop

Cloud Sensibility. Hype aside, what can the cloud do for life sciences today? Chris Dagdigian 2010 Bio-IT-World Cloud Workshop Cloud Sensibility Hype aside, what can the cloud do for life sciences today? Chris Dagdigian 2010 Bio-IT-World Cloud Workshop Cloud Sensibility Hype aside, what can the cloud do for life sciences today?

More information

Introduction to Red Hat Storage. January, 2012

Introduction to Red Hat Storage. January, 2012 Introduction to Red Hat Storage January, 2012 1 Today s Speakers 2 Heather Wellington Tom Trainer Storage Program Marketing Manager Storage Product Marketing Manager Red Hat Acquisition of Gluster What

More information

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc. www.eucalyptus.com

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc. www.eucalyptus.com Eucalyptus: An Open-source Infrastructure for Cloud Computing Rich Wolski Eucalyptus Systems Inc. www.eucalyptus.com Exciting Weather Forecasts Commercial Cloud Formation Eucalyptus - Confidential What

More information

Cloud Computing: Making the right choices

Cloud Computing: Making the right choices Cloud Computing: Making the right choices Kalpak Shah Clogeny Technologies Pvt Ltd 1 About Me Kalpak Shah Founder & CEO, Clogeny Technologies Passionate about economics and technology evolving through

More information

Are you ready for your Journey to the cloud? Maybe some of you are already using some cloud- based services?

Are you ready for your Journey to the cloud? Maybe some of you are already using some cloud- based services? 1 2 Are you ready for your Journey to the cloud? Maybe some of you are already using some cloud- based services? 3 Anyway, you ve finally decided to take the big step forward in the unknown, and to start

More information

Cloud Computing For Bioinformatics

Cloud Computing For Bioinformatics Cloud Computing For Bioinformatics Cloud Computing: what is it? Cloud Computing is a distributed infrastructure where resources, software, and data are provided in an on-demand fashion. Cloud Computing

More information

Understanding Virtualization and Cloud in the Enterprise

Understanding Virtualization and Cloud in the Enterprise Understanding Virtualization and Cloud in the Enterprise James Staten Vice President, Principal Analyst Forrester Research Virtualization is evolving toward cloud but won t be subsumed by it 2 What s different

More information

Building an AWS-Compatible Hybrid Cloud with OpenStack

Building an AWS-Compatible Hybrid Cloud with OpenStack Building an AWS-Compatible Hybrid Cloud with OpenStack AWS is Transforming IT Amazon Web Services (AWS) commands a significant lead in the public cloud services market, with revenue estimated to grow from

More information

Product Brochure. Hedvig Distributed Storage Platform Modern Storage for Modern Business. Elastic. Accelerate data to value. Simple.

Product Brochure. Hedvig Distributed Storage Platform Modern Storage for Modern Business. Elastic. Accelerate data to value. Simple. Product Brochure Elastic Scales to petabytes of data Start with as few as two nodes and scale to thousands. Add capacity if and when needed. Embrace the economics of commodity x86 infrastructure to build

More information

Amazon Web Services 100 Success Secrets

Amazon Web Services 100 Success Secrets Amazon Web Services Amazon Web Services Made Simple: Learn how Amazon EC2, S3, SimpleDB and SQS Web Services enables you to reach business goals faster Copyright 2008 Amazon Web Services 100 Success Secrets

More information

Automation and DevOps Best Practices. Rob Hirschfeld, Dell Matt Ray, Opscode

Automation and DevOps Best Practices. Rob Hirschfeld, Dell Matt Ray, Opscode Automation and DevOps Best Practices Rob Hirschfeld, Dell Matt Ray, Opscode Deploying & Managing a Cloud is not simple. Deploying to physical gear on layered networks Multiple interlocking projects Hundreds

More information

Data Centers and Cloud Computing

Data Centers and Cloud Computing Data Centers and Cloud Computing CS377 Guest Lecture Tian Guo 1 Data Centers and Cloud Computing Intro. to Data centers Virtualization Basics Intro. to Cloud Computing Case Study: Amazon EC2 2 Data Centers

More information

Data Centers and Cloud Computing. Data Centers

Data Centers and Cloud Computing. Data Centers Data Centers and Cloud Computing Slides courtesy of Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet

More information

Amazon Web Services. Elastic Compute Cloud (EC2) and more...

Amazon Web Services. Elastic Compute Cloud (EC2) and more... Amazon Web Services Elastic Compute Cloud (EC2) and more... I don t work for Amazon I do however, have a small research grant from Amazon (in AWS$) Portions of this presentation are reproduced from slides

More information

Cloud Computing and Amazon Web Services. CJUG March, 2009 Tom Malaher

Cloud Computing and Amazon Web Services. CJUG March, 2009 Tom Malaher Cloud Computing and Amazon Web Services CJUG March, 2009 Tom Malaher Agenda What is Cloud Computing? Amazon Web Services (AWS) Other Offerings Composing AWS Services Use Cases Ecosystem Reality Check Pros&Cons

More information

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc. www.eucalyptus.com

Eucalyptus: An Open-source Infrastructure for Cloud Computing. Rich Wolski Eucalyptus Systems Inc. www.eucalyptus.com Eucalyptus: An Open-source Infrastructure for Cloud Computing Rich Wolski Eucalyptus Systems Inc. www.eucalyptus.com Exciting Weather Forecasts Commercial Cloud Formation Eucalyptus - Confidential What

More information

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, 2015. 20014 IBM Corporation

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, 2015. 20014 IBM Corporation Boas Betzler Cloud IBM Distinguished Computing Engineer for a Smarter Planet Globally Distributed IaaS Platform Examples AWS and SoftLayer November 9, 2015 20014 IBM Corporation Building Data Centers The

More information

Emerging Technology for the Next Decade

Emerging Technology for the Next Decade Emerging Technology for the Next Decade Cloud Computing Keynote Presented by Charles Liang, President & CEO Super Micro Computer, Inc. What is Cloud Computing? Cloud computing is Internet-based computing,

More information

Cloud Computing. Lecture 24 Cloud Platform Comparison 2014-2015

Cloud Computing. Lecture 24 Cloud Platform Comparison 2014-2015 Cloud Computing Lecture 24 Cloud Platform Comparison 2014-2015 1 Up until now Introduction, Definition of Cloud Computing Pre-Cloud Large Scale Computing: Grid Computing Content Distribution Networks Cycle-Sharing

More information

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved. Object Storage: A Growing Opportunity for Service Providers Prepared for: White Paper 2012 Neovise, LLC. All Rights Reserved. Introduction For service providers, the rise of cloud computing is both a threat

More information

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform Page 1 of 16 Table of Contents Table of Contents... 2 Introduction... 3 NoSQL Databases... 3 CumuLogic NoSQL Database Service...

More information

Migration Scenario: Migrating Backend Processing Pipeline to the AWS Cloud

Migration Scenario: Migrating Backend Processing Pipeline to the AWS Cloud Migration Scenario: Migrating Backend Processing Pipeline to the AWS Cloud Use case Figure 1: Company C Architecture (Before Migration) Company C is an automobile insurance claim processing company with

More information

Cloud Computing and Open Source: Watching Hype meet Reality

Cloud Computing and Open Source: Watching Hype meet Reality Cloud Computing and Open Source: Watching Hype meet Reality Rich Wolski UCSB Computer Science Eucalyptus Systems Inc. May 26, 2011 Exciting Weather Forecasts 99 M 167 M 6.5 M What is a cloud? SLAs Web

More information

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research Introduction to Cloud : Cloud and Cloud Storage Lecture 2 Dr. Dalit Naor IBM Haifa Research Storage Systems 1 Advanced Topics in Storage Systems for Big Data - Spring 2014, Tel-Aviv University http://www.eng.tau.ac.il/semcom

More information

The Incremental Advantage:

The Incremental Advantage: The Incremental Advantage: MIGRATE TRADITIONAL APPLICATIONS FROM YOUR ON-PREMISES VMWARE ENVIRONMENT TO THE HYBRID CLOUD IN FIVE STEPS CONTENTS Introduction..................... 2 Five Steps to the Hybrid

More information

Integrating Remote Cloud and Local HPC Resources

Integrating Remote Cloud and Local HPC Resources Integrating Remote Cloud and Local HPC Resources Angel Pizarro Institute for Translational Medicine and Therapuetics Perelman School of Medicine University of Pennsylvania Acknowledgements Los Jefes: Garret

More information

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency

Solving I/O Bottlenecks to Enable Superior Cloud Efficiency WHITE PAPER Solving I/O Bottlenecks to Enable Superior Cloud Efficiency Overview...1 Mellanox I/O Virtualization Features and Benefits...2 Summary...6 Overview We already have 8 or even 16 cores on one

More information

Which is Better: Virtualization or Cloud IaaS?

Which is Better: Virtualization or Cloud IaaS? A White Paper Which is Better: Virtualization or Cloud IaaS? Which is Better: Virtualization or Cloud IaaS? Why should I consider Virtualization or Cloud IaaS at all? Simply put, they can save you money.

More information

Availability Digest. www.availabilitydigest.com. @availabilitydig. HPE Helion Private Cloud and Cloud Broker Services February 2016

Availability Digest. www.availabilitydigest.com. @availabilitydig. HPE Helion Private Cloud and Cloud Broker Services February 2016 the Availability Digest @availabilitydig HPE Helion Private Cloud and Cloud Broker Services February 2016 HPE Helion is a complete portfolio of cloud products and services that offers enterprise security,

More information

House of Cards. IaaS without storage performance testing. Howard Marks, Deep Storage Len Rosenthal, Load DynamiX

House of Cards. IaaS without storage performance testing. Howard Marks, Deep Storage Len Rosenthal, Load DynamiX House of Cards IaaS without storage performance testing Howard Marks, Deep Storage Len Rosenthal, Load DynamiX 1 Your NOT So Humble Speaker 25+ years of consulting and writing for trade press Columnist/blogger

More information

Data Centers and Cloud Computing. Data Centers

Data Centers and Cloud Computing. Data Centers Data Centers and Cloud Computing Intro. to Data centers Virtualization Basics Intro. to Cloud Computing 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises

More information

Lustre * Filesystem for Cloud and Hadoop *

Lustre * Filesystem for Cloud and Hadoop * OpenFabrics Software User Group Workshop Lustre * Filesystem for Cloud and Hadoop * Robert Read, Intel Lustre * for Cloud and Hadoop * Brief Lustre History and Overview Using Lustre with Hadoop Intel Cloud

More information

Cloud computing - Architecting in the cloud

Cloud computing - Architecting in the cloud Cloud computing - Architecting in the cloud anna.ruokonen@tut.fi 1 Outline Cloud computing What is? Levels of cloud computing: IaaS, PaaS, SaaS Moving to the cloud? Architecting in the cloud Best practices

More information

Big Data - Infrastructure Considerations

Big Data - Infrastructure Considerations April 2014, HAPPIEST MINDS TECHNOLOGIES Big Data - Infrastructure Considerations Author Anand Veeramani / Deepak Shivamurthy SHARING. MINDFUL. INTEGRITY. LEARNING. EXCELLENCE. SOCIAL RESPONSIBILITY. Copyright

More information

Financial Services Grid Computing on Amazon Web Services January 2013 Ian Meyers

Financial Services Grid Computing on Amazon Web Services January 2013 Ian Meyers Financial Services Grid Computing on Amazon Web Services January 2013 Ian Meyers (Please consult http://aws.amazon.com/whitepapers for the latest version of this paper) Page 1 of 15 Contents Abstract...

More information

Infrastructure for Cloud Computing

Infrastructure for Cloud Computing Eucalyptus: an Open-Source Infrastructure for Cloud Computing Rich Wolski Chris Grzegorczyk, Dan Nurmi, Graziano Obertelli, Woody Rollins, Sunil Soman, Lamia Youseff, Dmitrii Zagorodnov The Eucalyptus

More information

I D C A N A L Y S T C O N N E C T I O N

I D C A N A L Y S T C O N N E C T I O N I D C A N A L Y S T C O N N E C T I O N Gary Chen Research Manager, Enterprise Virtualization Software C a p i t a l iz i n g on the Cloud Revo l u t i o n w ith OpenStack March 2014 OpenStack was a project

More information

Building a Private Cloud with Eucalyptus

Building a Private Cloud with Eucalyptus Building a Private Cloud with Eucalyptus 5th IEEE International Conference on e-science Oxford December 9th 2009 Christian Baun, Marcel Kunze KIT The cooperation of Forschungszentrum Karlsruhe GmbH und

More information

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect

Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect on AWS Services Overview Bernie Nallamotu Principle Solutions Architect \ So what is it? When your data sets become so large that you have to start innovating around how to collect, store, organize, analyze

More information

Cloud-based Analytics and Map Reduce

Cloud-based Analytics and Map Reduce 1 Cloud-based Analytics and Map Reduce Datasets Many technologies converging around Big Data theme Cloud Computing, NoSQL, Graph Analytics Biology is becoming increasingly data intensive Sequencing, imaging,

More information

Intro to AWS: Storage Services

Intro to AWS: Storage Services Intro to AWS: Storage Services Matt McClean, AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved AWS storage options Scalable object storage Inexpensive archive

More information

CloudStack and Big Data. Sebastien Goasguen @sebgoa May 22nd 2013 LinuxTag, Berlin

CloudStack and Big Data. Sebastien Goasguen @sebgoa May 22nd 2013 LinuxTag, Berlin CloudStack and Big Data Sebastien Goasguen @sebgoa May 22nd 2013 LinuxTag, Berlin Google trends Start of Clouds Cloud computing trending down, while Big Data is booming. Virtualization BigData on the Trigger

More information

Benchmarking Sahara-based Big-Data-as-a-Service Solutions. Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015

Benchmarking Sahara-based Big-Data-as-a-Service Solutions. Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015 Benchmarking Sahara-based Big-Data-as-a-Service Solutions Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015 Agenda o Why Sahara o Sahara introduction o Deployment considerations o Performance

More information

Who moved my cloud? Part I: Introduction to Private, Public and Hybrid clouds and smooth migration

Who moved my cloud? Part I: Introduction to Private, Public and Hybrid clouds and smooth migration Who moved my cloud? Part I: Introduction to Private, Public and Hybrid clouds and smooth migration Part I of an ebook series of cloud infrastructure and platform fundamentals not to be avoided when preparing

More information

Expand Your Infrastructure with the Elastic Cloud. Mark Ryland Chief Solutions Architect Jenn Steele Product Marketing Manager

Expand Your Infrastructure with the Elastic Cloud. Mark Ryland Chief Solutions Architect Jenn Steele Product Marketing Manager Expand Your Infrastructure with the Elastic Cloud Mark Ryland Chief Solutions Architect Jenn Steele Product Marketing Manager Today we re going to talk about The Cloud Scenarios Questions You Probably

More information

How To Cloud Compute At The Cloud At The Cyclone Center For Cnc

How To Cloud Compute At The Cloud At The Cyclone Center For Cnc Cloud Computing at CDC Current Status and Future Plans Earl Baum March, 2014 1 Background Current Activities Agenda Use Cases, Shared Services and Other Considerations What s Next 2 Background Cloud Definition

More information

DevOps with Containers. for Microservices

DevOps with Containers. for Microservices DevOps with Containers for Microservices DevOps is a Software Development Method Keywords Communication, collaboration, integration, automation, measurement Goals improved deployment frequency faster time

More information

Evolving Datacenter and Cloud Connectivity Services

Evolving Datacenter and Cloud Connectivity Services Evolving Datacenter and Cloud Connectivity Services May 2015 KVH Co., Ltd. www.kvhasia.com Public Cloud is NOT for Everything There are certain type of systems that are not ideal to be operated on public

More information

Sistemi Operativi e Reti. Cloud Computing

Sistemi Operativi e Reti. Cloud Computing 1 Sistemi Operativi e Reti Cloud Computing Facoltà di Scienze Matematiche Fisiche e Naturali Corso di Laurea Magistrale in Informatica Osvaldo Gervasi ogervasi@computer.org 2 Introduction Technologies

More information

Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts

Part V Applications. What is cloud computing? SaaS has been around for awhile. Cloud Computing: General concepts Part V Applications Cloud Computing: General concepts Copyright K.Goseva 2010 CS 736 Software Performance Engineering Slide 1 What is cloud computing? SaaS: Software as a Service Cloud: Datacenters hardware

More information

Cloud Computing Now and the Future Development of the IaaS

Cloud Computing Now and the Future Development of the IaaS 2010 Cloud Computing Now and the Future Development of the IaaS Quanta Computer Division: CCASD Title: Project Manager Name: Chad Lin Agenda: What is Cloud Computing? Public, Private and Hybrid Cloud.

More information

AMAZON S3: ARCHITECTING FOR RESILIENCY IN THE FACE OF FAILURES Jason McHugh

AMAZON S3: ARCHITECTING FOR RESILIENCY IN THE FACE OF FAILURES Jason McHugh AMAZON S3: ARCHITECTING FOR RESILIENCY IN THE FACE OF FAILURES Jason McHugh CAN YOUR S ERVICE S URVIVE? CAN YOUR S ERVICE S URVIVE? CAN YOUR SERVICE SURVIVE? Datacenter loss of connectivity Flood Tornado

More information

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center

Data Centers and Cloud Computing. Data Centers. MGHPCC Data Center. Inside a Data Center Data Centers and Cloud Computing Intro. to Data centers Virtualization Basics Intro. to Cloud Computing Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises

More information

Amazon Web Services Workshop

Amazon Web Services Workshop Amazon Web Services Workshop Block 1: 10:45-11:10 Overview 2010 ISMB Cloud Workshop, Boston MA - Who am I? I m from the BioTeam Independent consulting shop Staffed by scientists forced to learn IT to get

More information

CLOUD APPLICATION INTEGRATION AND DEPLOYMENT MADE SIMPLE

CLOUD APPLICATION INTEGRATION AND DEPLOYMENT MADE SIMPLE E-Guide CLOUD APPLICATION INTEGRATION AND DEPLOYMENT MADE SIMPLE SearchCloud Applications C loud application integration and continue to be some of the top for software developers. In this e-guide, learn

More information

Comparing Open Source Private Cloud (IaaS) Platforms

Comparing Open Source Private Cloud (IaaS) Platforms Comparing Open Source Private Cloud (IaaS) Platforms Lance Albertson OSU Open Source Lab Associate Director of Operations lance@osuosl.org / @ramereth About me OSU Open Source Lab Server hosting for Open

More information

Big Data Storage Options for Hadoop Sam Fineberg, HP Storage

Big Data Storage Options for Hadoop Sam Fineberg, HP Storage Sam Fineberg, HP Storage SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and individual members may use this material in presentations

More information

Cloud Based Architectures in Ground Systems of Space Missions

Cloud Based Architectures in Ground Systems of Space Missions Cloud Based Architectures in Ground Systems of Space Missions M. Sarkarati, M. Merri, M. Spada HSO-GDA ESA/ESOC ESA unclassified for public use Copyright European Space Agency, 2013 Published by The Aerospace

More information

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2 DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing Slide 1 Slide 3 A style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet.

More information

Utilizing the SDSC Cloud Storage Service

Utilizing the SDSC Cloud Storage Service Utilizing the SDSC Cloud Storage Service PASIG Conference January 13, 2012 Richard L. Moore rlm@sdsc.edu San Diego Supercomputer Center University of California San Diego Traditional supercomputer center

More information

Scaling up to Production

Scaling up to Production 1 Scaling up to Production Overview Productionize then Scale Building Production Systems Scaling Production Systems Use Case: Scaling a Production Galaxy Instance Infrastructure Advice 2 PRODUCTIONIZE

More information

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how EMC Elastic Cloud Storage (ECS ) can be used to streamline the Hadoop data analytics

More information

WHITE PAPER. Software Defined Storage Hydrates the Cloud

WHITE PAPER. Software Defined Storage Hydrates the Cloud WHITE PAPER Software Defined Storage Hydrates the Cloud Table of Contents Overview... 2 NexentaStor (Block & File Storage)... 4 Software Defined Data Centers (SDDC)... 5 OpenStack... 5 CloudStack... 6

More information

The Scenario: Priority Matrix for Cloud Computing

The Scenario: Priority Matrix for Cloud Computing Cloud is here The Scenario: Priority Matrix for Cloud Computing Benefit / Yrs to mainstream adoption Transformational High Moderate Low Less than 2 yrs 2 to 5 yrs 5 to 10 yrs Virtualization Cloud APaaS

More information

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing July 7, 2014 David Pellerin, Business Development Principal Amazon Web Services What Do We Hear From Customers?

More information

Cloud Computing In Reality: Experience sharing in cloud solution developments and evaluations

Cloud Computing In Reality: Experience sharing in cloud solution developments and evaluations Cloud Computing In Reality: Experience sharing in cloud solution developments and evaluations Speaker Profile Dr. Chokchai Box Leangsuksun SWECO Endowned Professor Louisiana Tech University CTO, PB Tech

More information

Will They Blend?: Exploring Big Data Computation atop Traditional HPC NAS Storage

Will They Blend?: Exploring Big Data Computation atop Traditional HPC NAS Storage Will They Blend?: Exploring Big Data Computation atop Traditional HPC NAS Storage Ellis H. Wilson III 1,2 Mahmut Kandemir 1 Garth Gibson 2,3 1 Department of Computer Science and Engineering, The Pennsylvania

More information

A Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012)

A Comparison of Clouds: Amazon Web Services, Windows Azure, Google Cloud Platform, VMWare and Others (Fall 2012) 1. Computation Amazon Web Services Amazon Elastic Compute Cloud (Amazon EC2) provides basic computation service in AWS. It presents a virtual computing environment and enables resizable compute capacity.

More information

Hybrid Cloud Mini Roundtable. April 17, 2013. Expect Excellence. www.divihn.com

Hybrid Cloud Mini Roundtable. April 17, 2013. Expect Excellence. www.divihn.com Hybrid Cloud Mini Roundtable April 17, 2013 Expect Excellence www.divihn.com Today s Agenda What to expect today Introductions Plus, why are you here this evening? What is Hybrid Cloud? Why Hybrid Cloud?

More information

Red Hat Storage Server

Red Hat Storage Server Red Hat Storage Server Marcel Hergaarden Solution Architect, Red Hat marcel.hergaarden@redhat.com May 23, 2013 Unstoppable, OpenSource Software-based Storage Solution The Foundation for the Modern Hybrid

More information

OpenStack Alberto Molina Coballes

OpenStack Alberto Molina Coballes OpenStack Alberto Molina Coballes Teacher at IES Gonzalo Nazareno alb.molina@gmail.com @alberto_molina Table of Contents From public to private clouds Open Source Cloud Platforms Why OpenStack? OpenStack

More information

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES 1 HYPER-CONVERGED INFRASTRUCTURE STRATEGIES MYTH BUSTING & THE FUTURE OF WEB SCALE IT 2 ROADMAP INFORMATION DISCLAIMER EMC makes no representation and undertakes no obligations with regard to product planning

More information

Dominion Enterprises and Amazon Web Services. Going Hybrid Joe Fuller, VP/CIO December 10, 2013

Dominion Enterprises and Amazon Web Services. Going Hybrid Joe Fuller, VP/CIO December 10, 2013 Dominion Enterprises and Amazon Web Services Going Hybrid Joe Fuller, VP/CIO December 10, 2013 ...is a marketing services company focusing on classifieds advertising. Products include: web portals, classifieds

More information

Design and Evolution of the Apache Hadoop File System(HDFS)

Design and Evolution of the Apache Hadoop File System(HDFS) Design and Evolution of the Apache Hadoop File System(HDFS) Dhruba Borthakur Engineer@Facebook Committer@Apache HDFS SDC, Sept 19 2011 Outline Introduction Yet another file-system, why? Goals of Hadoop

More information

Cloud Computing Paradigm

Cloud Computing Paradigm Cloud Computing Paradigm Julio Guijarro Automated Infrastructure Lab HP Labs Bristol, UK 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice

More information

Managing Traditional Workloads Together with Cloud Computing Workloads

Managing Traditional Workloads Together with Cloud Computing Workloads Managing Traditional Workloads Together with Cloud Computing Workloads Table of Contents Introduction... 3 Cloud Management Challenges... 3 Re-thinking of Cloud Management Solution... 4 Teraproc Cloud

More information

CLOUD COMPUTING THOMAS BOLTZE CTO SKY & SAND THOMAS.BOLTZE@SKY-SAND.COM

CLOUD COMPUTING THOMAS BOLTZE CTO SKY & SAND THOMAS.BOLTZE@SKY-SAND.COM Sky&Sand CLOUD COMPUTING THOMAS BOLTZE CTO SKY & SAND THOMAS.BOLTZE@SKY-SAND.COM What is Cloud Computing? A new way to bill for resources: Hourly (virtual server, RDS) Data volume (S3, Glacier, SendGrid)

More information

Building Blocks of the Private Cloud

Building Blocks of the Private Cloud www.cloudtp.com Building Blocks of the Private Cloud Private clouds are exactly what they sound like. Your own instance of SaaS, PaaS, or IaaS that exists in your own data center, all tucked away, protected

More information

Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray VMware

Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray VMware Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray ware 2 Agenda The Hadoop Journey Why Virtualize Hadoop? Elasticity and Scalability Performance Tests Storage Reference

More information

Building a Cloud Computing Platform based on Open Source Software. 10. 18. 2011. Donghoon Kim ( donghoon.kim@kt.com ) Yoonbum Huh ( huhbum@kt.

Building a Cloud Computing Platform based on Open Source Software. 10. 18. 2011. Donghoon Kim ( donghoon.kim@kt.com ) Yoonbum Huh ( huhbum@kt. Building a Cloud Computing Platform based on Open Source Software 10. 18. 2011. Donghoon Kim ( donghoon.kim@kt.com ) Yoonbum Huh ( huhbum@kt.com) Topics I.Open Source SW and Cloud Computing II. About OpenStack

More information

Cloud Computing and Amazon Web Services

Cloud Computing and Amazon Web Services Cloud Computing and Amazon Web Services Gary A. McGilvary edinburgh data.intensive research 1 OUTLINE 1. An Overview of Cloud Computing 2. Amazon Web Services 3. Amazon EC2 Tutorial 4. Conclusions 2 CLOUD

More information

How Server And Network Virtualization Make Data Centers More Dynamic

How Server And Network Virtualization Make Data Centers More Dynamic January 2009 How Server And Network Virtualization Make Data Centers More Dynamic A commissioned study conducted by Forrester Consulting on behalf of Cisco Systems Table Of Contents Executive Summary...3

More information

How To Scale A Server Farm

How To Scale A Server Farm Basics of Cloud Computing Lecture 3 Scaling Applications on the Cloud Satish Srirama Outline Scaling Information Systems Scaling Enterprise Applications in the Cloud Auto Scaling 25/02/2014 Satish Srirama

More information

THE REALITIES OF NOSQL BACKUPS

THE REALITIES OF NOSQL BACKUPS THE REALITIES OF NOSQL BACKUPS White Paper Trilio Data, Inc. March 2015 1 THE REALITIES OF NOSQL BACKUPS TABLE OF CONTENTS INTRODUCTION... 2 NOSQL DATABASES... 2 PROBLEM: LACK OF COMPREHENSIVE BACKUP AND

More information

Next Generation Data Centers: Hyperconverged Architectures Impact On Storage. PRESENTATION TITLE GOES HERE Mark O Connell Distinguished Engineer EMC

Next Generation Data Centers: Hyperconverged Architectures Impact On Storage. PRESENTATION TITLE GOES HERE Mark O Connell Distinguished Engineer EMC Next Generation Data Centers: Hyperconverged Architectures Impact On Storage PRESENTATION TITLE GOES HERE Mark O Connell Distinguished Engineer EMC Abstract A modern data center typically contains a number

More information

Cloud Computing. Up until now

Cloud Computing. Up until now Cloud Computing Lecture 20 Cloud Platform Comparison & Load 2011-2012 Up until now Introduction, Definition of Cloud Computing Pre-Cloud Large Scale Computing: Grid Computing Content Distribution Networks

More information

Cloud Computing. Up until now

Cloud Computing. Up until now Cloud Computing Lecture 20 Cloud Platform Comparison & Load Balancing 2010-2011 Up until now Introduction, Definition of Cloud Computing Pre-Cloud Large Scale Computing: Grid Computing Content Distribution

More information

Hedvig Distributed Storage Platform with Cisco UCS

Hedvig Distributed Storage Platform with Cisco UCS REFERENCE ARCHITECTURE 1 Hedvig Distributed Storage Platform with Cisco UCS Solution Overview and Reference Architecture Elastic Scale to petabytes of data Start with as few as two nodes and scale to thousands.

More information

Amazon AWS in.net. Presented by: Scott Reed scottr@develop.com

Amazon AWS in.net. Presented by: Scott Reed scottr@develop.com Amazon AWS in.net Presented by: Scott Reed scottr@develop.com Objectives Cloud Computing What Amazon provides Why Amazon Web Services? Q&A Instances Interacting with Instances Management Console Command

More information