Building low cost disk storage with Ceph and OpenStack Swift



Similar documents
Low-cost

Product Spotlight. A Look at the Future of Storage. Featuring SUSE Enterprise Storage. Where IT perceptions are reality

Scientific Computing Data Management Visions

SUSE Enterprise Storage Highly Scalable Software Defined Storage. Gábor Nyers Sales

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

VM Image Hosting Using the Fujitsu* Eternus CD10000 System with Ceph* Storage Software

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

Introducing ScienceCloud

DreamObjects. Cloud Object Storage Powered by Ceph. Monday, November 5, 12

Distributed File Systems An Overview. Nürnberg, Dr. Christian Boehme, GWDG

Connecting Flash in Cloud Storage

Building All-Flash Software Defined Storages for Datacenters. Ji Hyuck Yun Storage Tech. Lab SK Telecom

How an Open Source Cloud Will Help Keep Your Cloud Strategy Options Open

Ceph. A complete introduction.

Best Practices for Increasing Ceph Performance with SSD

Data Sheet FUJITSU Storage ETERNUS CD10000

StorPool Distributed Storage Software Technical Overview

Client-aware Cloud Storage

How swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Ceph Distributed Storage for the Cloud An update of enterprise use-cases at BMW

21 st Century Storage What s New and What s Changing

Essentials Guide CONSIDERATIONS FOR SELECTING ALL-FLASH STORAGE ARRAYS

Intel Virtual Storage Manager (VSM) 1.0 for Ceph Operations Guide January 2014 Version 0.7.1

StorPool Distributed Storage. Software-Defined. Business Overview

Deploying Flash- Accelerated Hadoop with InfiniFlash from SanDisk

IBM ELASTIC STORAGE SEAN LEE

東 海 大 學 資 訊 工 程 研 究 所 碩 士 論 文

VMware Software-Defined Storage Vision

Dell Converged Infrastructure

RED HAT STORAGE PORTFOLIO OVERVIEW

CLOUD COMPUTING. When It's smarter to rent than to buy

POSIX and Object Distributed Storage Systems

SUSE Storage. FUT7537 Software Defined Storage Introduction and Roadmap: Getting your tentacles around data growth. Larry Morris

Scala Storage Scale-Out Clustered Storage White Paper

Storage Architectures for Big Data in the Cloud

Sep 23, OSBCONF 2014 Cloud backup with Bareos

IT of SPIM Data Storage and Compression. EMBO Course - August 27th! Jeff Oegema, Peter Steinbach, Oscar Gonzalez

Testing of several distributed file-system (HadoopFS, CEPH and GlusterFS) for supporting the HEP experiments analisys. Giacinto DONVITO INFN-Bari

A Complete Open Cloud Storage, Virt, IaaS, PaaS. Dave Neary Open Source and Standards, Red Hat

Object-based Storage in Big Data and Analytics. Ashish Nadkarni Research Director Storage IDC

Installing Hadoop over Ceph, Using High Performance Networking

Big Data Analytics on Object Storage -- Hadoop over Ceph* Object Storage with SSD Cache

Alexandria Overview. Sept 4, 2015

Is Hyperconverged Cost-Competitive with the Cloud?

Data management challenges in todays Healthcare and Life Sciences ecosystems

HOW TO SELECT THE BEST SOLID- STATE STORAGE ARRAY FOR YOUR ENVIRONMENT

Understanding Object Storage and How to Use It

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

SWIFT. Page:1. Openstack Swift. Object Store Cloud built from the grounds up. David Hadas Swift ATC. HRL 2012 IBM Corporation

Understanding Microsoft Storage Spaces

2) Xen Hypervisor 3) UEC

The Pros and Cons of Erasure Coding & Replication vs. RAID in Next-Gen Storage Platforms. Abhijith Shenoy Engineer, Hedvig Inc.

Hyperscale Use Cases for Scaling Out with Flash. David Olszewski

Microsoft Private Cloud Fast Track

Implementing Multi-Tenanted Storage for Service Providers with Cloudian HyperStore. The Challenge SOLUTION GUIDE

Reduce Latency and Increase Application Performance Up to 44x with Adaptec maxcache 3.0 SSD Read and Write Caching Solutions

Distributed Storage Systems

AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman

Solid State Storage in the Evolution of the Data Center

Building Storage as a Service with OpenStack. Greg Elkinbard Senior Technical Director

The Data Placement Challenge

Introduction to Cloud Computing

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved.

QoS-Aware Storage Virtualization for Cloud File Systems. Christoph Kleineweber (Speaker) Alexander Reinefeld Thorsten Schütt. Zuse Institute Berlin

Unveiling ~okeanos: A public cloud IaaS service coming from the depths of the GRNET's DataCenter facilities

version 7.0 Planning Guide

High Performance Computing OpenStack Options. September 22, 2015

Introduction to OpenStack Swift CloudOpen Japan 2014

Software Defined Microsoft. PRESENTATION TITLE GOES HERE Siddhartha Roy Cloud + Enterprise Division Microsoft Corporation

Product Brochure. Hedvig Distributed Storage Platform Modern Storage for Modern Business. Elastic. Accelerate data to value. Simple.

VDI Without Compromise with SimpliVity OmniStack and Citrix XenDesktop

Red Hat Storage Server

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

Deploying Flash in the Enterprise Choices to Optimize Performance and Cost

HPC on AWS. Hiroshi Kobayashi, Dev./Lab. IT System HGST Japan, Ltd. Jun 3, 2015

Industry Brief. The Epic Migration. to Software Defined Storage. SUSE Enterprise Storage. Featuring

Comparing Ganeti to other Private Cloud Platforms. Lance Albertson

Zadara Storage Cloud A

Cloud storage reloaded:

SUSE Cloud Installation: Best Practices Using a SMT, Xen and Ceph Storage Environment

Η υπηρεσία Public IaaS ΕΔΕΤ ανάπτυξη και λειτουργία για χιλιάδες χρήστες

KT ucloud storage. Two Years of Life with OpenStack Swift / Jaesuk Ahn, Cloud OS Dev. Team, Korea Telecom

A Virtual Filer for VMware s Virtual SAN A Maginatics and VMware Joint Partner Brief

NEXT GENERATION EMC: LEAD YOUR STORAGE TRANSFORMATION. Copyright 2013 EMC Corporation. All rights reserved.

Cloudian delivers object storage for next generation infrastructures

Flash Controller Architecture for All Flash Arrays

Cloud Computing and Amazon Web Services

RozoFS: a fault tolerant I/O intensive distributed file system based on Mojette erasure code

Cloud 101. Mike Gangl, Caltech/JPL, 2015 California Institute of Technology. Government sponsorship acknowledged

FIA Athens 2014 ~OKEANOS: A LARGE EUROPEAN PUBLIC CLOUD BASED ON SYNNEFO. VANGELIS KOUKIS, TECHNICAL LEAD, ~OKEANOS

VMware VMware Inc. All rights reserved.

Transcription:

Background photo from: http://edelomahony.com/2011/07/25/loving-money-doesnt-bring-you-more/ Building low cost disk storage with Ceph and OpenStack Swift Paweł Woszuk, Maciej Brzeźniak TERENA TF-Storage meeting in Zurich Feb 10-11th, 2014

Low-cost storage motivations (1) Pressure for high-capacity, low-cost storage Data volumes growing rapidly (data deluge, big data) Budgets does not extend as quickly as storage Storage market follows the cloud market Virtualisation causes explosion of storage usage (deduplication not always mitigates the increasing number of disk images)

Low-cost storage motivations (2) NRENs under pressure of industry Pricing (see S3 pricelist) Features in front of Dropbox, Google Drive Scale-out capability (can we have it?) Integration with IaaS services (VM + storage) Issues while building storage on disk arrays Reliatively high invest. cost and maintenance Vendor lock-in Closed architecture, limited scalability Slow adoption of new technologies

Topics covered Strategy Technology Pricing / costs Collaboration opportunity

PSNC strategy / approach Build a private storage cloud i.e. to build not to buy Public cloud adoption still problematic Use object storage architecture Scalable, no centralisation, open architecture HA thanks to components redundancy Run a pilot system using: Open source software Cost-efficient server platform Test the solutions: Various software / hardware mixtures Various workloads: plain storage, sync&share, VMs, video

Software: open source platforms considered OpenStack Swift Download User Apps Upload Load balancer CEPH APP HOST / VM Client RadosGW RBD CephFS LibRados Proxy Node Proxy Node Proxy Node Rados MDS MONs OSDs MDS.1 MON.1 OSD.1 Storage Node Storage Node Storage Node Storage Node Storage Node... MDS.n... MON.n... OSD.n

Software: OpenStack Swift

Software: Ceph APP HOST / VM Client RadosGW RBD CephFS S3 Swift LibRados Rados MDS MDS.1 MONs MON.1 Pool 1 Pool 2 Pool... X... Pool n...... CRUSH map MDS.n MON.n PG 1 PG 2 PG 3 PG 4... PG n... 1... n... 1... n 1 n Cluster Node [OSDs] Cluster Node [OSDs] Cluster Node [OSDs]

Ceph OSD selection

Ceph OSD selection + write to replicas

Software: OpenStack Swift vs Ceph Scalability: Architecture/features: e.g. load balancing: Swift external, Ceph within the architecture Implementation: Swift python Ceph C/C++ Maturity User base Know-how around

Hardware Different people use different back-ends Pan-cakes (1U, 12 drives) vs Fat nodes (4U, 36+ drives) HDDs vs SSDs 1Gbit vs 10Gbit connectivity PSNC: 1st stage: regular servers from HPC cluster: 1 HDD (data) + 1 SSD (meta-data, FS journal) 1Gbit for clients, Infiniband within the cluster 2nd stage: pilot installation of 16 servers 12 HDDs: data + meta-data 10 HDD (data) + 2 SSD (meta-data + FS journal, possibly caching) 10 Gbit connectivity Software and hardware comparison tests

Pancake stack storage rack Quanta Stratos S100-L11SL

A pancake photos Photo by PSNC Photo by PSNC Photo from: http://www.quantaqct.com/en/01_product/02_detail.php?mid=27&sid=158&id=159&qs=100=

Pancake in action Diagnostic panel on the server front shows the status of the disk drive (usefull while dealing with hundreds of drives) Photos by PSNC Server read performance in a throughput mode reaches 1,5GB/s (dstat output under stress test)

Assumptions: Costs (inv./tco vs capacity) Analysis for 5 years long lifecycle of the servers Investment cost includes 5 years warranty Total cost includes: Investment costs Power & cooling, room cost Personel costs

Monthly TCO / TB

Pricing by Amazon

Conclusions (1) NRENs can compete on pricing with industry At the end we may use similar hardware and software components Can we compete with our SLAs? Can we scale out? How to make it? Cheap storage is not that cheap Hardware: In the analysis we are not using extremely cheap components We could use even cheaper hardware, but: Do we want it: Operational costs, Know-how cost Are we able to really provide SLAs on top of it? Software: We need RAID-like, e.g. erasure coding mechanisms to increase storage efficiency (in the analysis we assumed 3x replication) There is definitely field to collaborate Know-how/experience exchange Storage capacity/services exchange? Technically possible, but politics are always difficult

Conclusions (2) We should examine possibility to use different hardware solutions BackBlaze s StoragePod: http://en.wikipedia.org/wiki/file:storagepod.jpg Open Vault storage array by Open Compute Project Open Vault storage array by Open Compute Project Open Vault storage array by Open Compute Project Servers based on off-the-shelf components

Conclusions (3) Storage row in PSNC s data center in 2 years - see: blog.backblaze.org