Storage Trends File and Object Based Storage and how NFS-Ganesha can play

Similar documents
Why is it a better NFS server for Enterprise NAS?

Open Source, Scale-out clustered NAS using nfs-ganesha and GlusterFS

Introduction to Highly Available NFS Server on scale out storage systems based on GlusterFS

Four Reasons To Start Working With NFSv4.1 Now

Distributed File System Choices: Red Hat Storage, GFS2 & pnfs

Linux Powered Storage:

NFS Ganesha and Clustered NAS on Distributed Storage System, GlusterFS. Soumya Koduri Meghana Madhusudhan Red Hat

Sep 23, OSBCONF 2014 Cloud backup with Bareos

Building Storage as a Service with OpenStack. Greg Elkinbard Senior Technical Director

pnfs State of the Union FAST-11 BoF Sorin Faibish- EMC, Peter Honeyman - CITI

Storage Architectures for Big Data in the Cloud

Introduction to NetApp Infinite Volume

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved.

Federated Cloud File System Framework

Scala Storage Scale-Out Clustered Storage White Paper

The Panasas Parallel Storage Cluster. Acknowledgement: Some of the material presented is under copyright by Panasas Inc.

10th TF-Storage Meeting

Building low cost disk storage with Ceph and OpenStack Swift

Product Spotlight. A Look at the Future of Storage. Featuring SUSE Enterprise Storage. Where IT perceptions are reality

Storage Virtualization in Cloud

High Performance Computing OpenStack Options. September 22, 2015

SUSE Enterprise Storage Highly Scalable Software Defined Storage. Gábor Nyers Sales

SMB in the Cloud David Disseldorp

Acronis Storage Gateway

Copyright 2011, Storage Strategies Now, Inc. All Rights Reserved.

Big data management with IBM General Parallel File System

Introduction to Gluster. Versions 3.0.x

How to Choose your Red Hat Enterprise Linux Filesystem

Object Storage A Dell Point of View

Understanding Enterprise NAS

Scale and Availability Considerations for Cluster File Systems. David Noy, Symantec Corporation

Data Storage in Clouds

IBM Infrastructure for Long Term Digital Archiving

Enterprise Storage Solution for Hyper-V Private Cloud and VDI Deployments using Sanbolic s Melio Cloud Software Suite April 2011

Red Hat Storage Server

Big Data Storage Options for Hadoop Sam Fineberg, HP Storage

WHITE PAPER. Software Defined Storage Hydrates the Cloud

Testing of several distributed file-system (HadoopFS, CEPH and GlusterFS) for supporting the HEP experiments analisys. Giacinto DONVITO INFN-Bari

Rapid Data Backup and Restore Using NFS on IBM ProtecTIER TS7620 Deduplication Appliance Express IBM Redbooks Solution Guide

EMC ISILON AND ELEMENTAL SERVER

Lessons learned from parallel file system operation

VIDEO SURVEILLANCE WITH SURVEILLUS VMS AND EMC ISILON STORAGE ARRAYS

<Insert Picture Here> Managing Storage in Private Clouds with Oracle Cloud File System OOW 2011 presentation

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN

SUSE Linux uutuudet - kuulumiset SUSECon:sta

Improving Scalability Of Storage System:Object Storage Using Open Stack Swift

Actifio Big Data Director. Virtual Data Pipeline for Unstructured Data

ZADARA STORAGE. Managed, hybrid storage EXECUTIVE SUMMARY. Research Brief

Technology Insight Series

UEFI on Dell BizClient Platforms

Caringo Swarm 7: beyond the limits of traditional storage. A new private cloud foundation for storage needs at scale

Large Scale Storage. Orlando Richards, Information Services LCFG Users Day, University of Edinburgh 18 th January 2013

Storage Management for the Oracle Database on Red Hat Enterprise Linux 6: Using ASM With or Without ASMLib

An Oracle White Paper July Oracle ACFS

Network Attached Storage. Jinfeng Yang Oct/19/2015

Database Hardware Selection Guidelines

GPFS-OpenStack Integration. Dinesh Subhraveti IBM Research

BIG DATA-AS-A-SERVICE

Integrated and reliable the heart of your iseries system. i5/os the next generation iseries operating system

VDI Optimization Real World Learnings. Russ Fellows, Evaluator Group

RED HAT STORAGE SERVER TECHNICAL OVERVIEW

StorPool Distributed Storage Software Technical Overview

Accelerating and Simplifying Apache

Cloud File System Gateway & Cloud Data Management Interface (CDMI)

SECURE, ENTERPRISE FILE SYNC AND SHARE WITH EMC SYNCPLICITY UTILIZING EMC ISILON, EMC ATMOS, AND EMC VNX

SWIFT. Page:1. Openstack Swift. Object Store Cloud built from the grounds up. David Hadas Swift ATC. HRL 2012 IBM Corporation

September 2009 Cloud Storage for Cloud Computing

Chapter 11 Distributed File Systems. Distributed File Systems

New Features in PSP2 for SANsymphony -V10 Software-defined Storage Platform and DataCore Virtual SAN

RED HAT STORAGE PORTFOLIO OVERVIEW

IBM Global Technology Services September NAS systems scale out to meet growing storage demand.

Ceph. A complete introduction.

Diagram 1: Islands of storage across a digital broadcast workflow

RED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: COMPETITIVE FEATURES

High Availability Solutions for the MariaDB and MySQL Database

Object storage in Cloud Computing and Embedded Processing

NFSv4.1 Server Protocol Compliance, Security, Performance and Scalability Testing - Implement RFC, Going Beyond POSIX Interop!

RED HAT STORAGE SERVER An introduction to Red Hat Storage Server architecture

EMC NetWorker Module for Microsoft Exchange Server Release 5.1

POWER ALL GLOBAL FILE SYSTEM (PGFS)

Ceph Distributed Storage for the Cloud An update of enterprise use-cases at BMW

Introduction to PCI Express Positioning Information

Cloud and Big Data initiatives. Mark O Connell, EMC

High Availability Storage

IBM ELASTIC STORAGE SEAN LEE

Interoperable Cloud Storage with the CDMI Standard

VMware ESX 2.5 Server Software Backup and Restore Guide on Dell PowerEdge Servers and PowerVault Storage

Maxta Storage Platform Enterprise Storage Re-defined

IBM Storwize Rapid Application Storage solutions

TECHNICAL WHITE PAPER: ELASTIC CLOUD STORAGE SOFTWARE ARCHITECTURE

Engineering a NAS box

Maginatics Cloud Storage Platform Feature Primer

OPTIMIZING PRIMARY STORAGE WHITE PAPER FILE ARCHIVING SOLUTIONS FROM QSTAR AND CLOUDIAN

Ceph. A file system a little bit different. Udo Seidel

Transcription:

Storage Trends File and Object Based Storage and how NFS-Ganesha can play Venkateswararao Jujjuri (JV) File systems and Storage Architect IBM Linux Technology center jvrao@us.ibm.com jujjuri@gmail.com 2014

2/28 Outline Data is Exploding Storage Trends Unstructured Data Need for new solution Object Store File vs Object and Object details Big question and answer FOBS File and Object Based Storage Object Storage details and variations NFS Evolution and pnfs and future NFS-Ganesha Conclusions

Data is Exploding We create IDC Says Growth Data will will reach grow from 4.4ZB today to 44 ZB by 2020 3/28 Source: http://www-01.ibm.com/software/data/bigdata/what-is-big-data.html

Storage Trends Data growth is around 70%/ year, most of it is unstructured. Scale-out rather than scale-up. Object is gaining lot of traction but file is not going away; NAS will stay as significant player. Analysts predict NAS grow at a CAGR of 25.44% over 2013-2018. (http://cti.tmcnet.com/news/2014/04/04/7762020.htm) Unified Storage NAS, SAN, and Object Growth mantra: FOBS IDC Projections * Structured Data Will grow At a 21.8% CAGR * Unstructured Data Will grow At a 61.7% CAGR Market Needs and Adoption 2000 Direct Attached Storage SAN 2010 Network Attached Storage NFS,CIFS 2020 File and Object Based Storage 4/28

5/28 Unstructured Data Basically non-database data Usually generated on an event (Cheese...Click) Typically no access or read access (photos, xrays, dental recs) Tough to interpret the content (jpeg can be a silly pic or blueprint) Emails, Instant Messages, Documents, Spread Sheets, Graphics, Images, Videos, Social Media, Medical Records, wearable. on.. and.. on... Explosive growth in search for cost effectiveness and manageability. Why not continue file/nas model? Simple access model No need for heavy POSIX interface. Scale-Out: Hierarchical model is more of an overhead Context: Difficult to build context of an individual file. (need entire path) Metadata is distributed hence complex/inefficient policies. Loose/Eventual consistency is often good enough.

6/28 Need for a new Solution Requirements Simple interface Easy access, no need to traverse through dirs/subdirs Context of the contents Scale-Out capabilities Massive and Cheap Easy policies for ILM

File vs Object Penguin.jpg File Object FileName: Penguin.jpg Times: atime, mtime, ctime etc Owner:Group Permissions: Unix style, ACLs etc ObjectId FileTyle Times Camera Info: Resolution: Owner Name: Location: Copyright: Orientation YcbCr positioning Compression Exposure Time X-Resolution Y-Resolution Focal Aperture Flash Focal Length Color Space Angle Orientation Preferred Display Category Importance Tags Version Notes Voice/comment 7/28 Object: Simply an abstract container where data and metadata are co-located

8/28 Objects Rich meta-data that co-exists with data; easy policies Addressed by a 128 bit id Flat Access Checksum is part of metadata Multiple file types can be in one object (a wave and jpeg) Cost effective because of eventual consistency and the lack of POSIX complexity. Scales well with off-the-shelf hardware Simple access protocol, RESTful API. Suited for the digital world generated unstructured data.

9/28 Big Question So... File and NAS are DEAD?

10/28.and the Answer is.. NO File and NAS will continue to grow File and Object joins hands together to keep the party on! FOBS File and Object Based Storage

11/28 FOBS File and Object Based Storage Object storage works best for WORM workloads and not all data fits that tab. Object is meant for low cost mass storage which is not actively shared. Traditional applications and file systems use continues File fills part of the spectrum where the need for rich set of security and consistency guarantees. Object Storage fills the space where file/nas is week. After-all, most of the object stores and structured data stores (databases) are created on file-systems

12/28 FOBS File and Object Based Storage WORM/Cold Cost Effective High Volume Scalability Manageability Secure Consistency General Purpose Performance Legacy Volume Based Market share Object Store Access/Update Frequency FOBS File

Object Storage Objects are broadly divided into two categories. Storage Devices * Move Smarts into the device layer NASD, OSD-1, OSD-2, OSD Layer on FS etc * Access command set Ex: SCSI model command set for OSD. * Custom OSD mode: Lustre, Ceph * T10 OSD model: EXOFS, PanFS * pnfs support. * PBs of storage on 1000s of disks, 1000s of clients Web Services * Objects created on Filesystems and accessed through web. * REST Model HTTP protocol Operation:Get, Put, Post, Delete * Highly Available * Loosely consistent. * SWIFT, S3, Azure etc * Gaining tremendous popularity. 13/28

Object Based Filesystem (Ex: Ceph) * Provides Posix-Compliant FS on top of Object-Based Ceph Storage Cluster * Files gets mapped to Objects and MDS below librados * MDS stores all Filesystem Metadata (Directories, Owners, Access info etc) * Data directly stored on OSDs * Out of band IO: Metadata provides data location, and IO is directly to OSDs * Offers kernel mount or FUSE interface 14/28 Source:http://ceph.com/docs/master/architecture/

Web Services Object Store - SWIFT * Storage Nodes consists Objects, stored as binary files on the filesystem with metadata stored in the file s extended attributes (xattrs). * Proxy Nodes receive and process Incoming request and determine the correct storage server for the request. * All objects stored in Swift have a URL * All objects stored are replicated 3x in as-unique-as-possible zones. * Object data can be located any where in the cluster * Nodes/Disks can be added without downtime. 15/28 Source:http://docs.openstack.org/training-guides/content/module003-ch007-cluster-architecture.html

NFS Evolution NFS is extremely popular and widely used. Stateless NFSv3, very successful and de-facto for 'NFS'. Stateful NFSv4 came out in 2003 Adaptation is slow but gaining momentum since NFSv4.1 came out Became a stepping stone to move towards NFSv4.1 NFSv4.1 introduced in 2010, added enhancements and addressed NFSv4 deficiencies. Improved performance - pnfs, Directory Delegations, Trunking Robustness - Exactly Once Semantics Security Windows native ACL support, Kerberized Back Channel For time-to-market reasons, few players skipping NFSv4.0 and directly moving to NFSv4.1. Ex: Vmware, Microsoft. 16/28

17/28 Parallel NFS - pnfs * Allows direct client access to the storage devices * Clients can do parallel IO across storage * Layouts can be leased, re-callable, and revokable. * Removes IO bottlenecks and improves large file performance. * Load balancing * Scale-out model * Control Protocol is not standardized, vendor value-add. POSIX Interface File/Object Layout Driver Control ExoFS

18/28 Parallel NFS - pnfs * Supported layout types are open-ended. * Supports three types of layouts - File (RFC 5661) - Block (RFC 5663) - Object (RFC 5664) - Future: - Flexible File Layout (proposal) and others * File Layout - Files, NFS protocol - Default layout and many implementations * Block Layout - SCSI blocks, iscsi, FCP etc * Object Layout - OSD SCSI object protocol, OSD2 - Few implementations, PanFS, Exofs(OSDFS) User Interface NFSv4.1 PNFS Layouts File Obj Block Future Network / IO stack

NAS and Object to become FOBS Many traditional applications written for POSIX access Object storage is different and foreign to the traditional applications. One of the solutions is to create a Filesystem layer on top of object store. Ex: Maldivica storage connector creates filesystem interface on top of SWIFT object store which can be exported via NFS/CIFS (NAS) Provide Object Interface on NAS. Ex: Calsoft Integrates NAS with modified openstack SWIFT and provides SWIFT interface on NAS. 19/28 Source: http://www.calsoftinc.com/openstack-object-storage-swift.aspx#, http://maldivica.com/

Swift-on-File Ability to access the back-end using both object interface and file interface. Swift-on-File stores objects following the same path hierarchy as that object's URL. Object URL: https://swift.example.com/v1/acc/cont/obj Swift:/mnt/sdb1/2/node/sdb2/objects/981/f79/f566bd022b9285 b05e665fd7b843bf79/1401254393.89313.data SoF: /mnt/gluster-vol/acc/cont/obj Enables objects created using the Swift API to be accessed as files on a Posix filesystem. This opens up enormous possibilities including NAS and RESTful interface to create and access the same data Use Case: Create video files using SWIFT, use file access to trans-code it, and let it use by SWIFT to access in different codec. 20/28 Source:https://github.com/swiftonfile/swiftonfile/blob/master/README.md

21/28 NFS for future NFS Pathless Objects - filesystem objects which can be created, queried for and destroyed without being associated with a pathname. (http://tools.ietf.org/html/draft-dipankar-nfsv4-pathless-objects-02) Metastripe - RFCs are being proposed to stripe/scale metadata servers (http://tools.ietf.org/html/draft-mbenjamin-nfsv4-pnfs-metastripe-01) Ceph providing access to back-end RADOS object store through LIBRADOS API, S3/Swift compatible API, Block, CEPHFS - which can be nfs exported, including pnfs. (http://ceph.com/docs/master/architecture/) pnfs over CEPH CohortFS with metastripe PNFS over Lustre. - CEA, French Defense organization. Possible to offer selectable consistency with nfs backed object store vrs web based. OpenStack Manila project (https://wiki.openstack.org/wiki/manila/)

22/28 NFSv4.2 Added Advanced features takes NFS into advanced file sharing category. Performance: Server Side Copy: Removes one leg of copy operation IO_ADVISE: Client advise Server on Application access pattern. Application Data Blocks (ADB): ex: VM image file type. Sparce file support. Security:. Labeled NFS: Mandatory Access Control based on system wide policy Scalability and QoS Space Reservation: Reserve Storage useful in thin provisioning Hole Punching: Return unused parts of the file back to the pool.

23/28 NFS-Ganesha One of the mainstream NFS Server. User-level NFS server suitable for enterprise applicatoins Manageability and debug-ability http://tinyurl.com/kka8czz Can manage huge meta-data and data caches Provision to exploit FS specific features. Can serve multiple types of File Systems at the same time. Can serve multiple protocols at the same time. Can act as a Proxy server and export a remote NFSv4 server. Cluster friendly and Cluster Manager agnostic. Easy recovery, failover and failback implementation. Multi-protocol support with common DLM (planned) Small but growing community. Active participants IBM, Panasas, Redhat, CohortFS(LinuxBox), CES, Bull, + few more.

24/28 NFS-Ganesha Supports many Filesystems through FSAL layer VFS, GPFS, PanFS, Gluster, CEPH, Lustre, XFS, FUSE, Proxy, PT etc NFS v3, NFSv4.0, NFSv4.1, pnfs support. Minimal NFSv4.2 IBM, Redhat, LinuxBox, Panasas released/releasing products based on NFS-Ganesha. Released 1.5, 2.0, 2.1 releases, 2.2 is set to be GA'd by end of October 2014.. Delegation, Statistics, Dynamic exports, LTTng support. Supports file and object layouts of pnfs Cluster Manager Abstraction Layer (CMAL) Clustered DRC, DLM, multi-protocol support.

25/28 Conclusions Object storage is expanding and file remains to be very important part of the equation and expected play together FOBS is the future. Unified storage - NAS, Object, SAN NFS is progressing as a protocol, NFSv4.1 and pnfs support is a must to be competitive in the market space. pnfs has major advantages - Scale-out meta-data, data; parallel IO/ performance improvement. NFSv4.2 is taking NFS as a preferred filesystem/access protocol for future storage needs.

NFS-Ganesha links NFS-Ganesha is available under the terms of the LGPLv3 license. NFS-Ganesha Project homepage on github https://github.com/nfs-ganesha/nfs-ganesha/wiki Github: https://github.com/nfs-ganesha/nfs-ganesh Download page http://sourceforge.net/projects/nfs-ganesha/files Mailing lists nfs-ganesha-devel@lists.sourceforge.net nfs-ganesha-support@lists.sourceforge.net nfs-ganesha-announce@lists.sourceforge.net 26/28

27/28 Legal Statements This work represents the view of the author and does not necessarily represent the view of IBM. IBM is a registered trademark of International Business Machines Corporation in the United States and/or other countries. UNIX is a registered trademark of The Open Group in the United States and other countries. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Other company, product, and service names may be trademarks or service marks of others CONTENTS are "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON- INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. Author/IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice.

28/28 Q & A THANK YOU!