LTFS and CDMI - Tape for the Cloud David Slik NetApp, Inc.

Similar documents
Applications of LTFS for Cloud Storage Use Cases

Using CDMI to Manage Swift, S3, and Ceph Object Repositories David Slik NetApp, Inc.

Archiving On-Premise and in the Cloud. March 2015

PASIG Four-Tier Computing: Linking On-Premise and Cloud Infrastructures

Growth of Unstructured Data & Object Storage. Marcel Laforce Sr. Director, Object Storage

Save Time and Money with Quantum s Integrated Archiving Solution

Strategies and New Technology for Long Term Preservation of Big Data

Backing up to the Cloud

IBM Spectrum Protect in the Cloud

How Does the Cloud Fit into Active Archiving. Active Archive Alliance Panel

SWIFT. Page:1. Openstack Swift. Object Store Cloud built from the grounds up. David Hadas Swift ATC. HRL 2012 IBM Corporation

Well-Grounded Approach to the Cloud:

Amazon Cloud Storage Options

Feet On The Ground: A Practical Approach To The Cloud Nine Things To Consider When Assessing Cloud Storage

Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving

LONG TERM RETENTION OF BIG DATA

Object Oriented Storage and the End of File-Level Restores

Long term retention and archiving the challenges and the solution

in the Cloud - What To Do and What Not To Do Chad Thibodeau / Cleversafe Sebastian Zangaro / HP

Intro to AWS: Storage Services

Cloud Data Management Interface (CDMI) The Cloud Storage Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG

High Performance Computing OpenStack Options. September 22, 2015

Future Proofing your Cloud Infrastructure. Greg Kleiman Director of Cloud Solutions, NetApp Co-Chair SNIA CSI Education Committee

<Insert Picture Here> Cloud Archive Trends and Challenges PASIG Winter 2012

Cloud Storage Standards Overview and Research Ideas Brainstorm

Cloud Storage Services. A Total Cost of Ownership Comparison. Sponsored by Fujifilm Copyright 2015 Brad Johns Consulting L.L.C.

SNIA Cloud Storage PRESENTATION TITLE GOES HERE

NetApp, Standards & Open Source

NetApp Data Fabric: Secured Backup to Public Cloud. Sonny Afen Senior Technical Consultant NetApp Indonesia

Federated Cloud File System Framework

WHITE PAPER. Reinventing Large-Scale Digital Libraries With Object Storage Technology

Leveraging Cloud Object Storage for Cost Savings, Scale and Reliability. Avere Systems Adds Cloud Storage as a NAS Tier

Long-term data storage in the media and entertainment industry: StrongBox LTFS NAS archive delivers 84% reduction in TCO

Introduction to NetApp Infinite Volume

The Evolution of Cloud Storage - From "Disk Drive in the Sky" to "Storage Array in the Sky" Allen Samuels Co-founder & Chief Architect

Design and Evolution of the Apache Hadoop File System(HDFS)

A 5 Year Total Cost of Ownership Study on the Economics of Cloud Storage

EMC DATA DOMAIN OPERATING SYSTEM

Vodacom Managed Hosted Backups

Key Considerations for Managing Big Data in the Life Science Industry

Simplify Your Data Protection Strategies: Best Practices for Online Backup & Recovery

TCO Case Study. Enterprise Mass Storage: Less Than A Penny Per GB Per Year. Featured Products

Hybrid Cloud Storage: Making Your Move to the Cloud

The Design and Implementation of the Zetta Storage Service. October 27, 2009

Hybrid Enterpriseto-Cloud. Services. Francisco Salinas, Cloud Services Engineering Adam O Dwyer, Services Marketing. June 24, 2015

EMC AVAMAR. a reason for Cloud. Deduplication backup software Replication for Disaster Recovery

Media for Long-Term Archiving. June 2014

Why long time storage does not equate to archive

Current and Future Economics of Deep Storage: Public vs. Private Cloud

Interoperable Cloud Storage with the CDMI Standard

The BIG Data Era has. your storage! Bratislava, Slovakia, 21st March 2013

Cloud Archiving. Paul Field Consultant

Storage Clouds. Karthik Ramarao. Director of Strategy and Technology and CTO Asia Pacific, NetApp Board Director SNIA South Asia

How To Share Data With The Cloud On Dcache

What is the LTO Program?

TCO Case Study Enterprise Mass Storage: Less Than A Penny Per GB Per Year

Sibling Rivalries: Integrating Data Management Technologies

Storage and Data Management in a post-filesystem

Building Storage Clouds for Online Applications A Case for Optimized Object Storage

Cloud OS Vision. Modern platform for the world s apps

New Features in Neuron ESB 2.6

Archive and Preservation in the Cloud - Business Case, Challenges and Best Practices. Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP

DEFINING THE ROI FOR MEDICAL IMAGE ARCHIVING

dcache, Software for Big Data

Solution Brief: Archiving Avid Interplay Projects using NLT and XenData

Optimize Your SharePoint 2010 Content Using Microsoft s New Storage Guidance

PARALLELS CLOUD STORAGE

Digital Archive Getting Started Guide

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

White Paper. Managing MapR Clusters on Google Compute Engine

XenData Product Brief: SX-550 Series Servers for LTO Archives

EMC DATA DOMAIN OPERATING SYSTEM

Understanding Enterprise NAS

Storage CloudSim: A Simulation Environment for Cloud Object Storage Infrastructures

Cloud Computing. Lecture 24 Cloud Platform Comparison

Cloud Archive & Long Term Preservation Challenges and Best Practices

Data Domain Overview. Jason Schaaf Senior Account Executive. Troy Schuler Systems Engineer. Copyright 2009 EMC Corporation. All rights reserved.

EonStor DS remote replication feature guide

Best Practices for Data Archiving...1 Prudent Policies...2 Data Vaulting Policies...2 Defining the Data Life Cycle...3 Keeping Data Current...

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

Hitachi Content Platform. Andrej Gursky, Solutions Consultant May 2015

ATLAS Tier 3

Will SSD replace HDD?

Storage Virtualisation in the Cloud

Power-Managed Storage:

15 Best Practices For Comparing Data Protection With Cloud Media Server

Reference Design: Scalable Object Storage with Seagate Kinetic, Supermicro, and SwiftStack

Accounts Payable Capture and Approval

ASG-DIGITAL ARCHIVE FOR MEDIA AND ENTERTAINMENT

Unleash the IaaS Cloud About VMware vcloud Director and more VMUG.BE June 1 st 2012

XenData Product Brief: SX-250 Archive Server for LTO

DSS. High performance storage pools for LHC. Data & Storage Services. Łukasz Janyst. on behalf of the CERN IT-DSS group

CLOUD COMPUTING. When it's smarter to rent than to buy.. Presented by Anand Tirumani

Cloud Storage. Deep Dive. Extending storage infrastructure into the cloud SPECIAL REPORT JUNE 2011

SAP HANA - Main Memory Technology: A Challenge for Development of Business Applications. Jürgen Primsch, SAP AG July 2011

TABLE OF CONTENTS THE SHAREPOINT MVP GUIDE TO ACHIEVING HIGH AVAILABILITY FOR SHAREPOINT DATA. Introduction. Examining Third-Party Replication Models

The Business Case for the Cloud. Presenter: Alex McDonald, Industry Standards, CTO Office, NetApp Author: Marty Stogsdill, Oracle

Oracle Data Protection Concepts

OpenStack Benelux. Dan Chester. Seagate Cloud Systems & Solutions

Lesson Plans Microsoft s Managing and Maintaining a Microsoft Windows Server 2003 Environment

Transcription:

LTFS and CDMI - Tape for the Cloud David Slik NetApp, Inc. 2013 Storage Developer Conference. Insert Your Company Name. All Rights Reserved.

Session Agenda Why Tape? Tape as a Cloud Protection Backing Store Tape as a Cloud Cold Storage Tier Tape for Bulk Transport Object Storage for LTFS Demo 2

Why Tape? Isn t Tape Dead? 3

Why Tape? Isn t Tape Dead? Not where it costs less then the alternatives! For PB Scale Archives, tape has significant economic savings compared to disk: Lower capital cost ($/GB), Lower power & cooling costs, and a Longer amortization period 4

Why Tape? In 2013, 2 PB crossover: Disk CapEx > Tape http://snia.org/sites/default/files/cloudtapeusecases_v1.0.pdf Tape OpEx is almost 1/3 that of Disk = > 5

Why Tape? Tape has flourished in several vertical market niches: Oil and Gas: Tape is environmentally robust and easy to safely transport in harsh environments Media & Entertainment: Tape simplifies workflows and data exchange Archiving & Preservation: Tape provides low bit error rates and long shelf life for large scale archives 6

Why Tape? So, to summarize: If you have many PB s of archival data, tape makes sense If you need to physically move data around, tape makes sense If you need a long shelf life, tape makes sense 7

Tape and cloud If you are a cloud provider, you have several challenges: 1. How do you protect against data loss? 2. How do you provide lower cost offerings? 3. How do you bulk transfer data between/into/out of the cloud? Tape can help! 8

Cloud Protection Tier Scenario 1 Reducing cost for data protection Assume you are a cloud provider and you want to reduce the probability of customer data loss. Your options are: 1. Deploy more disks and mirror 2. Deploy tape and archive Google chose #2, and it saved more than money For more details, search for: gmail outage 2011 tape 9

Cloud Protection Tier As tape has higher latency, a cloud provider must still have disk storage To handle common failures and maintenance activities, at least two disk locations are required However, two disk locations are insufficient to provide sufficient survivability and fault isolation Tape reduces the cost of additional copies Tape reduces the probability of cascading failures that corrupt/destroy all copies 10

Cloud Archival SLO Scenario 2 Reduced Cost Storage If data is stored directly to tape (or through a small staging area), savings can be passed on to the customer This allows a cloud service provider to offer a lower-cost differentiated service, similar to what Amazon has done with their Glacier offering 11

Cloud Protection Tier A couple of important restrictions: This only works for infrequently accessed data. If data is randomly access at a frequent enough rate, tape wear will increase costs due to media replacement rates This only works for data where high latencies can be tolerated by the customer This requires different software interfaces in older to handle the higher latency, typically involving notifications of data availability 12

Cloud Protection Tier LTFS standardization reduces complexity, simplifies development, and enables new service offerings: For example, if customer data is stored on standard tapes, in a standard format, that opens the option for a customer to request that the tapes (or copies of the tapes) be sent to them Which leads us into our third and final scenario 13

Bulk Cloud Transfer Scenario 3 Bulk Cloud transfer Q. How do you get large amounts of data in and out of the cloud? A. Slowly and expensively! This is a significant problem for organizations that generate more data then they have bandwidth to send, and when they need to retrieve large amounts of data quickly. 14

Bulk Cloud Transfer Transferring 2 PB over an OC-12 Link Save time! http://snia.org/sites/default/files/cloudtapeusecases_v1.0.pdf 15

Bulk Cloud Transfer Transferring 10 TB over an 10 Mbit/sec Link http://snia.org/sites/default/files/cloudtapeusecases_v1.0.pdf Save time AND money! 16

Bulk Cloud Transfer The LTFS TWG is working on a standard way to transfer collections of data: An XML manifest that describes: Which tapes are used to store the data Which files, directories and objects are being transferred Fixity and integrity verification information Instructions on how to merge data into an existing namespace A standard workflow for bulk data transfer 17

Demonstration Ruby Cloud -> LTFS Transfer Demonstration 18

Object Storage for LTFS LTFS provides: Standardized POSIX-style directory and files Standard file metadata and ACL storage Standard tape spanning for large files This reduces the complexity of using tape as a backing store, and simplifies development The LTFS TWG has begun an effort to standardize how objects are stored on tape 19

Object Storage for LTFS Storing objects on LTFS adds: Support for rich metadata ID-based namespaces for object access Support for composite objects (Queues, etc) Support for object versioning This allows objects from object storage systems using Azure, CDMI, S3, and Swift to be stored on LTFS and accessed in a standard way 20

Object Storage for LTFS CDMI <-> LTFS Mapping Examples: CDMI Named Data Object LTFS.pdf CDMI Unnamed Data Object 00007ED90 CDMI Container SDC 2013 CDMI Queue Messages S3 & Swift mappings in the works Standard Header Metadata mapping 21

Object Storage for LTFS CDMI Named Data Object LTFS.pdf Metadata Author : LTFS TWG LTFS Layout: / LTFS Root /LTFS.pdf ltfs.vendor.cdmi.objectid ltfs.vendor.cdmi.mimetype ltfs.vendor.cdmi.metadata ltfs.vendor.cdmi.valuetransferencoding /cdmi_objectid/ /cdmi_objectid/00007ed90010f0e4fa0 LTFS file with object name as file name 00007ED90010F0E4FA063BCEB659D6ED application/pdf { Author : LTFS TWG } Base64 Object ID Container Symlink to /LTFS.pdf 22

Object Storage for LTFS CDMI Unnamed Data Object 00007ED90 Metadata Conference : SDC LTFS Layout: / LTFS Root /cdmi_objectid/ /cdmi_objectid/00007ed90010a49f2a0 ltfs.vendor.cdmi.objectid ltfs.vendor.cdmi.mimetype ltfs.vendor.cdmi.metadata ltfs.vendor.cdmi.valuetransferencoding Object ID Container LTFS file with object ID as file name 00007ED90010F0E4FA063BCEB659D6ED application/pdf { Conference : SDC } Base64 23

Object Storage for LTFS CDMI Container SDC2013 Metadata cdmi_latency : 1000000 LTFS Layout: / LTFS Root /SDC2013/ LTFS directory ltfs.vendor.cdmi.objectid 00007ED900105E38846F7EAA6C061CA7 ltfs.vendor.cdmi.metadata { cdmi_latency : 1000000 } /cdmi_objectid/ Object ID Container /cdmi_objectid/00007ed900105e38846f Symlink to /SDC2013/ 24

Object Storage for LTFS CDMI Queue Messages LTFS Layout: / LTFS Root /Messages ltfs.vendor.cdmi.objectid /Messages.cdmi_queue/ /Messages.cdmi_queue/0 ltfs.vendor.cdmi.mimetype ltfs.vendor.cdmi.valuetransferencoding /Messages.cdmi_queue/1 ltfs.vendor.cdmi.mimetype ltfs.vendor.cdmi.valuetransferencoding /cdmi_objectid/ /cdmi_objectid/00007ed90010a49f2a0 LTFS file with queue name as file name 00007ED90010A49F2A0F1F996095A626 LTFS directory for queue values LTFS file corresponding to first queue value text/plain UTF8 LTFS file corresponding to next queue value text/plain UTF8 Object ID Container Symlink to /Messages 25

Next Steps Read the SNIA Cloud Tape Use Cases document: http://snia.org/sites/default/files/cloudtapeusecases_v1.0.pdf Join the SNIA Joint Cloud/LTFS Technical Working Group Active projects include: Cloud Data Transfer Workflow & XML Object storage for LTFS Tape 26

Thank You! Questions and Answers Contact Info: dslik@netapp.com 27