Object Oriented Storage and the End of File-Level Restores

Similar documents
Hitachi Content Platform. Andrej Gursky, Solutions Consultant May 2015

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Vodacom Managed Hosted Backups

Long term retention and archiving the challenges and the solution

Growth of Unstructured Data & Object Storage. Marcel Laforce Sr. Director, Object Storage

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

ntier Verde Simply Affordable File Storage

Alternatives to Big Backup

(Scale Out NAS System)

ETERNUS CS High End Unified Data Protection

IBM Tivoli Storage Manager Version Introduction to Data Protection Solutions IBM

EMC BACKUP MEETS BIG DATA

Save Time and Money with Quantum s Integrated Archiving Solution

Get Success in Passing Your Certification Exam at first attempt!

Protect Data... in the Cloud

The Archival Upheaval Petabyte Pandemonium Developing Your Game Plan Fred Moore President

Backup and Recovery 1

SOLUTION BRIEF KEY CONSIDERATIONS FOR BACKUP AND RECOVERY

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

EMC DATA DOMAIN OPERATING SYSTEM

XenData Product Brief: SX-550 Series Servers for LTO Archives

Object Storage: Out of the Shadows and into the Spotlight

Object Storage, Cloud Storage, and High Capacity File Systems

Building Storage Clouds for Online Applications A Case for Optimized Object Storage

Barracuda Backup Server. Introduction

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

Storage Backup and Disaster Recovery: Using New Technology to Develop Best Practices

ClearPath Storage Update Data Domain on ClearPath MCP

The Economics of File-based Storage

EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE: MEETING NEEDS FOR LONG-TERM RETENTION OF BACKUP DATA ON EMC DATA DOMAIN SYSTEMS

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

EMC DATA DOMAIN OPERATING SYSTEM

Diagram 1: Islands of storage across a digital broadcast workflow

Questions and Answers for Request for Proposal # Q1) What applications are running on each of the physical servers?

Introduction to Optical Archiving Library Solution for Long-term Data Retention

UNDERSTANDING DATA DEDUPLICATION. Thomas Rivera SEPATON

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

Introduction to NetApp Infinite Volume

Cloud OS Vision. Modern platform for the world s apps

Efficient Backup with Data Deduplication Which Strategy is Right for You?

White Paper. What is IP SAN?

WHITE PAPER. QUANTUM LATTUS: Next-Generation Object Storage for Big Data Archives

Hitachi NAS Platform and Hitachi Content Platform with ESRI Image

Disaster Recovery Strategies: Business Continuity through Remote Backup Replication

Protect Microsoft Exchange databases, achieve long-term data retention

Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows

EMC DATA DOMAIN OVERVIEW. Copyright 2011 EMC Corporation. All rights reserved.

How To Improve Storage Efficiency With Ibm Data Protection And Retention

Hitachi Cloud Service for Content Archiving. Delivered by Hitachi Data Systems

CONFIGURATION GUIDELINES: EMC STORAGE FOR PHYSICAL SECURITY

Checklist and Tips to Choosing the Right Backup Strategy

Data Protection. the data. short retention. event of a disaster. - Different mechanisms, products for backup and restore based on retention and age of

Business Benefits of Data Footprint Reduction

Every organization has critical data that it can t live without. When a disaster strikes, how long can your business survive without access to its

REDUCE COSTS AND COMPLEXITY WITH BACKUP-FREE STORAGE NICK JARVIS, DIRECTOR, FILE, CONTENT AND CLOUD SOLUTIONS VERTICALS AMERICAS

Symantec NetBackup Appliances

Key Messages of Enterprise Cluster NAS Huawei OceanStor N8500

Redefining Oracle Database Management

GIVE YOUR ORACLE DBAs THE BACKUPS THEY REALLY WANT

Why StrongBox Beats Disk for Long-Term Archiving. Here s how to build an accessible, protected long-term storage strategy for $.003 per GB/month.

A value-added service and solutions company focused on media-optimized, data storage-intensive workflows. SAN Scale-Out NAS Data Protection

Introduction. Silverton Consulting, Inc. StorInt Briefing

Five Fundamentals for Modern Data Center Availability

DATASHEET FUJITSU ETERNUS CS800 DATA PROTECTION APPLIANCE

XenData Video Edition. Product Brief:

Implementing Offline Digital Video Storage using XenData Software

Protecting the Microsoft Data Center with NetBackup 7.6

Understanding Enterprise NAS

Object Storage A Dell Point of View

Tier 2 Nearline. As archives grow, Echo grows. Dynamically, cost-effectively and massively. What is nearline? Transfer to Tape

NetApp Big Content Solutions: Agile Infrastructure for Big Data

Oracle Data Protection Concepts

THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.

Using the NDMP File Service for DMA- Driven Replication for Disaster Recovery. Hugo Patterson

IBM Storage Management within the Infrastructure Laura Guio Director, WW Storage Software Sales October 20, 2008

IBM Spectrum Protect in the Cloud

June Blade.org 2009 ALL RIGHTS RESERVED

Service Overview CloudCare Online Backup

EMC Data de-duplication not ONLY for IBM i

Protecting enterprise servers with StoreOnce and CommVault Simpana

Backup and Recovery Solutions for Exadata. Ľubomír Vaňo Principal Sales Consultant

Quantum DXi6500 Family of Network-Attached Disk Backup Appliances with Deduplication

Enterprise-class Backup Performance with Dell DR6000 Date: May 2014 Author: Kerry Dolan, Lab Analyst and Vinny Choinski, Senior Lab Analyst

DEFINING THE RIGH DATA PROTECTION STRATEGY

How to Manage Critical Data Stored in Microsoft Exchange Server By Hitachi Data Systems

Virtualize Without Compromise. Protecting and Storing Virtualized Data

Actifio Big Data Director. Virtual Data Pipeline for Unstructured Data

Deduplication has been around for several

Designing a Cloud Storage System

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

Transcription:

Object Oriented Storage and the End of File-Level Restores Stacy Schwarz-Gardner Spectra Logic

Agenda Data Management Challenges Data Protection Data Recovery Data Archive Why Object Based Storage? The Best of All Worlds 2

Data Management Challenges Data Chaos/Data Explosion Unstructured Data, Big Data, Expansive Data Backup, Data Protection, Data Recovery Storage Heterogeneity / Utilization Efficiency Data Center Footprint and Power Data Preservation Indefinite Retention Compliance Managed Retention & Access, WORM, Audit Want to build a service or cloud for Data Management Backup Want to build a Global Name Space for data accessibility Want Standardization and Policies Flexibility Cost = Everything comes down to Cost / GB 3

What is Big Data? Applications Requiring Intensive Data Mining and Analytics Financial Institutions Tic Data Analysis Trend Analysis Risk Assessment Cross Domain Correlations Health Care Drug Efficacy Disease Pattern recognition Broadcast, Media, & progression Entertainment Fraud detection Claim automation 4x resolution of HDTV Government (FBI,CIA,DOD,DOE,HLS,IRS) Internet threat detection Deeper in Color Pattern recognition Online Streaming Image analysis Fraud and waste mitigation Consumer / Commercial Space Social Media and Product Sentiment correlation Consumer Analysis Advertising Affinity correlation Telemetry and Quality analysis Mapping and Satellite Mortgage Data Check Images Brokerage Data Medical Records Digital Imaging Genomic Increase frame rate per second 2 4X Video Surveillance Biometrics Research Data Digital Media Videos, Music, Books, Photos Seismic/Oil & Gas

Data Protection BACKUP Data Protection BIG DATA = EXPANSIVE DATA 5

Data Protection Traditional Backup: Designed for Point In Time Recovery Designed for Disaster Recovery Full Backups Incrementals Differentials Tape primary media Tape primary offsite strategy 6

Data Protection Backup Issues Dataset Size (GB s TB s PB s) Backup Duration (Minutes Hours Days) Backup Reliability % of Stale/Inactive Data Full backups backing up higher % of the same data over and over again Windows of Exposure Time to get tapes created and offsite Restore Complexity and Duration Proprietary 7

Data Protection Backup Adaptability Enter VTL / Disk based backups Somewhat Faster Backups Virtual Tape Libraries / seem-less integration Introduced the concept of deduplication Introduced the concept of backup image replication Limited Retention on Disk due to Cost and Scalability Longer retention still required tape use of backup tapes for archive became the norm. Introduction of WORM and Encryption on Tape 8

Data Protection It s All About Data Access: Faster Restores but, Still Proprietary Required Rehydration Requires IT Intervention It s Still a File-Level Restore 9

Data Recovery BACKUP Data Protection BIG DATA = EXPANSIVE DATA MIRRORS SNAPSHOTS REPLICATION Data Recovery 10

Data Recovery Enter Storage-based Mirrors, Snapshots, Replication Block Based One or More Times Per Day Point in Time Recovery (more Aggressive) Close to Instant Recovery Offsite Protection 11

Data Recovery Storage Based Data Recovery Issues Storage Vendor Dependent Dependent on Primary Volume Integrity Short Term Retention Only Still need Traditional Backup Double or Triple the Storage Cost of Storage Bandwidth Considerations Spinning Disk Durability: how many copies is enough? 12

Data Recovery Storage Adaptability Introduce Storage Tiering Introduce Deduplication Introduce Off-Array based Snapshots Longer retention still requires backup tapes for use as a long term archive 13

Data Recovery It s All About Data Access: Snapshots Eliminated Restores Fast and Easy but, Primarily short retention only Storage vendor specific Mirrors and Access to Replicated Copies Complicated and required IT intervention Deduplication still requires rehydration overhead 14

Data Archive BACKUP Data Protection BIG DATA= EXPANSIVE DATA MIRRORS SNAPSHOTS REPLICATION Data Archive HSM ILM ACTIVE Data Recovery 15

Data Archive Let s try Hierarchical Storage Management (HSM) on Open Systems Let s call it Information Life Cycle Management (ILM) Let s address inactive/stale data and compliance challenges We ll move the data and manage the archive We ll include a combination of disk and tape to address everyone s needs 16

Data Archive We ll introduce immutability models Write Once, Read Many (WORM) Write Once, Read None (WORN) Write Once, Read Seldom if Ever (WORSE) We ll introduce a Storage Platform designed for Compliance Introduction of Content Addressable Storage And that s exactly what the industry tried to do 17

Data Archive Data Archive Issues One ILM Archive Technology did not address all types of data and applications Most ILM Archive Platforms were oriented around disk no native access to tape Cost and Complexity No Technology Refresh Strategies Long Term Preservation No Self-Healing Limited Scalability Archives were vendor proprietary Data Mover and Archive had to be the same vendor How do you protect the Archive? 18

Data Archive It s All About Data Access: Somewhat Limited or Redirection to files had to be via Pointers or Stubs left behind on the primary storage tier Access had to be via a Proprietary Archive Application GUI or Client/Application Plug-In 19

Active Archive Concept An Active Archive contains native file format data transparently accessible to end users through a file system interface (CIFS, NFS) Active Archiving is not a single product. It s a collaborative solution offered by software and multiple hardware vendors, and in the proposed scenario, also takes advantage of existing equipment Vendor Agnostic- Consisting of data management software, disk, and tape options 20

Active Archive Out of Band Conceptual Design Primary Storage Z: DRIVE NFS / CIFS NFS / CIFS Data Mover, Copy Technologies Data Management Software Tape Secondary Heterogeneous Storage Data Management Framework Remote Data Center/ and or 2012 Cloud Storage Developer Conference. Spectra Logic Inc.. All Rights Reserved. Remote Data Center And/or Cloud

Data Archive It s All About Data Access: An Active Archive provides native access to disk or tape without File-Level restores. An Active Archive presents an NFS and/or CIFS Gateway for transparent Access An Active Archive process is heterogeneous, working with multiple data movers, data management software, and storage/tape vendors Designed for Long Term Data Preservation 22

Data Durability BACKUP Data Protection OBJECT BASED STORAGE BIG DATA = EXPANSIVE DATA MIRRORS SNAPSHOTS REPLICATION Data Archive ARCHIVE HSM ILM ACTIVE Data Recovery 23

Why Object Based Storage? Looming, Spinning Disk Challenges: RAID Limitations Larger Disk Capacities Larger RAID Sets Increased number of RAID Sets per array due to higher capacity Introduce Longer Rebuild Times (days vs. hours) Higher Potential of Failures Unrecoverable Bit Errors ( bit rot ) Data Loss or Corruption during rebuild process 24

Why Object Based Storage? Replicated Copy Limitations Size of Data Sets Number of Copies needed for redundancy Rebuild Time for Mirrors Storage Capacity Required = Cost File System Limitations # s of files, directories, volumes Scalability Metadata Management Indexing, Search Ability, File Management 25

Why Object Based Storage? Four Major Benefits Levels of Protection/Data Durability Separation of File Intelligence from Physical Data Scalability Accessibility 26

Why Object Based Storage? Protection and Durability Replaces RAID much higher availability and reliability (Multiple 9 s) Erasure Coding Methodologies (i.e. Reed- Solomon, Fountain-Codes) Files are managed as data objects separation of file intelligence from physical data Data objects are transformed into a serious of equations which are redundant and distributed across a storage pool Self Healing Capabilities 27

Why Object Based Storage? Protection and Durability Replica s and Reliability Policies Define the Failure Tolerance Level of specific objects Define how many disks the objects should be spread over Define how many simultaneous failures that should be tolerated Example: 16/4 = Indicates that data will be spread across 16 drives in a manner that can tolerate 4 simultaneous failures 28

Why Object Based Storage? 29

Why Object Based Storage? Separation of Intelligence from Physical Data Scalable Metadata store Enables search, mining, and analytics of billions of objects without touching physical media Standard Metadata Object ID s, object size, creation dates, location Custom Metadata User Definable 30

Why Object-Based Storage? Scalability 100TB 100 s PB Space is Allocated across Storage Pools On-the fly capacity upgrades No file system or RAID rebuild limitations Automatic Restriping Capabilities Automated Replica Management No Volume Configuration Multi-Site Support Metadata and Data Store can scale with no impact on performance 31

Why Object-Based Storage? Accessibility Object Stores are accessible via Rest and HTTP Protocols Cloud and As A Service Enabling They require applications to be Object Storage Aware 32

Whey Object Based Storage? It s All About Data Access: Data stored within the Object Store can be natively accessed by an Object Store aware application Eliminates the Need for File-Level restores Data can be stored or copied within an Object Store thus eliminating the need to continually back it up Overcomes the limitations with spinning disk and capacity considerations 33

Best of All Worlds OBJECT BASED STORAGE Data Durability BIG DATA = EXPANSIVE DATA Data Protection ACTIVE ARCHIVE Data Recovery Data Archive 34

Best of All Worlds Combine Active Archive with Object Based Storage Remove Spinning Disk Concerns and Limitations Durability (Replica s, Reliability Policies) Capacity Management Scalability Add Enhanced Metadata Management Add Content Policy Management Retention, Immutability Deletion/Purge 35

Best of All Worlds NFS/CIFS (NAS Gateway) Capabilities Expanded Accessibility Transparent Application/User Access Ability to Leverage Tape and Disk transparently as a drive letter, share, mount point Ability to replace Traditional Backup Ability to Leverage Tape as NAS Ability to Leverage Tape for Open Portability Why Tape? Cost vs. Capacity (higher areal density) Capacity vs. Footprint 36

Best of All Worlds Primary Storage Z: DRIVE NFS / CIFS NFS / CIFS Data Mover, Copy Technologies Active Archive Management Software REST/HTTP Tape Object Storage Aware Applications Data Management Framework Remote Data Center and/or 2012 Cloud Storage Developer Conference. Spectra Logic Inc.. All Rights Reserved. Remote Data Center and/or Cloud

Best of All Worlds Ultimately, It s All About Data Access: Active Archive + Object Based Storage Cloud Ready, Scalable, Cost Effective, Long Term Data Storage And the End of File Level Restores!

Questions? stacys@spectralogic.com