Will Object Stores Dominate Storage of Unstructured Data?

Similar documents
STORAGE E NUVOLE Juku-unplugged v3.0

ENVIRONMENTAL PRESSURES DRIVING AN EVOLUTION IN FILE STORAGE

Object Storage A Dell Point of View

How To Store Data In A Cloud Environment

Hitachi Content Platform. Andrej Gursky, Solutions Consultant May 2015

REDUCE COSTS AND COMPLEXITY WITH BACKUP-FREE STORAGE NICK JARVIS, DIRECTOR, FILE, CONTENT AND CLOUD SOLUTIONS VERTICALS AMERICAS

Hitachi NAS Platform and Hitachi Content Platform with ESRI Image

Long term retention and archiving the challenges and the solution

The Economics of File-based Storage

Archiving A Dell Point of View

NetApp Big Content Solutions: Agile Infrastructure for Big Data

Time Value of Data. Creating an active archive strategy to address both archive and backup in the midst of data explosion.

Hitachi Cloud Service for Content Archiving. Delivered by Hitachi Data Systems

ARCHIVING. What it is, what it isn t, and how it can improve your business operations

Unstructured data in the enterprise

Growth of Unstructured Data & Object Storage. Marcel Laforce Sr. Director, Object Storage

Symantec Enterprise Vault and Symantec Enterprise Vault.cloud

Diagram 1: Islands of storage across a digital broadcast workflow

Data Sheet: Archiving Symantec Enterprise Vault Store, Manage, and Discover Critical Business Information

Hitachi Content Platform as a Continuous Integration Build Artifact Storage System

巨 量 資 料 分 層 儲 存 解 決 方 案

Backup and Recovery 1

Object Storage A Fresh Approach to Long-Term File Storage

Managing the Unmanageable: A Better Way to Manage Storage

Object Oriented Storage and the End of File-Level Restores

Taming Big Data Storage with Crossroads Systems StrongBox

WHITE PAPER. QUANTUM LATTUS: Next-Generation Object Storage for Big Data Archives

IBM General Parallel File System (GPFS ) 3.5 File Placement Optimizer (FPO)

Solution Overview: Data Protection Archiving, Backup, and Recovery Unified Information Management for Complex Windows Environments

Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows

How to Manage Critical Data Stored in Microsoft Exchange Server By Hitachi Data Systems

Geospatial Imaging Cloud Storage Capturing the World at Scale with WOS TM. ddn.com. DDN Whitepaper DataDirect Networks. All Rights Reserved.

DataCentred Cloud Storage

Understanding Object Storage and How to Use It

S O L U T I O N S B R I E F. Active Archive. A Blueprint for Long-term Preservation of Business-critical Digital Data. Hitachi Data Systems

SOLUTION BRIEF KEY CONSIDERATIONS FOR BACKUP AND RECOVERY

A New Strategy for Data Management in a Time of Hyper Change

Got Files? Get Cloud!

2. TEMPLATE TITLE SLIDE WITH PHOTO

Discover A New Path For Your Healthcare Data and Storage

IBM Spectrum Protect in the Cloud

Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Hitachi Data Systems. Klemen Bertoncelj Senior Solutions Consultant. 2 December Copyright 2010 Hitachi Data Systems. All rights reserved.

HGST Object Storage for a New Generation of IT

ARCHIVING FOR DATA PROTECTION IN THE MODERN DATA CENTER. Tony Walker, Dell, Inc. Molly Rector, Spectra Logic

Archiving, Backup, and Recovery for Complete the Promise of Virtualization

Case Studies. Data Sheets : White Papers : Boost your storage buying power... use ours!

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

Joe Young, Senior Windows Administrator, Hostway

Microsoft Exchange Online Archiving and Symantec Enterprise Vault.cloud

Lab Validation Report

LEVERAGING EMC SOURCEONE AND EMC DATA DOMAIN FOR ENTERPRISE ARCHIVING AUGUST 2011

Infrequent Tape Retention

White. Paper. Extracting the Value of Big Data with HP StoreAll Storage and Autonomy. December 2012

2011 FileTek, Inc. All rights reserved. 1 QUESTION

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

SUSE Storage. FUT7537 Software Defined Storage Introduction and Roadmap: Getting your tentacles around data growth. Larry Morris

USGS Guidelines for the Preservation of Digital Scientific Data

Partner Playbook. Archiving with Hitachi Content Platform and Symantec Enterprise Vault Solution

How To Use The Hitachi Content Archive Platform

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

WaterfordTechnologies.com Frequently Asked Questions

Symantec Enterprise Vault.cloud Overview

Introduction to NetApp Infinite Volume

EMC SourceOne Management and ediscovery Overview

TO BE OR NOT TO BE (Archiving), That is the question!

Intro to AWS: Storage Services

Datamanagement in the hybrid datacenter. RealDolmen March 25, 2014

VENDOR-AGNOSTIC EXPLANATIONS AND ADVICE FOR THE INFORMATION TECHNOLOGY BUYER. Evaluating Archiving Solutions

WOS OBJECT STORAGE PRODUCT BROCHURE DDN.COM Full Spectrum Object Storage

Global Headquarters: 5 Speen Street Framingham, MA USA P F

<Insert Picture Here> Solution Direction for Long-Term Archive

Understanding Enterprise NAS

WHITE PAPER. Reinventing Large-Scale Digital Libraries With Object Storage Technology

Migration from SharePoint 2007 to SharePoint 2010

The evolution of data archiving

Backup Software? Article on things to consider when looking for a backup solution. 11/09/2015 Backup Appliance or

Symantec Data Center Solutions for Microsoft Exchange

ILM: Tiered Services & The Need For Classification

A Best Practice Guide to Archiving Persistent Data: How archiving is a vital tool as part of a data center cost savings exercise

EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE: MEETING NEEDS FOR LONG-TERM RETENTION OF BACKUP DATA ON EMC DATA DOMAIN SYSTEMS

DEFINING THE RIGH DATA PROTECTION STRATEGY

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Introduction to Red Hat Storage. January, 2012

IBM System Storage DR550

Scala Storage Scale-Out Clustered Storage White Paper

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

Balboa Park Online Collaborative Deploys the Exablox OneBlox Solution: Achieves Cost and Management Savings

Veritas Enterprise Vault.cloud for Microsoft Office 365

Save Time and Money with Quantum s Integrated Archiving Solution

Bruce Allison. Steve Moran

Data Backup and Restore (DBR) Overview Detailed Description Pricing... 5 SLAs... 5 Service Matrix Service Description

Storage Virtualization

Enterprise Vault 10 Feature Briefing

Data Integrity Means and Practices

Benefits of upgrading to Enterprise Vault

Best Practices for Managing Storage in the Most Challenging Environments

Information Archiving

Transcription:

Will Object Stores Dominate Storage of Unstructured Data? Robert Primmer Sr. Technologist & Sr. Director Content Services Hitachi Data Systems

We ll Cover 5 Things Today 1. What is an Object, Object Store and how does it work? 2. Sea Change is Underway in Data Storage. 3. Comparing Structured to Unstructured Data. 4. Comparing Block, File System and Object Stores. 5. Real-World Examples of Object Stores in Action.

Object Store Fundamentals

Why Object Stores? One quick photo grows into more and more, and is quickly unmanageable

Why Object Stores? So we use software tools to do all sorts of useful things beyond just storing them where we created them

What Makes a File into Object? Custom Metadata Photo Example System Metadata File = Image File System Metadata Filename: CA57228.JPG Created: June 2, 2010 Last modified: July 4, 2010 File + Metadata = Object Custom Metadata Subject: Lisa Sanders Category: Family Retention: Do not delete Place Taken: Tempe, Arizona Allow sharing: Yes

What is an Object Store? Client Example: iphoto

Cloud Object Storage Example: Google Picasa

Industrial Strength Object Store Scale: Billions of Discrete Objects Time Horizon is Decades Flat (Global) Namespace Expands Dynamically Not Thousands of object, but Billions No Complexity Easy to configure, Easy to Maintain Self-Sustaining (Heal thyself) Non-disruptive Upgrades Storage Software Systems

Solving File System Limitations Custom Metadata (CMD) Self Describing Means of Layering Relationships Data Independence App is no longer King, Your Data is Allows Data to Outlive the App, Storage Hardware and Storage Software Makes it possible to monetize data outside the original creating application

Sea Change

Exabytes 75%-90% of data is unstructured Challenge: Unstructured Data Growth Unstructured content growing faster than traditional information or structured content Much of IT built for structured data Unstructured data is growing at 10X the rate of structured data How to limit growth of data and associated backup/restore burdens How to scale and support different workloads without more/larger silos <5% of unstructured data is proactively managed How to leverage tiered storage NAS/File systems not content aware Regulations force new behavior It is time for storage technology and practices designed for the unique challenges of unstructured data growth

U.S. National Archive and Records Association (NARA) "We do not know the exact number of files yet, but we need to grow that up to be able to store around 7PB. -- Dyung Le, director of systems engineering for the ERA project at NARA

Prior Market Shifting Event Minicomputer Industry

Next Seismic Shift?

Structured vs. Unstructured Data

Structured Data

Rich Data = Unstructured Data

Unstructured Data (UD) Characteristics UD brings with it a much richer experience. How much better is it to hear a song than simply read the lyrics? To see a full motion HD film versus reading the script? Final Form Data therefore few or no changes Tends to be large and growing larger vis-à-vis rows in a table Not searchable (can't search a song, a picture or a movie) Lives forever (songs last forever e.g. Beethoven's 5th)

Problems with Unstructured Data Becomes the Junk Drawer of Storage No relations between objects established a priori

Problems with Unstructured Data Need to be able to establish relationship amongst objects post ingest That is: add a relational layer on top of the object data

Comparing Block, File System and Object Stores

Contrasting Block to Object Bucket of bits No meaning attached One bit string no different from the next No data intelligence in storage Operations are against gross collections of bits, without knowledge of what the bits represent to the customer

Contrasting Block to Object Objects take form Not just a string of bits From storage subsystem to software system called an object store (OS) OS itself has knowledge about data Now able to perform complex functions against the data

What about File Systems (FS)? FS do a good job of adding order

File System Limitations Lack Flat (Global) Namespace Size Limitations Management Complexity Need to decide structure a priori Don t provide a means for us to later layer on Data Relations

Data World Today Application Data Application is the King Your Data becomes the Subject of this King

Limitations of Present Model It therefore becomes difficult to monetize data outside of the creating application

Real-World Examples

Klinikum Wels Information Management (File Data) Challenges Avoid data overkill and a data graveyard and create revision proof long term archive with high searching and analyzing possibilities Solution Capabilities and Components Easy Compliance integration with of meetings their medical legal applications retention periods for data Scalability Accessing & data open in interfaces different context for future cases projects for patient Works treatment, with USP-V science and education Search Data Independence capabilities avoid migrations, yet easy to access Primary Site: 8 Nodes 2 Search Nodes 2 HDDS Nodes USP-V Replication Secondary Site: 4 Nodes 1 Search Node USP-V

Company Information Management (Email Archiving) Real estate developer in U.A.E. Challenges: Retain email within company for corporate compliance Reduce Exchange server and user.pst capacities Increase the availability, performance and uptime of Exchange Ensure business and IT continuity Solution Capabilities and Components Archiving older mail, improving operations and backup/recovery times Improving backup performance and reliability Two USP V systems being used for Tier1 and Tier2 storage, plus virtualized AMS/WMS boxes for Tier3, HUR replicates the whole environment 1. Object is archived to the Content Platform by Enterprise Vault leaving a safety copy 2. HCP creates objects to meet DPL 3. Object is replicated to remote HCP 4. Remote HCP creates objects to meet DPL 5. Adapter verifies object meets DPL setting on primary and replica 6. Safety Copy is replaced with placeholder Primary Email Enterprise Vault 9 HTTP/HTTPS Replica

NASA Data Protection US National Aeronautics and Space Administration Challenges: The Time/expense/media data is considered issues a national of tape asset so it s extremely important to Separate safeguard. islands A hundred of archives years to store from different now, someone project data is going to want the Provide raw data on-demand or a processed data retrieval file. capabilities, indefinitely Curt Tilmes, OMI computer scientist at NASA. Solution Capabilities and Components Faster, more reliable, less complex and less expensive than tape knowing that we give it data and it gives the data back when and how we Generates two sets of parity data so data is always secure need it is a really great value to the organization because scientists can stay Provides focused an on online the health repository of our to planet, preserve and actively manage digital content so that data remains authentic and instantly accessible Curt Tilmes, OMI computer scientist at NASA With the Object Store, we are able to manage our data with no additional backup tasks. Ben Kobler, Computer Scientist, NASA

Summary 1. What is an Object, Object Store and how does it work? 2. Sea Change is Underway in Data Storage. 3. Comparing Structured to Unstructured Data. 4. Comparing Block, File System and Object Stores. 5. Real-World Examples of Object Stores in Action.

Thank You!