WHITE PAPER PANZURA CLOUD STORAGE SYSTEM

Similar documents
A Virtual Filer for VMware s Virtual SAN A Maginatics and VMware Joint Partner Brief

Quantum DXi6500 Family of Network-Attached Disk Backup Appliances with Deduplication

White Paper: Nasuni Cloud NAS. Nasuni Cloud NAS. Combining the Best of Cloud and On-premises Storage

UniFS A True Global File System

Cloud OS Vision. Modern platform for the world s apps

We look beyond IT. Cloud Offerings

OPTIMIZING PRIMARY STORAGE WHITE PAPER FILE ARCHIVING SOLUTIONS FROM QSTAR AND CLOUDIAN

Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving

Technical Brief: Global File Locking

Introduction to NetApp Infinite Volume

Disaster Recovery with the Public Cloud and Whitewater Cloud Storage Gateways

WHITE PAPER Improving Storage Efficiencies with Data Deduplication and Compression

IBM Global Technology Services September NAS systems scale out to meet growing storage demand.

Red Hat Storage Server

Why StrongBox Beats Disk for Long-Term Archiving. Here s how to build an accessible, protected long-term storage strategy for $.003 per GB/month.

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

CTERA Cloud Storage Platform Architecture

VMware VDR and Cloud Storage: A Winning Backup/DR Combination

How To Protect Data On Network Attached Storage (Nas) From Disaster

StoneFly SCVM TM for ESXi

EMC CLOUDARRAY PRODUCT DESCRIPTION GUIDE

EMC DATA DOMAIN OPERATING SYSTEM

12 Key File Sync and Share Advantages of Transporter Over Box for Enterprise

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

The Evolution of Cloud Storage - From "Disk Drive in the Sky" to "Storage Array in the Sky" Allen Samuels Co-founder & Chief Architect

June Blade.org 2009 ALL RIGHTS RESERVED

Actifio Big Data Director. Virtual Data Pipeline for Unstructured Data

EMC BACKUP MEETS BIG DATA

Archiving, Backup, and Recovery for Complete the Promise of Virtualization

Disaster Recovery Strategies: Business Continuity through Remote Backup Replication

TABLE OF CONTENTS THE SHAREPOINT MVP GUIDE TO ACHIEVING HIGH AVAILABILITY FOR SHAREPOINT DATA. Introduction. Examining Third-Party Replication Models

EMC DATA DOMAIN OPERATING SYSTEM

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN

WHITE PAPER. Dedupe-Centric Storage. Hugo Patterson, Chief Architect, Data Domain. Storage. Deduplication. September 2007

Big data management with IBM General Parallel File System

Every organization has critical data that it can t live without. When a disaster strikes, how long can your business survive without access to its

Selling Compellent NAS: File & Block Level in the Same System Chad Thibodeau

Cloud-integrated Storage What & Why

Protect Data... in the Cloud

Big data Devices Apps

Data Protection with IBM TotalStorage NAS and NSI Double- Take Data Replication Software

Building Backup-to-Disk and Disaster Recovery Solutions with the ReadyDATA 5200

Hypervisor-based Replication

StorReduce Technical White Paper Cloud-based Data Deduplication

Optimizing Storage for Better TCO in Oracle Environments. Part 1: Management INFOSTOR. Executive Brief

nwstor Storage Security Solution 1. Executive Summary 2. Need for Data Security 3. Solution: nwstor isav Storage Security Appliances 4.

CTERA Enterprise File Services Platform Architecture for HP Helion Content Depot

Got Files? Get Cloud!

Cloud-integrated Enterprise Storage. Cloud-integrated Storage What & Why. Marc Farley

EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE: MEETING NEEDS FOR LONG-TERM RETENTION OF BACKUP DATA ON EMC DATA DOMAIN SYSTEMS

ENTERPRISE STORAGE WITH THE FUTURE BUILT IN

Using object storage as a target for backup, disaster recovery, archiving

S O L U T I O N P R O F I L E. Riverbed and EMC Deliver Capacity-Optimized Cloud Storage for Backup, Recovery, Archiving, and DR

HyperQ Storage Tiering White Paper

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

How To Use An Npm On A Network Device

Symantec Enterprise Vault And NetApp Better Together

The Design and Implementation of the Zetta Storage Service. October 27, 2009

Universal Backup Device with

Barracuda Backup Deduplication. White Paper

Introduction to Data Protection: Backup to Tape, Disk and Beyond. Michael Fishman, EMC Corporation

EMC AVAMAR. a reason for Cloud. Deduplication backup software Replication for Disaster Recovery

Solution Overview. Business Continuity with ReadyNAS

Software Defined Microsoft. PRESENTATION TITLE GOES HERE Siddhartha Roy Cloud + Enterprise Division Microsoft Corporation

Implementing Multi-Tenanted Storage for Service Providers with Cloudian HyperStore. The Challenge SOLUTION GUIDE

White. Paper. Addressing NAS Backup and Recovery Challenges. February 2012

Protecting Big Data Data Protection Solutions for the Business Data Lake

Managing the Unmanageable: A Better Way to Manage Storage

Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows

The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

CISCO WIDE AREA APPLICATION SERVICES (WAAS) OPTIMIZATIONS FOR EMC AVAMAR

Growth of Unstructured Data & Object Storage. Marcel Laforce Sr. Director, Object Storage

SolidFire and NetApp All-Flash FAS Architectural Comparison

Maxta Storage Platform Enterprise Storage Re-defined

Storage Infrastructure as a Service

Unitrends Recovery-Series: Addressing Enterprise-Class Data Protection

WHITE PAPER RUN VDI IN THE CLOUD WITH PANZURA SKYBRIDGE

Data Protection with NETGEAR Storage. Smart Solutions for Disk-to-Disk Backup and Disaster Recovery for SMB s and Multi-Office Environments

Overcoming Backup & Recovery Challenges in Enterprise VMware Environments

5 Essential Benefits of Hybrid Cloud Backup

an introduction to networked storage

Object Oriented Storage and the End of File-Level Restores

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Protezione dei dati. Luca Bin. EMEA Sales Engineer Version 6.1 July 2015

Dell Converged Infrastructure

A STORAGE SYSTEM JUST LIKE THE ONE YOU HAVE TODAY A STORAGE SYSTEM NOTHING LIKE THE ONE YOU HAVE TODAY.

Hitachi NAS Platform and Hitachi Content Platform with ESRI Image

Universal Backup Device The Essential Facts of UBD

Breaking the Storage Array Lifecycle with Cloud Storage

Diagram 1: Islands of storage across a digital broadcast workflow

Enterprise Private Cloud Storage

Efficient Backup with Data Deduplication Which Strategy is Right for You?

Dell PowerVault DL Backup to Disk Appliance Powered by CommVault. Centralized data management for remote and branch office (Robo) environments

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

Transcription:

WHITE PAPER Panzura s game-changing Global Cloud Storage System technology finally brings the full power and benefits of cloud storage to enterprise customers, helping to break the unending onsite storage expansion cycle while eliminating islands of storage that inhibit cross-site user interaction and productivity and real-time data protection. Panzura makes deploying cloud storage and a global file system easy and transparent to users..

Executive Summary Today, enterprise IT executives struggle with storage growth, storage capacity balancing, and data mobility. Traditional technologies such as tape, SAN, and NAS were more than sufficient to meet the needs of the past but an increasingly distributed workforce generating massive amounts of data from multiple platforms forced enterprises to consider new storage paradigms, in particular cloud storage. But adopting the cloud as a storage tier can be terribly problematic. Integrating with existing IT environments, ensuring data security, and managing data across sites plus the cloud is not a trivial exercise. Panzura created a solution based on a global file system and unified namespace that makes adding and using cloud storage seamless and secure while enabling global file sharing, cloud-integrated NAS, and data protection, including archiving, disaster recovery, and backup. Panzura Quicksilver TM Cloud Storage Controllers deploy quickly and easily without changes to existing infrastructures, all while securing data with snapshots and military-grade encryption. Panzura Global Cloud Storage Systems make cloud storage a seamless, viable storage tier for enterprises of all size Ongoing Storage Challenges As unstructured data and metadata continue to grow upwards of 62% on average per year (IDC, 2011), enterprise IT managers face an unwinnable struggle to maintain or reduce costs while ensuring user needs (i.e., SLAs) are met. Traditional means of storing data, primarily local NAS, and protecting data, primarily tape or disk-to-disk solutions, do not scale quickly or economically enough to accommodate the growth in demand from massive expansion in data generated by individuals and organizations. In addition, constantly connected individuals demand that data be available anytime, from anywhere, without reduced performance. Limited budgets, limited technology alternatives, and strict user demand have put IT on an unsustainable trajectory that must be addressed without negatively impacting organizational performance. Data storage challenge. The primary technology for storing unstructured enterprise data today is high-performance, high-cost network-attached storage (NAS). With this technology, the standard method for addressing data growth is to add additional expensive spindles and/or new disk systems. In addition to the high hardware costs, more disks mean more datacenter space, power and cooling requirements, and added offsite capacity for replication. And because capacity expansion takes time, IT managers must try to forecast storage needs and forward provision to accommodate these potentially incorrect forecasts. Unanticipated spikes in storage demand send IT managers scrambling for added capacity and overly aggressive forecasts result in investment in idle capacity. When multiple sites are taken into consideration, using standard NAS can often result in overprovisioning at some sites and underprovisioning at others, as well as in storage islands where data at one site is not visible to users at other sites. The only way for users at one site to view files created or edited at another site is to save copies of files stored offsite to their own location, resulting in potentially significant duplication of files and commensurate overspending on expensive storage, not to mention significant version control challenges. This problem is compounded when backup and archiving also occurs locally, since duplicate copies on islands of storage means greater storage needed for data protection. (Figure 1) Data Time Hardware cost Power, cooling, space Forecasting demand Cross-site capacity balancing As an alternative to customer-owned NAS, some vendors offer what they call Cloud NAS. Almost all of these solutions suffer from limitations in performance, scale, or both, and none are yet enterprise-class. What about the nature of the data itself? On-site NAS today supports both structured and unstructured data storage. High-performance applications like databases and Exchange that utilize structured data require block storage in order to provide the response times and synchronous replication 2

speeds needed to avoid applications timing out. This storage often uses high-performance drives or, with growing frequency, SSD storage. iscsi is a common interface used for block storage to provide direct disk access to these applications. But applications using unstructured data (which represents 80% of data under management, on average) store it in file systems (which provide the structure), not as blocks, and usually use interfaces like CIFS or NFS unless the apps are rewritten for iscsi. For the most part, block storage interfaces like iscsi do not lend themselves to applications using unstructured data. As unstructured data and metadata continue to grow upwards of 62% on average per year (IDC, 2011), enterprise IT managers face an unwinnable struggle to maintain or reduce costs while ensuring user needs (i.e., SLAs) are met. In addition to compatibility, block storage interfaces suffer other shortcomings relative to file-based protocols. Because block-based applications are primarily limited to single-node storage targets, their ability to scale can be quite limited compared to file-based storage systems spanning multiple servers. And unlike with unstructured data, replication of structured data requires that the disks at the replication site be identical to those at the source site. Since applications using unstructured data are disk agnostic and can address multiple servers, they are particularly suited to solutions with CIFS or NFS interfaces and that target scalable storage, particularly object storage. Thus for optimal storage performance and cost control, administrators devote special attention to tier storage according to the specific requirements of each class of user or application. (Figure 2) Block Example Application Inteface iscsi Standard CFS/NFS Application Types Databases, Exchange Home Directories Share of Data <20% >80% Scalability Limited Massive Replication Storage Type Identical Mixed OK File Figure 2: Comparing aspects of block- and file-based storage DATA PROTECTION CHALLENGE. Data protection comprises archiving, backup, and disaster recovery (DR). The primary purpose is to maintain access to data that is no longer regularly used or to be able to recover current data if it is lost. For archiving, the authoritative copy of the data is stored offsite and recalled when needed. With backup, the data remains in use, or at least kept in primary storage, and a copy is made and stored for retrieval if the original data is lost. The most common method for protecting data is using a software application to direct data to disk or tape. Magnetic tape has been used for data storage for over 60 years and the technology has not changed all that much during that time. It is still in use due primarily to inertia (the devil you know ) and its perception as being cheap on a $/GB basis. Using tape, however, is very cumbersome, time consuming, and prone to error, making it a much derided medium for data backups and archiving. With steep reductions in the cost of disk and the development of deduplication over the last decade, disk-to-disk archive and backup has gained more and more share from tape. Disk targets range from removable (very slow) optical disk and commodity magnetic disk to specialized backup and archiving appliances. But all disk-to-disk backup still suffers from one or more of the following major drawbacks: high cost, limited functionality, vendor lock-in, limited scalability, and cumbersome deployment and management. Sometimes disk-to-disk-to-tape methodologies are also deployed. More recently, disk-to-disk-to-cloud data protection solutions have appeared, offering the scalability, availability, and economy of the cloud as a storage target. While theoretically, using the cloud is quite appealing, in practice, integrating cloud storage into an established IT infrastructure can be incredibly problematic due to issues like latency, communication protocols, and data security. DR can leverage both backup and archiving and centers around bringing operations back up when a site either partially or fully fails. DR involves rebuilding site functionality as quickly as possible so as to minimize the impact on overall IT operations. This rebuilding can occur either in the same location or offsite at another location. Traditional reliance on tape, with its complex offsite logistics and slow data search/access, has made rapid tape-based DR unachievable. Replication (mirroring or storing backup data of one location at another) is a common but potentially expensive way to implement a DR strategy if it involves full hardware duplication, which doubles datacenter capital costs. DR planning is challenging, time consuming, and difficult to get right. For these reasons, DR is often put off or avoided as long as possible, with organizations adopting a hope for the best strategy. 3

The Panzura Global Cloud Storage System To address the storage tiering and data protection challenges outlined above, Panzura went back to the drawing board to examine how data is created, stored, and consumed within an enterprise-class organization. By developing a proprietary cloud-integrated file system to accommodate both global file access and cloud storage, Panzura was able to overcome the main limitations of today s storage systems when used either as tiered NAS or for data protection. The Panzura Global Cloud Storage System comprises three components: Panzura CloudFS file system, Quicksilver Cloud Storage Controller, and Panzura OS. (Figure 3) Together, they provide a cloud-ready, high-performance and secure storage platform that enables tiered NAS, seamless global file sharing, active archiving, backup, and DR across all enterprise locations, facilitating consolidation and eliminating islands of storage. Key features of the Panzura solution are described in this section. Unified namespace File locking Access control VM, 1U, 2U CIFS/NFS, Cloud API Expandable to >200TB Encryption Deduplication Pinning Figure 3: Panzura Global Cloud Storage System components FILE-BASED STORAGE: As discussed above, storage optimized for block data will be very different from that optimized for file data, due to different requirements. Panzura developed a high-performance file-based global storage platform for the cloud to address the 80% of current data that is unstructured. By supporting CIFS and NFS transfer protocols commonly used by most applications, Quicksilver Cloud Storage Controllers can plug into existing IT infrastructures without any changes while connecting to all major cloud storage platforms, simplifying deployment and minimizing impact on operations. All data is managed under a global file system, simplifying user interaction and system administration while tying into enterprise applications and targeting both local disk and the cloud. CLOUD STORAGE: Object storage, the typical storage system used in the cloud, breaks data up and stores it as flexibly-sized containers or chunks that can be individually addressed and manipulated and stored in many locations, not tied to any particular disk. Each object usually has some associated metadata. Object storage can scale to billions of objects and exabytes of capacity while protecting data with greater effectiveness than RAID. In addition, drive failure allows for very rapid recovery vs. weeks for large capacity RAID systems. This combination of scale and robustness make object storage an ideal target for warehousing enterprise data. Panzura Global Cloud Storage Systems interface directly with all major cloud object storage APIs (http://www.panzura.com/partners/cloud-storage-partners/), avoiding vendor lock-in, and leverage object-based cloud storage as a data warehouse to provide massive scale and availability at a very compelling cost structure. GLOBAL FILE SYSTEM: The heart of any storage system for unstructured data is the file system. Key file systems that have shaped the market include VxFS (Veritas), NTFS (Microsoft), WAFL (NetApp), and ZFS (Sun). A successful file system must be highly scalable, high performing, flexible, and manageable. NetApp built much of its success around WAFL and its ONTAP OS. WAFL combined RAID and the disk device manager with the file system plus replication and snapshots (limited per volume). Its primary target is HDD. ZFS put all these elements plus encryption and deduplication in one stack, is massively scalable, and targets HDD and SSD natively but has no native cloud integration. The Panzura CloudFS file system was engineered to closely manage how files are managed and stored to provide seamless, high-performance, and robust data management. It improves on WAFL and ZFS while integrating cloud storage as a native capability. (Figure 4) 1992 2004 2008 WAFL RAID Disk Device Manager File System Replication Snapshots ZFS + Encryption + Deduplication + HDD/SSD Fiigure 4: Comparison of modern file systems + Cloud + Advanced Deduplication 4

White Paper Any user at any location can view and access files created by anyone, anywhere, at any time. The file system dynamically coordinates where files get stored, what gets sent to the cloud, who has edit and access rights, what files get locally cached for improved performance, and how data, metadata, and snapshots are managed. The structure of the file system allows for nearly unlimited scale with up to 10,000 user-managed snapshots per volume. Panzura s innovative use of metadata and snapshots for file system updates, combined with unique caching and pinning capabilities in the Quicksilver Cloud Storage Controllers, allows customers to view data and interact through an enterprise-wide file system that is continually updated in real time. Support for extended file system access control lists (ACLs) empowers administrators to set policies that determine what access and management functions per file will be available on a per user basis. Because the file system is global and shared across all controllers and because all data is also stored in the cloud, all data is always available to anyone, even if network connections are temporarily lost to one site for some reason. 1 (Figure 5) Hong Kong Boston Frankfurt Figure 5: Global file system with cloud as storage tier UNIFIED NAMESPACE: Ideally, a global storage solution would present a consistent view of all files to a user regardless of where the files are stored in the system and regardless of from which location a user is accessing the file system. Panzura has merged a global files system with a unified namespace to present a seamless view of file across the network. The Panzura unified namespace presents users with a single, unified and consistent view of all files across all locations and consists of the sum of all local Panzura file systems that are known on the network. The Panzura CloudFS file system intelligence tracks all copies of all files, including different versions of the same file, so users always have an up-to-date directory available to them. Because Panzura has interlaced its unified namespace and global file system, the resulting file intelligence makes file storage and tracking transparent to the user and ensures that the file is available whenever and wherever it is needed. Nodes can be added as necessary and all have rapid access to all files on the network, for truly seamless scale-out capability. (Figure 6) New Delhi (Sydney) Figure 6: Adding a node seamlessly updates global file system 1) Any data not yet uploaded to the cloud from a controller will not be available to other sites if network access to that controller is lost. 5

DISTRIBUTED FILE LOCKING: Panzura Global Cloud Storage Systems are designed with unique global file locking that manages write access across the entire network, preventing data collisions and version corruption. It is impossible for more than one person on the network to edit a given version of a file at any given time and cause a data collision. When one user is editing a file, that file is read-only locked to everyone else, and anyone accessing that version of the file will receive a message that it is read-only locked by user X (they can always save their edits under a different version of the file). This will continue until the first user has completed their file revisions. At that point, when another user requests write access, the file lock is transferred to that user s Quicksilver Cloud Storage Controller and all other users are locked out of write access to that file until this next user completes the edits. (Figure 7) This robust, elegant, fast system allows global file access and rapid file sharing while ensuring version integrity, avoiding the need to store multiple copies of the same file at multiple locations and the associated unnecessary network and storage capacity consumption from these duplicate files. Rome File A: File A: File A: Locked by User 2 Figure 7: Dynamic file locking preserves file integrity and prevents data collision GLOBAL DEDUPLICATION: Unlike other deduplication solutions, which were designed to offset inherent data duplication in localized, inefficient file systems, Panzura designed an interconnected, global file system that stops file-level duplication before data gets stored. Since only unique copies of files across all sites are preserved by the file system, data is deduplicated before it is ever stored. Capacity is optimized further by running advanced, inline block-level deduplication on any data that gets stored on the network to remove blocks common across different files. Unlike any other deduplication provider, Panzura embeds the deduplication reference table in metadata, which is instantly shared among all Quicksilver Cloud Storage Controllers. This inline deduplication method removes data redundancy across controllers, rather than just based on data seen by a single controller. Thus each controller in the network benefits from data seen by all other controllers, ensuring even greater capacity reduction, guaranteeing all data in the cloud is unique, and driving down cloud storage and network capacity (and cost) consumed by the enterprise. (Figure 8) Rome A D F C A D C A B E F B E B A B C D E F User 2 Figure 8: Data is deduplicated within and across sites for maximum data reduction MILITARY-GRADE ENCRYPTION: One of the top concerns most frequently expressed by IT professionals about cloud storage is data security. Because data is being transmitted to and stored by a 3rd-party cloud storage provider outside the corporate firewall, some worry that their data will be exposed and at risk for theft. The perception is that keeping data inside the firewall is inherently safer. This concern must be overcome by any cloud storage solution before it can become mainstream within an enterprise. 6

Panzura addresses data security concerns directly by applying military-grade encryption to all data stored in the cloud. Each controller applies AES- 256-CBC encryption for all data at rest in the cloud. In addition, all data transmitted to or from the cloud is encrypted with SSL v3.1 to prevent access via interception. Encryption keys are managed by the enterprise, never stored in the cloud. This complete, robust two-tier encryption solution is in addition to the typical multi-layer security provided by mainstream cloud storage providers. (Figure 9) In some cases, customers find that the combined security of a Panzura+cloud solution is greater than they can reasonably achieve within their own infrastructure, making cloud storage safer than some private cloud deployments JACK AND JILL WENT UP THE HILL... File encrypted before transmission 1@3Fa$% As%$#m :45^&GFJbfg SSL v3.1 vc!@dsff 34%hfdgGH& <tghf$*fx AES-256-CBC WAN Figure 9: Military-grade encryption protects data in transit and at rest in the cloud. Summary The cloud offers tremendous potential for enterprises to reduce storage costs, improve productivity, and reduce data availability risk. Tapping that potential fully and effectively can provide significant competitive advantage while reducing both business and technological risk. To date, enterprises attempting to fully integrate the cloud as a storage tier have been faced with building their own limited-capability solution by kludging together different technologies from various vendors, many of which were never designed to be used with cloud storage. This Frankenstein cloud storage implementation fails to realize the full benefits of cloud storage while consuming precious IT resources in implementation and management. Panzura s Global Cloud Storage System breaks this cycle with a fully cloud-integrated enterprise storage solution to handle NAS, active archiving, DR, and backup. By designing a global file system and namespace with cloud integration at a fundamental level, the Panzura solution brings the cloud as a seamless storage tier for the first time while enabling global file sharing and full access to all files in the system from any location at any time. This game-changing technology finally brings the full power and benefits of cloud storage to enterprise customers, helping to break the unending onsite storage expansion cycle while eliminating islands of storage that inhibit cross-site user interaction and productivity and real-time data protection. Panzura makes deploying cloud storage and a global file system easy and transparent to users. 695 Campbell Technology Parkway #225 Campbell, CA 95008 +1 (408) 578-8888 For more information: info@panzura.com For sales: sales@panzura.com 7