The Design and Implementation of the Zetta Storage Service. October 27, 2009

Similar documents
How To Store Data On A Server Or Hard Drive (For A Cloud)

OPTIMIZING PRIMARY STORAGE WHITE PAPER FILE ARCHIVING SOLUTIONS FROM QSTAR AND CLOUDIAN

Getting performance & scalability on standard platforms, the Object vs Block storage debate. Copyright 2013 MPSTOR LTD. All rights reserved.

Designing a Cloud Storage System

The last 18 months. AutoScale. IaaS. BizTalk Services Hyper-V Disaster Recovery Support. Multi-Factor Auth. Hyper-V Recovery.

Scale out NAS on the outside, Object storage on the inside

Software Defined Microsoft. PRESENTATION TITLE GOES HERE Siddhartha Roy Cloud + Enterprise Division Microsoft Corporation

<Insert Picture Here> Oracle Cloud Storage. Morana Kobal Butković Principal Sales Consultant Oracle Hrvatska

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

Building Storage Clouds for Online Applications A Case for Optimized Object Storage

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

Software-defined Storage Architecture for Analytics Computing

Diagram 1: Islands of storage across a digital broadcast workflow

Implementing Multi-Tenanted Storage for Service Providers with Cloudian HyperStore. The Challenge SOLUTION GUIDE

Cloudian delivers object storage for next generation infrastructures

Workspace & Storage Infrastructure for Service Providers

Long term retention and archiving the challenges and the solution

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

Scality RING High performance Storage So7ware for pla:orms, StaaS and Cloud ApplicaAons

Neil Stobart Cloudian Inc. CLOUDIAN HYPERSTORE Smart Data Storage

HadoopTM Analytics DDN

THE SUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

The next step in Software-Defined Storage with Virtual SAN

Network Attached Storage. Jinfeng Yang Oct/19/2015

How To Manage A Single Volume Of Data On A Single Disk (Isilon)

Business-centric Storage for small and medium-sized enterprises. How ETERNUS DX powered by Intel Xeon processors improves data management

THE FIRST LOCAL ENTERPRISE CLOUD STORAGE FEATURES. Enterprise iscsi (Block) & NFS/ CIFS (File) Storage-as-a-Service

EMC IRODS RESOURCE DRIVERS

IBM General Parallel File System (GPFS ) 3.5 File Placement Optimizer (FPO)

IBM PureData System for Transactions. Technical Deep Dive. Jonathan Rossi, PureSystems Specialist

High Availability Databases based on Oracle 10g RAC on Linux

A Virtual Filer for VMware s Virtual SAN A Maginatics and VMware Joint Partner Brief

Business-centric Storage for small and medium-sized enterprises. How ETERNUS DX powered by Intel Xeon processors improves data management

VMware Software-Defined Storage Vision

OPTIMIZING EXCHANGE SERVER IN A TIERED STORAGE ENVIRONMENT WHITE PAPER NOVEMBER 2006

StoneFly SCVM TM for ESXi

Business Continuity with the. Concerto 7000 All Flash Array. Layers of Protection for Here, Near and Anywhere Data Availability

Introduction to Gluster. Versions 3.0.x

(Scale Out NAS System)

SUSE Storage. FUT7537 Software Defined Storage Introduction and Roadmap: Getting your tentacles around data growth. Larry Morris

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

A virtual SAN for distributed multi-site environments

Building Storage Service in a Private Cloud

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Virtualizing Apache Hadoop. June, 2012

Microsoft Azure Cloud oplossing als een extensie op mijn datacenter? Frederik Baert Solution Advisor

PARALLELS CLOUD STORAGE

Scala Storage Scale-Out Clustered Storage White Paper

Oracle s Cloud Computing Strategy

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

VMware Software-Defined Storage & Virtual SAN 5.5.1

Whitepaper. NexentaConnect for VMware Virtual SAN. Full Featured File services for Virtual SAN

Product Spotlight. A Look at the Future of Storage. Featuring SUSE Enterprise Storage. Where IT perceptions are reality

A STORAGE SYSTEM JUST LIKE THE ONE YOU HAVE TODAY A STORAGE SYSTEM NOTHING LIKE THE ONE YOU HAVE TODAY.

Data Domain Overview. Jason Schaaf Senior Account Executive. Troy Schuler Systems Engineer. Copyright 2009 EMC Corporation. All rights reserved.

XpoLog Competitive Comparison Sheet

Data Storage. Vendor Neutral Data Archiving. May 2015 Sue Montagna. Imagination at work. GE Proprietary Information

Nimble Storage + OpenStack 打 造 最 佳 企 業 專 屬 雲 端 平 台. Nimble Storage Brian Chen, Solution Architect Jay Wang, Principal Software Engineer

The Economics of File-based Storage

CompTIA Cloud+ 9318; 5 Days, Instructor-led

SOFTWAREDEFINED-STORAGE

The OpenStack TM Object Storage system

Storage Design for High Capacity and Long Term Storage. DLF Spring Forum, Raleigh, NC May 6, Balancing Cost, Complexity, and Fault Tolerance

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise

CompTIA Cloud+ Course Content. Length: 5 Days. Who Should Attend:

EMC AVAMAR. a reason for Cloud. Deduplication backup software Replication for Disaster Recovery

EMC DATA DOMAIN OPERATING SYSTEM

VirtualclientTechnology 2011 July

The Modern Virtualized Data Center

EMC DATA DOMAIN OPERATING SYSTEM

ntier Verde Simply Affordable File Storage


Get Success in Passing Your Certification Exam at first attempt!

EMC BACKUP AND RECOVERY SOLUTIONS

VMware Virtual Machine File System: Technical Overview and Best Practices

The Evolution of Cloud Storage - From "Disk Drive in the Sky" to "Storage Array in the Sky" Allen Samuels Co-founder & Chief Architect

Zetta Builds Primary Cloud Storage to Enterprise Specifications. Analyst: Anne MacFarland

StorSimple Solution for File Share Deployments

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

Ultimate Guide to Oracle Storage

Powerful Management of Financial Big Data

The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000

Cloud Optimize Your IT

Cloud OS Vision. Modern platform for the world s apps

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

MaxDeploy Hyper- Converged Reference Architecture Solution Brief

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Milestone Solution Partner IT Infrastructure Components Certification Summary

What s New in VMware Virtual SAN TECHNICAL WHITE PAPER

POWER ALL GLOBAL FILE SYSTEM (PGFS)

Remote/Branch Office IT Consolidation with Lenovo S2200 SAN and Microsoft Hyper-V

Object Oriented Storage and the End of File-Level Restores

WOS Cloud. ddn.com. Personal Storage for the Enterprise. DDN Solution Brief

Transcription:

The Design and Implementation of the Zetta Storage Service October 27, 2009

Zetta s Mission Simplify Enterprise Storage Zetta delivers enterprise-grade storage as a service for IT professionals needing primary storage solutions. 2

Market Landscape Tier 0/1 storage consumers (correctly) risk adverse existing enterprises are not going to take mission critical transactional database and plug it into the cloud Network latency / speed of light an issue for many (not all) use cases Unstructured (file) data is majority of growth in terms of data footprint Lots of concerns about security, reliability, data integrity, etc, but examples of complete outsourcing of mission critical sensitive data (salesforce, email) also common CapEx, OpEx, and administration challenges colliding 3

Zetta Design Objectives Data Integrity Continuous Availability (failures, releases, scale out, moves, always consistent on disk) Strong consistency (respect sync() ), be POSIX Compatible Multi Tenant (IO performance as well as footprint) Tiered Design, with independent horizontal scalability Economically Viable (commodity components) 4

A Tale of Two Boxes for just 40TB Less internal redundancy, less hot swap, lower quality Internal Software Raid Less expensive purchase price False economy for most IT environments Redundanty PSU, hot swap fans, higher quality Internal Software Raid More expensive purchase price Good choice for most IT environments 5

Beyond the Single Box Approach Positives Negatives Two cheap boxes, mirrored more available & often less capital cost than one expensive box What does the mirroring? Conflict resolution? Performance? Higher OpEx. Application Partitioning Great solution Not applicable for many enterprise apps. Availability decreases as nodes increase (or cost increases for replication) Distributed File Replication Gets you past the one box Complexity of the management layer, higher OpEx Buy the Big SAN Mature Expensive, complex 6

Distributed File Replication Hadoop Data protection assumed to be at lower level (ie, raid card in every node) OR replicate every piece of data Great for analytics, not a general purpose file system MogileFS Lustre Open source, small install base, perl file-location-tracker Replicates data for protection No data integrity checking (hash/crc) Data protection assumed to be at lower level (ie, raid card in every node) Parascale Commercial could we really run it so much better, that people would use us rather than run it themselves? (no.) Data protection through replication Overall Software layer is complex, generally has single points of failure Replication less space / opex / capital efficient than erasure coding None designed for multi-tenancy 7

Our Conclusion In order to meet service objectives, we can t use off the shelf technology and have to build something new. 8

The Implementation ZettaFS Distributed File System All elements implemented as network services Centralized Metadata, holds inode equivalents (on SSD) 10Gbps low latency ethernet Basic unit of storage is a chunk, striped and protected across discrete nodes 9

The Implementation Protocol Translator == NAS Head Xen VM-ZettaFS appears as local file system Pulls config and authentication creds from LDAP QoS management Caching Reference Synchronization 10

The Implementation Zstack == RAID Controller Reed-solomon chunk encoding / recovery Write cache (local SSD & consensus quorum protocol) Metadata management Lock Manager Geo-Replication Chunk placement rebalancing/optimization 11

The Implementation Metadata DB N+3 protection Volume -> file maps File -> chunk maps Raid stripe maps Scalable / partitioned 12

The Implementation Chunk Stores == Disks Caching Layer Encryption / Decryption 100% on-disk encryption Background hash validation Foreground read verification 13

Other Key Features Most NFS/CIFS requests are handled as metadata operations, which don t require accessing the spindle layer Clustered mount capabilities Variable performance per volume virtualize IO capacity, not just space Federated Authentication (LDAP) Service Practices as important as technology 14

System Management Portal Intuitive Interface Powerful, yet simple to use and manage, no training needed Easy setup rapidly provision, configure, and mount storage volumes through UI Full control framework User-controlled snapshot management Transparency Automated alerting, reporting and real-time detailed status views Actionable and self-healing Full notification framework with auto-corrective actions Delegated administration User and responsibility delegation for storage and service management 15

Use Cases for Zetta Storage DB/Exchange DataMart Real-time commit requirements Business Continuity Data Warehouse Primary File Server Compliance Storage Bursting HSM/Roll Off File system required Strong consistency required 7200 RPM performance acceptable Active Archive Offline DR Backup File system brings marginal benefit Minimal performance requirements 16

Customer Proof Points Consumer/SMB IT/Media Services Provider 4TB/day ingest Expected Volume Size: 750T 1 PB Connected via 10Gbps Dedicated Circuit Large Silicon Valley Law Firm Security, Data Integrity, SLA requirements Need Snapshots, File System Connected via Cross Connect Large Public University Connected via Internet 17

Zetta Business Model Integrated Offering All robust features included in base offering Future protocols, APIs, performance improvement included Customer service and support included One Simple Price Starts @ $.25 per GB per month for 1 TB Discounts for footprint volume, term Connectivity options for lowest network % of TCO 18

Geo Expansion Live 2011 2010 2010 2010 2011 See more @ www.zetta.net 19