Utilizing the SDSC Cloud Storage Service

Similar documents
U"lizing the SDSC Cloud Storage Service

Data Services for Campus Researchers

Building a Cloud Computing Platform based on Open Source Software Donghoon Kim ( donghoon.kim@kt.com ) Yoonbum Huh ( huhbum@kt.

David Minor. Chronopolis Program Manager Director, Digital Preserva7on Ini7a7ves UCSD Library San Diego Supercomputer Center

Long term retention and archiving the challenges and the solution

Seagate Cloud Systems & Solutions

FLOSSK: FLOSSTalk OpenStack 22 nd February, Arturo Suarez: Founder, COO&BizDev StackOps 21/02/12 1

Adrian Otto,

StorReduce Technical White Paper Cloud-based Data Deduplication

New Storage System Solutions

Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage

RELEASING CLOUDS OF DATA

THE FIRST LOCAL ENTERPRISE CLOUD STORAGE FEATURES. Enterprise iscsi (Block) & NFS/ CIFS (File) Storage-as-a-Service

EMC BACKUP MEETS BIG DATA

Hadoop on the Gordon Data Intensive Cluster

Industry Brief. The Epic Migration. to Software Defined Storage. SUSE Enterprise Storage. Featuring

SWIFT. Page:1. Openstack Swift. Object Store Cloud built from the grounds up. David Hadas Swift ATC. HRL 2012 IBM Corporation

SMART SCALE YOUR STORAGE - Object "Forever Live" Storage - Roberto Castelli EVP Sales & Marketing BCLOUD

Zadara Storage Cloud A

SoftLayer Fundamentals. Storage and Backup. August, 2014

Hitachi Content Platform. Andrej Gursky, Solutions Consultant May 2015

Protecting Information in a Smarter Data Center with the Performance of Flash

Mit Soft- & Hardware zum Erfolg. Giuseppe Paletta

PACE Predictive Analytics Center of San Diego Supercomputer Center, UCSD. Natasha Balac, Ph.D.

Intro to AWS: Storage Services

Cloud Storage and Backup

How To Store Data On A Server Or Hard Drive (For A Cloud)

MAKING THE BUSINESS CASE

cloud functionality: advantages and Disadvantages

Mobile Cloud Computing T Open Source IaaS

FAQ RIVERBED WHITEWATER FREQUENTLY ASKED QUESTIONS

SwiftStack Filesystem Gateway Architecture

Archiving On-Premise and in the Cloud. March 2015

IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE

Hitachi NAS Platform and Hitachi Content Platform with ESRI Image

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

Investigating Private Cloud Storage Deployment using Cumulus, Walrus, and OpenStack/Swift

Data Storage. Vendor Neutral Data Archiving. May 2015 Sue Montagna. Imagination at work. GE Proprietary Information

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

Wrangler: A New Generation of Data-intensive Supercomputing. Christopher Jordan, Siva Kulasekaran, Niall Gaffney

(Scale Out NAS System)

Prospectus for the Proposed IDRE Cloud Archival Storage Program

Cloudian delivers object storage for next generation infrastructures

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

The Convergence of Software Defined Storage and Physical Appliances Hybrid Cloud Storage

Amazon Cloud Storage Options

Using object storage as a target for backup, disaster recovery, archiving

The Zadara Storage Cloud A Validation of its Use Cases and Economic Benefits

SMB Direct for SQL Server and Private Cloud

Infrastructure as a Service (IaaS)

POWER ALL GLOBAL FILE SYSTEM (PGFS)

Product Spotlight. A Look at the Future of Storage. Featuring SUSE Enterprise Storage. Where IT perceptions are reality

IBM Spectrum Protect in the Cloud

How To Protect Data On Network Attached Storage (Nas) From Disaster

Quantum DXi6500 Family of Network-Attached Disk Backup Appliances with Deduplication

Protect Data... in the Cloud

Boas Betzler. Planet. Globally Distributed IaaS Platform Examples AWS and SoftLayer. November 9, IBM Corporation

NetApp Data Fabric: Secured Backup to Public Cloud. Sonny Afen Senior Technical Consultant NetApp Indonesia

Cloud Storage. Deep Dive. Extending storage infrastructure into the cloud SPECIAL REPORT JUNE 2011

Data Storage Options for Research

11/13/2013. Research IT Office. Data Storage Options for Research. By Ashok Mudgapalli Director of Research IT. Agenda

Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes. Anthony Kenisky, VP of North America Sales

IT Survey Frank Dwyer Senior Director, Information Technology The Salk Institute, La Jolla, CA

DDN updates object storage platform as it aims to break out of HPC niche

Cloud Platform Comparison: CloudStack, Eucalyptus, vcloud Director and OpenStack

The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

Service Description Cloud Storage Openstack Swift

The Design and Implementation of the Zetta Storage Service. October 27, 2009

Data management challenges in todays Healthcare and Life Sciences ecosystems

ioscale: The Holy Grail for Hyperscale

How To Build A Cloud Stack For A University Project

Red Hat Storage Server

Workspace & Storage Infrastructure for Service Providers

Bringing Much Needed Automation to OpenStack Infrastructure

ETERNUS CS High End Unified Data Protection

CONVERGED DATA STORAGE SOLUTIONS. SAN Scale-Out NAS Archive

QUICK REFERENCE GUIDE: KEY FEATURES AND BENEFITS

Cloud for Your Business

Understanding AWS Storage Options

GTC Presentation March 19, Copyright 2012 Penguin Computing, Inc. All rights reserved

2) Xen Hypervisor 3) UEC

We look beyond IT. Cloud Offerings

WHY DO I NEED FALCONSTOR OPTIMIZED BACKUP & DEDUPLICATION?

Building Storage-as-a-Service Businesses

The OpenStack TM Object Storage system

Understanding Object Storage and How to Use It

Building Storage as a Service with OpenStack. Greg Elkinbard Senior Technical Director

Milestone Solution Partner IT Infrastructure MTP Certification Report Scality RING Software-Defined Storage

Deploying ArcGIS for Server using Managed Services

CLOUD BLOCK STORAGE CONSISTENT AND RELIABLE STORAGE PERFORMANCE IN THE CLOUD

Gladinet Cloud Access Solution Simple, Secure Access to Online Storage

Storage for Different Compute Clouds

STORIANT TECHNOLOGY DEEP DIVE. Data Storage for a New Generation

Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007

SGI Solutions for RDSI/CAUDIT 2013 SGI

IBM Smart Business Storage Cloud

RED HAT OPENSTACK PLATFORM A COST-EFFECTIVE PRIVATE CLOUD FOR YOUR BUSINESS

VMware vsphere Data Protection 6.0

I/O Considerations in Big Data Analytics

Transcription:

Utilizing the SDSC Cloud Storage Service PASIG Conference January 13, 2012 Richard L. Moore rlm@sdsc.edu San Diego Supercomputer Center University of California San Diego

Traditional supercomputer center Functional Systems storage systems Tape-based archival system Built for capacity We ve extended the archive beyond HPC simulation data to experimental data and other digital assets - and as a node in geographically-distributed digital preservation systems (e.g. Chronopolis) High-bandwidth parallel file system Built for speed Transient data, single-copy reliability Home directory system (e.g. NFS) Built for robustness and reliability Regular backups Limitations Archival data is difficult to access - high latency, lower bandwidth, user interfaces Difficult to share archival data by multiple users All too often archived data, particularly HPC simulations, is write-once-read-never Not sustainable and no incentives for users to retain only high-value data

Adapting to emerging requirements and changing technologies Exponential data growth - and analysis of that data - are increasingly important to the research enterprise Requires ready access to data, w/ low latency & high bandwidth Collaborative team science demands easy data sharing Consumer product development drives prices Disk capacities increasing quickly Flash memory becoming more affordable Gordon compute system just now being deployed with 0.25 PB of flash - to fill the latency gap between DRAM and spinning disk For HPC systems with historical byte/flop ratios, storage would be an increasingly significant fraction of total system cost Can t afford open-ended archival storage must develop methods to place value on data, especially for long-term high-reliability storage

SDSC is deploying a new repertoire of storage systems SDSC Cloud Storage of Digital Data for Ubiquitous Access and High-Durability Access: Multi-platform web interface, S3 interfaces, backup SW Data Oasis (PFS) High-Performance Transient Parallel File System for HPC Access: Lustre on HPC Systems (Gordon, Trestles, Triton) Project Storage Purpose: Typical Project / User File Server Storage Needs Access: NFS/CIFS, isci

A Paradigm Shift for Long-Term Storage: Access, Sharing and Collaboration SDSC Cloud http://cloud.sdsc.edu Launched September 2011 Largest, highest-performance known academic cloud 5.5 Petabytes (raw), 8 GB/sec System can upload 500GB in ~1 min Automatic dual-copy and verification Capacity and performance scale linearly to 100 s of petabytes Open source platform based on NASA and RackSpace software 5

Key Features of SDSC Cloud Always-there disk-based availability of data Tape latency and multi-user issues addressed High reliability Disk RAID; automatic dual-copy; continuous background checksum verification/ restoration; offsite replication soon Simple data owner user interfaces to data, its management, its access and setting permissions for sharing data Easy access to shared data for any users with permission under range of mechanisms (http, APIs, portals, gateways ) Encryption readily incorporated and addresses issues of storing HIPAA/proprietary data Transaction history is logged track usage, assess utility, support provenance Scalable system in both capacity and bandwidth Interfaces to commercial and open-source products

Applications of SDSC Cloud Shared/published/curated data collections HPC simulation data storage and sharing Web/portal applications and site hosting Application integration using supported APIs Serving images/videos Backup services

Why Openstack Swift Cloud Software? Evaluated Software OpenStack Swift Open Source Community Support Highly Configurable Eucalyptus Highly Flexible Compute Focused Caringo Castor Commercial Software Long Development Cycle Industry Standard More than 100 leading companies from over a dozen countries are participating in OpenStack, including Cisco, Citrix, Dell, Intel and Microsoft. Highly Compatible Compatibility w/ public OpenStack clouds means it s easy to migrate data and apps to public clouds when desired based on security policies, economics, and other key business criteria. Proven Software Running the OpenStack cloud operating system is same software that powers many large public and private clouds, including RackSpace Cloud Storage. Control & Flexibility Open source platform means not locked to a proprietary vendor, and modular design can integrate with legacy or 3rd-party technologies. OpenStack project provided under Apache 2.0 license.

SDSC Cloud Interfaces Data Owners Traditional Clients GUI Applications Command Line SDSC Web I/F Load Balanced Proxy Servers External Users Web Services API Amazon S3 Rackspace CloudFiles / Openstack API Swift Object Storage Cluster Commercial Products Commvault Amanda Backup Tools Crashplan User- Developed Web Portals/ Gateways

SDSC Cloud Explorer

Rates and Funding Mechanisms See https://cloud.sdsc.edu/hp/pricing.php for current pricing; HW costs subject to market volatility; contact services@sdsc.edu if interested in service On Demand Cloud Storage Pay monthly per GB used (water-mark) U California users: $X/TB-Year dual-copy + applicable indirect costs + 50% premium for additional off-site copy (when available) Users external to UC: 2*$X/TB-year dual-copy, 3*X for dual-copy + 1 off-site copy Condo Cloud Storage Recipient buys HW that is integrated into the storage service and pays annual operating costs for maintenance and system administration Purchase condo HW at $Y market price (pre-configured head node and disk array - currently 2TB drives with 8.5 TB usable dual-copy; space will increase over time) Annual operating cost: $Z/year/condo + applicable indirect costs & UC-external factors User has right to use condo for 5 years; TCO/condo = $Y + 5*Z over 5 years *Encryption and HIPAA Compliant Storage is available with both options

Questions? Get a trial account with an.edu email address cloud.sdsc.edu (no charges first 30 days)