David Minor. Chronopolis Program Manager Director, Digital Preserva7on Ini7a7ves UCSD Library San Diego Supercomputer Center
|
|
|
- Eric Clarence Bailey
- 10 years ago
- Views:
Transcription
1 David Minor Chronopolis Program Manager Director, Digital Preserva7on Ini7a7ves UCSD Library San Diego Supercomputer Center
2 SDSC Cloud now in produc7on UCSD Library DAMS use of Cloud DuraCloud + SDSC Cloud DuraCloud + Chronopolis Digital Preserva7on Network
3 SDSC Cloud Storage Built on OpenStack swih object storage sohware. Supports Rackspace/ SwiH API and a subset of Amazon s S3. Ini7al 5.5PB Raw Storage, scalable to over 100PB with equal performance scaling. Supported by redundant Arista Networks 7508 switches, providing 768 total 10 gigabit (Gb) Ethernet ports for more than 10Tbit/s of non- blocking, IP- based connec7vity. Peak transfer rates of up to 8GB/sec. Direct connec7ons to CENIC, ESNet, and XSEDE networks provides high- bandwidth wide area connec7vity. Managed with Rocks cluster toolkit. SwiH Roll available soon. AES 256- bit server side encryp7on for at rest data is available and deployed in a local HIPAA environment. h`p://cloud.sdsc.edu
4 Service Architecture Object Storage Services Storage Customers TradiConal Clients GUI Applica7ons Command Line Tools Dropbox style website Web Services API Amazon S3, Rackspace Cloud Services Commercial Vendors Commvault Backups Or most Rackspace API Supported Apps.. Instantly Available to External Users Via HTTP and Other Cloud Users Load Balanced Proxy Servers Swi> Object Storage Cluster
5 Goals for SDSC Cloud Support NSF Data Management Plans Required Plan to describe how research results are shared. 99.5% system availability File replica7on automated Default 2 copies, able to keep addi7onal offsite replica7ons. Automated checksum verifica7on and error correc7on Scalable Performance and capacity grows by incremental bricks. Mul7faceted accessibility Web, API, Graphical and Command Line Clients h`p://cloud.sdsc.edu Cost compe77ve Operated as a recharge service On par with current tape- based dual- copy costs of $0.0325/GB/Mo.
6 UCSD Library DAMS use of Cloud (slides provided by Declan Fleming)
7 DAMS Storage Story Started with 4T on SRB 8 years ago Eventually moved everything to local SAMBA Grown to 16T now UCSD research data pilots required 100T+
8 DAMS Implementa7on RackSpace CloudFiles Java Library RESTian 13% faster for small files 17% faster for large files SAMBA vs. cloud upload and download speeds quite similar Colocated server at SDSC produced download speeds some7mes 50% faster with OpenStack
9 Large Files with OpenStack Files larger than 10GB revealed a bug OpenStack requires breaking files larger than 5GB into segments returns those segments in lexical sort order, rather than in crea7on order So if the segments are named 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, they will be returned in the order 1, 10, 2, 3, 4, 5, 6, 7, 8, 9 Padding the numbers "0001" to make sure they sort in numerical order fixed this problem Once dealt with, performance was independent of file size
10 OpenStack Storage Organiza7on Containers vs Subdirectories How many containers should we use? One per object? One container for all objects? Tested with TIFFS, 185G, from 1 to containers Sweet spot was around 100 containers Learned later that every container is a SQLite instance Also learned that some other services will limit the number of containers per user
11 Victory declared. Now star7ng process of moving dozens of terabytes of research data into cloud- hosted DAMS. Declan Fleming - [email protected]
12 DuraCloud and SDSC Cloud
13 DuraCloud + SDSC Cloud SDSC Cloud now an op7on in DuraCloud Available in similar fashion as Amazon and Rackspace DuraCloud customers can now have a single service agreement and interface to access commercial and academic cloud providers
14 DuraCloud + SDSC Cloud "SDSC is an important strategic partner for us in that they are the first academic- based produc7on cloud offering and we envision together building an academic- based open cloud infrastructure with a layer of services provided by DuraCloud. We see this as the first step in this direc7on and are thrilled. - Michele Kimpton, DuraSpace CEO
15 DuraCloud + Chronopolis integra7on
16 Chronopolis + DuraCloud Chronopolis will be a preserva7on op7on within DuraCloud offerings Rolling into produc7on this fall On the backend, enabled through SDSC Cloud and related services
17 Process Data movement between systems happens in SDSC cloud Mediated by set of RESTful services Data handed off from DuraCloud with manifest, checksums, etc.
18 SDSC Cloud RESTful server SDSC Chronopolis storage NCAR UMD
19
20 Outcome Successful integra7on of two NDIIPP- supported preserva7on projects Chronopolis offers a TRAC- cer7fied archival backend to DuraCloud Chronopolis offers a low- cost snapshot of DuraCloud holdings DuraCloud offers a seamless op7on for orgs wan7ng data services and preserva7on Demonstra7on of uni7ng two systems via channels of communica7on and services
21 For more informa7on Andrew Woods, DuraCloud: David Minor, Mike Smorul, Chronopolis: Also will be presented as a poster at ipres
U"lizing the SDSC Cloud Storage Service
U"lizing the SDSC Cloud Storage Service PASIG Conference January 13, 2012 Richard L. Moore [email protected] San Diego Supercomputer Center University of California San Diego SAN DIEGO SUPERCOMPUTER CENTER
Perspec'ves on SDN. Roadmap to SDN Workshop, LBL
Perspec'ves on SDN Roadmap to SDN Workshop, LBL Philip Papadopoulos San Diego Supercomputer Center California Ins8tute for Telecommunica8ons and Informa8on Technology University of California, San Diego
Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.
Cost Effective Backup with Deduplication Agenda Today s Backup Challenges Benefits of Deduplication Source and Target Deduplication Introduction to EMC Backup Solutions Avamar, Disk Library, and NetWorker
Data Management in the Cloud: Limitations and Opportunities. Annies Ductan
Data Management in the Cloud: Limitations and Opportunities Annies Ductan Discussion Outline: Introduc)on Overview Vision of Cloud Compu8ng Managing Data in The Cloud Cloud Characteris8cs Data Management
Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS
Copyright 2014 Splunk Inc. Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS Dritan Bi=ncka BD Solu=ons Architecture Disclaimer During the course of this presenta=on, we may make forward looking statements
Redefining Oracle Database Management
Redefining Oracle Database Management Actifio PAS Specification A Single Solution for Backup, Recovery, Disaster Recovery, Business Continuity and Rapid Application Development for Oracle. MAY, 2013 Contents
www.basho.com Technical Overview Simple, Scalable, Object Storage Software
www.basho.com Technical Overview Simple, Scalable, Object Storage Software Table of Contents Table of Contents... 1 Introduction & Overview... 1 Architecture... 2 How it Works... 2 APIs and Interfaces...
Cloud Gateway. Agenda. Cloud concepts Gateway concepts My work. Monica Stebbins
Approved for Public Release; Distribution Unlimited. Case Number 15 0196 Cloud Gateway Monica Stebbins Agenda 2 Cloud concepts Gateway concepts My work 3 Cloud concepts What is Cloud 4 Similar to hosted
StorReduce Technical White Paper Cloud-based Data Deduplication
StorReduce Technical White Paper Cloud-based Data Deduplication See also at storreduce.com/docs StorReduce Quick Start Guide StorReduce FAQ StorReduce Solution Brief, and StorReduce Blog at storreduce.com/blog
HIGH-SPEED BRIDGE TO CLOUD STORAGE
HIGH-SPEED BRIDGE TO CLOUD STORAGE Addressing throughput bottlenecks with Signiant s SkyDrop 2 The heart of the Internet is a pulsing movement of data circulating among billions of devices worldwide between
Protect Data... in the Cloud
QUASICOM Private Cloud Backups with ExaGrid Deduplication Disk Arrays Martin Lui Senior Solution Consultant Quasicom Systems Limited Protect Data...... in the Cloud 1 Mobile Computing Users work with their
Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International
Keys to Successfully Architecting your DSI9000 Virtual Tape Library By Chris Johnson Dynamic Solutions International July 2009 Section 1 Executive Summary Over the last twenty years the problem of data
<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures
1 Refreshing Your Data Protection Environment with Next-Generation Architectures Dale Rhine, Principal Sales Consultant Kelly Boeckman, Product Marketing Analyst Program Agenda Storage
Phone Systems Buyer s Guide
Phone Systems Buyer s Guide Contents How Cri(cal is Communica(on to Your Business? 3 Fundamental Issues 4 Phone Systems Basic Features 6 Features for Users with Advanced Needs 10 Key Ques(ons for All Buyers
ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy
ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy OVERVIEW The global communication and the continuous growth of services provided through the Internet or local infrastructure require to
Backing up to the Cloud
Backing up to the Cloud Practical Considerations Chander Kant CEO Zmanda, Inc. 1 Zmanda Worldwide Leader in Open Source Backup 500,000+ Protected Systems Open Source. Open APIs. Open Formats. Smashes traditional
Archiving On-Premise and in the Cloud. March 2015
Archiving On-Premise and in the Cloud March 2015 Cloud Storage Storage accessed over a network via web services APIs. http://swift.example.com/v1/account/container/object Source: http://docs.openstack.org/admin-guide-cloud/content/objectstorage_characteristics.html
How To Image A Single Vm For Forensic Analysis On Vmwarehouse.Com
MCP+I, MCSE, CCSA, CCSE, CISSP-ISSAP, CISM, CISA, CIFI, CCE, ACE, GCFE, GCFA, GSEC, VCP4/5, vexpert Senior SANS Instructor - [email protected] 1 A Lot To Cover In ½ An Hour We simply can not cover all cloud
GPFS Cloud ILM. IBM Research - Zurich. Storage Research Technology Outlook
IBM Research - Zurich GPFS Cloud ILM Storage Research Technology Outlook Dr. Thomas Weigold ([email protected]) Manager Cloud Storage & Security IBM Research Zurich Why Cloud Storage? Economics! Lower
Data Centers and Cloud Computing. Data Centers
Data Centers and Cloud Computing Slides courtesy of Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet
Using Red Hat Network Satellite Server to Manage Dell PowerEdge Servers
Using Red Hat Network Satellite Server to Manage Dell PowerEdge Servers Enterprise Product Group (EPG) Dell White Paper By Todd Muirhead and Peter Lillian July 2004 Contents Executive Summary... 3 Introduction...
Microsoft Azure Cloud on your terms. Start your cloud journey.
Microsoft Azure Cloud on your terms. Start your cloud journey. Subscribe, Deploy, Migrate and Get Finance and Support for your Hybrid and/or Cloud Data Center. Never pay huge upfront Cost. How can Azure
Data Storage Options for Research
Research IT Office Data Storage Options for Research By Ashok Mudgapalli Director of Research IT Agenda Current Research Data Storage Current Data Backup Strategies Available Storage Solution: Enterprise
HOSTWAY. FlexCloudTM. Servers
PERFECT FOR INSTANTLY DEPLOYING SERVERS AND RESOURCES ON DEMAND Designed to be one of the most flexible, reliable, and easy-to-use cloud hosting solutions on the market, FlexCloud are a scalable, on-demand
Deploying ArcGIS for Server using Managed Services
2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Deploying ArcGIS for Server using Managed Services Erin Ross Andrew Sakowicz Esri UC2013. Technical cal
Research Data Storage, Sharing, and Transfer Options
Research Data Storage, Sharing, and Transfer Options Principal investigators should establish a research data management system for their projects including procedures for storing working data collected
Cloud 101. Mike Gangl, Caltech/JPL, [email protected] 2015 California Institute of Technology. Government sponsorship acknowledged
Cloud 101 Mike Gangl, Caltech/JPL, [email protected] 2015 California Institute of Technology. Government sponsorship acknowledged Outline What is cloud computing? Cloud service models Deployment
Hewlett Packard - NBU partnership : SAN (Storage Area Network) или какво стои зад облаците
Hewlett Packard - NBU partnership : SAN (Storage Area Network) или какво стои зад облаците Why SAN? Business demands have created the following challenges for storage solutions: Highly available and easily
Building Storage as a Service with OpenStack. Greg Elkinbard Senior Technical Director
Building Storage as a Service with OpenStack Greg Elkinbard Senior Technical Director MIRANTIS 2012 PAGE 1 About the Presenter Greg Elkinbard Senior Technical Director at Mirantis Builds on demand IaaS
Wireless Networks: Network Protocols/Mobile IP
Wireless Networks: Network Protocols/Mobile IP Mo$va$on Data transfer Encapsula$on Security IPv6 Problems DHCP Adapted from J. Schiller, Mobile Communications 1 Mo$va$on for Mobile IP Rou$ng based on IP
VoIP Security How to prevent eavesdropping on VoIP conversa8ons. Dmitry Dessiatnikov
VoIP Security How to prevent eavesdropping on VoIP conversa8ons Dmitry Dessiatnikov DISCLAIMER All informa8on in this presenta8on is provided for informa8on purposes only and in no event shall Security
Big Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas
Big Data The Big Picture Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas What is Big Data? Big Data gets its name because that s what it is data that
Navigating The World of Cloud Computing
Navigating The World of Cloud Computing Mike Klein President, Online Tech Cloud Computing Instead of having 20 servers Pool them together into a gigantic super-server Split up super-server into 100 virtual
Ultra-Scalable Storage Provides Low Cost Virtualization Solutions
Ultra-Scalable Storage Provides Low Cost Virtualization Solutions Flexible IP NAS/iSCSI System Addresses Current Storage Needs While Offering Future Expansion According to Whatis.com, storage virtualization
Service Description Cloud Storage Openstack Swift
Service Description Cloud Storage Openstack Swift Table of Contents Overview iomart Cloud Storage... 3 iomart Cloud Storage Features... 3 Technical Features... 3 Proxy... 3 Storage Servers... 4 Consistency
Arista 7060X and 7260X series: Q&A
Arista 7060X and 7260X series: Q&A Product Overview What are the 7060X & 7260X series? The Arista 7060X and 7260X series are purpose-built 40GbE and 100GbE data center switches in compact and energy efficient
Introduction to OpenStack Swift CloudOpen Japan 2014
Introduction to OpenStack Swift CloudOpen Japan 2014 Yuji Hagiwara [email protected] Platform Engineer, NTT DATA Corp. Copyright 2014 NTT DATA Corporation 2 Agenda 1.What is Swift? 2.Swift s Latest
ZEN LOAD BALANCER EE v3.02 DATASHEET The Load Balancing made easy
ZEN LOAD BALANCER EE v3.02 DATASHEET The Load Balancing made easy OVERVIEW The global communication and the continuous growth of services provided through the Internet or local infrastructure require to
RECOVERY SCALABLE STORAGE
RETENTION RETRIEVAL RECOVERY SCALABLE STORAGE IMATION SCALABLE STORAGE RETENTION RECOVERY RETRIEVAL We work with small and medium-sized businesses that are caught between a rock and hard spot: they are
Auspex. NAS/SAN Integration
Storage for Business NAS/SAN Integration Eighth NASA/Goddard Space Flight Center Conference on Mass Storage Systems and Technology March 30, 2000 1 Agenda Introduction The types and roles of storage Integrating
Web Drive Limited TERMS AND CONDITIONS FOR THE SUPPLY OF SERVER HOSTING
Web Drive Limited TERMS AND CONDITIONS FOR THE SUPPLY OF SERVER HOSTING Application of Terms Agreement to these terms requires agreement to Web Drive s Standard Terms & Conditions located online at the
Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data
Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data David Minor 1, Reagan Moore 2, Bing Zhu, Charles Cowart 4 1. (88)4-104 [email protected] San Diego Supercomputer Center
Protecting enterprise servers with StoreOnce and CommVault Simpana
Technical white paper Protecting enterprise servers with StoreOnce and CommVault Simpana HP StoreOnce Backup systems Table of contents Introduction 2 Technology overview 2 HP StoreOnce Backup systems key
12 Key File Sync and Share Advantages of Transporter Over Box for Enterprise
WHITE PAPER 12 Key File Sync and Share Advantages of Transporter Over Box for Enterprise Cloud storage companies invented a better way to manage information that allows files to be automatically synced
Cloud Storage and Backup
Cloud Storage and Backup Cloud Storage and Backup Cloud Storage and Backup services from iomartcloud have been designed to deliver the performance, capacity, security and flexibility needed to address
owncloud Architecture Overview
owncloud Architecture Overview Time to get control back Employees are using cloud-based services to share sensitive company data with vendors, customers, partners and each other. They are syncing data
IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE
White Paper IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE Abstract This white paper focuses on recovery of an IBM Tivoli Storage Manager (TSM) server and explores
How To Create A Large Enterprise Cloud Storage System From A Large Server (Cisco Mds 9000) Family 2 (Cio) 2 (Mds) 2) (Cisa) 2-Year-Old (Cica) 2.5
Cisco MDS 9000 Family Solution for Cloud Storage All enterprises are experiencing data growth. IDC reports that enterprise data stores will grow an average of 40 to 60 percent annually over the next 5
Understanding AWS Storage Options
Understanding AWS Storage Options Ian Massingham, Technical Evangelist @IanMmmm [email protected] 30 April 2014 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified,
Solving today's integra@on challenges with Oracle SOA Suite, and Oracle Coherence
Solving today's integra@on challenges with Oracle SOA Suite, and Oracle Coherence Asaf Lev Sales Consul@ng [email protected] Agenda Industry Trends Oracle SOA Suite Oracle Coherence Oracle Service Bus
Data Loss Prevention (DLP) & Recovery Methodologies
Data Loss Prevention (DLP) & Recovery Methodologies Topics to be Discussed Overview of Types of Storage Devices Prevention Methodologies on storage devices Creating a Backup Plan Testing your Recovery
Cisco Hybrid Cloud Solution: Deploy an E-Business Application with Cisco Intercloud Fabric for Business Reference Architecture
Reference Architecture Cisco Hybrid Cloud Solution: Deploy an E-Business Application with Cisco Intercloud Fabric for Business Reference Architecture 2015 Cisco and/or its affiliates. All rights reserved.
WHY DO I NEED FALCONSTOR OPTIMIZED BACKUP & DEDUPLICATION?
WHAT IS FALCONSTOR? FalconStor Optimized Backup and Deduplication is the industry s market-leading virtual tape and LAN-based deduplication solution, unmatched in performance and scalability. With virtual
(Scale Out NAS System)
For Unlimited Capacity & Performance Clustered NAS System (Scale Out NAS System) Copyright 2010 by Netclips, Ltd. All rights reserved -0- 1 2 3 4 5 NAS Storage Trend Scale-Out NAS Solution Scaleway Advantages
Restoration Technologies. Mike Fishman / EMC Corp.
Trends PRESENTATION in Data TITLE Protection GOES HERE and Restoration Technologies Mike Fishman / EMC Corp. SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless
Increased Security, Greater Agility, Lower Costs for AWS DELPHIX FOR AMAZON WEB SERVICES WHITE PAPER
Increased Security, Greater Agility, Lower Costs for AWS DELPHIX FOR AMAZON WEB SERVICES TABLE OF CONTENTS Introduction... 3 Overview: Delphix Virtual Data Platform... 4 Delphix for AWS... 5 Decrease the
How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router
HyperQ Hybrid Flash Storage Made Easy White Paper Parsec Labs, LLC. 7101 Northland Circle North, Suite 105 Brooklyn Park, MN 55428 USA 1-763-219-8811 www.parseclabs.com [email protected] [email protected]
Backup and Disaster Recovery Planning On a Budget. Presented by: Najam Saeed Lisa Ulrich
Backup and Disaster Recovery Planning On a Budget Presented by: Najam Saeed Lisa Ulrich Aging Backup System Symantec Backup Exec 11 Hardware Dell PowerEdge2950 Overland REO9000 7.4TB Overland REO4000 4TB
Computer Networks. Examples of network applica3ons. Applica3on Layer
Computer Networks Applica3on Layer 1 Examples of network applica3ons e- mail web instant messaging remote login P2P file sharing mul3- user network games streaming stored video clips social networks voice
An Introduction to Cloud Computing Concepts
Software Engineering Competence Center TUTORIAL An Introduction to Cloud Computing Concepts Practical Steps for Using Amazon EC2 IaaS Technology Ahmed Mohamed Gamaleldin Senior R&D Engineer-SECC [email protected]
Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup
Technical white paper Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup Table of contents Executive summary... 2 Introduction... 2 What is NDMP?... 2 Technology overview... 3 HP
WHITEPAPER: Understanding Pillar Axiom Data Protection Options
WHITEPAPER: Understanding Pillar Axiom Data Protection Options Introduction This document gives an overview of the Pillar Data System Axiom RAID protection schemas. It does not delve into corner cases
White Paper. What is IP SAN?
White Paper What is IP SAN? Introduction Internet Protocol, or IP, has grown to become the most widely used telecommunications standard worldwide. The technology is well understood, easy to implement and
Deploying a distributed data storage system on the UK National Grid Service using federated SRB
Deploying a distributed data storage system on the UK National Grid Service using federated SRB Manandhar A.S., Kleese K., Berrisford P., Brown G.D. CCLRC e-science Center Abstract As Grid enabled applications
Amazon Cloud Storage Options
Amazon Cloud Storage Options Table of Contents 1. Overview of AWS Storage Options 02 2. Why you should use the AWS Storage 02 3. How to get Data into the AWS.03 4. Types of AWS Storage Options.03 5. Object
STORAGE CENTER. The Industry s Only SAN with Automated Tiered Storage STORAGE CENTER
STORAGE CENTER DATASHEET STORAGE CENTER Go Beyond the Boundaries of Traditional Storage Systems Today s storage vendors promise to reduce the amount of time and money companies spend on storage but instead
High Availability Databases based on Oracle 10g RAC on Linux
High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN, June 2006 Luca Canali, CERN IT Outline Goals Architecture of an HA DB Service Deployment at the CERN Physics Database
Offensive & Defensive & Forensic Techniques for Determining Web User Iden<ty
Offensive & Defensive & Forensic Techniques for Determining Web User Iden
Building a Scalable Big Data Infrastructure for Dynamic Workflows
Building a Scalable Big Data Infrastructure for Dynamic Workflows INTRODUCTION Organizations of all types and sizes are looking to big data to help them make faster, more intelligent decisions. Many efforts
