UW-IT Backups & Archives



Similar documents
The Microsoft Large Mailbox Vision

Archiving On-Premise and in the Cloud. March 2015

High Availability Databases based on Oracle 10g RAC on Linux

Intro to AWS: Storage Services

Cloud Storage and Backup

Call: Disaster Recovery/Business Continuity (DR/BC) Services From VirtuousIT

Cloud OS Vision. Modern platform for the world s apps

Disaster Recovery 101. Sudarshan Ranganath & Matthew Phillips Ellucian

Brian LaGoe, Systems Administrator Benjamin Jellema, Systems Administrator Eastern Michigan University

Backup and Recovery 1

Advancements in Storage QoS Management in National Data Storage

Making a Smooth Transition to a Hybrid Cloud with Microsoft Cloud OS

Media for Long-Term Archiving. June 2014

Backing up to the Cloud

Increasing Storage Performance

Big data Devices Apps

PADS GPFS Filesystem: Crash Root Cause Analysis. Computation Institute

Appendix A Core Concepts in SQL Server High Availability and Replication

Oracle Database 10g: Backup and Recovery 1-2

NETGEAR SMB Storage Line Update and ReadyNAS 2100 Introduction

Amazon Cloud Storage Options

Realizing the Benefits of Hybrid Cloud. Anand MS Cloud Solutions Architect Microsoft Asia Pacific

ETERNUS CS High End Unified Data Protection

VMware vsphere Data Protection

HYBRID ARCHITECTURE IN THE CLOUD

IBM System x SAP HANA

Disaster Recovery Strategies: Business Continuity through Remote Backup Replication

Take An Internal Look at Hadoop. Hairong Kuang Grid Team, Yahoo! Inc

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)

Storage and Disaster Recovery

Every organization has critical data that it can t live without. When a disaster strikes, how long can your business survive without access to its

Symantec NetBackup Appliances

Storage Design for High Capacity and Long Term Storage. DLF Spring Forum, Raleigh, NC May 6, Balancing Cost, Complexity, and Fault Tolerance

Veeam ONE What s New in v9?

Virtual Server and Storage Provisioning Service. Service Description

CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level. -ORACLE TIMESTEN 11gR1

Backup to the Future. Hugo Patterson, Ph.D. Backup Recovery Systems, EMC

NETAPP SYNCSORT INTEGRATED BACKUP. Technical Overview. Peter Eicher Syncsort Product Management

NetApp Data Fabric: Secured Backup to Public Cloud. Sonny Afen Senior Technical Consultant NetApp Indonesia

Barracuda Backup Server. Introduction

Protecting your SQL database with Hybrid Cloud Backup and Recovery. Session Code CL02

Learn. Connect. Explore.

EMC arhiviranje. Lilijana Pelko Primož Golob. Sarajevo, Copyright 2008 EMC Corporation. All rights reserved.

Backup of NAS devices with Avamar

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Cybernetics isan D Series Affordable, Self-Protecting iscsi SAN Storage

Talk With Someone Live Now: (760) One Stop Data & Networking Solutions PREVENT DATA LOSS WITH REMOTE ONLINE BACKUP SERVICE

The safer, easier way to help you pass any IT exams. Exam : E Backup Recovery - Avamar Expert Exam for Implementation Engineers.

EMC DISK LIBRARY FOR MAINFRAME

Software-defined Storage Architecture for Analytics Computing

Enhancements of ETERNUS DX / SF

REDCENTRIC MANAGED BACKUP SERVICE SERVICE DEFINITION

IBM Spectrum Protect in the Cloud

Implementing Multi-Tenanted Storage for Service Providers with Cloudian HyperStore. The Challenge SOLUTION GUIDE

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

White Paper. Mimosa NearPoint for Microsoft Exchange Server. Next Generation Archiving for Exchange Server By Bob Spurzem and Martin Tuip

Cloud Hosting for PostgreSQL

Deploying Flash- Accelerated Hadoop with InfiniFlash from SanDisk

Hardware Configuration Guide

Windchill ProjectLink Curriculum Guide

Deploying Exchange Server 2007 SP1 on Windows Server 2008

Introducing the New Hitachi Storage Virtualization Operating System and Hitachi Virtual Storage Platform G1000

巨 量 資 料 分 層 儲 存 解 決 方 案

Nutanix Tech Note. Data Protection and Disaster Recovery

Data Protection Report 2008 Best Practices in Data Backup & Recovery

Enterprise Vault 10 Feature Briefing

Accelerating Real Time Big Data Applications. PRESENTATION TITLE GOES HERE Bob Hansen

Tier0 plans and security and backup policy proposals

Ultra-Scalable Storage Provides Low Cost Virtualization Solutions

IS IN-MEMORY COMPUTING MAKING THE MOVE TO PRIME TIME?

SHAREPOINT 2010 REMOTE BLOB STORES WITH EMC ISILON NAS AND METALOGIX STORAGEPOINT

StorReduce Technical White Paper Cloud-based Data Deduplication

Avamar Backup and Data De-duplication Exam

REMOTE BACKUP-WHY SO VITAL?

ManageEngine EventLog Analyzer. Best Practices Document

White paper 200 Camera Surveillance Video Vault Solution Powered by Fujitsu

Restoration Technologies. Mike Fishman / EMC Corp.

EonStor DS remote replication feature guide

System Administration of Windchill 10.2

ManageEngine EventLog Analyzer. Best Practices Document

ArcGIS for Server in the Amazon Cloud. Michele Lundeen Esri

Vodacom Managed Hosted Backups

Windchill Service Information Manager Curriculum Guide

Transcription:

UW-IT Backups & Archives Powerful, Flexible, Affordable UW-IT TechTalk February 19, 2015

Agenda Definitions Yesterday Today Tomorrow Your thoughts

Backups Defined Data is hot Primary data copy is on first-tier storage Changes automatically preserved daily Bias toward small files Relatively expensive (many small files problem)

Archives Defined Data is cold All/only copies reside in the Archives Additions/Modifications/Removals all manual Bias toward very large files Relatively inexpensive (manage few large objects)

Backups + Archives Most (~90%?) of all data is cold Move cold data to Archives, reduce first tier storage demand by ~90% ~90% reduction in first tier storage = ~90% reduction in backups Potential for dramatic cost savings Potential for dramatic improvements in data management practices.

Archives Today One node from the 4-node lolo cluster Between 2TB and 20TB each night Typically < 2,000 files each night 770TB subscribed, 370TB stored ~350 tapes in use across two sites 37 customers, Hyak by far the largest

Archive Service Design Research Internet 1. Customers upload blobs to lolo via ssh 2. Uploaded files are cached on 20TB disk 3. Files are backed up to Tierpoint tape 4. Files are migrated to Seattle tape 1 3 Campus lolo0 FCoIP FCoIP Tierpoint Tape 2 4 Disk Cache Seattle Tape

Backups Yesterday >?? nodes (lots, but difficult to determine #) > 5 hosts/servers (2 OSes/HW architectures, nonstandard) > 5 tape technologies >?TB, >?M files each night (not great reporting) > 200TB, > 200M files stored at each site > 5,000 tapes in use across two sites

Backups Today > 1,000 nodes (improved reporting) > 2 hosts/servers (2 OSes/HW architectures, nonstandard) 1 tape technology (5 x reduction) > 10TB, >2M files each night (improved reporting) > 500TB, > 500M files stored at each site (2 x increase) > 500 tapes in use across two sites (10 x reduction)

Backups Today Design Disk Cache 2 1. Nodes backup files to servers via TSM 2. Backed-up files are cached on disk 3. Files are migrated to Seattle tape 4. Files are copied to Tierpoint tape Nebula Huntay 1 4 Seattle Tape 3 FCoIP FCoIP Tierpoint Tape Campus Shed 1 2 Disk Cache

Backups Tomorrow: Goals Bring the service into the enterprise design fold 1 x OS, 1 x HW Architecture, All standard Standard toolset, layered design Reduce systems management effort Improve resilience and performance Improve scalability

Backups Tomorrow: Features High Availability Improved GR/DR Prepared for Archive integration Prepared for enhanced features

Backups Tomorrow Design 1 1. Nodes backup files to servers via TSM 2. Backed-up files are cached on disk 3. Files are replicated to Tierpoint servers 4. Files are migrated to tape Nodes 3 2 Disk Cache 4 Seattle Tape Disk Cache 4 Tierpoint Tape Nodes 1 3

Backups Tomorrow HA 1. Nodes backup files to servers via TSM 2. Backed-up files are cached on disk 3. Files are replicated to Tierpoint servers 4. Files are migrated to tape 2 4 Disk Cache Seattle Tape Disk Cache 4 Tierpoint Tape Nodes 1 3 Nodes

Backups Tomorrow GR/DR Nodes 0. Backup operations are suspended 1. Client nodes fail over to DR servers 2. Files are restored from GR replica tape 2 Disk Cache Seattle Tape Disk Cache 2 Tierpoint Tape Nodes 2

Your Thoughts? Better support for VMWARE, databases, etc.? What sort of data would you like to see? Details about your node s backups? Details about the overall service? Wish List?

Service Catalog lolo Archives http://depts.washington.edu/uwtscat/archivestorage Backup Service http://depts.washington.edu/uwtscat/databackuparchive

Supplemental Slides File size and bytes distributions Transfer rates for lolo Archives Prices

File Size Distribution Nebula in 2008 20 18 16 14 % Files % Bytes 35 30 % Files % Bytes Percentage 12 10 8 25 6 Percentage 20 15 4 2 0 0 5 10 15 20 25 30 35 10 log(2) filesize 5 0 0 5 10 15 20 25 30 35 log(2) filesize An Astrophysics Group in 2006

lolo Archive Recall Rates MB Tape Load/Seek (Sec) Tape Read (MBs) Recall Time (sec) Recall Speed (MBs) 1 90 125 90.01 0.01 10 90 125 90.08 0.11 100 90 125 90.80 1.10 1,000 90 125 98.00 10.20 10,000 90 125 170.00 58.82 100,000 90 125 890.00 112.36 1,000,000 90 125 8,090.00 123.61

Some Prices UW- IT Backup /GB/Month $/TB/Year @50% Use 6.0 720 720 lolo Archive 0.9 103 206 AWS Glacier 1.0 120 120