Archive Storage Infrastructure At the Library of Congress



Similar documents
Planning for Digital Preservation and Acquisitions at the Library of Congress National Audio-Visual Conservation Center

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

Let s Talk Digital An Approach to Managing, Storing, and Preserving Time-Based Media Art Works. DAMS Branch Manager Smithsonian Institution, OCIO

Tier 2 Nearline. As archives grow, Echo grows. Dynamically, cost-effectively and massively. What is nearline? Transfer to Tape

Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows

Isabel Meyer : DAMS Branch Manager. Ken Rahaim: DAMS Analyst

Solution Brief: Creating Avid Project Archives

Next Gen Data Center. KwaiSeng Consulting Systems Engineer

JPEG2000 in Moving Image Archiving

Enhancing the Dell iscsi SAN with Dell PowerVault TM Tape Libraries and Chelsio Unified Storage Router iscsi Appliance

Maurice Askinazi Ofer Rind Tony Wong. Cornell Nov. 2, 2010 Storage at BNL

White paper 200 Camera Surveillance Video Vault Solution Powered by Fujitsu

XenData Video Edition. Product Brief:

With DDN Big Data Storage

Comparison of Native Fibre Channel Tape and SAS Tape Connected to a Fibre Channel to SAS Bridge. White Paper

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

Quantum StorNext. Product Brief: Distributed LAN Client

Energy Efficient Storage - Multi- Tier Strategies For Retaining Data

Reference Architectures for Repositories and Preservation Archiving

XenData Product Brief: SX-550 Series Servers for LTO Archives

Using HP StoreOnce Backup Systems for NDMP backups with Symantec NetBackup

Archiving and Managing Remote Sensing Data Using State of the Art Storage Technologies

Server Bandwidth Scenarios Signposts for 40G/100G Server Connections

Flow 3 Media Asset Management incorporating AirFlow and Automation

University of Arizona Increases Flexibility While Reducing Total Cost of Ownership with Cisco Nexus 5000 Series Switches

Solution Brief: Archiving Avid Interplay Projects using NLT and XenData

STORNEXT PRO SOLUTIONS. StorNext Pro Solutions

19 LCD / 8 CHANNEL DVR COMBO WITH 160GB HDD & 4 CAMERAS

HP 3PAR StoreServ 8000 Storage - what s new

Hadoop: Embracing future hardware

POWER ALL GLOBAL FILE SYSTEM (PGFS)

Storage Design for High Capacity and Long Term Storage. DLF Spring Forum, Raleigh, NC May 6, Balancing Cost, Complexity, and Fault Tolerance

Virtual Desktop Infrastructure (VDI) Overview

Object storage in Cloud Computing and Embedded Processing

Archival Storage At LANL Past, Present and Future

ANY SURVEILLANCE, ANYWHERE, ANYTIME

Data Sheet FUJITSU Storage ETERNUS LT260 Tape System

Desktop Consolidation. Stéphane Verdy, CTO Devon IT

XenData Product Brief: SX-250 Archive Server for LTO

The safer, easier way to help you pass any IT exams. Exam : Storage Sales V2. Title : Version : Demo 1 / 5

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

Cost Effective Backup with Deduplication. Copyright 2009 EMC Corporation. All rights reserved.

EonStor DS 3000 Series

Scala Storage Scale-Out Clustered Storage White Paper

Data Sheet FUJITSU Storage ETERNUS LT20 S2 Tape System

Implementing a Digital Video Archive Based on XenData Software

Protect Microsoft Exchange databases, achieve long-term data retention

IBM Scale Out Network Attached Storage

irods in complying with Public Research Policy

Cisco Data Center 3.0: Aligning IT to the 21 st Century Business

Post Production Video Editing Solution Guide with Apple Xsan File System AssuredSAN 4000

Cisco Data Center Optimization Services

STORNEXT PRO SOLUTIONS. StorNext Pro Solutions

The Key Elements of Digital Asset Management

Primary Data Center. Remote Data Center Plans (COOP), Business Continuity (BC), Disaster Recovery (DR), and data

Digital Asset Management 数 字 媒 体 资 源 管 理 任 课 老 师 : 张 宏 鑫

XenData Product Brief: SX-250 Archive Server for LTO

Oracle Data Protection Concepts

ADDENDUM 1 September 22, 2015 Request for Proposals: Data Center Implementation

Long-term data storage in the media and entertainment industry: StrongBox LTFS NAS archive delivers 84% reduction in TCO

SAN Conceptual and Design Basics

Reference Architecture - Microsoft Exchange 2013 on Dell PowerEdge R730xd

3Gen Data Deduplication Technical

Datasheet Fujitsu ETERNUS LT20 S2

Server and Storage Virtualization with IP Storage. David Dale, NetApp

Broadcast Automation Without Boundaries

BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything

10th TF-Storage Meeting

Determine dates with you telecom suppliers so that the new office is online before your move for both Phones and Data connections.

Multiple Digital Content Types in a Single Collection. Dina Sokolova and Jane Gorjevsky, Columbia University

Get Success in Passing Your Certification Exam at first attempt!

Emulex OneConnect 10GbE NICs The Right Solution for NAS Deployments

Media Cloud Building Practicalities

Scalable. Reliable. Flexible. High Performance Architecture. Fault Tolerant System Design. Expansion Options for Unique Business Needs

SOLUTION BRIEF KEY CONSIDERATIONS FOR LONG-TERM, BULK STORAGE

HP StorageWorks D2D Backup Systems and StoreOnce

VPMS - Advanced Media Management

Scalable. Reliable. Flexible. High Performance Architecture. Fault Tolerant System Design. Expansion Options for Unique Business Needs

Implementing an Automated Digital Video Archive Based on the Video Edition of XenData Software

IFI Irish Film Archive Digital Preservation & Access Strategy

SUN ORACLE DATABASE MACHINE

Solution Brief: XenData Digital Video Archives in a Dalet Environment

Top of Rack: An Analysis of a Cabling Architecture in the Data Center

IsumaTV. Media Player Setup Manual COOP Cable System. Media Player

EMC Invista: The Easy to Use Storage Manager

Unitrends Recovery-Series: Addressing Enterprise-Class Data Protection

Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International

Transcription:

Infrastructure At the Library of Congress Scott Rife Digital Conference September 30, 2011 srif at loc dot gov

The Packard Campus Mission The National Audiovisual Conservation Center develops, preserves and provides broad access to a comprehensive and valued collection of the world s audiovisual heritage for the benefit of Congress and the nation s citizens. Goals Collect, Preserve, Provide Access to Knowledge The National Audiovisual Conservation Center (NAVCC) of the Library of Congress will be the first centralized facility in America especially planned and designed for the acquisition, cataloging, storage and preservation of the nation s collection of moving images and recorded sounds. This collaborative initiative is the result of a unique partnership between the Packard Humanities Institute, the United States Congress, the Library of Congress and the Architect of the Capitol. The NAVCC will consolidate collections now stored in four states and the District of Columbia. Once complete, the facility will boast more than 1 million film and video items and 3 million sound recordings, providing endless opportunities to peruse the sights and sounds of American creativity. 2

The Packard Campus Many Formats 3

The Packard Campus Past, Present and Future 38 digitization stations (PODs): 31 Solo, 6 Pyramix, 1 Quadriga Daily each POD generates: 2GB-150GB for audio and 50GB-1,200GB for video Additional PODs coming in the future include 2K and 4K scan for film, digital submission for Copyright and other (Live capture-264 DVRs, PBS, NBC Universal, Vanderbilt TV News, SCOLA, etc) Growth since production February 2009: 10 TB / month February 2010: 45 TB / month February 2011: 91 TB / month Peak in May 2011: 134 TB / month Current: 1.7 PB and 180,000 files in 2 locations The Challenge Projected: 300 TB / day or 7.5 PB / month at least 10 years off Counting on doubling of tape density and computing power to keep us in our current 3000 Sq feet of computer room. 4

The Packard Campus Graph Packard Campus Capacity and Growth (Estimated) Petabytes Qtr Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Year 2009 2009 2009 2009 2010 2010 2010 2010 2011 2011 2011 2011 2012 2012 2012 2012 2013 2013 2013 2013 2014 2014 2014 2014 2015 2015 2015 2015 Projected Quarterly (PB) Projected Total (PB) Actual Capacity(PB) Projected Maximum Capacity (PB) 5

The Packard Campus Physical Space Sun Servers & UPS-NET1 UPS-LIBB2 UPS_LIBA3 UPS-LIBA1 UPS-LIBB1 UPS-LIBB3 UPS-LIBA2 UPS-NET2 Tape Library: Lt green is future 38,344 CRAC 27 Ton Future COOP Switch Future COOP Racks 84 inches Hot Aisle Hot Aisle I J K H G F E D C B A 1448 3456 ==== 4904 6632 3456 6632 3456 1448 5184 ==== 6632 1448 1728 ==== 3176 3456 R1 R2 R3R4R5R6R7R6 R8 M9000 R1,2,3 DDN R4,5,6 HPSS R7,8 Future Servers/ Future Servers/ CRAC 27 Ton D2 Data Telecom UPS-BENCH D1 D3 D4 Voice Telecom Desk Potential Future Telecom Movaz/Firewall/UPS 6

Functional Architecture Data Movement Infrastructure Server Data Mover Web Server Proxy Server XML Server Database Server (fast) PODs PODs PODs PODs PODs PODs generate data Workflow software copies data via signiant/samba to the Data Mover / Data Mover verifies files with SHA-1 Server reads from storage and writes to tape Server reads from storage and writes to remote Server Every 1GB of incoming data requires 4GB of total throughput: 1 write/3 reads (SHA1, local, remote) 7

Functional Architecture User Interface Infrastructure Server Data Mover Web Server Proxy Server XML Server Database Server PODs PODs PODs PODs PODs Web Server hosts JAVA/JBoss workflow application Proxy Server (formerly Derivative) streams the content to Reading Rooms and desktops XML Server is an application specific intermediary to proprietary MAVIS database. 8

Functional Architecture - Scaling Infrastructure Distributed Server Server Distributed Data Mover Server Distributed Distributed Distributed Web Server Server Proxy Server Server XML Server Server Database Distributed Server Server 1 1 2 PODs PODs PODs PODs PODs Some replication must happen as a set: Server/Data Mover/ storage Proxy Server/ storage The Web Servers would need to connect to all and storage with load balancing switches in front of them Workflow software would need to understand the data split and distribute requests 9

Functional Architecture Current Infrastructure Server Data Mover Web Server Proxy Server XML Server Database Server PODs PODs PODs PODs PODs Server Web Server fulfills functions of Data Mover and Proxy Server. Workflow software runs only on this server. 10

Physical Implementation V1: 2 GB/s throughput PODs PODs PODs PODs PODs PCs 1Gbe 6509 2x10 Gbe 2X 7606 10 Gbe Data Mover X4600 16XFC4 X4600 16XFC4 9506 DWDM 9513 16XFC4 4600 6XFC4 b Tek 6540 12XFC4 b Tek FLX 380 HSM: SAM 4.65 LUNS 16X1.5TB large 16X.5TB small 4X50GB Metadata 11

Physical Implementation V2: 6.5 GB/s throughput PODs PODs PODs PODs PODs PCs 1Gbe 6509 2x10 Gbe 2X 7606 Data Mover M9000-3IOU 10 Gbe 16XFC8 4470 16XFC4 9506 DWDM 9513 16XFC8 4600 4XFC4 b Tek 6540 10XFC4 b DDN SFA 10000 HSM: SAM 5.2 LUNS 12X4TB large 5X4TB small 2X300GB Metadata 12

Physical Implementation V2+: 6.5 GB/s throughput Infrastructure Upgrades PODs PODs PODs PODs PODs PCs 1Gbe 6509 2x10 Gbe 2X 7010 Data Mover M9000-3IOU 10 Gbe 16XFC8 X4470 16XFC4 9506 DWDM 9513 16XFC8 4600 6XFC4 c Tek 6540 12XFC4 c DDN SFA 10000 HSM:SAM 5.3 LUNS 12X4TB large 5X4TB small 2X300GB Metadata 13

Physical Implementation V3: 6.5 GB/s throughput What s Next? PODs PODs PODs PODs PODs PCs 1Gbe 6509 2x10 Gbe 2X 7010 Data Mover M9000-3IOU 10 Gbe 16XFC8 X4470 16XFC4 9506 DWDM 9513 FCOE with Nexus 7010? 16XFC8 4600 6XFC4 c Tek 6540 12XFC4 c DDN SFA 10000 HSM:SAM 5.3 LUNS 12X4TB large 5X4TB small 2X300GB Metadata 14

Discussion Market Survey of HSM solutions Fibre Channel over Ethernet Monolithic versus distributed Sandy Bridge / Ivy Bridge We can probably ride the monolithic bandwidth curve Cost: compare software development and maintenance to hardware costs Effective 2-5 year planning of ingest magnitude Contact: Scott Rife: srif at loc dot gov 15