High Performance Computing Modernization Program Mass Storage



Similar documents
Big Data in Test and Evaluation by Udaya Ranawake (HPCMP PETTT/Engility Corporation)

HPCMP New Users Guide Who Are We? Distribution A: Approved for Public release; distribution is unlimited.

<Insert Picture Here> Solution Direction for Long-Term Archive

The Maui High Performance Computing Center Department of Defense Supercomputing Resource Center (MHPCC DSRC) Hadoop Implementation on Riptide - -

UNCLASSIFIED R-1 ITEM NOMENCLATURE FY 2013 OCO

Backup and Recovery of SAP Systems on Windows / SQL Server

巨 量 資 料 分 層 儲 存 解 決 方 案

The Key Elements of Digital Asset Management

Reference Architectures for Repositories and Preservation Archiving

LSKA 2010 Survey Report Job Scheduler

irods at CC-IN2P3: managing petabytes of data

Data Masking Secure Sensitive Data Improve Application Quality. Becky Albin Chief IT Architect

USGS Guidelines for the Preservation of Digital Scientific Data

A Big Data Storage Architecture for the Second Wave David Sunny Sundstrom Principle Product Director, Storage Oracle

Benefits of upgrading to Enterprise Vault

Grid Scheduling Dictionary of Terms and Keywords

QStar White Paper. Tiered Storage

STORAGE. Buying Guide: TARGET DATA DEDUPLICATION BACKUP SYSTEMS. inside

Work Environment. David Tur HPC Expert. HPC Users Training September, 18th 2015

The Archival Upheaval Petabyte Pandemonium Developing Your Game Plan Fred Moore President

PACE Predictive Analytics Center of San Diego Supercomputer Center, UCSD. Natasha Balac, Ph.D.

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

Optimizing IT Deployment Issues

Hadoop Cluster Applications

THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.

Exhibit to Data Center Services Service Component Provider Master Services Agreement

Condor Supercomputer

Data Deduplication in Tivoli Storage Manager. Andrzej Bugowski Spała

The Economics of File-based Storage

Lessons from the field: Implementing Information Governance and Records Management with Microsoft SharePoint

ETERNUS CS High End Unified Data Protection

Long term retention and archiving the challenges and the solution

Applications of LTFS for Cloud Storage Use Cases

Growth of Unstructured Data & Object Storage. Marcel Laforce Sr. Director, Object Storage

Make the Most of Big Data to Drive Innovation Through Reseach

Introduction to NetApp Infinite Volume

SOLUTION BRIEF KEY CONSIDERATIONS FOR BACKUP AND RECOVERY

Cluster Scalability of ANSYS FLUENT 12 for a Large Aerodynamics Case on the Darwin Supercomputer

Big Data Services at DKRZ

HPC technology and future architecture

3Gen Data Deduplication Technical

Accelerating Innovation with Self- Service HPC

<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures

SGI High Performance Computing

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect

European Data Infrastructure - EUDAT Data Services & Tools

AFS Usage and Backups using TiBS at Fermilab. Presented by Kevin Hill

Maui High Performance Computing Center Overview

Pentaho Enterprise and Community Editions Feature Comparison

UNCLASSIFIED. R-1 Program Element (Number/Name) PE A / High Performance Computing Modernization Program. Prior Years FY 2013 FY 2014 FY 2015

Hitachi NAS Platform and Hitachi Content Platform with ESRI Image

Running Agilent GeneSpring MPP on the Cloud

MEEC Webinar Daly and EMC BRS - EMC Backup & Archive Solutions

EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE: MEETING NEEDS FOR LONG-TERM RETENTION OF BACKUP DATA ON EMC DATA DOMAIN SYSTEMS

Government Technology Trends to Watch in 2014: Big Data

A SURVEY OF POPULAR CLUSTERING TECHNOLOGIES

Document Management and Records Management in SharePoint Scott Jamison

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

WHITE PAPER. DATA DEDUPLICATION BACKGROUND: A Technical White Paper

Energy Efficient Storage - Multi- Tier Strategies For Retaining Data

Milestone Solution Partner IT Infrastructure Components Certification Summary

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data

Data Deduplication Background: A Technical White Paper

DoD High Performance Computing Modernization Program: a Distributed Computing Program January Larry Davis

Data storage services at CC-IN2P3

Big Data and Cloud Computing for GHRSST

Managing Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery

Versity All rights reserved.

SURFsara HPC Cloud Workshop

Introduction to Optical Archiving Library Solution for Long-term Data Retention

Archiving and Managing Remote Sensing Data Using State of the Art Storage Technologies

Research Technologies Data Storage for HPC

Server & Cloud Management

Transcription:

High Performance Computing Modernization Program Mass Storage September 21, 2012 Designing Storage Architectures for Digital Collections Library of Congress Gene McGill, mcgill@arsc.edu Arctic Region Supercomputing Center/HTL

What is the HPCMP? Five large DoD Supercomputing Resource Centers (DSRCs) and various smaller facilities Air Force Research Lab (AFRL DSRC), Dayton, OH Army Research Lab (ARL DSRC), Aberdeen Proving Ground, MD US Army Engineer Research and Development Center (ERDC DSRC), Vicksburg, MS Navy DoD Supercomputing Resource Center (Navy DSRC), Stennis Space Center, MS Maui High Performance Computing Center (MHPCC DSRC), Kihei, Maui, HI ARSC HEUE Test Lab (ARSC HTL), Fairbanks, AK Other smaller, more focused centers

What is the HPCMP? (cont) ~1,500 users Diverse scientific & engineering applications, systems are general-purpose, no single usage profile Ten large, unclassified, HPC clusters Total TFLOPs: 1421; 6.1 PBs local, high speed disk; Mass Storage servers SAM-QFS, SL8500, SL3000, T10000[A B C] + Remote DR. 25 PB 1 st copy, nearly all files get DR copy Less than 2% of the users own 50% of the PBs Growth is accelerating

Legacy Architecture Batch jobs run on HPC clusters Jobs use fast, expensive, local disk Insufficient space on HPCs to store restart files between jobs Users store files on mass storage servers that shouldn t live long-term Restart file, intermediate results, and users rarely go back and remove what they don t need Users tell us: Tools to manage the millions of files on mass storage servers are lacking HPC clusters not good for pre- & post-processing, this discourages postprocessing and saving only high-value PP d files Users don t think all files need a DR copy Program management wants to live within current tape slots

The HPCMP Enhanced User Environment We don t want to be an Archive like this group thinks of them! We are implementing tools that could be used for an Archive for the purpose of reducing rate of growth of data. The solution: Add Utility Server (US) Interactive use for pre-, post-processing Install PB-scale center-wide file system (CWFS) 30 day residency commitment vs. ~10 days on HPC disk Add Storage Lifecycle Management tool (SLM) to manage the archive Enhance metadata for users Implement storage management policy

The HPCMP Enhanced User Environment Nirvana Storage Resource Broker (SRB): Commercial offering, w/ vendor support Application runs on top of a database Automated Storage Lifecycle Management using metadata-driven policy SRB presents a global namespace of potentially many storage resources, some of which may be geographically distributed. HPCMP is layering SRB on top of existing SAM-QFS file systems

The HPCMP Enhanced User Environment SRB (cont): Policies we re implementing in SRB ARCHIVE: Local tape copy is automatic DR: Remote copy by user request per object, default is no DR copy EXPIRE: Automatically removes objects when they reach the end of their retention period Enhanced metadata Retention period Default is 30 days, Users can specify decades but it cannot be infinite Users must confirm retention every 3 years

The HPCMP Enhanced User Environment SRB (cont): Enhanced User metadata: Dublin core Name Value scheme Multi-level security features added in Up to 16 site-defined flags per object Opportunity for doing science in the metadata Automated processing during ingest can mine data to populate metadata needed for scientific visualization later Avoids accessing files in favor of using metadata in SRB/DB Metadata is query-able.

The HPCMP Enhanced User Environment Currently in Pilot phases at two DSRCs

Acknowlegements Many people working on this: Tom Kendall, Executive Agent, Army Research Lab DSRC Stephen Scherr, HPCMP Robert Hunter, HPCMP Constantin Scheder, Nirvana SRB Chief Architect Reid Bingham, Nirvana And many others.