NOAA National Data Center. + CLASS Overview



Similar documents
The CLASS Cloud Access Pilot

Joint Polar Satellite System (JPSS)

Kathryn A. Shontz STAR-NDE Liaison IMSG at NOAA/NESDIS/STAR Camp Springs, MD Ingrid Guch, Aleksandar Jelenak and Kent Hughes NOAA/NESDIS/STAR

The STAR Algorithm Integration Team (AIT) Research to Operations Process

Object Storage: A Growing Opportunity for Service Providers. White Paper. Prepared for: 2012 Neovise, LLC. All Rights Reserved.

CLASS and Enterprise Solutions Rick Vizbulis. CLASS and Enterprise Solutions

GLOBAL FORUM London, October 24 & 25, 2012

STAR JPSS Algorithms Integration Team Configuration Management Plan Version 1.2

Global Earth Observation Integrated Data Environment (GEO-IDE) Presentation to the Data Archiving and Access Requirements Working Group (DAARWG)

Topics. Images courtesy of Majd F. Sakr or from Wikipedia unless otherwise noted.

How To Save Money On A Data Center

Cloud Service Model. Selecting a cloud service model. Different cloud service models within the enterprise

The HP IT Transformation Story

Migration and Disaster Recovery Underground in the NEC / Iron Mountain National Data Center with the RackWare Management Module

Virtualization s Evolution

Online Storage Replacement Strategy/Solution

NOAA Direct Broadcast Real-Time Network: Current Status and Plans for Delivering Sounder Data to DRARS

Building a Weather- Ready Nation. NWS Office of Dissemination Partners Meeting January 14, 2016 New Orleans, LA

Marcus A. Watkins Director of the Joint Agency Satellite Division, Science Mission Directorate, NASA HQ, Washington DC

Market Maturity. Cloud Definitions

End-to-End Ozone Mapping and Profiler Suite (OMPS) Mission Data Modeling and Simulation

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Cloud JPL Science Data Systems

How To Manage A Virtualization Server

Cloud computing - Architecting in the cloud

Understanding Object Storage and How to Use It

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

Amazon Cloud Storage Options

Archiving and the Cloud: Perfect Together

WaterfordTechnologies.com Frequently Asked Questions

Comparing Microsoft SQL Server 2005 Replication and DataXtend Remote Edition for Mobile and Distributed Applications

ESPDS Scalable and Secure Infrastructure

SwiftStack Filesystem Gateway Architecture

The business value of improved backup and recovery

Introduction to Cloud Computing

Software-Defined Networks Powered by VellOS

Achieve Economic Synergies by Managing Your Human Capital In The Cloud

Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module

Building Storage-as-a-Service Businesses

Data Protection with IBM TotalStorage NAS and NSI Double- Take Data Replication Software

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

The Private Cloud Your Controlled Access Infrastructure

Digital Preservation. OAIS Reference Model

Big Data at ECMWF Providing access to multi-petabyte datasets Past, present and future

IPL Service Definition - Master Data Management for Cloud Related Services

ediscovery and Search of Enterprise Data in the Cloud

Building Reliable, Scalable AR System Solutions. High-Availability. White Paper

Strategic Vision. for Stewarding the Nation s Climate Data. Our. NOAA s National Climatic Data Center

How To Compare The Two Cloud Computing Models

Optimizing the Data Center for Today s Federal Government

Configure AlwaysOn Failover Cluster Instances (SQL Server) using InfoSphere Data Replication Change Data Capture (CDC) on Windows Server 2012

Elastic Private Clouds

Expert Reference Series of White Papers. Understanding Data Centers and Cloud Computing

Federal Data Center Consolidation Initiative (FDCCI)

Backup Software? Article on things to consider when looking for a backup solution. 11/09/2015 Backup Appliance or

T a c k l i ng Big Data w i th High-Performance

GOSIC NEXRAD NIDIS NOMADS

Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows

BI on Cloud using SQL Server on IaaS

NOAALink Small Business Industry Day. June 2, 2014

Cloud Computing Where ISR Data Will Go for Exploitation

An Enterprise Backup Solution for GOES Operations Ground Equipment (OGE) and Spacecraft Support Ground System (SSGS)

HIGH-SPEED BRIDGE TO CLOUD STORAGE

Welcome to IBM SmartCloud Notes!

WHITE PAPER Making Cloud an Integral Part of Your Enterprise Storage and Data Protection Strategy

Accenture cloud application migration services

NCDC Strategic Vision

Manage Your Data: Virtualization for Small Businesses

Cluster, Grid, Cloud Concepts

IBM Spectrum Protect in the Cloud

CONCEPTUAL DESIGN OF DATA ARCHIVE AND DISTRIBUTION SYSTEM FOR GEO-KOMPSAT-2A

IBM Campaign and IBM Silverpop Engage Version 1 Release 2 August 31, Integration Guide IBM

Operating Systems: Basic Concepts and History

StorReduce Technical White Paper Cloud-based Data Deduplication

The software platform for storing, preserving and sharing very large data sets.

Denis Botambekov 1, Andrew Heidinger 2, Andi Walther 1, and Nick Bearson 1

Cloud Security: Evaluating Risks within IAAS/PAAS/SAAS

Cloud Ready Data: Speeding Your Journey to the Cloud

Transcription:

An Overview of the NOAA National Data Center and CLASS Landscape + CLASS Overview Kenneth ths. Casey, NODC On Behalf of the CLASS Operations Working Group (COWG) + Kern Witcher, CLASS Program Manager Presentation to the DAARWG, 27 June 2012

First, recall

The NOAA National Data Centers National Oceanographic Data Center Understanding our Oceans and Coasts National Geophysical Data Center Understanding our World National Climatic Data Center Understanding our Climate

The NOAA National Data Centers Based on NODC s Levels of Stewardship Across the three Data Centers the words vary a little, but all focus on stewarding environmental data now and for the future.

Comprehensive Large Array data Stewardship dh System (CLASS) Designed originally for large volume satellite data sets IT infrastructure supporting the lowest level of stewardship NESDIS has mandated its use by all three Data Centers Even the lowest levels of stewardship require Even the lowest levels of stewardship require non IT domain knowledge and expertise!

and the landscape in which we are operating

Consolipalooza l Consolimaggedon Consolitopia Consolidation has even hit the popular p culture!

Many Forces At Play

Consolidation Free Body Diagram Federal Data Center Consolidation Initiative FDCCI Initiated by the Administration/OMB in 2010 Reduce hardware, software, real estate, cooling costs CLASS Integration into Data Center Operations Initiated by NESDIS in 2011 Support common archival storage needs of the Data Centers NESDIS Data Centerconsolidation Initiated by NESDIS in 2012 Explore options to reduce admin and IT costs Other Initiatives Other Initiated by individual Data Centers Respond to various pressures $

Consolidation Landscape NESDIS Data Center Consolidation FDCCI FDCCI FDCCI NODC NCDC NGDC Other Other Other CLASS Integra tion CLASS Integra tion CLASS Integra tion CLASS

So, what are we doing about it?

Activities Underway FDCCI Server virtualization Reducing ITfootprints Data calls/surveys NESDIS Data Center Consolidation Deputies meeting to form options Will report to NESDIS Management CLASS Integration

Aerospace Corp. Review Notional look at the way forward: Containing Costs, Enhancing Mission Data Providers and Users Science Mission Data Providers and Users Data Stewardship/Management Data Stewardship/Management Software Development and IT Data Services Software Development and IT Data Services IT Infrastructure Information Technology IT Infrastructure, O&M 13

Evolution of the NOAA Archive Architecture ls unde er discu ussion DRAFT Detai FY2012 FY2013 FY2014 FY2015 FY2016 Phase I Phase II Phase III NODC NCDC NGDC MOB Cloud NCDC NGDC NODC NPP GCOM W Jason Pilot IRODS Data Center Migration Concurrent CLASS Initiatives JPSS GOES R Metadata Access Path Dt Data Net Nt HPSS Archive Path On Hold Programs S t a g i n g Access Dissemination Stewardship M2M Archive Storage Service

Data Centers Data Migration Plan Approaches and milestones for integrating CLASS into NOAA Data Center operations by FY15 Three Phases: Archival Storage: use CLASS for safe, long term storage Access Services: expands CLASS to include access capabilities expected by Consumers and functions needed for Data Center stewardship Operations: compare levels of service and decommission local Data Center services when appropriate p

A metaphor, if you will Working to integrate CLASS into our archive operations is a bit like our day job. You get up, pull on your boots, and make the best of it you can. But you are not really very happy in your day job, so you start exploring alternatives. Maybe you tk take some online classes at night, learn some new skills, invest in a startup you do something to change the game, live the dream, or expand your horizons the CLASS Cloud Access Pilot is just that sort of thing. http://www.dreamstime.com/royalty freestock photo muddy boots image14440895 http://www.dreamstime.com/royalty free stockphoto muddy boots image14440895

CLASS Cloud Access Pilot Why: To test the cost effectiveness effectiveness, scalability, performance, and agility of a Cloud solution for access to archived data Who: Three NOAA Data Centers and CLASS, reported on by CLASS Operations Working Group What: At least one data collection from each Data Center, including most/all /llof NODC s data

CLASS Cloud Access Pilot When: This FY, some overlap into next Where: A commercial provider of Cloud IaaS: Amazon Web Services (S3/EC2). Government Cloud also being examined. How: Three parallel activities Populate Cloud storage with DC held data Load Data Center access services to the Cloud Populate Cloud storage with CLASS held data Test the Cloud based data access services with a select group of real world users

CLASS Cloud Access Pilot Common Storage Services using Cloud Common Storage Services using Cloud Technologies

CLASS Cloud Access Pilot Traditional Data Center NOAA Google UMS Managed by User Managed by Vendor http://venturebeat.com/2011/11/14/cloud iaas paas saas/

CLASS Cloud Access Pilot Happy Users Find Access FTP TDS LAS DAP Data Virtually Organized in logical directories (e.g., symbolic links) Cloud IaaS Data Physically Organized in Accessions 1. The three arrows pointing into the cloud from CLASS and the DCs can develop at different rates. 2. Eventually, the DC to Cloud arrow could go away, and CLASS could manage the synchronization of data to the cloud access layer. 3. For now, discovery services like Geoportal could run locally at DCs, but could also eventually move into the cloud. Discovery services could point to the cloud access layer, the existing DChosted access mechanisms, or both. Discovery NODC, NGDC, NCDC DC local holdings sent to CLASS CLASS Data Centers Data Migration Plan

Status in Brief Complicated landscape with lots of forces at play Progress on Archival Storage Phase Next few months of CLASS Cloud Access Pilot critical ii to refine Services Phase

Brief Highlights from each NOAA National Data Center

NODC Facing 26% proposed FY13 budget cut; examining centralizing IT in MS, admin in MD Good progress on Archival Storage phase... on track to have our data in CLASS by end of calendar year Excited about CLASS Cloud Access Pilot and running a cloud computing pilot as well

NGDC Working with CLASS on generic Ingest strategies for efficient data migration Working with CLASS on Machine to Machine Search & Access Implementations: Data Center interfaces need to talk to CLASS for querying and placing orders Working with CLASS on defining data stewardship needs related to the archive holdings

NCDC Internal ac archival astorage migration/consolidation gato dato plan under way Taking the opportunity to clean up the 700+ individual datasets in NCDC s legacy archive NCDC to take lead for GOES R Access function, to lead Access PDR in Spring 2013 New NCDC website/portals due Summer 2012 Implementing configuration management of CDRs and other NCDC data products

Now CLASS Overview

Comprehensive Large Array-data Stewardship System Oerie Overview Presented to DAARWG Kern Witcher Kern Witcher CLASS Program Manager June 27, 2012

2 9 Background Refresher

CLASS Level I Requirements (preliminary) As an enterprise solution, CLASS will reduce anticipated cost growth associated with storing environmental datasets by: Providing common services for acquisition, security, and project management for the IT system supporting NOAA Archives Consolidating stove-pipe, legacy archival storage (See Note 3 below) systems thereby reducing the number of archival storage related IT projects for NOAA to manage and data systems for customers to access. Relieving data owners of archival storage-related system development and operations issues Note 3: Archival storage provides the services and functions for the storage, maintenance and retrieval of archival information packets. Archival storage functions include receiving archival information packets from ingest and adding them to permanent storage, managing the storage hierarchy, refreshing the media in which archive holdings are stored, performing routine and special error checking, providing disaster recovery capabilities, and providing archival information packets to access to fulfill orders 30 3 0

Current CLASS Architecture Direct Connectivity to: ESPC- NOAA Environmental Satellite Processing Center National Ice Center NOAA Coastal Watch JPSS Interface Data Processing Segment (IDPS) Functions Test and Integration Environment Development Team Fairmont, WV CLASS NCDC, Asheville NC CLASS NSOF, Suitland MD CLASS NGDC, Boulder CO Operations Ingest Storage (Disk & Tape) Public Access Satellite Landing Zone Users Replication via NOAA Science Network(N-WAVE) Users Development Environment 3 1

Recent Accomplishments and Current Efforts 3 2

Accomplishments Since Last Briefing 2.0 Core Release 6.0 Linux Migration in final Test and Integration. Release is scheduled for July 8,2012 Completed Server Virtualization ti Study 3.0 Data Center Migration Aerospace completed Phase I ( as is analysis) of Data Center Con Ops Completed Data Center Requirements Documents Completed Data Center Interface Control Documents 4.0 JPSS Successfully supported the Launch of NPP Have ingested 803Tb of NPP data (1.606Pb across both nodes) 5.0 Goes-R Completed Archive and Access PDR Completed Interface CDR Completed Receipt Node Design Machine to Machine (M2M) API in final design In Final Stages of HPSS design 3 3

CLASS Suomi-NPP Data Flow Svalbard, Norway C3S IDPS to SDS RDRs (~1.3TB/day) IDPS SDS IDPS to GRAVITE RDRs, VIIRS M Band EDR, OMPS Auxiliary Files (~1.4TB/day) IDPS to CLASS RDRs, SDRs, TDRs EDRs, ARP, IP, Ancillary Data, Signature Files, Release Packages (~3.2TB/day) CLASS to SDS SDRs, TDRs EDRs, ARP, IP, Ancillary Data, Signature Files, Release Packages (~3.3 3 TB/day) CLASS NSOF STAR GRAVITE CLASS to GRAVITE SDRs, EDRs, Ancillary Data, Auxiliary Files, Mission Notices (~2.5TB/day) GRAVITE to CLASS RIPs (~1.5TB/day) ~4.7TB/day NWAVE 10Gb/s replication ~4.7TB/day CLASS to STAR SDRs, TDRs EDRs, ARP, IP, Ancillary Data, Signature File (~3.3 TB/day) CLASS NGDC CLASS NCDC Public Subscriptions Public Ad hoc orders NPP Sensors NPP Segments ATMS Advance Technology Microwave Sounder C3S Command, Control & Communications CrIS Cross-track Infrared Sounder Segment OMPS Ozone Mapping and Profiler Suite GRAVITE Government Resource for Algorithm VIIRS Visible and Infrared Imaging Radiometer SuiteVerification, Independent Testing & Evaluation IDPS - Interface Data Processing Segment SDS Science Data Segment STAR NOAA Center for Satellite Applications & Research 3 4 ARP Application Related Product EDR Environmental Data Record IP Intermediate Product RDR Raw Data Record RIP Retained Intermediate Product SDR Sensor Data Record TDR Temperature Data Record

3 6 Current Program Projections

CLASS Program Milestones Fiscal Year FY09 FY10 FY11 FY12 FY13 FY14 FY15 FY16 FY17 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 CLASS GOES-R Campaign SRR Archive & Interface PDRAccess PDR Interface CDR Archive & Access CDR ORR Launch 10/15 NPP Campaign JPSS-1 Launch FOR NCT3 J-1 SRR NCT4 ORR J-1 PDR IDPS IDPS Blk IDPS IDPS Blk 1.5 J-1 2.0 CDR Blk 1.5 Blk 2.0 CDR CDR Ops Ops ORR 6/15 Launch 10/16 Climate Model Data SDS v1 CDR v1 CDR v2 Nexrad Campaign Charter ICD ORR NDE Campaign ICD PDR CDR NPP Launch Jason-3 Campaign SRR 7/11 SDR 10/11 PDR CDR ORR 2/12 6/12 9/12 Launch 4/13 Data Center Migration R3 R4 PDR CDR CLASS Releases Completed Milestone R5.2 Current Milestone R5.3 R5.4NEAAT R5.5 2 NEAAT NEAAT NEAAT 4 1 3 R6.0 4/14 10/14 4/15 10/15 4/16 Development Construction Integration/ Test Operations 10/16 37 37 Notes: CDR: Critical Design Review FOR: Flight Operations Review PDR: Preliminary Design Review SDR: Systems Definition Review SRR: System Requirements Review ORR: Operational Readiness Review NCT: NPP Connectivity Test ICD: Interface Control Document

CLASS Projected Ingest and Delivery 39

NOAA Enterprise Archive System (CLASS) Cumulative Total Volume by Data Type 140.0 120.0 Actual Volumes Projected Volumes Petabytes 100.0 80.0 60.0 40.0 20.00 0.0 Migration Completed 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 FY NEXRAD Model In-Situ Satellite 16.0 14.0 Actual Volumes Projected Volumes 44.7% Pet tabytes 12.0 10.0 8.0 6.0 Est. FY 15 Legacy System Migration Completed 27.9% 19.8% NEXRAD Model InSitu/Misc. 4.0 2.0 7.7% Satellite 0.0 08 09 10 11 12 13 14 15 16 FY 40

41 Near and Long Term Plans

Near-Term Plans 2.0 Core Initiate Server Virtualization Expect at least a 30% reduction of servers. 3.0 Data Center Migration Initiate Aerospace Phase II ( to be state) of Data Center Con Ops Complete Cloud Pilot Project Complete NexRad Pilot Project 4.0 JPSS Update the CLASS IRD and ICD to support the Systems Requirements Review for the IDPS Block 1.5 and 2.0 releases 5.0 Goes-R Complete Archive and Access CDR Procure and deploy the GOES-R receipt node into the test and integration environment Update HPSS documentation and begin hardware procurement based on IBM recommendations. Finalize the M2M design and begin implementation 42

CY12 Software Releases 4/1 5/1 6/1 7/1 8/1 9/1 10/1 11/1 12/1 D Build 6.0 Linux Build 6.0.1 NexRad E V Build 5.5.3 Build 6.1 DB Performance Enhancement Build 6.2 Data Center Migration I & T 6.0 6.0.1 6.1 6.2 Test Begins 2/3/13 P A L Linux Migration PaL unavailable for external testing O P 4/30 6/25 7/24 11/5 S 5.5.3. NDE, GCOM-W and various NPP fixes 6.0.1 NexRad Pilot 6.2 Data Center Migration 43 6.0 Linux Transition Note: Please refer to REMEDY for complete release details 6.1 DB Performance enhancements: Changes to Schema, Updates to M2M

CLASS- Looking Forward Present CLASS Future CLASS Access Services Climate. Gov Ordering Discovery P R O D U C E R Ingest Preservation Planning Data Management Archival Storage Administration Access requests results C O N S U M E R Satellite Mod del Insitu Pre-ingest Services Data Gateway Data Gateway Data Gateway gest erface In Inte Access Interface NOAA Enterprise Archival and Storage System Admin. Interface Diss s. Interfa ace Dissemination Services Commercial Cloud Services Private Cloud Services Direct Delivery Web Accessible Folders Administration and Preservation 44

Final Thoughts CLASS does an outstanding job of meeting it s original intended purpose (Large Satellite Data) The program recognizes that for CLASS to become a true NOAA enterprise solution, it must evolve. The challenge is how to evolve in a manner that will not disrupt our current operational missions and accomplish this under a constrained budget environment DAARWG could be a valuable resource for providing an IV&V function for data and algorithm integrity 45