Globus for Data Management



Similar documents
globus online Globus Online for Research Data Management Rachana Ananthakrishnan Great Plains Network Annual Meeting 2013

Globus Research Data Management: Introduction and Service Overview

How To Write A Blog Post On Globus

Practical Solutions for Big Data Analytics

Large-scale Research Data Management and Analysis Using Globus Services. Ravi Madduri Argonne National Lab University of

globus online The Research Cloud Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory

Globus Auth. Steve Tuecke. The University of Chicago

GridFTP: A Data Transfer Protocol for the Grid

SURFsara Data Services

Level 1: Asigra Cloud Backup Foundation Training

XSEDE Service Provider Software and Services Baseline. September 24, 2015 Version 1.2

Going Google... with Gaggle!!!!

WOS Cloud. ddn.com. Personal Storage for the Enterprise. DDN Solution Brief

Globus Platform-as-a-Service for Collaborative Science Applications

NAS 259 Protecting Your Data with Remote Sync (Rsync)

Hitachi Data Migrator to Cloud Best Practices Guide

Cloud Storage and Backup

White Paper for Data Protection with Synology Snapshot Technology. Based on Btrfs File System

How APIs Turned Cloud on Security on Its Head

SapphireIMS 4.0 BSM Feature Specification

Using and Contributing Virtual Machines to VM Depot

Flight Workflow User's Guide. Release

How To Run A Cloud At Cornell Cac

Axway API Portal. Putting APIs first for your developer ecosystem

ESnet Support for WAN Data Movement

Camilyo APS package by Techno Mango Service Provide Deployment Guide Version 1.0

UIT USpace Flexible and Secure File Manager for Cloud Storage

Technical. Overview. ~ a ~ irods version 4.x

Challenges in Deploying Public Clouds

globus online Integrating with Globus Online Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory

Installation and Setup: Setup Wizard Account Information

Microsoft Exchange Online Archiving and Symantec Enterprise Vault.cloud

Every Silver Lining Has a Vault in the Cloud

Easily integrate Mac into Microsoft System Center

Business and enterprise cloud sync, backup and sharing solutions

Sophos Endpoint Security and Control How to deploy through Citrix Receiver 2.0

NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons

CloudBerry Dedup Server

Mobile device management

KASEYA CLOUD SOLUTION CATALOG 2016 Q1. UPDATED & EFFECTIVE AS OF: February 1, Kaseya Catalog Kaseya Copyright All rights reserved.

Next-Generation Networking for Science

CA API Management SaaS

owncloud Configuration and Usage Guide

Features of AnyShare

IBM Campaign Version-independent Integration with IBM Engage Version 1 Release 3 April 8, Integration Guide IBM

Secure file sharing and collaborative working solution

High Speed Transfers Using the Aspera Node API! Aspera Live Webinars November 6, 2012!

How to Use JCWHosting Reseller Cloud Storage Solution

Release Notes. CTERA Portal May CTERA Portal Release Notes 1

Workshop on Cloud Services for File Synchronisation and Sharing NOV 2014

RingStor User Manual. Version 2.1 Last Update on September 17th, RingStor, Inc. 197 Route 18 South, Ste 3000 East Brunswick, NJ

Egnyte Cloud File Server. White Paper

AstroCompute. AWS101 - using the cloud for Science. Brendan Bouffler ( boof ) Scientific Computing AWS. ska-astrocompute@amazon.

Project Documentation

Installing and Using the vnios Trial

Amazon Web Services. Lawrence Berkeley LabTech Conference 9/10/15. Jamie Baker Federal Scientific Account Manager AWS WWPS

How To Manage The Sas Metadata Server With Ibm Director Multiplatform

Veritas AdvisorMail. archiving, compliance, and ediscovery solution designed specifically for U.S. financial services companies

Deploying ArcGIS for Server Using Esri Managed Services

MANAGED SERVICE PROVIDERS SOLUTION BRIEF

IBM Cloud Security Draft for Discussion September 12, IBM Corporation

Authentication Integration

U"lizing the SDSC Cloud Storage Service

MATLAB on EC2 Instructions Guide

Windows Server 2012 R2 The Essentials Experience

Application Development

SAP NetWeaver AS Java

Adobe Summit 2015 Lab 718: Managing Mobile Apps: A PhoneGap Enterprise Introduction for Marketers

PI Cloud Connect Overview

T a c k l i ng Big Data w i th High-Performance

Course Overview. What You Will Learn

Access Instructions for United Stationers ECDB (ecommerce Database) 2.0

File Sharing & Collaboration

Frequently asked questions

Only LDAP-synchronized users can access SAML SSO-enabled web applications. Local end users and applications users cannot access them.

User Guide. Version R91. English

A Service for Data-Intensive Computations on Virtual Clusters

FilesAnywhere Feature List

Transcription:

Globus for Data Management Computation Institute Rachana Ananthakrishnan (ranantha@uchicago.edu)

Data Management Challenges Transfers often take longer than expected based on available network capacities Lack of an easy to use interface to some of the high-performance tools Tools [are] too difficult to install and use Time and interruption to other work required to supervise large data transfers Need data transfer tools that are easy to use, wellsupported, and permitted by site and facility cybersecurity organizations No easy mechanism to share selected data with collaborators Credit: Some data points from ESNet reports

We envisage a world where data flows rapidly, reliably, and securely among: experimental facilities, online and archival storage, computing facilities, and remote institutions

We envisage a world where data is easily and securely shared with collaborators and partners at other institutions

We envisage a world where data is readily discoverable and accessible to collaborators, regardless of their and the data s location

Reliable, secure, high-performance file transfer and synchronization Fire-and-forget transfers Automatic fault recovery Data Source 2 Globus moves and syncs files Data Destination Seamless security integration Powerful GUI and APIs 1 User initiates transfer request 3 Globus notifies user

Transfer Files

Transfer Options

Command line and scripting Interactive login to command line interface: $ ssh tuecke@cli.globusonline.org Running commands remotely: $ ssh tuecke@cli.globusonline.org <command> $ ssh tuecke@cli.globusonline.org scp r s 3 -D \ nersc#dtn:~/myfile* mylaptop:~/projects/p1 Task ID: 4a3c471e-edef-11df-aa30-1231350018b1 $ _ Using CLI with gsissh: $ gsissh tuecke@cli.globusonline.org <command>

Simple, secure sharing off existing storage systems Easily share large data with any user or group No cloud storage required 2 Globus tracks shared files; no need to move files to cloud storage! Data Source 1 User A selects file(s) to share, selects user or group, and sets permissions 3 User B logs in to Globus and accesses shared file

Share files/folders

Manage permissions

Endpoint Administrator Controls Restricted paths Files and folders that can be used to transfer Enable or disable ability to share from an endpoint Sharing restricted paths Files and folders that can be used to share Whitelist of users who are allowed to share from an endpoint Only allow read permissions on shared endpoints created by users

Amazon S3 Endpoints

Deploying Globus

8,000 active endpoints (in the past year)

Ins$tu$on Endpoints Features ALCF JGI alcf#* jgi#* LANL Globus Connect Personal UpdaCng to DTNs LBNL lbnl#* Sharing enabled NERSC nersc#* Custom site OLCF PNNL olcf#* pic#* SDSC sdsc#* Sharing enabled XSEDE xsede#* In process of enabling Sharing

85 U.S. campuses

Globus Connect Personal 19

Globus increasingly used to better utilize network Enable compucng facilices to beler uclize high performance network infrastructure Source: University of Nebraska Holland Compu;ng Center

Using Globus

Ann Syrowski (Illinois) moves data across XSEDE, NCSA MSS and PSC, to leverage HPC facilities across the country Weather Research and ForecasCng Model Source: UCAR

Dan Kozak (Caltech) replicates 1 PB LIGO astronomy data for resilience

Erin Miller (PNNL) collects data at Advanced Photon Source, renders at PNNL, and views at ANL Credit: KersCn Kleese- van Dam 24

Users upload data to their archives at University of Exeter: Globus API used for integration into their services 25

Earth System Grid Federation leverages Globus for data download: including the user interfaces 26

Using Globus web pages 27

Globus is moving beyond transfer and sharing to data publication and discovery

Curated publishing and rich discovery of research data. Describe and publish data in hosted collections Data Source 2 Globus stores the data in collection s external storage; no need to move files to cloud storage! Leverage institutional storage Collection Data Storage Customizable publication and curation workflows 1 User A chooses a collection to publish into, selects file(s) to publish, and associates descriptive metadata User B uses Globus to discover and download published datasets 3

Globus Data Publication SaaS for publishing large research data Bring your own storage Extensible metadata Publication and curation workflows Public and restricted collections Rich discovery model

We are a non-profit, delivering a production-grade service to the non-profit research community Our challenge: Sustainability

Globus Provider Subscriptions Managed Endpoints Priority support Management console Usage reports Mass Storage System optimization Host shared endpoints Integration support Branded Web Site Alternate Identity Provider (InCommon is standard) globus.org/provider-plans

Thank you to our sponsors! U.S. DEPARTMENT OF ENERGY