Globus Research Data Management: Introduction and Service Overview

Similar documents
Globus Auth. Steve Tuecke. The University of Chicago

How To Write A Blog Post On Globus

Large-scale Research Data Management and Analysis Using Globus Services. Ravi Madduri Argonne National Lab University of

Practical Solutions for Big Data Analytics

Globus for Data Management

itds OAuth Integration Paterva itds OAuth Integration Building and re-using OAuth providers within Maltego 2014/09/22

Flexible Identity Federation

globus online Globus Online for Research Data Management Rachana Ananthakrishnan Great Plains Network Annual Meeting 2013

Globus Research Data Management: Endpoint Configuration and Deployment. Steve Tuecke Vas Vasiliadis

globus online The Research Cloud Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory

Axway API Portal. Putting APIs first for your developer ecosystem

VMware ESXi in a Cloud-based Lab David Davis, VCP, VCAP, and vexpert

Using RD Gateway with Azure Multifactor Authentication

Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management

Configuration Guide - OneDesk to SalesForce Connector

Egnyte Single Sign-On (SSO) Installation for OneLogin

OCLC CONTENTdm. Geri Ingram Community Manager. Overview. Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015

Okta/Dropbox Active Directory Integration Guide

Copyright Pivotal Software Inc, of 10

HarePoint Workflow Extensions for Office 365. Quick Start Guide

When your users take devices outside the corporate environment, these web security policies and defenses within your network no longer work.

SOA Software API Gateway Appliance 7.1.x Administration Guide

KBase and Globus Online Nexus. Shreyas Cholia NERSC/LBL

Quick Start Guide. IT Management On-Demand

Tenrox. Single Sign-On (SSO) Setup Guide. January, Tenrox. All rights reserved.

Research Data Management Guide

Installation and Administration Guide

Data Lab System Architecture

This chapter describes how to use the Junos Pulse Secure Access Service in a SAML single sign-on deployment. It includes the following sections:

Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any

McAfee Cloud Single Sign On

PingFederate. Windows Live Cloud Identity Connector. User Guide. Version 1.0

Connected Data. Connected Data requirements for SSO

Tableau Online. Understanding Data Updates

Integration Guide. SafeNet Authentication Service. Using SAS as an Identity Provider for Salesforce

DocuSign Single Sign On Implementation Guide Published: March 17, 2016

Administering Jive for Outlook

HP Operations Orchestration Software

Discovering Devices CHAPTER

AdRadionet to IBM Bluemix Connectivity Quickstart User Guide

Virtual Cabinet Document Portal User Guide

Cloud Elements! Events Management BETA! API Version 2.0

PI Cloud Connect Overview

Advanced Configuration Steps

nexus Hybrid Access Gateway

Lytecube Technologies. EnCircle Automation. User Guide

Globus Genomics Tutorial GlobusWorld 2014

STK Terrain Server Installation Guide

ElephantDrive Cloud Backup Module Contents

Helpdesk Ticketing User Guide

User Guide. Version R91. English

Salesforce Files Connect Implementation Guide

IMPLEMENTING SINGLE SIGN- ON USING SAML 2.0 ON JUNIPER NETWORKS MAG SERIES JUNOS PULSE GATEWAYS

PrinterOn Mobile Applications for ios and Android

IT Security & Compliance. On Time. On Budget. On Demand.

HOW TO SILENTLY INSTALL CLOUD LINK REMOTELY WITHOUT SUPERVISION

Building Secure Applications. James Tedrick

Configuring. Moodle. Chapter 82

AAdobe Systems Incorporated. Controlling Access to Adobe Creative Cloud Services INTRODUCTION SYNOPSIS

Apigee Gateway Specifications

STABLE & SECURE BANK lab writeup. Page 1 of 21

How to configure the TopCloudXL WHMCS plugin (version 2+) Update: Version: 2.2

How To Useuk Data Service

FortyCloud Installation Guide. Installing FortyCloud Gateways Using AMIs (AWS Billing)

Contents Overview of RD Web Access What is RD Web Access?... 2 What are the benefits of RD Web Access versus thin client?...

PrinterOn Mobile Print Application Overview and User Guide

Chapter 9 PUBLIC CLOUD LABORATORY. Sucha Smanchat, PhD. Faculty of Information Technology. King Mongkut s University of Technology North Bangkok

Coillte IT has recently upgraded the Remote Access Solution to a new platform.

OAuth 2.0 Developers Guide. Ping Identity, Inc th Street, Suite 100, Denver, CO

JumpCloud is your Directory-as-a-Service. A fully managed directory to rule your infrastructure whether on-premise or in the cloud.

High Speed Transfers Using the Aspera Node API! Aspera Live Webinars November 6, 2012!

MaaS360 Cloud Extender

Nexus Professional Whitepaper. Repository Management: Stages of Adoption

Cloudera Manager Training: Hands-On Exercises

CA Nimsoft Service Desk

Discovering Devices. The Cisco Prime Collaboration Manager discovery process involves three phases: Access-level discovery Prime CM:

USER GUIDE for Salesforce

MassTransit vs. FTP Comparison

Web Content Management (Web CMS) for Internal or External Sites Request for Proposal (RFP) Template

Windows Clients and GoPrint Print Queues

The Java EE 7 Platform and Support for the PaaS Model

Agenda. How to configure

MaaS360 On-Premises Cloud Extender

How To Use Salesforce Identity Features

Sharepoint server SSO

globus online Integrating with Globus Online Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory

CloudBerry Dedup Server

Central Administration QuickStart Guide

Discovering Devices. The Cisco Prime Collaboration Manager discovery process involves three phases: Access-level discovery Cisco Prime CM:

AWS CodePipeline. User Guide API Version

Contents Release Notes System Requirements Administering Jive for Office

Transcription:

Globus Research Data Management: Introduction and Service Overview Kyle Chard chard@uchicago.edu Ben Blaiszik blaiszik@uchicago.edu

Thank you to our sponsors! U. S. D E P A R T M E N T OF ENERGY 2

Agenda Research data management challenges Globus: a high-level flyover File Transfer and Sharing: Accelerating and streamlining collaboration Data Publication: Enhancing reproducibility and discoverability New features: Globus Auth and HTTP Leveraging the Globus platform 3

Research data management scenarios and challenges 4

I need to easily, quickly, & reliably move portions of my data to other locations. Research Computing HPC Cluster Campus Home Filesystem Lab Server Personal Laptop Desktop Workstation XSEDE Resource Public Cloud 5

I need to get data from a scientific instrument to my analysis system. MRI Advanced Light Source Next Gen Sequencer Light Sheet Microscope 6

I need to easily and securely share my data with my colleagues at other institutions. 7

I need to publish my data so others can find/use/validate/reproduce it. Reference Dataset Scholarly Publication Research Community Collaboration 8

Research data management today Index?

Globus and the research data lifecycle Instrument 1 Researcher initiates transfer request; or requested automatically by script, science gateway Globus transfers files reliably, securely 2 Transfer Only a Web browser required Use storage system of your choice Access using your campus credentials Compute Facility 3 Share Researcher selects files to share, selects user or group, and sets access permissions Collaborator logs in to Globus and accesses shared files; no local account required; download via Globus 5 Personal Computer 4 Globus controls access to shared files on existing storage; no need to move files to cloud storage! 6 Researcher assembles data set; describes it using metadata (Dublin core and domainspecific) Publish 6 7 Curator reviews and approves; data set published on campus or other system Peers, collaborators search and discover datasets; transfer and share using Globus Discover Publication Repository 8 10

Globus delivers Big data transfer, sharing, publication, and discovery directly from your own storage systems...via software-as-a-service 11

Globus is SaaS Easy to access via Web browser Command line, REST interfaces for flexible automation and integration New features automatically available Reduced IT operational costs Small local footprint (Globus Connect) Consolidated support and troubleshooting 12

Our focus: User Experience for your photos for your office docs for your entertainment for your research data 13

Accessing Globus and Moving Data 14

Sign up & transfer files 1. Go to: www.globus.org/signup 2. Create your Globus account 3. Validate e-mail address 4. Optional: Login with your campus/incommon identity 5. Install Globus Connect Personal 6. Move files from kyle#ncsa-tutorial endpoint to your laptop 15

Sharing Data 16

Lowering collaboration overhead Grant collaborators access to data on systems without requiring local accounts No need to replicate or move data to separate system/cloud just for sharing Researchers manage virtual ACLs Respect local system access controls

Share files 1. Join the Tutorial Users groups Go to Groups, search for tutorial Select group from list, click Join Group 2. Create a shared endpoint on your laptop 3. Grant your neighbor permissions on your shared endpoint 4. Access your neighbor s shared endpoint 18

Group Management 19

Group Management User-managed group creation and management Flexible control over roles, policies, workflows, and visibility Groups can be used in authorization decisions Example: kbase Every kbase user added to kbase_users Subgroups also created Groups used for access control 20

Exercise 3: Create/configure group 1. Create a group Go to globus.org/groups Click Create New Group Enter the group name and a short description Set visibility to all Globus members 2. Configure your group policies Select your group and click the Settings tab Set requests to a logged in Globus user Set approvals to automatically if all policies are met 3. Ask your neighbor to join your group 4. Grant permissions to the group on your shared endpoint 5. Confirm your neighbor can access your shared endpoint 21

Enhancing reproducibility and discoverability 22

Globus data publication framework Identifier URL Handle DOI Description None Standard Domain-specific Custom Curation None Acceptance Human-validated Machine-validated Access Anonymous Public Embargoed Collaborators Preservation Transient Project Lifetime Archive forever 23

24

Yet another publication system? Software as a Service Self service management Identifiers, policies, submission and curation workflows, storage, metadata, access control Remote storage Supports arbitrarily large datasets Powerful search 25

Exercise 4: Publish a dataset 1. Go to trial.publish.globus.org 2. Log in, click Submit a New Dataset 3. Select either of the Open Trial collections and continue 1. Accept the license terms 2. Enter required metadata to describe the dataset 3. Assemble data set from the kye#ncsa-tutorial endpoint (or your own laptop if you installed Globus Connect Personal) 4. Complete the workflow and submit 4. Curators (a.k.a. presenters) will review your submission and publish 5. Search for your published dataset and browse the data 26

Globus today 4 major services 122 PB transferred 20 billion files processed 31,000 registered users 13 national labs use Globus 10,000 active endpoints ~350 active daily users 99.95% uptime 35+ institutional subscribers 1 PB largest single transfer to date 3 months longest continuously managed transfer 130 federated campus identities

New Features: Globus Auth and HTTP 28

New features: Globus Auth 29

New features: HTTP 30

Leveraging the Globus Platform 31

Globus APIs Globus Connect Globus Platform-as-a-Service Data Publication & Discovery File Sharing File Transfer & Replication Identity, Group, and Profile Management Globus Toolkit 32

APIs and Clients Globus Nexus (identity, profile, groups) API: http://globusonline.github.io/nexus-docs/api.html Clients: o https://github.com/globusonline/python-nexus-client o https://github.com/globusonline/java-nexus-client Globus transfer API: https://docs.globus.org/api/transfer/ Clients: o https://github.com/globusonline/transfer-api-client-python o https://github.com/globusonline/transfer-api-client-java Globus publication (coming soon) 33

Developing a 3 rd party application Register client (URL and client username/password) Implement an OAuth client (or reuse an existing one) Store the resulting access token and make calls to Globus APIs on behalf ot the authenticated user 34

Globus OAuth workflow 35

Developing a 3 rd party application 36

Branded sites 37

Resouces Signup: globus.org/signup Enable your resource: globus.org/globusconnect-server Need help? support.globus.org Follow us: @globusonline 38

Thank you! 39