irods Technologies at UNC

Similar documents
irods Overview Introduction to Data Grids, Policy-Driven Data Management, and Enterprise irods

irods Overview Intro to Data Grids and Policy-Driven Data Management!!Leesa Brieger, RENCI! Reagan Moore, DICE & RENCI!

OSG PUBLIC STORAGE. Tanya Levshina

Technical. Overview. ~ a ~ irods version 4.x

irods Policy-Driven Data Preservation Integrating Cloud Storage and Institutional Repositories

Pluggable Rule Engine

Jenkins World Tour 2015 Santa Clara, CA, September 2-3

CLOUD API DOCUMENTATION v2.0. Get list of cloud servers in account

Software configuration management

Jenkins: The Definitive Guide

GECKO Software. Introducing FACTORY SCHEMES. Adaptable software factory Patterns

irods for Big Data Management in Research Driven Organizations Charles Schmitt CTO & Director of Informatics RENCI

Meister Going Beyond Maven

Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora

Data Management using irods

Policy Policy--driven Distributed driven Distributed Data Management (irods) Richard M arciano Marciano marciano@un

DevOps Course Content

irods and Metadata survey Version 0.1 Date March Abhijeet Kodgire 25th

Jenkins Continuous Build System. Jesse Bowes CSCI-5828 Spring 2012

SOFTWARE DEVELOPMENT BASICS SED

RELATED WORK DATANET FEDERATION CONSORTIUM, IRODS,

Content. Development Tools 2(63)

Jenkins User Conference Herzelia, July #jenkinsconf. Testing a Large Support Matrix Using Jenkins. Amir Kibbar HP

Automated and Scalable Data Management System for Genome Sequencing Data

Mastering Continuous Integration with Jenkins

The National Consortium for Data Science (NCDS)

INTEGRATED RULE ORIENTED DATA SYSTEM (IRODS)

Continuous Integration and Automatic Testing for the FLUKA release using Jenkins (and Docker)

DocDokuPLM Innovative PLM solution

Beginners guide to continuous integration. Gilles QUERRET Riverside Software

November 12 th 13 th London: Mastering Continuous Integration with Jenkins

CloudCenter Full Lifecycle Management. An application-defined approach to deploying and managing applications in any datacenter or cloud environment

Continuous Integration: Put it at the heart of your development

integrated Rule-Oriented Data System Reference

Continuous Integration and Delivery at NSIDC

Nexus Professional Whitepaper. Repository Management: Stages of Adoption

Content Distribution Management

Managing your Red Hat Enterprise Linux guests with RHN Satellite

w w w. u l t i m u m t e c h n o l o g i e s. c o m Infrastructure-as-a-Service on the OpenStack platform

Cloud Storage in a PaaS World

Sonatype Nexus Professional

STRATEGIC WHITE PAPER. The next step in server virtualization: How containers are changing the cloud and application landscape

Introduction to Gluster. Versions 3.0.x

Continuous Integration in the Cloud with Hudson

Servers. Servers. NAT Public Subnet: /20. Internet Gateway. VPC Gateway VPC: /16

Server Monitoring. AppDynamics Pro Documentation. Version Page 1

ORACLE OPS CENTER: PROVISIONING AND PATCH AUTOMATION PACK

ETICS - A Clean Approach to Open Source Cloud Software

Scaling Web Applications in a Cloud Environment using Resin 4.0

Chapter 1 - Web Server Management and Cluster Topology

THE EUCALYPTUS OPEN-SOURCE PRIVATE CLOUD

irods at CC-IN2P3: managing petabytes of data

Acronis Backup & Recovery 10 Server for Linux. Update 5. Installation Guide

Seamless integration Connect our solution to your business applications thanks to our open architecture

The future of middleware: enterprise application integration and Fuse

Integrated Rule-based Data Management System for Genome Sequencing Data

Testing Automation for Distributed Applications By Isabel Drost-Fromm, Software Engineer, Elastic

Best Practices Guide. Symantec NetBackup with ExaGrid Disk Backup with Deduplication ExaGrid Systems, Inc. All rights reserved.

JAMF Software Server Installation and Configuration Guide for Linux. Version 9.2

What s New in VMware vcenter 5.0

Linux Distributions. What they are, how they work, which one to choose Avi Alkalay

Cloud Optimize Your IT

Upping the game. Improving your software development process

Acronis Backup & Recovery 10 Server for Linux. Installation Guide

Practicing Continuous Delivery using Hudson. Winston Prakash Oracle Corporation

Build Automation for Mobile. or How to Deliver Quality Apps Continuously. Angelo Rüggeberg

Continuous integration with Jenkins CI

XpoLog Center Suite Log Management & Analysis platform

DRUPAL CONTINUOUS INTEGRATION. Part I - Introduction

Asigra Cloud Backup V13 Delivers Enhanced Protection for Your Critical Enterprise Data

@jenkinsconf. Maintaining huge Jenkins clusters - Have we reached the limit of Jenkins?

insync Installation Guide

Integration in Practice

EMC E Exam Name: Virtualized Data Center and Cloud Infrastructure Design Specialist

Reliable DNS and DHCP for Microsoft Active Directory Protecting and Extending Active Directory Infrastructure with Infoblox Appliances

Cloud Storage and Backup

Continuous Integration and Deployment Modern Technique's

Application Release Automation (ARA) Vs. Continuous Delivery

VERITAS Cluster Server v2.0 Technical Overview

Automatic promotion and versioning with Oracle Data Integrator 12c

Object storage in Cloud Computing and Embedded Processing

Reliable DNS and DHCP for Microsoft Active Directory

Continuous Delivery for Alfresco Solutions. Satisfied customers and happy developers with!! Continuous Delivery!

Paul Barham Program Manager - Java. David Staheli (dastahel@microsoft.com) Software Development Manager - Java

Our Systems Experience, Specifically:

Continuous Integration Multi-Stage Builds for Quality Assurance

Integrating Data Life Cycle into Mission Life Cycle. Arcot Rajasekar

Deployment Topologies

DELIVERABLE. Europeana Cloud: Unlocking Europe s Research via The Cloud. Deliverable D2.1 Development Environment

Cultural Heritage Institutions, Metadata Aggregators and The Cloud Aleksandra Nowak, Marcin Werla Poznań Supercomputing and Networking Center

Managing Next Generation Sequencing Data with irods

Cloud Optimize Your IT

SAP Crystal Reports & SAP HANA: Integration & Roadmap Kenneth Li SAP SESSION CODE: 0401

Requirements Specifications for: The Management Action Record System (MARS) for the African Development Bank

next generation architecture created to safeguard in virtual & physical environments to deliver comprehensive UNIFIED DATA PROTECTION SOLUTION BRIEF

Nuxeo, an open source platform for content-centric business applications. Stéfane Fermigier, Nuxeo Laurent Doguin, Nuxeo

Brian LaGoe, Systems Administrator Benjamin Jellema, Systems Administrator Eastern Michigan University

Service Description Cloud Backup

Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration

Project Convergence: Integrating Data Grids and Compute Grids. Eugene Steinberg, CTO Grid Dynamics May, 2008

Transcription:

irods Technologies at UNC E-iRODS: Enterprise irods at RENCI Presenter: Leesa Brieger leesa@renci.org SC12 irods Informational Reception 1!

UNC Chapel Hill Investment in irods DICE and RENCI: research units of UNC CH DICE Developers of irods Headed by Reagan Moore Development and support team Arcot Rajasekar Wayne Schroeder Mike Wan Sheau-Yen Chen Hao Xu Antoine DeTorcy Mike Conway (DICE-UNC and DICE-UCSD) RENCI Developers of E-iRODS Director of RENCI: Stan Ahalt Development and support team Jason Coposky H. Johnson Howard Lander Terrell Russell Michael Stealey Lisa Stillwell Leesa Brieger Charles Schmitt SC12 irods Informational Reception 2!

irods Technologies irods Research-oriented/community code Funded by research organizations E-iRODS Represents RENCI s (and UNC s) long-term support for irods Targeting new funding models for sustainability moving beyond traditional public research funds First release based on irods 3.0 second beta came out in June 2012 www.e-irods.org Inspired by the RedHat/Fedora open source model E-iRODS Consortium to fund Enterprise support infrastructure SC12 irods Informational Reception 3!

E-iRODS Goals Production-ready Supports deployment by sys admin without IT staff for development and customization Greater modularity: microservices, authentication, resources, Dynamic extensibility: (hot) plug-ins Greater security - hardening, signed plug-ins User-friendly - installation and use Admin-friendly - upgrades and administration High availability - scalable, fail-over capabilities Dependency inversion Back-porting to the irods code SC12 irods Informational Reception 4!

E-iRODS Processes Refactoring - simple modular architecture - object-oriented Issue Tracking - prioritizing issues - issue ownership - every commit is tagged Testing - unit, functional, regression - cppcheck for static analysis - Valgrind for dynamic memory analysis - gcov for coverage metrics - variety of topologies - federated grid tests Automate Everything - Continuous Integration (CI) Packaging - EPM metapackager generates RPM, DEB, DMG across multiple OS versions - dependency management inherited from OS tools: apt (DEB), yum(rpm) Documentation - Doxygen - administration manuals - compiled from source during packaging - user manuals SC12 irods Informational Reception 5!

Collaborative Development and Test Environment at RENCI Git distributed revision control system GForge project management system hosting & version control bug-tracking messaging https://code.renci.org/gf/project/ Hudson/Jenkins Continuous Integration environment: incremental quality control Nexus Maven repository that tracks dependencies and bundles for check-out (Java) SC12 irods Informational Reception 6!

Continuous Integration Automated via Hudson (moving to Jenkins) A risk reduction technique Push code frequently to the repository Build & test for each new commit in order to catch defects as early as possible Periodic testing as well Automated CI removes a level of burden from developers and provides constant insight to the state of the project SC12 irods Informational Reception 7!

Continuous Integration (CI) Automated system testing Frequent code pushes to repository Continuous Build & Test Reduces risk by detecting defects early Reduces testing staff time Presentation title goes here 8!

Testing Framework Predefined grid bundles Automated generation of distributed E-iRODS zone NSF Exo-genie infrastructure for dynamically configured virtual networks IP and hostname stitching Scripted tests and expected outcomes Presentation title goes here 9!

Testing 100% test coverage of server-side APIs across selected platforms and topologies: n-way testing across all combinations Packages released as of June 2012 DEB (Debian, Ubuntu) DMG (MacOSX) Unix client (icommands) RPM (RHEL, CentOS, Fedora, SUSE) Topologies Single zone: icat server + 2 non-icat servers Federation: two single zones Consortium Certification Planned: Solaris Windows (MSI) MacOSX (servers) SC12 irods Informational Reception 10!

E-iRODS View services. E-iRODS 3.0 will include pluggable microservices and SC12 irods pluggable Informational resources. Reception 11!

Pluggable Microservices To be included in E-iRODS 3.0 Planned for next release of irods (3.3) Hot pluggability - no need to shut down data grid to update functionality - data grid can activate some plug-ins (done by data admin, not sys admin) Signed microservices provides greater security Plug-in dependency model facilitates upgrade of the data grid

E-iRODS Resources Data Systems E- irods E-iRODS: production management of distributed data assets E- irods Data Systems E- irods Data Grid E- irods Data Systems unified view and management of disparate storage systems policy-driven data management private/community data cloud provisioning extensible and open-source SC12 irods Informational Reception 13!

Policy-Managed Pluggable Resources PEPs irods Rule Engine User develops pluggable resource Resource Wrapper Pluggable Resource Resource- specific rules Code inspection allows for autogenerated Policy Enforcement Points (PEPs) Grid admin can then develop standard irods policy-enforcing rules specific to the resource Local resource Remote resource Use Case Example Pluggable resource replicates off-site to ensure high availability, with an exception irods rule turns off remote replication for files tagged with Protected Health Information metadata

E-iRODS Composable Resources Tree-based approach - horizontal and vertical extensibility E- irods Resource Parent-child relationships between resources Resource 1 (high performance storage) Resource 2 (NFS)... Resource N (archive) Resource A (array of disk) Resource B (tape) Generalizes irods resource groups Operations introduce policy at the storage level... To be included in E-iRODS 3.0 with smart resources for HSM, round robin, replication, load balancing

Anticipated Timeline for Future Development 3.0 18 months... Micro- Service Plug-in Resource Plug-in Inter-dependent plug-ins: databases objects (workflows) authentication RPC messaging Consortiumdriven future development SC12 irods Informational Reception 16!

E-iRODS Consortium Summary Membership dues will fund basic E-iRODS development; research funding will keep irods development going E-iRODS remains completely open source (binary and source code) The Consortium will gather together organizations to Protect the longevity of the irods technology Allow input and coordination on the roadmap Unite organizations with common (data) goals Membership levels General Access to release roadmap Privileged access to (paid) consulting and technical support Professional Voting rights to release roadmap Non-voting seat on governing board Sustaining Voting seat on governing board SC12 irods Informational Reception 17!

E-iRODS Consortium Contacts: Charles Schmitt (cschmitt@renci.org) and Leesa Brieger (leesa@renci.org) Draft charter document available on request We are looking for discussions with potential partners Suggestions and ideas welcome To reach consensus before formal establishment of the consortium SC12 irods Informational Reception 18!