INTEGRATED RULE ORIENTED DATA SYSTEM (IRODS)

Size: px
Start display at page:

Download "INTEGRATED RULE ORIENTED DATA SYSTEM (IRODS)"

Transcription

1 INTEGRATED RULE ORIENTED DATA SYSTEM (IRODS) Todd BenDor Associate Professor Dept. of City and Regional Planning UNC-Chapel Hill SESYNC Model Integration Workshop

2 Important note: Most of this is blatantly ripped off from irods official documentation irods executive summary/introduction Reagan Moore, UNC SILS Adil Hasan, Liverpool

3 Punchline Model integration and flexibility needs sophisticated tools for collecting, curating, and updating databases. What happens when our model integration dreams come true? Maintaining data provenance and facilitating loose/tight coupling is key.

4 Overview What is irods? Who Uses irods? What Can irods Do?

5 What is irods? integrated Rule Oriented Data Management System. irods is open source data grid middleware that implements Data Virtualization Automation of Data Operations A Robust Metadata Catalog Data Management Policy Enforcement and Compliance Verification

6 irods? Based on considerable experience from Storage Resource Broker (SRB) developed by Data Intensive Cyber Environment (DICE) group at UNC, UCSD, SD Super Computer Center. Found many groups used SRB to store large quantities of data. A lot of server-side post-processing of the data is required (e.g. replicate files, convert to different format, checksum etc). Almost all management is policy driven.

7 Emergence of irods SRB experience motivated requirements for a new data management system Contained all SRB functionality Add work-flow to manage server-side post-processing Model coupling potential grows Configurable only include the 'services' you need Open-source SRB license imposed sever restrictions on the academic community

8 Claims to fame An infinitely configurable data janitor irods is the kind of technology you need to host everyone s unstructured data. A powerful data migration tool A data preservation technology A tool for providing fine-grained privacy and security controls Can be seen as a basis for a Digital Repository/Archive Digital Repository/Archive is a Policy Driven System

9 but wait, there s more! Extensible: irods has command-line clients, APIs for numerous programming languages, and web clients Data is accessed using familiar APIs Supports new plug-ins for storage resources, authentication mechanisms, microservices, and network protection irods lets system administrators roll out an extensible data grid without changing their infrastructure. OPEN SOURCE

10 How will models interface with large amounts of heterogeneous data? Microservices Authentication Mechanisms Integrated Custom Client Command Line Client Familiar APIs Storage Resources Network Services Web Client Coordinating Resources Databases

11 irods is Middleware mid dle ware `midl,we(ə)r noun software that acts as a bridge between an operating system or database and applications, especially on a network Applications/Users irods Data Grid Operating System Filesystem irods Middleware Layer - abstracts out the low-level I/O - provides a uniform interface to heterogeneous storage systems (Heterogeneous) Storage Systems

12 Data Virtualization across Grids Administrators control how the grid is presented to users Implement replication, load-distribution, and archiving policies that are completely transparent to the user. Independent grids can be federated with one another to allow controlled access to remote grids or grids operated by separate workgroups. irods Data Grid Federation irods Data Grid Boston Chicago RTP System 1 RTP System 2 London

13 Automation of Data Operations With irods, any agent can initiate any action upon any trigger. This powerful capability allows administrators to automate policies such as: Validating checksums every time a new file is placed in a folder. Backing up a set of files every second Thursday. Archiving data that hasn t been accessed in over 1 month. Logging each time a file is replicated or destroyed. Permitting a file to be accessed by multiple independently defined user groups. These operations can be distributed to the storage resource or client.

14 Policy Enforcement and Compliance Verifications Metadata + automation = infrastructure to enforce mandated data management policies Records retention and privacy protection requirements Audit trails generated by irods can be used to verify compliance with policy.

15 Who Uses irods? 15 iplant: 15,000 users on an irods data grid with 100 million files IN2P3 (French Nat. Inst. Nuclear and Particle Physics): over 6 PB of data managed by irods Sanger Institute (Genome Research): 20+ PB of irods data NASA Center for Climate Simulations: 300 million metadata attributes CineGRID (exchanging digital media): sites distributed across Japan- US-Europe

16 What Can irods Do? irods simplifies data grid management BnF (2008) Data on different storage devices at different locations can be centrally managed. In situ migration to new hardware can be managed by replicating the legacy resource before repurposing or decommissioning it. Backup and archiving are transparent and highly configurable using the automation and metadata capabilities in irods.

17 What Can irods Do? irods simplifies data discovery, data validation, and data processing. attribute: library attribute: total_reads attribute: attribute: library type attribute: attribute: total_reads lane attribute: attribute: type is_paired_read attribute: attribute: lane study_accession_number attribute: ute: library_id libra attribute: is_paired_read attribute: attribute: study_accession_number sample_accession_number attribute: ribute: sample_public_name total_ attribute: library_id attribute: attribute: sample_accession_number manual_qc attribute: ribute: tag type attribute: sample_public_name attribute: attribute: manual_qc sample_common_name attribute: ribute: md5lane attribute: tag attribute: attribute: sample_common_name tag_index attribute: ribute: study_title is_paire attribute: md5 attribute: attribute: tag_index study_id attribute: ribute: reference study_ attribute: study_title attribute: attribute: study_id sample attribute: attribute: ute: reference target li attribute: attribute: sample sample_id attribute: attribute: target id_run attribute: attribute: sample_id study attribute: attribute: id_run alignment attribute: study attribute: alignment attribute: library attribute: total_reads attribute: type attribute: lane attribute: is_paired_read attribute: study_accession_number attribute: library_id attribute: sample_accession_number attribute: sample_public_name attribute: manual_qc attribute: tag attribute: sample_common_name attribute: md5 attribute: tag_index attribute: study_title attribute: study_id attribute: reference attribute: sample attribute: target attribute: sample_id attribute: id_run attribute: study attribute: alignment User-defined and intrinsic metadata make stored data searchable. Validation and analytical tools can be automated to process incoming data. The results and process steps can be stored in the icat metadata catalog.

18 What Can irods Do? Distributed Data Management Data Preservation Digital Archives Data at scale Data Maintenance Automated Data Processing Data Curation Digital Libraries Large number of users Complex management tasks Critical policy enforcement Data Virtualization Data Protection and Security Data Sharing and Access Policy Enforcement

19 Where irods fits? Client interacts with digital repository to access data Access Digital Repository Storage IRODS Spans Digital Repository and Storage Domains

20 Rules Policies are implemented in irods as rules. Rule is a series of logically connected steps. Each step realised as a micro-service. IRODS rules fully featured: Contain loops and branches. Can have rules contained within rules. IRODS rules read from a rule-file (called core.irb by default).

21

22 Questions/ideas? Todd BenDor Department of City and Regional Planning

Managing Next Generation Sequencing Data with irods

Managing Next Generation Sequencing Data with irods Managing Next Generation Sequencing Data with irods Presented by Dan Bedard // danb@renci.org at the 9 th International Conference on Genomics Shenzhen, China September 12, 2014 Managing NGS Data with

More information

Data Management using irods

Data Management using irods Data Management using irods Fundamentals of Data Management September 2014 Albert Heyrovsky Applications Developer, EPCC a.heyrovsky@epcc.ed.ac.uk 2 Course outline Why talk about irods? What is irods?

More information

Policy Policy--driven Distributed driven Distributed Data Management (irods) Richard M arciano Marciano marciano@un marciano @un.

Policy Policy--driven Distributed driven Distributed Data Management (irods) Richard M arciano Marciano marciano@un marciano @un. Policy-driven Distributed Data Management (irods) Richard Marciano marciano@unc.edu Professor @ SILS / Chief Scientist for Persistent Archives and Digital Preservation @ RENCI Director of the Sustainable

More information

Data grid storage for digital libraries and archives using irods

Data grid storage for digital libraries and archives using irods Data grid storage for digital libraries and archives using irods Mark Hedges, Centre for e-research, King s College London eresearch Australasia, Melbourne, 30 th Sept. 2008 Background: Project History

More information

irods and Metadata survey Version 0.1 Date March Abhijeet Kodgire akodgire@indiana.edu 25th

irods and Metadata survey Version 0.1 Date March Abhijeet Kodgire akodgire@indiana.edu 25th irods and Metadata survey Version 0.1 Date 25th March Purpose Survey of Status Complete Author Abhijeet Kodgire akodgire@indiana.edu Table of Contents 1 Abstract... 3 2 Categories and Subject Descriptors...

More information

Technical. Overview. ~ a ~ irods version 4.x

Technical. Overview. ~ a ~ irods version 4.x Technical Overview ~ a ~ irods version 4.x The integrated Ru e-oriented DATA System irods is open-source, data management software that lets users: access, manage, and share data across any type or number

More information

Automated and Scalable Data Management System for Genome Sequencing Data

Automated and Scalable Data Management System for Genome Sequencing Data Automated and Scalable Data Management System for Genome Sequencing Data Michael Mueller NIHR Imperial BRC Informatics Facility Faculty of Medicine Hammersmith Hospital Campus Continuously falling costs

More information

The National Consortium for Data Science (NCDS)

The National Consortium for Data Science (NCDS) The National Consortium for Data Science (NCDS) A Public-Private Partnership to Advance Data Science Ashok Krishnamurthy PhD Deputy Director, RENCI University of North Carolina, Chapel Hill What is NCDS?

More information

RELATED WORK DATANET FEDERATION CONSORTIUM, HTTP://WWW.DATAFED.ORG IRODS, HTTP://IRODS.DICERESEARCH.ORG

RELATED WORK DATANET FEDERATION CONSORTIUM, HTTP://WWW.DATAFED.ORG IRODS, HTTP://IRODS.DICERESEARCH.ORG REAGAN W. MOORE DIRECTOR DATA INTENSIVE CYBER ENVIRONMENTS CENTER UNIVERSITY OF NORTH CAROLINA AT CHAPEL HILL RWMOORE@RENCI.ORG PRIMARY RESEARCH OR PRACTICE AREA(S): POLICY-BASED DATA MANAGEMENT PREVIOUS

More information

DataGrids 2.0 irods - A Second Generation Data Cyberinfrastructure. Arcot (RAJA) Rajasekar DICE/SDSC/UCSD

DataGrids 2.0 irods - A Second Generation Data Cyberinfrastructure. Arcot (RAJA) Rajasekar DICE/SDSC/UCSD DataGrids 2.0 irods - A Second Generation Data Cyberinfrastructure Arcot (RAJA) Rajasekar DICE/SDSC/UCSD What is SRB? First Generation Data Grid middleware developed at the San Diego Supercomputer Center

More information

Technology solutions for managing and computing on largescale biomedical data

Technology solutions for managing and computing on largescale biomedical data Technology solutions for managing and computing on largescale biomedical data Charles Schmitt CTO & Director of Informatics RENCI Brand Fortner Executive Director, irods Consortium Jason Coposky Chief

More information

Digital Preservation Lifecycle Management

Digital Preservation Lifecycle Management Digital Preservation Lifecycle Management Building a demonstration prototype for the preservation of large-scale multi-media collections Arcot Rajasekar San Diego Supercomputer Center, University of California,

More information

irods Overview Intro to Data Grids and Policy-Driven Data Management!!Leesa Brieger, RENCI! Reagan Moore, DICE & RENCI!

irods Overview Intro to Data Grids and Policy-Driven Data Management!!Leesa Brieger, RENCI! Reagan Moore, DICE & RENCI! irods Overview Intro to Data Grids and Policy-Driven Data Management!!Leesa Brieger, RENCI! Reagan Moore, DICE & RENCI! Renaissance Computing Institute (RENCI) A research unit of UNC Chapel Hill Current

More information

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007 Data Management in an International Data Grid Project Timur Chabuk 04/09/2007 Intro LHC opened in 2005 several Petabytes of data per year data created at CERN distributed to Regional Centers all over the

More information

Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora

Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora David Pcolar Carolina Digital Repository (CDR) david_pcolar@unc.edu Alexandra Chassanoff School of Information &

More information

Intro to Data Management. Chris Jordan Data Management and Collections Group Texas Advanced Computing Center

Intro to Data Management. Chris Jordan Data Management and Collections Group Texas Advanced Computing Center Intro to Data Management Chris Jordan Data Management and Collections Group Texas Advanced Computing Center Why Data Management? Digital research, above all, creates files Lots of files Without a plan,

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

Integrating Data Life Cycle into Mission Life Cycle. Arcot Rajasekar rajasekar@unc.edu sekar@diceresearch.org

Integrating Data Life Cycle into Mission Life Cycle. Arcot Rajasekar rajasekar@unc.edu sekar@diceresearch.org Integrating Data Life Cycle into Mission Life Cycle Arcot Rajasekar rajasekar@unc.edu sekar@diceresearch.org 1 Technology of Interest Provide an end-to-end capability for Exa-scale data orchestration From

More information

Assessment of RLG Trusted Digital Repository Requirements

Assessment of RLG Trusted Digital Repository Requirements Assessment of RLG Trusted Digital Repository Requirements Reagan W. Moore San Diego Supercomputer Center 9500 Gilman Drive La Jolla, CA 92093-0505 01 858 534 5073 moore@sdsc.edu ABSTRACT The RLG/NARA trusted

More information

irods Policy-Driven Data Preservation Integrating Cloud Storage and Institutional Repositories

irods Policy-Driven Data Preservation Integrating Cloud Storage and Institutional Repositories irods Policy-Driven Data Preservation Integrating Cloud Storage and Institutional Repositories Reagan W. Moore Arcot Rajasekar Mike Wan {moore,sekar,mwan}@diceresearch.org h;p://irods.diceresearch.org

More information

Integrated Rule-based Data Management System for Genome Sequencing Data

Integrated Rule-based Data Management System for Genome Sequencing Data Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer

More information

irods Overview Introduction to Data Grids, Policy-Driven Data Management, and Enterprise irods

irods Overview Introduction to Data Grids, Policy-Driven Data Management, and Enterprise irods irods Overview Introduction to Data Grids, Policy-Driven Data Management, and Enterprise irods Renaissance Computing Institute (RENCI) A research unit of UNC Chapel Hill Directed by Stan Ahalt, formerly

More information

integrated Rule-Oriented Data System Reference

integrated Rule-Oriented Data System Reference i integrated Rule-Oriented Data System Reference Arcot Rajasekar 1 Michael Wan 2 Reagan Moore 1 Wayne Schroeder 2 Sheau-Yen Chen 2 Lucas Gilbert 2 Chien-Yi Hou Richard Marciano 1 Paul Tooby 2 Antoine de

More information

Figure 1: MQSeries enabled TCL application in a heterogamous enterprise environment

Figure 1: MQSeries enabled TCL application in a heterogamous enterprise environment MQSeries Enabled Tcl Application Ping Tong, Senior Consultant at Intelliclaim Inc., ptong@intelliclaim.com Daniel Lyakovetsky, CIO at Intelliclaim Inc., dlyakove@intelliclaim.com Sergey Polyakov, VP Development

More information

9 ways to revolutionize HR with paperless productivity

9 ways to revolutionize HR with paperless productivity Human Resources Management 9 ways to revolutionize HR with paperless productivity A guided tour of paperless Human Resources software using the Document Locator document management system. Human Resources

More information

WebDat: Bridging the Gap between Unstructured and Structured Data

WebDat: Bridging the Gap between Unstructured and Structured Data FERMILAB-CONF-08-581-TD WebDat: Bridging the Gap between Unstructured and Structured Data 1 Fermi National Accelerator Laboratory Batavia, IL 60510, USA E-mail: nogiec@fnal.gov Kelley Trombly-Freytag Fermi

More information

Implementing an Electronic Document and Records Management System. Key Considerations

Implementing an Electronic Document and Records Management System. Key Considerations Implementing an Electronic Document and Records Management System Key Considerations Commonwealth of Australia 2011 This work is copyright. Apart from any use as permitted under the Copyright Act 1968,

More information

PoS(ISGC 2013)021. SCALA: A Framework for Graphical Operations for irods. Wataru Takase KEK E-mail: wataru.takase@kek.jp

PoS(ISGC 2013)021. SCALA: A Framework for Graphical Operations for irods. Wataru Takase KEK E-mail: wataru.takase@kek.jp SCALA: A Framework for Graphical Operations for irods KEK E-mail: wataru.takase@kek.jp Adil Hasan University of Liverpool E-mail: adilhasan2@gmail.com Yoshimi Iida KEK E-mail: yoshimi.iida@kek.jp Francesca

More information

State of Michigan Records Management Services. Guide to E mail Storage Options

State of Michigan Records Management Services. Guide to E mail Storage Options State of Michigan Records Management Services Guide to E mail Storage Options E mail is a fast, efficient and cost effective means for communicating and sharing information. However, e mail software is

More information

Using Databases to Manage State Information for. Globally Distributed Data

Using Databases to Manage State Information for. Globally Distributed Data Storage Resource Broker Using Databases to Manage State Information for Globally Distributed Data Reagan W. Moore San Diego Supercomputer Center moore@sdsc.edu http://www.sdsc sdsc.edu/srb Abstract The

More information

Abstract. 1. Introduction. irods White Paper 1

Abstract. 1. Introduction. irods White Paper 1 irods: integrated Rule Oriented Data System White Paper Data Intensive Cyber Environments Group University of North Carolina at Chapel Hill University of California at San Diego September 2008 Abstract

More information

Cloud Archive & Long Term Preservation Challenges and Best Practices

Cloud Archive & Long Term Preservation Challenges and Best Practices Cloud Archive & Long Term Preservation Challenges and Best Practices Chad Thibodeau, Cleversafe, Inc. Sebastian Zangaro, HP Author: Chad Thibodeau, Cleversafe, Inc. Author: Sebastian Zangaro, HP SNIA Legal

More information

Symantec Enterprise Vault.cloud Overview

Symantec Enterprise Vault.cloud Overview Fact Sheet: Archiving and ediscovery Introduction The data explosion that has burdened corporations and governments across the globe for the past decade has become increasingly expensive and difficult

More information

ADDENDUM 5 TO APPENDIX 6 TO SCHEDULE 3.3

ADDENDUM 5 TO APPENDIX 6 TO SCHEDULE 3.3 ADDENDUM 5 TO APPENDIX 6 TO SCHEDULE 3.3 TO THE Statement of Technical Approach for Messaging Services Northrop Grumman s approach will improve available messaging services, collaboration and workflow

More information

ECM Migration Without Disrupting Your Business: Seven Steps to Effectively Move Your Documents

ECM Migration Without Disrupting Your Business: Seven Steps to Effectively Move Your Documents ECM Migration Without Disrupting Your Business: Seven Steps to Effectively Move Your Documents A White Paper by Zia Consulting, Inc. Planning your ECM migration is just as important as selecting and implementing

More information

Shibbolized irods (and why it matters)

Shibbolized irods (and why it matters) Shibbolized irods (and why it matters) 3 rd TERENA Storage Meeting, Dublin, February 12 th -13 th 2009 David Corney, for Jens Jensen, e-science centre, Rutherford Appleton Lab, UK Overview Introduction

More information

Data Grids. Lidan Wang April 5, 2007

Data Grids. Lidan Wang April 5, 2007 Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural

More information

Transition Guidelines: Managing legacy data and information. November 2013 v.1.0

Transition Guidelines: Managing legacy data and information. November 2013 v.1.0 Transition Guidelines: Managing legacy data and information November 2013 v.1.0 Document Control Document history Date Version No. Description Author October 2013 November 2013 0.1 Draft Department of

More information

CAREER TRACKS PHASE 1 UCSD Information Technology Family Function and Job Function Summary

CAREER TRACKS PHASE 1 UCSD Information Technology Family Function and Job Function Summary UCSD Applications Programming Involved in the development of server / OS / desktop / mobile applications and services including researching, designing, developing specifications for designing, writing,

More information

irods Technologies at UNC

irods Technologies at UNC irods Technologies at UNC E-iRODS: Enterprise irods at RENCI Presenter: Leesa Brieger leesa@renci.org SC12 irods Informational Reception 1! UNC Chapel Hill Investment in irods DICE and RENCI: research

More information

Capture Your Assets (CYA) Data Retention Policies in E&P

Capture Your Assets (CYA) Data Retention Policies in E&P 1 Capture Your Assets (CYA) Data Retention Policies in E&P By Suzi Hutchinson, Sr. Product Manager, Information Management, and Janet Hicks, Sr. Manager, Information Management Presented at Petroleum Network

More information

PASIG May 12, 2012. Jacob Farmer, CTO Cambridge Computer

PASIG May 12, 2012. Jacob Farmer, CTO Cambridge Computer Adding Intelligence to Conventional NAS and File Systems: Metadata, Backups, and Data Life Cycle Management PASIG May 12, 2012 Presented by: Jacob Farmer, CTO Cambridge Computer Copyright 2009-2011, Cambridge

More information

irods for Big Data Management in Research Driven Organizations Charles Schmitt CTO & Director of Informatics RENCI

irods for Big Data Management in Research Driven Organizations Charles Schmitt CTO & Director of Informatics RENCI irods for Big Data Management in Research Driven Organizations Charles Schmitt CTO & Director of Informatics RENCI Acknowledgements Presented work funded in part by grants from NIH, NSF, NARA, DHS, as

More information

Data Grid Landscape And Searching

Data Grid Landscape And Searching Or What is SRB Matrix? Data Grid Automation Arun Jagatheesan et al., University of California, San Diego VLDB Workshop on Data Management in Grids Trondheim, Norway, 2-3 September 2005 SDSC Storage Resource

More information

Concepts in Distributed Data Management or History of the DICE Group

Concepts in Distributed Data Management or History of the DICE Group Concepts in Distributed Data Management or History of the DICE Group Reagan W. Moore 1, Arcot Rajasekar 1, Michael Wan 3, Wayne Schroeder 2, Antoine de Torcy 1, Sheau- Yen Chen 2, Mike Conway 1, Hao Xu

More information

Managing and Maintaining Windows Server 2008 Servers

Managing and Maintaining Windows Server 2008 Servers Managing and Maintaining Windows Server 2008 Servers Course Number: 6430A Length: 5 Day(s) Certification Exam There are no exams associated with this course. Course Overview This five day instructor led

More information

DATA ARCHIVING. The first Step toward Managing the Information Lifecycle. Best practices for SAP ILM to improve performance, compliance and cost

DATA ARCHIVING. The first Step toward Managing the Information Lifecycle. Best practices for SAP ILM to improve performance, compliance and cost DATA ARCHIVING The first Step toward Managing the Information Lifecycle Best practices for SAP ILM to improve performance, compliance and cost 2010 Dolphin. West Chester, PA All rights are reserved, including

More information

TERRITORY RECORDS OFFICE BUSINESS SYSTEMS AND DIGITAL RECORDKEEPING FUNCTIONALITY ASSESSMENT TOOL

TERRITORY RECORDS OFFICE BUSINESS SYSTEMS AND DIGITAL RECORDKEEPING FUNCTIONALITY ASSESSMENT TOOL TERRITORY RECORDS OFFICE BUSINESS SYSTEMS AND DIGITAL RECORDKEEPING FUNCTIONALITY ASSESSMENT TOOL INTRODUCTION WHAT IS A RECORD? AS ISO 15489-2002 Records Management defines a record as information created,

More information

EMC NETWORKER AND DATADOMAIN

EMC NETWORKER AND DATADOMAIN EMC NETWORKER AND DATADOMAIN Capabilities, options and news Madis Pärn Senior Technology Consultant EMC madis.parn@emc.com 1 IT Pressures 2009 0.8 Zettabytes 2020 35.2 Zettabytes DATA DELUGE BUDGET DILEMMA

More information

Beyond the Data Lake

Beyond the Data Lake WHITE PAPER Beyond the Data Lake Managing Big Data for Value Creation In this white paper 1 The Data Lake Fallacy 2 Moving Beyond Data Lakes 3 A Big Data Warehouse Supports Strategy, Value Creation Beyond

More information

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data David Minor 1, Reagan Moore 2, Bing Zhu, Charles Cowart 4 1. (88)4-104 minor@sdsc.edu San Diego Supercomputer Center

More information

Geospatial Data and Storage Resource Broker Online GIS Integration in ESRI Environments with SRB MapServer and Centera.

Geospatial Data and Storage Resource Broker Online GIS Integration in ESRI Environments with SRB MapServer and Centera. Geospatial Data and Storage Resource Broker Online GIS Integration in ESRI Environments with SRB MapServer and Centera White Paper 2 Geospatial Data Access and Management, The SRB MapServer Table of Contents

More information

Cisco Discovery 3: Introducing Routing and Switching in the Enterprise 157.8 hours teaching time

Cisco Discovery 3: Introducing Routing and Switching in the Enterprise 157.8 hours teaching time Essential Curriculum Computer Networking II Cisco Discovery 3: Introducing Routing and Switching in the Enterprise 157.8 hours teaching time Chapter 1 Networking in the Enterprise-------------------------------------------------

More information

Nexus Professional Whitepaper. Repository Management: Stages of Adoption

Nexus Professional Whitepaper. Repository Management: Stages of Adoption Sonatype Nexus Professional Whitepaper Repository Management: Stages of Adoption Adopting Repository Management Best Practices SONATYPE www.sonatype.com sales@sonatype.com +1 301-684-8080 12501 Prosperity

More information

Enterprise Information Management Services Managing Your Company Data Along Its Lifecycle

Enterprise Information Management Services Managing Your Company Data Along Its Lifecycle SAP Solution in Detail SAP Services Enterprise Information Management Enterprise Information Management Services Managing Your Company Data Along Its Lifecycle Table of Contents 3 Quick Facts 4 Key Services

More information

7Seven Things You Need to Know About Long-Term Document Storage and Compliance

7Seven Things You Need to Know About Long-Term Document Storage and Compliance 7Seven Things You Need to Know About Long-Term Document Storage and Compliance Who Is Westbrook? Westbrook Technologies, based in Branford on the Connecticut coastline, is an innovative software company

More information

How To Use The Hitachi Content Archive Platform

How To Use The Hitachi Content Archive Platform O V E R V I E W Hitachi Content Archive Platform An Active Archive Solution Hitachi Data Systems Hitachi Content Archive Platform An Active Archive Solution As companies strive to better manage information

More information

CIP s Open Data & Data Management Guidelines and Procedures

CIP s Open Data & Data Management Guidelines and Procedures CIP s Open Data & Data Management Guidelines and Procedures 1.1 Scope The CIP Data Management Guidelines and Procedures aim to provide guidance and support throughout the Data Management Cycle to facilitate

More information

Collaborative SRB Data Federations

Collaborative SRB Data Federations WHITE PAPER Collaborative SRB Data Federations A Unified View for Heterogeneous High-Performance Computing INTRODUCTION This paper describes Storage Resource Broker (SRB): its architecture and capabilities

More information

Cisco Process Orchestrator Adapter for Cisco UCS Manager: Automate Enterprise IT Workflows

Cisco Process Orchestrator Adapter for Cisco UCS Manager: Automate Enterprise IT Workflows Solution Overview Cisco Process Orchestrator Adapter for Cisco UCS Manager: Automate Enterprise IT Workflows Cisco Unified Computing System and Cisco UCS Manager The Cisco Unified Computing System (UCS)

More information

Why enterprise data archiving is critical in a changing landscape

Why enterprise data archiving is critical in a changing landscape Why enterprise data archiving is critical in a changing landscape Ovum white paper for Informatica SUMMARY Catalyst Ovum view The most successful enterprises manage data as strategic asset. They have complete

More information

Pluggable Rule Engine

Pluggable Rule Engine Pluggable Rule Engine CurateGear2016 Terrell Russell, Ph.D. @terrellrussell Senior Data Scientist, irods Consortium Renaissance Computing Institute (RENCI), UNC-Chapel Hill 1 2 irods Consortium The irods

More information

What we do? Our services include:

What we do? Our services include: What we do? The next revolution in information technology is migration to what has been labeled as The Third Platform. Following the revolution that was brought about by the introduction of mainframe technology

More information

SOLUTION BRIEF KEY CONSIDERATIONS FOR BACKUP AND RECOVERY

SOLUTION BRIEF KEY CONSIDERATIONS FOR BACKUP AND RECOVERY SOLUTION BRIEF KEY CONSIDERATIONS FOR BACKUP AND RECOVERY Among the priorities for efficient storage management is an appropriate protection architecture. This paper will examine how to architect storage

More information

InstaFile. Complete Document management System

InstaFile. Complete Document management System InstaFile Complete Document management System Index : About InstaFile 1.1 What is InstaFile 1.2 How does it work 1.3 Where you can use InstaFile 1.4 Why only InstaFile InstaFile features and benefits Start

More information

Common Questions and Concerns About Documentum at NEF

Common Questions and Concerns About Documentum at NEF LES/NEF 220 W Broadway Suite B Hobbs, NM 88240 Documentum FAQ Common Questions and Concerns About Documentum at NEF Introduction...2 What is Documentum?...2 How does Documentum work?...2 How do I access

More information

Distributed File Systems An Overview. Nürnberg, 30.04.2014 Dr. Christian Boehme, GWDG

Distributed File Systems An Overview. Nürnberg, 30.04.2014 Dr. Christian Boehme, GWDG Distributed File Systems An Overview Nürnberg, 30.04.2014 Dr. Christian Boehme, GWDG Introduction A distributed file system allows shared, file based access without sharing disks History starts in 1960s

More information

Symantec Enterprise Vault.cloud Overview

Symantec Enterprise Vault.cloud Overview Fact Sheet: Archiving and ediscovery Introduction The data explosion that has burdened corporations and governments across the globe for the past decade has become increasingly expensive and difficult

More information

A Service for Data-Intensive Computations on Virtual Clusters

A Service for Data-Intensive Computations on Virtual Clusters A Service for Data-Intensive Computations on Virtual Clusters Executing Preservation Strategies at Scale Rainer Schmidt, Christian Sadilek, and Ross King rainer.schmidt@arcs.ac.at Planets Project Permanent

More information

Course Overview. What You Will Learn

Course Overview. What You Will Learn CA EDUCATION COURSE DESCRIPTION CA AppLogic r3.5: Maintain Cloud Apps for Operators and Build Cloud Apps for Architects Bundle 300 PRODUCT RELEASE CA AppLogic r3.5 COURSE TYPE, DURATION & COURSE CODE Instructor-led

More information

Data Management Resources at UNC: The Carolina Digital Repository and Dataverse Network

Data Management Resources at UNC: The Carolina Digital Repository and Dataverse Network Data Management Resources at UNC: The Carolina Digital Repository and Dataverse Network November 16, 2010 Data Management Short Course Series Sponsored by the Odum Institute and the UNC Libraries Campus

More information

Archival of Digital Assets.

Archival of Digital Assets. Archival of Digital Assets. John Burns, Archive Analytics Summary: We discuss the principles of archiving, best practice in both preserving the raw bits and the utility of those bits, and assert that bit-

More information

Data Services for Campus Researchers

Data Services for Campus Researchers Data Services for Campus Researchers Research Data Management Implementations Workshop March 13, 2013 Richard Moore SDSC Deputy Director & UCSD RCI Project Manager rlm@sdsc.edu SDSC Cloud: A Storage Paradigm

More information

Hitachi Cloud Service for Content Archiving. Delivered by Hitachi Data Systems

Hitachi Cloud Service for Content Archiving. Delivered by Hitachi Data Systems SOLUTION PROFILE Hitachi Cloud Service for Content Archiving, Delivered by Hitachi Data Systems Improve Efficiencies in Archiving of File and Content in the Enterprise Bridging enterprise IT infrastructure

More information

TRANSFORMING DATA PROTECTION

TRANSFORMING DATA PROTECTION TRANSFORMING DATA PROTECTION Moving from Reactive to Proactive Mark Galpin 1 Our Protection Strategy: Best Of Breed Performance LEADER HIGH-END STORAGE VMAX Low Service Level LEADER SCALE-OUT NAS STORAGE

More information

How To Manage Security On A Networked Computer System

How To Manage Security On A Networked Computer System Unified Security Reduce the Cost of Compliance Introduction In an effort to achieve a consistent and reliable security program, many organizations have adopted the standard as a key compliance strategy

More information

Benefits of upgrading to Enterprise Vault 11.0.1

Benefits of upgrading to Enterprise Vault 11.0.1 Benefits of upgrading to Enterprise Vault 11.0.1 Benefits of upgrading to Enterprise Vault 11.0.1 With the release of Symantec Enterprise Vault 11.0.1, there are several features that will allow your business

More information

A 15-Minute Guide to 15-MINUTE GUIDE

A 15-Minute Guide to 15-MINUTE GUIDE A 15-Minute Guide to Retention Management 15-MINUTE GUIDE Foreword For you as a business professional, time is a precious commodity. You spend much of your day distilling concepts, evaluating options,

More information

Release & Deployment Management

Release & Deployment Management 1. Does the tool facilitate the management of the full lifecycle of Release and Deployment Management? For example, planning, building, testing, quality assurance, scheduling and deployment? Comments:

More information

Protecting Official Records as Evidence in the Cloud Environment. Anne Thurston

Protecting Official Records as Evidence in the Cloud Environment. Anne Thurston Protecting Official Records as Evidence in the Cloud Environment Anne Thurston Introduction In a cloud computing environment, government records are held in virtual storage. A service provider looks after

More information

Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS.

Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS. National Digital Stewardship Residency - Boston Project Summaries 2015-16 Residency Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS. Harvard Library s Digital

More information

web archives & research collections

web archives & research collections forging the future, preserving the past: web archives & research collections Geoff Harder, Archive-it Partner Meetings BPE 2012, The task is to extract information out of noisy data, Knox says. It is a

More information

SSM6437 DESIGNING A WINDOWS SERVER 2008 APPLICATIONS INFRASTRUCTURE

SSM6437 DESIGNING A WINDOWS SERVER 2008 APPLICATIONS INFRASTRUCTURE SSM6437 DESIGNING A WINDOWS SERVER 2008 APPLICATIONS INFRASTRUCTURE Duration 5 Days Course Outline Module 1: Designing IIS Web Farms The students will learn the process of designing IIS Web Farms with

More information

ILM et Archivage Les solutions IBM

ILM et Archivage Les solutions IBM Information Management ILM et Archivage Les solutions IBM Dr. Christian ARNOUX Consultant Information Management IBM Suisse, Software Group 2007 IBM Corporation IBM Strategy for Enterprise Content Compliance

More information

Data Sheet: Archiving Symantec Enterprise Vault Discovery Accelerator Accelerate e-discovery and simplify review

Data Sheet: Archiving Symantec Enterprise Vault Discovery Accelerator Accelerate e-discovery and simplify review Accelerate e-discovery and simplify review Overview provides IT/Legal liaisons, investigators, lawyers, paralegals and HR professionals the ability to search, preserve and review information across the

More information

OSG PUBLIC STORAGE. Tanya Levshina

OSG PUBLIC STORAGE. Tanya Levshina PUBLIC STORAGE Tanya Levshina Motivations for Public Storage 2 data to use sites more easily LHC VOs have solved this problem (FTS, Phedex, LFC) Smaller VOs are still struggling with large data in a distributed

More information

Storage Virtualisation in the Cloud

Storage Virtualisation in the Cloud PRESENTATION TITLE GOES HERE Storage Virtualisation in the Cloud Bob Plumridge SNIA Europe Chair HDS Overview What is Storage Virtualisation? Cloud Storage Virtualisation Use Cases CDMI CDMI and CIMI 2

More information

irods at CC-IN2P3: managing petabytes of data

irods at CC-IN2P3: managing petabytes of data Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules irods at CC-IN2P3: managing petabytes of data Jean-Yves Nief Pascal Calvat Yonny Cardenas Quentin Le Boulc h

More information

5 CMDB GOOD PRACTICES

5 CMDB GOOD PRACTICES 5 CMDB GOOD PRACTICES - Preparing for Service Asset and Configuration Management Wade Palmer, Director of IT Services ii TABLE OF CONTENTS INTRODUCTION... 1 1. KEY CMDB ELEMENTS... 1 2. IT CHANGE MANAGEMENT

More information

MOVING TO THE NEXT-GENERATION MEDICAL INFORMATION CALL CENTER

MOVING TO THE NEXT-GENERATION MEDICAL INFORMATION CALL CENTER MOVING TO THE NEXT-GENERATION MEDICAL INFORMATION CALL CENTER Pharma companies are improving personalized relationships across more channels while cutting cost, complexity, and risk Increased competition

More information

Streamline Enterprise Records Management. Laserfiche Records Management Edition

Streamline Enterprise Records Management. Laserfiche Records Management Edition Laserfiche Records Management Edition Streamline Enterprise Records Management Controlling your organization s proliferating paper and electronic records can be demanding. How do you adhere to records

More information

MOVING THE CLINICAL ANALYTICAL ENVIRONMENT INTO THE CLOUD

MOVING THE CLINICAL ANALYTICAL ENVIRONMENT INTO THE CLOUD MOVING THE CLINICAL ANALYTICAL ENVIRONMENT INTO THE CLOUD STIJN ROGIERS, SENIOR INDUSTRY CONSULTANT, LIFE SCIENCES/HEALTH CARE (EMEA/AP) SANDEEP JUNEJA CONSULTING MANAGER (SSOD) AGENDA Move towards cloud

More information

Recordkeeping for Good Governance Toolkit. GUIDELINE 14: Digital Recordkeeping Choosing the Best Strategy

Recordkeeping for Good Governance Toolkit. GUIDELINE 14: Digital Recordkeeping Choosing the Best Strategy Recordkeeping for Good Governance Toolkit GUIDELINE 14: Digital Recordkeeping Choosing the Best Strategy i The original version of this guideline was prepared by the Pacific Regional Branch of the International

More information

HP Records Manager. A single solution for enterprise-scalable document and records management

HP Records Manager. A single solution for enterprise-scalable document and records management Data sheet HP Records Manager A single solution for enterprise-scalable document and records management HP Records Manager is a scalable electronic document and records management solution designed to

More information

Symantec Backup Exec 12.5 for Windows Servers. Quick Installation Guide

Symantec Backup Exec 12.5 for Windows Servers. Quick Installation Guide Symantec Backup Exec 12.5 for Windows Servers Quick Installation Guide 13897290 Installing Backup Exec This document includes the following topics: System requirements Before you install About the Backup

More information

Migrate from Exchange Public Folders to Business Productivity Online Standard Suite

Migrate from Exchange Public Folders to Business Productivity Online Standard Suite Migrate from Exchange Public Folders to Business Productivity Online Standard Suite White Paper Microsoft Corporation Published: July 2009 Information in this document, including URL and other Internet

More information

The Benefits of Archiving and Seven Questions You Should Always Ask

The Benefits of Archiving and Seven Questions You Should Always Ask ArkivumLimited R21 Langley Park Way Chippenham Wiltshire SN15 1GE UK +44 1249 405060 info@arkivum.com @Arkivum arkivum.com The Benefits of Archiving and Seven Questions You Should Whitepaper 1 / 6 Introduction

More information

Provide access control with innovative solutions from IBM.

Provide access control with innovative solutions from IBM. Security solutions To support your IT objectives Provide access control with innovative solutions from IBM. Highlights Help protect assets and information from unauthorized access and improve business

More information

Redefining Oracle Database Management

Redefining Oracle Database Management Redefining Oracle Database Management Actifio PAS Specification A Single Solution for Backup, Recovery, Disaster Recovery, Business Continuity and Rapid Application Development for Oracle. MAY, 2013 Contents

More information

BEA AquaLogic Integrator Agile integration for the Enterprise Build, Connect, Re-use

BEA AquaLogic Integrator Agile integration for the Enterprise Build, Connect, Re-use Product Data Sheet BEA AquaLogic Integrator Agile integration for the Enterprise Build, Connect, Re-use BEA AquaLogic Integrator delivers the best way for IT to integrate, deploy, connect and manage process-driven

More information