Copying Archives. Ngoni Munyaradzi (MNYNGO001)

Size: px
Start display at page:

Download "Copying Archives. Ngoni Munyaradzi (MNYNGO001) Email: mnyngo001@uct.ac.za"

Transcription

1 Copying Archives Ngoni Munyaradzi (MNYNGO001) Abstract This paper focuses on the problem of trying to define a common exchange interface. That will be used to implement repository-to-repository transfers. The significance of addressing the issue is that it provides flexibility of manipulating the digital content stored in repositories in the manner archivists choose. Repository-to-repository transfers can then be used to implement preservation and migration of digital content. The paper discusses the current technologies and related work that has been done so far. Key Words: metadata, interoperability, Digital Libraries Introduction and Motivation The Copying Archives project is about copying digital content to and from Digital Libraries Systems[2]. Digital Library Systems store digital content of various forms e.g. journals or video content. To achieve replication of digital content between digital libraries, a common exchange interface has to be defined. The common exchange interface will allow for the heterogeneous software repositories to be interoperable. The project will focus mostly on implementing interoperability between the LOCKSS [13, 14], DSpace [15] and EPrints [5] software repositories. There are other popular software repositories in use in the Digital Libraries community, like Greenstone [18] and Fedora, to name a few. For the purposes of this project, LOCKSS, DSpace and EPrints will be implemented. In the next section, current technologies in Digital Libraries that are used in the Copying Archives project will be discussed. Summary of Technologies LOCKSS (Lots of Copies Keep Stuff Safe) [13, 14] is a digital preservation system that collects content from a target Web site using a crawler and keeps this data in the format provided by the publisher. The system continually compares the content it has stored with other LOCKSS boxes, and repairs any differences; this is achieved by a voting system [8] it implements. The system acts as a Web proxy, providing browsers with access to the publisher s content. The system also supports content migration in the case that the format has become obsolete. LOCKSS is an OAIS compliant system [7]. In a LOCKSS network, there are multiple caches and these are decentralised; each cache contains the same content as the other caches on the network. The ability for the same content to be stored in multiple machines ensures that preservation of digital content is achieved, in the case that a cache fails. Page 1

2 DSpace [15] is an open source software repository that adheres to OAI and OpenURL compliancy. It allows users to capture and describe digital works using a custom workflow process. DSpace ensures long-term preserving of data and distribution of institution s digital works over the Web. The DSpace system provides a way to manage these research materials and publications [15]. It is mostly used in institutions to store their scholarly articles, but its use can be extended to multiple disciplines. Dspace archives a large variety of types of content such as articles, datasets, images, audio and video files, working papers and drawings. Dspace supports the Dublin Core metadata format [17]. Eprints [3] is a tool that is used to manage the archiving or research in the form of books, posters, or scholarly articles. It purpose is not to provide a long-term archiving solution that ensures that material will be readable and accessible through technology changes. Instead it gives institutions a means to collect, store and provide Web access to material. It is a free open source package, that is Open Archives Initiative (OAI)-compliant, which makes it accessible to cross-archive searching. Eprints comes with a user interface that can be customized. It also supports any metadata format. Preservation Digital preservation is the management of information over time. The set of processes and activities that ensure continued access to information and all kinds of records, scientific and cultural heritage. There has been little effort put into addressing the need for digital preservation [16]. Some people are arguing that this is an issue that should not be prioritised at the moment. The argument is that any system that has OAI compliance implements preservation. Digital preservation is important because software systems are susceptible to failure and becoming obsolete, hence this needs to be addressed. The Open Archival Information System [7] reference model was defined as a standard for digital preservation. The reference model provides a framework for understanding and increased awareness of archival concepts. The LOCKSS system was designed to ensure preservation of digital content. The Victorian Electronic Record Strategy (VERS) [16], has been developed to practically solve the problems of ensuring digital preservation. VERS designs a digital format, that preserves information indefinitely. The project has identified what it views might be keys to long term preservation of digital information. Encapsulation storing the information in a single location, having been enclosed with descriptive metadata. Self documentation being able to understand the information without needing references to other documentation. Self sufficiency minimising dependency of information. Content documentation descriptive information stated to allow other users to understand usability of content. Standardisation the digital information should be stored in standard formats e.g. METS [3], DIDL [1], MPEG [1]. Page 2

3 It is important to note that the LOCKSS system addresses the issue of preservation comprehensively, through the voting and polling system [9] it implements across the LOCKSS network. Hence, integrating the functionality of the former, DSpace and EPrints, will hopefully add to the current preservation techniques [16]. The next section discusses the role of preservation metadata. Metadata PREMIS[19] has done research into the use of metadata for preservation, Preservation metadata is defined as the information a repository uses to support the digital preservation process. PREMIS focuses on metadata supporting the functions of maintaining viability, renderability, understandability and authenticity. The adoption rate of PREMIS is proved to be generally low [19]. PREMIS has a comprehensive metadata framework [19] description in their data dictionary. PREMIS addresses the issues of implementation, sustainability (through identifying the core metadata elements), metadata creation and capture (human agency vs. automatic capture) and interoperability (support reuse of existing metadata). Interoperability In the context of the Copying Archives project, interoperability is defined as being able to export and ingest digital content with LOCKSS, DSpace and EPrints. The exchanged information must remain in a valid format once ingested in one of the software repositories. Validity of data is important because this ensures authenticity, and that all data has been copied across. Efforts have been made to make heterogeneous repositories interoperable, and one such project is the Open Archives Initiative - Protocol for Metadata Harvesting (OAI-PMH)[6]. This provides an application-independent interoperability framework based on metadata harvesting. A possible way to achieving transfer of digital content in heterogeneous repositories would be through the OAI-PMH framework. It requires use of the Dublin Core [17] metadata standard, which allows flexibility in defining the required fields in metadata content. The next section discusses possible issues of data integrity in wanting to design the common exchange interface. Issues in Data Integration In wanting to develop a common exchange interface between heterogeneous repositories, particular issues need to be considered and ways of solving these established: Heterogeneous infrastructure [12] there are a number of heterogeneous repositories and most of these store data in different formats. To extract data from the repositories, there is a need to manipulate export tool functionality of each repository. Insufficient data quality [12] - every repository is designed for a specific purpose, and this requires a certain level of detail of data stored and accuracy. Consistency in the Page 3

4 data is hard to obtain as the various platforms require different levels of metadata detail. Semantic gaps [12] contradictory fields need to be resolved after data integration of separate repositories has be done. Semantic mismatch is the cause of contradictory entries, as the fields for data are used for different purposes in each individual repository. Language support [11] the language supported in searching or retrieving data from the repository must be the same for all Digital Library Systems. Copyright / rights management [11] - particular information may be protected under copyright law and permission has to be obtained to transfer it from publishers. Naming and identifier [11] - Digital Library Systems name objects in different ways and unique identifiers are important to avoid naming conflicts. Related Research Efforts have been made in the digital libraries community to achieve and interoperability amongst heterogeneous repositories, for example a transfer from Greenstone [18] to DSpace [14] and vice versa. The two repositories share some similarity [18] and have differences [18]- the differences arise because the intended goal of both systems differ. Both these architectures implement the OAI-PMH [6] protocol; this would give an initial perspective that transfer of digital library content could be achieved through this common interoperability framework. However, through testing it has been shown that there are challenges associated with using the framework [6]; the level of integration would only be broad and not deep. Transfer of metadata would be achieved successfully but not the actual object itself due the object identifier element. This is because the OAI-PMH protocol does not support digital object migration. Integration can possibly be done through the METS [3] standard. METS would offer a deeper level of integration via its combined metadata and document container approach. Both Greenstone and DSpace have the capability to export through this format. The METS standard is flexible, but has a disadvantage that different systems might implement structures that might not map to one another. DSpace supports a hierarchical form of metadata while Greenstone metadata is flat, and to achieve integration these differences need to be reconciled. StoneD [18] implements transfer of objects and metadata through import/export, which allows greater level of integration than the previous two methods mentioned. This allows Greenstone to access documents in DSpace at the database level. Another possible alternative to accessing data at the database level is to do it at a service level. Page 4

5 The Towards Interoperable Preservation Repositories (TIPR) [8] projects aims to do testing, refining of transfer mechanisms, and repository to repository transfer. TIRP uses 3 basic Information Packages defined in the OAIS standard to implement repository-torepository transfer. The 3 information packages [8] are, Archival Information Package (AIP), Submission Information Package (SIP)and Dissemination Information Package (DIP). The details of the four projects on the TIPR experiments were published in a special edition of D-Lib Magazine in Results obtained showed that there are many different ways in which the exchange of information packages could fail [8]. TIPR noted that a key requirement for any transfer format is that it carries rich technical and historical information supplied by the source repository. Not all repositories will be able to interpret this information, but the originating repository should communicate the data. METS is the primary schema, and other schemas are used within METS to extend it. Although METS was used in the four projects, no repository could take a package produced by another project and ingest it without substantial transformation. TIPR have defined a Repository exchange Package (RXP) [8], for exchanging information. The RXP includes both package-level and representation-level PREMIS information. A DIP[8] from a repository is transformed into a RXP, which can be ingested by any other repositories. The research done by the two projects StoneD and TIPR serve as a starting point of identifying which standard and techniques are best in implementing repository-torepository transfers. Summary of competing methods/critical Comparison In every digital library system there is an overarching framework within which everything is held. Two standards have emerged, which aim to fulfil this function: the Metadata Encoding and Transmission Standard (METS) [3] and Digital Item Declaration Language (DIDL) [1]. The two standards have frameworks within which descriptive, administrative and structural metadata are defined. Both have mechanisms for recording data of the individual files that define the digital library object, plus methods of recording information on how the data should be rendered when the user receives them e.g. detailing information about what software should be used on the object. Both provide mechanisms for recording the internal structure of a digital object, in a nested hierarchy form, so that it makes sense to the user. The way in which these two standards implement the functions is different. METS is arranged according to the types of its constituent metadata (descriptive, administrative and structural), each of which is located in different sections of its overall structure. DIDL collates all types for a given component together. For example, an image file will have its descriptive and administrative metadata located together. METS employs a separate structural map to act as the centre for links to the metadata, while DIDL embeds everything into a single hierarchy, representing the structure of the digital object. The METS standard proves to be the most flexible to use, hence its wide adoption. The above-mentioned content packaging and preservation techniques, are all defined in XML, schemas and DTDs. Page 5

6 Conclusion A number of the papers discussed mostly point out that research into repository-torepository transfer is still ongoing. As mentioned in the discussed papers, there has been a growing acknowledgment for the need to design repositories that support preservation, to avoid data loss. Progress has been made in identifying some of the important transmission packages that can be used e.g. VERS, METS and PREMIS are being used together. References 1. Bekaert J, Hochstenbach P, and Van de Sompel H. Using MPEG-21 DIDL to represent complex digital objects in the los alamos national laboratory digital library. D-Lib Magazine 2003; 9: Borgman CL. What are digital libraries? competing visions. Information processing and management 1999; 35: Cundiff MV. An introduction to the metadata encoding and transmission standard (METS). Library Hi Tech 2004; 22: Gartner R. Metadata for digital libraries: State of the art and future directions. JISC TechWatch Report 2008;. 5. Gutteridge C. GNU EPrints 2 overview. 2002;. 6. Lagoze C, Van de Sompel H. The open archives initiative: Building a low-barrier interoperability framework. 2001; Lavoie BF. The open archival information system reference model: Introductory guide. Microform & imaging review 2004; 33: Magazine DL. Repository to repository transfer of enriched archival information packages. D-Lib Magazine 2008; 14: Maniatis P, Rosenthal DSH, Roussopoulos M, Baker M, Giuli T, and Muliadi Y. Preserving peer replicas by rate-limited sampled voting. ACM SIGOPS Operating Systems Review 2003; 37: Paepcke A, Chang CCK, Winograd T, and García-Molina H. Interoperability for digital libraries worldwide. Commun ACM 1998; 41: Pinfield S, James H. The digital preservation of e-prints. D-Lib Magazine 2003; 9: Ramler R, Wolfmaier K. Issues and effort in integrating data from heterogeneous software repositories and corporate databases. 2008; Reich V, Rosenthal DSH. LOCKSS: A permanent web publishing and access system. Sun Microsystems Laboratories The First Ten Years 2001;. 14. Reich V, Rosenthal DSH. Lockss (lots of copies keep stuff safe). New Review of Academic Librarianship 2000; 6: Smith MK, Barton M, Bass M, Branschofsky M, McClellan G, Stuve D, Tansley R, and Walker JH. An open source dynamic digital repository. D-Lib Magazine 2003; 9: Page 6

7 16. Waugh A, Wilkinson R, Hills B, and Dell'Oro J. Preserving digital information forever. 2000; Weibel S. The dublin core: A simple content description model for electronic resources. Bulletin of the American Society for Information Science and Technology 2005; 24: Witten IH, Bainbridge D, Tansley R, Huang CY, Don K, and Hamilton NZ. A bridge between greenstone and DSpace. D-Lib Magazine 2005; 11: Woodyard-Robinson D, Ltd WRH. Implementing the PREMIS data dictionary: A survey of approaches. 2007;. Page 7

Implementing an Institutional Repository for Digital Archive Communities: Experiences from National Taiwan University

Implementing an Institutional Repository for Digital Archive Communities: Experiences from National Taiwan University Implementing an Institutional Repository for Digital Archive Communities: Experiences from National Taiwan University Chiung-min Tsai Department of Library and Information Science, National Taiwan University

More information

Ex Libris Rosetta: A Digital Preservation System Product Description

Ex Libris Rosetta: A Digital Preservation System Product Description Ex Libris Rosetta: A Digital Preservation System Product Description CONFIDENTIAL INFORMATION The information herein is the property of Ex Libris Ltd. or its affiliates and any misuse or abuse will result

More information

Archiving Systems. Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie. uwe.borghoff@unibw.

Archiving Systems. Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie. uwe.borghoff@unibw. Archiving Systems Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie uwe.borghoff@unibw.de Decision Process Reference Models Technologies Use Cases

More information

DAR: A Digital Assets Repository for Library Collections

DAR: A Digital Assets Repository for Library Collections DAR: A Digital Assets Repository for Library Collections Iman Saleh 1, Noha Adly 1,2, Magdy Nagi 1,2 1 Bibliotheca Alexandrina, El Shatby 21526, Alexandria, Egypt {iman.saleh, noha.adly, magdy.nagi}@bibalex.org

More information

Notes about possible technical criteria for evaluating institutional repository (IR) software

Notes about possible technical criteria for evaluating institutional repository (IR) software Notes about possible technical criteria for evaluating institutional repository (IR) software Introduction Andy Powell UKOLN, University of Bath December 2005 This document attempts to identify some of

More information

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM Introduction The Institute of Museum and Library Services (IMLS) is committed to expanding public access to federally funded research, data, software,

More information

B SVF - Bavaria Long Term Preservation

B SVF - Bavaria Long Term Preservation Klaus Kempf Long Term Preservation: Needs and Activities at the Bavarian State Library (BSB) Agenda BSB s Institutional Profile Munich Digitization Center (MDZ) Current Responsibilities, Milestones, Activities

More information

Analysing log files. Yue Mao (mxxyue002@uct.ac.za) Supervisor: Dr Hussein Suleman, Kyle Williams, Gina Paihama. University of Cape Town

Analysing log files. Yue Mao (mxxyue002@uct.ac.za) Supervisor: Dr Hussein Suleman, Kyle Williams, Gina Paihama. University of Cape Town Analysing log files Yue Mao (mxxyue002@uct.ac.za) Supervisor: Dr Hussein Suleman, Kyle Williams, Gina Paihama University of Cape Town ABSTRACT A digital repository stores a collection of digital objects

More information

Col*Fusion: Not Just Jet Another Data Repository

Col*Fusion: Not Just Jet Another Data Repository Col*Fusion: Not Just Jet Another Data Repository Evgeny Karataev 1 and Vladimir Zadorozhny 1 1 School of Information Sciences, University of Pittsburgh Abstract In this poster we introduce Col*Fusion a

More information

Applying the OAIS standard to CCLRC s British Atmospheric Data Centre and the Atlas Petabyte Storage Service

Applying the OAIS standard to CCLRC s British Atmospheric Data Centre and the Atlas Petabyte Storage Service Applying the OAIS standard to CCLRC s British Atmospheric Centre and the Atlas Petabyte Storage Service Corney, D.R., De Vere, M., Folkes, T., Giaretta, D., Kleese van Dam, K., Lawrence, B. N., Pepler,

More information

Building integration environment based on OAI-PMH protocol. Novytskyi Oleksandr Institute of Software Systems NAS Ukraine Alex@zu.edu.

Building integration environment based on OAI-PMH protocol. Novytskyi Oleksandr Institute of Software Systems NAS Ukraine Alex@zu.edu. Building integration environment based on OAI-PMH protocol Novytskyi Oleksandr Institute of Software Systems NAS Ukraine Alex@zu.edu.ua Roadmap What is OAI-PMH? Requirements for infrastructure Step by

More information

METADATA STANDARDS AND GUIDELINES RELEVANT TO DIGITAL AUDIO

METADATA STANDARDS AND GUIDELINES RELEVANT TO DIGITAL AUDIO This chart provides a quick overview of metadata standards and guidelines that are in use with digital audio, including metadata used to describe the content of the files; metadata used to describe properties

More information

Response to Invitation to Tender: requirements and feasibility study on preservation of e-prints

Response to Invitation to Tender: requirements and feasibility study on preservation of e-prints Response to Invitation to Tender: requirements and feasibility study on preservation of e-prints A proposal to the JISC from the Arts and Humanities Data Service and the University of Nottingham, Project

More information

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories MacKenzie Smith, Associate Director for Technology Massachusetts Institute of Technology Libraries, Cambridge,

More information

Job description. Purpose. Key Tasks. Job Title Branch Business Group Reporting to Location Duration Salary Range

Job description. Purpose. Key Tasks. Job Title Branch Business Group Reporting to Location Duration Salary Range Job description Job Title Branch Business Group Reporting to Location Duration Salary Range Digital Preservation Technical Specialist National Library of New Zealand Information, Knowledge and Systems

More information

Towards an architecture for open archive networks in Agricultural Sciences and Technology

Towards an architecture for open archive networks in Agricultural Sciences and Technology Towards an architecture for open archive networks in Agricultural Sciences and Technology Imma Subirats, Irene Onyancha, Gauri Salokhe, Johannes Keizer Food and Agriculture Organization of the United Nations,

More information

ADRI. Digital Record Export Standard. ADRI-2007-01-v1.0. ADRI Submission Information Package (ASIP)

ADRI. Digital Record Export Standard. ADRI-2007-01-v1.0. ADRI Submission Information Package (ASIP) ADRI Digital Record Export Standard ADRI Submission Information Package (ASIP) ADRI-2007-01-v1.0 Version 1.0 31 July 2007 Digital Record Export Standard 2 Copyright 2007, Further copies of this document

More information

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

DIGITAL ARCHIVES & PRESERVATION SYSTEMS DIGITAL ARCHIVES & PRESERVATION SYSTEMS Part 1 Overview (part 1 of 7) Kari R. Smith, MIT Institute Archives Session Overview Digital archives and digital preservation systems. These open source tools are

More information

The challenges of becoming a Trusted Digital Repository

The challenges of becoming a Trusted Digital Repository The challenges of becoming a Trusted Digital Repository Annemieke de Jong is Preservation Officer at the Netherlands Institute for Sound and Vision (NISV) in Hilversum. She is responsible for setting out

More information

Citebase Search: Autonomous Citation Database for e-print Archives

Citebase Search: Autonomous Citation Database for e-print Archives Citebase Search: Autonomous Citation Database for e-print Archives Tim Brody Intelligence, Agents, Multimedia Group University of Southampton Abstract Citebase is a culmination

More information

DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER. Version 2.0. 9 July 2008 UNITED STATES GOVERNMENT PRINTING OFFICE

DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER. Version 2.0. 9 July 2008 UNITED STATES GOVERNMENT PRINTING OFFICE DIGITAL PRESERVATION AT THE U.S. GOVERNMENT PRINTING OFFICE: WHITE PAPER Version 2.0 9 July 2008 Record of Changes Version Description Of Change Revision Date Author Number 1.0 Baseline Document July 12,

More information

MSc Proposal DIGITAL LIBRARIES LABORATORY DEPARTMENT OF COMPUTER SCIENCE. STUDENT:... signed on... Stefano Rivera <stefano@rivera.co.

MSc Proposal DIGITAL LIBRARIES LABORATORY DEPARTMENT OF COMPUTER SCIENCE. STUDENT:... signed on... Stefano Rivera <stefano@rivera.co. DIGITAL LIBRARIES LABORATORY DEPARTMENT OF COMPUTER SCIENCE MSc Proposal STUDENT:.............................. signed on............... Stefano Rivera SUPERVISOR:............................

More information

AN INNOVATIVE INTEGRATED SYSTEM FOR EDITORIAL PROCESSES MANAGEMENT: THE CASE OF FIRENZE UNIVERSITY PRESS

AN INNOVATIVE INTEGRATED SYSTEM FOR EDITORIAL PROCESSES MANAGEMENT: THE CASE OF FIRENZE UNIVERSITY PRESS AN INNOVATIVE INTEGRATED SYSTEM FOR EDITORIAL PROCESSES MANAGEMENT: THE CASE OF FIRENZE UNIVERSITY PRESS BOLLINI, ANDREA 1 ; COTONESCHI, PATRIZIA 2 ; FARSETTI, ANTONELLA 2 ; MINORE, SEBASTIANA 2 ; MORNATI,

More information

Invenio: A Modern Digital Library for Grey Literature

Invenio: A Modern Digital Library for Grey Literature Invenio: A Modern Digital Library for Grey Literature Jérôme Caffaro, CERN Samuele Kaplun, CERN November 25, 2010 Abstract Grey literature has historically played a key role for researchers in the field

More information

How To Manage Your Digital Assets On A Computer Or Tablet Device

How To Manage Your Digital Assets On A Computer Or Tablet Device In This Presentation: What are DAMS? Terms Why use DAMS? DAMS vs. CMS How do DAMS work? Key functions of DAMS DAMS and records management DAMS and DIRKS Examples of DAMS Questions Resources What are DAMS?

More information

DA-NRW: a distributed architecture for long-term preservation

DA-NRW: a distributed architecture for long-term preservation DA-NRW: a distributed architecture for long-term preservation Manfred Thaller manfred.thaller@uni-koeln.de, Sebastian Cuy sebastian.cuy@uni-koeln.de, Jens Peters jens.peters@uni-koeln.de, Daniel de Oliveira

More information

SHared Access Research Ecosystem (SHARE)

SHared Access Research Ecosystem (SHARE) SHared Access Research Ecosystem (SHARE) June 7, 2013 DRAFT Association of American Universities (AAU) Association of Public and Land-grant Universities (APLU) Association of Research Libraries (ARL) This

More information

2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, info@ikeep.com)

2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, info@ikeep.com) CSP CHRONOS Compliance statement for ISO 14721:2003 (Open Archival Information System Reference Model) 2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, info@ikeep.com) The international

More information

NTU-IR: An Institutional Repository for Nanyang Technological University using DSpace

NTU-IR: An Institutional Repository for Nanyang Technological University using DSpace Abrizah Abdullah, et al. (Eds.): ICOLIS 2007, Kuala Lumpur: LISU, FCSIT, 2007: pp 103-108 NTU-IR: An Institutional Repository for Nanyang Technological University using DSpace Jayan C Kurian 1, Dion Hoe-Lian

More information

The Rutgers Workflow Management System. Workflow Management System Defined. The New Jersey Digital Highway

The Rutgers Workflow Management System. Workflow Management System Defined. The New Jersey Digital Highway The Rutgers Workflow Management System Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries Presented at the 2007 LITA National Forum Denver, Colorado Workflow Management System

More information

Interagency Science Working Group. National Archives and Records Administration

Interagency Science Working Group. National Archives and Records Administration Interagency Science Working Group 1 National Archives and Records Administration Establishing Trustworthy Digital Repositories: A Discussion Guide Based on the ISO Open Archival Information System (OAIS)

More information

How To Manage A Digital Library

How To Manage A Digital Library A Study on the Open Source Digital Library Software s: Special Reference to DSpace, EPrints and Greenstone Shahkar Tramboo Department of Library and Information Science University of Kashmir Srinagar Humma

More information

Efficient, Automatic Web Resource Harvesting

Efficient, Automatic Web Resource Harvesting Efficient, Automatic Web Resource Harvesting Michael L. Nelson, Joan A. Smith and Ignacio Garcia del Campo Old Dominion University Computer Science Dept Norfolk VA 23529 USA {mln, jsmit, dgarcia}@cs.odu.edu

More information

CERN Document Server

CERN Document Server CERN Document Server Document Management System for Grey Literature in Networked Environment Martin Vesely CERN Geneva, Switzerland GL5, December 4-5, 2003 Amsterdam, The Netherlands Overview Searching

More information

ELAG 2006 Paper: Long-term electronic archiving as part of a digital library solution an overview of products

ELAG 2006 Paper: Long-term electronic archiving as part of a digital library solution an overview of products ELAG 2006 Paper: Long-term electronic archiving as part of a digital library solution an overview of products Johan van Halm, Library Consultant, Amersfoort (NL) 1. 1. Introduction Long-term electronic

More information

infokit JISC infonet is a JISC Advance Service

infokit JISC infonet is a JISC Advance Service This infokit reflects the increasing use of repositories using the documentation, guidance and expertise built up during the Repositories Support Project (RSP). This has been augmented by Lou McGill. infokit

More information

Preservation Action: What, how and when? Hilde van Wijngaarden Head, Digital Preservation Department National Library of the Netherlands

Preservation Action: What, how and when? Hilde van Wijngaarden Head, Digital Preservation Department National Library of the Netherlands : What, how and when? Hilde van Wijngaarden Head, Digital Preservation Department National Library of the Netherlands What is preservation action? Execution of a strategy to regain or improve access to

More information

Building An Institutional Repository With DSpace

Building An Institutional Repository With DSpace 102 PLANNER - 2008 Building An Institutional Repository With DSpace Juli Thakuria Abstract Paper deals with open source institutional repository software specially DSpace. After defining the terms, it

More information

Database Preservation Toolkit: a flexible tool to normalize and give access to databases

Database Preservation Toolkit: a flexible tool to normalize and give access to databases Database Preservation Toolkit: a flexible tool to normalize and give access to databases José Carlos Ramalho University of Minho jcr@di.uminho.pt Luis Faria KEEP SOLUTIONS Lda lfaria@keep.pt Miguel Coutada

More information

Long-term preservation activities of the Bavarian State Library

Long-term preservation activities of the Bavarian State Library Long-term preservation activities of the Bavarian State Library Latest challenges and developments aêk=qüçã~ë=tçäñjhäçëíéêã~åå=== aáöáí~ä=iáäê~êó=aéé~êíãéåí g~åì~êó OSI=OMNM The Bavarian State Library

More information

Contrasting metadata quality processes and coverage in agriculture-related repositories: an experience report

Contrasting metadata quality processes and coverage in agriculture-related repositories: an experience report Contrasting metadata quality processes and coverage in agriculture-related repositories: an experience report P. Šimek 1, G. Adamides 2, D. Le Henaff 3, I. A. Rasmussen 4, M. A. Sicilia 5 and G. Waksman

More information

James Hardiman Library. Digital Scholarship Enablement Strategy

James Hardiman Library. Digital Scholarship Enablement Strategy James Hardiman Library Digital Scholarship Enablement Strategy This document outlines the James Hardiman Library s strategy to enable digital scholarship at NUI Galway. The strategy envisages the development

More information

Archives Ready To the AIPs Transmission. PREMIS Implementation Fair. Reminding the ipres2010 Presentation

Archives Ready To the AIPs Transmission. PREMIS Implementation Fair. Reminding the ipres2010 Presentation FONDAZIONE RINASCIMENTO DIGITALE Foundation promoted by Ente Cassa di Risparmio of Florence 7th International Conference on Preservation of Digital Objects (ipres2010) September 19-24, 2010, Vienna, Austria

More information

Long-term archiving and preservation planning

Long-term archiving and preservation planning Long-term archiving and preservation planning Workflow in digital preservation Hilde van Wijngaarden Head, Digital Preservation Department National Library of the Netherlands The Challenge: Long-term Preservation

More information

System Requirements for Archiving Electronic Records PROS 99/007 Specification 1. Public Record Office Victoria

System Requirements for Archiving Electronic Records PROS 99/007 Specification 1. Public Record Office Victoria System Requirements for Archiving Electronic Records PROS 99/007 Specification 1 Public Record Office Victoria Version 1.0 April 2000 PROS 99/007 Specification 1: System Requirements for Archiving Electronic

More information

Bradford Scholars Digital Preservation Policy

Bradford Scholars Digital Preservation Policy DIGITAL PRESERVATION The value of the research outputs produced by staff and research students at the University of Bradford cannot be over emphasised in demonstrating the scientific, societal and economic

More information

Data Publication and Paradigm Mapping Solutions

Data Publication and Paradigm Mapping Solutions British Library Difficult Data Meeting December 3 2012 Mapping the data publication paradigm onto the operations of the British Oceanographic Data Centre Roy Lowry British Oceanographic Data Centre Summary

More information

AComparativeStudyofPlatformsforResearch Data Management: Interoperability, Metadata Capabilities and Integration Potential

AComparativeStudyofPlatformsforResearch Data Management: Interoperability, Metadata Capabilities and Integration Potential AComparativeStudyofPlatformsforResearch Data Management: Interoperability, Metadata Capabilities and Integration Potential Ricardo Carvalho Amorim 1,JoãoAguiarCastro 1, João Rocha da Silva 1,andCristinaRibeiro

More information

MultiMimsy database extractions and OAI repositories at the Museum of London

MultiMimsy database extractions and OAI repositories at the Museum of London MultiMimsy database extractions and OAI repositories at the Museum of London Mia Ridge Museum Systems Team Museum of London mridge@museumoflondon.org.uk Scope Extractions from the MultiMimsy 2000/MultiMimsy

More information

Federating DSpace. Jim Rutherford Media and Information Systems Group. HP Labs Bristol, UK

Federating DSpace. Jim Rutherford Media and Information Systems Group. HP Labs Bristol, UK Federating DSpace Jim Rutherford Media and Information Systems Group HP Labs Bristol, UK 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice

More information

Dynamic Web File Format Transformations with Grace

Dynamic Web File Format Transformations with Grace Dynamic Web File Format Transformations with Grace Daniel S. Swaney, Frank McCown, and Michael L. Nelson Old Dominion University Computer Science Department Norfolk, VA 23529 USA {dswaney,fmccown,mln}@cs.odu.edu

More information

LOCKSS Audit Report November 2007

LOCKSS Audit Report November 2007 Center for Research Libraries Auditing and Certification of Digital Archives Project LOCKSS Audit Report November 2007 Report prepared by Robin Dale, with contributions by Bernard Reilly and Marie Waltz.

More information

The Data Management Plan with. Dataverse. Mercè Crosas, Ph.D. Director of Product Development

The Data Management Plan with. Dataverse. Mercè Crosas, Ph.D. Director of Product Development The Data Management Plan with Dataverse Mercè Crosas, Ph.D. Director of Product Development The Dataverse The Data Management Plan The Data Management Plan with Dataverse The Dataverse The Data Management

More information

Carl Lagoze Senior Research Associate Cornell Computing and Information Science 301 College Ave. Ithaca, NY USA lagoze@cs.cornell.

Carl Lagoze Senior Research Associate Cornell Computing and Information Science 301 College Ave. Ithaca, NY USA lagoze@cs.cornell. A Proposal to the Andrew W. Mellon Foundations Title Repositories Interoperability Framework: Augmenting Interoperability across Scholarly Repositories Period October 1, 2006 September 30, 2008 Amount

More information

Digital Preservation: the need for an open source digital archival and preservation system for small to medium sized collections,

Digital Preservation: the need for an open source digital archival and preservation system for small to medium sized collections, Digital Preservation: the need for an open source digital archival and preservation system for small to medium sized collections, Kevin Bradley ABSTRACT: Though the solution to all of the problems of digital

More information

Digital Asset Management Developing your Institutional Repository

Digital Asset Management Developing your Institutional Repository Digital Asset Management Developing your Institutional Repository Manny Bekier Director, Biomedical Communications Clinical Instructor, School of Public Health SUNY Downstate Medical Center Why DAM? We

More information

Second EUDAT Conference, October 2013 Workshop: Digital Preservation of Cultural Data Scalability in preservation of cultural heritage data

Second EUDAT Conference, October 2013 Workshop: Digital Preservation of Cultural Data Scalability in preservation of cultural heritage data Second EUDAT Conference, October 2013 Workshop: Digital Preservation of Cultural Data Scalability in preservation of cultural heritage data Simon Lambert Scientific Computing Department STFC UK Types of

More information

The Czech Digital Library and Tools for the Management of Complex Digitization Processes

The Czech Digital Library and Tools for the Management of Complex Digitization Processes The Czech Digital Library and Tools for the Management of Complex Digitization Processes Martin LHOTÁK Library of the Academy of Sciences of the Czech Republic lhotak@knav.cz INFORUM 2012: 18th Conference

More information

Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora

Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora Conceptualizing Policy-Driven Repository Interoperability (PoDRI) Using irods and Fedora David Pcolar Carolina Digital Repository (CDR) david_pcolar@unc.edu Alexandra Chassanoff School of Information &

More information

Assessment of RLG Trusted Digital Repository Requirements

Assessment of RLG Trusted Digital Repository Requirements Assessment of RLG Trusted Digital Repository Requirements Reagan W. Moore San Diego Supercomputer Center 9500 Gilman Drive La Jolla, CA 92093-0505 01 858 534 5073 moore@sdsc.edu ABSTRACT The RLG/NARA trusted

More information

Vilas Wuwongse, Thiti Vacharasintopchai, Neelawat Intaraksa Asian Institute of Technology www.ait.asia

Vilas Wuwongse, Thiti Vacharasintopchai, Neelawat Intaraksa Asian Institute of Technology www.ait.asia A Common Infrastructure for Digital Contents Vilas Wuwongse, Thiti Vacharasintopchai, Neelawat Intaraksa Asian Institute of Technology www.ait.asia Outline Introduction Issues Proposed Approach A Common

More information

Best Archiving Practice Guidance

Best Archiving Practice Guidance Best Archiving Practice Guidance This document has been published under the auspices of the EU Telematics Implementation Group - electronic submissions (TIGes) Please note that this document has been published

More information

Evaluating File Formats for Long-term Preservation

Evaluating File Formats for Long-term Preservation Evaluating File Formats for Long-term Preservation Abstract Judith Rog, Caroline van Wijk National Library of the Netherlands; The Hague, The Netherlands judith.rog@kb.nl, caroline.vanwijk@kb.nl National

More information

Documenting the research life cycle: one data model, many products

Documenting the research life cycle: one data model, many products Documenting the research life cycle: one data model, many products Mary Vardigan, 1 Peter Granda, 2 Sue Ellen Hansen, 3 Sanda Ionescu 4 and Felicia LeClere 5 Introduction Technical documentation for social

More information

9 th ETD Conference - 2006

9 th ETD Conference - 2006 9 th ETD Conference - 2006 A Prototype for Preservation and Harvesting of International ETDs using LOCKSS and OAI-PMH Kamini Santhanagopalan Department of Computer Science, Virginia Tech ksanthan@vt.edu

More information

DAR: A Digital Assets Repository for Library Collections An Extended Overview

DAR: A Digital Assets Repository for Library Collections An Extended Overview DAR: A Digital Assets Repository for Library Collections An Extended Overview Iman Saleh Noha Adly * Magdy Nagi * Bibliotheca Alexandrina El Shatby 21526 Alexandria, Egypt {iman.saleh, noha.adly, magdy.nagi}

More information

Bibliothèque numérique de l enssib

Bibliothèque numérique de l enssib Bibliothèque numérique de l enssib Extending the network: libraries and their partners, 17 au 20 juin 2003 32 e congrès LIBER Open archive solutions to traditional archive/library cooperation Castelli,

More information

Open Journal Systems and Dataverse Integration-- Helping Journals to Upgrade Data Publication for Reusable Research

Open Journal Systems and Dataverse Integration-- Helping Journals to Upgrade Data Publication for Reusable Research Open Journal Systems and Dataverse Integration-- Helping Journals to Upgrade Data Publication for Reusable Research Micah Altman Director of Research, MIT Libraries http://informatics.mit.edu

More information

Oxford Digital Asset Management System (DAMS) Update

Oxford Digital Asset Management System (DAMS) Update Oxford Digital Asset Management System (DAMS) Update Neil Jefferies R&D Project Manager Systems & eresearch Services (SERS) Oxford University Library Services (OULS) Agenda Overview Fedora-Commons Honeycomb/ST5800

More information

Current Developments and Future Trends for the OAI Protocol for Metadata Harvesting

Current Developments and Future Trends for the OAI Protocol for Metadata Harvesting Current Developments and Future Trends for the OAI Protocol for Metadata Harvesting Sarah L. Shreeves, Thomas G. Habing, Kat Hagedorn, and Jeffrey A. Young Abstract The Open Archives Initiative Protocol

More information

Digital Preservation Lifecycle Management

Digital Preservation Lifecycle Management Digital Preservation Lifecycle Management Building a demonstration prototype for the preservation of large-scale multi-media collections Arcot Rajasekar San Diego Supercomputer Center, University of California,

More information

The Open Archives Initiative: Building a low-barrier interoperability framework

The Open Archives Initiative: Building a low-barrier interoperability framework The Open Archives Initiative: Building a low-barrier interoperability framework Carl Lagoze Digital Library Research Group Cornell University Ithaca, NY +1-607-255-6046 lagoze@cs.cornell.edu Herbert Van

More information

Repository Replication Using NNTP and SMTP

Repository Replication Using NNTP and SMTP Repository Replication Using NNTP and SMTP Joan A. Smith, Martin Klein, and Michael L. Nelson Old Dominion University, Department of Computer Science Norfolk, VA 23529 USA {jsmit, mklein, mln}@cs.odu.edu

More information

WRANGLING DIGITAL CHAOS: CHARACTERIZATION & INGEST

WRANGLING DIGITAL CHAOS: CHARACTERIZATION & INGEST Dr. Helen R. Tibbo School of Information and Library Science University of North Carolina at Chapel Hill tibbo@ils.unc.edu WRANGLING DIGITAL CHAOS: CHARACTERIZATION & INGEST Some Streams of Activity

More information

CASE STUDY: DIGITAL PRESERVATION AT THE NATIONAL LIBRARY OF NEW ZEALAND

CASE STUDY: DIGITAL PRESERVATION AT THE NATIONAL LIBRARY OF NEW ZEALAND CASE STUDY: DIGITAL PRESERVATION AT THE NATIONAL LIBRARY OF NEW ZEALAND Preservation: A Forward-Looking Mission The problem of preserving digital information for the future is not only, or even primarily,

More information

Digital libraries of the future and the role of libraries

Digital libraries of the future and the role of libraries Digital libraries of the future and the role of libraries Donatella Castelli ISTI-CNR, Pisa, Italy Abstract Purpose: To introduce the digital libraries of the future, their enabling technologies and their

More information

Factors in Selecting a Digital Asset Management System:

Factors in Selecting a Digital Asset Management System: Factors in Selecting a Digital Asset Management System: Deborah Holmes-Wong, Project Manager University of Southern California Information Services Division Digital Library Federation Spring Forum 2003

More information

Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context

Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager paul.bevan@llgc.org.uk Structure! Background and overview! OAIS Model! Why

More information

Florida Digital Archive (FDA) Policy and Procedures Guide

Florida Digital Archive (FDA) Policy and Procedures Guide Florida Digital Archive (FDA) Policy and Procedures Guide Version 3.1, July 1, 2012 Last reviewed May, 2010 without updates superseded versions: version 3.0, May, 2011 version 2.5, April 2009 version 2.4,

More information

The use of file validation tools in the University of St Andrews digital archive for research data

The use of file validation tools in the University of St Andrews digital archive for research data The use of file validation tools in the University of St Andrews digital archive for research data Swithun Crowe Application Developer (Arts and Humanities Computing Projects) University of St Andrews

More information

Appendix A. MSU Digital Preservation Proposal April 2009. Project: Preserving MSU s Digital Assets

Appendix A. MSU Digital Preservation Proposal April 2009. Project: Preserving MSU s Digital Assets Appendix A MSU Digital Preservation Proposal April 2009 Project: Preserving MSU s Digital Assets I. Project Overview Like other research universities, Michigan State University has amassed a growing body

More information

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire A Selection of Questions from the Stewardship of Digital Assets Workshop Questionnaire SECTION A: Institution Information What year did your institution begin creating digital resources? What year did

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

Representing digital assets using MPEG-21 Digital Item Declaration

Representing digital assets using MPEG-21 Digital Item Declaration International Journal on Digital Libraries (2006) 6(2): 159 173 DOI 10.1007/s00799-005-0133-0 REGULAR PAPER Jeroen Bekaert Emiel De Kooning Herbert van de Sompel Representing digital assets using MPEG-21

More information

and ensure validation; documents are saved in standard METS format.

and ensure validation; documents are saved in standard METS format. METS-Based Cataloging Toolkit for Digital Library Management System Li Dong, Bei Zhang Library of Tsinghua University, Beijing, China {dongli, zhangbei}@lib.tsinghua.edu.cn Chunxiao Xing, Lizhu Zhou Computer

More information

Understanding Metadata Needs when Migrating DAMS

Understanding Metadata Needs when Migrating DAMS Understanding Metadata Needs when Migrating DAMS Ayla Stein University of Illinois at Urbana-Champaign, USA astein@illinois.edu Santi Thompson University of Houston, USA sathompson3@uh.edu Abstract This

More information

Shigeo Sugimoto. Tsukuba University, Japan

Shigeo Sugimoto. Tsukuba University, Japan Constructing a Records Archiving System Using Off- the-shelf Tools - A Lightweight Approach IWAW 2007 Jan Askhoj,, Mitsuharu Nagamori, Shigeo Sugimoto Tsukuba University, Japan Outline Problems with corporate

More information

Integration of Distributed Healthcare Records: Publishing Legacy Data as XML Documents Compliant with CEN/TC251 ENV13606

Integration of Distributed Healthcare Records: Publishing Legacy Data as XML Documents Compliant with CEN/TC251 ENV13606 Integration of Distributed Healthcare Records: Publishing Legacy Data as XML Documents Compliant with CEN/TC251 ENV13606 J.A. Maldonado, M. Robles, P. Crespo Bioengineering, Electronics and Telemedicine

More information

Chapter 5: The DAITSS Archiving Process

Chapter 5: The DAITSS Archiving Process Chapter 5: The DAITSS Archiving Process Topics covered in this chapter: A brief glossary of terms relevant to this chapter Specifications for Submission Information Packages (SIPs) DAITSS archiving workflow

More information

Why long time storage does not equate to archive

Why long time storage does not equate to archive Why long time storage does not equate to archive Jos van Wezel HUF Toronto 2015 STEINBUCH CENTRE FOR COMPUTING - SCC KIT University of the State of Baden-Württemberg and National Laboratory of the Helmholtz

More information

Florida Digital Archive (FDA) Policy and Procedures Guide

Florida Digital Archive (FDA) Policy and Procedures Guide Florida Digital Archive (FDA) Policy and Procedures Guide Version 3.0, May, 2011 Last reviewed May, 2010 without updates superseded versions: version 2.5, April 2009 version 2.4, August 2007 version 2.3,

More information

Knowledge Management using Open Source Repository

Knowledge Management using Open Source Repository Knowledge Management using Open Source Repository GIULIO CONCAS, FILIPPO EROS PANI, MARIA ILARIA LUNESU Department of Electric and Electronic Engineering, Agile Group University of Cagliari Piazza d Armi,

More information

Summary Report of the PREMIS Implementation Fair

Summary Report of the PREMIS Implementation Fair Summary Report of the PREMIS Implementation Fair The PREMIS Implementation Fair, sponsored by the Library of Congress, was held on October 7, 2009 in San Francisco. There were over 40 attendees and registrants

More information

The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols

The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols Claudia Nicolai; Imma Subirats; Stephen Katz Food and Agriculture Organization of the United

More information

Personal & SOHO Archiving

Personal & SOHO Archiving Personal & SOHO Archiving Stephan Strodl, Florian Motlik, Kevin Stadler, Andreas Rauber Vienna University of Technology Vienna, Austria www.ifs.tuwien.ac.at/dp ABSTRACT Digital objects require appropriate

More information

Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM)

Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM) Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM) Oracle's Sun Storage Archive Manager (SAM) self-protecting file system software reduces operating costs by providing data

More information

METS and the CIDOC CRM a Comparison

METS and the CIDOC CRM a Comparison METS and the CIDOC CRM a Comparison Martin Doerr February 2011 Acknowledgments The work was commissioned and financed by Cultural Heritage Imaging (http://www.c-h-i.org) with majority funding from the

More information

National Library of Australia IT Architecture Project Report. March 2007

National Library of Australia IT Architecture Project Report. March 2007 National Library of Australia IT Architecture Project Report March 2007 IT Architecture Project Report. March 2007 TABLE OF CONTENTS Table of Contents... i Overview... 1 Purpose... 1 Scope... 1 Benefits...

More information

Using Dublin Core for DISCOVER: a New Zealand visual art and music resource for schools

Using Dublin Core for DISCOVER: a New Zealand visual art and music resource for schools Proc. Int. Conf. on Dublin Core and Metadata for e-communities 2002: 251-255 Firenze University Press Using Dublin Core for DISCOVER: a New Zealand visual art and music resource for schools Karen Rollitt,

More information