Building integration environment based on OAI-PMH protocol. Novytskyi Oleksandr Institute of Software Systems NAS Ukraine Alex@zu.edu.



Similar documents
Notes about possible technical criteria for evaluating institutional repository (IR) software

The Czech Digital Library and Tools for the Management of Complex Digitization Processes

Open Access Repositories Technical Considerations. Introduction. Approaches to Setting up Repositories

OCLC CONTENTdm. Geri Ingram Community Manager. Overview. Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015

DAR: A Digital Assets Repository for Library Collections

Service Cloud for information retrieval from multiple origins

CERN Document Server

Institutional Repositories: Staff and Skills requirements

PAPER Data retrieval in the PURE CRIS project at 9 universities

MultiMimsy database extractions and OAI repositories at the Museum of London

Institutional Repositories: Staff and Skills Set

How To Harvest For The Agnic Portal

Course 1: Digital Libraries Introduction Unit 4: Digital libraries functional components software

All You Wanted To Know About the Management of Digital Resources in Alma

OCLC CONTENTdm and the WorldCat Digital Collection Gateway Overview

Taking Control of Library Metadata and Websites using the extensible Catalog

Eprints and the evaluation of scientific production: a useful tool but it is not enough

Vilas Wuwongse, Thiti Vacharasintopchai, Neelawat Intaraksa Asian Institute of Technology

DIGITAL OBJECT an item or resource in digital format. May be the result of digitization or may be born digital.

DataNet Flexible Metadata Overlay over File Resources

Metadata Quality Control for Content Migration: The Metadata Migration Project at the University of Houston Libraries

Ex Libris Rosetta: A Digital Preservation System Product Description

Analysis of e-learning repository systems and frameworks with prepositions for improvements

Copying Archives. Ngoni Munyaradzi (MNYNGO001)

Building An Institutional Repository With DSpace

AgriDrupal support slides

Functional Requirements for Digital Asset Management Project version /30/2006

Deduplication of metadata harvested from Open Archives Initiative repositories

Free web-based solution to manage photographs that could be used to manage collection items online if there is a photo of every item.

DAR: A Digital Assets Repository for Library Collections An Extended Overview

How To Build A Connector On A Website (For A Nonprogrammer)

A Virtual Exhibition of Open Source Software for Libraries

RCAAP: Building and maintaining a national repository network

EPrints Preservation Update

Archiving Scientific Literature: An Experience with E-prints Archive Software

Implementing an Institutional Repository for Digital Archive Communities: Experiences from National Taiwan University

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols

User Guide for Students. Instructions for Building a Website in Omeka.net for a Class Project

ETD Repository: Drupal, Solr, Islandora, and Fedora Commons. Aaron Collie, Devin Higgins, Lucas Mak, Shawn Nicholson

Factors in Selecting a Digital Asset Management System:

OPEN SOURCE SOFTWARE TOOLS FOR CREATING DIGITAL REPOSITORIES

Analysing log files. Yue Mao Supervisor: Dr Hussein Suleman, Kyle Williams, Gina Paihama. University of Cape Town

How To Manage Your Digital Assets On A Computer Or Tablet Device

Metadata for Big Data: A preliminary investigation of metadata quality issues in research data repositories

OPENGREY: HOW IT WORKS AND HOW IT IS USED

METADATA STANDARDS AND GUIDELINES RELEVANT TO DIGITAL AUDIO

Smart Storage and Preservation: How Digital Repositories Can Participate

8 open access in Turkey

Installation and Customization Experience of Metadata Harvester System: Case from University of Ruhuna

Building Library Website using Drupal

Laserfiche. and SharePoint Integration. Your potential, realized. Learn More Inside. Include imaged documents in collaborative processes.

dati.culturaitalia.it a Pilot Project of CulturaItalia dedicated to Linked Open Data

Biblioteca Digital del Patrimonio Iberoamericano: open source technology in the service of a major cooperative project.

Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context

Extension with Digital Library Technology of UNESCO s J-ISIS Database Software

Data Publishing Workflows with Dataverse

Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM)

DFG form /15 page 1 of 8. for the Purchase of Licences funded by the DFG

Citebase Search: Autonomous Citation Database for e-print Archives

Proposed Protocol to Solve Discovering Hidden Web Hosts Problem

How To Manage A Digital Library

Contents. 1.Running the Installer Activating PowerForms SharePoint SharePoint 2007 / WSS

A DICOM-based Software Infrastructure for Data Archiving

James Hardiman Library. Digital Scholarship Enablement Strategy

Using Dataverse Virtual Archive Technology for Research Data Management. Jonathan Crabtree Thu-Mai Christian Amanda Gooch

Columbia University Libraries / Information Services

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

Local Loading. The OCUL, Scholars Portal, and Publisher Relationship

Strategy and Cooperation on Long-term Preservation in the Czech Republic

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

METS and the CIDOC CRM a Comparison

OAISistema verso un portale OAI per gli studi sul Mediterraneo Antico

Transcription:

Building integration environment based on OAI-PMH protocol Novytskyi Oleksandr Institute of Software Systems NAS Ukraine Alex@zu.edu.ua

Roadmap What is OAI-PMH? Requirements for infrastructure Step by step building integration environment Additional services for end-users Future directions for integration Some issues Questions

What is OAI-PMH? OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) is a protocol developed by the Open Archives Initiative. It is used to harvest (or collect) the metadata descriptions of the records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representations metadata in Dublin Core, but may also support additional representations.

What is OAI-PMH? Protocol Requests and Responses Requests GetRecord Identify ListIdentifiers Responses Metadata item Information about repository, schema, protocol version and other information Returns record header it contains: unique identifier -- the unique identifier of an item in a repository; datestamp -- the date of creation, modification or deletion of the item for the purpose of selective harvesting; zero or more setspec elements -- the set elements belong to item for the purpose of selective harvesting. ListMetadataFormats ListRecords ListSets is used to retrieve the metadata formats available in a repository is used to harvest records from a repository is used to retrieve the set structure of a repository, useful for selective harvesting.

Requirements for infrastructure Data provider Eprints (http://www.eprints.org ) Dspace (http://www.dspace.org ) Fedora (http://fedora-commons.org ) Other software applications which support uploading data in to OAI-PMH Service provider Open Harvester Systems (http://pkp.sfu.ca/?q=harvester ) Omeka (http://omeka.org )

Step by step for building integration environment 1. Select hosting for installation Service provider aggregations many big repositories need a lot of RAM (~3 GB) for PHP execution Crontab Long time execution script Needs dedicated server or VPS 1. Installation and configuration Service provider 1. Installation Service provider 2. Customization user interface 3. Connection digital libraries 4. Configuration Reading Tools

Step by step guide for building integration environment PKP Open Harvester Systems Easy to install Long term support Easy to upgrade Many plugins Supports ETD-MS, MARC, Dublin Core, MODS metadata formats Good performance

search in public archives of Ukraine oai.org.ua The project http://oai.org.ua started in 2008 and its main goal is to integrate open archives/repositories of Ukraine. Currently, it integrates 35 open archives of Ukraine. And it contains more than 126,000 records. The system was developed and is maintained by Institute of Software Systems of the NAS of Ukraine.

Addition services for end-users PKP Harvester offers a range of tools that you can use in scientific discovery. For example, the Reading Tools. This tool allows to carry out a quick search of related resources.

Reading Tools

Reading Tools

Reading Tools Linked to Scientific Digital Library periodicals NAS of Ukraine http://dspace.nbuv.gov.ua

Future directions for INFORMATION integration. Protocol OAI-PMH provides a structured data integration, however leaves aside the semantics. We plan to use the model of semantic description for each resource by using RDFa. This will allow search engines to search for more information.

Future directions INFORMATION integration Support distributed full text search in the archives. We have already developed for EPrints a set of web services that enable searching for metadata fields as well as full-text search. In the near future we plan to introduce this extension to our system http://oai.org.ua. The integration of archives based on ontologies This direction is being studied by us, and we plan the use of ontologies to integrate heterogeneous resources that harvester collects.

Some issues Some users install DSpace without supporting OAI-PMH Some EPrints installations do not have unique OAI Repository Identifier Repositories can change their URL. It gives additional difficulties in supporting the list of repositories (keeping it up to date).

Questions Thank you for your attention!