CERN Document Server



Similar documents
Invenio: A Modern Digital Library for Grey Literature

Institutional Repositories: Staff and Skills Set

CDS Invenio - a software solution for National Repository of Grey Literature

Building integration environment based on OAI-PMH protocol. Novytskyi Oleksandr Institute of Software Systems NAS Ukraine

The Institutional Repository at West Virginia University Libraries: Resources for Effective Promotion

OPENGREY: HOW IT WORKS AND HOW IT IS USED

Institutional Repositories: Staff and Skills requirements

Service Cloud for information retrieval from multiple origins

The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols

Course 1: Digital Libraries Introduction Unit 4: Digital libraries functional components software

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

IR: Knowledge Creation, Knowledge Management and Knowledge Dissemination

Vilas Wuwongse, Thiti Vacharasintopchai, Neelawat Intaraksa Asian Institute of Technology

Issues and Challenges in Open Source Software Environment with Special Reference to India

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

USER EDUCATION: ACADEMIC LIBRARIES

The Czech Digital Library and Tools for the Management of Complex Digitization Processes

Open Access Repositories Technical Considerations. Introduction. Approaches to Setting up Repositories

Copying Archives. Ngoni Munyaradzi (MNYNGO001)

James Hardiman Library. Digital Scholarship Enablement Strategy

DAR: A Digital Assets Repository for Library Collections

How To Manage Your Digital Assets On A Computer Or Tablet Device

NTU-IR: An Institutional Repository for Nanyang Technological University using DSpace

A Virtual Exhibition of Open Source Software for Libraries

Building An Institutional Repository With DSpace

DIGITAL LIBRARY OPEN SOURCE SOFTWARE : A COMPARATIVE STUDY

Taking Control of Library Metadata and Websites using the extensible Catalog

Indian Journal of Science International Weekly Journal for Science ISSN EISSN Discovery Publication. All Rights Reserved

Citebase Search: Autonomous Citation Database for e-print Archives

Archiving Scientific Literature: An Experience with E-prints Archive Software

Notes about possible technical criteria for evaluating institutional repository (IR) software

Indian Journal of Science International Weekly Journal for Science ISSN EISSN Discovery Publication. All Rights Reserved

PAPER Data retrieval in the PURE CRIS project at 9 universities

SHared Access Research Ecosystem (SHARE)

MultiMimsy database extractions and OAI repositories at the Museum of London

How to avoid building a data swamp

OpenAIRE Research Data Management Briefing paper

Implementing an Institutional Repository for Digital Archive Communities: Experiences from National Taiwan University

Installation and Customization Experience of Metadata Harvester System: Case from University of Ruhuna

DRIVER Providing value-added services on top of Open Access institutional repositories

How To Manage A Digital Library

The NERC DataGrid (NDG)

Selection and Management of Open Source Software in Libraries.

Institutional Repositories: Time for African universities to consolidate the digital divide. John Paul Anbu K. University of Swaziland

Open Source Initiative in Digital Preservation: The Need for an Open Source Digital Repository and Preservation System

EZcast technical documentation

THE HELMHOLTZ INVENIO REPOSITORY PROJECT :

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

infokit JISC infonet is a JISC Advance Service

Queen s Open Journal System (OJS) Business Case

OCLC CONTENTdm. Geri Ingram Community Manager. Overview. Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015

Evaluation Criteria Defined for Evaluating Open Source Digital Library Software (OSS-DL)

Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web

Building Semantic Content Management Framework

Digital Libraries and Content Management

OAISistema verso un portale OAI per gli studi sul Mediterraneo Antico

Digital Asset Management Developing your Institutional Repository

Local Loading. The OCUL, Scholars Portal, and Publisher Relationship

OCLC CONTENTdm and the WorldCat Digital Collection Gateway Overview

METS and the CIDOC CRM a Comparison

Enlighten: Glasgow s University s online institutional repository. Morag Greig University Library

OPEN SOURCE SOFTWARE TOOLS FOR CREATING DIGITAL REPOSITORIES

A collaborative platform for knowledge management

Ex Libris Rosetta: A Digital Preservation System Product Description

GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata

Oxford Digital Asset Management System (DAMS) Update

GeoNetwork, The Open Source Solution for the interoperable management of geospatial metadata

Appendix A. MSU Digital Preservation Proposal April Project: Preserving MSU s Digital Assets

Boundless Treasury of Collective Memories

E-Content Service Group Virtual Meeting. Digital Preservation: How to Get Started

Simplifying e Business Collaboration by providing a Semantic Mapping Platform

Best Practices for Research Data Management. October 30, 2014

The University of Chicago Library

Transcription:

CERN Document Server Document Management System for Grey Literature in Networked Environment Martin Vesely CERN Geneva, Switzerland GL5, December 4-5, 2003 Amsterdam, The Netherlands

Overview Searching Scholarly Publications Why not to use Google? Institutional Repositories A natural way of document management at a place of the document origin Open Archives initiative (OAi) develops and promotes interoperability standards that aim to facilitate the efficient dissemination of content enhances access to e-print archives as a means of increasing the availability of scholarly communication Protocol for Metadata Harvesting (PMH) application-independent interoperability framework CERN Document Server Implementation of an institutional repository and information services with searching and harvesting capabilities 2/17

Searching Scholarly Publications Electronic capabilities should be used to provide wide access to scholarship, encourage interdisciplinary research, and enhance interoperability and searchability. Development of common standards will be particularly important in the electronic environment Principles for Emerging Systems of Scholarly Publishing Tempe, Arizona, March 2-4, 2000 3/17

Institutional Repositories Digital collections capturing, preserving and disseminating the intellectual output of a single or multi-university community SPARC The Scholarly Publishing & Academic Resource Coalition http://www.arl.org/sparc/ 4/17

Open Archives Initiative Milestones of OAi: Oct 1999, Santa Fe Convention Nov 2000, OAi TC meeting at CERN Jun 2002, OAi-PMH v.2.0 released Next: CERN 3 rd Workshop on Innovations in Scholarly Communication: Implementing the benefits of OAi 12-14th February 2004 CERN, Geneva, Switzerland http://info.web.cern.ch/info/oaip/ 5/17

Protocol for Metadata Harvesting Services Across institutional repositories Institutional Repositories Application e.g. search engine Metadata harvesting: OAi XML Transfer: HTTP other options (+) HTTP widely deployed Transport: communication subsystem TCP/IP (internet) Information Services 6/17

Protocol for Metadata Harvesting Data provider XML HTTP, Web Services Service Provider Unified Independent XML Schema (structure) Storage technology HTTP transfer Local metadata format Data encoding Communication subsystem Data flow control Common transfer metadata format 7/17

CERN Document Server CDS digital library for HEP community CDSware in-house developed system MySQL RDBMS, Apache, Python, PHP MARC21 metadata format http://www.loc.gov/ Document submission (with flow control) Multilingual: UNICODE CDSware is available as GPL http://cdsware.cern.ch/ CVS repository access Free download and usage 8/17

CDSware Search Engine Metadata organized into navigable collections In-house indexing technique to provide fast userseen search times (fraction of a second for a typical query on a database upto size of 10 6 records) User friendliness, Google-like guidance Personalization: Alert engine User baskets Combined metadata/reference/fulltext searching 9/17

CDSware overview author WebAccess WebSubmit admin OAI/Non OAI Data Provider BibHarvest BibConvert BibUpload admin BibSched BibFormat BibIndex admin WebAccess user user WebAccess WebAccess WebSearch WebBasket WebPerso CDSware metadata+ data WebAccess system librarian BibData 10/17 OAI Services admin BibHarvest

CDSware OAi compliancy Cache CDS metadata Flow control Database query MARC XML / DC XML Request parsing OAi Request OAi XML OAi Response HTTP 11/17

CDSware References 12/17

13/17 CERN Document Server

Documents at CERN Articles, preprints, thesis 500 000 CDS at CERN Archived items Books 50 000 50 000 20 000 Talks (slides, videos) 15 000 14 000 2 500-650 000 records (Grey Literature > 80%) - 220 000 full texts - 350 different collections -1000 new preprints per week: - 70 % from ArXiv - 5 % from CERN 14/17-25 % from 80 other sources Conferences Multimedia items (photos, clips, press cuttings ) Journals

Interoperability Issues Standardization efforts XML Schemata and XSLT stylesheets have been specified (e.g. OAi-PMH) Common metadata formats are defined (e.g. Dublin Core, MARC21) Semantic interoperability research Structural approaches (e.g. RDF/XML) Ontological Interoperability Subject of research in DL 15/17

Conclusions Search engines for grey literature are being widely deployed and represent a central information service in scholarly communication Institutional repositories gain momentum and become dominant over disciplinary repositories Standardized frameworks for distributed and federated document processing have been established Information interoperability has been achieved on the syntactic and structural/schematic level, whereas semantic interoperability remains a research issue CDSware implementing OAi-PMH, freely available (GNU/GPL) 16/17

Contact CERN Document Server http://cds.cern.ch/ http://cdsweb.cern.ch/ CDSware sources and demo Contact http://cdsware.cern.ch/ http://cdsware.cern.ch:8000/demoplus/ cds.support@cern.ch martin.vesely@cern.ch 17/17