data.bris: collecting and organising repository metadata, an institutional case study
|
|
|
- Michael Garrison
- 10 years ago
- Views:
Transcription
1 Describe, disseminate, discover: metadata for effective data citation. DataCite workshop, no.2.. data.bris: collecting and organising repository metadata, an institutional case study David Boyd data.bris project
2 data.bris context Funded by JISC, one of several RDM infrastructure projects, October 2011 March Central aim to establish a research data repository service Service to be piloted in the Faculty of Arts (School of Arts; School of Humanities; School of Modern Languages) Develop a business plan to extend the model across the University Building on the investment already made in research data storage through the Research Data Storage Facility
3 Repository service some key requirements Must be fit for purpose and meet the needs of the researcher Must fit within the wider University-wide support service structure, i.e., successfully interface with existing systems and processes, e.g., Research Data Storage Facility, and new RIS Must be able to be embedded in future systems, without conflicting with other University systems
4 6 July 2012 Research Data Storage Facility Bristol has the infrastructure for research data storage, though requires some way to publish it! Employees of the University complete an application form to use the Facility. Application form asks for details about the project and the data being created. Data Steward is held responsible for the proper stewardship of the data stored in the Facility. Existing service offers an upload facility by simply mapping storage to individual machines around the campus. Provision of metadata is not mandatory.
5 6 July 2012 data.bris architecture Some of the things we want: A deposit facility that will allow a user to select a set of files from the RDSF for deposit and ultimately wider publication. The ability to document that deposit in some detail. The metadata will be manually entered, but also gathered from other institutional systems and mechanically derived from the dataset itself. A process of assigning a DOI to the dataset and making it publicly available. To create and manage records in the University RIS
6 Metadata - key requirements Generate metadata that is of good quality, and enables discovery, retrieval, and sharing of data Establish a process for collecting metadata that is lightweight, efficient, and where possible automated Provide depositors with a standard citation for their dataset - identify a minimum set of metadata for the purposes of data citation, including a persistent identifier Generate metadata that is interoperable and can be shared across different systems, i.e., standards based
7 Deposit process - collecting metadata manually In-house system that directly interacts with users RDSF file share Lightweight approach - encourage the creators of data to deposit by developing an intuitive, simple browser-based user interface Develop efficient, streamlined procedures for manual data entry and metadata creation Engage researchers with the repository deposit process in an attempt to embed the processes involved within their working cycle to further encourage deposit
8 6 July 2012 Deposit facility: project area
9 6 July 2012 Deposit facility: project area with data contents
10 6 July 2012 Deposit facility: new deposit (initial information)
11 6 July 2012 Deposit facility: record created
12 6 July 2012 Deposit facility: choosing content
13 6 July 2012 Deposit facility: data deposited
14 6 July 2012 Deposit facility: data published
15 Generating and capturing metadata automatically Import, and integrate metadata held in other University systems into the data deposit process where relevant, e.g., Title of data; names of data creators; name of research project (extracted and re-used from original RDSF registration form) Staff profile information from central administrative systems Automatically generate metadata where relevant, e.g., Identifier (DOI), Publisher name Apache Tika - automatically extract metadata from data files where relevant, e.g., when created, who created it, when last updated, file size, file extension etc.
16 Data citation Provide researchers with a sample citation (and persistent identifier) for their dataset, one that can be used in academic publications. Enables impact of data to be tracked Implement a minimum (mandatory) set of metadata whose elements can be used for citation purposes DataCite recommended format with mapping to Dublin Core qualified terms is being used as a point of reference, i.e., Name of creator/s (PublicationYear): Title. Publisher. Identifier (DOI). Aim to use DataCite Metadata Store service for minting DOIs and register associated metadata.
17 Metadata interoperability and implementation Use Dublin Core (DC) element set as a basis for defining and structuring metadata. DC is well understood, widely used, and highly interoperable Use of DC terms follows the Resource Description Framework (RDF) model Implementation - provide serialisation in RDF/XML, turtle, JSON and XML. May also provide RDF embedded in html Integration with University RIS...
18 Research Information System (RIS): University is currently implementing a new research information system (RIS) that aims to provide a complete and definitive record of researchers, research outputs, grants and projects RIS will also serve as the full-text institutional repository though it is assumed that (for the present at least) it won t store large datasets Need to avoid duplication of metadata and ensure that users do not re-enter metadata Need to adopt the same identifiers for users, outputs, and datasets
19 Research Information System (RIS) proposed interaction: Aim to push data.bris dataset records (summary metadata records not the data itself) to the RIS, to provide a complete picture of research. Requirement for a metadata crosswalk Aim to minimise the amount of data entry and duplication in data.bris system by using metadata that originates in the RIS Longer term - as RIS's ability to describe research data evolves it may be possible for the RIS to take over from data.bris as the definitive holder of metadata about research data.
20 Ongoing investigation related resources Appropriate use of metadata properties and attributes to describe the relationships between resources, e.g., the relationship between an article and the dataset on which the research is based the relationship between different versions (revisions, updates, additions) of a dataset the relationship between the various parts (supplements, subsets) of a dataset What constitutes a complete (citable) set of data?
21 6 July 2012 Ongoing investigations stub deposits Provide for stub deposits, i.e., those consisting of metadata only and no associated research data, for when the data is held in an external repository or domain specific data centre Use of SWORD deposit protocol to enable this
22 Ongoing investigation domain/subject specific metadata Collecting descriptive information about data, e.g., annotations, notes, etc. plugin support for a variety of tools Subject classification, e.g., JACS coding, or LoC subject headings
23 Thank you. David Boyd. University of Bristol, Library Services/IT Services R&D/ILRT.
Research Data Management Guide
Research Data Management Guide Research Data Management at Imperial WHAT IS RESEARCH DATA MANAGEMENT (RDM)? Research data management is the planning, organisation and preservation of the evidence that
Working with the British Library and DataCite A guide for Higher Education Institutions in the UK
Working with the British Library and DataCite A guide for Higher Education Institutions in the UK Contents About this guide This booklet is intended as an introduction to the DataCite service that UK organisations
Cite My Data M2M Service Technical Description
Cite My Data M2M Service Technical Description 1 Introduction... 2 2 How Does it Work?... 2 2.1 Integration with the Global DOI System... 2 2.2 Minting DOIs... 2 2.3 DOI Resolution... 3 3 Cite My Data
LIBER Case Study: University of Oxford Research Data Management Infrastructure
LIBER Case Study: University of Oxford Research Data Management Infrastructure AuthorS: Dr James A. J. Wilson, University of Oxford, [email protected] Keywords: generic, institutional, software
University of Bristol. Research Data Storage Facility (the Facility) Policy Procedures and FAQs
University of Bristol Research Data Storage Facility (the Facility) Policy Procedures and FAQs This FAQs should be read in conjunction with the RDSF usage FAQs - https://www.acrc.bris.ac.uk/acrc/rdsf-faqs.html
EPSRC Research Data Management Compliance Report
EPSRC Research Data Management Compliance Report Contents Introduction... 2 Approval Process... 2 Review Schedule... 2 Acknowledgement... 2 EPSRC Expectations... 3 1. Awareness of EPSRC principles and
The overall aim for this project is To improve the way that the University currently manages its research publications data
Project Plan Overview of Project 1. Background The I-WIRE project will develop a workflow and toolset, integrated into a portal environment, for the submission, indexing, and re-purposing of research outputs
Towards research data cataloguing at Southampton using Microsoft SharePoint and EPrints: a progress report
Towards research data cataloguing at Southampton using Microsoft SharePoint and EPrints: a progress report Steve Hitchcock and Wendy White* JISC DataPool Project Faculty of Physical and Applied Sciences,
OpenAIRE Research Data Management Briefing paper
OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement
SHared Access Research Ecosystem (SHARE)
SHared Access Research Ecosystem (SHARE) June 7, 2013 DRAFT Association of American Universities (AAU) Association of Public and Land-grant Universities (APLU) Association of Research Libraries (ARL) This
DATA CITATION. what you need to know
DATA CITATION what you need to know The current state of practice of the citation of datasets is seriously lacking. Acknowledgement of intellectual debts should not be limited to only certain formats of
RESEARCH DATA MANAGEMENT POLICY
Document Title Version 1.1 Document Review Date March 2016 Document Owner Revision Timetable / Process RESEARCH DATA MANAGEMENT POLICY RESEARCH DATA MANAGEMENT POLICY Director of the Research Office Regular
Stewarding Big Data: Perspectives on Public Access to Federally Funded Scientific Research Data
Stewarding Big Data: Perspectives on Public Access to Federally Funded Scientific Research Data Big Data and Big Challenges for Law and Legal Information Georgetown Law Library January 30, 2013 William
How To Write A Blog Post On Globus
Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, [email protected] Jim Pruyne, University of Chicago Computation Institute, [email protected]
THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy 2015-2018. Page 1 of 8
THE BRITISH LIBRARY Unlocking The Value The British Library s Collection Metadata Strategy 2015-2018 Page 1 of 8 Summary Our vision is that by 2020 the Library s collection metadata assets will be comprehensive,
Environment Canada Data Management Program. Paul Paciorek Corporate Services Branch May 7, 2014
Environment Canada Data Management Program Paul Paciorek Corporate Services Branch May 7, 2014 EC Data Management Program (ECDMP) consists of 5 foundational, incremental projects which will implement
Data Management Resources at UNC: The Carolina Digital Repository and Dataverse Network
Data Management Resources at UNC: The Carolina Digital Repository and Dataverse Network November 16, 2010 Data Management Short Course Series Sponsored by the Odum Institute and the UNC Libraries Campus
Jochen Schirrwagen, Najko Jahn. Bielefeld University Library, Germany. Research in Context
Jochen Schirrwagen, Najko Jahn Bielefeld University Library, Germany Research in Context In the light of recent results from OpenAIREplus and from the Library perspective Seminar to Access of Grey Literature
Research Data Storage and the University of Bristol
Introduction: Policy for the use of the Research Data Storage Facility The University s High Performance Computing (HPC) facility went live to users in May 2007. Access to this world-class HPC facility
Notes about possible technical criteria for evaluating institutional repository (IR) software
Notes about possible technical criteria for evaluating institutional repository (IR) software Introduction Andy Powell UKOLN, University of Bath December 2005 This document attempts to identify some of
DRIVER Providing value-added services on top of Open Access institutional repositories
DRIVER Providing value-added services on top of Open Access institutional repositories Dr Dale Peters Scientific Technical Manager : DRIVER SUB Goettingen Germany Gaining the momentum: Open Access and
K@ A collaborative platform for knowledge management
White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index
ESRC Research Data Policy
ESRC Research Data Policy Introduction... 2 Definitions... 2 ESRC Research Data Policy Principles... 3 Principle 1... 3 Principle 2... 3 Principle 3... 3 Principle 4... 3 Principle 5... 3 Principle 6...
A Guide to the Research Data Service
A Guide to the Research Data Service DMP online ONLINE DATASHARE MY RESEARCH DATA PURE DATA SYNC DATA VAULT DATA STORE This booklet was produced in April 2016 by the Research Data Service Team, Information
JISC Project Plan. Leeds RoaDMaP (Leeds Research Data Management Pilot) #leedsrdm. University of Leeds
Date: 09/03/2012 JISC Project Plan Project Information Project Identifier To be completed by JISC Project Title Leeds RoaDMaP (Leeds Research Data Management Pilot) Project Hashtag #leedsrdm Start Date
Introduction to the Research Data Center PsychData
Introduction to the Research Data Center PsychData INA DEHNHARD ERICH WEICHSELGARTNER Leibniz- Institute for Psychology Information (ZPID) Trier, Germany DATA SHARING DATA MANAGEMENT RESEARCH DATA CENTER
infokit JISC infonet is a JISC Advance Service
This infokit reflects the increasing use of repositories using the documentation, guidance and expertise built up during the Repositories Support Project (RSP). This has been augmented by Lou McGill. infokit
- a Humanities Asset Management System. Georg Vogeler & Martina Semlak
- a Humanities Asset Management System Georg Vogeler & Martina Semlak Infrastructure to store and publish digital data from the humanities (e.g. digital scholarly editions): Technically: FEDORA repository
Expanding Metadata Reuse with an Islandora Metadata Extraction Utility
Expanding Metadata Reuse with an Islandora Metadata Extraction Utility Serhiy Polyakov and William E. Moen University of North Texas International conference Open Repositories 2013 Charlottetown, Prince
Data Publishing Workflows with Dataverse
Data Publishing Workflows with Dataverse Mercè Crosas, Ph.D. Twitter: @mercecrosas Director of Data Science Institute for Quantitative Social Science, Harvard University MIT, May 6, 2014 Intro to our Data
Project Acronym: CRM ACCORD Version: 2 Contact: Joanne Child, Doncaster College Date: 30 April 2010. JISC Final Report CRM ACCORD
Project Acronym: CRM ACCORD JISC Final Report CRM ACCORD Page 1 of 22 Document title: JISC Final Report Last updated: April 2007 Table of Contents Acknowledgements... 3 Executive Summary... 4 Background...
RE: OSTP RFI: Public Access to Peer-Reviewed Scholarly Publications Resulting From Federally Funded Research
Attn: Office of Science and Technology Policy 725 17 th Street, Washington, DC 20501 RE: OSTP RFI: Public Access to Peer-Reviewed Scholarly Publications Resulting From Federally Funded Research Massachusetts
Building Semantic Content Management Framework
Building Semantic Content Management Framework Eric Yen Computing Centre, Academia Sinica Outline What is CMS Related Work CMS Evaluation, Selection, and Metrics CMS Applications in Academia Sinica Concluding
DDI Lifecycle: Moving Forward Status of the Development of DDI 4. Joachim Wackerow Technical Committee, DDI Alliance
DDI Lifecycle: Moving Forward Status of the Development of DDI 4 Joachim Wackerow Technical Committee, DDI Alliance Should I Wait for DDI 4? No! DDI Lifecycle 4 is a long development process DDI Lifecycle
Image Galleries: How to Post and Display Images in Digital Commons
bepress Digital Commons Digital Commons Reference Material and User Guides 12-2014 Image Galleries: How to Post and Display Images in Digital Commons bepress Follow this and additional works at: http://digitalcommons.bepress.com/reference
Checklist for a Data Management Plan draft
Checklist for a Data Management Plan draft The Consortium Partners involved in data creation and analysis are kindly asked to fill out the form in order to provide information for each datasets that will
Affiliation: University of Massachusetts Amherst, University Libraries
Date: January 12, 2012 Name: Marilyn S Billings Email: [email protected] Affiliation: University of Massachusetts Amherst, University Libraries City, State: Amherst, MA Summary: Thank you for
Metadata Repositories in Health Care. Discussion Paper
Health Care and Informatics Review Online, 2008, 12(3), pp 37-44, Published online at www.hinz.org.nz ISSN 1174-3379 Metadata Repositories in Health Care Discussion Paper Dr Karolyn Kerr [email protected]
Functional Requirements for Digital Asset Management Project version 3.0 11/30/2006
/30/2006 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 30 3 32 33 34 35 36 37 38 39 = required; 2 = optional; 3 = not required functional requirements Discovery tools available to end-users:
Research Data Management Policy
Research Data Management Policy Version Number: 1.0 Effective from 06 January 2016 Author: Research Data Manager The Library Document Control Information Status and reason for development New as no previous
PAPER Data retrieval in the PURE CRIS project at 9 universities
PAPER Data retrieval in the PURE CRIS project at 9 universities A practical approach Paper for the IWIRCRIS workshop in Copenhagen 2007, version 1.0 Author Atira A/S Bo Alrø Product Manager [email protected]
Supporting Change-Aware Semantic Web Services
Supporting Change-Aware Semantic Web Services Annika Hinze Department of Computer Science, University of Waikato, New Zealand [email protected] Abstract. The Semantic Web is not only evolving into
Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.
Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Dr Liz Lyon, UKOLN, University of Bath Introduction and Objectives UKOLN is undertaking
Project Information Project Acronym Kaptur
Project Information Project Acronym Kaptur Project Title Kaptur Start Date 3 rd October 2011 End Date 29 th March 2013 Lead Institution University for the Creative Arts Project Director Leigh Garrett,
The Institutional Repository at West Virginia University Libraries: Resources for Effective Promotion
The Institutional Repository at West Virginia University Libraries: Resources for Effective Promotion John Hagen Manager, Electronic Institutional Document Repository Programs West Virginia University
BYODs & FAIR Data Stewardship
BYODs & FAIR Data Stewardship Luiz Olavo Bonino [email protected] www.elixir-europe.org Summary FAIR Data stewardship Approach in NL BYOD FAIR Data tooling ecosystem Way of working (FAIR) Data Stewardship
Statement of Work (SOW) for Web Harvesting U.S. Government Printing Office Office of Information Dissemination
Statement of Work (SOW) for Web Harvesting U.S. Government Printing Office Office of Information Dissemination Scope The U.S. Government Printing Office (GPO) requires the services of a vendor that can
How To Useuk Data Service
Publishing and citing research data Research Data Management Support Services UK Data Service University of Essex April 2014 Overview While research data is often exchanged in informal ways with collaborators
Research Data Management (RDM) Roadmap August 2012 January 2014
Research Data Management (RDM) Roadmap August 2012 January 2014 Information Services RDM Policy Implementation Committee University of Edinburgh November 2012: Version 1.0 Document Status This is a living
REACCH PNA Data Management Plan
REACCH PNA Data Management Plan Regional Approaches to Climate Change (REACCH) For Pacific Northwest Agriculture 875 Perimeter Drive MS 2339 Moscow, ID 83844-2339 http://www.reacchpna.org [email protected]
STORRE: Stirling Online Research Repository Policy for etheses
STORRE: Stirling Online Research Repository Policy for etheses Contents Content and Collection Policies Definition of Repository Structure Content Guidelines Submission Process Copyright and Licenses Metadata
How To Build A Connector On A Website (For A Nonprogrammer)
Index Data's MasterKey Connect Product Description MasterKey Connect is an innovative technology that makes it easy to automate access to services on the web. It allows nonprogrammers to create 'connectors'
OPENGREY: HOW IT WORKS AND HOW IT IS USED
OPENGREY: HOW IT WORKS AND HOW IT IS USED CHRISTIANE STOCK [email protected] INIST-CNRS, France Abstract OpenGrey is a unique repository providing open access to European grey literature references,
Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014
Best Practices for Data Management RMACC HPC Symposium, 8/13/2014 Presenters Andrew Johnson Research Data Librarian CU-Boulder Libraries Shelley Knuth Research Data Specialist CU-Boulder Research Computing
Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management
Research Data Management Canadian National Research Data Repository Service Progress Report, June 2016 As their digital datasets grow, researchers across all fields of inquiry are struggling to manage
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article
ELPUB Digital Library v2.0. Application of semantic web technologies
ELPUB Digital Library v2.0 Application of semantic web technologies Anand BHATT a, and Bob MARTENS b a ABA-NET/Architexturez Imprints, New Delhi, India b Vienna University of Technology, Vienna, Austria
DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM
DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM Introduction The Institute of Museum and Library Services (IMLS) is committed to expanding public access to federally funded research, data, software,
Open Data and Information Sharing Michael Simcock, Chief Data Architect OCIO/EDMO
Open Data and Information Sharing Michael Simcock, Chief Data Architect OCIO/EDMO MARCH 2014 Open Data Exec Order & Policy: DHS Agency Actions 1. Create and maintain an Enterprise Data Inventory 2. Publish
Checklist and guidance for a Data Management Plan
Checklist and guidance for a Data Management Plan Please cite as: DMPTuuli-project. (2016). Checklist and guidance for a Data Management Plan. v.1.0. Available online: https://wiki.helsinki.fi/x/dzeacw
The PIRUS Code of Practice for recording and reporting usage at the individual article level
PIRUS A COUNTER Standard The PIRUS Code of Practice for recording and reporting usage at the individual article level (Note: this draft is for public consultation only and is not yet ready for full implementation)
2013 Research Publications Repository Survey Report September 2013
2013 Research Publications Repository Survey Report September 2013 Page 1 Table of contents Introduction... 4 Key Findings... 4 Percentage of open access records... 4 Additional repositories... 4 Long-term
Master Class 4 - Introduction to Intellectual Property and Copyright Wilna Macmillan, Director of Client Services, Library
IP and Research Data Managing your research data Consider early and review often Master Class 4 - Introduction to Intellectual Property and Copyright Wilna Macmillan, Director of Client Services, Library
Collecting and archiving tweets: a DataPool case study
Collecting and archiving tweets: a DataPool case study Steve Hitchcock, JISC DataPool Project, Faculty of Physical and Applied Sciences, Electronics and Computer Science, Web and Internet Science, University
Cambridge University Library. Working together: a strategic framework 2010 2013
1 Cambridge University Library Working together: a strategic framework 2010 2013 2 W o r k i n g to g e t h e r : a s t r at e g i c f r a m e w o r k 2010 2013 Vision Cambridge University Library will
Report of the DTL focus meeting on Life Science Data Repositories
Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity
Service Guidelines. This document describes the key services and core policies underlying California Digital Library (CDL) s EZID Service.
http://ezid.cdlib.org Service Guidelines 1 Purpose This document describes the key services and core policies underlying (CDL) s EZID Service. 2 Introduction to EZID EZID (easy eye dee) is a service providing
Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan
Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:
