DASISH. WP4 Data Archiving



Similar documents
DASISH. Workshop Trust and Certification

Data Service Infrastructure for the Social Sciences and Humanities

User Requirements for PID Service Providers: Survey Results from DASISH WP 5.2

Programmes and Effects of the CESSDA Trust Project

A federated data infrastructure: the Dutch way forward

The challenges of becoming a Trusted Digital Repository

Data Seal of Approval. Certification for sustainable and trusted data repositories

Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning

Assessing a Scientific Data Center as a Trustworthy Digital Repository

Interagency Science Working Group. National Archives and Records Administration

Preservation and Dissemination Policy of the LISS Data Archive

Cost and Value analysis of digital data archiving ANNA PALAIOLOGK

EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020

RESEARCH DATA MANAGEMENT POLICY

PTAB Test Audit Report for. National Space Science Data Center (NSSDC) Prepared by

Checklist for a Data Management Plan draft

Research Data Management: The library s role

Data Management Plans - How to Treat Digital Sources

Response to Invitation to Tender: requirements and feasibility study on preservation of e-prints

"Best practices in digital language archiving of language and music data" Sep. 6 7, University of Cologne. Abstracts

ODUM INSTITUTE ARCHIVE SERVICES OVERVIEW IASSIST 2015

Preservation and Dissemination Policy of the LISS Data Archive

OPEN ACCESSAND DATA MANAGEMENT SUPPORTAT THE UNIVERSITY OF HELSINKI

HERON (No: ): Deliverable D.2.6 DATA MANAGEMENT PLAN AUGUST Partners: Oxford Brookes University and Università Commerciale Luigi Bocconi

EPSRC Research Data Management Compliance Report

February 22, 2013 MEMORANDUM FOR THE HEADS OF EXECUTIVE DEPARTMENTS AND AGENCIES

Managing and Sharing research Data

Writing a Wellcome Trust Data Management & Sharing Plan

Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS.

THE UNIVERSITY OF LEEDS. Vice Chancellor s Executive Group Funding for Research Data Management: Interim

Research Data Management Guide

UT Research data policy

SURFsara Data Services

Data Management Plan in Slovenia

Cost and benefits of digital preservation in a federated data infrastructure: Who pays what and who gets what in The Netherlands?

The Open Access Strategy of the Max Planck Society

Data Management Plans & the DMPTool. IAP: January 26, 2016

Open Access to scientific data. SwissCore Annual Event Brussels, 14 May 2014

The PEER Project: Investigating the Effects of Green Open Access

Implementation of the Data Seal of Approval

Questionnaire on Digital Preservation in Local Authority Archive Services

Introduction to Research Data Management for Social Scientists

Open Data in Archaeology. Prof Julian Richards Director, Archaeology Data Service University of York

The International Journal of Digital Curation Issue 1, Volume

The ISPS Data Archive: Mission, Work, and Some Reflections

Data management plan

Subject Work flows - Data Discovery and Dissemination: User Perspective ( ) DDIBestPractices_Workflows-DiscoveryAndDissemination.doc.

European Landscape Study of Research Data Management

Johns Hopkins University Data Management Services

Development of a Very Flexible Web based Database System for Environmental Research

Research Data Management Support Service Jan June 2015 End Stage Report

CITY UNIVERSITY. An evaluation of SOAS Research Online, the Institutional Repository of the School of Oriental and African Studies.

Large Scale Repository Auditing to ISO José Carvalho

CLARIN: Common Language Resources and Technology Infrastructure

Enhancing Discoverability of Public Health and Epidemiology Research Data: Summary

Project Plan DATA MANAGEMENT PLANNING FOR ESRC RESEARCH DATA-RICH INVESTMENTS

D1.3: 1 st Data Management Plan WP1 Project Management

Open Access to publications and research data in Horizon 2020

NARCIS - more than a gateway to Οpen Αccess publications in The Netherlands

DRIVER Providing value-added services on top of Open Access institutional repositories

Cost Model for Digital Preservation. Ulla Bøgvad Kejser, Preservation Specialist, PhD The Royal Library, Denmark

ENHANCED PUBLICATIONS IN THE CZECH REPUBLIC

Second EUDAT Conference, October 2013 Workshop: Digital Preservation of Cultural Data Scalability in preservation of cultural heritage data

ESRC Research Data Policy

Open Access and Open Research Data in Horizon 2020

Research Data Management Policy

IASSIST Quarterly VOLUME 34 Number 3 & / VOLUME 35 - Number 1 &

Achieving a Step Change in Digital Preservation Capability

Quality Assurance of Research Data:

Elements of Data Management Plans: A Gap Analysis and Recommendations

NWO-DANS Data Contracts

WRANGLING DIGITAL CHAOS: CHARACTERIZATION & INGEST

KNOWLEDGENT REPORT Big Data Survey: Current Implementation Challenges

nestor - Network of Expertise in Long-Term Storage and Long-Term availability of Digital Resources in Germany

Long-term preservation activities of the Bavarian State Library

A Maturity Model for Information Governance

A sustainable archiving software solution for The Language Archive

ETIP Wind Steering Committee meeting Monday 7th March :00 16:45 EWEA office, Rue d Arlon 80 6th floor Bruxelles AGENDA

SHARING RESEARCH DATA POLICY, INFRASTRUCTURE, PEOPLE

Integrating Research Information: Requirements of Science Research

Undergraduate Psychology Major Learning Goals and Outcomes i

Local Loading. The OCUL, Scholars Portal, and Publisher Relationship

Research Data Management Services. Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012

NERC Data Policy Guidance Notes

Sponsored Programs Guidance Cradle to Grave

Research Data Management - The Essentials

OpenAIRE Research Data Management Briefing paper

Trends in Data Archiving

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

Research Data Management in Horizon 2020

Translation Service Provider according to ISO 17100

Benefits of conducting a Project Management Maturity Assessment with PM Academy:

Checklist and guidance for a Data Management Plan

EXECUTIVE AGENCY HORIZON 2020 PROGRAMME

D5.5 Initial EDSA Data Management Plan

Keeping Research Data Safe JISC Research Data Digital Preservation Costs Study. MPG Workshop Gottingen June 2008

ERA-CAPS Data Sharing Policy ERA-CAPS. Data Sharing Policy

Data Management Plan (DMP) Deliverable 11.5

Implementing a Metrics Program MOUSE will help you

Enabling the re-use of research data: organising stakeholders and infrastructure in the Netherlands

Transcription:

DASISH Digital Services Infrastructure for Social Sciences and Humanities WP4 Data Archiving Vigdis Kvalheim Norwegian Social Science Data Services (NSD) IASSIST Toronto 2014

DASISH PM Distribution and Partners PM CESSDA NSD, Norwegian Social Science Data Services ( 15 PM) FSD, Finish Social Science Data Archive (2 PM) 199.5 171 SND, Swedish National Data Services (5 PM) GESIS - Leibniz Institute for the Social Sciences, (6 PM) 44 83 67 68 56 34 CLARIN MPG, Max Planck Institute for Psycholinguistics (6 PM) UiB, University of Bergen (7 PM) DARIAH OEAW, Austrian Academy of Sciences (5 PM) DANS, Data Archiving and networked services (5 PM) UGOE, Goettingen University (6 PM) ESS CITY, City University, London (2 PM) SHARE CentERdata, The Netherlands (7 PM)

Archiving and Curation - Access and Sharing DASISH will rely on common data services offered by a network of strong data centres with national backing Purpose: Assess and discuss the state of data and deposit services in the SSH domain and identify gaps, bottlenecks and requirements Develop and recommend a requirements for deposit services which handle various types of data Work out and suggest policy rules and guidelines for proper data management, that can be taken up by data infrastructures providing long term preservation and curation services

WP4 Sub-tasks Task 4.1: State-of-the-art of data preservation and curation Task 4.2: Assessment of deposit services Task 4.3: Deposit service convergence Task 4.4: Recommendation of a set of policy rules Current state of data preservation and curation Analyze and describe Investigate existing deposit offers. Assess the scope of policy rules and their requirements Policies and guidelines Recommendations Service Level Agreements Establish policy rules Requirements specification Service level agreements PR and training material Implement and test the policy framework NSD 2012

D4.1 and D4.2: Fact Sheets First Year http://dasish.eu/publications/projectreports/d4.1_-_roadmap_for_preservation_and_curation_in_the_ssh.pdf http://dasish.eu/publications/projectreports/d4.2_-_report_about_preservation_service_offers.pdf

Five Level Trust Maturity Model (D4.1) Trust Maturity Level Key Guideline Guideline Source 1. OAIS Core Conformance Support OAIS Information Model. Acknowledge OAIS Archive responsibilities. OAIS Information Model: Section 2.2 of CCSDS 650.0-M-2 / ISO 14721:2012. OAIS Archive Responsibilities: Section 3.1 of CCSDS 650.0-M-2 / ISO 14721:2012. Self-assessment through PLATTER and PLATTER Key Self-assessment questions. DRAMBORA. 2. Initial self-assessment, PLATTER/DRAMBORA DRAMBORA Key Self-assessment questions. 3. Peer-reviewed self-assessment I, DSA Peer-reviewed self-assessment I, DSA. Data Seal of Approval Guidelines. 4. Peer-reviewed self-assessment II, ISO 16363/DIN 31644 Conformance to the OAIS Detailed Functional Model. Self-audit with the ISO 16363. Alternatively, self-audit with DIN 31644. Support: NESTOR criteria OAIS Detailed Functional Model: Section 4.1 of CCSDS 650.0-M-2 / ISO 14721:2012. CCSDS 652.0-M-1 / ISO 16363:2012. DIN 31644 5. Certification and Optimization External review and formal certification in conformance with the ISO 16363. Alternatively, with DIN 31644. CCSDS 652.0-M-1 / ISO 16363:2012. DIN 31644.

Nr DASISH Data Archive Description Sheet Functionality Nr Functionality Administrative context 1 Funding 2 Depositor Agreements 3 Usage Agreements, Code of Conduct to be signed 4 Policies in place 5 Rights on data claimed by the archive 6 Data Curation strategy Pre-Ingest 7 Primary community in focus for deposits 8 Secondary communities accepted for deposits Archival storage and preservation 13 Size of current archive in TB 14 Size of current archive in other means (collections, files, etc.) 15 Maximal deposit size in TB 16 Long term guarantees / standards of trust 17 Checks on quality / quality control Dissemination 18 Costs / Conditions for Access 19 Tools / Interfaces used for Access Ingest 9 Formats accepted and curated 10 Formats accepted and not curated 11 Metadata formats accepted 12 User-based ingest

Survey on data deposit service arrangements The questionnaire; based on the results and recommendations of D4.1, D4.2 and the DADS The purpose; to gain broader and more detailed insights about the organization, the state of and the degree to which data archive solutions exists across Europe and across scientific fields. Point of departure for the next steps: having in-depth interviews with selected data archive services

Survey key findings Background 45 Archive service level 40 35 30 25 20 15 10 5 0 None/No plans Plan to launch Functioning data archive

Survey key findings - Organizational context Key requirement compliance indicators: Documentation on deposit agreements, usage agreements and preservation policies..data Seal of Approval (DSA), Service Provider requirements among others.. Overall, 75 % of the services do have a licence or depositor agreement North-Western Europe the percentage of respondents confirming the existence of deposit/license agreement is somewhat higher (85 %) than South and East (53 %) Code of conduct / usage agreements are in place among 82 % of the North- Western Europe respondents; 41 % among South and East Preservation policy are in place among 62 % of the North-Western Europe respondents; for South and East it is 29 %

Survey key findings - Level of Trust 25 of 46 respondents indicate that their services have undertaken activities to determine their trustworthiness 15 respondents from existing data archive services indicate that these services have not undertaken any action in this respect yet Among the respondents from North Western Europe, 65 % mentioned certification activities (half of them on the level of peer-reviewed DSAassessment or higher); 27 % from Southern and Eastern Europe

Survey findings - Self-reported maturity level of Data Archive Services We asked the respondents if they are satisfied with the maturity level of several aspects of their data archive service. We split this item into 5 sub-items (related to the OAIS reference model) 30 25 20 15 10 5 0 No measures needed Archival storage and preservation Data archive administration Ingest facilities Dissemination facilities

The way ahead some suggestions Further steps; the selection and recommendation of appropriate data service are dependent on further analyses of survey results The next step is to complete the DADS for all or the most promising data services, except those already included, based on the competed survey and with the help of the data infrastructure/deposit service itself. D4.3: List of recommended data services (trusted centres), will be a based on the completed and verified DADS First step feed into world wide registry Updated version of the Survey Report including information on the less mature, emerging/aspiring data archives with institutional/national backing, that to various extent meet requirements recommended in 4.1, 4.2 and 4.3.

Policy Rules for Data Management Deliverable in Month 33: A Comprehensive Set of Policy Rules for Data Management Partners: NSD, UGOE, FSD, MPG, UiB, GESIS Procedure: Data Policy Description Sheet (DPDS) Assess the scope of policy rules and their requirements in collaboration with initiatives in Europe and the US Establish policy rules in close collaboration with experts and emerging collaborative data services infrastructure

IFDO Survey on Research Funders Data Policies Country-by-country information on current institutional research data policies Main focus on formal data policies Existence, contents and quality of data sharing requirements Type of linkage to funding

IFDO Data Policy Description Sheet Topic Nr. Topic Item Background information 1 Name of funder Background information 2 Homepage General policy 3 General conditions General policy 4 Data Management Plan (DMP) for Proposal General policy 5 Data Timeframe General policy 6 Guidance General policy 7 Compliance/Monitoring General policy 8 Funding / Costs General policy 9 Scope of policy Standards/Documentation 10 Documentation Requirements Standards/Documentation 11 Data Standards Standards/Documentation 12 Metadata Standards Access and preservation 13 Data Preservation Access and preservation 14 Scope of preservation provisons Access and preservation 15 Data Access / Sharing Access and preservation 16 Data Access / Sharing incentives Access and preservation 17 Data Sharing Rights (IPR) Access and preservation 18 Data Embargo / Data Retention Access and preservation 19 Data Sharing requirements / timeframe Access and preservation 20 Designated Data Repository Access and preservation 21 Data Repository Supported Access and preservation 22 Institutional (data repository) Requirements Publications 23 Open Access to Publications Publications 24 Publication Repository Specified Publications 25 Publication Repository Supported Resources/References 26 Date of policy Resources/References 27 Policy link Resources/References 28 Policy link

Well described / Required - 2009, 2012 Data Policy Description Sheet - example Direct quotes / paraphrased information from policy Links to documents containing quote(s) / paraphrase(s) Input, short. Input, free text (elaborate from previous column) - Research Council of Norway - http://www.forskningsradet.no/en/ Applies to all projects funded totally or partly by the Well described Norwegian Research Council "With regard to the use of research infrastructure for research involving the processing of large amounts of data (time series, registries, scientific collections, etc.), the progress report shall also show how the data generated are safeguarded through large-scale storage resources, data handling tools and dedicated point-topoint network connections for particularly demanding Suggested / Refers to 'progress report', not data management plan. applications." R&D Project Agreement Document "As a general rule, the formal applicant to the Research Council is to be a Norwegian institution/enterprise with a specific individual designated as the project Suggested Applies to all research data administrator". General application requirements "Unless otherwise agreed with the Research Council, copies of all research-generated data, including requisite documentation, shall be transferred from the Project Owner to the Norwegian Social Science Data Services. This shall be carried out as soon as possible All data and documentation to be deposited at designated data and at the latest two years following the conclusion of Suggested / Recommended centre the project period. R&D Project Agreement Document Suggested Applies to all research data See quote in input nr 13 /Suggested Indirectly and externally, through NSD licence/deposit form. All research-generated data; as soon as possible, max. two Required years. See quote in input nr 13 Required Norwegian Social Science Data Services (NSD) See quote in input nr 13 /Suggested Indirectly, NSD (financial support) "Scientific publications based on R&D projects funded wholly or partially by the Research Council must be made openly accessible to all interested parties". The Research Council's Principles for Open Access to Scientific Publications

Common Challenges and needs Looking at the overall picture: In many countries high-level policy recommendations has not yet led to specified national policies by key research funders. If SSH funders has formulated open access policies, they are likely to be soft recommendations without well defined requirements and guidance to follow-up and implementation of recommendations.

Common Challenges and needs Looking at the overall picture: it is still unusual to enforce projects to open their data - we need to move form policy statements to policy enforcements and monitoring too many countries lack sufficient data sharing (trusted centers) infrastructures we need to move from short-term funding to long-term funding and business models that build trust, confidence and incentives to contribute to the data infrastructure. Moving towards policy based data archiving!

Thank you for listening!