Cost and Value analysis of digital data archiving ANNA PALAIOLOGK

Similar documents
Interagency Science Working Group. National Archives and Records Administration

ENHANCED PUBLICATIONS IN THE CZECH REPUBLIC

Modelling the Costs of Preserving Digital Assets

The challenges of becoming a Trusted Digital Repository

A federated data infrastructure: the Dutch way forward

Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning

Harnessing Media Micro-Services for Stewardship of Digital Assets at CUNY TV. NDSR Project:

AHDS Digital Preservation Glossary

EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020

Master Class 4 - Introduction to Intellectual Property and Copyright Wilna Macmillan, Director of Client Services, Library

Enabling the re-use of research data: organising stakeholders and infrastructure in the Netherlands

From Stored Knowledge to Smart Knowledge

Research Data Management Guide

SURFsara Data Services

CIP s Open Data & Data Management Guidelines and Procedures

NWO-DANS Data Contracts

Data Management Resources at UNC: The Carolina Digital Repository and Dataverse Network

Management Principles and the RIM Program

Digital preservation policy

Corporate Performance Management. Framework, Approach and Challenges Observed

Assessing a Scientific Data Center as a Trustworthy Digital Repository

NERC Data Policy Guidance Notes

ARCHIVING ARCHAEOLOGY: INTRODUCING THE GUIDES TO GOOD PRACTICE

Digital Records Preservation Procedure No.: 6701 PR2

January Brand and Campaigns Executive: Information for Candidates

DATA LIFE CYCLE & DATA MANAGEMENT PLANNING

Operations Business Administration/Support

Data Service Infrastructure for the Social Sciences and Humanities

Service Road Map for ANDS Core Infrastructure and Applications Programs

Research Data Management

Response to Invitation to Tender: requirements and feasibility study on preservation of e-prints

Feet On The Ground: A Practical Approach To The Cloud Nine Things To Consider When Assessing Cloud Storage

NSW Data & Information Custodianship Policy. June 2013 v1.0

THE BRITISH LIBRARY BOARD BLB 12/29

Data Management Best Practices for Landscape Conservation Cooperatives Part 1: LCC Funded Science

CHAPTER 3: MANAGING IMPLEMENTATION PROJECTS

Quick Guide: Meeting ISO Requirements for Asset Management

Corporate Social Responsibility and Corporate Governance in the United Arab Emirates

Whitepaper. The ABC of Private Clouds. A viable option or another cloud gimmick?

Data Seal of Approval. Certification for sustainable and trusted data repositories

Report of the DTL focus meeting on Life Science Data Repositories

Executive Diploma in Big Data Management & Analytics

Recruitment Process: Why Outsource?

JOB TITLE: Asset Management Officer CLASS: ASO3 POSITION NO.: SA0017. This Position Reports to: Project Manager, Asset Services

Data Management A key enabler to Open Data and. Phil Dana, VP DAMA-Ottawa, Partner, BMB Data Consulting

METER DATA MANAGEMENT LESSONS LEARNED LIFE AFTER GO LIVE Insight on Leveraging Integration While Preserving Technical Investment

Enterprise Content Management (ECM)

E-Content Service Group Virtual Meeting. Digital Preservation: How to Get Started

Information Technology Strategic Plan

TAX MANAGEMENT CONSULTING. How can you be more efficient at managing tax?

Sample marketing plan template

Research Data Management Framework: Capability Maturity Guide

PRINCIPLES OF RESEARCH DATA MANAGEMENT AT THE RECTOR S DECISION, 9 SEPTEMBER 2014

Databases & Data Infrastructure. Kerstin Lehnert

Local Loading. The OCUL, Scholars Portal, and Publisher Relationship

Questionnaire on Digital Preservation in Local Authority Archive Services

Summary of Critical Success Factors, Action Items and Performance Measures

Governance, Risk and Compliance (GRC) software Business needs and market trends

Faut-il des cyberarchivistes, et quel doit être leur profil professionnel?

IFI Irish Film Archive Digital Preservation & Access Strategy

North Carolina Digital Preservation Policy. April 2014

How To Write A Blog Post On Globus

North Carolina Department of Cultural Resources Digital Preservation Plan. State Archives of North Carolina State Library of North Carolina

Transcription:

Cost and Value analysis of digital data archiving ANNA PALAIOLOGK

Motivation Detailed and meaningful cost information allows: more accurate planning better forecasting and control more accountability and transparency prioritise/control the level of ambition - realistic strategy (e.g. collection levels and preservation aims, quality-quantity balance, etc) 2/19

Challenges + Terminology funding does not grow in line with information growth curation vs. storing of the data acquisition and ingest guidelines vs. regulation on preferred formats legal requirements and grant terms access - most variable area of costs 3/19

DANS case study Data Archiving & Networked Services (DANS) is an institute of the Dutch Royal Academy of Arts and Sciences (KNAW) an independent digital archive collection: 14.000 datasets (1,5 TB) available to public and 10 datasets (20 TB) not available to the public 51 employees work processes based on Open Archival Information System (OAIS) - ISO 14721:2003 mixed budget of approximately 3,8 million euro/ year costs measured in Euros per dataset next slide depicts processes and vision of DANS 4/19

SUPPORTERS INTERNAL PROCESSES CUSTOMERS INNOVATION & GROWTH MISSION IMPROVE RESEARCH DATA INFRASTRUCTURE IN SOCIAL SCIENCES & HUMANITIES Big institutional collectors make their data available for free through DANS Leading partner in standards of infrastructure (provide innovative solutions) Increased use & re-use of data stored in EASY Complete coverage of fields we are active in Users, clients and partners satisfaction Increase awareness amongst research community, students and partners Increased demand for consultancy Increased number of datasets available to end user Dataset access aligned to national and international law All datasets available from one portal Effective administrative management Compliance to international standards Efficiency of archiving process Sustainable sources of revenue Growth in the number of supporters 5/19

Budget distribution Staff is the major resource pool in digital archiving, up to 65 70% of total expenses Data Acquisition Office IT services and equipment Staff Total 14,2% 14,3% 7% 64,5% 100% Staff needs to do tasks bringing the most value. Rest needs to be automated. 6/19

18% 17,02% 16% 14% 14,03% 12% 10,42% 10% 8% 7,31% 8,20% 7,07% 6% 4% 2% 0% % of money spent on each activity 7

Workload allocation per discipline 50% 45% 40% 35% 30% History 25% 20% 15% Social Sciences Archaeology 10% 5% 0% Ingest Archival Storage Data Management Access Preservation Archival Administration

Key findings In long term data archiving: 1. Up-front costs of acquisition and ingest of data (70-90% of total) dominate the long-term costs of storage and preservation. 2. Up-front costs dominated by staff time rather than hardware or other technology costs. 3. Long-term costs scale weakly, if at all, with the size of an archive. Preserving 10 TB is not that much more expensive than preserving 10 GB. 9/19

ABC methodology resource cost drivers activity cost drivers Resources Activities Cost Objects

T O T A L C O S T S O F T H E O R G A N I S A T I O N SALARY RESOURCE POOL NON-SALARY RESOURCE POOL Staff Data Acquisition Office IT Equipment and Services SALARY RESOURCE DRIVERS NON-SALARY RESOURCE DRIVERS Archivists General ICTa ICTb (detailed analysis out of scope of this paper) ACTIVITIES Project Acquisition Preparation Projects Dissemination Interorganisational Assistance and Liaison Indirect acquisition Direct Acquisition Submission Negotiation Maintenance of archival system Functional management of the technical infrastructure Development of Archival System Improvement of dataset presentation/ access Ingest Archival Storage Access Data Management Preservation Archival Administration Administrative Support General Management Project/ Functional team Management External Relations & Networking Marketing and PR Trainings and Education ACTIVITY COST DRIVERS # of partners # of domains Attitude of researcher # of functions Completeness of metadata # of files # of employees # of trainings Duration of project Complexity # of privacy protected files # of projects COST OBJECTS Dataset of Archaeology Dataset of Humanities Dataset of Social Sciences 11

Practicalities of ABC methodology ABC data collection How-To Dedicating a person to be responsible for collecting the cost information Do not overwhelm staff with information Do not expect all staff to be on the same page from the beginning Run a trial for a day or a week Ask staff to report separately on activities outside the Model Allow for a general comments field Leave, sickness or absence should be specified separately 12/19

Value + Economic Impact Analysis Methods being applied to: - report published - in progress - in progress 13/19

Benefits data collection Desk-research sources: Organisation and infrastructure evaluation reports Documentation on data usage and users Internal (management) reports Annual and mid-term reports Interviews with: Organisation management and staff Policy makers and practitioners Government institutions Non-academic and private sector representatives Online-survey addressed to: Depositors and users 14/19

Economic measures of value Investment value: annual operational funding & the costs that depositors face in preparing data for deposit and in making that deposit Use value: average user access costs x number of users Contingent value: the amount users are "willing to pay or willing to accept in return for giving up access Efficiency gain: user estimates of time saved by using the Data Service resources Return on investment: estimated return with time (30yrs) 15/19

INVESTMENT & USE VALUE (Direct) CONTIGENT VALUE (Stated) EFFICIENCY IMPACT (Estimates) RETURN ON INVESTMENT (Scenarios) WIDER IMPACTS (Not Measured) Survey User Community (registered users) Wider User Community Wider Research Community Society Investment Value Amount spent on producing the good or service Use Value Amount spent by users to obtain the good or service Willingness to Pay Maximum amount user would be willing to pay Consumer Surplus Total willingness to pay minus the cost of obtaining Net Economic Value Consumer surplus minus the cost of supplying Willingness to Accept Minimum amount user would be willing to accept to forego good or service Survey User Community Estimated value of efficiency gains due to using service Wider User Community Estimated value of efficiency gains due to using service Increased Return on Investment in Data Creation Estimated increase in return on investment in data creation arising from the additional use facilitated by service? 16/19

Next steps Costs Refine cost drivers Allocate the other-than-staff costs to activities Experiment with other cost objects Develop the matrix of dataset complexity Apply economic adjustments Test reliability and accuracy Develop/Customise software to make ABC easy to use Value/Benefits Develop the benefits framework further Collect more diverse/detailed data Verify results

References 1. Rob Baxter, EUDAT Sustainability Plan, WP2-D2.1.1-v.1.0, October 2012. Available from: http://www.eudat.eu/deliverables/d211-sustainability-plan 2. Neil Beagrie, Julia Chruszcz, Brian Lavoie and Matthew Woollard, Keeping Research Data Safe 1 and 2, 2008 and 2010. Available from: http://www.beagrie.com/krds.php 3. Neil Beagrie, John Houghton, Anna Palaiologk and Peter Williams, Economic Impact Evaluation of the Economic and Social Data Service, 2012. Available from: http://www.esrc.ac.uk/_images/esds_economic_impact_evaluation_tcm8-22229.pdf 4. Anna Palaiologk, Anastasios Economides, Heiko Tjalsma and Laurents Sesink, An activity-based costing model for long-term preservation and dissemination of digital research data: the case of DANS, International Journal on Digital Libraries, 2012. Available from: http://link.springer.com/content/pdf/10.1007%2fs00799-012-0092-1 19/19