Research Data Management 1
Why to we need to Manage Data? 2
Data Management Planning Typically covers: - What data will be created (format, types) and how? - How will the data be documented and described? - How will you manage ethics and intellectual property? - What are the plans for data sharing and access? - What is the strategy for long-term preservation? 3
What is data curation? (so many boxes) the active management and appraisal of data over the lifecycle of scholarly and scientific interest Data management is part of good research practice 4
External Drivers: Australian Code for the Practice for the responsible conduct of Research (2007) The central aim is that sufficient materials and data are retained to justify the outcomes of the research and to defend them if they are challenged. The potential value of the material for the further research should also be considered, particular where the research would be difficult or impossible to repeat. ARC Funding Rules The ARC considers data management planning an important part of the responsible conduct of research and strongly encourages the depositing of data arising from a Project in an appropriate publically accessible subject and/or institutional repository. 5
And it s not just Australia International Funding Agencies National Science Foundation (NSF) National Science Board (NSB) Inter-University Consortium for Political and Social Research (ICPSR) National Institute of Health (NIH) Publishers with open data policies: Nature Publishing Science Cancer Research Cell PLoS one 6
What is UoN Doing? Rationale: Researchers need to be supported to manage their data well and comply with legal and funder policies Trend towards funders wanting to ensure public money spent on research is maximised this means ensuring research data is preserved for reuse. Provide institutional support to meet the needs of our researchers. Aim: To produce a technical solution and review the governance model based on researchers requirements to meet needs across multiple research groups/disciplines, taking into account the institutional landscape, its policies and compliance to The Australian code for the responsible conduct of research or The Code 7
Library DM Services Creating citable profiles describing research data to be published to Research Data Australia and the web Advice on data management planning (DMP) DMP templates DMP checklists Support for publishing supplementary datasets underpinning journal publication Information about data archiving and storage for current and completed projects Lots of help here: http://libguides.newcastle.edu.au/datamanagement 8
UoN Research Data Catalogue Under the Seeding the Commons and Metadata Stores ANDS funded projects, the University Library in conjunction with Research Services and IT Services Developed a UON Research Data Catalogue which: Enabled rich descriptions about UoN research datasets and researchers to be published to the national Research Data Australia portal and; Established the infrastructure to enable these datasets to be discoverable on the web, Developed a range of services to support researchers with data management planning at UoN. 9
Current Storage Options UoN Research Data Store (at full capacity) Portable drives Sharepoint Wikis Dropbox Research Data Storage Infrastructure (RDSI) National eresearch Collaboration Tools and Resources (NeCTAR) Australian Research Collaboration Services (ARCS) Data Fabric 10
Grant Proposal Data Management Plan Data preservation tools: Crate-IT Upload to repository for re-use eresearch Tools Repository Solution Infrastructure 11
OwnCloud 12
Researcher Benefits I can access my data, through one application I can easily share and find data within my research group I can publish my data, under my control, with the wider community I m freed up from some of my data management duties to concentrate on my research Researchers spending less time managing data, getting more value for their efforts and freeing more time for research activities. 13
Researcher Benefits Locally managed data in a repository (aligns with new ARC funding rules) Data becomes discoverable globally (fostering new research collaborations and potential reuse of the data) Persistently identified data (enabling data to be a first class research output) Persistent identification enables data to be cited 14
Institutional Benefits Researcher time spent on research Linking the researcher with core infrastructure services, IT Services, Research Services, Library Achieve Compliance with funder & legal obligations Tie in to central infrastructure services (that is backed up and 24/7 service) rather than dispersed and dysfunctional approaches to storage of data (USB hard drives.) Provenance - Complete history of research project stored Close a gap in the research life cycle and researcher support 15
Thank you Ton Dijkgraaf and Kai Chen 16