Project Number: Project Title: Human Brain Project. HBP_SP13_EPFL_ _D13.3.2_Final.docx
|
|
|
- Kimberly Parrish
- 9 years ago
- Views:
Transcription
1 Project Number: Project Title: Human Brain Project Document Title: Document Filename (1) : Deliverable Number: Deliverable Type: HBP Data Management Plan HBP_SP13_EPFL_ _D13.3.2_Final.docx D Planning Document Work Package(s): WP 13.3 Dissemination Level: PP Planned Delivery Date: M6 / 31 March 2014 Actual Delivery Date: M8 / 02 May 2014 Author: Nenad BUNCIC, EPFL (P1) Author Author: Felix SCHÜRMANN, EPFL (P1) Author Editor: Richard WALKER, EPFL P01 Reviewer & Editor Abstract: Keywords: This document describes the management life cycle for the data that will be collected, deposited, shared, processed, generated or derived by the Human Brain Project (HBP). data, data management, data management life cycle, data storage, data preservation, metadata HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 1 of 19
2 Table of Contents 1. Executive Summary Introduction Background Purpose Scope General Principles Data Description Data set Identification Card Formats and Quality Assurance Access and Sharing Access Procedures Technical Access Methods Access Charges Restrictions on Access Data Storage and Storage, Replication, and Versioning Provenance Security Intellectual Property Rights Software Licensing Model Data and Software Release Process Ethics Adherence... 9 Annex A: Data Classes in the HBP Strategic Mouse Brain Data for the HBP Mouse Brain Atlas Strategic Human Brain Data for the HBP Human Brain Atlas Theoretical Neuroscience Data Algorithms, Cognitive Models and Computing Principles for Implementation in HPC, Neuromorphic and Neurorobotic Systems Atlases and Brainpedia Brain Simulation Data High Performance Computing Data Medical Informatics Data Neuromorphic Data Neurorobotics Data Applications Data Ethics and Society Data Software and Technical Documentation Administrative and Management Documentation Annex B: Data Set Identification Card Annex C: Initial Tasks and Annex D: References HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 2 of 19
3 1. Executive Summary This document describes the management life cycle for the data that will be collected, deposited, shared, processed, generated or derived by the Human Brain Project (HBP). The HBP, a ten-year research project funded by the European Commission (EC), is laying the foundations for a new approach to brain research. The fields of neuroscience, medicine and information technology each have important roles to play in addressing this challenge, but the knowledge and data that each is generating are currently very fragmented. The HBP is driving integration of these different contributions and catalysing a community effort to achieve a new understanding of the brain, new treatments for brain disease, and new brain-like computing technologies. To support this effort, the HBP is creating an integrated system of Information and Communication Technology (ICT) Platforms, offering services to neuroscientists, clinical researchers, information technology developers and roboticists. The total amount of data (experimental data, model data, simulation artefacts, analysis data, software, documentation) integrated and generated by the HBP will be in the petabyte to exabyte scale range. All data and software will be accessible via the HBP Unified Portal to all registered and active HBP users. The data will be provided in the original file format. To describe the data, metadata will be either embedded or generated in XML format and stored alongside the data. The data and software accessible via the HBP Unified Portal will be used in the format and context in which it is provided. We propose that the data and the HBP Software be licensed under a "dual licensing" model. Under this model, users may choose to use the data and/or the HBP Software under the free software/open source or under the commercial license terms. We believe a dual licensing model can sustain both innovation and growth, and that by adopting this licensing model, we can encourage freedom of development and contribution. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 3 of 19
4 2. Introduction 2.1 Background The goal of the Human Brain Project (HBP) is to build a completely new information and communication technology (ICT) infrastructure for neuroscience, and for brain-related research in medicine and computing. This ICT infrastructure will catalyse a global collaborative effort to understand the human brain and its diseases, and ultimately to emulate its computational capabilities. To achieve the goal, The HBP is developing six ICT platforms dedicated to Neuroinformatics, Brain Simulation, High-Performance Computing (HPC), Medical Informatics, Neuromorphic Computing and Neurorobotics. The HBP Platforms will make it possible to federate neuroscience data from all over the world, to integrate the data into unified models and simulations of the brain, to validate the results against empirical data from biology and medicine, and to make the data available to the global scientific community. Gathering all existing knowledge about the human brain, and reconstructing it in multiscale models and supercomputer-based simulations of those models, will offer a fundamentally new understanding of the human brain and its diseases. It will also spur the development of novel, brain-like computing technologies. The HBP is committed to having its data and software accessible via the HBP Unified Portal to all registered and active HBP users. 2.2 Purpose The purpose of this plan is to describe the data management life cycle for the data that will be collected, deposited, shared, processed, generated or derived by the HBP. 2.3 Scope The document outlines the data management policy that the HBP Platforms will use to organise and maintain all data generated by the project. This document will evolve over the lifetime of the HBP as the project expands, and as the strategy for HBP data management develops. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 4 of 19
5 3. General Principles 3.1 Data Description The HBP will generate and integrate large amounts of experimental data, model data, simulation artefacts, analysis data, software, and documentation see Annex A for a list of HBP data classes. The total amount of data integrated and generated by the HBP will be in the petabyte to exabyte scale range Data set Identification Card HBP data providers are required to produce a Data Set Identification Card (DIC) see Annex B - for each data set that details the appropriate storage, backup, replication, versioning, archiving and preservation policies. DICs cover the points below on a data setby-data set basis, and reflect the status of data production in the project as a whole. Data set reference and name: Identifier for the data set. Data set description: Description of the data generated or collected, its origin (in case it is collected), and its nature and scale. metadata: Reference to existing standards. If these do not exist, there should be an outline on how and what metadata will be created. Data sharing: Description of how data will be shared, including access procedures and embargo periods (if any). preservation: preservation (including storage and backup) policies Formats and Data will be provided in the original file format. To describe the data, metadata will be either For XMLstored metadata, the format must be compliant with the XML standards. XML compliance checking will ensure that the metadata use appropriate XML schemas. The data providers will be responsible for ensuring that the metadata content is correct and maintained appropriately. Also, the data providers will implement systematic procedures for the periodic checking of metadata. From the start of the project, the Dublin Core Standard 1 will be applied during metadata creation. Other metadata standards will be defined and applied accordingly Quality Assurance The HBP will use internationally accepted ontologies and will develop and maintain collaborative workflows for data curation. A process for developing and curating neuroscience ontologies, defined by the Neuroscience Information Framework (NIF) 2, will drive the development, curation and dissemination of the ontologies needed to annotate and share neuroscience data and models. The ontologies will be distributed via the HBP Unified Portal. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 5 of 19
6 3.2 Access and Sharing Access Procedures The HBP Platforms will provide access to all the data, software, workflows, and documents for all registered and active HBP users. The HBP users will access the data via a single point of access (the HBP Unified Portal) shared among all HBP Platforms. A review committee will vet the HBP user account applications to ensure that rare and valuable resources (i.e. supercomputing resources, high performance storage resources, etc.) are used in the most efficient and optimised way. SP6 will develop the HBP Unified Portal, and SP13 will provide the shared infrastructure that it will run on Technical Access Methods SP5 (T Shared Data Space) will provide infrastructure, software, and interfaces that will allow users to share and access data distributed to sites across the world. The Shared Data Space will allow neuroscientists to upload, access and analyse experimental data and computational models. The data will be discoverable and identifiable by means of a unique Digital Object Identifier (DOI). A software ecosystem will be developed by SP6, with workflows and algorithms for model reconstructions (T6.1.1). Functionality will be implemented to query the HBP atlases during model reconstruction, and to import the data into the formats required for specific reconstruction tasks (T6.1.2). SP7 (WP7.4) will design and develop technology to manage very large volumes of experimental and simulation data for analysis, model building and model validation. Novel spatial indexes, implemented by SP7, will allow scalable querying of massive neuroscience data sets located in external data centres, cloud systems and supercomputer storage (T7.4.1). The SP8 (WP8.1) will build the tools necessary to federate clinical databases around the world, to effectively anonymise patient data, to make the data available to clinical researchers, and to mine the data for clinically relevant information. A software framework will be developed to allow researchers to query clinical data stored on hospital and laboratory servers, without moving the data from the servers where it resides (in situ querying) and without compromising patient privacy (T , T8.1.4) Access Charges For a non-commercial and academic use, there will be no charge for the use of the HBP Unified Portal. For a commercial use, a commercial license will be required Restrictions on Access To access the HBP Unified Portal all users will be required to register and to have a valid HBP account. The registration process will be open to all partners of the HBP. However, a review committee will vet the HBP user account applications to ensure that rare and valuable resources (i.e. supercomputing resources, high performance storage resources, etc.) are used in the most efficient and optimised way. Access policies to the HBP Unified Portal for users external to the HBP will be defined in subsequent versions of this data management plan in time for the general release of the HBP Platforms (Month 30). HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 6 of 19
7 3.3 Data Storage and Storage, Replication, and Versioning Data providers, such as institutes and laboratories, and resource providers, such as HPC data centres, will be responsible for providing storage and backup resources for the raw data sets. SP13 (T13.3.1) will manage metadata storage, backup and replication. The metadata catalogue will be replicated across several zones to facilitate queries and to provide redundancy and resilience. The data and resource providers are responsible for creating and implementing maintenance plans and policies, in accordance with the relevant industry standards and best practices, to manage the data and metadata backup and recovery process. Specific storage, backup, replication and versioning policies will be described in more detail in the DICs Provenance The HBP is committed to describing the generated and derived data in sufficient detail to facilitate reproduction and validation of results. Whenever possible, the chain of ownership will be recorded, as will the transformations the data have undergone The HBP Platforms will ensure that data are archived and migrated to new formats, platforms, and storage media in accordance with the good practices of the digital preservation community. The resource providers will address succession planning for digital assets and will designate a successor in the unlikely event that such a need arises. Specific archiving and preservation policies will be described in more details in the DICs. 3.4 Security Access to the data and software will be managed via the HBP Unified Portal. All users will have a project-wide HBP account. The Access Control Lists (ACL) will specify which users or system processes are granted access to objects, as well as what operations are allowed on given objects. The HBP data providers are required to provide the DIC for each data set, detailing sharing and security policies. The DIC will be added to this Data Management Plan. 3.5 Intellectual Property Rights Data owners and their institutions hold the copyright for the data they generate in accordance to the HBP Consortium Agreement. Access to these data for implementation and use by other HBP partners is similarly regulated in the HBP Consortium Agreement. Usage of the data by third parties may be subject to additional agreements Software Licensing Model We propose that the HBP Software (software entirely generated within the Project) be licensed under a "dual licensing" model. Under this model, users may choose to use the HBP Software under the free software/open source or under the commercial license terms. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 7 of 19
8 Under open source license, the HBP Software will be available free of charge, as long as the use adheres to the terms of the license. Commercial use of the HBP Software will be possible under the commercial license terms. We believe a dual licensing model can sustain both innovation and growth, and that by adopting this licensing model, the HBP Software can remain open-source and encourage freedom of development and contribution. Where appropriate, preference will be given to the use of HBP software through the HBP Unified Portal. Different licensing models might be applicable for software generated outside the project, software existing prior to the project, or software generated partially within the project Data and Software Release Process All data and software will be accessible via the HBP Unified Portal to all registered and active HBP users. The data and software accessible via the HBP Unified Portal will be used in the format and in the context in which it is provided. The data and software will follow five lifecycle phases: 1) Creation: The initial phase and the data or software origin point. 2) Platform Distribution: The phase and the process of managing the data or software once it has been created or received. 3) Internal Use: This takes place after data or software is distributed internally. 4) IP Protection and Licensing: This phase refers to the recognition of exclusive rights to data or software. 5) Release: The last phase when data or software will be released outside of the HBP, in accordance with the licensing model. Exceptional Requests (requests prior to Release) for raw data and software binary or source code can be made, and will be considered under the following conditions: 1) Requested data or code does not include confidential data. 2) Purposes of the study/analysis/investigation are approved by the HBP. 3) Results and format of the study/analysis/investigation are presented to the HBP for further discussion, modification and acceptance prior to public release. 4) An agreement exists or is made between the HBP and the entity requesting data in terms of partnership and resource sharing to conduct data analyses. 3.6 Ethics Privacy issues arise when data (e.g. patient records, hospital information records, biological traits and genetic material) are collected and stored. The handling of digital personal data is a major concern because of the processing possibilities and the potential to link vast amounts of personal data. Privacy concerns exist with any data that either alone or when linked relate to an identifiable individual or individuals. The HBP data providers will be responsible for rendering the data anonymous (when obvious individual identifiers are removed) and non-identifiable (when an individual cannot be identified either immediately from the data or when the data are combined with other data). HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 8 of 19
9 Where required, human subjects contributing data to the project (e.g. in SP2 and SP3) will be asked to explicitly consent to the sharing of their anonymised data within the HBP and within the research community. 3.7 Adherence for implementation of individual Tasks are set out in Annex C. Adherence to this data management plan will be controlled by SP13 specifically T (Central Support IT Services). HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 9 of 19
10 Annex A: Data Classes in the HBP Strategic Mouse Brain Data for the HBP Mouse Brain Atlas SP1 (Strategic Mouse Brain Data) is responsible for generating strategic mouse brain data, and complementing or completing existing data on the structure of the mouse brain. T1.3.1 (University of Edinburgh) is responsible for the data aggregation and depositing into the HBP Mouse Brain Atlas. SP5 (Neuroinformatics Platform) is responsible for the data integration, federation, management and administration. Mouse brain transcriptomics and proteomics data; and mouse brain macrostructure, vasculature, cells and synapses data. The data will be provided in the original file format. will be either SP5 (Neuroinformatics Platform) is responsible for making the data available and accessible in the Shared Data Space. SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. Access and sharing of the data will be granted in accordance with the HBP user account rights. The data will be stored in the Shared Data Space provided, managed, and administered by SP5 (Neuroinformatics Platform) T SP5 will implement, within the scope of the Shared Data Space (T5.1.1), a digital preservation process to ensure integrity and authenticity of the data. Strategic Human Brain Data for the HBP Human Brain Atlas SP2 (Strategic Human Brain Data) is responsible for generating strategic human multi-level data that parallels the data collected for mouse to facilitate the use of mouse data to predict human data. T2.2.2 (Jülich) is responsible for the data aggregation and depositing into the HBP Human Brain Atlas. SP5 (Neuroinformatics Platform) T5.6.1 is responsible for the data integration, federation, management and administration. Structural and functional human brain data, neuron morphologies, numbers and distributions of neurons and glia, distribution of receptors in the human cerebral cortex, and connection maps. The data will be provided in the original file format. will be either SP5 (Neuroinformatics Platform) is responsible for making the data available and accessible in the Shared Data Space (T5.1.1). SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. The data will be stored in the Shared Data Space provided, managed, and administered by SP5 (Neuroinformatics Platform) T SP5 will implement, within the scope of the Shared Data Space (T5.1.1), a digital preservation process to ensure integrity and authenticity of the data. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 10 of 19
11 Theoretical Neuroscience Data SP4 (Theoretical Neuroscience) is responsible for producing simplified models of complex brain structures and dynamics; rules linking learning and memory to synaptic plasticity; large-scale models creating a bridge between high-level behavioural and imaging data; and mathematical descriptions of neural computation at different levels of brain organisation. SP6 (Brain Simulation Platform) and SP13 (Management) T will share responsibility for the data integration, federation, management and administration. Simplified models and large-scale models. The data will be provided in the original file format. will be either SP5 (Neuroinformatics Platform) is responsible for making the data available and accessible in the Shared Data Space (T5.1.1). SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. The data will be stored in the Shared Data Space provided, managed, and administered by SP5 (Neuroinformatics Platform) T SP5 will implement, within the scope of the Shared Data Space (T5.1.1), a digital preservation process to ensure integrity and authenticity of the data. Algorithms, Cognitive Models and Computing Principles for Implementation in HPC, Neuromorphic and Neurorobotic Systems SP3 (Cognitive Architectures) is responsible for providing functional mapping data, cognitive architectures and models for the HBP Human Brain Atlas. SP4 (Theoretical Neuroscience) is responsible for providing algorithms, cognitive models and computing principles for the HBP Human Brain Atlas. SP5 (Neuroinformatics Platform) T5.6.1 is responsible for the data integration, federation, management and administration. The algorithms and computing principles will be used in HPC, neuromorphic and neurorobotic systems. Functional mapping data, cognitive architectures and models, and algorithms and computing principles. The data will be provided in the original file format. will be either SP5 (Neuroinformatics Platform) is responsible for making the data available and accessible in the Shared Data Space. SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. Access and sharing of the data will be granted in accordance with the HBP user The data will be stored in the Shared Data Space provided, managed, and administered by SP5 (Neuroinformatics Platform) T SP5 will implement, within the scope of the Shared Data Space (T5.1.1), a digital preservation process to ensure integrity and authenticity of the data. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 11 of 19
12 Atlases and Brainpedia SP5 (Neuroinformatics Platform) is responsible for building the atlases, data integration, federation, management and administration. Mouse Brain Atlas, Human Brain Atlas, and Brainpedia. The data will be provided in the original file format. will be either SP5 (Neuroinformatics Platform) T5.6.1 is responsible for making the data available and accessible in the Shared Data Space. SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. The data will be stored in the Shared Data Space provided, managed, and administered by SP5 (Neuroinformatics Platform) T SP5 will implement, within the scope of the Shared Data Space (T5.1.1), a digital preservation process to ensure integrity and authenticity of the data. Brain Simulation Data SP6 (Brain Simulation Platform) is responsible for delivering an Internetaccessible collaborative platform for data-driven predictive reconstruction and simulation of brain models. Brain models, simulation results, and validation tests. The data will be provided in the original file format. will be either HPC Data Centres (Jülich, CINECA, CSCS) will provide the storage infrastructure. The data will be stored in the respective HPC Data Centres (Jülich, CINECA, CSCS). HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 12 of 19
13 High Performance Computing Data SP7 (High Performance Computing Platform) is responsible for providing supercomputing, Big Data and Cloud capabilities at the exascale level, as well as the system software, middleware, interactive computational steering and visualisation support necessary to create and simulate multi-scale brain models and to address the hard-scaling challenges of whole brain modelling. Simulation artefacts. The data will be provided in the original file format. will be either HPC Data Centres (Jülich, CINECA, CSCS) will provide the storage infrastructure. The data will be stored in the respective HPC Data Centres (Jülich, CINECA, CSCS). Medical Informatics Data SP8 (Medical Informatics Platform) is responsible for building the tools to federate clinical data, recruiting hospitals to use the system, and developing tools to extract unique biological signatures of disease making it possible to develop a new, comprehensive classification of brain diseases. Brain scans; electrophysiology data; electro- encephalography and genotyping data; metabolic, biochemical and haematological profiles; and clinical instruments data. The data will be provided in the original file format. will be either SP8 (Medical Informatics Platform) T8.4.2 is responsible for building a federated data space for clinical data. The data will be federated and anonymised (D8.6.1). In-situ distributed database querying (T8.1.1) will be developed to facilitate search and discovery of the data. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 13 of 19
14 Neuromorphic Data SP9 (Neuromorphic Computing Platform) is responsible for allowing non-expert neuroscientists and engineers to perform experiments with configurable Neuromorphic Computing Systems (NCS) implementing simplified versions of brain models. Simplified brain models, neuromorphic circuits, configuration files, experimental environments, experiments and experimental results, and simulation results. The data will be provided in the original file format. will be either SP9 (Neuromorphic Computing Platform) is responsible for making the data available and accessible in the Shared Data Space. SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. The data will be stored in the Shared Data Space provided, managed, and administered by SP5 (Neuroinformatics Platform) T SP5 will implement, within the scope of the Shared Data Space (T5.1.1), a digital preservation process to ensure integrity and authenticity of the data. Neurorobotics Data SP10 (Neurorobotics Platform) is responsible for allowing researchers to design and run simple experiments in cognitive neuroscience using simulated robots and simulated environments linked to simplified versions of HBP brain models. Simplified brain models, configuration files, experimental environments, experiments and experimental results, and simulation results. The data will be provided in the original file format. will be either SP10 (Neurorobotics Platform) is responsible for making the data available and accessible in the Shared Data Space. SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. The data will be stored in the Shared Data Space provided, managed, and administered by SP5 (Neuroinformatics Platform) T SP5 will implement, within the scope of the Shared Data Space (T5.1.1), a digital preservation process to ensure integrity and authenticity of the data. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 14 of 19
15 Applications Data SP11 (Applications) is responsible for testing and refining the pre-release versions of the ICT platforms, providing early, small-scale demonstrations of their potential for research in neuroscience, medicine and computing and preparing for more ambitious research in the operational phase. Configuration files, experimental environments, and experiments and experimental results. The data will be provided in the original file format. will be either The respective platform on which the data are run will be responsible for making the data available and accessible in the Shared Data Space. SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. The data will be stored in the Shared Data Space provided, managed, and administered by SP5 (Neuroinformatics Platform) T SP5 will implement, within the scope of the Shared Data Space (T5.1.1), a digital preservation process to ensure integrity and authenticity of the data. Ethics and Society Data SP12 (Ethics And Society) is responsible for exploring the project s social, ethical and philosophical implications, promoting engagement with decisionmakers and the general public, fostering responsible research and innovation by raising social and ethical awareness among project participants, and ensuring that the project is governed in a way that ensures full compliance with relevant legal and ethical norms. Data showing all ethics applications by partners across the project. The data will be in the database. SP13 (Management) T will provide the infrastructure for accessing the data via HBP Unified Portal. The data will be stored in the database. SP13 (T13.3.1) will implement a process to ensure integrity and of the data. HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 15 of 19
16 Software and Technical Documentation Archiving & SP5 (Neuroinformatics Platform), SP6 (Brain Simulation Platform), SP7 (High Performance Computing Platform), SP8 (Medical Informatics Platform), SP9 (Neuromorphic Computing Platform) and SP10 (Neurorobotics Platform) are responsible for delivering Internet-accessible collaborative platforms for reconstruction and simulation of brain models. Software, configuration files and validation tests. The data will be provided in the original file format All Platforms are responsible for making the software available and accessible in the Central Software Repository (Git). SP13 (Management) T will provide the infrastructure for running a Git Server. The data will be stored in the Central Software Repository provided, managed, and administered by SP13 (Management) T Administrative and Management Documentation SP13 (Management) is responsible for supporting HBP decision-making, operating the management structure and European Research Programme, ensuring transparency and accountability toward funders and stakeholders, and maintaining standards of quality and performance. Agenda and minutes of meetings, deliverables, and quarterly reports. The data will be provided in the original file format. SP13 (Management) T will provide the infrastructure for accessing the data via EMDESK Research Project Management Platform. The data will be stored in the EMDESK Research Project Management Platform database provided, managed, and administered by SP13 (Management) T HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 16 of 19
17 Annex B: Data Set Identification Card DATA SET IDENTIFICATION CARD Name SPID TID Description metadata Access, sharing, and privacy Storage and backup preservation Unique name of the data set. The name should be in the following form: Subproject ID Task ID SPnnWPnnTnnDSnnnn_human_readable_name Description of the data that will be generated or collected, format, its origin, nature and scale. Reference to existing data and metadata standards. If these do not exist, an outline on how and what metadata will be created. Description of how data will be shared, including access procedures, embargo periods (if any), outlines of technical mechanisms for dissemination and necessary software and other tools for enabling re-use, and definition of whether access will be widely open or restricted to specific groups. Identification of the repository where data will be stored, if already existing and identified, indicating in particular the type of repository (institutional, standard repository for the discipline, etc.). In case the data set cannot be shared, the reasons for this should be mentioned (e.g. ethical, rules of personal data, intellectual property, commercial, privacy-related, security-related). Description of where the primary data resides as well as the data backup plan including backup frequency and backup retention (e.g. incremental and transaction log backups hourly, so that in the event of a catastrophic failure, researchers will lose no more than 1 hour of work. Retention period of 10 years to allow researchers to be able to restore the data set to any point in time within the last 10 years). Description of the procedures that will be put in place for long-term preservation of the data. Indication of how long the data should be preserved, its approximated end volume, and the associated costs and how they will be covered. Notes HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 17 of 19
18 Annex C: Initial Tasks and Task Description Leader Institution T1.3.1 Deposit data in the HBP mouse brain atlas Douglas ARMSTRONG University of Edinburgh T2.2.2 Deposition of Human Brain data in the HBP Brain Atlas and Brainpedia Katrin AMUNTS Forschungszentrum Jülich T5.1.1 Shared data space Sten GRILLNER Karolinska Institutet T5.6.1 Neuroinformatics Platform integration, website construction, maintenance and administration Sean HILL EPFL T6.5.1 Brain Simulation Platform integration, website construction, maintenance and administration Henry MARKRAM EPFL T7.4.3 Data provenance and preservation Peter BUNEMAN University of Edinburgh T7.5.7 High Performance Computing Platform website construction and maintenance Thomas LIPPERT Forschungszentrum Jülich T8.1.2 Data provenance, preservation and integration Vasilis VASSALOS Athens University of Economics and Business T8.4.1 Medical Informatics Platform integration, website construction, maintenance and administration Richard FRACKOWIAK Centre Hospitalier Universitaire Vaudois T8.4.2 Setting up federated infrastructure Richard FRACKOWIAK Centre Hospitalier Universitaire Vaudois T9.5.5 Neuromorphic Computing Platform website construction and maintenance Karlheinz MEIER Ruprecht-Karls-Universität Heidelberg T Setting up and administration of the Neurorobotics Platform Alois KNOLL Technische Universität München T IT Services Nenad BUNCIC EPFL HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 18 of 19
19 Annex D: References HBP_SP13_EPFL_ _D13.3.2_Final 2-May-2014 Page 19 of 19
Human Brain Project -
Human Brain Project - Scientific goals, Organization, Our role Wissenswerte, Bremen 26. Nov 2013 Prof. Sonja Grün Insitute of Neuroscience and Medicine (INM-6) & Institute for Advanced Simulations (IAS-6)
EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020
EUROPEAN COMMISSION Directorate-General for Research & Innovation Guidelines on Data Management in Horizon 2020 Version 2.0 30 October 2015 1 Introduction In Horizon 2020 a limited and flexible pilot action
LJMU Research Data Policy: information and guidance
LJMU Research Data Policy: information and guidance Prof. Director of Research April 2013 Aims This document outlines the University policy and provides advice on the treatment, storage and sharing of
A grant number provides unique identification for the grant.
Data Management Plan template Name of student/researcher(s) Name of group/project Description of your research Briefly summarise the type of your research to help others understand the purposes for which
EXECUTIVE AGENCY HORIZON 2020 PROGRAMME
EUROPEAN COMMISSION INNOVATION and NETWORKS EXECUTIVE AGENCY HORIZON 2020 PROGRAMME for RESEARCH and INNOVATION Reducing impacts and costs of freight and service trips in urban areas (Topic: MG-5.2-2014)
Research Data Management Policy
Research Data Management Policy Version Number: 1.0 Effective from 06 January 2016 Author: Research Data Manager The Library Document Control Information Status and reason for development New as no previous
RESEARCH DATA MANAGEMENT POLICY
Document Title Version 1.1 Document Review Date March 2016 Document Owner Revision Timetable / Process RESEARCH DATA MANAGEMENT POLICY RESEARCH DATA MANAGEMENT POLICY Director of the Research Office Regular
Future and Emerging Technologies (FET) in H2020. Ales Fiala Future and Emerging Technologies DG CONNECT European Commission
Future and Emerging Technologies (FET) in H2020 51214 Ales Fiala Future and Emerging Technologies DG CONNECT European Commission H2020, three pillars Societal challenges Excellent science FET Industrial
HERON (No: 649690): Deliverable D.2.6 DATA MANAGEMENT PLAN AUGUST 2015. Partners: Oxford Brookes University and Università Commerciale Luigi Bocconi
HERON (No: 649690): Deliverable D.2.6 DATA MANAGEMENT PLAN AUGUST 2015 Partners: Oxford Brookes University and Università Commerciale Luigi Bocconi Institutions: Low Carbon Building Group, Oxford Brookes
Data Management Planning
DIY Research Data Management Training Kit for Librarians Data Management Planning Kerry Miller Digital Curation Centre University of Edinburgh [email protected] Running Order I. What is Research Data
Benefits of managing and sharing your data
Benefits of managing and sharing your data Research Data Management Support Services UK Data Service University of Essex April 2014 Overview Introduction to the UK Data Archive What is data management?
Action full title: Universal, mobile-centric and opportunistic communications architecture. Action acronym: UMOBILE
Action full title: Universal, mobile-centric and opportunistic communications architecture Action acronym: UMOBILE Deliverable: D.6.10 - Data Management Plan Project Information: Project Full Title Project
NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons
The NIH Commons Summary The Commons is a shared virtual space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage,
An Introduction to Managing Research Data
An Introduction to Managing Research Data Author University of Bristol Research Data Service Date 1 August 2013 Version 3 Notes URI IPR data.bris.ac.uk Copyright 2013 University of Bristol Within the Research
The UK INCF Node and the CARMEN project. Presenter: Leslie Smith [email protected]
The UK INCF Node and the CARMEN project Presenter: Leslie Smith [email protected] Content Events and activities at UK INCF Node Recent Forthcoming The CARMEN project History, current status, ways
EUDAT. Towards a pan-european Collaborative Data Infrastructure. Willem Elbers
EUDAT Towards a pan-european Collaborative Data Infrastructure Willem Elbers EUDAT / MPI-TLA Focus meeting: Data repositories SURF, Utrecht March 3, 2014 Outline EUDAT project EUDAT services Summary and
REACCH PNA Data Management Plan
REACCH PNA Data Management Plan Regional Approaches to Climate Change (REACCH) For Pacific Northwest Agriculture 875 Perimeter Drive MS 2339 Moscow, ID 83844-2339 http://www.reacchpna.org [email protected]
1.2: DATA SHARING POLICY. PART OF THE OBI GOVERNANCE POLICY Available at: http://www.braininstitute.ca/brain-code-governance. 1.2.
1.2: DATA SHARING POLICY PART OF THE OBI GOVERNANCE POLICY Available at: http://www.braininstitute.ca/brain-code-governance 1.2.1 Introduction Consistent with its international counterparts, OBI recognizes
LabArchives Electronic Lab Notebook:
Electronic Lab Notebook: Cloud platform to manage research workflow & data Support Data Management Plans Annotate and prove discovery Secure compliance Improve compliance with your data management plans,
UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure
UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure Authors: A O Jaunsen, G S Dahiya, H A Eide, E Midttun Date: Dec 15, 2015 Summary Uninett Sigma2 provides High
W H I T E P A P E R E X E C U T I V E S U M M AR Y S I T U AT I O N O V E R V I E W. Sponsored by: EMC Corporation. Laura DuBois May 2010
W H I T E P A P E R E n a b l i n g S h a r e P o i n t O p e r a t i o n a l E f f i c i e n c y a n d I n f o r m a t i o n G o v e r n a n c e w i t h E M C S o u r c e O n e Sponsored by: EMC Corporation
MS Health Informatics (2014 2015)
MS Health Informatics (2014 2015) Program Information Point of Contact Girmay Berhie ([email protected]) Support for University and College Missions Marshall University is a multi campus public university
Management of Research Data Procedure
Management of Research Data Procedure Related Policy Management of Research Data Policy Responsible Officer Deputy Vice Chancellor (Research) Approved by Deputy Vice Chancellor (Research) Approved and
Digital preservation a European perspective
Digital preservation a European perspective Pat Manson Head of Unit European Commission DG Information Society and Media Cultural Heritage and Technology Enhanced Learning Outline The digital preservation
Data Management in NeuroMat and the Neuroscience Experiments System (NES)
Data Management in NeuroMat and the Neuroscience Experiments System (NES) Kelly Rosa Braghetto Department of Computer Science Institute of Mathematics and Statistics - University of São Paulo 1st NeuroMat
OpenAIRE Research Data Management Briefing paper
OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement
Data Management Plan Template Guidelines
Data Management Plan Template Guidelines This sample plan is provided to assist grant applicants in creating a data management plan, if required by the agency receiving the proposal. A data management
Doctor of Philosophy in Computer Science
Doctor of Philosophy in Computer Science Background/Rationale The program aims to develop computer scientists who are armed with methods, tools and techniques from both theoretical and systems aspects
Data Management using irods
Data Management using irods Fundamentals of Data Management September 2014 Albert Heyrovsky Applications Developer, EPCC [email protected] 2 Course outline Why talk about irods? What is irods?
Checklist for a Data Management Plan draft
Checklist for a Data Management Plan draft The Consortium Partners involved in data creation and analysis are kindly asked to fill out the form in order to provide information for each datasets that will
European Data Infrastructure - EUDAT Data Services & Tools
European Data Infrastructure - EUDAT Data Services & Tools Dr. Ing. Morris Riedel Research Group Leader, Juelich Supercomputing Centre Adjunct Associated Professor, University of iceland BDEC2015, 2015-01-28
Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any
Data Management Plan Name of Contractor Name of project Project Duration Start date : End: DMP Version Date Amended, if any Name of all authors, and ORCID number for each author WYDOT Project Number Any
ESRC Research Data Policy
ESRC Research Data Policy Introduction... 2 Definitions... 2 ESRC Research Data Policy Principles... 3 Principle 1... 3 Principle 2... 3 Principle 3... 3 Principle 4... 3 Principle 5... 3 Principle 6...
Checklist and guidance for a Data Management Plan
Checklist and guidance for a Data Management Plan Please cite as: DMPTuuli-project. (2016). Checklist and guidance for a Data Management Plan. v.1.0. Available online: https://wiki.helsinki.fi/x/dzeacw
Digital libraries of the future and the role of libraries
Digital libraries of the future and the role of libraries Donatella Castelli ISTI-CNR, Pisa, Italy Abstract Purpose: To introduce the digital libraries of the future, their enabling technologies and their
Deliverable Name. D1.1.3 IPR Management Plan v1. DuraArk DURAARK. Date: 201x-MM-DD. Date: 2013-03-31. Document id. id. : duraark/2013/d.1.1.3/v1.
Deliverable Name D1.1.3 IPR Management Plan v1 DuraArk FP7 FP7 ICT ICT Digital Digital Preservation Preservation Grant Grant agreement No.: No.: 600908 600908 Date: 201x-MM-DD Date: 2013-03-31 Version
The Key Elements of Digital Asset Management
The Key Elements of Digital Asset Management The last decade has seen an enormous growth in the amount of digital content, stored on both public and private computer systems. This content ranges from professionally
APPENDIX to http://dx.doi.org/10.4338/aci-2014-09-ra-0083 CAHIIM 2012 Curriculum Requirements Health Informatics Master s Degree
APPENDIX to http://dx.doi.org/10.4338/aci-2014-09-ra-0083 CAHIIM 2012 Curriculum Requirements Health Informatics Master s Degree Column 1 - Health Informatics Facet I. Information Systems concerned with
PROPOSAL HUMAN BRAIN PROJECT FRAMEWORK PARTNERSHIP AGREEMENT
PROPOSAL HUMAN BRAIN PROJECT FRAMEWORK PARTNERSHIP AGREEMENT UNIFYING OUR UNDERSTANDING OF THE HUMAN BRAIN COVER PAGE Title of Proposal: The Human Brain Project - Framework Partnership Agreement (HBP-FPA)
Proposal template (technical annex) Health, demographic change and wellbeing Two-stage Research and Innovation actions Innovation actions
Proposal template (technical annex) Health, demographic change and wellbeing Two-stage Research and Innovation actions Innovation actions Note: This is for information only. The definitive templates to
Project Number: 284941 Project Title: Human Brain Project. Neurorobotics Platform v1 preliminary release for internal Consortium use
Project Number: 284941 Project Title: Human Brain Project Document Title: Document Filename: Deliverable Number: Deliverable Type: Neurorobotics Platform v1 preliminary release for internal Consortium
NSF Data Management Plan Template Duke University Libraries Data and GIS Services
NSF Data Management Plan Template Duke University Libraries Data and GIS Services NSF Data Management Plan Requirement Overview The Data Management Plan (DMP) should be a supplementary document of no more
The Big Data methodology in computer vision systems
The Big Data methodology in computer vision systems Popov S.B. Samara State Aerospace University, Image Processing Systems Institute, Russian Academy of Sciences Abstract. I consider the advantages of
LIBER Case Study: University of Oxford Research Data Management Infrastructure
LIBER Case Study: University of Oxford Research Data Management Infrastructure AuthorS: Dr James A. J. Wilson, University of Oxford, [email protected] Keywords: generic, institutional, software
Data Management Plan Deliverable 5.4
Data Management Plan Deliverable 5.4 This project has received funding from the European Union s Horizon 2020 Research and Innovation Programme under Grant Agreement No 675191 Page 1 About this document
DATA LIFE CYCLE & DATA MANAGEMENT PLANNING
DATA LIFE CYCLE & DATA MANAGEMENT PLANNING......... VEERLE VAN DEN EYNDEN RESEARCH DATA MANAGEMENT TEAM UNIVERSITY OF ESSEX.. LOOKING AFTER AND MANAGING YOUR RESEARCH DATA (GOING DIGITAL AND ESRC ATN EVENTS),
Digital Asset Manager, Digital Curator. Cultural Informatics, Cultural/ Art ICT Manager
Role title Digital Cultural Asset Manager Also known as Relevant professions Summary statement Mission Digital Asset Manager, Digital Curator Cultural Informatics, Cultural/ Art ICT Manager Deals with
Technical. Overview. ~ a ~ irods version 4.x
Technical Overview ~ a ~ irods version 4.x The integrated Ru e-oriented DATA System irods is open-source, data management software that lets users: access, manage, and share data across any type or number
SURFsara Data Services
SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,
Creating a Data Management Plan for your Research
Creating a Data Management Plan for your Research EPFL Workshop Lausaunne, 28 Oct 2014 Robin Rice, Laine Ruus EDINA and Data Library Course content What is a Data Management Plan? Benefits and drivers
CIP s Open Data & Data Management Guidelines and Procedures
CIP s Open Data & Data Management Guidelines and Procedures 1.1 Scope The CIP Data Management Guidelines and Procedures aim to provide guidance and support throughout the Data Management Cycle to facilitate
Guidelines for applicants
ERACoSysMed 1st Joint Transnational Call for Proposals for European Research Projects to demonstrate the feasibility and benefits of systems medicine Guidelines for applicants Submission deadline for pre-proposals:
Digital Continuity Plan
Digital Continuity Plan Ensuring that your business information remains accessible and usable for as long as it is needed Accessible and usable information Digital continuity Digital continuity is an approach
IBM Unstructured Data Identification and Management
IBM Unstructured Data Identification and Management Discover, recognize, and act on unstructured data in-place Highlights Identify data in place that is relevant for legal collections or regulatory retention.
Ex Libris Rosetta: A Digital Preservation System Product Description
Ex Libris Rosetta: A Digital Preservation System Product Description CONFIDENTIAL INFORMATION The information herein is the property of Ex Libris Ltd. or its affiliates and any misuse or abuse will result
A. Document repository services for EU policy support
A. Document repository services for EU policy support 1. CONTEXT Type of Action Type of Activity Service in charge Associated Services Project Reusable generic tools DG DIGIT Policy DGs (e.g. FP7 DGs,
Report of the DTL focus meeting on Life Science Data Repositories
Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity
REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])
305 REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) (See also General Regulations) Any publication based on work approved for a higher degree should contain a reference
Using Enterprise Content Management Principles to Manage Research Assets. Kelly Mannix, Manager Deloitte Consulting Perth, WA.
Using Enterprise Content Management Principles to Manage Research Assets Kelly Mannix, Manager Deloitte Consulting Perth, WA November 2010 Agenda Introduction Defining ECM Understanding the Challenges
The data landscape lessons from UK
The data landscape lessons from UK Veerle Van den Eynden UK Data Archive University of Essex Faculty of Psychology and Educational Sciences University of Ghent, Belgium 23 October 2014 UK data landscape
1. Introduction to ehealth:
1. Introduction to ehealth: E-Health is one of the fastest growing areas within the health sector. The scope of e- Health involves application of the knowledge, skills and tools, which enable information
DELIVERABLE. Grant Agreement number: 325091 Europeana Cloud: Unlocking Europe s Research via The Cloud
DELIVERABLE Project Acronym: Grant Agreement number: 325091 Project Title: Europeana Cloud Europeana Cloud: Unlocking Europe s Research via The Cloud D5.3 Europeana Cloud Access and Reuse Framework (originally:
Cambridge University Library. Working together: a strategic framework 2010 2013
1 Cambridge University Library Working together: a strategic framework 2010 2013 2 W o r k i n g to g e t h e r : a s t r at e g i c f r a m e w o r k 2010 2013 Vision Cambridge University Library will
ERA-CAPS Data Sharing Policy ERA-CAPS. Data Sharing Policy
ERA-CAPS Data Sharing Policy March 2014 1 ERA-CAPS Data Sharing Policy 1. Principles ERA-CAPS view on research data is informed by the overarching principles declared by the Organisation for Economic Cooperation
ECM Governance Policies
ECM Governance Policies Metadata and Information Architecture Policy Document summary Effective date 13 June 2012 Last updated 17 November 2011 Policy owner Library Services, ICTS Approved by Council Reviewed
Management of Official Records in a Business System
GPO Box 2343 ADELAIDE SA 5001 Tel (08) 8204 8773 Fax (08) 8204 8777 DX:467 [email protected] www.archives.sa.gov.au Management of Official Records in a Business System October 2011 Version
SHared Access Research Ecosystem (SHARE)
SHared Access Research Ecosystem (SHARE) June 7, 2013 DRAFT Association of American Universities (AAU) Association of Public and Land-grant Universities (APLU) Association of Research Libraries (ARL) This
DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM
DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM Introduction The Institute of Museum and Library Services (IMLS) is committed to expanding public access to federally funded research, data, software,
2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, [email protected])
CSP CHRONOS Compliance statement for ISO 14721:2003 (Open Archival Information System Reference Model) 2009 ikeep Ltd, Morgenstrasse 129, CH-3018 Bern, Switzerland (www.ikeep.com, [email protected]) The international
Data Governance Policy. Staff Only Students Only Staff and Students. Vice-Chancellor
Name of Policy Description of Policy Policy applies to Data Governance Policy To establish proper standards to assure the quality and integrity of University data. This policy also defines the roles and
H2020 Guidelines on Open Data and Data Management Plan
H2020 Guidelines on Open Data and Data Management Plan CRR Centro Risorse per la Ricerca Multimediale Why? Open scientific research data should be easily discoverable, accessible, assessable, intelligible,
Horizon 2020. Research e-infrastructures Excellence in Science Work Programme 2016-17. Wim Jansen. DG CONNECT European Commission
Horizon 2020 Research e-infrastructures Excellence in Science Work Programme 2016-17 Wim Jansen DG CONNECT European Commission 1 Before we start The material here presented has been compiled with great
ECRIN (European Clinical Research Infrastructures Network)
ECRIN (European Clinical Research Infrastructures Network) Wolfgang Kuchinke University of Duesseldorf (HHU) and ECRIN EUDAT 1st User Forum 7 March 2012 8 March 2012, Barcelona 1 What is ECRIN? European
Research proposal (Part B)
Research and Innovation Actions (RIA) Innovation Actions (IA) Research proposal (Part B) Version 1.1 3 November 2014 Disclaimer This document is aimed at informing potential applicants for Horizon 2020
Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens
Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many
Research Data Management in Horizon 2020
Research Data Management in Horizon 2020 Dr. Fieke Schoots, UBL 11 / 6 / 2015 From : Guidelines on Open Access to Scientific Publications and Research Data in Horizon 2020 [v.1.0, 11/12/2013] Open access
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness
Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article
Globus Research Data Management: Introduction and Service Overview
Globus Research Data Management: Introduction and Service Overview Kyle Chard [email protected] Ben Blaiszik [email protected] Thank you to our sponsors! U. S. D E P A R T M E N T OF ENERGY 2 Agenda
D1.3: 1 st Data Management Plan WP1 Project Management
D1.3: 1 st Data Management Plan WP1 Project Management Document Information Grant Agreement Number 649493 Acronym STEP Full Project Title Societal and political engagement of young people in environmental
Our Cloud Offers You a Brighter Future
Our Cloud Offers You a Brighter Future Qube Global Software Cloud Services are used by many diverse organisations including financial institutions, international service providers, property companies,
