DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES
|
|
|
- Stanley Ross
- 10 years ago
- Views:
Transcription
1 DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES Yvan Le Bras Cyril Monjeaud Olivier Collin Jacques Nicolas CNRS UMR 6074 IRISA-INRIA
2 Context Now : Genomics : Next Generation Sequencing Now : Proteomics Next : Bio-imaging Kahn. On the future of genomic data. Science (2011) vol. 331 (6018) pp Digital data Huge amount Heterogenous Critical situation for some laboratories
3 Context Exchange from one domain to another From ICT / IT to scientific domains Between scientific domains Life science integrators e-science integrators
4 E-BIOGENOUEST From the e-biogenouest project to the first french e- Science center : CeSGO
5 E-Biogenouest Started in May 2012 for 3 years Funded by Brittany and Pays de la Loire E-science initiative for the Biogenouest network Test an e-science approach Roadmap preparation
6 E-Biogenouest Started in May 2012 for 3 years Funded by Brittany and Pays de la Loire E-science initiative for the Biogenouest network Test an e-science approach More than 120 scientists trained! 1669 meetings ;) Roadmap preparation -UEB C@mpus -CPER -FRM -INCa -H2020 Health 7 submitted publications Agro Environment IT More than 200 users! An innovative VRE concept -Mission interdisciplinarité CNRS -PIA -IFB -Fce Génomique -Rapsodyn -Sciences citoyennes
7 VRE: a tool for e-science application Virtual Research Environment Data User Web portal Collaboration softwares Community Processing resources
8 An innovative VRE approach Research Lifecycle Open source solutions Mutualise Don t reinvente the wheel win win Break down silos
9 Continuum HubZero Galaxy EMME Communauté Continuum data management & analysis Collaborative environment Collaboration
10 HUBzero : Scientifique collaborative platform ebgo HUB HUBzero to share knowledge and manage groups and projects Informations 218 users 111 projects 53 groups 729 resources > 400 uniq users uniques by month Purdue University M. McLennan, R. Kennell. Comput Sci Eng, 12:48-53, 2010.
11 ISAtools : Experimental data management EMME ISAtools suite to store data & metadata Fonctionalities -based on biomed ontologies -bridge between existing biomed standards -format publication submission -Pydio to upload data -biological investigation repository (data + metadata) Oxford eresearch Centre P. Rocca-Serra et al. Bioinformatics, 26;254(6), 2010
12 Galaxy : Data analysis web platform GALAXY by GenOuest To analyse & share data as processes and tools Informations jobs 150 users More than 800 outils Share - data - histories - workflows - tools Penn state university J. Goecks, A. Nekrutenko, J. Taylor, et al. Genome Biol, 25;11(8):R86, 2010
13 Pydio : File sharing platform Pydio by GenOuest To store & share data as links Informations -Galaxy workspace -EMME workspace -INCa workspace Share - data via URI - control - safety - privacy Abstrium SAS Charles du jeu, David Gillard et al.
14 What are our goals? For society Open Science and open data For end users scientists communities Data management plan Preserve, access, share & visualise (data & analytics porocesses) Help for project management For ICT Facilitate the use of tools Research Service Accelerate switch between dev to production state Optimise infrastructures use (storage, computing & network ) Infrastructure for data infastructure of data
15 DMP ON THE LINE From data storage to publication
16 CeSGO : Data storage
17 Data storage
18 Data storage URL generation
19 Metadata management
20 Metadata management
21 Metadata management Configuration
22 Metadata management Configuration
23 Metadata management Configuration
24 Metadata management Configuration
25 Metadata management Configuration
26 Metadata management Configuration
27 Metadata management Isacreator
28 Metadata management Isacreator
29 Metadata management Isacreator: genomespace
30 Metadata management Isacreator: local
31 Metadata management Isacreator: choose a config
32 Metadata management Isacreator: existing isatab
33 Metadata management Isacreator: existing isatab
34 Metadata management Isacreator: existing isatab
35 Metadata management Isacreator: Investigation
36 Metadata management Isacreator: Study
37 Metadata management Isacreator: Study 1
38 Metadata management Isacreator: Assay 1
39 Metadata management Isacreator: Assay 1 / Data
40 Metadata management Isacreator: Study
41 Metadata management Isacreator: create an ISArchive
42 Metadata management Isacreator: Study
43 Data analysis Metadata & data analysis: Galaxy
44 Data analysis Metadata & data analysis: Galaxy / Import ISArchive
45 Data analysis Metadata & data analysis: Galaxy / Import ISArchive
46 Data analysis Metadata & data analysis: Galaxy / Extract ISArchive
47 Data analysis Metadata & data analysis: Galaxy / Extract ISArchive
48 Data analysis Metadata & data analysis: Galaxy / Extract ISArchive
49 Data analysis Metadata & data analysis: Galaxy / Extract ISArchive
50 Data analysis Metadata & data analysis: Galaxy / Download data
51 Data analysis Metadata & data analysis: Galaxy / Download raw data
52 Data analysis Metadata & data analysis: Galaxy / Extract ISArchive
53 Data analysis Metadata & data analysis: Galaxy / Extract ISArchive
54 Metadata repository Metadata repository: Bii
55 Metadata repository Metadata repository: Bii 1 study
56 Metadata repository Metadata repository: Bii Data via URL / Protocols
57 CeSGO & DMP Données administratives Dénomination du projet Description du projet Nom / ID du responsable Agence de financement Version du DMP Politique appliquée aux données Responsabilités et ressources Collecte / création de données Description du jeu de données Protocole Méthode Equipements Assurance qualité appliquée Documentation et métadonnées Entrepôt Bii Standard de métadonnées : ISA-TAB
58 CeSGO & DMP Stockage, sauvegarde et sécurité des données Datacenter CeSGO pendant la durée du projet (max : 5 ans) Ethique et cadre légal Protection des données sensibles ou personnelles CC version 4.0 Partage des données Accès libre ou restreint Délai : 3 ans max après leur collecte Entrepôts (GEO, Genbank, SRA, Uniprot, PRIDE,.) Outils nécessaires à la réutilisation / validation des données Data paper Sélection et archivage des données
59 CESGO: 5 GOALS From Data Mangement to Accessibility
60 CeSGO : Western France e-science metadata Data management URI Life sciences protocols
61 CeSGO : Western France e-science New VREs! Open Data
62 CeSGO : Western France e-science New VREs! Connected using semantic web approaches Thanks to DOI attribution Linked Data
63 CeSGO : Western France e-science cloud Reproducibility Galaxy versioning docker
64 CeSGO : Western France e-science wiki Accessibility Analytics processes Public resources Experiments Publications
65 Merci de votre attention La plate-forme Bio-informatique GenOuest Le groupe Symbiose IRISA/INRIA GenOuest-Dyliss-Genscale ebgo HUB (collaboration) Scitizen portal (citizen science) EMME portal (data management) Galaxy instance (data analysis) GO4Bioinformatics (education )
66 CeSGO : Western France e-science New VREs!
e-biogenouest : The Tools
e-biogenouest : The Tools Coordinateur : Olivier Collin Animateur : Yvan Le Bras CNRS UMR 6074 IRISA-INRIA / Plateforme de Bioinformatique GenOuest [email protected] Programme fédérateur Biogenouest
A curated Domain centric shared Docker registry linked to the Galaxy toolshed
A curated Domain centric shared Docker registry linked to the Galaxy toolshed François Moreews 1, Olivier Sallou 2, Yvan le Bras 2, Marie Grosjean 3, Cyril Monjeaud 2, Thomas Darde 4, Olivier Collin 2,
Cloud Ready for Bioinformatics?
IDB acknowledges co-funding by the European Community's Seventh Framework Programme (INFSO-RI-261552) and the French National Research Agency's Arpege Programme (ANR-10-SEGI-001) Cloud Ready for Bioinformatics?
Workprogramme 2014-15
Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES
COPO: Collaborative Open Plant Omics. Rob Davey Data Infrastructure and Algorithms Group Leader [email protected].
: Collaborative Open Plant Omics Rob Davey Data Infrastructure and Algorithms Group Leader [email protected] @froggleston Toni Etuk Felix Shaw Acknowledgements Oxford eresearch Centre Susanna Sansone
Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova
Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel
Il est repris ci-dessous sans aucune complétude - quelques éléments de cet article, dont il est fait des citations (texte entre guillemets).
Modélisation déclarative et sémantique, ontologies, assemblage et intégration de modèles, génération de code Declarative and semantic modelling, ontologies, model linking and integration, code generation
Hadoopizer : a cloud environment for bioinformatics data analysis
Hadoopizer : a cloud environment for bioinformatics data analysis Anthony Bretaudeau (1), Olivier Sallou (2), Olivier Collin (3) (1) [email protected], INRIA/Irisa, Campus de Beaulieu, 35042,
Bioinformatique sur Cloud Cas d usage avec le portail Galaxy
Bioinformatique sur Cloud Cas d usage avec le portail Galaxy Christophe Blanchet Institute of Biology and Chemistry of Proteins Head of Service Infrastructure for Biology - IDB CNRS-IBCP FR3302 - LYON
Web and Big Data at LIG. Marie-Christine Rousset (Pr UJF, déléguée scientifique du LIG)
Web and Big Data at LIG Marie-Christine Rousset (Pr UJF, déléguée scientifique du LIG) Data and Knowledge Processing at Large Scale Officers: Massih-Reza Amini - Jean-Pierre Chevallet Teams: AMA EXMO GETALP
Exploitation of ISS scientific data
Cooperative ISS Research data Conservation and Exploitation Exploitation of ISS scientific data Luigi Carotenuto Telespazio s.p.a. Copernicus Big Data Workshop March 13-14 2014 European Commission Brussels
Quel pilote ètes-vous
Quel pilote ètes-vous Mario Andretti Unique Multi-World Champion en Formula 1, Indy Car, World Sportscar, Nascar Copyright 2 3/27/2013 BMC Software, Inc 2 If everything seems under control, you're not
Cloud pour la Bioinformatique
Cloud pour la Bioinformatique Christophe Blanchet Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR French Node CNRS UMS3601 - Gif-sur-Yvette - FRANCE Sequencing data
Sequencing data. And other experimental data. EMBL-EBI data resources growth
Sequencing Institut Français de Bioinformatique, Un loud pour les Sciences du Vivant source: www.genomesonline.org source: www.politigenomics.com/next-generation- hristophe Blanchet Institut Français de
IFB s e-infrastructure
IFB s e-infrastructure Christophe Blanchet Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Life Sciences Platforms in France
Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.
Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Dr Liz Lyon, UKOLN, University of Bath Introduction and Objectives UKOLN is undertaking
Early Cloud Experiences with the Kepler Scientific Workflow System
Available online at www.sciencedirect.com Procedia Computer Science 9 (2012 ) 1630 1634 International Conference on Computational Science, ICCS 2012 Early Cloud Experiences with the Kepler Scientific Workflow
SURFsara Data Services
SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,
EMBL Identity & Access Management
EMBL Identity & Access Management Rupert Lück EMBL Heidelberg e IRG Workshop Zürich Apr 24th 2008 Outline EMBL Overview Identity & Access Management for EMBL IT Requirements & Strategy Project Goal and
Formation à l ED STIC ED STIC Doctoral education. Hanna Klaudel
Formation à l ED STIC ED STIC Doctoral education Hanna Klaudel Texte de référence / Text of low L arrêté de 7 août 2006 : «Les écoles doctorales proposent aux doctorants les formations utiles à leur projet
EUDAT. Towards a pan-european Collaborative Data Infrastructure. Willem Elbers
EUDAT Towards a pan-european Collaborative Data Infrastructure Willem Elbers EUDAT / MPI-TLA Focus meeting: Data repositories SURF, Utrecht March 3, 2014 Outline EUDAT project EUDAT services Summary and
IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper
IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and
NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons
The NIH Commons Summary The Commons is a shared virtual space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage,
You can choose to install the plugin through Magento Connect or by directly using the archive files.
Magento plugin 1.5.7 installation 1. Plugin installation You can choose to install the plugin through Magento Connect or by directly using the archive files. 1.1 Installation with Magento Connect 1.1.1
Standard Big Data Architecture and Infrastructure
Standard Big Data Architecture and Infrastructure Wo Chang Digital Data Advisor Information Technology Laboratory (ITL) National Institute of Standards and Technology (NIST) [email protected] May 20, 2016
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
Towards a galaxy.prabi.fr
Towards a galaxy.prabi.fr IFB- galaxy Day 04/12/2013 Navra5l V., PhD, UCBL [email protected] www.prabi.fr One among the six IFB regional nodes Region: Rhône- Alpes Director: Guy Perrière 11 Research Team,
Data Intensive Research Initiative for South Africa (DIRISA)
Data Intensive Research Initiative for South Africa (DIRISA) A Reinterpreted Vision A. Vahed 25 November 2014 Outline Background Data Landscape Strategy & Objectives Activities & Outputs Organisational
Preserving French Scientific data
Preserving French Scientific data Marion MASSOL (CINES) [email protected] DARIAH General VCC Meeting November 28 th, 29 th, 30 th 2012 AGENDA 1. Preserving data: our mission and strategy 4. The file
Smart Specialization Regional Innovation Strategy (SRI 3S) in Provence Alpes Côte d Azur
Smart Specialization Regional Innovation Strategy (SRI 3S) in Provence Alpes Côte d Azur 1 PACA Assets for economic growth 3 rd French region in terms of GDP 1st University of France (70 000 students)
#jenkinsconf. Jenkins as a Scientific Data and Image Processing Platform. Jenkins User Conference Boston #jenkinsconf
Jenkins as a Scientific Data and Image Processing Platform Ioannis K. Moutsatsos, Ph.D., M.SE. Novartis Institutes for Biomedical Research www.novartis.com June 18, 2014 #jenkinsconf Life Sciences are
Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM [email protected]
Lecture 11 Data storage and LIMS solutions Stéphane LE CROM [email protected] Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.3 Selected Standards
Service Road Map for ANDS Core Infrastructure and Applications Programs
Service Road Map for ANDS Core and Applications Programs Version 1.0 public exposure draft 31-March 2010 Document Target Audience This is a high level reference guide designed to communicate to ANDS external
Digital libraries of the future and the role of libraries
Digital libraries of the future and the role of libraries Donatella Castelli ISTI-CNR, Pisa, Italy Abstract Purpose: To introduce the digital libraries of the future, their enabling technologies and their
Le cloud IFB et son instance Galaxy
Le cloud IFB et son instance Galaxy Christophe BLANCHET Institut Français de Bioinformatique - IFB French Institute of Bioinformatics - ELIXIR-FR CNRS UMS3601 - Gif-sur-Yvette - FRANCE Ecole Bioinformatique
Automatic Timeline Construction For Computer Forensics Purposes
Automatic Timeline Construction For Computer Forensics Purposes Yoan Chabot, Aurélie Bertaux, Christophe Nicolle and Tahar Kechadi CheckSem Team, Laboratoire Le2i, UMR CNRS 6306 Faculté des sciences Mirande,
Virginia Commonwealth University Rice Rivers Center Data Management Plan
Virginia Commonwealth University Rice Rivers Center Data Management Plan Table of Contents Objectives... 2 VCU Rice Rivers Center Research Protocol... 2 VCU Rice Rivers Center Data Management Plan... 3
CFT 100000930 ICT review Questions/Answers
CFT 100000930 ICT review Questions/Answers 1. Est-ce que la stratégie métier est formalisée dans un document détaillant les priorités? Yes, there are two strategic documents, the STRATEGIC ORIENTATIONS
Brian Connolly Systems Engineer, LabKey Software [email protected]. LabKey Server in the Cloud
Brian Connolly Systems Engineer, LabKey Software [email protected] LabKey Server in the Cloud 1 Agenda What is the Cloud? Why would I want to use the cloud? What will it cost? Using LabKey in the cloud
DATA MANAGEMENT PLAN DELIVERABLE NUMBER RESPONSIBLE AUTHOR. Co- funded by the Horizon 2020 Framework Programme of the European Union
DATA MANAGEMENT PLAN Co- funded by the Horizon 2020 Framework Programme of the European Union DELIVERABLE NUMBER DELIVERABLE TITLE D7.4 Data Management Plan RESPONSIBLE AUTHOR DFKI GRANT AGREEMENT N. PROJECT
"Internationalization vs. Localization: The Translation of Videogame Advertising"
Article "Internationalization vs. Localization: The Translation of Videogame Advertising" Raquel de Pedro Ricoy Meta : journal des traducteurs / Meta: Translators' Journal, vol. 52, n 2, 2007, p. 260-275.
THE HELMHOLTZ INVENIO REPOSITORY PROJECT :
THE HELMHOLTZ INVENIO REPOSITORY PROJECT : BETWEEN ORGANIZATIONAL TASKS, NEEDS OF SCIENTISTS, AND CONSTRAINTS 1 Structure of the presentation Libraries of DESY, FZJ and GSI in the Helmholtz Association
BIOINFORMATICS Supporting competencies for the pharma industry
BIOINFORMATICS Supporting competencies for the pharma industry ABOUT QFAB QFAB is a bioinformatics service provider based in Brisbane, Australia operating nationwide and internationally. QFAB was established
Making university-industry partnerships work: trials and lessons. Marie-Odile OTT, PhD Inspectrice générale
Making university-industry partnerships work: trials and lessons Marie-Odile OTT, PhD Inspectrice générale University Industry partnership Common views concerning the mission of universities: 1. The dissemination
How To Write A Blog Post On Globus
Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, [email protected] Jim Pruyne, University of Chicago Computation Institute, [email protected]
Faut-il des cyberarchivistes, et quel doit être leur profil professionnel?
Faut-il des cyberarchivistes, et quel doit être leur profil professionnel? Jean-Daniel Zeller To cite this version: Jean-Daniel Zeller. Faut-il des cyberarchivistes, et quel doit être leur profil professionnel?.
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of
Copyright 2014, Oracle and/or its affiliates. All rights reserved.
1 Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment
Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any
Data Management Plan Name of Contractor Name of project Project Duration Start date : End: DMP Version Date Amended, if any Name of all authors, and ORCID number for each author WYDOT Project Number Any
Managing the Knowledge Exchange between the Partners of the Supply Chain
Managing the Exchange between the Partners of the Supply Chain Problem : How to help the SC s to formalize the exchange of the? Which methodology of exchange? Which representation formalisms? Which technical
Report of the DTL focus meeting on Life Science Data Repositories
Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity
Semantic Workflows and the Wings Workflow System
To Appear in AAAI Fall Symposium on Proactive Assistant Agents, Arlington, VA, November 2010. Assisting Scientists with Complex Data Analysis Tasks through Semantic Workflows Yolanda Gil, Varun Ratnakar,
OpenAIRE Research Data Management Briefing paper
OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement
Information and Communications Technology Strategy 2014-2017
Contents 1 Background ICT in Geoscience Australia... 2 1.1 Introduction... 2 1.2 Purpose... 2 1.3 Geoscience Australia and the Role of ICT... 2 1.4 Stakeholders... 4 2 Strategic drivers, vision and principles...
A brief introduction to Cytoscape
A brief introduction to Cytoscape Scientific day «Data mining of omics data» 5th Feb 2015 CGFB, Bordeaux [email protected] (U1038, EDyP team, CEA-Grenoble) OUTLINES Concepts and context Cytoscape
Stockage distribué sous Linux
Félix Simon Ludovic Gauthier IUT Nancy-Charlemagne - LP ASRALL Mars 2009 1 / 18 Introduction Répartition sur plusieurs machines Accessibilité depuis plusieurs clients Vu comme un seul et énorme espace
The SIST-GIRE Plate-form, an example of link between research and communication for the development
1 The SIST-GIRE Plate-form, an example of link between research and communication for the development Patrick BISSON 1, MAHAMAT 5 Abou AMANI 2, Robert DESSOUASSI 3, Christophe LE PAGE 4, Brahim 1. UMR
CDPP in Europlanet/IDIS FP6 and FP7 C. Jacquey, N. André, B. Cecconi, V. Génot, C. Briand. M. Gangloff, M. Bouchemit, E. Budnik, E.
CDPP in Europlanet/IDIS FP6 and FP7 C. Jacquey, N. André, B. Cecconi, V. Génot, C. Briand M. Gangloff, M. Bouchemit, E. Budnik, E. Pallier Le CDPP Centre National (INSU-CNES) Missions -Archivage et préservation
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge
Grid Computing Perspectives for IBM
Grid Computing Perspectives for IBM Atelier Internet et Grilles de Calcul en Afrique Jean-Pierre Prost IBM France [email protected] Agenda Grid Computing Initiatives within IBM World Community Grid Decrypthon
Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT
Building Bioinformatics Capacity in Africa Nicky Mulder CBIO Group, UCT Outline What is bioinformatics? Why do we need IT infrastructure? What e-infrastructure does it require? How we are developing this
