How To Build An Open Source Data Infrastructure
|
|
|
- Suzan Wells
- 5 years ago
- Views:
Transcription
1 EUDAT Collaborative Data Infrastructure Towards the convergence of Compute, Data, Knowledge and Scientific Instruments Giuseppe Fiameni CINECA EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-infrastructures. Contract No
2 EUDAT is... a pan-european initiative building a sustainable cross-disciplinary and cross-national data infrastructure providing a set of shared services for accessing and preserving research data supporting multiple research communities by working closely with them to deliver technical services as part of the EUDAT Collaborative Data Infrastructure (CDI) consortium of high performance computing (HPC) / data centres, libraries, scientific communities, data scientists
3 Delivering an integrated suite of common data services covering the full research data lifecycle & addressing both long tail and big data EUDAT s contribution to Open Science
4 B2 SERVICE SUITE
5 Community-Driven BIOMEDICAL & MEDICAL SCIENCES MATERIALS & ANALYTICAL FACILITIES MAPPER PHYSICAL SCIENCES & ENGINEERING ICT Networking Session 6
6 Open Science & EUDAT OPEN all researchers, communities and infrastructures are encouraged to use / join the Collaborative Data Infrastructure GLOBAL pan European infrastructure with global collaborations US, Japan, COLLABORATIVE designed and driven by research communities and end users CREATIVE enabling scientists from any research discipline to find, access and process data enabling them to carry out research effectively CLOSER TO SOCIETY- services and technology offers are available to all
7 E-Infrastructure Collaboration DFT, data fabric, PID, metadata, practical policy Four interoperability pilots fostering the coupling of data Policy and & guidelines cloud resources. Large Data communities management involved, plans BBMRI, Service integration Open ICOS, EISCAT-3D, ELIXIR. AIRE DMP LERU LIBER RDA EGI Policy & networking Output adoption Test beds Cloud Catalogue Contribution to regular PRACE calls by providing medium/long-term storage capacity GEANT and services. 10 potential pilots emerging from the last DECI Call (13 th ) Cross-infra services & ops Common protocols, APIs HPC/HTC/Clouds DECI calls PRACE Helix Nebula Data Cloud ICT Networking Session 9
8 For more info:
9 Services description EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-infrastructures. Contract No
10 Sync and Exchange Research Data B2DROP EUDAT s Personal Cloud Storage Service B2DROP is a secure and trusted data exchange service for researchers and scientists to keep their research data synchronized and up-to-date and to exchange with others. b2drop.eudat.eu
11 An ideal solution for researchers and scientists to: Store and exchange data with colleagues and team members, including research data not finalized for publishing share data with fine-grained access controls synchronize multiple versions of data across different devices ICT Networking Session Features: 20GB storage per user Living objects, so no PIDs Versioning and offline use Desktop synchronisation 13
12 Store and Share Research Data B2SHARE B2SHARE is a user-friendly, reliable and trustworthy way for researchers, scientific communities and scientists to store and share small-scale research data from diverse contexts. b2share.eudat.eu
13 A winning solution for researchers, scientists and communities to: store data safely at a trusted and certified data centre preserve data to guarantee long-term persistence control access and share data with colleagues and the world ICT Networking Session Features: metadata management permanent PIDs Open Access support 15
14 Replicate Research Data Safely B2SAFE B2SAFE is a robust, safe and highly available service which allows community and departmental repositories to implement data management policies on research data across multiple administrative domains in a trustworthy manner. eudat.eu/b2safe
15 The ideal solution for communities with no facility for archival to: replicate research data into secure data stores archive and preserve research data in the long-term bring data close to powerful compute resources co-locate data with different communities benefit from economies of scale ICT Networking Session Features: large-scale storage robust and highly available permanent PIDs 17
16 Get Data to Computation B2STAGE B2STAGE is a reliable, efficient, light-weight and easy-to-use service to transfer research data sets between EUDAT storage resources and high-performance computing (HPC) workspaces eudat.eu/b2stage
17 Facilitating communities to: move large amounts of data between data stores and highperformance compute resources re-ingest computational results back into EUDAT deposit large data sets onto EUDAT resources for long-term preservation ICT Networking Session Features: high-speed transfer reliable and light-weight manages permanent PIDs 19
18 Find Research Data B2FIND B2FIND is a simple, user-friendly metadata catalogue of research data collections stored in EUDAT data centres and other repositories. b2find.eudat.eu
19 A metadata catalogue service to: seek data objects and collections using powerful metadata searches catalogue community data by means of selected metadata browse through multi-disciplinary data collections filtered by content, provenance and temporal keywords ICT Networking Session Features: simple to use standards-based comprehensive catalogue 21
EUDAT - Open Data Services for Research
EUDAT - Open Data Services for Research Per Öster 05.03.2015 CSC at a Glance Founded in 1971 as a technical support unit for Univac 1108 Connected Finland to the Internet in 1988 Reorganized as a company,
European Data Infrastructure - EUDAT Data Services & Tools
European Data Infrastructure - EUDAT Data Services & Tools Dr. Ing. Morris Riedel Research Group Leader, Juelich Supercomputing Centre Adjunct Associated Professor, University of iceland BDEC2015, 2015-01-28
EUDAT. Towards a pan-european Collaborative Data Infrastructure. Willem Elbers
EUDAT Towards a pan-european Collaborative Data Infrastructure Willem Elbers EUDAT / MPI-TLA Focus meeting: Data repositories SURF, Utrecht March 3, 2014 Outline EUDAT project EUDAT services Summary and
Horizon 2020. Research e-infrastructures Excellence in Science Work Programme 2016-17. Wim Jansen. DG CONNECT European Commission
Horizon 2020 Research e-infrastructures Excellence in Science Work Programme 2016-17 Wim Jansen DG CONNECT European Commission 1 Before we start The material here presented has been compiled with great
Report of the DTL focus meeting on Life Science Data Repositories
Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity
Workprogramme 2014-15
Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES
OpenAIRE Research Data Management Briefing paper
OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement
e-infrastructures in Horizon 2020 Vision, approach, drivers, policy background, challenges, WP structure INFODAY France Paris, 25 mars 2014
e-infrastructures in Horizon 2020 Vision, approach, drivers, policy background, challenges, WP structure INFODAY France Paris, 25 mars 2014 Jean-Luc Dorel European Commission DG CNECT einfrastructure Vision
data infrastructures framework for action for H2020
data infrastructures framework for action for H2020 Event Open Access Policy in Portugal Lisbon, 17 June 2013 Carlos Morais Pires European Commission e-infrastructures, DG CNECT.C1 Author s views do not
8970/15 FMA/AFG/cb 1 DG G 3 C
Council of the European Union Brussels, 19 May 2015 (OR. en) 8970/15 NOTE RECH 141 TELECOM 119 COMPET 228 IND 80 From: Permanent Representatives Committee (Part 1) To: Council No. prev. doc.: 8583/15 RECH
9360/15 FMA/AFG/cb 1 DG G 3 C
Council of the European Union Brussels, 29 May 2015 (OR. en) 9360/15 OUTCOME OF PROCEEDINGS From: To: Council Delegations RECH 183 TELECOM 134 COMPET 288 IND 92 No. prev. doc.: 8970/15 RECH 141 TELECOM
Federated Authentication and Credential Translation in the EUDAT Collaborative Data Infrastructure
Federated Authentication and Credential Translation in the EUDAT Collaborative Data Infrastructure Ahmed Shiraz Memon (JSC - DE) Jens Jensen (STFC escience - UK) Ales Cernivec (XLAB - SL) Krzysztof Benedyczak
EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020
EUROPEAN COMMISSION Directorate-General for Research & Innovation Guidelines on Data Management in Horizon 2020 Version 2.0 30 October 2015 1 Introduction In Horizon 2020 a limited and flexible pilot action
Open Access to scientific data. SwissCore Annual Event 2014. Brussels, 14 May 2014
Open Access to scientific data SwissCore Annual Event 2014 Brussels, 14 May 2014 Jarkko Siren European Commission DG CONNECT einfrastructure Two Commissioners on open access Vice-President Neelie Kroes
" ANNEX 4. 4. European research infrastructures (including e-infrastructures).."
EN ANNEX 4 " ANNEX 4 HORIZON 2020 WORK PROGRAMME 2016 2017 4..." This draft text is submitted to the Horizon 2020 Programme Committee as the basis for an exchange of views. The text is still subject to
High Performance Computing in Horizon 2020. February 26-28, 2014 Fukuoka Japan
High Performance Computing in Horizon 2020 Big Data and Extreme Scale Computing Workshop 51214 February 26-28, 2014 Fukuoka Japan Excellence in Science DG CONNECT European Commission Jean-Yves Berthou
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of
Research Data Management Guide
Research Data Management Guide Research Data Management at Imperial WHAT IS RESEARCH DATA MANAGEMENT (RDM)? Research data management is the planning, organisation and preservation of the evidence that
SURFsara Data Services
SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,
Italian Scientific Big Data Initiative
Italian Scientific Big Data Initiative Sanzio Bassini Director of Supercomputing Application & Innovation Department [email protected] Casalecchio di Reno (BO) Via Magnanelli 6/3, 40033 Casalecchio di
Research Data Management
Research Data Management 1 Why to we need to Manage Data? 2 Data Management Planning Typically covers: - What data will be created (format, types) and how? - How will the data be documented and described?
e-irg workshop Dublin 22-23 May 2013 Track 1: Coordination of e-infrastructures
e-irg workshop Dublin 22-23 May 2013 Track 1: Coordination of e-infrastructures Rossend Llurba e-irgsp3 Track 1 2 sessions Session 1 (Chair: Lajos Balint) 4 presentations Bob Jones Stephen Moffat Sandra
Executive summary. Prepared by Bob Jones (IT department) on behalf of CERN 17 March 2015
Towards the European Open Science Cloud Executive summary The objective of this paper is to propose the establishment of the European Open Science Cloud that will enable digital science by introducing
Data at NIST: A View from the Office of Data and Informatics
Data at NIST: A View from the Office of Data and Informatics Robert Hanisch Office of Data and Informatics Material Measurement Laboratory National Institute of Standards and Technology Data and NIST 1
Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
ENHANCED PUBLICATIONS IN THE CZECH REPUBLIC
ENHANCED PUBLICATIONS IN THE CZECH REPUBLIC PETRA PEJŠOVÁ, HANA VYČÍTALOVÁ [email protected], [email protected] The National Library of Technology, Czech Republic Abstract The aim of this
A guide to ICT-related activities in WP2014-15
A guide to ICT-related activities in WP2014-15 ICT in H2020 an Overview As a generic technology, ICT is present in many of the H2020 areas. This guide is designed to help potential proposers find ICT-related
RESEARCH DATA MANAGEMENT POLICY
Document Title Version 1.1 Document Review Date March 2016 Document Owner Revision Timetable / Process RESEARCH DATA MANAGEMENT POLICY RESEARCH DATA MANAGEMENT POLICY Director of the Research Office Regular
Open Access and Open Research Data in Horizon 2020
Open Access and Open Research Data in Horizon 2020 Celina Ramjoué Head of Sector Open Access to Scientific Publications and Data Digital Science Unit CONNECT.C3 22 November 2013 Train the Trainer for H2020
Checklist and guidance for a Data Management Plan
Checklist and guidance for a Data Management Plan Please cite as: DMPTuuli-project. (2016). Checklist and guidance for a Data Management Plan. v.1.0. Available online: https://wiki.helsinki.fi/x/dzeacw
How to gain and maintain ISO 27001 certification
Public How to gain and maintain ISO 27001 certification Urpo Kaila, Head of Security CSC IT Center for Science ltd. [email protected], [email protected] GÉANT SIG ISM 1 st Workshop, 2015-05-12, imperial.ac.uk
The challenges of digital preservation to support research in the digital age
DRAFT FOR DISCUSSION WITH ADVISORY COUNCIL MEMBERS ONLY The challenges of digital preservation to support research in the digital age Lynne Brindley CEO, The British Library November 2005 Agenda UK developments
(European Commission C(2015)XXX of XX 2015)
WORK PROGRAMME 2016 2017 (note this draft only covers the e-infrastructure part) (European Commission C(2015)XXX of XX 2015) Version 2.0 of Monday 12 February 12:00 Table of Contents Call E - e-infrastructures...
Big Data to Knowledge (BD2K)
Big Data to Knowledge () potential funding agency synergies Jennie Larkin, PhD Office of the Associate Director of Data Science National Institutes of Health idash-pscanner meeting UCSD September 16, 2014
Trials community. Yannick Legré. [email protected]. www.egi.eu. EGI InSPIRE RI 261323
EGI InSPIRE InSPIRE EGI Federated cloud for the Clinical Trials community Yannick Legré [email protected] ECRIN Workshop EGI European Grid Infrastructure Distributed, federated storage and compute facilities
Exploitation of ISS scientific data
Cooperative ISS Research data Conservation and Exploitation Exploitation of ISS scientific data Luigi Carotenuto Telespazio s.p.a. Copernicus Big Data Workshop March 13-14 2014 European Commission Brussels
Local Loading. The OCUL, Scholars Portal, and Publisher Relationship
Local Loading Scholars)Portal)has)successfully)maintained)relationships)with)publishers)for)over)a)decade)and)continues) to)attract)new)publishers)that)recognize)both)the)competitive)advantage)of)perpetual)access)through)
How To Write A Blog Post On Globus
Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, [email protected] Jim Pruyne, University of Chicago Computation Institute, [email protected]
Business Proposition. Digital Asset Management. Media Intelligent
Business Proposition Digital Asset Management Executive Summary º º The Changing Face of Digital Asset Management Today, a true enterprise-class DAM solution must be the core component of an integrated
Big Data Standardisation in Industry and Research
Big Data Standardisation in Industry and Research EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
Cambridge University Library. Working together: a strategic framework 2010 2013
1 Cambridge University Library Working together: a strategic framework 2010 2013 2 W o r k i n g to g e t h e r : a s t r at e g i c f r a m e w o r k 2010 2013 Vision Cambridge University Library will
Open Access to publications and research data in Horizon 2020
Open Access to publications and research data in Horizon 2020 Celina Ramjoué Head of Sector Open Access to Scientific Publications and Data Digital Science Unit CONNECT.C3 4 December 2013 Meeting of National
How To Useuk Data Service
Publishing and citing research data Research Data Management Support Services UK Data Service University of Essex April 2014 Overview While research data is often exchanged in informal ways with collaborators
Data management plan
FACILITATE OPEN SCIENCE TRAINING FOR EUROPEAN RESEARCH 612425 Data management plan Course for Doctoral Students at ECPR Summer School 2015 Faculty of Social Sciences, University of Ljubljana, Slovenia
Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management
Research Data Management Canadian National Research Data Repository Service Progress Report, June 2016 As their digital datasets grow, researchers across all fields of inquiry are struggling to manage
Big Data in the Digital Cultural Heritage
Big Data in the Digital Cultural Heritage Antonella Fresa, Promoter Srl DCH-RP Technical Coordinator 1 Table of Content Digitisation of Cultural Heritage Toward an e-infrastructure for Digital Cultural
EDISON Education for Data Intensive Science to Open New science frontiers
H2020 INFRASUPP-4 CSA Project EDISON Education for Data Intensive Science to Open New science frontiers Yuri Demchenko University of Amsterdam Outline Consortium members EDISON Project Concept and Objectives
Council of the European Union Brussels, 13 February 2015 (OR. en)
Council of the European Union Brussels, 13 February 2015 (OR. en) 6022/15 NOTE From: To: Presidency RECH 19 TELECOM 29 COMPET 30 IND 16 Permanent Representatives Committee/Council No. Cion doc.: 11603/14
Digital Asset Management Developing your Institutional Repository
Digital Asset Management Developing your Institutional Repository Manny Bekier Director, Biomedical Communications Clinical Instructor, School of Public Health SUNY Downstate Medical Center Why DAM? We
Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning
Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning Simon Lambert Scientific Computing Department STFC Rutherford
A Service for Data-Intensive Computations on Virtual Clusters
A Service for Data-Intensive Computations on Virtual Clusters Executing Preservation Strategies at Scale Rainer Schmidt, Christian Sadilek, and Ross King [email protected] Planets Project Permanent
Data Intensive Research Initiative for South Africa (DIRISA)
Data Intensive Research Initiative for South Africa (DIRISA) A Reinterpreted Vision A. Vahed 25 November 2014 Outline Background Data Landscape Strategy & Objectives Activities & Outputs Organisational
Research Infrastructures in Horizon 2020
Research Infrastructures in Horizon 2020 Philippe Froissard Deputy Head of Unit - Research Infrastructures European Commission DG Research & Innovation Research Infrastructures Research infrastructures
DRIVER Providing value-added services on top of Open Access institutional repositories
DRIVER Providing value-added services on top of Open Access institutional repositories Dr Dale Peters Scientific Technical Manager : DRIVER SUB Goettingen Germany Gaining the momentum: Open Access and
How to Share Best Security Practices
How to Share Best Security Practices Urpo Kaila, EUDAT Security Officer [email protected], [email protected] WISE Workshop for Information Security for E-infrastructures 2015-10-22, Barcelona This work
Oportunidades, desafios e perspetivas de financiamento no Horizonte 2020 Infraestruturas de Investigação Ricardo Miguéis Daniela Guerra
Fundação para a Ciência e Tecnologia Agência de Inovação Oportunidades, desafios e perspetivas de financiamento no Horizonte 2020 Infraestruturas de Investigação Ricardo Miguéis Daniela Guerra IST, Lisboa
Databases & Data Infrastructure. Kerstin Lehnert
+ Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,
Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS.
National Digital Stewardship Residency - Boston Project Summaries 2015-16 Residency Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS. Harvard Library s Digital
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43%
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43% Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering,
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research
THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy 2015-2018. Page 1 of 8
THE BRITISH LIBRARY Unlocking The Value The British Library s Collection Metadata Strategy 2015-2018 Page 1 of 8 Summary Our vision is that by 2020 the Library s collection metadata assets will be comprehensive,
A grant number provides unique identification for the grant.
Data Management Plan template Name of student/researcher(s) Name of group/project Description of your research Briefly summarise the type of your research to help others understand the purposes for which
The challenges of becoming a Trusted Digital Repository
The challenges of becoming a Trusted Digital Repository Annemieke de Jong is Preservation Officer at the Netherlands Institute for Sound and Vision (NISV) in Hilversum. She is responsible for setting out
