EUDAT - Open Data Services for Research
|
|
|
- Janel Franklin
- 10 years ago
- Views:
Transcription
1 EUDAT - Open Data Services for Research Per Öster
2 CSC at a Glance Founded in 1971 as a technical support unit for Univac 1108 Connected Finland to the Internet in 1988 Reorganized as a company, CSC Scientific Computing Ltd. in 1993 All shares to the Ministry of Education and Culture of Finland in 1997 Operates on a non-profit principle Facilities in Espoo, close to Otaniemi campus (of 15,000 students and 16,000 technology professionals) and Kajaani (data center in North-east Finland) Staff >250 Turnover 2015 ~34 million euros
3 EUDAT Open Data Services for Research Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd ECMWF,
4 EUDAT: A pan-european e-infrastructure Solution for pan-european RI Data Challenges All Research Infrastructures are facing data challenges Where to store the growing amount of data? How to find it? How to make the most of it? Many communities are developing own solutions This is good but we also need to make sure that the solutions remain interoperable EUDAT offers a pan-european solution Providing a set of generic services to help RIs managing their growing amount of data Providing these services across communities to ensure minimum level of interoperability Linking community specific repositories to the largest European scientific data and HPC centers Collaborative Data Infrastructure (CDI) 4
5 Data Centers and Communities 5
6 User Forums + 30 Communities 1 st User Forum 7-8 March 2012, Barcelona 6
7 Services & Resources Covering both access and deposit, from informal data sharing to long-term archiving, and addressing identification, discoverability and computability of both long-tail and big data, EUDAT s services will address the full lifecycle of research data 7
8 B2DROP is a secure and trusted data exchange service for researchers and scientists to keep their research data synchronized and up-to-date and to exchange with other researchers. An ideal solution to: Store and exchange data with colleagues and team b2drop.eudat.eu Synchronize multiple versions of data Ensure automatic desktop synchronization of large files
9 B2SHARE is a user-friendly, reliable and trustworthy way for researchers, scientific communities and citizen scientists to store and share small-scale research data from diverse contexts. A winning solution to: Store: facilitates research data storage b2share.eudat.eu Preserve: guarantees long-term persistence of data Share: allows data, results or ideas to be shared worldwide
10 A four-click service b2share.eudat.eu
11 B2SAFE is a robust, safe and highly available service which allows community and departmental repositories to implement data management policies on their research data across multiple administrative domains in a trustworthy manner. A solution to: eudat.eu/b2safe Provide an abstraction layer which virtualizes large-scale data resources Guard against data loss in long-term archiving and preservation Optimize access for users from different regions Bring data closer to powerful computers for compute-intensive analysis
12 B2SAFE Features eudat.eu/b2safe based on the execution of auditable data policy rules and the use of persistent identifiers (PIDs) respects the rights of the data owners to define the access rights for their data and to decide how and when it is made publicly referenceable data policies are centrally managed via a Data Policy Manager, and the policy rules are implemented and enforced by site-local rule engines able to aggregate data from different disciplines into a storage system of trustworthy and capable data service providers support for repository packages (e.g. DSPACE, FEDORA) and a lightweight HTTP-based solution
13 B2STAGE is a reliable, efficient, light-weight and easy-to-use service to transfer research data sets between EUDAT storage resources and highperformance computing (HPC) workspaces. The service allows users to: eudat.eu/b2stage Transfer large data collections from EUDAT storage facilities to external HPC facilities for processing In conjunction with B2SAFE, replicate community data sets, ingesting them onto EUDAT storage resources for long-term preservation Ingest computation results into the EUDAT infrastructure Access data through a RESTful HTTP interface (in progress)
14 B2FIND is a simple, user-friendly metadata catalogue of research data collections stored in EUDAT data centres and other repositories. A service which allows users to: b2find.eudat.eu Find collections of scientific data quickly and easily, irrespective of their origin, discipline or community Get quick overviews of available data Browse through collections using standardized facets
15 B2FIND Features supports faceted, geospatial and temporal metadata searches b2find.eudat.eu allows users to search and browse datasets via keyword searches initially available for communities in the EUDAT registered domain of data EUDAT will then extend the service to other interested and reliable data and metadata providers results displayed in user-friendly format and listed in order of relevance access to the scientific data objects is given through references provided in the metadata
16 A Federated and Distributed CDI Community data sites Generic data centres EUDAT is about providing solutions in a federated environment Independent and sustainable centers working within a common framework to develop shared services & policies Partnerships between legal entities relying on OLAs and SLAs
17 A Federated and Distributed CDI Community data sites Generic data centres Using EUDAT services: finding and accessing data, for instance, or storing smaller data sets by interacting with one of the CDI public front-end services vs Joining the CDI: implies a tighter integration with at least one of the EUDAT centre partnership between legal entities relying on OLAs and SLAs
18 Need an EUDAT Specific Offering? Storage capacities located at selected centers across Europe, based on clear SLAs Replication based on policy rules defined by the customer Possibility to use large-scale computing power close to the data Service based on clear SLAs with hosting centers (Open) Data Sharing platform tailored for specific needs (researchers, citizen scientists, etc.) B2DROP, B2SHARE plus extensions Dissemination and better discoverability and reusability of data sets B2FIND for data hosted both inside and outside of EUDAT 18
19 Pilot existing services Conclusion B2DROP, B2SHARE, and B2FIND available immediately through the Web with free access at the point of use Pilots within the EUDAT 2020 project can include customization of the service (e.g. B2SHARE extension, B2FIND mapping, etc.) B2SAFE and B2STAGE requires selection of hosting sites and agreement on resource allocation Limited free resources available for piloting service through the EUDAT 2020 project Contact EUDAT to discuss long term strategy and partnership on data management Data management and preservation policies and requirements Business models and partnership agreements (pre-paid resources, pay-per use, etc.) 19
20 Contacts Project Manager Damien Lecarpentier Scientific Coordinator Peter Wittenburg Web 20
European Data Infrastructure - EUDAT Data Services & Tools
European Data Infrastructure - EUDAT Data Services & Tools Dr. Ing. Morris Riedel Research Group Leader, Juelich Supercomputing Centre Adjunct Associated Professor, University of iceland BDEC2015, 2015-01-28
How To Build An Open Source Data Infrastructure
EUDAT Collaborative Data Infrastructure Towards the convergence of Compute, Data, Knowledge and Scientific Instruments Giuseppe Fiameni CINECA www.eudat.eu EUDAT receives funding from the European Union's
EUDAT. Towards a pan-european Collaborative Data Infrastructure. Willem Elbers
EUDAT Towards a pan-european Collaborative Data Infrastructure Willem Elbers EUDAT / MPI-TLA Focus meeting: Data repositories SURF, Utrecht March 3, 2014 Outline EUDAT project EUDAT services Summary and
How to gain and maintain ISO 27001 certification
Public How to gain and maintain ISO 27001 certification Urpo Kaila, Head of Security CSC IT Center for Science ltd. [email protected], [email protected] GÉANT SIG ISM 1 st Workshop, 2015-05-12, imperial.ac.uk
OpenAIRE Research Data Management Briefing paper
OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement
Report of the DTL focus meeting on Life Science Data Repositories
Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity
How To Write A Blog Post On Globus
Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, [email protected] Jim Pruyne, University of Chicago Computation Institute, [email protected]
Workprogramme 2014-15
Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES
SURFsara Data Services
SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,
Exploitation of ISS scientific data
Cooperative ISS Research data Conservation and Exploitation Exploitation of ISS scientific data Luigi Carotenuto Telespazio s.p.a. Copernicus Big Data Workshop March 13-14 2014 European Commission Brussels
EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020
EUROPEAN COMMISSION Directorate-General for Research & Innovation Guidelines on Data Management in Horizon 2020 Version 2.0 30 October 2015 1 Introduction In Horizon 2020 a limited and flexible pilot action
INTEGRATING RECORDS SYSTEMS WITH DIGITAL ARCHIVES CURRENT STATUS AND WAY FORWARD
INTEGRATING RECORDS SYSTEMS WITH DIGITAL ARCHIVES CURRENT STATUS AND WAY FORWARD National Archives of Estonia Kuldar As National Archives of Sweden Karin Bredenberg University of Portsmouth Janet Delve
Local Loading. The OCUL, Scholars Portal, and Publisher Relationship
Local Loading Scholars)Portal)has)successfully)maintained)relationships)with)publishers)for)over)a)decade)and)continues) to)attract)new)publishers)that)recognize)both)the)competitive)advantage)of)perpetual)access)through)
Italian Scientific Big Data Initiative
Italian Scientific Big Data Initiative Sanzio Bassini Director of Supercomputing Application & Innovation Department [email protected] Casalecchio di Reno (BO) Via Magnanelli 6/3, 40033 Casalecchio di
Interagency Science Working Group. National Archives and Records Administration
Interagency Science Working Group 1 National Archives and Records Administration Establishing Trustworthy Digital Repositories: A Discussion Guide Based on the ISO Open Archival Information System (OAIS)
H2020 Guidelines on Open Data and Data Management Plan
H2020 Guidelines on Open Data and Data Management Plan CRR Centro Risorse per la Ricerca Multimediale Why? Open scientific research data should be easily discoverable, accessible, assessable, intelligible,
Open Access and Open Research Data in Horizon 2020
Open Access and Open Research Data in Horizon 2020 Celina Ramjoué Head of Sector Open Access to Scientific Publications and Data Digital Science Unit CONNECT.C3 22 November 2013 Train the Trainer for H2020
Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan
Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:
e-infrastructures in Horizon 2020 Vision, approach, drivers, policy background, challenges, WP structure INFODAY France Paris, 25 mars 2014
e-infrastructures in Horizon 2020 Vision, approach, drivers, policy background, challenges, WP structure INFODAY France Paris, 25 mars 2014 Jean-Luc Dorel European Commission DG CNECT einfrastructure Vision
ESRC Research Data Policy
ESRC Research Data Policy Introduction... 2 Definitions... 2 ESRC Research Data Policy Principles... 3 Principle 1... 3 Principle 2... 3 Principle 3... 3 Principle 4... 3 Principle 5... 3 Principle 6...
RESEARCH DATA MANAGEMENT POLICY
Document Title Version 1.1 Document Review Date March 2016 Document Owner Revision Timetable / Process RESEARCH DATA MANAGEMENT POLICY RESEARCH DATA MANAGEMENT POLICY Director of the Research Office Regular
Environment Canada Data Management Program. Paul Paciorek Corporate Services Branch May 7, 2014
Environment Canada Data Management Program Paul Paciorek Corporate Services Branch May 7, 2014 EC Data Management Program (ECDMP) consists of 5 foundational, incremental projects which will implement
Business Proposition. Digital Asset Management. Media Intelligent
Business Proposition Digital Asset Management Executive Summary º º The Changing Face of Digital Asset Management Today, a true enterprise-class DAM solution must be the core component of an integrated
Big Data in the context of Preservation and Value Adding
Big Data in the context of Preservation and Value Adding R. Leone, R. Cosac, I. Maggio, D. Iozzino ESRIN 06/11/2013 ESA UNCLASSIFIED Big Data Background ESA/ESRIN organized a 'Big Data from Space' event
ENHANCED PUBLICATIONS IN THE CZECH REPUBLIC
ENHANCED PUBLICATIONS IN THE CZECH REPUBLIC PETRA PEJŠOVÁ, HANA VYČÍTALOVÁ [email protected], [email protected] The National Library of Technology, Czech Republic Abstract The aim of this
Federated Authentication and Credential Translation in the EUDAT Collaborative Data Infrastructure
Federated Authentication and Credential Translation in the EUDAT Collaborative Data Infrastructure Ahmed Shiraz Memon (JSC - DE) Jens Jensen (STFC escience - UK) Ales Cernivec (XLAB - SL) Krzysztof Benedyczak
Digital Asset Management Developing your Institutional Repository
Digital Asset Management Developing your Institutional Repository Manny Bekier Director, Biomedical Communications Clinical Instructor, School of Public Health SUNY Downstate Medical Center Why DAM? We
DRIVER Providing value-added services on top of Open Access institutional repositories
DRIVER Providing value-added services on top of Open Access institutional repositories Dr Dale Peters Scientific Technical Manager : DRIVER SUB Goettingen Germany Gaining the momentum: Open Access and
The challenges of digital preservation to support research in the digital age
DRAFT FOR DISCUSSION WITH ADVISORY COUNCIL MEMBERS ONLY The challenges of digital preservation to support research in the digital age Lynne Brindley CEO, The British Library November 2005 Agenda UK developments
Why long time storage does not equate to archive
Why long time storage does not equate to archive Jos van Wezel HUF Toronto 2015 STEINBUCH CENTRE FOR COMPUTING - SCC KIT University of the State of Baden-Württemberg and National Laboratory of the Helmholtz
CLASS and Enterprise Solutions Rick Vizbulis. CLASS and Enterprise Solutions
NOAA Science Advisory Board s December 7-8, 2006 CLASS and Enterprise Solutions CLASS and Enterprise Solutions Rick Vizbulis 1 Agenda! CLASS history! What is an archive?! Archive responsibilities! What
ECRIN (European Clinical Research Infrastructures Network)
ECRIN (European Clinical Research Infrastructures Network) Wolfgang Kuchinke University of Duesseldorf (HHU) and ECRIN EUDAT 1st User Forum 7 March 2012 8 March 2012, Barcelona 1 What is ECRIN? European
Introduction to Research Data Management for Social Scientists
Introduction to Research Data Management for Social Scientists Astrid Recker & Sebastian Netscher CESSDA Training at the Data Archive for the Social Sciences GESIS - Leibniz Institute for the Social Sciences
Globus Research Data Management: Introduction and Service Overview
Globus Research Data Management: Introduction and Service Overview Kyle Chard [email protected] Ben Blaiszik [email protected] Thank you to our sponsors! U. S. D E P A R T M E N T OF ENERGY 2 Agenda
How To Useuk Data Service
Publishing and citing research data Research Data Management Support Services UK Data Service University of Essex April 2014 Overview While research data is often exchanged in informal ways with collaborators
Horizon 2020. Research e-infrastructures Excellence in Science Work Programme 2016-17. Wim Jansen. DG CONNECT European Commission
Horizon 2020 Research e-infrastructures Excellence in Science Work Programme 2016-17 Wim Jansen DG CONNECT European Commission 1 Before we start The material here presented has been compiled with great
CAREER TRACKS PHASE 1 UCSD Information Technology Family Function and Job Function Summary
UCSD Applications Programming Involved in the development of server / OS / desktop / mobile applications and services including researching, designing, developing specifications for designing, writing,
Jochen Schirrwagen, Najko Jahn. Bielefeld University Library, Germany. Research in Context
Jochen Schirrwagen, Najko Jahn Bielefeld University Library, Germany Research in Context In the light of recent results from OpenAIREplus and from the Library perspective Seminar to Access of Grey Literature
Data Publishing Workflows with Dataverse
Data Publishing Workflows with Dataverse Mercè Crosas, Ph.D. Twitter: @mercecrosas Director of Data Science Institute for Quantitative Social Science, Harvard University MIT, May 6, 2014 Intro to our Data
Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS.
National Digital Stewardship Residency - Boston Project Summaries 2015-16 Residency Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS. Harvard Library s Digital
FEDORA @ AWI Fedora User Meeting Copenhagen, Denmark 28 September, 2005
Photo: L. Tadday FEDORA @ AWI Fedora User Meeting Copenhagen, Denmark 28 September, 2005-1- Ana Ana Macario, Macario, Computer Center Center Alfred Wegener Alfred Wegener Institute Institute, for Polar
Mail Archive and Management System. Product White Paper
Mail Archive and Management System Product White Paper Regulatory Compliance Greater Emphasis of Information Security Regulations Placed on Emails Email has been widely adopted by worldwide enterprises
Documenting the research life cycle: one data model, many products
Documenting the research life cycle: one data model, many products Mary Vardigan, 1 Peter Granda, 2 Sue Ellen Hansen, 3 Sanda Ionescu 4 and Felicia LeClere 5 Introduction Technical documentation for social
Data at NIST: A View from the Office of Data and Informatics
Data at NIST: A View from the Office of Data and Informatics Robert Hanisch Office of Data and Informatics Material Measurement Laboratory National Institute of Standards and Technology Data and NIST 1
Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any
Data Management Plan Name of Contractor Name of project Project Duration Start date : End: DMP Version Date Amended, if any Name of all authors, and ORCID number for each author WYDOT Project Number Any
Intelligent document management for the legal industry
Brochure Intelligent document management for the legal industry HP WorkSite The leading legal enterprise content management solution Sharing documents between legal teams, clients, and service providers
Research Data Management
Research Data Management 1 Why to we need to Manage Data? 2 Data Management Planning Typically covers: - What data will be created (format, types) and how? - How will the data be documented and described?
Big Data in the Digital Cultural Heritage
Big Data in the Digital Cultural Heritage Antonella Fresa, Promoter Srl DCH-RP Technical Coordinator 1 Table of Content Digitisation of Cultural Heritage Toward an e-infrastructure for Digital Cultural
The Czech Digital Library and Tools for the Management of Complex Digitization Processes
The Czech Digital Library and Tools for the Management of Complex Digitization Processes Martin LHOTÁK Library of the Academy of Sciences of the Czech Republic [email protected] INFORUM 2012: 18th Conference
31 December 2011. Dear Sir:
Office of Science and Technology Policy on behalf of National Science and Technology Council Attention: Ted Wackler, Deputy Chief of Staff Re: Response to Notice for Request for Information: Public Access
The data landscape lessons from UK
The data landscape lessons from UK Veerle Van den Eynden UK Data Archive University of Essex Faculty of Psychology and Educational Sciences University of Ghent, Belgium 23 October 2014 UK data landscape
Oxford Digital Asset Management System (DAMS) Update
Oxford Digital Asset Management System (DAMS) Update Neil Jefferies R&D Project Manager Systems & eresearch Services (SERS) Oxford University Library Services (OULS) Agenda Overview Fedora-Commons Honeycomb/ST5800
Research Data Management Guide
Research Data Management Guide Research Data Management at Imperial WHAT IS RESEARCH DATA MANAGEMENT (RDM)? Research data management is the planning, organisation and preservation of the evidence that
OPENGREY: HOW IT WORKS AND HOW IT IS USED
OPENGREY: HOW IT WORKS AND HOW IT IS USED CHRISTIANE STOCK [email protected] INIST-CNRS, France Abstract OpenGrey is a unique repository providing open access to European grey literature references,
Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management
Research Data Management Canadian National Research Data Repository Service Progress Report, June 2016 As their digital datasets grow, researchers across all fields of inquiry are struggling to manage
Digital Assets Repository 3.0. PASIG User Group Conference Noha Adly Bibliotheca Alexandrina
Digital Assets Repository 3.0 PASIG User Group Conference Noha Adly Bibliotheca Alexandrina DAR 3.0 DAR manages the full lifecycle of a digital asset: its creation, ingestion, metadata management, storage,
GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington
GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise
Research Data Management in Horizon 2020
Research Data Management in Horizon 2020 Dr. Fieke Schoots, UBL 11 / 6 / 2015 From : Guidelines on Open Access to Scientific Publications and Research Data in Horizon 2020 [v.1.0, 11/12/2013] Open access
Checklist and guidance for a Data Management Plan
Checklist and guidance for a Data Management Plan Please cite as: DMPTuuli-project. (2016). Checklist and guidance for a Data Management Plan. v.1.0. Available online: https://wiki.helsinki.fi/x/dzeacw
data.bris: collecting and organising repository metadata, an institutional case study
Describe, disseminate, discover: metadata for effective data citation. DataCite workshop, no.2.. data.bris: collecting and organising repository metadata, an institutional case study David Boyd data.bris
Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
Integrated Rule-based Data Management System for Genome Sequencing Data
Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer
Competency frameworks COMPUTING AND INTERNET CERTIFICATE (C2i)
Competency frameworks COMPUTING AND INTERNET CERTIFICATE (C2i) Mission Numérique pour l'enseignement Supérieur (MINES - DGESIP) (Digital Mission for Higher Education) Ministry of Higher Education and Research
RESPONSE FROM GBIF TO QUESTIONS FOR FURTHER CONSIDERATION
RESPONSE FROM GBIF TO QUESTIONS FOR FURTHER CONSIDERATION A. Policy support tools and methodologies developed or used under the Convention and their adequacy, impact and obstacles to their uptake, as well
