data infrastructures framework for action for H2020 Event Open Access Policy in Portugal Lisbon, 17 June 2013 Carlos Morais Pires European Commission e-infrastructures, DG CNECT.C1 Author s views do not commit the European Commission
summary Policy framework related with data infrastructures Data as Infrastructure: Europe is "Riding the Wave" Implementing Interoperable Data Infrastructure balancing community driven and service driven initiatives Data Infrastructure in FP7 (examples of project initiatives) H2020 workprogramme under construction Main Messages to conclude
Policy context A Reinforced European Research Area Partnership for Excellence and Growth, COM(2012) 392 July 2012 Towards better access to scientific information: boosting the benefits of public investments in research, COM(2012) 401 final - July2012 Commission, Recommendation on access and preservation of scientific information, C(2012) 4890 final July 2012
Better Access to Scientific Information slide from Neil Jacobs (JISC) presentation to the EC National policy not formulated National policy formulated but not implemented National policy formulated, implemented but no outcomes yet National policy formulated, implemented, and outcomes delivered Policy Formulated, formulated not implementing Policy not formulated Policy Formulated, formulated not implementing Policy not formulated Formulated, Policy formulated not implementing Policy not formulated Policy Formulated, formulated not implementing Policy not formulated Formulated, implementing, with outcomes Formulated, implementing, no outcomes Formulated, implementing, with outcomes Formulated, implementing, no outcomes Formulated, implementing, with outcomes Formulated, implementing, no outcomes Formulated, implementing, with outcomes Formulated, implementing, no outcomes OA Publications OA Data Preservation einfrastructures
data as infrastructure: Europe is Riding the Wave The High Level Expert Group on Scientific Data presented Riding the Wave in October 2010 Vision: "data e-infrastructure that supports seamless access, use, re-use, and trust of data. In a sense, the physical and technical infrastructure becomes invisible and the data themselves become the infrastructure a valuable asset on which science, technology, the economy and society can advance".
useful definitions Data: digital recorded factual material commonly accepted in the scientific community as necessary to validate research findings (not include lab notebooks, preliminary analysis, drafts of scientific papers, plans for future research, peer review reports, communication with peers, physical objects, lab specimens) [c.f. White House Memo on "Increasing Access to the Results of Federally Funded Scientific Research"] Data infrastructures: services, applications, tools, knowledge and policies for research data to be discoverable, understandable, accessible, preserved and curated and available 24/7
implementing interoperable data infrastructure (a)data generators; research projects, big research infrastructure, installations or medium size laboratories, simulation centres, surveys or individual researchers community driven data infrastructure, including ESFRI, ESFRI clusters and others (b)discipline-specific data service providers, providing data and workflows as a service (c)providers of generic common data services (computing centres, libraries) (d)researchers as users, using the data for science and engineering
data infrastructures in FP7 CNECT: 96 Meuro of EC contribution 5 Calls for proposals First two calls (45 M): probing the European Research Data Space Third call (4 M): FP7 OA Pilot/OpenAIRE Fourth call (45 M): structuring the European Research Data Space along the Riding the Wave strategy Fifth call (2M): icordi Other projects, closely related with data infrastructures were funded in other parts of the programme (~80 Meuro ) distributed computing, grids, virtual research environments, earth-server,
thematic distributed data infrastructures in FP7 RTD: Topics targeting thematic distributed data infrastructures or thematic networks of RI providing data services were included in all the five FP7 RTD Calls for proposals More than 170 Meuro of EC contribution Preparatory Phase Projects: ELIXIR, ISBE, ICOS, LIFEWATCH, CLARIN, DARIAH,CESSDA, Implementation clusters: DASISH, BIOMEDBRIDGES, ENVRI, CRISP Integrating Activities: SEADATANET II, UP-GRADE-BS-SCENE, ACTRIS, NERA, IS-ENES, INGOS, JERICO, SLING, BBMRI-LPC, DwB, INGRID, ARIADNE, CENDARI, EHRI, ERANET and Policy support measures: SIM4RDM, COOPEUS, CREATIVE-B, DARECLIMED Life Science Environment SSH Other
data infrastructure: bridging islands bridges scientific data infrastructure distributed computing/software infrastructure network infrastructure, GÉANT
S c i e n t i f i c Information Infrastructure Open. Share. Re-use. Science. Set Free. Research results. Linked.
OpenAIRE - information pages OA The National Research Environment (research institutions, funding) Open Access and Repositories (awareness, repositories, journals, organizations) Contact details of the Open Access Desk 22 more countries http://www.openaire.eu/en/nlo/country-information.html
OpenAIRE: support to research metrics
Data driven research across disciplinary and geographical boundaries Register relevant data objects stored in certified repositories Virtually integrate data objects in trusted federations Foster advancements in interoperability of object content Fragmentation and heterogeneity of data require standardization ARGO MetaNet INCF Health echild Collaborative Data Infrastructure EUDAT Scenario DESY European Data Centers 14
service-driven data e-infrastructures OA Publication Infrastructure Open Data Infrastructures
community-driven data e-infrastructures SCIDIP-ES (Earth Observation Long Term Data Preservation ) Adapted from a slide of Dr. Mirco Albani (ESA), project leader of SCIDIP-ES
community-driven data e-infrastructures The Virtual Observatory concept is a bold community-led response to the challenges the astronomical community faces in data management and storage. Impressive progress has been made and the momentum of the International Virtual Observatory Alliance will ensure sustained progress.
Implementation Cluster for SSH DASISH provides solutions to a number of common issues for the five projects in social sciences and humanities work together along four major areas of common concern: data quality, data archiving, data access and legal and ethical issues The outcome of this work will form the basis for educational activities and for outreach to the communities of researchers that will benefit from these infrastructures
Implementation Cluster for Life Science BioMedBridges EU funding : 10.5 M, started in 2012 All ESFRI Life Sciences infrastructures, coordinated by EMBL Interoperability across data sources and services INSTRUCT ECRIN Euro-Bio-imaging BSL4 EMBRC BIOBANKS-BBMRI EU-Openscreen INFRAFRONTIER EATRIS EBI-ELIXIR
Implementation cluster for Environment ENVRI EU funding : 3.7 M, started in 2011 Development of common reference model, standards, and common components for data pre-processing and postprocessing Contribution to GEOSS (Global Earth Observation System of Systems) and compliance with INSPIRE EC Directive Large participation of ICT and e-infrastructures actors (key partners from D4SCIENCE, GENESI, EGI, EUDAT, PRACE ) EURO- ARGO SIOS LIFE- WATCH EPOS EISCAT EMSO ICOS
Implementation cluster for Physics, Astronomy CRISP All ESFRI Physics, Astronomy and Analytical infrastructures, coordinated by ESRF Seeking synergies between 11 ESFRI Projects totalling more than 9 b investment volume 16 project partners from 12 MS with total op. budg.: 1.5 b /y FAIR ELI ESRF SLHC SPIRAL2 EUROFEL ESS XFEL ILL upgrade SKA ILC-HiGrade
Research Data Alliance: Common Infrastructure, Policy and Practice Drives Data Sharing and Exchange throughout the Data Life Cycle From Prof. Fran Berman and Prof. John Wood, Members of the RDA Council
consultation towards horizon2020
H2020 workprogramme under construction Community data services E-Infrastructure for Open Access Managing, preserving and computing with big research data Towards global data e-infrastructures Skills and professions for e-infrastructures Integration of Core and Basic Operations Services for e-infrastructures e-infrastructures for virtual research environments (VRE) Centres of Excellence for computing applications PRACE Network of Competence Centres for SMEs GEANT These lines are related with the content of the Framework for Action
main messages to conclude Research Data "is" an Infrastructure for modern science Data is generated and used by disciplinary communities Data is stored, moved and processed by common infrastructures Crossing disciplinary and geographic boundaries requires exploring the commonalities of data infrastructures implement global and interoperable data infrastructures Policies for Open Access remove, where possible, barriers to access and share data H2020 will make OA to publication the rule H2020 will start a pilot on OA to publicly funded research data
Carlos Morais Pires obrigado pela vossa atenção