Data at NIST: A View from the Office of Data and Informatics
|
|
|
- Ralf Curtis
- 10 years ago
- Views:
Transcription
1 Data at NIST: A View from the Office of Data and Informatics Robert Hanisch Office of Data and Informatics Material Measurement Laboratory National Institute of Standards and Technology
2 Data and NIST 1 NIST is a national and world resource for fundamental data Access should be easy and open With regard to IP and privacy issues As our nation s standards organization NIST should be a leader in national and international standards efforts for data discovery and access Discovery is fundamental Discovery is enabled by metadata standards Key research at NIST should engage in data sharing strategies from the onset NIST should provide an infrastructure that makes data and information sharing as easy as possible
3 NIST Public Data Access Policy 2 Establish NIST s commitment to providing public access to scientific research results Support governance of and best practices for managing peer-reviewed scholarly publications and digital scientific data across NIST Ensure effective access to and reliable preservation of NIST peer-reviewed scholarly publications and digital scientific data for use in research, development, education, and scientific discovery Enhance innovation and competitiveness by maximizing the potential to create new business opportunities Public-Access.pdf
4 NIST Public Data Access Policy 3
5 Implementation 4 Data management plans Enterprise Data Inventory data.gov
6 Data Management Plans 5
7 Data Management Plans 6
8 Data Management Plans 7
9 Data Management Plans 8
10 JSON Export to EDI, data.gov 9
11 Federated Architecture 10 Data Centers Local Publishing Registry harvest (pull) OAI/PMH Local Publishing Registry Full Searchable Registry search queries replicate Full Searchable Registry Users, applications
12 Office of Data and Informatics 11
13 ODI in context 12 NIST Material Measurement Laboratory Materials Science and Engineering Division Materials Measurement Services Division Biosystems and Biomaterials Science Division Biomolecular Measurement Division Chemical Sciences Division Applied Chemicals and Materials Division Office of Reference Materials Office of Data and Informatics
14 MML in context 13 Associate Director for Laboratory Programs Material Measurement Laboratory Communications Technology Laboratory Physical Measurement Laboratory Engineering Laboratory Information Technology Laboratory Center for Nanoscale Science and Technology NIST Center for Neutron Research
15 ODI Today: People 14 Robert Hanisch, director Data Services Group Lead Two web applications developers SRD sales staff (3) Materials science detailee Chemistry detailee Biology detailee Data systems architect Data interoperability specialist Informatics/analytics consultant ODI Advisory Group SRD Advisory Group
16 ODI Today: Activities 15 Data management plans, Enterprise Data Inventory (EDI) SRD modernization, audit, process MML Strategic Plan MML Data Working Group, Research Reproducibility user s group, electronic laboratory notebook evaluations, seminars, internal & external workshops NIST web site redesign, taxonomy NIST Library Materials Genome Initiative materialsdata.nist.gov Dspace repository Metadata standards MGI community portal genomicsdata.nist.gov in development Software discovery, citation, re-use Nanomaterials registry, nanomaterials metadata standards, National Jet Fuels Program, NIST Center for Automotive Lightweighting, NIH collaboration InChI Trust membership Domain Repositories Sustainability, interoperability DoC Data Working Group DoC Data Council selection committee Research Data Alliance, National Data Services Consortium
17 Materials Genome Initiative 16
18 materialsdata.nist.gov 17
19 materialsdata.nist.gov 18
20 Standard Reference Data 19 SRD Act of 1968 authorized NIST to create Standard Reference Data Copyright Cost recovery 90 databases, most are free to use
21 SRD Examples 20
22 Data Infrastructure Investments 21 NIST investing in infrastructure development to instigate changes in data management behaviors, assure compliance with OMB/OSTP policies on open data Four areas of focus Internal open data processes and tools: data publication process, deployment of Data Management Plan and Enterprise Data Inventory tools in federated architecture Data management infrastructure: cloud-based storage for working data, drag-and-drop user i/f and API, fast network link to cloud storage Data dissemination and public access: NIST data portal, SRD interface re-design, data center infrastructure for SRD and other public-facing data Effective utilization of commercial/public domain data tools: increase pace of review and potential adoption of data and collaboratory tools, move ~20 packages through NIST review processes (Skype, SpiderOak, OwnCloud, SciDrive, Socrata, Evernote, ELNs, etc.)
23 Research Data Alliance 22
24 CODATA 23
25 National Data Service 24
26 International Data Week 25 September 2016, DC area RDA Plenary, CODATA SciDataCon, ICSU World Data Service Sponsors being sought
27 NIST research data: Modernize the Standard Reference Data collection. Revamp interfaces and build applications programming interfaces (APIs) in order to make SRDs much more usable. Audit and rationalize SRD, special databases, etc. Implement the OMB/OSTP policies on providing public access to data. Build a tool to facilitate creation of Data Management Plans by researchers and working to show the value of DMPs independent of the Administration directives. Build a tool to allow researchers to deposit metadata for export to data.gov. Develop production-level infrastructure and populate it with persistent identifiers and metadata for all publicly available NIST data. Establish a solution center for good data management practices. Broker solutions to data management problems, advise on good practices and help staff set up the hardware/software infrastructure needed to support their data management challenges. Become a resource for data analytics and informatics solutions. Provide consultation from an informatics generalist who can help researchers in a variety of situations to find the most appropriate tools for understanding the patterns and characteristics of complex/large data sets. Support the Materials Genome Initiative National initiative to promote new materials discovery through deployment by linking data, models, and experiment. 7
28 NIST research data: ~10 year horizon 27 Expand the Standard Reference Data collection. Identify through internal and external inputs where new SRD are needed. Prioritize, scope, and find resources for development work Establish NIST as an exemplar federal agency in data management. Implement and share best practices for preservation, curation, discovery, re-use, and interoperability Facilitate community-based development of metadata standards & data models Participate in leadership of national and international data federation activities Research Data Alliance, National Data Services Consortium, CODATA and World Data System Contribute to solving the challenge of long-term sustainability of data repositories Share NIST-developed technologies to assist other agencies in improving data access and data services Collaborate with federal and non-federal organizations in developing and deploying common solutions Establish a data-aware, data-savvy culture at NIST Improve efficiency of experimentation and simulation Improve reliability and reproducibility of research results Increase value of NIST to the research and industrial communities
29 Summary 28 Goal of ODI is to help all MML scientists Data management plans Best practices Improved efficiency: do science, not bookkeeping Data publication and citation But MML data are incredibly diverse New metadata standards Must address interoperabiity at the appropriate granularity Can utilize many elements of the Virtual Observatory infrastructure and architecture to improve data management capabilities in MML and NIST Federation rather than centralization
30 Some things to think/worry about 29 Quality metadata is key for discovery, interoperability, re-use Reproducibility Integrity of the scientific process Metadata curation is non-trivial, can be costly Address interoperability at the proper scale Too wide: very expensive, difficult/impossible to reach consensus across disciplines; what is the scientific motivation? Too narrow: Scientific stovepipes, missed opportunities for discovery at the intersections of complementary data collections
31 Some things to think/worry about 30 Standards for metadata, data access protocols, etc., require community participation to assure take-up Major research organizations Professional societies (national, international) Recognized standards organizations RDA, CODATA, NDS, EUDAT, etc.
32 Some things to think/worry about 31 Little national commitment to sustaining infrastructure for open data Domain repositories often must (re)compete for basic resources, rely on complex business models Federal funding agencies require Data Management Plans, but provide no common infrastructure and no consistent review process Commercial academic publishers poised to take on data preservation roles; open data could move behind pay-walls
OpenAIRE Research Data Management Briefing paper
OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement
NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons
The NIH Commons Summary The Commons is a shared virtual space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage,
Report of the DTL focus meeting on Life Science Data Repositories
Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity
Public Access Plan. U.S. Department of Energy July 24, 2014 ENERGY.GOV
Public Access Plan U.S. Department of Energy July 24, 2014 ENERGY.GOV Table of Contents Background... 3 Authority... 3 Public Access to Scientific Publications... 4 Scope... 4 Requirements... 5 Applicability...
Response from Oxford University Press, USA
OSTP Request for Information: Public Access to Peer- Reviewed Scholarly Publications Resulting from Federally Funded Research Response from Oxford University Press, USA To: Ted Wackler Deputy Chief of
Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management
Research Data Management Canadian National Research Data Repository Service Progress Report, June 2016 As their digital datasets grow, researchers across all fields of inquiry are struggling to manage
Databases & Data Infrastructure. Kerstin Lehnert
+ Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,
How To Build An Open Source Data Infrastructure
EUDAT Collaborative Data Infrastructure Towards the convergence of Compute, Data, Knowledge and Scientific Instruments Giuseppe Fiameni CINECA www.eudat.eu EUDAT receives funding from the European Union's
EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020
EUROPEAN COMMISSION Directorate-General for Research & Innovation Guidelines on Data Management in Horizon 2020 Version 2.0 30 October 2015 1 Introduction In Horizon 2020 a limited and flexible pilot action
Research Data Management Guide
Research Data Management Guide Research Data Management at Imperial WHAT IS RESEARCH DATA MANAGEMENT (RDM)? Research data management is the planning, organisation and preservation of the evidence that
Best Practices for Research Data Management. October 30, 2014
Best Practices for Research Data Management October 30, 2014 Presenters Andrew Johnson Research Data Librarian University Libraries Shelley Knuth Research Data Specialist Research Computing Outline What
How To Write A Blog Post On Globus
Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, [email protected] Jim Pruyne, University of Chicago Computation Institute, [email protected]
Open Access and Open Research Data in Horizon 2020
Open Access and Open Research Data in Horizon 2020 Celina Ramjoué Head of Sector Open Access to Scientific Publications and Data Digital Science Unit CONNECT.C3 22 November 2013 Train the Trainer for H2020
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011)
Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011) Key Dates Release Date: June 6, 2013 Response Date: June 25, 2013 Purpose This Request
LJMU Research Data Policy: information and guidance
LJMU Research Data Policy: information and guidance Prof. Director of Research April 2013 Aims This document outlines the University policy and provides advice on the treatment, storage and sharing of
Agilent s Kalabie Electronic Lab Notebook (ELN) Product Overview ChemAxon UGM 2008 Agilent Software and Informatics Division Mike Burke
Agilent s Kalabie Electronic Lab Notebook (ELN) Product Overview ChemAxon UGM 2008 Agilent Software and Informatics Division Mike Burke Kalabie: Extending the OpenLAB Architecture Agilent User Interface
Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.
Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Dr Liz Lyon, UKOLN, University of Bath Introduction and Objectives UKOLN is undertaking
DRIVER Providing value-added services on top of Open Access institutional repositories
DRIVER Providing value-added services on top of Open Access institutional repositories Dr Dale Peters Scientific Technical Manager : DRIVER SUB Goettingen Germany Gaining the momentum: Open Access and
THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy 2015-2018. Page 1 of 8
THE BRITISH LIBRARY Unlocking The Value The British Library s Collection Metadata Strategy 2015-2018 Page 1 of 8 Summary Our vision is that by 2020 the Library s collection metadata assets will be comprehensive,
The Platform is the Planet
The Platform is the Planet IoT Solutions in a Heterogeneous World Kevin Miller ([email protected]) Principal Program Manager, Azure IoT IoT Solutions Until Now Most earlier successful IoT deployments
Re: Public Access to Peer-Reviewed Scholarly Publications Resulting from Federally Funded Research Request for Information
December 19, 2011 Office of Science and Technology Policy National Science and Technology Council s Task Force on Public Access to Scholarly Publications 725 17 th Street Washington DC 20502 Via Email
RE: OSTP RFI: Public Access to Peer-Reviewed Scholarly Publications Resulting From Federally Funded Research
Attn: Office of Science and Technology Policy 725 17 th Street, Washington, DC 20501 RE: OSTP RFI: Public Access to Peer-Reviewed Scholarly Publications Resulting From Federally Funded Research Massachusetts
Best Practices for Good Data Management. February 19, 2015
Best Practices for Good Data Management February 19, 2015 Presenters Andrew Johnson Research Data Librarian University Libraries University of Colorado Boulder [email protected] Shelley Knuth
Scientific Business Intelligence using Pipeline Pilot
Scientific Business Intelligence using Pipeline Pilot Anneliese Appleton Accelrys, Sydney y What is Scientific Business Intelligence? Biz Analyst Management Scientist Engineer Biz Analyst Management Business
SHared Access Research Ecosystem (SHARE)
SHared Access Research Ecosystem (SHARE) June 7, 2013 DRAFT Association of American Universities (AAU) Association of Public and Land-grant Universities (APLU) Association of Research Libraries (ARL) This
NATIONAL CENTER FOR PUBLIC HEALTH INFORMATICS (CPE)
NATIONAL CENTER FOR PUBLIC HEALTH INFORMATICS (CPE) The National Center for Public Health Informatics (NCPHI) protects and improves the public s health through discovery, innovation, and service in health
data.bris: collecting and organising repository metadata, an institutional case study
Describe, disseminate, discover: metadata for effective data citation. DataCite workshop, no.2.. data.bris: collecting and organising repository metadata, an institutional case study David Boyd data.bris
Data Management Brown-bag/Seminar March 12, 2014
Data Management Brown-bag/Seminar March 12, 2014 Bonnie Bowen, Dept. Ecology, Evolution & Organismal Biology & SP@ISU Megan O Donnell, Scholarly Communications Librarian Data Images: wikipedia, lepidopteralovers.com,
Invenio: A Modern Digital Library for Grey Literature
Invenio: A Modern Digital Library for Grey Literature Jérôme Caffaro, CERN Samuele Kaplun, CERN November 25, 2010 Abstract Grey literature has historically played a key role for researchers in the field
Affiliation: University of Massachusetts Amherst, University Libraries
Date: January 12, 2012 Name: Marilyn S Billings Email: [email protected] Affiliation: University of Massachusetts Amherst, University Libraries City, State: Amherst, MA Summary: Thank you for
Big Data to Knowledge (BD2K)
Big Data to Knowledge () potential funding agency synergies Jennie Larkin, PhD Office of the Associate Director of Data Science National Institutes of Health idash-pscanner meeting UCSD September 16, 2014
Public Cloud Workshop Offerings
Cloud Perspectives a division of Woodward Systems Inc. Public Cloud Workshop Offerings Cloud Computing Measurement and Governance in the Cloud Duration: 1 Day Purpose: This workshop will benefit those
Checklist for a Data Management Plan draft
Checklist for a Data Management Plan draft The Consortium Partners involved in data creation and analysis are kindly asked to fill out the form in order to provide information for each datasets that will
Open Access to publications and research data in Horizon 2020
Open Access to publications and research data in Horizon 2020 Celina Ramjoué Head of Sector Open Access to Scientific Publications and Data Digital Science Unit CONNECT.C3 4 December 2013 Meeting of National
Data Management at UT
Data Management at UT Maria Esteva, TACC, [email protected] Colleen Lyon, UT Libraries, [email protected] Angela Newell, ITS, [email protected] What is data management? systematic organization
MOVING TO THE NEXT-GENERATION MEDICAL INFORMATION CALL CENTER
MOVING TO THE NEXT-GENERATION MEDICAL INFORMATION CALL CENTER Pharma companies are improving personalized relationships across more channels while cutting cost, complexity, and risk Increased competition
Standard Big Data Architecture and Infrastructure
Standard Big Data Architecture and Infrastructure Wo Chang Digital Data Advisor Information Technology Laboratory (ITL) National Institute of Standards and Technology (NIST) [email protected] May 20, 2016
DATA STEWARDSHIP from a geoscience and academic perspective
DATA STEWARDSHIP from a geoscience and academic perspective Margaret Leinen Vice Chancellor for Marine Science, UC San Diego Director, Scripps Institution of Oceanography Research Data Alliance - 5 San
LabArchives Electronic Lab Notebook:
Electronic Lab Notebook: Cloud platform to manage research workflow & data Support Data Management Plans Annotate and prove discovery Secure compliance Improve compliance with your data management plans,
DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories
DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories MacKenzie Smith, Associate Director for Technology Massachusetts Institute of Technology Libraries, Cambridge,
EUDAT. Towards a pan-european Collaborative Data Infrastructure. Willem Elbers
EUDAT Towards a pan-european Collaborative Data Infrastructure Willem Elbers EUDAT / MPI-TLA Focus meeting: Data repositories SURF, Utrecht March 3, 2014 Outline EUDAT project EUDAT services Summary and
Management of Research Data Procedure
Management of Research Data Procedure Related Policy Management of Research Data Policy Responsible Officer Deputy Vice Chancellor (Research) Approved by Deputy Vice Chancellor (Research) Approved and
DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM
DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM Introduction The Institute of Museum and Library Services (IMLS) is committed to expanding public access to federally funded research, data, software,
A grant number provides unique identification for the grant.
Data Management Plan template Name of student/researcher(s) Name of group/project Description of your research Briefly summarise the type of your research to help others understand the purposes for which
Data Management in NeuroMat and the Neuroscience Experiments System (NES)
Data Management in NeuroMat and the Neuroscience Experiments System (NES) Kelly Rosa Braghetto Department of Computer Science Institute of Mathematics and Statistics - University of São Paulo 1st NeuroMat
Administrative Manual
Administrative Manual AMS 14.40 World Bank Open Access Policy for Formal Publications 67830 April 2012 I. Policy Policy Rationale The World Bank supports the free online communication and exchange of knowledge
Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014
Best Practices for Data Management RMACC HPC Symposium, 8/13/2014 Presenters Andrew Johnson Research Data Librarian CU-Boulder Libraries Shelley Knuth Research Data Specialist CU-Boulder Research Computing
Local Loading. The OCUL, Scholars Portal, and Publisher Relationship
Local Loading Scholars)Portal)has)successfully)maintained)relationships)with)publishers)for)over)a)decade)and)continues) to)attract)new)publishers)that)recognize)both)the)competitive)advantage)of)perpetual)access)through)
Functional Requirements for Digital Asset Management Project version 3.0 11/30/2006
/30/2006 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 30 3 32 33 34 35 36 37 38 39 = required; 2 = optional; 3 = not required functional requirements Discovery tools available to end-users:
Action full title: Universal, mobile-centric and opportunistic communications architecture. Action acronym: UMOBILE
Action full title: Universal, mobile-centric and opportunistic communications architecture Action acronym: UMOBILE Deliverable: D.6.10 - Data Management Plan Project Information: Project Full Title Project
Data Curation for the Long Tail of Science: The Case of Environmental Sciences
Data Curation for the Long Tail of Science: The Case of Environmental Sciences Carole L. Palmer, Melissa H. Cragin, P. Bryan Heidorn, Linda C. Smith Graduate School of Library and Information Science University
A NEW STRATEGIC DIRECTION FOR NTIS
A NEW STRATEGIC DIRECTION FOR NTIS U.S. Department of Commerce November 5, 2015 Refocusing NTIS to meet a 21st Century National Need 2 Overview Following a rigorous review of NTIS operations, the Commerce
National Data Sharing and Accessibility Policy (NDSAP)
Draft National Data Sharing and Accessibility Policy (NDSAP) 1. Introduction 1.1 Data are recognized at all levels as a valuable resource that should be made publicly available and maintained over time
The UK INCF Node and the CARMEN project. Presenter: Leslie Smith [email protected]
The UK INCF Node and the CARMEN project Presenter: Leslie Smith [email protected] Content Events and activities at UK INCF Node Recent Forthcoming The CARMEN project History, current status, ways
Data Management using irods
Data Management using irods Fundamentals of Data Management September 2014 Albert Heyrovsky Applications Developer, EPCC [email protected] 2 Course outline Why talk about irods? What is irods?
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43%
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) $100,070,000 -$32,350,000 / -24.43% Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering,
NSF Data Management Plan Template Duke University Libraries Data and GIS Services
NSF Data Management Plan Template Duke University Libraries Data and GIS Services NSF Data Management Plan Requirement Overview The Data Management Plan (DMP) should be a supplementary document of no more
Workprogramme 2014-15
Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES
High Performance Computing Initiatives
High Performance Computing Initiatives Eric Stahlberg September 1, 2015 DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Cancer Institute Frederick National Laboratory is
A Holistic Framework for Enterprise Data Management DAMA NCR
A Holistic Framework for Enterprise Data Management DAMA NCR Deborah L. Brooks March 13, 2007 Agenda What is Enterprise Data Management? Why an EDM Framework? EDM High-Level Framework EDM Framework Components
Data Publishing Workflows with Dataverse
Data Publishing Workflows with Dataverse Mercè Crosas, Ph.D. Twitter: @mercecrosas Director of Data Science Institute for Quantitative Social Science, Harvard University MIT, May 6, 2014 Intro to our Data
A Big Picture for Big Data
Supported by EU FP7 SCIDIP-ES, EU FP7 EarthServer A Big Picture for Big Data FOSS4G-Europe, Bremen, 2014-07-15 Peter Baumann Jacobs University rasdaman GmbH [email protected] Our Stds Involvement
Cloud and Big Data Standardisation
Cloud and Big Data Standardisation EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University of Amsterdam
Stewarding Big Data: Perspectives on Public Access to Federally Funded Scientific Research Data
Stewarding Big Data: Perspectives on Public Access to Federally Funded Scientific Research Data Big Data and Big Challenges for Law and Legal Information Georgetown Law Library January 30, 2013 William
