Crafting the Data Management Plan. NSF CAREER Workshop Summer 2013



Similar documents
WHAT SHOULD NSF DATA MANAGEMENT PLANS LOOK LIKE

Virginia Commonwealth University Rice Rivers Center Data Management Plan

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

Introduction to Research Data Management. Tom Melvin, Anita Schwartz, and Jessica Cote April 13, 2016

Data Management at UT

Instrument Used to Analyze the DMPs at the University of Minnesota

Best Practices for Research Data Management. October 30, 2014

BACKUP SECURITY GUIDELINE

Lesson 3: Data Management Planning

Research Data Management PROJECT LIFECYCLE

Data Management Plans & the DMPTool. IAP: January 26, 2016

RESEARCH DATA MANAGEMENT POLICY

LabArchives Electronic Lab Notebook:

A grant number provides unique identification for the grant.

Research Data Ownership, Retention, Access, and Security

Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014

Introduction Thanks Survey of attendees Questions at the end

Data management plan

LJMU Research Data Policy: information and guidance

CIP s Open Data & Data Management Guidelines and Procedures

Research Data Management Procedures

STATE COASTAL CONSERVANCY/CA SEA GRANT NORTH CENTRAL COAST MPA BASELINE PROGRAM,

The RDMSG : Data Management Planning and More

Information Technology Acceptable Use Policy

Edinburgh Napier University. Research Data Management Policy

Creating a Data Management Plan for your Research

NSF Data Management Plan Requirement

Data Management. Courtesy of Calisphere: SUSAN BORDA DIGITAL CURATION LIBRARIAN

Best Practices for Good Data Management. February 19, 2015

Litigation Support. Learn How to Talk the Talk. solutions. Document management

FARMINGTON PUBLIC SCHOOLS 2410

An Introduction to Managing Research Data

Larry Patterson, City Manager

Protect the Past, Secure the Future

Twente Grants Week: Data management. Maarten van Bentum (Library & Archive)

Research Data Management Policy. April 2015

Research Data Management Policy

Checklist and guidance for a Data Management Plan

Johns Hopkins University Data Management Services

Library Strategic Planning

SCICEX boat-to-archive route map and data management template Biological and Chemical Samples

Purpose: To ensure that e-discovery Requests and Litigation Hold Notices are received, routed and responded to in a timely and thorough manner.

Record Custodian to Health Information Steward Best Practices in Record Retention, Storage, and Destruction

Data Management Plan Template Guidelines

Research and the HIPAA Security Rule Prepared for the Association of American Medical Colleges by Daniel Masys, M.D. Professor and Chairman,

REBs & Data Management Plans: Conflict & Coexistence Susan Babcock and Chuck Humphrey, University of Alberta CAREB Conference, Vancouver,

Research Data Management For Researchers

University of Pittsburgh Data Center Information Security

Preservation and Production of Electronic Records

The Libraries Role in Research Data Management: A Case Study from the University of Minnesota

Facilitate Open Science Training for European Research

How To Manage The Sas Metadata Server With Ibm Director Multiplatform

STORRE: Stirling Online Research Repository Policy for etheses

NATIONAL UNIVERSITY CORPORATION HOKKAIDO UNIVERSITY TANGIBLE RESEARCH AND DEVELOPMENT RESULTS HANDLING REGULATIONS

Symantec Backup Exec 11d for Windows Small Business Server

Document Management Plan Preparation Guidelines

EPSRC Research Data Management Compliance Report

University-wide Data Management Services: Cross-campus Collaborations at Indiana University

How To Use The Hitachi Content Archive Platform

15 Organisation/ICT/02/01/15 Back- up

NEW FEATURES ORACLE ESSBASE STUDIO

Learning Enhancement Grant Program Grant Application. Applicant Information

OECD SERIES ON PRINCIPLES OF GOOD LABORATORY PRACTICE AND COMPLIANCE MONITORING NUMBER 10 GLP CONSENSUS DOCUMENT

Considerations for Management of Laboratory Data

DATA MANAGEMENT FOR QUALITATIVE DATA USING NVIVO9

Life Cycle of Records

Symantec NetBackup for Microsoft SharePoint Server Administrator s Guide

How To Write A Health Care Security Rule For A University

Information Security Risk Assessment Checklist. A High-Level Tool to Assist USG Institutions with Risk Analysis

VMware vcloud Air HIPAA Matrix

Best Practices for Data Management. September 24, 2014

University of Bristol. Research Data Storage Facility (the Facility) Policy Procedures and FAQs

Using Technology Equipment to Teach Chemistry Laboratory Exercises in Community Colleges

Introduction to the Data Migration Framework (DMF) in Microsoft Dynamics WHITEPAPER

WHITE PAPER. HIPPA Compliance and Secure Online Data Backup and Disaster Recovery

2. SUMMER ADVISEMENT AND ORIENTATION PERIODS FOR NEWLY ADMITTED FRESHMEN AND TRANSFER STUDENTS

CORPORATE RECORD RETENTION IN AN ELECTRONIC AGE (Outline)

WEB PUBLISHING GUIDELINES

Outline of a Research Data Management Policy for Australian Universities / Institutions

Contents The College of Information Science and Technology Undergraduate Course Descriptions

STATE OF NEBRASKA STATE RECORDS ADMINISTRATOR DURABLE MEDIUM WRITTEN BEST PRACTICES & PROCEDURES (ELECTRONIC RECORDS GUIDELINES) OCTOBER 2009

Training for de-identifying human subjects data for sharing: a viable library service

Introduction. What Can an Offline Desktop Processing Tool Provide for a Chemist?

STUDENT RECORD POLICY, PROCEDURES AND DEFINITIONS

Recommendations of the Campus Data Management and Curation Advisory Committee

How Cisco IT Uses SAN to Automate the Legal Discovery Process

<Choose> Addendum Windows Azure Data Processing Agreement Amendment ID M129

Is online backup right for your business? Eight reasons to consider protecting your data with a hybrid backup solution

Harnessing Media Micro-Services for Stewardship of Digital Assets at CUNY TV. NDSR Project:

Nevada NSF EPSCoR Track 1 Data Management Plan

State of Michigan Records Management Services. Guide to E mail Storage Options

Enrollment for Education Solutions Addendum Microsoft Online Services Agreement Amendment 10 EES

MDSplus Automated Build and Distribution System

United Cerebral Palsy of Greater Chicago Records and Information Management Policy and Procedures Manual, December 12, 2008

Document Imaging Trend Report Transportation Sector Prepared by: Pamela Doyle (March, 2007) Table of Contents

Ex Libris Rosetta: A Digital Preservation System Product Description

6. FINDINGS AND SUGGESTIONS

L. Staton Noel III, MS, MBA Director

Ensuring HIPAA Compliance with AcclaimVault Online Backup and Archiving Services

Transcription:

Crafting the Data Management Plan NSF CAREER Workshop Summer 2013

Today s session: What is a DMP? Guiding questions to answer for your DMP Data preservation and strategies for sharing your data Sample DMPs

What is a Data Management Plan? Brief description of how you will comply with funder s data sharing policy Typically reviewed as part of the grant application Requirements vary by funding agency and NSF directorate

Why data management? Why now? Conform to the Jan 2011 NSF guideline on the dissemination and sharing of research results Enable others to access/use data from funded projects Create research networks Move information from lab to the community or commerce more quickly Transparency/quality/credibility

Other outcomes of DMPs Simplify requests for data Increase visibility and impact of your research Clearly document and provide evidence for your research in conjunction with published results Comply with copyright and ethical compliance (ie. HIPAA). Preserve data to safeguard from loss Support open access Facilitate new discoveries

What does the NSF say? Proposals must include a supplementary document of no more than two pages should describe how the proposal will conform to NSF policy on the dissemination and sharing of research results Fastlane will not permit submission of a proposal that is missing a Data Management Plan. may include only the statement that no detailed plan is needed, as long as the statement is accompanied by a clear justification will be reviewed as an integral part of the proposal Directly quoted from http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/gpg_2.jsp#dmp

DMP templates and guidance DMPTool https://dmp.cdlib.org/ Create ready-to-use data management plans for specific funding agencies with step-by-step instructions and guidance for data management plan SciDaC templates Questions to consider Directorate guidance

Data management guide http://guides.ucf.edu/data

What to include Types of data, samples, and other materials to be produced in the course of the project. Retention period? Data and metadata standards for format and content Access and sharing policies including provisions for appropriate protection of privacy or other rights Re-use policies and provisions re-distribution and the production of derivatives Archiving plans for data, samples, and other research products, and for preservation of access to them.

DMP Examples Data Management Plan. 1. Products of the Research. The data generated from this project primarily includes the structural characterization of new compositions of matter, the kinetic data and the microscopy information (Aims 1-3). All of these will be translated into numerical/digital data for dissemination in the form of publications and presentations. For example, the polyamine derivatives and reagents will be purified and characterized by 1 H and 13 C NMR, mass spectrometry, and elemental analysis. In addition, scans of the NMR spectra for each new compound will be stored as pdf files. These will be stored on a lab server for inclusion as future Supporting Information in publications. In this manner, the lab students can view the spectral data in addition to reading its numeric translation (e.g., δ 2.25 (s, 3H)) described in each student s digital notebook. The raw kinetic data obtained during the silica condensation experiments in Aim 3 will be collected in an Excel spreadsheet and stored for future review, if needed. Interested parties will also have access to all the relevant experimental data upon publication via the Supporting Information section posted online by the publisher. The newly synthesized compounds are also entered into a structure-searchable database created via the ChemBioFinder software package (CambridgeSoft). This database is available to the PI s lab personnel to help them find each synthesized compound in the lab freezers and refrigerators. Labeled plastic boxes contain water-resistant labeled vials, which house each compound as a pure solid or liquid. Note: a complete chemical inventory of the laboratory is stored as an Excel spreadsheet on the central lab server and is updated upon the purchase of new chemicals, etc. In this regard, the PI has complete inventories of all commercial and custom chemicals within the laboratory space. A structure-activity relationship relating platform geometries (dihedral angles which orient the amide carbonyl groups), rate of assembly and final morphology will be developed to provide a predictive model for future silaffin mimic designs. 2. Data Format. The data will be stored in each researcher s digital notebook folder housed on the PI s central lab server as primarily Microsoft Word (.doc) and Excel (.xls) files and Adobe Acrobat (.pdf) files. Microscopy images are stored as.tiff and.jpg formats. In this regard the data is directly compiled into a digital format which facilitates its dissemination. Since the PI s group publishes full papers with complete characterization data and extensive Supporting information included, interested parties can easily obtain this information online after publication. 3. Access to Data, Data Sharing Practices and Policies. a) Access to Data. The PI and UCF maintain three UCF websites which detail the PI s research interests in each of his affiliated departments/schools. The websites are associated with the UCF Department of Chemistry (courtesy appointment), Burnett School of Biomedical Sciences (member of the PhD program) and College of Medicine s (COM) Department of Medical Education (full time faculty member). These sites provide brief summaries of the PI s area of expertise as well as links to his publications. In terms of future publications and associated Supporting Information, the research team plans to provide experimental details and data in the form of scanned NMR data (pdf files) and raw kinetic data in the form of tables or graphs. Data is not released prior to disclosure to UCF or publication. The PI s policy has been to rapidly publish as soon as the project is complete and to post links to his latest papers after they appear online from the publisher. b) Data Sharing. As a biochemistry group developing new technologies, the PI is required by UCF to disclose all new compositions of matter and processes to the UCF Office of Research and Commercialization (UCF-ORC) prior to publication or public disclosure in order to protect the University s (and PI s) intellectual property rights to the data generated. These disclosures are reviewed by the UCF-ORC and often patent applications are filed on behalf of UCF. Prior to sending new compounds or probes to other researchers worldwide, UCF-ORC and the interested party complete a Material Transfer Agreement (MTA). If UCF-ORC elects to pursue a patent on the new compound or technology, then a Non-Disclosure Agreement (NDA) or Confidentiality Agreement (CA) are completed prior to shipment to the interested party. These agreements fully delineate the expectations of both parties and the legal responsibility of the PI, UCF and its new partner regarding publications, patents and disclosures. The primary data is stored on the password-protected lab server for additional security. Summaries of the work will be published in COMMUNIQUE, a COM electronic magazine for central Florida outreach. 4. Policies for Re-Use, Re-Distribution and Production of Derivatives. The policy regarding the use of data provided via general access is that no data will be disclosed to the public prior to disclosure and approval for disclosure by UCF-ORC. This is to protect the intellectual property of UCF and the PI. In this regard, there are no posted disclaimers or conditions placed on the posted PI s website data, since the posted information describes already-published information. There are no restrictions placed on the publicly disclosed data in terms of derivatives. 5. Archiving of Data. In terms of the PI s laboratory, all research information is housed on a central lab server (Black Armor) located in the student office. This involves a RAID backup system using a redundant hard drive system (1 TB). A third copy of the data is backed-up on a remote hard drive located in the adjacent PI s office. In this regard, the PI has three copies of the lab s information backed up nightly. The backup drive in the PI s office is an important additional repository to help protect the information from a localized lab fire or minor water leaks, damage etc. The lab backup systems will be updated by the PI as computer technology/standards change and the system becomes obsolete or non-functional.the PI has the lead responsibility for maintaining the lab database. Since the backup involves automated processes using the Nero software, there is limited maintenance required beyond routine checks of backup scheduling and purging of old incremental archives, etc. Lastly, the ultimate archive of the PI s data is in the form of each publication and its Supporting information section online. In organic chemistry one typically looks in the Supporting Information associated with the pertinent publication for experimental details. Indeed, having the experimental data linked to the publication at the publisher s website (e.g., pubs.acs.org) makes for a more efficient search process. In this regard, the PI elects to continue to have a full disclosure of experimental details upon publication in the form of a detailed Supporting Information component to his publications. The PI s significant publication record of full papers in journals, which have a Supporting Information component (e.g., J. Med. Chem. and J. Org. Chem.), is consistent with this choice. In summary, the PI has a well-organized laboratory space and a clear plan for data acquisition, data archiving and dissemination. The combination of rapid full disclosure in the form of full papers with extensive Supporting Information included makes this an efficient plan for data management.

DMP Examples Sample Data Management Plan: Physical Sciences and Engineering From Rice University 1. Products of the Research The data obtained during the proposed project will consist of measurement records of electronic and optical properties of the nanodevices, as described in the main body of the proposal. These records will consist chiefly of measurements of current as a function of voltage, conduction as a function of time, and Raman spectra as a function of wavenumber obtained via custom software. These data will be recorded via computerized data acquisition software, with essential metadata present either as header in the relevant electronic files, or included along with the indexed laboratory notebook narrative. These data will provide an experimental look at the detailed dissipative processes at work in nanoscale junctions, as delineated in the main body of the proposal. As such, they will be of interest to the nanoelectronics community, as well as to the condensed matter physics and physical chemistry communities. 2. Data format The electronic transport and optical spectra will be computer files generally in the form of tabdelimited numbers with header information. These computer files will be accompanied by dated laboratory notebooks, as well as by numerical data files analyzed using MATLAB and commercial plotting software, such as OriginLab. A copy of the custom Raman spectrometer data acquisition software also will be included, since it is essential for interpreting the raw data files. 3. Access to data, data sharing practices and policies The electronic data will be preserved in multiple on-site backups in the form of DVDs and RAID hard drive storage. Copies of the electronic data will be preserved off-site at Rice University s storage facility. Original laboratory notebooks will be secured by the PI in his campus office or laboratory. If requested, access to the data will be provided via contact with the PI. Data will, in principle, be available for access and sharing as soon as is reasonably possible, and not longer than two years after the acquisition of the data. The data will be preserved for at least three years beyond the award period, as required by NSF guidelines. 4. Policies for re-use, re-distribution, and production of derivatives We do not anticipate that there will be any significant intellectual property issues involved with the acquisition of the data. In the event that discoveries or inventions are made in direct connection with this data, access to the data will be granted upon request once appropriate invention disclosures and/or provisional patent filings are made. 5. Archiving of data The data acquired and preserved in the context of this proposal will be further governed by Rice University s policies pertaining to intellectual property, record retention, and data management Data Management Plan Example From UCSD The data associated with the research project will be systematically managed. The team has multiple backup servers to protect our research findings, and publicly available internet resources to share our results. All aspects of the research will be carefully tracked, stored, and published. The work detailed in the preceding proposal can be anticipated to produce three broad categories of data: computer software, physical devices, and optical characterization measurements. The computer software category includes not only the optimization subroutines that will be produced, but also the scripting written to control the laboratory equipment used in the characterization of the fabricated photonic circuitry. The physical devices category includes the unit cell, as well as the prototype optical circuitry generated during its development. The optical characterization category includes spectral measurements of the fabricated devices, as well as the associated metadata. The algorithm development progression will be logged through both handwritten research notebooks as well as digitally generated documents. To ensure the safety of the data, we will use the UCSD research group's existing server to periodically backup the materials. A Structured Query Language (SQL) database will be created to track the digital documents. The completed design toolbox will be made available by enacting the Release to Public policies of University of California, San Diego. We plan to package our algorithm in a MATLAB toolbox as well as a self-contained software complete with an user interface. All of the computer code produced during the project will be written using the latest version of MATLAB. Code will be developed using volume shadow copy technology, which will allow the recovery of prior iterations for quality control. The spectral measurement data will be written to MATLAB cell arrays that also include the measurement date and time, duration, and a unique identification number. Subroutines will be produced that allow the effortless extraction of this data both graphically, and in raw matrix form. The results of the research performed under this proposal will be disseminated primarily through publication in research journals and conference presentations. All of the computer software and optical characterization measurements will be available to interested parties upon request, and will be transmitted electronically via e-mail. All electronic data generated by proposal research will be redundantly archived. Locally, the laboratory has a secure server on which all information is stored. The server hard drives are set up in a RAID that is capable of full recovery even in the case of multiple simultaneous disk failure. Additionally, the server drives are backed up on an independent server operated by the electrical engineering department. This will allow full recovery of data in the event of catastrophic failure of the local laboratory server. Physical samples will be labeled and stored in a designated area of the measurement laboratory. All of these systems will be in place for the 3 year minimum proscribed by the guidelines.

DMP resources Data Management (or DMP) Research Guide http://guides.ucf.edu/data DMP Tool https://dmp.cdlib.org/ NSF Data Sharing Policy http://www.nsf.gov/bfa/dias/policy/dmp.jsp NSF Grant Proposal Guide (requirements) http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/gpg_2.jsp Databib http://databib.org/ Schedule a Consultation http://library.ucf.edu/reference/researchconsultations/default.php Meet with a librarian for an in-depth one on one consultation specific to your research topic

Questions? Lee Dotson, Digital Initiatives Librarian, Lee.Dotson@ucf.edu Penny Beile, Interim Head, Reference Services, pbeile@ucf.edu Selma Jaskowski, Assistant Director, Library Systems & Technology, selmaj@ucf.edu Rich Gause, Government Information Librarian, rich.gause@ucf.edu Athena Hoeppner, Electronic Services Librarian, athena@ucf.edu