Examples of Data Management Plans

Similar documents
Lesson 3: Data Management Planning

Data archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin

Best Prac*ces for Data Management When Using Instrumenta*on

EXPERIENCE WITH SERVICE OBSERVING

Data organiza+on, data modeling, and databases

Use of Tablet Computers to Implement the Local Governance Performance Index (LGPI) in Tunisia

1. Base Programming. GIORGIO RUSSOLILLO - Cours de prépara+on à la cer+fica+on SAS «Base Programming»

Language Resources, Language Technology, Text Mining, the Seman8c Web: How interoperability of machines can help humans in the mul8lingual web

An Open Dynamic Big Data Driven Applica3on System Toolkit

Privacy- Preserving P2P Data Sharing with OneSwarm. Presented by. Adnan Malik

DEEP FILM ACCESS Project (Digital Transforma4ons in the Arts and Humani4es: Big Data) February 2014 April 2015

Aboriginal Data at Statistics Canada Tim Leonard Social and Aboriginal Statistics Division, Statistics Canada

735 State St

UNIFIED, END- TO- END EDISCOVERY

Open-Source Based Solutions for Processing, Preserving, and Presenting Oral Histories

Python for Data Analysis and Visualiza4on. Fang (Cherry) Liu, Ph.D PACE Gatech July 2013

What is Assessment? Assessment is a process of collec3ng data for the purpose of making decisions about individuals and groups

Kaseya Fundamentals Workshop DAY THREE. Developed by Kaseya University. Powered by IT Scholars

REDCap Longitudinal Studies & Surveys

Confessions of a new (agile) software project manager. Laura Akerman

Discovering Computers Fundamentals, 2010 Edition. Living in a Digital World

Managing Social Media as Official Records

Reports Analyzing Your Performance

Use of Sales Tax Information for Economic Forecasting Mike Erwin

Historically, Film Archives primarily preserved narra>ve feature films

Better Transnational Access and Data Sharing to Solve Common Questions

HIPAA Privacy Policy (Revised Feb. 4, 2015)

A Brief Overview of the Mobile App Ecosystem. September 13, 2012

TIME SERIES ANALYSIS OF COMPOSITIONAL DATA USING A DYNAMIC LINEAR MODEL APPROACH

Expanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves

Scientific Computing at NCEAS

Mission. To provide higher technological educa5on with quality, preparing. competent professionals, with sound founda5ons in science, technology

Payments to Foreign Individuals and Entities: Complying with IRS Requirements for Services, Rents, Royalties, and Interest

DTCC Data Quality Survey Industry Report

The LCC Network Integrated Data Management Network GREAT NORTHERN LCC STEERING COMMITTEE MEETING MORAN, WY 25 SEPTEMBER 2011

Website Design. A Crash Course. Monique Sherre, monique@boxcarmarke4ng.com

How To Perform a SaaS Applica7on Inventory in. 5Simple Steps. A Guide for Informa7on Security Professionals. Share this ebook

Copernicus Space Component Ground Segment and Opera4ons Concept

CONTAM 02 Observa4ons in Rivers and Urban Streams: San Joaquin River Mixing Dynamics and Mass Balances

Seven Steps to Client Success Understanding the Flow of ediscovery

Establishing Effec/ve Data Governance

The Great Lakes Futures Project Assessing what the state of the Great Lakes St. Lawrence River Basin could be in 50 years.

TRANSLATING TECHNOLOGY INTO BUSINESS. Let s make money from Big Data!

DEFINING COMPONENTS OF NATIONAL REDD+ FINANCIAL PLANNING

Globus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis

Priority Ac*on Report. Anthropology Crime Scene/Death Investigation Thomas D. Holland, Ph.D. February 13, 2015

Human Genome Sequencing Project: What Did We Learn? How to Analyze Your Own Genome Fall 2013

Introduc)on to the IoT- A methodology

Prac%cal Informa%cs Course 2009 Intersystem Communica%on and Computer Interfaces. Jeffrey Fine MD Magee Womens Hospital of UPMC

Offensive & Defensive & Forensic Techniques for Determining Web User Iden<ty

Organic Exchange 100 Standard 2009, Version 1.3

UAB Cyber Security Ini1a1ve

9. PROGRAM: TEACHING AND RESEARCH. Teaching and research are the principal activities on reserves and the basis for the

Chapter 3. Database Architectures and the Web Transparencies

REST (Representa.onal State Transfer) Ingegneria del So-ware e Lab. Università di Modena e Reggio Emilia Do<. Marzio Franzini

GEOGRAPHIC INFORMATION GATEWAY New York Department of State (NYDOS) data acceptance & metadata standards

Application of Supply Chain Concepts to the Analysis Process

REACCH PNA Data Management Plan

Data Governance Framework: Bank of Canada

oit Manage Digital Images with Picasa March 2010

The author(s) shown below used Federal funds provided by the U.S. Department of Justice and prepared the following final report:

The Big Integra-on Simula'on Pla,orms for Low Carbon Decision Making

The FDA s Mini- Sen*nel Program and the Learning Health System

PowerPoint Presentation to Accompany. Chapter 3. File Management. Copyright 2014 Pearson Educa=on, Inc. Publishing as Pren=ce Hall

LJMU Research Data Policy: information and guidance

Big Data in medical image processing

The Data Reservoir. 10 th September Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons

Scalus A)ribute Workshop. Paris, April 14th 15th

IT Change Management Process Training

Distributed Solar: Joyce McLaren. Illinois Solar Energy Conference. July 16, Issues and Op4ons for U4li4es and Decision Makers

Boulder Creek Critical Zone Observatory Data Management Plan

Na#onal Asbestos Forum 2013: Advance in Medical Research on Asbestos- Related Diseases

Data Science And Big Data Analytics Course

Science Gateways What are they and why are they having such a tremendous impact on science? Nancy Wilkins- Diehr wilkinsn@sdsc.edu

Retaining globally distributed high availability Art van Scheppingen Head of Database Engineering

Annual Reviews: A Nonprofit Scien.fic Publisher. Bringing the Best Review Literature to the Worldwide Scien9fic Community for over 75 Years

Big Data Research at DKRZ

Transcribing documentaries. Can respeaking be used efficiently?

Effects of visualization mode and context on comprehension and decision making in litigation law

Protec'ng Informa'on Assets - Week 8 - Business Continuity and Disaster Recovery Planning. MIS 5206 Protec/ng Informa/on Assets Greg Senko

Put the Magic in Your Marke4ng

HOW TO CREATE APPS FOR TRAINING. A step- by- step guide to crea2ng a great training app for your company

Monitoreo GNSS. Uso prác)co de sistemas GNSS Aplicaciones en el sector de la aviación

What Do Our Data Tell Us: Two Reports Examining Correla;ons in Utah Data

NFPA 70E: What Contractors and Employers Need to Know

Data Backup and Recovery. Heng Sovannarith

Why do we do what we do?

Retaining and Preserving the Scholarly Record: An Update on the Eastern Academic Scholars Trust

Council Monitoring & Assessment Program Development

Virginia Commonwealth University Rice Rivers Center Data Management Plan

ECBDL 14: Evolu/onary Computa/on for Big Data and Big Learning Workshop July 13 th, 2014 Big Data Compe//on

Introduction to LTER Data. Management. LTER Information. Management. Training Materials

FYI - UT Drupal Kit 2.0. December 9, 2015

Greenhouse Produc-on Apps and Smartphone- Ready Business Tips Part I. Amy Fulcher University of Tennessee February 13, 2014

desert conservation program Data Management Guidelines

Welcome! Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research. Andrea Heckert, PhD, MPH Program Officer, Science

Information Management. Corinna Gries

Securing Business Informa9on in the Cloud

MSc Data Science at the University of Sheffield. Started in September 2014

Transcription:

Stephanie Hampton Na.onal Center for Ecological Analysis and Synthesis University of California, Santa Barbara Credits: Taun.ngpanda, Anita363, Stonebird, NeilsPhotography, Rick Smit, Jschinker Examples of Data Management Plans

Background Project name: Effects of temperature and salinity on popula.on growth of the estuarine copepod, Eurytemora affinis Descrip5on of project aims and purpose: We will rear popula.ons of E. affinis in the laboratory at three temperatures and three salini.es (9 treatments total). We will document the popula.on from hatching to death, no.ng the propor.on of individuals in each stage over.me. The data collected will be used to parameterize popula.on models of E. affinis. We will build a model of popula.on growth as a func.on of temperature and salinity. This will be useful for studies of invasive copepod popula.ons in the Northeast Pacific.

nih.gov Experimental lab data 1. Informa1on about data Every two days, we will subsample E. affinis popula.ons growing at our treatment condi.ons. We will use a microscope to iden.fy the stage and sex of the subsampled individuals. We will document the informa.on first in a laboratory notebook, then copy the data into an Excel spreadsheet. For quality control, values will be entered separately by two different people to ensure accuracy. The Excel spreadsheet will be saved as a comma- separated value (.csv) file daily and backed up to a server. AZer all data are collected, the Excel spreadsheet will be saved as a.csv file and imported into the program R for sta.s.cal analysis. Strasser will be responsible for all data management during and azer data collec.on. ESA 2 011: Data Management Plans Example from Carly Strasser

1. Informa1on about data Wired.com Our short- term data storage plan, which will be used during the experiment, will be to save copies of 1) the.txt metadata file and 2) the Excel spreadsheet as.csv files to an external drive, and to take the external drive off site nightly. We will use the Subversion version control system to update our data and metadata files daily on the University of Alberta Mathema.cs Department server. We will also have the laboratory notebook as a hard copy backup.

2. Metadata format and content We will first document our metadata by taking careful notes in the laboratory notebook that refer to specific data files and describe all columns, units, abbrevia.ons, and missing value iden.fiers. These notes will be transcribed into a.txt document that will be stored with the data file. AZer all of the data are collected, we will then use EML (Ecological Metadata Language) to digi.ze our metadata. EML is on of the accepted formats used in Ecology, and works well for the type of data we will be producing. We will create these metadata using Morpho sozware, available through the Knowledge Network for Biocomplexity (KNB). The documenta.on and metadata will describe the data files and the context of the measurements.

3. Policies for access, sharing, and reuse We are required to share our data with the CAISN network, azer all data have been collected and metadata have been generated. This should be no more than 6 months azer the experiments are completed. Interested par.es must contact the CAISN data manager (data@caisn.ca) or the authors and explain their intended use. Data requests will be approved by authors azer review of the proposed use.

3. Policies for access, sharing, and reuse The authors will retain rights to the data un.l the resul.ng publica.on is produced, within two years of data produc.on. AZer publica.on (or azer two years, whichever is first), the authors will open data to public use. AZer publica.on, we will submit our data to the KNB allowing discovery and use by the wider scien.fic community. Interested par.es will be able to download the data directly from KNB without contac.ng the authors, but will s.ll be required to give credit to the authors for the data used by ci.ng a KNB accession number either in the publica.on s text or in the references list.

4. Long- term storage and data management (archiving) The data set will be submi`ed to KNB for long- term preserva.on and storage. The authors will submit metadata in EML format along with the data to facilitate its reuse. Strasser will be responsible for upda.ng metadata and data author contact informa.on in the KNB.

5. Budget We will use a tablet computer for data collec.on, which will cost approximately $500. We an.cipate that data documenta.on and prepara.on for reuse and storage will require approximately one month of salary for two technicians. These technicians will be responsible for data entry, quality control and assurance, and metadata genera.on. These costs are included in the budget in Lines 10-15.

Image data Background This project generates.me- and loca.on- stamped image files of natural resources in Delaware County, PA. The images serve as a record of the occurrence of creatures, natural ar.facts, and condi.ons at specific places and.mes during the period 2003 through 2011. For many of the photos taxonomic informa.on is also available. The occurrence data are observa.onal and qualita.ve, and in addi.on to documen.ng the presence of various taxa they will be used for posters, books, and a web site promo.ng the apprecia.on of the natural diversity of the County and the preserva.on of its remaining natural areas. ESA 2011: Data Management Plans Example from Carly Strasser

Image data 1. Informa1on about data Data will be captured with a digital camera capable of crea.ng images with sufficient taxonomic detail to allow iden.fica.on to the species level. Images are stored as JPG files with embedded EXIF and IPTC informa.on describing the exposure, camera type, lens, and metering mode, along with photographer- generated cataloging tags that may include taxonomic iden.ty of the organism(s) in the photograph. Images will be stored in a date- hierarchical file structure (year, date) on redundant disk drives and managed using the commercial imatch sozware. Images will be maintained with unique file names. Quality flags are used to differen.ate images of varying quality. J. Smith will be responsible for the data management during the course of the project.

2. Metadata format and content Metadata about.ming and exposure of individual images is automa.cally generated by the camera at the.me the photo is taken. GPS loca.ons are subsequently added by post- processing GPS track data based on shared.me stamps. GPS data stored in image files depic.ng rare or locally sensi.ve species will be obfuscated in the file metadata but can be made available for appropriate, approved uses. Metadata for the image dataset as a whole will be generated by the image management sozware (imatch) and will include.me ranges, loca.ons, and a taxon list. Those metadata will be translated into Ecological Metadata Language (EML), created using the Morpho sozware tool, and will include loca.on and taxonomic summaries.

3. Policies for access, sharing, and reuse The image collec.on will be made available beginning in 2015. They will be available as digital photographs viewable on the web in a restricted form that prevents downloading. Summaries of temporal and spa.al distribu.ons of taxonomic groups will be also be made available on Smith s website for use without permission, but users will need permission to access the original high resolu.on photographs from which these distribu.on data were derived. Smith will retain copyright and the originals will be licensed using a Crea.ve Commons license (Crea.ve Commons A`ribu.on- NonCommercial- NoDerivs 3.0 Unported License). Data can be cited by referring to the image website.

4. Long- term storage and data management (archiving) Long term storage will involve expor.ng the metadata stored within the JPG files (loca.on, EXIF, and tag informa.on) into text files to facilitate access to that metadata and as a safeguard should the specifics of JPG metadata structure change. The image files will be stored in a single zip file containing the date- hierarchical file structure. As noted above, EML metadata will be created and the resul.ng dataset will be submi`ed to the Knowledge Network for Biocomplexity (KNB) archive.