Data Management at UT



Similar documents
Edinburgh Napier University. Research Data Management Policy

Best Practices for Research Data Management. October 30, 2014

Research Data Management Policy

Best Practices for Good Data Management. February 19, 2015

Data Management Plans & the DMPTool. IAP: January 26, 2016

Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

Library Strategic Planning

A grant number provides unique identification for the grant.

Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any

Data Management Best Practices for Landscape Conservation Cooperatives Part 1: LCC Funded Science

WHAT SHOULD NSF DATA MANAGEMENT PLANS LOOK LIKE

Management of Research Data Procedure

Virginia Commonwealth University Rice Rivers Center Data Management Plan

Research Data Management Guide

Research Data Management in Horizon 2020

CIP s Open Data & Data Management Guidelines and Procedures

Lesson 3: Data Management Planning

Open Access to publications and research data in Horizon 2020

Checklist for a Data Management Plan draft

Open Exeter Research Data Survey

The RDMSG : Data Management Planning and More

Checklist and guidance for a Data Management Plan

Image Data, RDA and Practical Policies

EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020

4 NUMBER 004 Policy Data base Document Reference Number P

LJMU Research Data Policy: information and guidance

Johns Hopkins University Data Management Services

RESEARCH DATA MANAGEMENT POLICY

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Managing and Sharing research Data

Open Access to scientific data. SwissCore Annual Event Brussels, 14 May 2014

NERC Biodiversity and Ecosystem Service Sustainability (BESS) Data Management Strategy

Functional Requirements for Digital Asset Management Project version /30/2006

EXECUTIVE AGENCY HORIZON 2020 PROGRAMME

POSITION DETAILS. Digitisation & Digital Services

OpenAIRE Research Data Management Briefing paper

Data management plan

College Archives Digital Preservation Policy. Created: October 2007 Last Updated: December 2012

Research Data Management PROJECT LIFECYCLE

H2020 Guidelines on Open Data and Data Management Plan

Research Data Management

In ediscovery and Litigation Support Repositories MPeterson, June 2009

Globus Research Data Management: Introduction and Service Overview

Bradford Scholars Digital Preservation Policy

LIBER Case Study: University of Oxford Research Data Management Infrastructure

Introduction to Research Data Management for Social Scientists

The Key Elements of Digital Asset Management

Horizon2020 Data Management Plans. Ma4 Harrison BGS

How To Manage Research Data At Columbia

4.10 Information Management Policy

Long Term Preservation of Earth Observation Space Data. Preservation Workflow

Data Management Brown-bag/Seminar March 12, 2014

Research Data Storage and the University of Bristol

Data Driven Discovery In the Social, Behavioral, and Economic Sciences

REBs & Data Management Plans: Conflict & Coexistence Susan Babcock and Chuck Humphrey, University of Alberta CAREB Conference, Vancouver,

Creating a Data Management Plan for your Research

Records Management and SharePoint 2013

How To Write A Blog Post On Globus

The Scientific Data Mining Process

Exploitation of ISS scientific data

Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning

Riverbed Whitewater/Amazon Glacier ROI for Backup and Archiving

Research Data Management Procedures

Transcription:

Data Management at UT Maria Esteva, TACC, maria@tacc.utexas.edu Colleen Lyon, UT Libraries, c.lyon@austin.utexas.edu Angela Newell, ITS, anewell@austin.utexas.edu

What is data management? systematic organization of data throughout the research lifecycle "[data curation] includes authentication, archiving, management, preservation, retrieval, and representation... these activities enable data discovery and retrieval, maintain data quality, add value, and provide for re-use over time."* *University of Illinois:http://www.lis.illinois.edu/academics/programs/ms/data_curation

Elements of a Data Management Plan 1. Description of the data 2. Metadata 3. Access, sharing and re-use 4. Licensing and confidentiality of data 5. Data storage and preservation 6. Resources needed $$

Data Types and Reproducibility Values Experimental data From labs and equipment (R C) Observational data (N) Captured in real time Derived data (R C) After data mining and statistical processing Simulation data (R C) Data generated from modeling processes Peer reviewed data (R C) Genome banks Software (R C) REPRODUCIBLE: Derives from simulations, reductions, measurements NON-REPRODUCIBLE: Cannot be reproduced or reconstructed COSTLY: Expensive to reproduce Assessment of the reproducibility value of your data in relation to the goals of your research during the early research stages will aid in scheduling your data and shaping your data management activities.

Data Describe the data that will be generated or existing data that will be used Volume File formats and structures Schedule the retention of your data Examples: Raw telemetry files: Satellite telemetry frames acquired by the Direct Broadcast Receiving Station (DBRS). This data has long-term retention to allow for full, end-to-end reprocessing. Raw uncompressed audio files from oral history interviews, 50 MGbytes: This data has long-term retention and will serve archival purposes. For purposes of analysis during the study process, copies of the raw files will be compressed to MPEG-4. The latter will be discarded upon finalizing the study.

Metadata Descriptive information that helps you and others discover and identify data Example 1 Example 2 Structural metadata gives description of how the components are organized Example: information about the database column descriptions, keys, indexes Administrative metadata gives information to help manage the source Example: file type, date of creation, information about machine that created data

Licensing & Confidentiality If you are doing human subjects research, make sure your DMP is compliant with IRB protocols You may also need to consider: Confidentiality agreements Working with copyrighted materials Previous licenses Citation and licensing your data

Sharing Who will have access to the data? When? How? Providing access to non-group members o Restrictions on sharing o Specify approved uses Protecting sensitive information From: http://www.trendmls.com/guest/news/showdoc.aspx?id=771 o This can determine which storage and management systems you can use and how to provide authorization

Storage & Archiving Where will data be stored during project? o Local versus remote o Backing up data o Costs Where will data live after the project ends? o Public repository o Personal/lab/university website o On journal s website

https://dmp.cdlib.org/ Online templates to guide you in creating your DMP Developed by a team of universities and organizations Sign in with your EID Templates for funding agencies and directorates within NSF Save, cut/paste, print

Data Management at UT http:lib.utexas.edu/datamanagement A central location for information to access all data management resources on campus TACC resources ITS resources UT Libraries resources Other campus resources Links to subject specific repositories DMPTool - an online DMP creation tool Complementary services From: http://attractions.uptake.com/blog/ university-texas-tower-austin-texas-1891.html

Quick Links Data management plan help: https://dmp.cdlib.org/ Storage options on campus: http://www.utexas.edu/its/ (GB range) https://www.tacc.utexas.edu/ (TB range) Repository options: http://repositories.lib.utexas.edu/ http://www.re3data.org/ (list of subject specific repositories) Not sure where to start: datamanagement@lib.utexas.edu