Research Data Management. Jeff Moon Data & Government Information Librarian Academic Director, Queen s Research Data Centre



Similar documents
Survey of Canadian and International Data Management Initiatives. By Diego Argáez and Kathleen Shearer

Local Loading. The OCUL, Scholars Portal, and Publisher Relationship

Open Data & Libraries (in Canada)

<odesi> Survey Example Canadian Community Health Survey (CCHS)

Census Data: Access, Mapping and Visualization

EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020

CIP s Open Data & Data Management Guidelines and Procedures

2. Look for a link to ODESI data portal. In the Key Links section in the left side and double click ODESI Data Retrieval.

A grant number provides unique identification for the grant.

Research Data Management

Queen s Open Journal System (OJS) Business Case

Data Management Brown-bag/Seminar March 12, 2014

Checklist and guidance for a Data Management Plan

Interagency Science Working Group. National Archives and Records Administration

DATA CITATION. what you need to know

Digital Heritage Preservation - Economic Realities and Options

Virginia Commonwealth University Rice Rivers Center Data Management Plan

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

A Policy Framework for Canadian Digital Infrastructure 1

Cambridge University Library. Working together: a strategic framework

How To Write An Nccwsc/Csc Data Management Plan

Research Data Management Guide

How To Build A Map Library On A Computer Or Computer (For A Museum)

Standard Big Data Architecture and Infrastructure

Integrating Numeric, Statistical, and Geospatial Data Services for Graduate Students

How To Useuk Data Service

OpenAIRE Research Data Management Briefing paper

RESEARCH DATA MANAGEMENT POLICY

Best Practices for Research Data Management. October 30, 2014

IFI Irish Film Archive Digital Preservation & Access Strategy

Invenio: A Modern Digital Library for Grey Literature

ESRC Research Data Policy

INTERACTIVE CONTENT AND DYNAMIC PUBLISHING A VITAL PART OF AN NSO S OUTPUT AND COMMUNICATION STRATEGY

Library Strategic Planning

Entering its Third Century

Data Management Plans & the DMPTool. IAP: January 26, 2016

Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014

INTRODUCTION TO THE DATAVERSE NETWORK

Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS.

Data Management Plan Template Guidelines

Benefits of managing and sharing your data

CDI SSF Category 1: Management, Policy and Standards

Brown University Libraries Technology Plan,

Data Management Resources at UNC: The Carolina Digital Repository and Dataverse Network

Digital Content Management Workflow Task Force

Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management

The Libraries Role in Research Data Management: A Case Study from the University of Minnesota

Geospatial Data Archiving

How To Write A Blog Post On Globus

Picture Archiving and Communication Systems (PACS) - Global Opportunity Assessment, Competitive Landscape and Market Forecasts to 2017

Public Access Plan. U.S. Department of Energy July 24, 2014 ENERGY.GOV

FINDING MEANINGFUL PERFORMANCE MEASURES FOR HIGHER EDUCATION A REPORT FOR EXECUTIVES

Horizon2020 Data Management Plans. Ma4 Harrison BGS

Data on selected insurance markets

Texas State University University Library Strategic Plan

BY THE NUMBERS. Understanding the mobile gaming landscape through Unity metrics

Metadata driven framework for the Canada Research Data Centre Network

Transcription:

Research Data Management Jeff Moon Data & Government Information Librarian Academic Director, Queen s Research Data Centre

Research Data Management Data management: encompasses the entire data life cycle, including data collection, organization, storage, retrieval, and preservation over time.

The Data Life Cycle Source: http://www.ucl.ac.uk/ich/research-ich/mrc-cech/images/data/life-cycle/research-data-life-cycle.jpg?hires

Why manage & share your data? Promotes higher quality research and peer review Opens the door to further research Demonstrates effective stewardship of publiclyfunded data Ensures long-term preservation of and access to data Builds in procedures to ensure anonymity/privacy

Other factors to consider Some scientific publications require that data be available for the scientific community before the article is published A growing number of funding agencies require that data be deposited in a public archive Future researchers can go to the archive for your data rather than trying to find you

OK, but where do I turn? There are many resources related to research data management Queen s University Library has pulled selected resources together into a guide

Research Data Management http://guides.library.queensu.ca/rdm

Topics covered Overview Writing a Data Management Plan Funding Agency Guidelines Metadata Data Repositories & Archives Citing Data Best Practices in Data Management Library as Data Partner QUL Research Data Archive

Data Management Workflow

The Library as Data Partner Writing a Data Management Plan Funding Agency Guidelines Metadata Best Practices in Data Management Citing Data QUL Research Data Archive Data Repositories & Archives

Key things the Library can help with Developing a Data Management Plan (DMP) Data assessment Finding appropriate data archive(s) Storing/backing up your data Choosing an appropriate metadata standard

Data Management Plans Recent Tri-Council consultation paper 16 October 2013 Toward a Policy Framework for Advancing Digital Scholarship in Canada http://www.sshrc-crsh.gc.ca/news_room-salle_de_presse/latest_news-nouvelles_recentes/big_data_consultation-donnees_massives_consultation-eng.aspx

Recent Tri-Council consultation paper Establishing a Culture of Stewardship a. a requirement that all grant applications include specific data management plans including identified costs of data collection/analysis and preservation of results and associated datasets; b. definition of those specific elements of data plans that will be considered by reviewers in the assessment of funding applications; c. guidelines indicating which data must be preserved and in what formats; d. consolidated open access policies and guidelines (in concert with work already initiated by the TC3+); e. guidelines for researchers in selecting suitable data repositories;

Recent Tri-Council consultation paper 16 October 2013 Coordination of Stakeholder Engagement The Canadian research environment is characterized by a high degree of commitment to collaboration and cooperation, with individuals and organizations combining forces to address long-term planning and initiate bottom-up actions to shape the evolution of the digital landscape. These important actions, which bring a richness of community engagement, would benefit from development of a coordinating mechanism to ensure enhanced alignment to optimize returns on the investment of time by all parties. To help ensure the maximum contribution of key players, the TC3+ would work with other organizations and working groups to ensure ongoing consultation and coordination with all stakeholders, including the provinces, in the development of Canada s national digital infrastructure for research.

Recent Tri-Council consultation paper 16 October 2013 Developing Capacity and Future Funding Parameters To help create a forward-looking digital research environment, the parameters for the funding of coordinated national scale infrastructure should be re-examined a. best practices in administration, operations, policy and access; b. enhanced networks and infrastructure c. skills development and graduate and researcher training.

Data Management Planning TAKEAWAY MESSAGE HERE The Library can help with this!

Data Assessment Data Verification: check dataset integrity, variable names/labels/groups, coding, missing data, etc. Disclosure Risk Assessment: check for identifying variables, detailed geography (e.g. Postal Codes), etc. File Formats: Assess file formats delivered, alternative formats, and conversion options

Finding the right data archive(s) for your data Discipline-based archives

QSpace

QSpace

QSpace data file

Scholars Portal Dataverse

ODESI Data Portal

ODESI

ODESI

How do YOU store or backup your data? http://win-win-elec.com/product.asp?classid=13 http://www.forbes.com/sites/benkerschberg/2012/02/15/federal-court-orders-kpmg-to-preserve-2500-hard-drives/ www.google.com http://www.colourbox.com/image/an-image-of-treasure-chest-on-white-background-image-3343371

Metadata Data about data Documentation in standardized and structured form Ideally computer-ready Explains your data: origin, purpose, time reference, geographic location, creator, access conditions and terms of use of a data collection. You as the researcher are the best person to answer questions about your data! http://www.dataenthusiast.com/wp-content/uploads/2011/10/meta_data_standard_transmission1.jpg

Metadata Standards

The Data Continuum Statistics Tables, charts, graphs Open to the world Microdata Public-Use Microdata Files (PUMFs) Responses of individuals Anonymity of respondents is assured Generally free Under license from Statistics Canada (DLI) Custom Tabulations Generated on request by Statistics Canada using unabridged Master files Potential for more detailed tables Tables vetted for anonymity Cost recovery from Statistics Canada $$$ Real-Time Remote Access (RTRA) Desktop analysis of unabridged Master files Potential for more detailed tables AUTOMATIC table vetting Statistics Canada charges annual fee for access $$$ (this service is under consideration) Research Data Centre (RDC) Controlled access to unabridged Master files Potential for more detailed tables & analysis Tables vetted by Statistical Analyst for anonymity No charge to researchers ODESI Dataverse QSpace Queen s Nesstar Other archives A lot of research data fits in here but other discipline-based archives exist

Examples of archived data http://guides.library.queensu.ca/content.php?pid=437556&sid=3637637

Some early examples Baldwin-Green Study - Canada-US Census of Industry 1867-1940 1994 Historical Canadian Macroeconomic Dataset 1871-1994 2001

The Globalization of Personal Data Project Studied implications of surveillance and personal data in nine countries: Canada, U.S.A., France, Spain, Hungary, Mexico, Brazil, China, and Japan. Key element of success was a data manager on the project. QSpace Dataverse ODESI

Cranial Nonmetric Traits Database Used R to stack the files. 27 separate databases in Borland Paradox format on an older vintage PC Reviewed, refined, & enriched coding scheme Reviewed, refined, & enriched documentation QSpace Dataverse

Ideally, however We d like to catch you early http://www.actbucks.org.uk/images/research.jpg

The Data Life Cycle Source: http://www.ucl.ac.uk/ich/research-ich/mrc-cech/images/data/life-cycle/research-data-life-cycle.jpg?hires

Penitentiary study Archived Data & Documentation http://larshagberg.files.wordpress.com/2012/05/lth102.jpg http://michele-norris.com/wp-content/uploads/2012/02/writing-pencil.jpg

Online shopping Archived Data & Documentation http://scm-l3.technorati.com/12/12/13/73835/online-shopping(1).jpg http://grantfundingexpert.org/wp-content/uploads/grantsecrets/june/grant%20funding%20expert%20chris%20johnson%20free%20or%20cheap%20legal%20aid%20for%20canadians.jpeg

Mental health Pilot data provided Archived Data & Documentation http://www.livingroompharmacy.ca/assets/images/contentimages/mental_health15400988xsmall-1.jpg

Contact information Jeff Moon Data and Government Information Librarian Academic Director, Queen s Research Data Centre moonj@queensu.ca Ext. 77992 Alex Cooper Data and Web Support Assistant coopera@queensu.ca Ext. 77481 Acknowledgement: Some content developed by Matthew Barabash, Library Intern, Fall 2012