BIG DATA: DATA EVERYWHERE

Size: px
Start display at page:

Download "BIG DATA: DATA EVERYWHERE"

Transcription

1 Line Pouchard, PhD Purdue Libraries, Research Data 03/10/2015 BIG DATA INTEREST GROUP Issues in Big Data Cura/on

2 BIG DATA: DATA EVERYWHERE

3 DEFINITIONS OF DATA CURATION Data curation is a term used to indicate management activities required to maintain research data long-term such that it is available for reuse and preservation (Wikipedia) The active and ongoing management of data through its life cycle of interest and usefulness to scholarship, science, and education. Data curation activities enable data discovery and retrieval, maintain its quality, add value, and provide for reuse over time, and this new field includes authentication, archiving, management, preservation, retrieval, and representation (GSLIS)

4 BIG DATA LIFECYCLE Assure

5 ISSUES IN BIG DATA CURATION Storage Data preparation & clean up Quality Discoverability Selection for preservation Privacy and ethics Reproducibility

6 QUESTIONS INFORMING CURATION ACTIVITIES Plan Acquire Prepare Volume What is an es1mate of volume & growth rate? Variety Are the data sensi1ve? What provisions are made to accommodate sensi1ve data? What is the most suited storage (databases, NoSQL, cloud)? What are the data formats and steps needed to integrate them? How do we prepare datasets for analysis? (remove blanks, duplicates, splifng columns, adding/removing headers)? What transforma1ons are needed to aggregate data? Do we need to create a pipeline? Velocity Is bandwidth sufficient to accommodate input rates? Veracity What are the data sources? What allows us to trust them? Will datasets be aggregated into series? Will metadata apply to individual datasets or to series? Who collects the data? Do they have the tools and skills to ensure con1nuity? What type of naming format is needed to keep track of incoming and derived datasets? Are the wrangling steps sufficiently documented to foster trust in the analysis?

7 QUESTIONS INFORMING CURATION ACTIVITIES Analyse Preserve Discover Volume Are adequate compute power and analysis methods available? Should raw data be preserved? What storage space is needed in the long- term? What part of the data (derived, raw, somware code) will be made accessible to searches? Variety Are the various analy1cal methods compa1ble with the different datasets? Velocity At what 1me point does the analy1cal feedback need to inform decisions? Are there different legal considera1ons for each data source? Are there conflicts with privacy and confiden1ality? When does data become obsolete? What search methods best suit this data keyword- based, geo- spa1al searches, metadata- based, seman1c searches? What degree of search latency is tolerable? Veracity What kind of access to scripts, somware, and procedures is needed to ensure transparency and reproducibility? What are the trade- offs if only derived products and no raw data are preserved? Providing well- documented data in open access allows scru1ny. How is veracity supported with sensi1ve and private data?

8 COLLABORATIONS Collaborations on multi-disciplinary proposals and projects Levels of collaboration Developing customized Data Management Plans Organizing your data Describing your data Sharing your data Publishing your datasets Preserving your data Education on best practices

9 A BIG DATA PROJECT AT PURDUE Dr. Yung- Hsiang Lu, PI

10 CURATION ISSUES IN CAM2 PROJECT Data access and re-use - policies of video streams and CCTV - Sparse legal framework except UK - Few policies available Data ownership Data storage Data organization - naming scheme - metadata Protect metadata storage where the intellectual property lies Data information literacy skills for Big Data

11 CONCLUSION Big Data looks very different than small data in maintenance and storage Curation primarily focuses on different areas than small data Planning from the beginning is crucial: without planning, curation will fall short Collaborations are more important Facilitating access is where efforts need to focus, not storing the data.

12 THE END

13 A REPRESENTATION OF THE FOUR Vs hup:// veracity

14 SOLUTION WHAT DID WE GET? Approximately 2.25 PB of IBM GPFS Hardware provided by a pair of Data Direct Networks SFA12k arrays, one in each of MATH and FREH datacenters 160 Gb/sec to each datacenter 5x Dell R620 servers in each datacenter

In 2014, the Research Data group @ Purdue University

In 2014, the Research Data group @ Purdue University EDITOR S SUMMARY At the 2015 ASIS&T Research Data Access and Preservation (RDAP) Summit, panelists from Research Data @ Purdue University Libraries discussed the organizational structure intended to promote

More information

RESEARCH DATA MANAGEMENT POLICY

RESEARCH DATA MANAGEMENT POLICY Document Title Version 1.1 Document Review Date March 2016 Document Owner Revision Timetable / Process RESEARCH DATA MANAGEMENT POLICY RESEARCH DATA MANAGEMENT POLICY Director of the Research Office Regular

More information

Data Management at UT

Data Management at UT Data Management at UT Maria Esteva, TACC, maria@tacc.utexas.edu Colleen Lyon, UT Libraries, c.lyon@austin.utexas.edu Angela Newell, ITS, anewell@austin.utexas.edu What is data management? systematic organization

More information

Research Data Management Policy

Research Data Management Policy Research Data Management Policy Version Number: 1.0 Effective from 06 January 2016 Author: Research Data Manager The Library Document Control Information Status and reason for development New as no previous

More information

SURFsara Data Services

SURFsara Data Services SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,

More information

Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any

Data Management Plan. Name of Contractor. Name of project. Project Duration Start date : End: DMP Version. Date Amended, if any Data Management Plan Name of Contractor Name of project Project Duration Start date : End: DMP Version Date Amended, if any Name of all authors, and ORCID number for each author WYDOT Project Number Any

More information

LJMU Research Data Policy: information and guidance

LJMU Research Data Policy: information and guidance LJMU Research Data Policy: information and guidance Prof. Director of Research April 2013 Aims This document outlines the University policy and provides advice on the treatment, storage and sharing of

More information

The BEAR Management Group will report to the University Research Committee.

The BEAR Management Group will report to the University Research Committee. Research Data Storage Policy Introduction The University has committed to provide mechanisms and services for storage, backup, registration, deposit and retention of research data assets in support of

More information

Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014

Best Practices for Data Management. RMACC HPC Symposium, 8/13/2014 Best Practices for Data Management RMACC HPC Symposium, 8/13/2014 Presenters Andrew Johnson Research Data Librarian CU-Boulder Libraries Shelley Knuth Research Data Specialist CU-Boulder Research Computing

More information

Web Archiving and Scholarly Use of Web Archives

Web Archiving and Scholarly Use of Web Archives Web Archiving and Scholarly Use of Web Archives Helen Hockx-Yu Head of Web Archiving British Library 15 April 2013 Overview 1. Introduction 2. Access and usage: UK Web Archive 3. Scholarly feedback on

More information

Research Data Management PROJECT LIFECYCLE

Research Data Management PROJECT LIFECYCLE PROJECT LIFECYCLE Introduction and context Basic Project Info. Thesis Title UH or Research Council? Duration Related Policies UH and STFC policies: open after publication as your research is public funded

More information

Image Data, RDA and Practical Policies

Image Data, RDA and Practical Policies Image Data, RDA and Practical Policies Rainer Stotzka and many others KIT University of the State of Baden-Württemberg and National Laboratory of the Helmholtz Association www.kit.edu Data Life Cycle Lab

More information

SharePoint Document and Data Control

SharePoint Document and Data Control SharePoint Document and Data Control This article is concerned with how the management of documents and data in a site related to the delivery of SharePoint, thus allowing the control, storage and management

More information

Archiving and Sharing Big Data Digital Repositories, Libraries, Cloud Storage

Archiving and Sharing Big Data Digital Repositories, Libraries, Cloud Storage Archiving and Sharing Big Data Digital Repositories, Libraries, Cloud Storage Cyrus Shahabi, Ph.D. Professor of Computer Science & Electrical Engineering Director, Integrated Media Systems Center (IMSC)

More information

Globus Research Data Management: Introduction and Service Overview

Globus Research Data Management: Introduction and Service Overview Globus Research Data Management: Introduction and Service Overview Kyle Chard chard@uchicago.edu Ben Blaiszik blaiszik@uchicago.edu Thank you to our sponsors! U. S. D E P A R T M E N T OF ENERGY 2 Agenda

More information

A grant number provides unique identification for the grant.

A grant number provides unique identification for the grant. Data Management Plan template Name of student/researcher(s) Name of group/project Description of your research Briefly summarise the type of your research to help others understand the purposes for which

More information

CIP s Open Data & Data Management Guidelines and Procedures

CIP s Open Data & Data Management Guidelines and Procedures CIP s Open Data & Data Management Guidelines and Procedures 1.1 Scope The CIP Data Management Guidelines and Procedures aim to provide guidance and support throughout the Data Management Cycle to facilitate

More information

Horizon2020 Data Management Plans. Ma4 Harrison BGS

Horizon2020 Data Management Plans. Ma4 Harrison BGS Horizon2020 Data Management Plans Ma4 Harrison BGS Data Management plan What is a Data Management Plan? A data management plan (DMP) describes what data that will be created, the standards used to describe

More information

Overview of state of art in Data management. Stefano Cozzini CNR/IOM and exact lab srl

Overview of state of art in Data management. Stefano Cozzini CNR/IOM and exact lab srl Overview of state of art in Data management Stefano Cozzini CNR/IOM and exact lab srl AIM of this short talk Frame the problem and the discussion around DATA: What are big data? Which kind of challenges

More information

Research Data Management Policy. Glasgow School of Art

Research Data Management Policy. Glasgow School of Art Research Data Management Policy Glasgow School of Art Version 1.4 Last revision April 2013 Responsibility Research Information Manager Department Learning Resources Relevant legislation Data Protection

More information

How To Write A Blog Post On Globus

How To Write A Blog Post On Globus Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, chard@uchicago.edu Jim Pruyne, University of Chicago Computation Institute, pruyne@uchicago.edu

More information

ALA s Core Competences of Librarianship

ALA s Core Competences of Librarianship ALA s Core Competences of Librarianship Final version Approved by the ALA Executive Board, October 25 th 2008 Approved and adopted as policy by the ALA Council, January 27 th 2009 This document defines

More information

Report of the DTL focus meeting on Life Science Data Repositories

Report of the DTL focus meeting on Life Science Data Repositories Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity

More information

www.basho.com Technical Overview Simple, Scalable, Object Storage Software

www.basho.com Technical Overview Simple, Scalable, Object Storage Software www.basho.com Technical Overview Simple, Scalable, Object Storage Software Table of Contents Table of Contents... 1 Introduction & Overview... 1 Architecture... 2 How it Works... 2 APIs and Interfaces...

More information

Global Scientific Data Infrastructures: The Big Data Challenges. Capri, 12 13 May, 2011

Global Scientific Data Infrastructures: The Big Data Challenges. Capri, 12 13 May, 2011 Global Scientific Data Infrastructures: The Big Data Challenges Capri, 12 13 May, 2011 Data-Intensive Science Science is, currently, facing from a hundred to a thousand-fold increase in volumes of data

More information

Data Management Planning

Data Management Planning DIY Research Data Management Training Kit for Librarians Data Management Planning Kerry Miller Digital Curation Centre University of Edinburgh Kerry.miller@ed.ac.uk Running Order I. What is Research Data

More information

Big Data Analytics Platform @ Nokia

Big Data Analytics Platform @ Nokia Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform

More information

Capitalizing on Big Data

Capitalizing on Big Data Capitalizing on Big Data CARL s response to the consultation document Capitalizing on Big Data: Toward a Policy Framework for Advancing Digital Scholarship in Canada December 12 th 2013 Who we are The

More information

White Paper Big Data Without Big Headaches

White Paper Big Data Without Big Headaches Vormetric, Inc. 2545 N. 1st Street, San Jose, CA 95131 United States: 888.267.3732 United Kingdom: +44.118.949.7711 Singapore: +65.6829.2266 info@vormetric.com www.vormetric.com THE NEW WORLD OF DATA IS

More information

Management of Research Data Procedure

Management of Research Data Procedure Management of Research Data Procedure Related Policy Management of Research Data Policy Responsible Officer Deputy Vice Chancellor (Research) Approved by Deputy Vice Chancellor (Research) Approved and

More information

INTRODUCTORY NOTE TO THE G20 ANTI-CORRUPTION OPEN DATA PRINCIPLES

INTRODUCTORY NOTE TO THE G20 ANTI-CORRUPTION OPEN DATA PRINCIPLES INTRODUCTORY NOTE TO THE G20 ANTI-CORRUPTION OPEN DATA PRINCIPLES Open Data in the G20 In 2014, the G20 s Anti-corruption Working Group (ACWG) established open data as one of the issues that merit particular

More information

Data Management Plans - How to Treat Digital Sources

Data Management Plans - How to Treat Digital Sources 1 Data Management Plans - How to Treat Digital Sources The imminent future for repositories and their management Paolo Budroni Library and Archive Services, University of Vienna Tomasz Miksa Secure Business

More information

Data sharing and Big Data in the physical sciences. 2 October 2015

Data sharing and Big Data in the physical sciences. 2 October 2015 Data sharing and Big Data in the physical sciences 2 October 2015 Content Digital curation: Data and metadata Why consider the physical sciences? Astronomy: Video Physics: LHC for example. Video The Research

More information

Creating a Data Management Plan for your Research

Creating a Data Management Plan for your Research Creating a Data Management Plan for your Research EPFL Workshop Lausaunne, 28 Oct 2014 Robin Rice, Laine Ruus EDINA and Data Library Course content What is a Data Management Plan? Benefits and drivers

More information

PhD in Information Studies Goals

PhD in Information Studies Goals PhD in Information Studies Goals The goals of the PhD Program in Information Studies are to produce highly qualified graduates for careers in research, teaching, and leadership in the field; to contribute

More information

AppSymphony White Paper

AppSymphony White Paper AppSymphony White Paper Secure Self-Service Analytics for Curated Digital Collections Introduction Optensity, Inc. offers a self-service analytic app composition platform, AppSymphony, which enables data

More information

Enhanced Research Data Management and Publication with Globus

Enhanced Research Data Management and Publication with Globus Enhanced Research Data Management and Publication with Globus Vas Vasiliadis Jim Pruyne Presented at OR2015 June 8, 2015 Presentations and other useful information available at globus.org/events/or2015/tutorial

More information

ESRC Research Data Policy

ESRC Research Data Policy ESRC Research Data Policy Introduction... 2 Definitions... 2 ESRC Research Data Policy Principles... 3 Principle 1... 3 Principle 2... 3 Principle 3... 3 Principle 4... 3 Principle 5... 3 Principle 6...

More information

Scholarly Use of Web Archives

Scholarly Use of Web Archives Scholarly Use of Web Archives Helen Hockx-Yu Head of Web Archiving British Library 15 February 2013 Web Archiving initiatives worldwide http://en.wikipedia.org/wiki/file:map_of_web_archiving_initiatives_worldwide.png

More information

High Performance Compu2ng and Big Data. High Performance compu2ng Curriculum UvA- SARA h>p://www.hpc.uva.nl/

High Performance Compu2ng and Big Data. High Performance compu2ng Curriculum UvA- SARA h>p://www.hpc.uva.nl/ High Performance Compu2ng and Big Data High Performance compu2ng Curriculum UvA- SARA h>p://www.hpc.uva.nl/ Big data was big news in 2012 and probably in 2013 too. The Harvard Business Review talks about

More information

IBM Solution Framework for Lifecycle Management of Research Data. 2008 IBM Corporation

IBM Solution Framework for Lifecycle Management of Research Data. 2008 IBM Corporation IBM Solution Framework for Lifecycle Management of Research Data Aspects of Lifecycle Management Research Utilization of research paper Usage history Metadata enrichment Usage Pattern / Citation Collaboration

More information

Research Data Management

Research Data Management Research Data Management 1 Why to we need to Manage Data? 2 Data Management Planning Typically covers: - What data will be created (format, types) and how? - How will the data be documented and described?

More information

Searching biomedical data sets. Hua Xu, PhD The University of Texas Health Science Center at Houston

Searching biomedical data sets. Hua Xu, PhD The University of Texas Health Science Center at Houston Searching biomedical data sets Hua Xu, PhD The University of Texas Health Science Center at Houston Motivations for biomedical data re-use Improve reproducibility Minimize duplicated efforts on creating

More information

Checklist and guidance for a Data Management Plan

Checklist and guidance for a Data Management Plan Checklist and guidance for a Data Management Plan Please cite as: DMPTuuli-project. (2016). Checklist and guidance for a Data Management Plan. v.1.0. Available online: https://wiki.helsinki.fi/x/dzeacw

More information

EPSRC Research Data Management Compliance Report

EPSRC Research Data Management Compliance Report EPSRC Research Data Management Compliance Report Contents Introduction... 2 Approval Process... 2 Review Schedule... 2 Acknowledgement... 2 EPSRC Expectations... 3 1. Awareness of EPSRC principles and

More information

More Than A Buzzword: Big Data in the Environmental Arena

More Than A Buzzword: Big Data in the Environmental Arena More Than A Buzzword: Big Data in the Environmental Arena 2015 Na>onal Environmental Monitoring Conference July 15, 2015 Brooke Roecker Senior Environmental Data Analyst Mark Packard, PG, CPG President/CEO

More information

YORK REGION DISTRICT SCHOOL BOARD. Policy and Procedure #160.0 Records and Information Management

YORK REGION DISTRICT SCHOOL BOARD. Policy and Procedure #160.0 Records and Information Management YORK REGION DISTRICT SCHOOL BOARD Policy and Procedure #160.0 Records and Information Management Policy and Procedure #160.0 Records and Information Management outline the process for ensuring information

More information

Research Data Management Guide

Research Data Management Guide Research Data Management Guide Research Data Management at Imperial WHAT IS RESEARCH DATA MANAGEMENT (RDM)? Research data management is the planning, organisation and preservation of the evidence that

More information

HPSS Best Practices. Erich Thanhardt Bill Anderson Marc Genty B

HPSS Best Practices. Erich Thanhardt Bill Anderson Marc Genty B HPSS Best Practices Erich Thanhardt Bill Anderson Marc Genty B Overview Idea is to Look Under the Hood of HPSS to help you better understand Best Practices Expose you to concepts, architecture, and tape

More information

A Novel Cloud Based Elastic Framework for Big Data Preprocessing

A Novel Cloud Based Elastic Framework for Big Data Preprocessing School of Systems Engineering A Novel Cloud Based Elastic Framework for Big Data Preprocessing Omer Dawelbeit and Rachel McCrindle October 21, 2014 University of Reading 2008 www.reading.ac.uk Overview

More information

OpenAIRE Research Data Management Briefing paper

OpenAIRE Research Data Management Briefing paper OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement

More information

Versity 2013. All rights reserved.

Versity 2013. All rights reserved. From the only independent developer of large scale archival storage systems, the Versity Storage Manager brings enterpriseclass storage virtualization to the Linux platform. Based on Open Source technology,

More information

Cambridge University Library. Working together: a strategic framework 2010 2013

Cambridge University Library. Working together: a strategic framework 2010 2013 1 Cambridge University Library Working together: a strategic framework 2010 2013 2 W o r k i n g to g e t h e r : a s t r at e g i c f r a m e w o r k 2010 2013 Vision Cambridge University Library will

More information

UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure

UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure Authors: A O Jaunsen, G S Dahiya, H A Eide, E Midttun Date: Dec 15, 2015 Summary Uninett Sigma2 provides High

More information

Globus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis

Globus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis Globus Research Data Management: Introduction and Service Overview Steve Tuecke Vas Vasiliadis Presentations and other useful information available at globus.org/events/xsede15/tutorial 2 Thank you to

More information

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Data Governance in the Hadoop Data Lake. Michael Lang May 2015 Data Governance in the Hadoop Data Lake Michael Lang May 2015 Introduction Product Manager for Teradata Loom Joined Teradata as part of acquisition of Revelytix, original developer of Loom VP of Sales

More information

Virginia Commonwealth University Rice Rivers Center Data Management Plan

Virginia Commonwealth University Rice Rivers Center Data Management Plan Virginia Commonwealth University Rice Rivers Center Data Management Plan Table of Contents Objectives... 2 VCU Rice Rivers Center Research Protocol... 2 VCU Rice Rivers Center Data Management Plan... 3

More information

ERA Challenges. Draft Discussion Document for ACERA: 10/7/30

ERA Challenges. Draft Discussion Document for ACERA: 10/7/30 ERA Challenges Draft Discussion Document for ACERA: 10/7/30 ACERA asked for information about how NARA defines ERA completion. We have a list of functions that we would like for ERA to perform that we

More information

Fall 2015-2016 Course Descriptions School of Library and Information Studies March 4, 2015 Subject to Change

Fall 2015-2016 Course Descriptions School of Library and Information Studies March 4, 2015 Subject to Change Fall 2015-2016 Course Descriptions School of Library and Information Studies March 4, 2015 Subject to Change LIS 450: Information Agencies and Their Environment Basic communication theories and models;

More information

Big Data Analytics. Chances and Challenges. Volker Markl

Big Data Analytics. Chances and Challenges. Volker Markl Volker Markl Professor and Chair Database Systems and Information Management (DIMA), Technische Universität Berlin www.dima.tu-berlin.de Big Data Analytics Chances and Challenges Volker Markl DIMA BDOD

More information

How To Manage Cloud Data Safely

How To Manage Cloud Data Safely Information Governance In The Cloud Galina Datskovsky, Ph. D., CRM President of ARMA International SVP Information Governance Solutions Topics Cloud Characteristics And Risks Information Management In

More information

Research Data Storage and the University of Bristol

Research Data Storage and the University of Bristol Introduction: Policy for the use of the Research Data Storage Facility The University s High Performance Computing (HPC) facility went live to users in May 2007. Access to this world-class HPC facility

More information

Big Data in the context of Preservation and Value Adding

Big Data in the context of Preservation and Value Adding Big Data in the context of Preservation and Value Adding R. Leone, R. Cosac, I. Maggio, D. Iozzino ESRIN 06/11/2013 ESA UNCLASSIFIED Big Data Background ESA/ESRIN organized a 'Big Data from Space' event

More information

Survey of Canadian and International Data Management Initiatives. By Diego Argáez and Kathleen Shearer

Survey of Canadian and International Data Management Initiatives. By Diego Argáez and Kathleen Shearer Survey of Canadian and International Data Management Initiatives By Diego Argáez and Kathleen Shearer on behalf of the CARL Data Management Working Group (Working paper) April 28, 2008 Introduction Today,

More information

Digital Preservation Lifecycle Management

Digital Preservation Lifecycle Management Digital Preservation Lifecycle Management Building a demonstration prototype for the preservation of large-scale multi-media collections Arcot Rajasekar San Diego Supercomputer Center, University of California,

More information

Texas State University University Library Strategic Plan 2012 2017

Texas State University University Library Strategic Plan 2012 2017 Texas State University University Library Strategic Plan 2012 2017 Mission The University Library advances the teaching and research mission of the University and supports students, faculty, and other

More information

The Relationship Between Information Governance, Data Governance, and Big Data. Richard Kessler November 2015

The Relationship Between Information Governance, Data Governance, and Big Data. Richard Kessler November 2015 The Relationship Between Information Governance, Data Governance, and Big Data Richard Kessler November 2015 Definitions and Interpretations Data Governance "The exercise of authority and control over

More information

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. Impact of Big Data in Oil & Gas Industry Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. New Age Information 2.92 billions Internet Users in 2014 Twitter processes 7 terabytes

More information

WHAT SHOULD NSF DATA MANAGEMENT PLANS LOOK LIKE

WHAT SHOULD NSF DATA MANAGEMENT PLANS LOOK LIKE WHAT SHOULD NSF DATA MANAGEMENT PLANS LOOK LIKE Bin Ye, College of Agricultural and Life Sciences University of Wisconsin Diane Winter, Inter-university Consortium for Political and Social Research (ICPSR),

More information

Wrangler: A New Generation of Data-intensive Supercomputing. Christopher Jordan, Siva Kulasekaran, Niall Gaffney

Wrangler: A New Generation of Data-intensive Supercomputing. Christopher Jordan, Siva Kulasekaran, Niall Gaffney Wrangler: A New Generation of Data-intensive Supercomputing Christopher Jordan, Siva Kulasekaran, Niall Gaffney Project Partners Academic partners: TACC Primary system design, deployment, and operations

More information

INFORMATION SYSTEMS & HIGHER EDUCATION. Steve Kutay Digital Services Librarian Oviatt Library CSUN

INFORMATION SYSTEMS & HIGHER EDUCATION. Steve Kutay Digital Services Librarian Oviatt Library CSUN INFORMATION SYSTEMS & HIGHER EDUCATION Steve Kutay Digital Services Librarian Oviatt Library CSUN Indicators of Successful Info Systems Intuitive navigation Findable information Meets user needs + organizational

More information

POSITION DETAILS. Digitisation & Digital Services

POSITION DETAILS. Digitisation & Digital Services HR191 JOB DESCRIPTION NOTES Forms must be downloaded from the UCT website: http://www.uct.ac.za/depts/sapweb/forms/forms.htm This form serves as a template for the writing of job descriptions. A copy of

More information

Edinburgh Napier University. Research Data Management Policy

Edinburgh Napier University. Research Data Management Policy Edinburgh Napier University Research Data Management Policy Introduction/Rationale Edinburgh Napier University (the University) is committed to delivering excellent research and, as research data is at

More information

SHared Access Research Ecosystem (SHARE)

SHared Access Research Ecosystem (SHARE) SHared Access Research Ecosystem (SHARE) June 7, 2013 DRAFT Association of American Universities (AAU) Association of Public and Land-grant Universities (APLU) Association of Research Libraries (ARL) This

More information

LIBER Case Study: Author: Mijke Jetten, University Library, Radboud University, m.jetten@ubn.ru.nl

LIBER Case Study: Author: Mijke Jetten, University Library, Radboud University, m.jetten@ubn.ru.nl LIBER Case Study: Research Data Management at Radboud University Author: Mijke Jetten, University Library, Radboud University, m.jetten@ubn.ru.nl Keywords: generic, institutional, policy, support, software

More information

The Key Elements of Digital Asset Management

The Key Elements of Digital Asset Management The Key Elements of Digital Asset Management The last decade has seen an enormous growth in the amount of digital content, stored on both public and private computer systems. This content ranges from professionally

More information

Archive I. Metadata. 26. May 2015

Archive I. Metadata. 26. May 2015 Archive I Metadata 26. May 2015 2 Norstore Data Management Plan To successfully execute your research project you want to ensure the following three criteria are met over its entire lifecycle: You are

More information

Introduction to Research Data Management. Tom Melvin, Anita Schwartz, and Jessica Cote April 13, 2016

Introduction to Research Data Management. Tom Melvin, Anita Schwartz, and Jessica Cote April 13, 2016 Introduction to Research Data Management Tom Melvin, Anita Schwartz, and Jessica Cote April 13, 2016 What Will We Cover? Why is managing data important? Organizing and storing research data Sharing and

More information

Data management plan

Data management plan FACILITATE OPEN SCIENCE TRAINING FOR EUROPEAN RESEARCH 612425 Data management plan Course for Doctoral Students at ECPR Summer School 2015 Faculty of Social Sciences, University of Ljubljana, Slovenia

More information

Open Access to Manuscripts, Open Science, and Big Data

Open Access to Manuscripts, Open Science, and Big Data Open Access to Manuscripts, Open Science, and Big Data Progress, and the Elsevier Perspective in 2013 Presented by: Dan Morgan Title: Senior Manager Access Relations, Global Academic Relations Company

More information

RECOMMENDATIONS AND BEST PRACTICES FOR DATA SHARING IN NEW PROJECTS - THE FOT-NET DATA SHARING FRAMEWORK

RECOMMENDATIONS AND BEST PRACTICES FOR DATA SHARING IN NEW PROJECTS - THE FOT-NET DATA SHARING FRAMEWORK RECOMMENDATIONS AND BEST PRACTICES FOR DATA SHARING IN NEW PROJECTS - THE FOT-NET DATA SHARING FRAMEWORK 10 March 2015 Helena Gellerman, SAFER INTRODUCTION New projects retrieving data in two ways Collecting

More information

UCLA: Data management, governance, and policy issues

UCLA: Data management, governance, and policy issues UCLA: Data management, governance, and policy issues Chris

More information

On the Radar: Tessella

On the Radar: Tessella On the Radar: Tessella Creating an archive for the long-term preservation of digital content Reference Code: IT014-002789 Publication Date: 04 Sep 2013 Author: Sue Clarke SUMMARY Catalyst Ensuring that

More information

The OnApp Federation. Instant scale and reach for your cloud

The OnApp Federation. Instant scale and reach for your cloud The OnApp Federation Instant scale and reach for your cloud Instant scale and reach for your cloud The OnApp Federation is a global network of clouds you can use, on demand, to add scale and reach to your

More information

Digital Continuity Plan

Digital Continuity Plan Digital Continuity Plan Ensuring that your business information remains accessible and usable for as long as it is needed Accessible and usable information Digital continuity Digital continuity is an approach

More information

Lesson 3: Data Management Planning

Lesson 3: Data Management Planning Lesson 3: CC image by Joe Hall on Flickr What is a data management plan (DMP)? Why prepare a DMP? Components of a DMP NSF requirements for DMPs Example of NSF DMP CC image by Darla Hueske on Flickr After

More information

UNH Strategic Technology Plan

UNH Strategic Technology Plan UNH Strategic Technology Plan Joanna Young, UNH Chief Information Officer - April 2010 People increasingly experience or interact with an organization through a technology lens. Accessible, engaging, responsive,

More information

Applications of LTFS for Cloud Storage Use Cases

Applications of LTFS for Cloud Storage Use Cases Applications of LTFS for Cloud Storage Use Cases Version 1.0 Publication of this SNIA Technical Proposal has been approved by the SNIA. This document represents a stable proposal for use as agreed upon

More information

Strategic Plan 2013 2017

Strategic Plan 2013 2017 Plan 0 07 Mapping the Library for the Global Network University NYU DIVISION OF LIBRARIES Our Mission New York University Libraries is a global organization that advances learning, research, and scholarly

More information

Data Wrangling: From the Wild to the Lake

Data Wrangling: From the Wild to the Lake Data Wrangling: From the Wild to the Lake Ignacio Terrizzano Peter Schwarz Mary Roth John Colino IBM Research - Almaden 48 hours of video is uploaded to YouTube every minute Walmart processes million transactions

More information

CLOUD BLOCK STORAGE CONSISTENT AND RELIABLE STORAGE PERFORMANCE IN THE CLOUD

CLOUD BLOCK STORAGE CONSISTENT AND RELIABLE STORAGE PERFORMANCE IN THE CLOUD CLOUD BLOCK STORAGE CONSISTENT AND RELIABLE STORAGE PERFORMANCE IN THE CLOUD Rackspace Cloud Block Storage provides external block-level storage volumes that supplement the storage built into Rackspace

More information

DATA LIFE CYCLE & DATA MANAGEMENT PLANNING

DATA LIFE CYCLE & DATA MANAGEMENT PLANNING DATA LIFE CYCLE & DATA MANAGEMENT PLANNING......... VEERLE VAN DEN EYNDEN RESEARCH DATA MANAGEMENT TEAM UNIVERSITY OF ESSEX.. LOOKING AFTER AND MANAGING YOUR RESEARCH DATA (GOING DIGITAL AND ESRC ATN EVENTS),

More information

Data Management Best Practices for Landscape Conservation Cooperatives Part 1: LCC Funded Science

Data Management Best Practices for Landscape Conservation Cooperatives Part 1: LCC Funded Science Data Management Best Practices for Landscape Conservation Cooperatives Part 1: LCC Funded Science Version 3.4, November 2012 LCC Network Data Management Working Group Sean Finn, Josh Bradley, Emily Fort,

More information

Collaborations between Official Statistics and Academia in the Era of Big Data

Collaborations between Official Statistics and Academia in the Era of Big Data Collaborations between Official Statistics and Academia in the Era of Big Data World Statistics Day October 20-21, 2015 Budapest Vijay Nair University of Michigan Past-President of ISI vnn@umich.edu What

More information

European Data Infrastructure - EUDAT Data Services & Tools

European Data Infrastructure - EUDAT Data Services & Tools European Data Infrastructure - EUDAT Data Services & Tools Dr. Ing. Morris Riedel Research Group Leader, Juelich Supercomputing Centre Adjunct Associated Professor, University of iceland BDEC2015, 2015-01-28

More information

2013-2015. North Carolina Department of Cultural Resources Digital Preservation Plan. State Archives of North Carolina State Library of North Carolina

2013-2015. North Carolina Department of Cultural Resources Digital Preservation Plan. State Archives of North Carolina State Library of North Carolina 2013-2015 North Carolina Department of Cultural Resources Digital Preservation Plan State Archives of North Carolina State Library of North Carolina TABLE OF CONTENTS 1 EXECUTIVE SUMMARY... 3 2 INTRODUCTION...

More information

EXECUTIVE AGENCY HORIZON 2020 PROGRAMME

EXECUTIVE AGENCY HORIZON 2020 PROGRAMME EUROPEAN COMMISSION INNOVATION and NETWORKS EXECUTIVE AGENCY HORIZON 2020 PROGRAMME for RESEARCH and INNOVATION Reducing impacts and costs of freight and service trips in urban areas (Topic: MG-5.2-2014)

More information

Design of Data Management Guideline for Open Data Implementation

Design of Data Management Guideline for Open Data Implementation Design of Data Guideline for Implementation (case study in Indonesia) Arry Akhmad Arman Institut Teknologi Bandung Jl. Ganesha 10 Bandung Indonesia 40132 Phone: +62-22-2502260 arry.arman@yahoo.com Gilang

More information

From Stored Knowledge to Smart Knowledge

From Stored Knowledge to Smart Knowledge From Stored Knowledge to Smart Knowledge The British Library s Content Strategy 2013 2015 From Stored Knowledge to Smart Knowledge: The British Library s Content Strategy 2013 2015 Introduction The British

More information

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES 1 HYPER-CONVERGED INFRASTRUCTURE STRATEGIES MYTH BUSTING & THE FUTURE OF WEB SCALE IT 2 ROADMAP INFORMATION DISCLAIMER EMC makes no representation and undertakes no obligations with regard to product planning

More information