COPO: Collaborative Open Plant Omics. Rob Davey Data Infrastructure and Algorithms Group Leader

Size: px
Start display at page:

Download "COPO: Collaborative Open Plant Omics. Rob Davey Data Infrastructure and Algorithms Group Leader robert.davey@tgac.ac."

Transcription

1 : Collaborative Open Plant Omics Rob Davey Data Infrastructure and Algorithms Group

2

3 Toni Etuk Felix Shaw Acknowledgements Oxford eresearch Centre Susanna Sansone Alejandra Gonzalez-Beltran Philippe Rocca-Serra Alfie Abdul-Rahman Warwick Jim Beynon Katherine Denby Ruth Bastow EMBL-EBI Paul Kersey TGAC Vicky Schneider Tanya Dickie Emily Angiolini Matt Drew

4 Recently awarded BBSRC BBR grant TGAC, Univ. Oxford, Univ. Warwick, EMBL-EBI Supported by GARNet, iplant, Eagle Genomics Empower bioscience plant researchers to: 1. Enable standards-compliant data collection, curation and integration 2. Enhance access to data analysis and visualisation pipelines 3. Facilitate data sharing and publication to promote reuse Train plant researchers in best practice for data sharing and producing citable Research Objects

5 (Good) Science is founded on reproducibility Reproducibility depends on: reducing reinvention ( friction )* describing methods and data maximising benefit to the researcher Describing methods well established through traditional publishing Data description sorely under-represented and used Benefits are often opaque Fear of being scooped, loss of control, reputation, etc *

6 What prevents plant scientists from openly depositing their data and metadata? Lack of interoperability between: metadata annotation services data repository services data analysis services data publishing services Researchers might not: be aware that the services exist have the expertise to use them see the value in properly describing their data

7 Data: Sample, Sequence, Genome, Proteome, Metabolome, Imaging Code: GitHub, BitBucket, Zenodo Analysis: Galaxy, iplant, Bioconductor, Taverna, local code/services Publication: figshare, Scientific Data, Dryad, F1000, PeerJ, Gigascience Beyond the PDF: Utopia, GitHub Training: Materials, examples, workshops, bootcamps

8 It's not because these services don't exist! Clearly, barriers exist between the scientist and the service Infrastructure can help by: wiring existing services together improving access to services facilitating collaboration raising profile of the benefits of open science How do we collaborate successfully to make this happen? Mapping services with Application Programming Interfaces

9

10 Grace signs into COPO with her ORCID ID This signs her into all other services as required She starts a new COPO Profile She uploads to the COPO platform: Three FASTQs (two Illumina HiSeq2500, one PacBio P6-C4) representing her velociraptor sequencing reads She tells COPO to push her data to a Galaxy server and run a workflow, producing: An assembly of the reads from ALLPATHS-LG v51551 A draft automated annotation from RAST v33-1 The interface prompts her to add metadata to her data in order to deposit them in the public repositories Metadata fields will be shown based on data, and redundant fields will be merged automatically Sample name, sample organism, data type, sequencer used, software name, software version... She clicks Upload, and everything is submitted

11 Single-sign on (SSO), e.g. ORCID Deposit multi-omics data in one go No context-switching between services Run and deposit analytical workflows Describe software used, versions Pull into platforms, e.g. Galaxy, iplant Support virtualisation, e.g. iplant Atmosphere, Docker, Amazon AWS Data is well-described, open, and everything has DOIs Finding and integrating data is improved greatly Make suggestions to users based on their data/workflows Programmatic access to all layers REPRODUCIBILITY

12 Not just raw/processed data is valuable COPO supports submission of supplementary data to Figshare PDFs (posters, papers) CSV/Excel movies/images (size permitting) Zenodo/Github releases for code DOIs Marked up with ENCODE Digital Curation Center s software metadata descriptors, for example

13 What have we achieved so far? TGAC infrastructure to support brokering of data irods and web server virtual machines High speed transfer Aspera links to EBI Prototype user interface for multi-omics data submissions Oauth2 support ( sign in with ORCiD, Google, Twitter) Developing JSON specification for COPO objects Easily stored in document-based databases, e.g. MongoDB Interconversion between ISA formats ISATab (CSV based) to JSON, and vice versa Linked Data specifications Community interactions Metabolights group at EBI Setting up this workshop!

14 COPO will: Facilitate easy relevant data description to: Submit data and metadata to multiple public repositories The reasons most of you are here What are the barriers for you and your data? Facilitate access to workflows used to analyse the data, e.g. to GigaDB, Scientific Data This will form part of another COPO workshop

The open source ISA sooware suite and its internaqonal user community:

The open source ISA sooware suite and its internaqonal user community: The open source ISA sooware suite and its internaqonal user community: Knowledge management of experimental data Alejandra González- Beltrán Senior Software Engineer, ISATeam Oxford e- Research Centre,

More information

NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons

NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons The NIH Commons Summary The Commons is a shared virtual space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage,

More information

BIOINFORMATICS Supporting competencies for the pharma industry

BIOINFORMATICS Supporting competencies for the pharma industry BIOINFORMATICS Supporting competencies for the pharma industry ABOUT QFAB QFAB is a bioinformatics service provider based in Brisbane, Australia operating nationwide and internationally. QFAB was established

More information

Report of the DTL focus meeting on Life Science Data Repositories

Report of the DTL focus meeting on Life Science Data Repositories Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity

More information

E-SCIENCE IN WESTERN FRANCE :

E-SCIENCE IN WESTERN FRANCE : E-SCIENCE IN WESTERN FRANCE : BEGINS Yvan Le Bras Cyril Monjeaud Olivier Collin & the GenOuest team CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics Next :

More information

Research Data Management Guide

Research Data Management Guide Research Data Management Guide Research Data Management at Imperial WHAT IS RESEARCH DATA MANAGEMENT (RDM)? Research data management is the planning, organisation and preservation of the evidence that

More information

DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES

DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES Yvan Le Bras Cyril Monjeaud Olivier Collin Jacques Nicolas CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics

More information

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes

More information

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements

More information

DATA SCIENTIST TRAINING FOR LIBRARIANS #DST4L. C. Erdmann DST4L @ Designing Libraries IV @libcce

DATA SCIENTIST TRAINING FOR LIBRARIANS #DST4L. C. Erdmann DST4L @ Designing Libraries IV @libcce DATA SCIENTIST TRAINING FOR LIBRARIANS #DST4L C. Erdmann DST4L @ Designing Libraries IV @libcce On the Same Page We started speaking the same language. A side conversation with a Harvard faculty member

More information

OPEN SOURCE AND BOTTOM-UP VRE APPROACH IN WESTERN FRANCE

OPEN SOURCE AND BOTTOM-UP VRE APPROACH IN WESTERN FRANCE OPEN SOURCE AND BOTTOM-UP VRE APPROACH IN WESTERN FRANCE Towards supporting accessible, reproducible, and transparent research in the life sciences Yvan Le Bras Cyril Monjeaud Olivier Collin, the GenOuest

More information

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge

More information

FROM DATA REPOSITORIES TO DATA JOURNALS: PUBLISHING AND SHARING HEALTH-RELATED DATA

FROM DATA REPOSITORIES TO DATA JOURNALS: PUBLISHING AND SHARING HEALTH-RELATED DATA FROM DATA REPOSITORIES TO DATA JOURNALS: PUBLISHING AND SHARING HEALTH-RELATED DATA Big Data in Health Care, Oct 28 th, 2015 Andrew L. Hufton Managing Editor, Scientific Data Nature Publishing Group What

More information

In 2014, the Research Data group @ Purdue University

In 2014, the Research Data group @ Purdue University EDITOR S SUMMARY At the 2015 ASIS&T Research Data Access and Preservation (RDAP) Summit, panelists from Research Data @ Purdue University Libraries discussed the organizational structure intended to promote

More information

GenomeSpace Architecture

GenomeSpace Architecture GenomeSpace Architecture The primary services, or components, are shown in Figure 1, the high level GenomeSpace architecture. These include (1) an Authorization and Authentication service, (2) an analysis

More information

Introduction to Research Data Management. Tom Melvin, Anita Schwartz, and Jessica Cote April 13, 2016

Introduction to Research Data Management. Tom Melvin, Anita Schwartz, and Jessica Cote April 13, 2016 Introduction to Research Data Management Tom Melvin, Anita Schwartz, and Jessica Cote April 13, 2016 What Will We Cover? Why is managing data important? Organizing and storing research data Sharing and

More information

GeneProf and the new GeneProf Web Services

GeneProf and the new GeneProf Web Services GeneProf and the new GeneProf Web Services Florian Halbritter florian.halbritter@ed.ac.uk Stem Cell Bioinformatics Group (Simon R. Tomlinson) simon.tomlinson@ed.ac.uk December 10, 2012 Florian Halbritter

More information

E-SCIENCE IN WESTERN FRANCE : THE BEGINNING

E-SCIENCE IN WESTERN FRANCE : THE BEGINNING E-SCIENCE IN WESTERN FRANCE : THE BEGINNING Yvan Le Bras Olivier Collin Jacques Nicolas CNRS UMR 6074 IRISA-INRIA Context Now : Genomics : Next Generation Sequencing Now : Proteomics Next : Bio-imaging

More information

Towards the construction of an integrated Wheat Information System

Towards the construction of an integrated Wheat Information System Towards the construction of an integrated Wheat Information System Mario Caccamo 1, Hadi Quesneville 2 Report- June 2012 1. The Genome Analysis Centre (TGAC), Norwich Research Park, Norwich, UK 2. INRA,

More information

How To Write A Blog Post On Globus

How To Write A Blog Post On Globus Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, chard@uchicago.edu Jim Pruyne, University of Chicago Computation Institute, pruyne@uchicago.edu

More information

SHared Access Research Ecosystem (SHARE)

SHared Access Research Ecosystem (SHARE) SHared Access Research Ecosystem (SHARE) June 7, 2013 DRAFT Association of American Universities (AAU) Association of Public and Land-grant Universities (APLU) Association of Research Libraries (ARL) This

More information

The National Consortium for Data Science (NCDS)

The National Consortium for Data Science (NCDS) The National Consortium for Data Science (NCDS) A Public-Private Partnership to Advance Data Science Ashok Krishnamurthy PhD Deputy Director, RENCI University of North Carolina, Chapel Hill What is NCDS?

More information

data.bris: collecting and organising repository metadata, an institutional case study

data.bris: collecting and organising repository metadata, an institutional case study Describe, disseminate, discover: metadata for effective data citation. DataCite workshop, no.2.. data.bris: collecting and organising repository metadata, an institutional case study David Boyd data.bris

More information

Enhanced Research Data Management and Publication with Globus

Enhanced Research Data Management and Publication with Globus Enhanced Research Data Management and Publication with Globus Vas Vasiliadis Jim Pruyne Presented at OR2015 June 8, 2015 Presentations and other useful information available at globus.org/events/or2015/tutorial

More information

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

Open Access to Manuscripts, Open Science, and Big Data

Open Access to Manuscripts, Open Science, and Big Data Open Access to Manuscripts, Open Science, and Big Data Progress, and the Elsevier Perspective in 2013 Presented by: Dan Morgan Title: Senior Manager Access Relations, Global Academic Relations Company

More information

Data Publishing Workflows with Dataverse

Data Publishing Workflows with Dataverse Data Publishing Workflows with Dataverse Mercè Crosas, Ph.D. Twitter: @mercecrosas Director of Data Science Institute for Quantitative Social Science, Harvard University MIT, May 6, 2014 Intro to our Data

More information

Workflow Tools at NERSC. Debbie Bard djbard@lbl.gov NERSC Data and Analytics Services

Workflow Tools at NERSC. Debbie Bard djbard@lbl.gov NERSC Data and Analytics Services Workflow Tools at NERSC Debbie Bard djbard@lbl.gov NERSC Data and Analytics Services NERSC User Meeting August 13th, 2015 What Does Workflow Software Do? Automate connection of applications Chain together

More information

Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011)

Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011) Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG-13-011) Key Dates Release Date: June 6, 2013 Response Date: June 25, 2013 Purpose This Request

More information

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Dr Liz Lyon, UKOLN, University of Bath Introduction and Objectives UKOLN is undertaking

More information

The Horizon 2020 Open Data Pilot. Sarah Jones Digital Curation Centre, University of Glasgow sarah.jones@glasgow.ac.

The Horizon 2020 Open Data Pilot. Sarah Jones Digital Curation Centre, University of Glasgow sarah.jones@glasgow.ac. The Horizon 2020 Open Data Pilot Sarah Jones Digital Curation Centre, University of Glasgow sarah.jones@glasgow.ac.uk Twitter: sjdcc Why open access and open data? The European Commission s vision is that

More information

Managing research data and Horizon 2020

Managing research data and Horizon 2020 Managing research data and Horizon 2020 Sarah Jones Digital Curation Centre, Glasgow sarah.jones@glasgow.ac.uk Twitter: @sjdcc Funded by: ConsorcioMadroñoconference on Data Management Plans and Horizon

More information

Integrated Rule-based Data Management System for Genome Sequencing Data

Integrated Rule-based Data Management System for Genome Sequencing Data Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer

More information

Open Source Software in Life Science Research. Woodhead Publishing Series in Biomedicine

Open Source Software in Life Science Research. Woodhead Publishing Series in Biomedicine Brochure More information from http://www.researchandmarkets.com/reports/2719842/ Open Source Software in Life Science Research. Woodhead Publishing Series in Biomedicine Description: The free/open source

More information

Cloud and Big Data Standardisation

Cloud and Big Data Standardisation Cloud and Big Data Standardisation EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University of Amsterdam

More information

Big Data Standardisation in Industry and Research

Big Data Standardisation in Industry and Research Big Data Standardisation in Industry and Research EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University

More information

NCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013

NCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013 NCBI resources III: GEO and ftp site Yanbin Yin Spring 2013 1 Homework assignment 2 Search colon cancer at GEO and find a data Series and perform a GEO2R analysis Write a report (in word or ppt) to include

More information

Introduction to Dropbox. Jim Miller, LCITO Office 785.296.5566 Mobile 913.484.8013 Email jim.miller@las.ks.gov

Introduction to Dropbox. Jim Miller, LCITO Office 785.296.5566 Mobile 913.484.8013 Email jim.miller@las.ks.gov Introduction to Dropbox Jim Miller, LCITO Office 785.296.5566 Mobile 913.484.8013 Email jim.miller@las.ks.gov Introduction to Dropbox What is it? Why use it? Mitigating the risks of using Dropbox? Dropbox

More information

Bringing Compute to the Data Alternatives to Moving Data. Part of EUDAT s Training in the Fundamentals of Data Infrastructures

Bringing Compute to the Data Alternatives to Moving Data. Part of EUDAT s Training in the Fundamentals of Data Infrastructures Bringing Compute to the Data Alternatives to Moving Data Part of EUDAT s Training in the Fundamentals of Data Infrastructures Introduction Why consider alternatives? The traditional approach Alternative

More information

The Preservation and Sustainability of Research Data

The Preservation and Sustainability of Research Data The Preservation and Sustainability of Research Data Dr Markus Buchhorn, Director, ICT Environments Australian National University; Formerly: Head, ANU Internet Futures Grid Services Architect, APAC Grid

More information

Research Data Alliance: Current Activities and Expected Impact. SGBD Workshop, May 2014 Herman Stehouwer

Research Data Alliance: Current Activities and Expected Impact. SGBD Workshop, May 2014 Herman Stehouwer Research Data Alliance: Current Activities and Expected Impact SGBD Workshop, May 2014 Herman Stehouwer The Vision 2 Researchers and innovators openly share data across technologies, disciplines, and countries

More information

LabArchives Electronic Lab Notebook:

LabArchives Electronic Lab Notebook: Electronic Lab Notebook: Cloud platform to manage research workflow & data Support Data Management Plans Annotate and prove discovery Secure compliance Improve compliance with your data management plans,

More information

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Ntinos Krampis Asst. Professor J. Craig Venter Institute kkrampis@jcvi.org http://www.jcvi.org/cms/about/bios/kkrampis/

More information

Building Success on Acquia Cloud:

Building Success on Acquia Cloud: Building Success on Acquia Cloud: 10 Layers of PaaS TECHNICAL Guide Table of Contents Executive Summary.... 3 Introducing the 10 Layers of PaaS... 4 The Foundation: Five Layers of PaaS Infrastructure...

More information

The World of Data and Its Importance

The World of Data and Its Importance The Preservation and Sustainability of Research Data in Australia Dr Markus Buchhorn, Director, ICT Environments Australian National University; Also in www.apsr.edu.au Formerly: Head, ANU Internet Futures

More information

Globus Research Data Management: Introduction and Service Overview

Globus Research Data Management: Introduction and Service Overview Globus Research Data Management: Introduction and Service Overview Kyle Chard chard@uchicago.edu Ben Blaiszik blaiszik@uchicago.edu Thank you to our sponsors! U. S. D E P A R T M E N T OF ENERGY 2 Agenda

More information

ODIN ORCID and DATACITE Interoperability Network Title

ODIN ORCID and DATACITE Interoperability Network Title ODIN ORCID and DATACITE Interoperability Network Title This project has received funding from the European Union's Seventh Framework Programme for research, technological development and demonstration

More information

-> Integration of MAPHiTS in Galaxy

-> Integration of MAPHiTS in Galaxy Enabling NGS Analysis with(out) the Infrastructure, 12:0512 Development of a workflow for SNPs detection in grapevine From Sets to Graphs: Towards a Realistic Enrichment Analy species: MAPHiTS -> Integration

More information

Service Road Map for ANDS Core Infrastructure and Applications Programs

Service Road Map for ANDS Core Infrastructure and Applications Programs Service Road Map for ANDS Core and Applications Programs Version 1.0 public exposure draft 31-March 2010 Document Target Audience This is a high level reference guide designed to communicate to ANDS external

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.3 Selected Standards

More information

Running Agilent GeneSpring MPP on the Cloud

Running Agilent GeneSpring MPP on the Cloud Running Agilent GeneSpring MPP on the Cloud Technical Overview Authors Stephen Madden, Rick A. Fasani, and Michael Rosenberg Agilent Technologies, Inc. Santa Clara, California, USA Introduction Cloud computing

More information

PRIVACY AWARE ACCESS CONTROL FOR CLOUD-BASED DATA PLATFORMS

PRIVACY AWARE ACCESS CONTROL FOR CLOUD-BASED DATA PLATFORMS www.openi-ict.eu Open-Source, Web-Based, Framework for Integrating Applications with Social Media Services and Personal Cloudlets PRIVACY AWARE ACCESS CONTROL FOR CLOUD-BASED DATA PLATFORMS Open-Source,

More information

Research Data Management

Research Data Management Research Data Management 1 Why to we need to Manage Data? 2 Data Management Planning Typically covers: - What data will be created (format, types) and how? - How will the data be documented and described?

More information

Innovative Seed Grant Data Management Plans. February 17, 2015

Innovative Seed Grant Data Management Plans. February 17, 2015 Innovative Seed Grant Data Management Plans February 17, 2015 Presenters Andrew Johnson Research Data Librarian University Libraries andrew.m.johnson@colorado.edu Shelley Knuth Senior Research Data Specialist

More information

Enabling multi-cloud resources at CERN within the Helix Nebula project. D. Giordano (CERN IT-SDC) HEPiX Spring 2014 Workshop 23 May 2014

Enabling multi-cloud resources at CERN within the Helix Nebula project. D. Giordano (CERN IT-SDC) HEPiX Spring 2014 Workshop 23 May 2014 Enabling multi-cloud resources at CERN within the Helix Nebula project D. Giordano (CERN IT-) HEPiX Spring 2014 Workshop This document produced by Members of the Helix Nebula consortium is licensed under

More information

BioMed Central s position statement on open data

BioMed Central s position statement on open data BioMed Central s position statement on open data Increasing transparency in scientific research has always been at the core of BioMed Central s strategy. Now, after more than a decade of open access research

More information

Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management

Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management Research Data Management Canadian National Research Data Repository Service Progress Report, June 2016 As their digital datasets grow, researchers across all fields of inquiry are struggling to manage

More information

Implementation of Open Researcher and Contributor ID. (ORCID) at a Large Academic Institution

Implementation of Open Researcher and Contributor ID. (ORCID) at a Large Academic Institution Implementation of Open Researcher and Contributor ID (ORCID) at a Large Academic Institution Merle Rosenzweig*, AMLS Informationist Taubman Health Sciences Library, University of Michigan Abstract ORCID

More information

The Galaxy workflow. George Magklaras PhD RHCE

The Galaxy workflow. George Magklaras PhD RHCE The Galaxy workflow George Magklaras PhD RHCE Biotechnology Center of Oslo & The Norwegian Center of Molecular Medicine University of Oslo, Norway http://www.biotek.uio.no http://www.ncmm.uio.no http://www.no.embnet.org

More information

Reduce and manage operating costs and improve efficiency. Support better business decisions based on availability of real-time information

Reduce and manage operating costs and improve efficiency. Support better business decisions based on availability of real-time information Data Management Solutions Horizon Software Solution s Data Management Solutions provide organisations with confidence in control of their data as they change systems and implement new solutions. Data is

More information

Attach receipt options:

Attach receipt options: Attaching Receipts and Receipt Store There are a few ways to attach receipts to an expense report. You will only need to choose one of the following options when attaching receipts. You can add receipts

More information

Integrating computational data analysis capabilities into analytics applications

Integrating computational data analysis capabilities into analytics applications Integrating computational data analysis capabilities into analytics applications TIBCO Spotfire API Juan Elvira Integromics Deputy CTO About Integromics www.integromics.com Focus on software development

More information

AWS CodePipeline. User Guide API Version 2015-07-09

AWS CodePipeline. User Guide API Version 2015-07-09 AWS CodePipeline User Guide AWS CodePipeline: User Guide Copyright 2015 Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's trademarks and trade dress may not be used in connection

More information

Cloud Computing for e-science with CARMEN

Cloud Computing for e-science with CARMEN Cloud Computing for e-science with CARMEN Paul Watson, Phillip Lord, Frank Gibson, Panayiotis Periorellis, Georgios Pitsilis School of Computing Science, Newcastle University, Newcastle-upon-Tyne, UK Paul.Watson@newcastle.ac.uk

More information

Introduction to NGS data analysis

Introduction to NGS data analysis Introduction to NGS data analysis Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Sequencing Illumina platforms Characteristics: High

More information

Bioinformatics for programmers

Bioinformatics for programmers Bioinformatics for programmers Scientific software development: best practices and approaches Konstantin Okonechnikov Max Planck Institute For Infection Biology Летняя школа биоинформатики Москва, 2013

More information

The data landscape lessons from UK

The data landscape lessons from UK The data landscape lessons from UK Veerle Van den Eynden UK Data Archive University of Essex Faculty of Psychology and Educational Sciences University of Ghent, Belgium 23 October 2014 UK data landscape

More information

OpenAIRE Research Data Management Briefing paper

OpenAIRE Research Data Management Briefing paper OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement

More information

Beyond The Web Drupal Meets The Desktop (And Mobile) Justin Miller Code Sorcery Workshop, LLC http://codesorcery.net/dcdc

Beyond The Web Drupal Meets The Desktop (And Mobile) Justin Miller Code Sorcery Workshop, LLC http://codesorcery.net/dcdc Beyond The Web Drupal Meets The Desktop (And Mobile) Justin Miller Code Sorcery Workshop, LLC http://codesorcery.net/dcdc Introduction Personal introduction Format & conventions for this talk Assume familiarity

More information

Analysis of ChIP-seq data in Galaxy

Analysis of ChIP-seq data in Galaxy Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers

More information

Databases and platforms for data analysis from NGS of MTB

Databases and platforms for data analysis from NGS of MTB Databases and platforms for data analysis from NGS of MTB Derrick Crook MMM Consortium MMM Consortium Linking Clinical record systems and NHS databases Translating next generation sequencing for patient

More information

Modifying ScholarOne to seek author consent before sending manuscript notifications to Dryad the single step version.

Modifying ScholarOne to seek author consent before sending manuscript notifications to Dryad the single step version. Modifying ScholarOne to seek author consent before sending manuscript notifications to Dryad the single step version. Initial version: March 2013 by Tim Vines This document describes the process for modifying

More information

Big Data: Challenges and Opportunities

Big Data: Challenges and Opportunities Big Data: Challenges and Opportunities NGWI & USDA/ARS Meeting USDA Carver Center April 16, 2014 Doreen Ware Acting Chief Science Information Officer USDA ARS Big Data: Challenges and Response Biology

More information

Altmetrics Data Quality Code of Conduct

Altmetrics Data Quality Code of Conduct Altmetrics Data Quality Code of Conduct For public review and comment from February 25 March 31, 2016 A Recommended Practice of the National Information Standards Organization About Recommended Practices

More information

About TGAC. TGAC s Scientific Vision

About TGAC. TGAC s Scientific Vision New Website Project Brief 4 August 2015 About TGAC Situated at Norwich Research Park, The Genome Analysis Centre (TGAC) is a research institute focused on the application of state of the art genomics and

More information

Semantic Workflows and the Wings Workflow System

Semantic Workflows and the Wings Workflow System To Appear in AAAI Fall Symposium on Proactive Assistant Agents, Arlington, VA, November 2010. Assisting Scientists with Complex Data Analysis Tasks through Semantic Workflows Yolanda Gil, Varun Ratnakar,

More information

Steven Newhouse, Head of Technical Services

Steven Newhouse, Head of Technical Services Challenges at EMBL-EBI Steven Newhouse, Head of Technical Services European Bioinformatics Institute Outstation of the European Molecular Biology Laboratory International organisation created by treaty

More information

Technical. Overview. ~ a ~ irods version 4.x

Technical. Overview. ~ a ~ irods version 4.x Technical Overview ~ a ~ irods version 4.x The integrated Ru e-oriented DATA System irods is open-source, data management software that lets users: access, manage, and share data across any type or number

More information

Lessons Learned at Continental Automotive

Lessons Learned at Continental Automotive Lessons Learned at Continental Automotive Basic Principle of Lessons Learned Lessons Learned is people reusing experiences of colleagues. What is Lessons Learned? A Lesson Learned IS Knowledge or understanding

More information

Understanding Infrastructure as Code. By Michael Wittig and Andreas Wittig

Understanding Infrastructure as Code. By Michael Wittig and Andreas Wittig Understanding Infrastructure as Code By Michael Wittig and Andreas Wittig In this article, excerpted from Amazon Web Service in Action, we will explain Infrastructure as Code. Infrastructure as Code describes

More information

Data grid storage for digital libraries and archives using irods

Data grid storage for digital libraries and archives using irods Data grid storage for digital libraries and archives using irods Mark Hedges, Centre for e-research, King s College London eresearch Australasia, Melbourne, 30 th Sept. 2008 Background: Project History

More information

APT How to Better Manage Risk by Outsourcing Risk Measurement. London 11 June 2014

APT How to Better Manage Risk by Outsourcing Risk Measurement. London 11 June 2014 APT How to Better Manage Risk by Outsourcing Risk Measurement London 11 June 2014 SunGard APT How to Better Manage Risk by Outsourcing Risk Measurement Benoit Louis Head of managed Services SunGard APT

More information

Research Data Management in Horizon 2020

Research Data Management in Horizon 2020 Research Data Management in Horizon 2020 Dr. Fieke Schoots, UBL 11 / 6 / 2015 From : Guidelines on Open Access to Scientific Publications and Research Data in Horizon 2020 [v.1.0, 11/12/2013] Open access

More information

Diamonds are forever. What about research data?

Diamonds are forever. What about research data? Diamonds are forever. What about research data? Dr Adam Farquhar Head of Digital Scholarship, The British Library 14 March 2016 The Brithish LLibrary Over 200 million items in most known languages stored

More information

Digital Curation at ETH-Bibliothek: From a Survey to a Customer Oriented Service for Researchers

Digital Curation at ETH-Bibliothek: From a Survey to a Customer Oriented Service for Researchers Digital Curation at ETH-Bibliothek: From a Survey to a Customer Oriented Service for Researchers LIBER Annual Conference 2014, July 3 Dr. Arlette Piguet, ETH-Bibliothek, ETH Zurich Arlette Piguet 03.07.2014

More information

NECC History. Karl V. Steiner 2011 Annual NECC Meeting, Orono, Maine March 15, 2011

NECC History. Karl V. Steiner 2011 Annual NECC Meeting, Orono, Maine March 15, 2011 NECC History Karl V. Steiner 2011 Annual NECC Meeting, Orono, Maine March 15, 2011 EPSCoR Cyberinfrastructure Workshop First regional NENI (now NECC) Workshop held in Vermont in August 2007 Workshop heldinkentucky

More information

Higher user satisfaction: customers can navigate website content and usergenerated content on a single site.

Higher user satisfaction: customers can navigate website content and usergenerated content on a single site. Evoq Engage: Interactive websites to drive customer engagement According to Forrester Research, 2015 will see a renewed focus on customer engagement in owned media channels: in other words, on your website

More information

ODUM INSTITUTE ARCHIVE SERVICES OVERVIEW IASSIST 2015

ODUM INSTITUTE ARCHIVE SERVICES OVERVIEW IASSIST 2015 ODUM INSTITUTE ARCHIVE SERVICES OVERVIEW IASSIST 2015 JONATHAN CRABTREE Assistant Director of Computing and Archival Research The Odum Institute for Research in Social Science Davis Library, 2nd Floor,

More information

Cloud Computing for Scientific Research

Cloud Computing for Scientific Research Cloud Computing for Scientific Research The NIH Nephele Project for Microbiome Analysis On behalf of: Yentram Huyen, Ph.D., Chief Nick Weber, Scientific Computing Project Manager Bioinformatics and Computational

More information

IO Informatics The Sentient Suite

IO Informatics The Sentient Suite IO Informatics The Sentient Suite Our software, The Sentient Suite, allows a user to assemble, view, analyze and search very disparate information in a common environment. The disparate data can be numeric

More information

See business with unprecedented clarity.

See business with unprecedented clarity. In partnership with Eligibility requirements, other conditions, and fees apply. Contact a Viewpost Representative for complete details regarding terms and fees. U.S. Bank is not responsible for and does

More information

The ISPS Data Archive: Mission, Work, and Some Reflections

The ISPS Data Archive: Mission, Work, and Some Reflections The ISPS Data Archive: Mission, Work, and Some Reflections http://isps.yale.edu Archive Embedded in ISPS Website Limor Peer Yale University April 2016 http://isps.yale.edu/research/data ISPS Data Archive:

More information

Management von Forschungsprimärdaten und DOI Registrierung. Dr. Matthias Lange (Bioinformatics & Information Technology) June 19 th, 2013

Management von Forschungsprimärdaten und DOI Registrierung. Dr. Matthias Lange (Bioinformatics & Information Technology) June 19 th, 2013 Management von Forschungsprimärdaten und DOI Registrierung Dr. Matthias Lange (Bioinformatics & Information Technology) June 19 th, 2013 Outline Motivation: IPK data infrastructure LIMS: Integration of

More information

Globus Genomics Tutorial GlobusWorld 2014

Globus Genomics Tutorial GlobusWorld 2014 Globus Genomics Tutorial GlobusWorld 2014 Agenda Overview of Globus Genomics Example Collaborations Demonstration Globus Genomics interface Globus Online integration Scenario 1: Using Globus Genomics for

More information

Introduction to Arvados. A Curoverse White Paper

Introduction to Arvados. A Curoverse White Paper Introduction to Arvados A Curoverse White Paper Contents Arvados in a Nutshell... 4 Why Teams Choose Arvados... 4 The Technical Architecture... 6 System Capabilities... 7 Commitment to Open Source... 12

More information

D5.5 Initial EDSA Data Management Plan

D5.5 Initial EDSA Data Management Plan Project acronym: Project full : EDSA European Data Science Academy Grant agreement no: 643937 D5.5 Initial EDSA Data Management Plan Deliverable Editor: Other contributors: Mandy Costello (Open Data Institute)

More information

OpenCB a next generation big data analytics and visualisation platform for the Omics revolution

OpenCB a next generation big data analytics and visualisation platform for the Omics revolution OpenCB a next generation big data analytics and visualisation platform for the Omics revolution Development at the University of Cambridge - Closing the Omics / Moore s law gap with Dell & Intel Ignacio

More information

MiSeq: Imaging and Base Calling

MiSeq: Imaging and Base Calling MiSeq: Imaging and Page Welcome Navigation Presenter Introduction MiSeq Sequencing Workflow Narration Welcome to MiSeq: Imaging and. This course takes 35 minutes to complete. Click Next to continue. Please

More information

Workprogramme 2014-15

Workprogramme 2014-15 Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES

More information