Work Package 13.5: Authors: Paul Flicek and Ilkka Lappalainen. 1. Introduction

Size: px
Start display at page:

Download "Work Package 13.5: Authors: Paul Flicek and Ilkka Lappalainen. 1. Introduction"

Transcription

1 Work Package 13.5: Report summarising the technical feasibility of the European Genotype Archive to collect, store, and use genotype data stored in European biobanks in a manner that complies with all applicable medical and privacy regulations Authors: Paul Flicek and Ilkka Lappalainen 1. Introduction The European Genome-phenome Archive (EGA; promotes the sharing of all types of potentially identifiable genetic and phenotypic data consented for research use but not for full open public release. In this report, the details of the EGA design and implementation are presented both the technical feasibility and the effectiveness of the EGA for its stated purpose is demonstrated. As a demonstrated working and active project of the European Bioinformatics Institute (EBI), it provides a critical service in the ELIXIR infrastructure for European biobanks and other researchers. 2. EGA Infrastructure The EGA infrastructure has been designed to provide a secure and scalable system for the archiving and dissemination of such data. The EGA security policy includes the development of a safe computing facility within EBI and a comprehensive suite of protocols for information management. All implemented protocols are consistent with the European Union Data Protection Directive (95/46/EC) and are subject to regular independent audit. The archived data are required to follow the EGA data access policy model whereby the data access decisions are made by a data access-granting organisation (DAO) and not by the EGA. The EGA project provides data management and distribution services for users of the database. 2.1 Overview of the EGA data model

2 The EGA data model is modularized to provide high security and optimal performance for large data archiving. Concepts in the data model provide storage of information on a subject, any sample that may have been derived from the subject and the phenotypic and genetic variations acquired from a particular sample in separate relational databases. The databases are located in a secure area accessible only to the EGA team. The raw data supporting the variant calls is stored outside these databases in an encrypted format. The links between the data points stored in the databases are made using abstract EGA identifiers. The EGA application programming interface (API) has been developed to provide unified and transparent tools for archiving and distributing data. No databases or direct access to databases is provided through the API for authorized users or members of the public. The access is provided to data files that have been created from the archive based on the agreement with the corresponding DAO. A dataset constitutes a single unit of released data that is governed by a single data access policy. The access to a dataset is provided by the EGA to the user once the authorization has been granted by the governing DAO. 2.2 The EGA data model for phenotypic information A sample object is created for each submitted sample to which the provided phenotype variables are associated as key value pairs. The EGA supports archiving of longitudinal sample types by linking samples to a particular subject once this information has been made available. It is also possible to hold this key outside of the EGA or authorize access to the subject-sample mapping separately to the data. The EGA does not have an internal process to harmonize phenotype data on samples across different submissions. The phenotype data are distributed as it has been submitted into our system. The submission of phenotype data using a standardized ontology vocabulary is encouraged. The EGA supports submission updates. 2.3 The EGA data model for genetic data

3 The accepted data types include manufacturer-specific raw data formats from array-based genotyping and raw DNA sequence data arising from resequencing, transcriptomics or other assays. The raw data files, such as the information for each probe in an array-based experiment or the raw reads from a next generation sequence experiment, are encrypted and archived into a file repository that shares the same design as that used for the European Nucleotide Archive (ENA) 1. Only the EGA team members have access to the archival encryption key. The variant types called from the raw data are stored in relational databases optimized for each data type. The schema requirements for genotypes are very different to those of structural variations. The EGA API facilitates the storage and retrieval of variants, together with any associated information recorded during the calling process, such as intensity values or quality scores. It is possible to archive variants called with a number of different algorithms for the same experiment. The API also allows us to merge genotype data acquired with different technologies, phase submitted data or impute unobserved genotypes using public reference panels such as those being developed in the 1000 Genomes Project. The EGA archives any submitted summary level statistical analysis. Results of the quality analysis connected to the submission are also stored, but without altering the original data. 2.4 Feasibility of the EGA data model In summary, the EGA data model allows for the storage of phenotypic and genetic information for samples in physically separate locations. Security is, therefore, controlled specifically for the stored data type without compromising access across the data archived for a particular subject and provide a scalable system that is able to respond to future storage, analysis and distribution requirements. 3. How users interact with the EGA 1 Leinonen et al., Nucleic Acid Research 2010

4 The EGA supports both submission of data from individual researcher or research groups in support of publications and also for prepublication data release for large-scale community resource projects as recommended by the Toronto workshop Data submissions to the EGA All data files must be encrypted prior to their upload to a dedicated submission account. The EGA only accepts the encryption keys using an out of band method such as telephone, postal mail or a courier. The EGA also provides a public key that allows secure encryption for the data submissions. In addition to data files, the EGA requires each submission to provide accurate information on the experimental and analytical methods used in the study. Each submission must also include DAO contact details, applied policy information and a certification for the authority to submit the data to the EGA for archiving and dissemination on behalf of the submitting organisation. The EGA accepts information in pre-defined submission formats, such as excel sheet based format MAGE-tab or XML. The EGA submission website 3 includes the most recent documentation for the submission process and provides examples of the data submission formats. The experienced EGA help-desk 4 also provides additional help during the submission. Once the data has been submitted to us, the EGA team members work together with the submitter to make sure that the data are correctly presented in our system. The release of any data from the EGA requires DAO authorization. 3.2 Data release from the EGA Submissions to the EGA must be consistent with national laws and regulations. The archived data are required to follow the EGA data access policy model whereby the data access decisions are made by a data accessgranting organisation (DAO) and not by the EGA. The DAO may be the same organisation that approved and monitored the initial study protocol or a designate of this approving organization such as a dedicated data access 2 Toronto International Data Release Workshop Authors, Nature to ega-helpdesk@ebi.ac.uk

5 committee (DAC). Access to the data must be granted in a timely fashion to all bona fide researchers whose use of the data is consistent with the original consent agreements. The data access agreement that dictates how data must be stored, transferred or analysed is made directly between the applicant and the corresponding DAO. The EGA associates the data access rights to personal accounts within our system. All account actions are logged into our audit system. The EGA project provides data management tools that allow the DAO to directly add, remove or summarize permissions for those EGA accounts that are linked to the data they have a mandate to govern. These tools also show the full audit trail which includes of when a particular data access was added to an EGA account and who performed this action. The EGA supports workflows that require multiple authorizations that can include administrators from several organisations. The complex workflows have been implemented to allow strict checks of the applicants prior to data access authorization. The EGA user management tools are integrated into our website and can be linked to an account by DAO authorization. The EGA provides full documentation regarding the use of these tools and further training for data access management is available upon request. 4. Response to changing scientific environment Since the launch of the service, scientific developments have impacted the EGA operations significantly and the data models and procedures have been robust to these changes. As an example, the next generation sequencing influences the size and type of genetic information collected from the samples and submitted to our system resulting in developments to the data model and infrastructure as these data are generally larger in size per sample than arraybased genotype data. Additionally, in the fall of 2008, a publication 5 described computational methods to predict whether a given individual had participated in a particular research project using summary-level data. The EGA, together with the 5 Homer N et al., PLoS Genet 4(8): e

6 DAOs, responded to this publication by removing all public access to data that could lead to the identification of the research participants. These data are now made available to users that either have been granted access only to the summary data or have access to the individual genotypes, and hence, would be able to produce the same results. The EGA is able to provide summary data from studies archived in our system for other EBI resources or to other ESFRI projects should the policy change in the future. Summary The EGA service has been used successfully since April 2008 and currently manages genetic and phenotypic information for approximately samples listed in more than 40 different studies and serve 1700 authorized data users worldwide. These data include samples that are stored in European BioBanks such as the UK DNA Banking Network. The EGA provides appropriate security and has built a robust and scalable infrastructure that is responsive to changes in the science and regulatory environment. Taken as a whole, it is clear that the operation of the EGA as a service is technically feasible and that that EGA can be used as a critical tool for ELIXIR and European ESFRI projects.

European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute

European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute Justin Paschall Team Leader Genetic Variation / EGA ! European Genome-phenome

More information

Computational Requirements

Computational Requirements Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects Computational Requirements Steve Sherry, Lisa Brooks, Paul Flicek, Anton Nekrutenko, Kenna Shaw, Heidi Sofia High-density

More information

Global Alliance. Ewan Birney Associate Director EMBL-EBI

Global Alliance. Ewan Birney Associate Director EMBL-EBI Global Alliance Ewan Birney Associate Director EMBL-EBI Our world is changing Research to Medical Research English as language Lightweight legal Identical/similar systems Open data Publications Grant-funding

More information

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes

More information

NIH Genomic Data Sharing (GDS) Policy Guidance Memo #2 1

NIH Genomic Data Sharing (GDS) Policy Guidance Memo #2 1 MEMORANDUM TO: Principal Investigators and Research Staff DATE: 2/22/15 FROM: Anne Klibanski, MD, Partners Chief Academic Officer (CAO) Paul Anderson, MD, PhD, BWH CAO Harry Orf, PhD, MGH Sr. Vice President-Research

More information

NIH s Genomic Data Sharing Policy

NIH s Genomic Data Sharing Policy NIH s Genomic Data Sharing Policy 2 Benefits of Data Sharing Enables data generated from one study to be used to explore a wide range of additional research questions Increases statistical power and scientific

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of

More information

Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects

Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects Report on the Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects Background and Goals of the Workshop June 5 6, 2012 The use of genome sequencing in human research is growing

More information

6 ELIXIR Domain Specific Services

6 ELIXIR Domain Specific Services 6 ELIXIR Domain Specific Services Work stream leads: Alfonso Valencia (ES), Inge Jonassen (NO), Jose Leal (PT) Work stream members: Nils-Peder Willassen (NO), Finn Drablos (NO), Mark Viant (UK), Ferran

More information

The 100,000 genomes project

The 100,000 genomes project The 100,000 genomes project Tim Hubbard @timjph Genomics England King s College London, King s Health Partners Wellcome Trust Sanger Institute ClinGen / Decipher Washington DC, 26 th May 2015 The 100,000

More information

ESTRO PRIVACY AND DATA SECURITY NOTICE

ESTRO PRIVACY AND DATA SECURITY NOTICE ESTRO PRIVACY AND DATA SECURITY NOTICE This Data Privacy and Security Policy is a dynamic document, which will reflect our continuing vigilance to properly handle and secure information that we are trusted

More information

Case Study Life Sciences Data

Case Study Life Sciences Data Case Study Life Sciences Data Centre for Integrative Systems Biology and Bioinformatics www.imperial.ac.uk/bioinfsupport Sarah Butcher s.butcher@imperial.ac.uk www.imperial.ac.uk/bioinfsupport Bio-data

More information

Report of the DTL focus meeting on Life Science Data Repositories

Report of the DTL focus meeting on Life Science Data Repositories Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity

More information

UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory. April, 2015

UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory. April, 2015 UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory April, 2015 1 Contents Overview... 3 Rare Variants... 3 Observation... 3 Approach... 3 ApoE

More information

Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM lecrom@biologie.ens.fr

Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM lecrom@biologie.ens.fr Lecture 11 Data storage and LIMS solutions Stéphane LE CROM lecrom@biologie.ens.fr Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog

More information

1.2: DATA SHARING POLICY. PART OF THE OBI GOVERNANCE POLICY Available at: http://www.braininstitute.ca/brain-code-governance. 1.2.

1.2: DATA SHARING POLICY. PART OF THE OBI GOVERNANCE POLICY Available at: http://www.braininstitute.ca/brain-code-governance. 1.2. 1.2: DATA SHARING POLICY PART OF THE OBI GOVERNANCE POLICY Available at: http://www.braininstitute.ca/brain-code-governance 1.2.1 Introduction Consistent with its international counterparts, OBI recognizes

More information

ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013

ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013 ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and

More information

Worldwide Collaborations in Molecular Profiling

Worldwide Collaborations in Molecular Profiling Worldwide Collaborations in Molecular Profiling Lillian L. Siu, MD Director, Phase I Program and Cancer Genomics Program Princess Margaret Cancer Centre Lillian Siu, MD Contracted Research: Novartis, Pfizer,

More information

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel

More information

Committee on WIPO Standards (CWS)

Committee on WIPO Standards (CWS) E CWS/1/5 ORIGINAL: ENGLISH DATE: OCTOBER 13, 2010 Committee on WIPO Standards (CWS) First Session Geneva, October 25 to 29, 2010 PROPOSAL FOR THE PREPARATION OF A NEW WIPO STANDARD ON THE PRESENTATION

More information

Enabling a federated environment to support biomedical research. Gianmauro Cuccuru CRS4

Enabling a federated environment to support biomedical research. Gianmauro Cuccuru CRS4 Enabling a federated environment to support biomedical research Gianmauro Cuccuru CRS4 ELIXIR connects national bioinformatics centres and EMBL- EBI into a sustainable European infrastructure for biological

More information

An Introduction to Managing Research Data

An Introduction to Managing Research Data An Introduction to Managing Research Data Author University of Bristol Research Data Service Date 1 August 2013 Version 3 Notes URI IPR data.bris.ac.uk Copyright 2013 University of Bristol Within the Research

More information

Electronic Document and Record Compliance for the Life Sciences

Electronic Document and Record Compliance for the Life Sciences Electronic Document and Record Compliance for the Life Sciences Kiran Thakrar, SoluSoft Inc. SoluSoft, Inc. 300 Willow Street South North Andover, MA 01845 Website: www.solu-soft.com Email: solusoftsales@solu-soft.com

More information

Towards the construction of an integrated Wheat Information System

Towards the construction of an integrated Wheat Information System Towards the construction of an integrated Wheat Information System Mario Caccamo 1, Hadi Quesneville 2 Report- June 2012 1. The Genome Analysis Centre (TGAC), Norwich Research Park, Norwich, UK 2. INRA,

More information

Q: What browsers will be supported? A: Internet Explorer (from version 6), Firefox (from version 3.0), Safari, Chrome

Q: What browsers will be supported? A: Internet Explorer (from version 6), Firefox (from version 3.0), Safari, Chrome CCV Renewal FAQ General Q: Why is the CCV building a new application? A: The current application was built in 2002, using the latest web technology available at that time. Over the last ten years the number

More information

UCLA Team Sequences Cell Line, Puts Open Source Software Framework into Production

UCLA Team Sequences Cell Line, Puts Open Source Software Framework into Production Page 1 of 6 UCLA Team Sequences Cell Line, Puts Open Source Software Framework into Production February 05, 2010 Newsletter: BioInform BioInform - February 5, 2010 By Vivien Marx Scientists at the department

More information

MANAGED FILE TRANSFER: 10 STEPS TO SOX COMPLIANCE

MANAGED FILE TRANSFER: 10 STEPS TO SOX COMPLIANCE WHITE PAPER MANAGED FILE TRANSFER: 10 STEPS TO SOX COMPLIANCE 1. OVERVIEW Do you want to design a file transfer process that is secure? Or one that is compliant? Of course, the answer is both. But it s

More information

A complete platform for proactive data management

A complete platform for proactive data management Brochure A complete platform for proactive data management HP Structured Data Manager Software for Oracle e-business Suite The right data management strategy The increased size and unmanaged growth of

More information

FTP-Stream Data Sheet

FTP-Stream Data Sheet FTP-Stream Data Sheet Problem FTP-Stream solves four demanding business challenges: Global distribution of files any size. File transfer to / from China which is notoriously challenging. Document control

More information

White Paper: NCBI Database of Genotypes and Phenotypes (dbgap) Security Best Practices Compliance Overview for the New DNAnexus Platform

White Paper: NCBI Database of Genotypes and Phenotypes (dbgap) Security Best Practices Compliance Overview for the New DNAnexus Platform White Paper: NCBI Database of Genotypes and Phenotypes (dbgap) Security Best Practices Compliance Overview for the New DNAnexus Platform April 18, 2013 Overview This White Paper summarizes how the new

More information

CONSUMER DATA RESEARCH CENTRE DATA SERVICE USER GUIDE. Version: August 2015

CONSUMER DATA RESEARCH CENTRE DATA SERVICE USER GUIDE. Version: August 2015 CONSUMER DATA RESEARCH CENTRE DATA SERVICE USER GUIDE Version: August 2015 Introduction The Consumer Data Research Centre (CDRC or Centre) is an academic led, multi-institution laboratory which discovers,

More information

Writing a Wellcome Trust Data Management & Sharing Plan

Writing a Wellcome Trust Data Management & Sharing Plan Writing a Wellcome Trust Data Management & Sharing Plan Report Version Control Version Date Author Change Description 1.3 02 September 2014 Gareth Knight Revision to Q1, Q2, Q4 and Q6 on basis of feedback

More information

Information and Data Sharing Policy* Genomics:GTL Program

Information and Data Sharing Policy* Genomics:GTL Program Appendix 1 Information and Data Sharing Policy* Genomics:GTL Program Office of Biological and Environmental Research Office of Science Department of Energy Appendix 1 Final Date: April 4, 2008 Introduction

More information

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE QUALITY OF BIOTECHNOLOGICAL PRODUCTS: ANALYSIS

More information

IT 415 Information Visualization Spring Semester

IT 415 Information Visualization Spring Semester The Department of Applied Information Technology The Volgenau School of Information Technology & Engineering George Mason University 4400 University Drive Fairfax. VA 22030-4444 IT 415 Information Visualization

More information

Focusing on results not data comprehensive data analysis for targeted next generation sequencing

Focusing on results not data comprehensive data analysis for targeted next generation sequencing Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes

More information

How To Write A Blog Post On Globus

How To Write A Blog Post On Globus Globus Software as a Service data publication and discovery Kyle Chard, University of Chicago Computation Institute, chard@uchicago.edu Jim Pruyne, University of Chicago Computation Institute, pruyne@uchicago.edu

More information

SURFsara Data Services

SURFsara Data Services SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,

More information

Major US Genomic Medicine Programs: NHGRI s Electronic Medical Records and Genomics (emerge) Network

Major US Genomic Medicine Programs: NHGRI s Electronic Medical Records and Genomics (emerge) Network Major US Genomic Medicine Programs: NHGRI s Electronic Medical Records and Genomics (emerge) Network Dan Roden Member, National Advisory Council For Human Genome Research Genomic Medicine Working Group

More information

How To Ensure Health Information Is Protected

How To Ensure Health Information Is Protected pic pic CIHI Submission: 2011 Prescribed Entity Review October 2011 Who We Are Established in 1994, CIHI is an independent, not-for-profit corporation that provides essential information on Canada s health

More information

Overview. Overarching observations

Overview. Overarching observations Overview Genomics and Health Information Technology Systems: Exploring the Issues April 27-28, 2011, Bethesda, MD Brief Meeting Summary, prepared by Greg Feero, M.D., Ph.D. (planning committee chair) The

More information

Streamlining the drug development lifecycle with Adobe LiveCycle enterprise solutions

Streamlining the drug development lifecycle with Adobe LiveCycle enterprise solutions White paper Streamlining the drug development lifecycle with Adobe LiveCycle enterprise solutions Using intelligent PDF documents to optimize collaboration, data integrity, authentication, and reuse Table

More information

What s Next for Data Sharing: Insight from the NIH Experience

What s Next for Data Sharing: Insight from the NIH Experience What s Next for Data Sharing: Insight from the NIH Experience Jerry Sheehan Assistant Director for Policy Development National Library of Medicine National Institutes of Health SHARE In-Person Meeting

More information

NOW!! Registry and BioBank Services for! Your Organization/Company/Clinic/Project!

NOW!! Registry and BioBank Services for! Your Organization/Company/Clinic/Project! NOW!! Registry and BioBank Services for! Your Organization/Company/Clinic/Project! What Does Genetic Alliance Registry and BioBank Offer?! Flexible, customizable, registry and biobank options One-on-one

More information

ECRIN (European Clinical Research Infrastructures Network)

ECRIN (European Clinical Research Infrastructures Network) ECRIN (European Clinical Research Infrastructures Network) Wolfgang Kuchinke University of Duesseldorf (HHU) and ECRIN EUDAT 1st User Forum 7 March 2012 8 March 2012, Barcelona 1 What is ECRIN? European

More information

EMC DOCUMENTUM CONTENT ENABLED EMR Enhance the value of your EMR investment by accessing the complete patient record.

EMC DOCUMENTUM CONTENT ENABLED EMR Enhance the value of your EMR investment by accessing the complete patient record. EMC DOCUMENTUM CONTENT ENABLED EMR Enhance the value of your EMR investment by accessing the complete patient record. ESSENTIALS Provide access to records ingested from other systems Capture all content

More information

Version 21 Date: 14th September 2010 ETHICAL GOVERNANCE FRAMEWORK. Drafted by the Ethical Advisory Group of the UK10K project

Version 21 Date: 14th September 2010 ETHICAL GOVERNANCE FRAMEWORK. Drafted by the Ethical Advisory Group of the UK10K project Version 21 Date: 14th September 2010 ETHICAL GOVERNANCE FRAMEWORK Drafted by the Ethical Advisory Group of the UK10K project Table of Contents OVERVIEW... 3 REGULATORY APPROVALS... 5 INFORMED CONSENT...

More information

Trade Repository Service White Paper December 2013

Trade Repository Service White Paper December 2013 Trade Repository Service White Paper December 2013 Copyright IntercontinentalExchange, Inc. 2013. All Rights Reserved. Table of Contents DEFINITIONS... 3 EXECUTIVE SUMMARY... 5 OVERVIEW: TRADE REPOSITORIES...

More information

The Information Commissioner s Office response to HM Treasury s Call for Evidence on Data Sharing and Open Data in Banking

The Information Commissioner s Office response to HM Treasury s Call for Evidence on Data Sharing and Open Data in Banking The Information Commissioner s Office response to HM Treasury s Call for Evidence on Data Sharing and Open Data in Banking The Information Commissioner has responsibility for promoting and enforcing the

More information

Signature Requirements for the etmf

Signature Requirements for the etmf Wingspan Technology Signature Requirements for the etmf A Regulatory and Technological Assessment Kathie Clark Director, Product Management Wingspan Technology 1 November 2012 Signature Requirements for

More information

MANAGED FILE TRANSFER: 10 STEPS TO PCI DSS COMPLIANCE

MANAGED FILE TRANSFER: 10 STEPS TO PCI DSS COMPLIANCE WHITE PAPER MANAGED FILE TRANSFER: 10 STEPS TO PCI DSS COMPLIANCE 1. OVERVIEW Do you want to design a file transfer process that is secure? Or one that is compliant? Of course, the answer is both. But

More information

Integrated Rule-based Data Management System for Genome Sequencing Data

Integrated Rule-based Data Management System for Genome Sequencing Data Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer

More information

Comments of the EDPS in response to the public consultation on

Comments of the EDPS in response to the public consultation on Comments of the EDPS in response to the public consultation on the planned guidelines on recommended standard licences, datasets and charging for the reuse of public sector information initiated by the

More information

A Service for Data-Intensive Computations on Virtual Clusters

A Service for Data-Intensive Computations on Virtual Clusters A Service for Data-Intensive Computations on Virtual Clusters Executing Preservation Strategies at Scale Rainer Schmidt, Christian Sadilek, and Ross King rainer.schmidt@arcs.ac.at Planets Project Permanent

More information

MANAGED FILE TRANSFER: 10 STEPS TO HIPAA/HITECH COMPLIANCE

MANAGED FILE TRANSFER: 10 STEPS TO HIPAA/HITECH COMPLIANCE WHITE PAPER MANAGED FILE TRANSFER: 10 STEPS TO HIPAA/HITECH COMPLIANCE 1. OVERVIEW Do you want to design a file transfer process that is secure? Or one that is compliant? Of course, the answer is both.

More information

HL7 Clinical Genomics and Structured Documents Work Groups

HL7 Clinical Genomics and Structured Documents Work Groups HL7 Clinical Genomics and Structured Documents Work Groups CDA Implementation Guide: Genetic Testing Report (GTR) Amnon Shabo (Shvo), PhD shabo@il.ibm.com HL7 Clinical Genomics WG Co-chair and Modeling

More information

Document process management solutions for MiFID compliance

Document process management solutions for MiFID compliance Adobe Technical White Paper produced in conjunction with Equiduct Document process management solutions for MiFID compliance Adobe technology provides document process management solutions, enabling investment

More information

An Introduction to Genomics and SAS Scientific Discovery Solutions

An Introduction to Genomics and SAS Scientific Discovery Solutions An Introduction to Genomics and SAS Scientific Discovery Solutions Dr Karen M Miller Product Manager Bioinformatics SAS EMEA 16.06.03 Copyright 2003, SAS Institute Inc. All rights reserved. 1 Overview!

More information

Collaborative Computational Projects: Networking and Core Support

Collaborative Computational Projects: Networking and Core Support Collaborative Computational Projects: Networking and Core Support Call type: Invitation for proposals Closing date: 16:00 07 October 2014 Related themes: Engineering, ICT, Mathematical sciences, Physical

More information

Six Challenges for the Privacy and Security of Health Information. Carl A. Gunter University of Illinois

Six Challenges for the Privacy and Security of Health Information. Carl A. Gunter University of Illinois Six Challenges for the Privacy and Security of Health Information Carl A. Gunter University of Illinois The Six Challenges 1. Access controls and audit 2. Encryption and trusted base 3. Automated policy

More information

escience and Post-Genome Biomedical Research

escience and Post-Genome Biomedical Research escience and Post-Genome Biomedical Research Thomas L. Casavant, Adam P. DeLuca Departments of Biomedical Engineering, Electrical Engineering and Ophthalmology Coordinated Laboratory for Computational

More information

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each

More information

Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats

Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats Genomic Medicine 8 meeting Alexa McCray Christopher G Chute Rex

More information

Integration of Genetic and Familial Data into. Electronic Medical Records and Healthcare Processes

Integration of Genetic and Familial Data into. Electronic Medical Records and Healthcare Processes Integration of Genetic and Familial Data into Electronic Medical Records and Healthcare Processes By Thomas Kmiecik and Dale Sanders February 2, 2009 Introduction Although our health is certainly impacted

More information

TRANSFoRm: Vision of a learning healthcare system

TRANSFoRm: Vision of a learning healthcare system TRANSFoRm: Vision of a learning healthcare system Vasa Curcin, Imperial College London Theo Arvanitis, University of Birmingham Derek Corrigan, Royal College of Surgeons Ireland TRANSFoRm is partially

More information

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and

More information

The i2b2 Hive and the Clinical Research Chart

The i2b2 Hive and the Clinical Research Chart The i2b2 Hive and the Clinical Research Chart Henry Chueh Shawn Murphy The i2b2 Hive is centered around two concepts. The first concept is the existence of services provided by applications that are wrapped

More information

Security architecture and framework Design and pilot implementation

Security architecture and framework Design and pilot implementation WP 5 Work Package Meeting Security architecture and framework Design and pilot implementation 3 rd AGM in Munich, 17 February 2015 TUM: Raffael Bild, Florian Kohlmayer, Helmut Spengler EBI: Olga Melnichuk,

More information

MOOCdb: Developing Data Standards for MOOC Data Science

MOOCdb: Developing Data Standards for MOOC Data Science MOOCdb: Developing Data Standards for MOOC Data Science Kalyan Veeramachaneni, Franck Dernoncourt, Colin Taylor, Zachary Pardos, and Una-May O Reilly Massachusetts Institute of Technology, USA. {kalyan,francky,colin

More information

Release of Data from EORTC Studies for Use in External Research Projects

Release of Data from EORTC Studies for Use in External Research Projects Release of Data from Studies for Use in External Research Projects POL008 Version 2.03 ALWAYS REFER TO THE INTERNET WEBSITE TO CHECK THE VALIDITY OF THIS DOCUMENT Author: Associate Head of Statistics Department

More information

Delivering the power of the world s most successful genomics platform

Delivering the power of the world s most successful genomics platform Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE

More information

INVESTRAN DATA EXCHANGE

INVESTRAN DATA EXCHANGE INVESTRAN DATA EXCHANGE INVESTRAN DATA EXCHANGE AUTOMATING DATA CAPTURE AND REPORT DISTRIBUTION FOR ALTERNATIVE INVESTMENTS In today s alternative investment world, transparency of data is key, and the

More information

Data controllers and data processors: what the difference is and what the governance implications are

Data controllers and data processors: what the difference is and what the governance implications are ICO lo : what the difference is and what the governance implications are Data Protection Act Contents Introduction... 3 Overview... 3 Section 1 - What is the difference between a data controller and a

More information

Title Draft Pan-Canadian Primary Health Care Electronic Medical Record Content Standard, Version 2.0 Data Extract Specifi cation Business View

Title Draft Pan-Canadian Primary Health Care Electronic Medical Record Content Standard, Version 2.0 Data Extract Specifi cation Business View pic Title Draft Pan-Canadian Primary Health Care Electronic Medical Record Content Standard, Version 2.0 Data Extract Specifi cation Business View Primary Health Care Who We Are Established in 1994, CIHI

More information

Protective Marking for UK Government

Protective Marking for UK Government Protective Marking for UK Government WHITE PAPER Contents Introduction 3 Regulatory Requirements 3 Government Protective Marking System (GPMS) 3 The Value Beyond Regulatory Requirements 4 Leveraging Other

More information

Remote Data Extraction Policy and Procedure

Remote Data Extraction Policy and Procedure Remote Data Extraction Policy and Procedure Prepared by PRIMIS June 2015 The University of Nottingham. All rights reserved. Contents 1. Introduction... 3 2. Purpose and scope... 3 3. Policy Statement...

More information

BIOINFORMATICS Supporting competencies for the pharma industry

BIOINFORMATICS Supporting competencies for the pharma industry BIOINFORMATICS Supporting competencies for the pharma industry ABOUT QFAB QFAB is a bioinformatics service provider based in Brisbane, Australia operating nationwide and internationally. QFAB was established

More information

Enterprise Information Management Services Managing Your Company Data Along Its Lifecycle

Enterprise Information Management Services Managing Your Company Data Along Its Lifecycle SAP Solution in Detail SAP Services Enterprise Information Management Enterprise Information Management Services Managing Your Company Data Along Its Lifecycle Table of Contents 3 Quick Facts 4 Key Services

More information

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article

More information

The New EU Clinical Trials Regulation: The Good, the Bad, the Ugly

The New EU Clinical Trials Regulation: The Good, the Bad, the Ugly A Full-Service International CRO The New EU Clinical Trials Regulation: The Good, the Bad, the Ugly Dr. Martine Dehlinger-Kremer Vice President, Global Medical and Regulatory Affairs The original intent

More information

Big Data for Population Health

Big Data for Population Health Big Data for Population Health Prof Martin Landray Nuffield Department of Population Health Deputy Director, Big Data Institute, Li Ka Shing Centre for Health Information and Discovery University of Oxford

More information

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success Developing an MDM Strategy Key Components for Success WHITE PAPER Table of Contents Introduction... 2 Process Considerations... 3 Architecture Considerations... 5 Conclusion... 9 About Knowledgent... 10

More information

Test Data Management Concepts

Test Data Management Concepts Test Data Management Concepts BIZDATAX IS AN EKOBIT BRAND Executive Summary Test Data Management (TDM), as a part of the quality assurance (QA) process is more than ever in the focus among IT organizations

More information

Clinical Knowledge Manager. Product Description 2012 MAKING HEALTH COMPUTE

Clinical Knowledge Manager. Product Description 2012 MAKING HEALTH COMPUTE Clinical Knowledge Manager Product Description 2012 MAKING HEALTH COMPUTE Cofounder and major sponsor Member and official submitter for HL7/OMG HSSP RLUS, EIS 'openehr' is a registered trademark of the

More information

The Big Data Bioinformatics System

The Big Data Bioinformatics System The Big Data Bioinformatics System Introduction The purpose of this document is to describe a fictitious bioinformatics research system based in part on the design and implementation of a similar system

More information

Towards Integrating the Detection of Genetic Variants into an In-Memory Database

Towards Integrating the Detection of Genetic Variants into an In-Memory Database Towards Integrating the Detection of Genetic Variants into an 2nd International Workshop on Big Data in Bioinformatics and Healthcare Oct 27, 2014 Motivation Genome Data Analysis Process DNA Sample Base

More information

i2b2 Clinical Research Chart

i2b2 Clinical Research Chart i2b2 Clinical Research Chart Shawn Murphy MD, Ph.D. Griffin Weber MD, Ph.D. Michael Mendis Vivian Gainer MS Lori Phillips MS Rajesh Kuttan Wensong Pan MS Henry Chueh MD Susanne Churchill Ph.D. John Glaser

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.3 Selected Standards

More information

Document Change Control

Document Change Control Document Change Control for Quality & Regulatory Compliance Quality and Compliance Solutions Document Control Software for Microsoft Windows Document Change Control Improve efficiency and enforce consistency

More information

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI

Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements

More information

RMS. Privacy Policy for RMS Hosting Plus and RMS(one) Guiding Principles

RMS. Privacy Policy for RMS Hosting Plus and RMS(one) Guiding Principles RMS Privacy Policy for RMS Hosting Plus and RMS(one) Guiding Principles RMS Privacy Policy for RMS Hosting Plus and RMS(one) Guiding Principles RMS aims to provide the most secure, the most private, and

More information

Big Data Challenges. technology basics for data scientists. Spring - 2014. Jordi Torres, UPC - BSC www.jorditorres.

Big Data Challenges. technology basics for data scientists. Spring - 2014. Jordi Torres, UPC - BSC www.jorditorres. Big Data Challenges technology basics for data scientists Spring - 2014 Jordi Torres, UPC - BSC www.jorditorres.eu @JordiTorresBCN Data Deluge: Due to the changes in big data generation Example: Biomedicine

More information

www.xarios.com Xarios EMEA Xarios Asia / Pacific Xarios North America

www.xarios.com Xarios EMEA Xarios Asia / Pacific Xarios North America Xarios EMEA Unit M1, Cody Court Kansas Avenue Salford Quays Manchester. M50 2GE United Kingdom Telephone: (+44) 845 373 6880 Facsimile: (+44) 845 373 6881 Email: sales@xarios.com Web: www.xarios.com Xarios

More information

Deliverable 7.3.1 First report on sample storage, DNA extraction and sample analysis processes

Deliverable 7.3.1 First report on sample storage, DNA extraction and sample analysis processes Model Driven Paediatric European Digital Repository Call identifier: FP7-ICT-2011-9 - Grant agreement no: 600932 Thematic Priority: ICT - ICT-2011.5.2: Virtual Physiological Human Deliverable 7.3.1 First

More information

Harmonized Use Case for Electronic Health Records (Laboratory Result Reporting) March 19, 2006

Harmonized Use Case for Electronic Health Records (Laboratory Result Reporting) March 19, 2006 Harmonized Use Case for Electronic Health Records (Laboratory Result Reporting) March 19, 2006 Office of the National Coordinator for Health Information Technology (ONC) Table of Contents American Health

More information

ANSYS EKM Overview. What is EKM?

ANSYS EKM Overview. What is EKM? ANSYS EKM Overview What is EKM? ANSYS EKM is a simulation process and data management (SPDM) software system that allows engineers at all levels of an organization to effectively manage the data and processes

More information

INTERNATIONAL PHARMACEUTICAL PRIVACY CONSORTIUM COMMENTS IN RESPONSE TO THE CALL FOR EVIDENCE ON EU DATA PROTECTION PROPOSALS

INTERNATIONAL PHARMACEUTICAL PRIVACY CONSORTIUM COMMENTS IN RESPONSE TO THE CALL FOR EVIDENCE ON EU DATA PROTECTION PROPOSALS INTERNATIONAL PHARMACEUTICAL PRIVACY CONSORTIUM COMMENTS IN RESPONSE TO THE CALL FOR EVIDENCE ON EU DATA PROTECTION PROPOSALS I. INTRODUCTION The International Pharmaceutical Privacy Consortium (IPPC)

More information

European Medicines Agency

European Medicines Agency European Medicines Agency July 1996 CPMP/ICH/139/95 ICH Topic Q 5 B Quality of Biotechnological Products: Analysis of the Expression Construct in Cell Lines Used for Production of r-dna Derived Protein

More information

Digital Pathways. Harlow Enterprise Hub, Edinburgh Way, Harlow CM20 2NQ. 0844 586 0040 intouch@digitalpathways.co.uk www.digpath.co.

Digital Pathways. Harlow Enterprise Hub, Edinburgh Way, Harlow CM20 2NQ. 0844 586 0040 intouch@digitalpathways.co.uk www.digpath.co. Harlow Enterprise Hub, Edinburgh Way, Harlow CM20 2NQ 0844 586 0040 intouch@digitalpathways.co.uk Security Services Menu has a full range of Security Services, some of which are also offered as a fully

More information

A Guide to Horizon 2020 Funding for the Creative Industries

A Guide to Horizon 2020 Funding for the Creative Industries A Guide to Horizon 2020 Funding for the Creative Industries October 2014 Introduction This document is provided as a short guide to help you submit a proposal for the Horizon 2020 funding programme (H2020).

More information