European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute
|
|
- Brandon Cross
- 8 years ago
- Views:
Transcription
1 European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute Justin Paschall Team Leader Genetic Variation / EGA
2 ! European Genome-phenome Archive! European Variation Archive (EVA)! Clinical Genetics variation databases, data sharing and data discovery 2
3 Most Important Slide 3
4 4
5 Mission of the EGA! Enable collaboration and data sharing of individual patient-level genomic and phenotype data through a controlled-access system. Datasets in scope for EGA are those which are consented for research sharing but cannot be made fully publically available 5
6 What is controlled-access? User discovers a dataset and requests access through an application sent to Data Access Committee (DAC) which governs the study. Request is reviewed by DAC and approved Data is provided by the EGA to user through a secure encrypted channel 6
7 European Genome-phenome Archive (EGA)! Distributed Access Control model! Multiple, study specific Data Access Committees (DACs) Data release policy data access application and data access agreement Each DAC supports tailored data access application process all data users should be named in the application.! EGA supports only data access decisions that are based on the original informed consent! The EGA does not make access decisions, it implements access decisions made by the DACs that manage the data within EGA! Authorized users have personal accounts in our system Access to the data requires account password Data decryption requires a separate key that must be requested and is sent off line
8 Why was the EGA created?! The supporting personally identifiable data may be limited for many reasons National laws Consent agreements Ethical considerations! Data size The 2007 Wellcome Trust Case Control Consortium raw data files are nearly 2 terabytes The 2013 UK10K raw data files will be approximately 200 terabytes Funders, journals, and researchers are reluctant to support broad or mandated data availability without appropriate technical solutions
9 EGA Data scope! Primary archive for any data consented for sharing in the context of research but not for fully public distribution! Raw data from DNA sequencing and array-based genotyping applications, e.g. gene expression experiments, transcriptomics, epigenomics, sequencing or proteomics assays.! Processed datatypes such as genotypes, structural variations or whole genome sequence with any values associated with these calls.! Phenotypic data collected from the subjects and consented for research purpose.! All data must be de-identified and in accordance with the informed consent.! Used for ICGC, IHEC, IHMC, UK10K, DDD and other projects
10 Overview of EGA content! More than 450 studies available for user requests, each ranging from hundreds of thousands of subjects, 4126 users, 400 requests a month.! BAM files, FASTQ, increasing numbers of VCF. 30 TB/month submission! Methods: High throughput array-based genotyping, next gen exome and whole genome, RNA-seq, methylation.! Research domains: Population genetics quantitative traits (WTCCC/ Uk10K), cancer genetics (ICGC/CRUK), rare disease / developmental (Decipher DDD), molecular phenotype (eqtl)! Study meta-data exchange with NCBI, soon more than 1000 studies combined discoverable 10
11 11 UK10K study x coverage whole genome control samples. Exome sequencing of 6000 samples with extreme phenotypes. Rapid data release.
12 12 Finding Datasets in EGA - browse by Data Provider, Study, or Dataset search by accession
13 13 Example of Making a data request - start with UK10K data provider table
14 14
15 15
16 16
17 17
18 18
19 19
20 Data access page (after logging in)
21 For detailed step-by-step instructions for requesting or submitting data see the EGA website.
22 22 WTCCC genome wide case-control consortium.
23 23 Rare undiagnosed syndrome exome sequencing
24 24 Cancer Chronic Lymphocytic Leukemia
25 EGA Data Growth EGA data growth / / / / / / / / / / / / / / / / / / / / / / / / / / /01! EGA hosts more than 450 studies and discoverability to the 732 that are in both EGA and dbgap! EGA supports more than 400 user requests per month
26 Future of EGA and ELIXIR! The EGA is a core data resource of the ELIXIR infrastructure and we are piloting developments in collaboration with ELIXIR partners in Finland and Spain Distributed trust authentication system CSC IT Center for Science Institute for Molecular Medicine Finland (FIMM) Establishment of a peer EGA node Centre for Regulatory Genomics Barcelona! Cloud computing! Both FIMM and the CRG are EMBL partner institutes
27 Acknowledgements People! EGA Ilkka Lappalainen, Vasudev Kumanduri, Jeff Almeida-King, Saif Ur- Rehman, Jagasree Kanda! The EBI Variation Team Dylan Splading, Shyamasree Saham Lisa Skipper! The Ensembl Team Funding European Commission Framework Programme 7
28 Submission guide 28
29 Additional EBI Resources for Variation! Ensembl variation data resources Integrated variation annotation source Multi-species Ensembl Variant Effect Predictor (VEP)! Locus Reference Genomic (LRG) Standard Sequences Reference sequences for reporting and describing genetic variation Clinical diagnostic laboratories and locus specific databases! DGVa (Database of Genomics Variants archive) Structural and copy number variation All species, fully open data Peer database of NCBI s dbvar! European Variation Archive
30 Summary! EGA is a technical solution for secure data storage, management and dissemination. EGA serves large and small data submitters world wide! EGA has a distributed access model and implements decisions made by outside data access committees! Large reference datasets are required to support future biomedical research and personalized medicine EBI provides a number of these reference databases Through ELIXIR some future activities with be integrated and organised across Europe providing worldwide resources Access to the data Data annotation Data integration
31 Four areas of focus Patients - Respect the terms of informed consent, reduce the exposure of identifiable data, provide the tracking and accountability of data use. Make best of use of data to enable medicine Submitting Investigators - Respect the goals and publication timelines, as well as increase the visibility and impact of work of submitting investigators Biomedical Researcher Users Enable science through data sharing: methods development, increases sample size, expand net for rare variants, meta-analysis of raw data. Funders/Consortia - Enable technically feasible, scable, user friendly, and cost efficient solutions to long term archiving and sharing to extract maximum value from these dataset which have been produced through significant investment 31
Work Package 13.5: Authors: Paul Flicek and Ilkka Lappalainen. 1. Introduction
Work Package 13.5: Report summarising the technical feasibility of the European Genotype Archive to collect, store, and use genotype data stored in European biobanks in a manner that complies with all
More informationGlobal Alliance. Ewan Birney Associate Director EMBL-EBI
Global Alliance Ewan Birney Associate Director EMBL-EBI Our world is changing Research to Medical Research English as language Lightweight legal Identical/similar systems Open data Publications Grant-funding
More informationWorkshop on Establishing a Central Resource of Data from Genome Sequencing Projects
Report on the Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects Background and Goals of the Workshop June 5 6, 2012 The use of genome sequencing in human research is growing
More informationImportance of Statistics in creating high dimensional data
Importance of Statistics in creating high dimensional data Hemant K. Tiwari, PhD Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham History of Genomic Data
More informationDelivering the power of the world s most successful genomics platform
Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE
More informationLeading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik
Leading Genomics Diagnostic harma Discove Collab Shanghai Cambridge, MA Reykjavik Global leadership for using the genome to create better medicine WuXi NextCODE provides a uniquely proven and integrated
More informationBIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
More informationThe National Institute of Genomic Medicine (INMEGEN) was
Genome is...... the complete set of genetic information contained within all of the chromosomes of an organism. It defines the particular phenotype of an individual. What is Genomics? The study of the
More informationThe 100,000 genomes project
The 100,000 genomes project Tim Hubbard @timjph Genomics England King s College London, King s Health Partners Wellcome Trust Sanger Institute ClinGen / Decipher Washington DC, 26 th May 2015 The 100,000
More informationENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
More informationNIH s Genomic Data Sharing Policy
NIH s Genomic Data Sharing Policy 2 Benefits of Data Sharing Enables data generated from one study to be used to explore a wide range of additional research questions Increases statistical power and scientific
More informationSharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility
Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility Report of a meeting organized by the Wellcome Trust and held on 14 15 January 2003 at Fort Lauderdale,
More informationSEQUENCING INITIATIVE SUOMI (SISU) SYMPOSIUM SPEAKERS August 26, 2014
SEQUENCING INITIATIVE SUOMI (SISU) SYMPOSIUM SPEAKERS August 26, 2014 Professor Aarno Palotie Professor Aarno Palotie is the Research Director of the Human Genomics program at the Finnish Institute for
More informationEuro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of
More informationPreparing the scenario for the use of patient s genome sequences in clinic. Joaquín Dopazo
Preparing the scenario for the use of patient s genome sequences in clinic Joaquín Dopazo Computational Medicine Institute, Centro de Investigación Príncipe Felipe (CIPF), Functional Genomics Node, (INB),
More informationA Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
More informationThree data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk
Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes
More informationComputational Requirements
Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects Computational Requirements Steve Sherry, Lisa Brooks, Paul Flicek, Anton Nekrutenko, Kenna Shaw, Heidi Sofia High-density
More informationBig Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
More informationKeystones for supporting collaborative research using multiple data sets in the medical and bio-sciences
Keystones for supporting collaborative research using multiple data sets in the medical and bio-sciences David Fergusson Head of Scientific Computing The Francis Crick Institute The Francis Crick Institute
More informationVersion 21 Date: 14th September 2010 ETHICAL GOVERNANCE FRAMEWORK. Drafted by the Ethical Advisory Group of the UK10K project
Version 21 Date: 14th September 2010 ETHICAL GOVERNANCE FRAMEWORK Drafted by the Ethical Advisory Group of the UK10K project Table of Contents OVERVIEW... 3 REGULATORY APPROVALS... 5 INFORMED CONSENT...
More informationSchool of Nursing. Presented by Yvette Conley, PhD
Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression
More informationDiscovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat
Bioinformatique et Séquençage Haut Débit, Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG) roderic.guigo@crg.cat 1 RNA Transcription to RNA and subsequent
More information6 ELIXIR Domain Specific Services
6 ELIXIR Domain Specific Services Work stream leads: Alfonso Valencia (ES), Inge Jonassen (NO), Jose Leal (PT) Work stream members: Nils-Peder Willassen (NO), Finn Drablos (NO), Mark Viant (UK), Ferran
More informationGenetic diagnostics the gateway to personalized medicine
Micronova 20.11.2012 Genetic diagnostics the gateway to personalized medicine Kristiina Assoc. professor, Director of Genetic Department HUSLAB, Helsinki University Central Hospital The Human Genome Packed
More informationUsing the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova
Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel
More informationGenomic medicine in Australia. Professor Warwick Anderson Chief Executive Officer National Health and Medical Research Council
Genomic medicine in Australia Professor Warwick Anderson Chief Executive Officer National Health and Medical Research Council This presentation 1. NHMRC s role funding research and translation 2. Genetic/genomic
More informationRETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
More informationWorldwide Collaborations in Molecular Profiling
Worldwide Collaborations in Molecular Profiling Lillian L. Siu, MD Director, Phase I Program and Cancer Genomics Program Princess Margaret Cancer Centre Lillian Siu, MD Contracted Research: Novartis, Pfizer,
More informationClinical Genomics at Scale: Synthesizing and Analyzing Big Data From Thousands of Patients
Clinical Genomics at Scale: Synthesizing and Analyzing Big Data From Thousands of Patients Brandy Bernard PhD Senior Research Scientist Institute for Systems Biology Seattle, WA Dr. Bernard s research
More informationEnabling a federated environment to support biomedical research. Gianmauro Cuccuru CRS4
Enabling a federated environment to support biomedical research Gianmauro Cuccuru CRS4 ELIXIR connects national bioinformatics centres and EMBL- EBI into a sustainable European infrastructure for biological
More informationIntegrated Rule-based Data Management System for Genome Sequencing Data
Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer
More informationAn Introduction to Genomics and SAS Scientific Discovery Solutions
An Introduction to Genomics and SAS Scientific Discovery Solutions Dr Karen M Miller Product Manager Bioinformatics SAS EMEA 16.06.03 Copyright 2003, SAS Institute Inc. All rights reserved. 1 Overview!
More informationData integration and modelling in health sciences Science as a conversation across borders
Open data key to the future Helsinki 2011-11-01 Data integration and modelling in health sciences Science as a conversation across borders Juni Palmgren Karolinska Institutet and FIMM, Helsinki University
More informationEnhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats
Enhancing Functionality of EHRs for Genomic Research, Including E- Phenotying, Integrating Genomic Data, Transportable CDS, Privacy Threats Genomic Medicine 8 meeting Alexa McCray Christopher G Chute Rex
More informationSubmission Schedule for Descriptive/Raw Data
Research Domain Criteria Database (RDoCdb) Data Sharing Policy 8/7/2014 i. Data Sharing Overview All de-identified data resulting from this NIH-funded research involving human subjects are expected to
More informationNIH Genomic Data Sharing (GDS) Policy Guidance Memo #2 1
MEMORANDUM TO: Principal Investigators and Research Staff DATE: 2/22/15 FROM: Anne Klibanski, MD, Partners Chief Academic Officer (CAO) Paul Anderson, MD, PhD, BWH CAO Harry Orf, PhD, MGH Sr. Vice President-Research
More informationHigh Performance Compu2ng Facility
High Performance Compu2ng Facility Center for Health Informa2cs and Bioinforma2cs Accelera2ng Scien2fic Discovery and Innova2on in Biomedical Research at NYULMC through Advanced Compu2ng Efstra'os Efstathiadis,
More informationModule 1. Sequence Formats and Retrieval. Charles Steward
The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.
More informationBig Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
More informationDisease gene identification with exome sequencing
Disease gene identification with exome sequencing Christian Gilissen Dept. of Human Genetics Radboud University Nijmegen Medical Centre c.gilissen@antrg.umcn.nl Contents Infrastructure Exome sequencing
More informationNew solutions for Big Data Analysis and Visualization
New solutions for Big Data Analysis and Visualization From HPC to cloud-based solutions Barcelona, February 2013 Nacho Medina imedina@cipf.es http://bioinfo.cipf.es/imedina Head of the Computational Biology
More informationEmbargoed until 14:30 CEST European time, 13:30 BST UK, 8:30 Eastern US summer time Contacts:
Embargoed until 14:30 CEST European time, 13:30 BST UK, 8:30 Eastern US summer time Contacts: Louisa Wood or Katrina Pavelin, EMBL EBI louisa@ebi.ac.uk katrina@ebi.ac.uk +44 (0)1223 494665 Sonia Furtado,
More informationLarge-scale Research Data Management and Analysis Using Globus Services. Ravi Madduri Argonne National Lab University of Chicago @madduri
Large-scale Research Data Management and Analysis Using Globus Services Ravi Madduri Argonne National Lab University of Chicago @madduri Outline Who we are Challenges in Big Data Management and Analysis
More informationReport of the DTL focus meeting on Life Science Data Repositories
Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity
More informationProfessor Dame Janet Thornton Director T + 44 (0)1223 494648 F + 44 (0)1223 494496 thornton@ebi.ac.uk
Professor Dame Sally Davies Chief Medical Officer Department of Health Richmond House, Room 123b 79 Whitehall London SW1A 2NS Professor Dame Janet Thornton Director T + 44 (0)1223 494648 F + 44 (0)1223
More informationITT Advanced Medical Technologies - A Programmer's Overview
ITT Advanced Medical Technologies (Ileri Tip Teknolojileri) ITT Advanced Medical Technologies (Ileri Tip Teknolojileri) is a biotechnology company (SME) established in Turkey. Its activity area is research,
More informationData Sharing Initiative: International Cancer Genome Consortium
Data Sharing Initiative: International Cancer Genome Consortium Tom Hudson, MD President and Scientific Director Ontario Institute for Cancer Research 1 Sharing Data Sharing BIG Genome Initiative: DATA
More informationTECHNOLOGIES, PRODUCTS & SERVICES for MOLECULAR DIAGNOSTICS, MDx ABA 298
DIAGNOSTICS BUSINESS ANALYSIS SERIES: TECHNOLOGIES, PRODUCTS & SERVICES for MOLECULAR DIAGNOSTICS, MDx ABA 298 By ADAMS BUSINESS ASSOCIATES MAY 2014. May 2014 ABA 298 1 Technologies, Products & Services
More informationAnalysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics
Analysis and Integration of Big Data from Next-Generation Genomics, Epigenomics, and Transcriptomics Christopher Benner, PhD Director, Integrative Genomics and Bioinformatics Core (IGC) idash Webinar,
More informationDirectorate Medical Operations Patients and Information Nursing Policy Commissioning Development
100,000 Genome Project NHS Genomic Medicine Centre Selection Prospectus July 2014 NHS England INFORMATION READER BOX Directorate Medical Operations Patients and Information Nursing Policy Commissioning
More informationCase Study Life Sciences Data
Case Study Life Sciences Data Centre for Integrative Systems Biology and Bioinformatics www.imperial.ac.uk/bioinfsupport Sarah Butcher s.butcher@imperial.ac.uk www.imperial.ac.uk/bioinfsupport Bio-data
More informationAttacking the Biobank Bottleneck
Attacking the Biobank Bottleneck Professor Jan-Eric Litton BBMRI-ERIC BBMRI-ERIC Big Data meets research biobanking Big data is high-volume, high-velocity and highvariety information assets that demand
More informationTowards the construction of an integrated Wheat Information System
Towards the construction of an integrated Wheat Information System Mario Caccamo 1, Hadi Quesneville 2 Report- June 2012 1. The Genome Analysis Centre (TGAC), Norwich Research Park, Norwich, UK 2. INRA,
More informationThe NeurOmics team at a recent project meeting
Introduction Welcome to the NeurOmics project newsletter. This is the second edition and comes after the project has been underway for just over a year. This means that whilst we still have lots of work
More informationUsing the Bionimbus Protected Data Cloud (PDC): Obtaining Access Credentials FAQ
Using the Bionimbus Protected Data Cloud (PDC): Obtaining Access Credentials FAQ It s very important that a PDC user is the only one who logs in with an account. If you have members of your lab that would
More informationWhat s Next for Data Sharing: Insight from the NIH Experience
What s Next for Data Sharing: Insight from the NIH Experience Jerry Sheehan Assistant Director for Policy Development National Library of Medicine National Institutes of Health SHARE In-Person Meeting
More informationIntegration of genomic data into electronic health records
Integration of genomic data into electronic health records Daniel Masys, MD Affiliate Professor Biomedical & Health Informatics University of Washington, Seattle Major portion of today s lecture is based
More informationNazneen Aziz, PhD. Director, Molecular Medicine Transformation Program Office
2013 Laboratory Accreditation Program Audioconferences and Webinars Implementing Next Generation Sequencing (NGS) as a Clinical Tool in the Laboratory Nazneen Aziz, PhD Director, Molecular Medicine Transformation
More informationPop-Up Governance: developing internal governance frameworks for consortia: the example of UK10K
Kaye et al. Life Sciences, Society and Policy (2015) 11:10 DOI 10.1186/s40504-015-0028-9 RESEARCH Pop-Up Governance: developing internal governance frameworks for consortia: the example of UK10K Jane Kaye
More informationescience and Post-Genome Biomedical Research
escience and Post-Genome Biomedical Research Thomas L. Casavant, Adam P. DeLuca Departments of Biomedical Engineering, Electrical Engineering and Ophthalmology Coordinated Laboratory for Computational
More informationRequest for Applications. Sharing Big Data for Health Care Innovation: Advancing the Objectives of the Global Alliance for Genomics and Health
1. Overview Request for Applications Sharing Big Data for Health Care Innovation: Advancing the Objectives of the Global Alliance for Genomics and Health In order for Canada to take full advantage of the
More informationELIXIR Scientific Programme 2014-2018
ELIXIR Scientific Programme 2014-2018 1 ELIXIR Scientific Programme 2014-2018 Contents About ELIXIR 1 Executive Summary 2 Europe s Bioinformatics Infrastructure: key challenges 2014-2018 4 ELIXIR s Strategic
More informationGenomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future
Genomics and Health Data Standards: Lessons from the Past and Present for a Genome-enabled Future Daniel Masys, MD Professor and Chair Department of Biomedical Informatics Professor of Medicine Vanderbilt
More informationSequencing and microarrays for genome analysis: complementary rather than competing?
Sequencing and microarrays for genome analysis: complementary rather than competing? Simon Hughes, Richard Capper, Sandra Lam and Nicole Sparkes Introduction The human genome is comprised of more than
More informationEuropean Educational Programme in Epidemiology
European Educational Programme in Epidemiology 29 th RESIDENTIAL SUMMER COURSE FLORENCE, ITALY Pre-courses 13 17 JUNE 2016 1/13 European Educational Programme in Epidemiology Pre-Course: Introduction to
More informationPractical Solutions for Big Data Analytics
Practical Solutions for Big Data Analytics Ravi Madduri Computation Institute (madduri@anl.gov) Paul Dave (pdave@uchicago.edu) Dinanath Sulakhe (sulakhe@uchicago.edu) Alex Rodriguez (arodri7@uchicago.edu)
More informationNCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013
NCBI resources III: GEO and ftp site Yanbin Yin Spring 2013 1 Homework assignment 2 Search colon cancer at GEO and find a data Series and perform a GEO2R analysis Write a report (in word or ppt) to include
More informationThe Human Genome Project
The Human Genome Project Brief History of the Human Genome Project Physical Chromosome Maps Genetic (or Linkage) Maps DNA Markers Sequencing and Annotating Genomic DNA What Have We learned from the HGP?
More informationBIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS
BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Bioinformatics
More informationFast. Integrated Genome Browser & DAS. Easy. Flexible. Free. bioviz.org/igb
bioviz.org/igb Integrated Genome Browser & DAS Free tools for visualizing, sharing, and publishing genomes and genome-scale data. Easy Flexible Fast Free Funding: National Science Foundation Arabidopsis
More informationGenes for Good Consent Form
Genes for Good Consent Form Version 2.1 The next few screens contain information about Genes for Good and the benefits and risks of participating. This is called "informed consent", because we want you
More informationChallenges associated with analysis and storage of NGS data
Challenges associated with analysis and storage of NGS data Gabriella Rustici Research and training coordinator Functional Genomics Group gabry@ebi.ac.uk Next-generation sequencing Next-generation sequencing
More informationdixa a data infrastructure for chemical safety Jos Kleinjans Dept of Toxicogenomics Maastricht University
dixa a data infrastructure for chemical safety Jos Kleinjans Dept of Toxicogenomics Maastricht University Current protocol for chemical safety testing Short Term Tests for Genetic Toxicity Bacterial Reverse
More informationBiomedical Big Data and Precision Medicine
Biomedical Big Data and Precision Medicine Jie Yang Department of Mathematics, Statistics, and Computer Science University of Illinois at Chicago October 8, 2015 1 Explosion of Biomedical Data 2 Types
More informationBIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:
More informationWhite Paper: NCBI Database of Genotypes and Phenotypes (dbgap) Security Best Practices Compliance Overview for the New DNAnexus Platform
White Paper: NCBI Database of Genotypes and Phenotypes (dbgap) Security Best Practices Compliance Overview for the New DNAnexus Platform April 18, 2013 Overview This White Paper summarizes how the new
More informationAutomated and Scalable Data Management System for Genome Sequencing Data
Automated and Scalable Data Management System for Genome Sequencing Data Michael Mueller NIHR Imperial BRC Informatics Facility Faculty of Medicine Hammersmith Hospital Campus Continuously falling costs
More informationOpenCB a next generation big data analytics and visualisation platform for the Omics revolution
OpenCB a next generation big data analytics and visualisation platform for the Omics revolution Development at the University of Cambridge - Closing the Omics / Moore s law gap with Dell & Intel Ignacio
More informationOverview. Overarching observations
Overview Genomics and Health Information Technology Systems: Exploring the Issues April 27-28, 2011, Bethesda, MD Brief Meeting Summary, prepared by Greg Feero, M.D., Ph.D. (planning committee chair) The
More informationTHE UNIVERSITY OF MANCHESTER Unit Specification
1. GENERAL INFORMATION Title Unit code Credit rating 15 Level 7 Contact hours 30 Other Scheduled teaching and learning activities* Pre-requisite units Co-requisite units School responsible Member of staff
More informationBig Data: Challenges and Opportunities
Big Data: Challenges and Opportunities NGWI & USDA/ARS Meeting USDA Carver Center April 16, 2014 Doreen Ware Acting Chief Science Information Officer USDA ARS Big Data: Challenges and Response Biology
More informationG E N OM I C S S E RV I C ES
GENOMICS SERVICES THE NEW YORK GENOME CENTER NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. capabilities. N E X T- G E
More informationBBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS
BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS 1. The Technology Strategy sets out six areas where technological developments are required to push the frontiers of knowledge
More informationShouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center
Computational Challenges in Storage, Analysis and Interpretation of Next-Generation Sequencing Data Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Next Generation Sequencing
More informationThe Data Publishing Landscape where are we now in 2015
The Data Publishing Landscape where are we now in 2015 Presentation at NFAIS webinar On 12 March 2015 Eefke Smit, STM Director for Standards and Technology 2015, International STM Association What is STM?
More informationBig Data to Knowledge (BD2K)
Big Data to Knowledge () potential funding agency synergies Jennie Larkin, PhD Office of the Associate Director of Data Science National Institutes of Health idash-pscanner meeting UCSD September 16, 2014
More informationElectronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider
Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider Daniel Masys, M.D. Affiliate Professor Biomedical and Health Informatics University of Washington, Seattle
More informationData deluge (and it s applications) Gianluigi Zanetti. Data deluge. (and its applications) Gianluigi Zanetti
Data deluge (and its applications) Prologue Data is becoming cheaper and cheaper to produce and store Driving mechanism is parallelism on sensors, storage, computing Data directly produced are complex
More informationAlison Yao, Ph.D. July 2014
* Alison Yao, Ph.D. Program Officer, Office of Genomics and Advanced Technologies Division of Microbiology and Infectious Diseases National Institute of Allergy and Infectious Diseases National Institutes
More informationHow does genetic testing work?
How does genetic testing work? What is a genetic test? A genetic test looks at to find changes (variants) that cause disease or put you at greater risk to develop disease. DNA is the code our bodies use
More informationVad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives
Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives Dirk.Repsilber@oru.se 2015-05-21 Functional Bioinformatics, Örebro University Vad är bioinformatik och varför
More informationServices. Updated 05/31/2016
Updated 05/31/2016 Services 1. Whole exome sequencing... 2 2. Whole Genome Sequencing (WGS)... 3 3. 16S rrna sequencing... 4 4. Customized gene panels... 5 5. RNA-Seq... 6 6. qpcr... 7 7. HLA typing...
More informationCore Facility Genomics
Core Facility Genomics versatile genome or transcriptome analyses based on quantifiable highthroughput data ascertainment 1 Topics Collaboration with Harald Binder and Clemens Kreutz Project: Microarray
More information1. WHY ARE ELECTRONIC MEDICAL RECORDS IMPORTANT FOR PERSONALIZED MEDICINE?
THE ELECTRONIC MEDICAL RECORD: A CRITICAL ISSUE IN PERSONALIZED MEDICINE 1. WHY ARE ELECTRONIC MEDICAL RECORDS IMPORTANT FOR PERSONALIZED MEDICINE? As initially configured, electronic medical records (EMRs)
More informationDeliverable 7.3.1 First report on sample storage, DNA extraction and sample analysis processes
Model Driven Paediatric European Digital Repository Call identifier: FP7-ICT-2011-9 - Grant agreement no: 600932 Thematic Priority: ICT - ICT-2011.5.2: Virtual Physiological Human Deliverable 7.3.1 First
More information2019 Healthcare That Works for All
2019 Healthcare That Works for All This paper is one of a series describing what a decade of successful change in healthcare could look like in 2019. Each paper focuses on one aspect of healthcare. To
More informationIMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS
IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS 29 OCTOBER 2015 DR. DIRK J. EVERS BACKGROUND TreatmentMAP
More informationLifeScope Genomic Analysis Software 2.5
USER GUIDE LifeScope Genomic Analysis Software 2.5 Graphical User Interface DATA ANALYSIS METHODS AND INTERPRETATION Publication Part Number 4471877 Rev. A Revision Date November 2011 For Research Use
More informationEuropean registered Clinical Laboratory Geneticist (ErCLG) Core curriculum
(February 2015; updated from paper issued by the European Society of Human Genetics Ad hoc committee for the accreditation of clinical laboratory geneticists, published in February 2012) Speciality Profile
More information