ENCODE Data Coordina/on Center

Size: px
Start display at page:

Download "ENCODE Data Coordina/on Center"

Transcription

1 ENCODE Data Coordina/on Center Human Genome Mee/ng, Geneva 28 April 2014 J. Michael Cherry Department of Genetics Stanford University Stanford, California USA 1

2 Intro to the DCC Standards defini/on Using ontologies Accessing metadata

3 ENCODE Experiments 3

4 Types of assays in ENCODE project

5 ENCODE 3 Consor/um Structure Data Produc/on Groups RNA Histone Mods DNase DNAme TF Binding RBP Binding Data Coordina/on Center Technology Development Groups Data Analysis Center Analysis Working Group Computa/onal Analysis Groups Gene Models Chroma/n States Element ID 5

6 hwp://encodeproject.org 6

7 Role of the Data Coordina/on Center Genome Browser Produc/on labs Analysis groups Data files Metadata DCC DCC ENCODE Portal Integrative websites! Scientific! community! Role: Data genera-on Data organiza-on Data access Tasks: Perform assays Data processing & valida/on Web- based searches Perform analyses Data file storage Data downloads Validate data Metadata cura/on Submit data files Submit metadata

8 Challenge: How do you define standards for diverse assays in mul/ple species?

9 Intro to the DCC Standards defini/on Using ontologies Accessing metadata

10 What are metadata? Experimental Variables (/ssue, an/body target, assay) Experimental Design (Replicates and Controls) Non- experimental variables: (sequencing machine, protocols, product lots) Quality Assessment (IDR, unique reads, mapping ra/o) Credit (who, when, where) Validated Experiment Data processing choices (sodware, parameters, versions)

11 Define their rela/onship to each other An/bodies Experiment has Replicate has has Libraries + Biosample has has Donor has Files Biosample has

12 Intro to the DCC Metadata defini/on Using ontologies Accessing metadata

13 Single annota/on, accessed via mul/ple queries circulatory system mesoderm part_of develops_from develops_from part_of heart develops_from myoblast part_of develops_from Explicit rela/onships Inferred rela/onships hwp://uberon.org/ hwp://cellontology.org/ cardiac muscle cell

14 Instant interoperability & func/onality circulatory system mesoderm part_of develops_from develops_from part_of heart develops_from myoblast part_of develops_from Explicit rela/onships Inferred rela/onships hwp://uberon.org/ hwp://cellontology.org/ cardiac muscle cell

15 Why use standards? Consistency of language and iden/fiers facilitates iden/fica/on of data programma/cally. Alterna/ve spellings & phrases are synonyms. Independent of a par/cular data model. F = f = Female = female X X X Biological concepts are defined to provide scope Mitochondria: A semiautonomous, self replica8ng organelle that occurs in varying numbers, shapes, and sizes in the cytoplasm of virtually all eukaryo8c cells. It is notably the site of 8ssue respira8on. Rela/onships between terms can be computed to provide addi/onal annota/on details for grouping, searching, or analysis

16 Intro to the DCC Metadata defini/on Using ontologies Accessing metadata

17 the new ENCODE Portal hwp://

18 the new ENCODE Portal hwp://

19

20

21 21

22 Conclusions We developed a single data model that reflects the experimental process to store the 30+ assays done by the ENCODE produc/on labs Using ontologies to annotate metadata provides instant interoperability with other datasets & search func/onality Intro to the DCC Metadata defini/on Using ontologies Accessing metadata Applica/on built on a REST API & JSON- LD supports programma/c querying across other scien/fic resources

23 Contact encode-

24 Sources of ENCODE data ENCODE Portal hwp://encodeproject.org/ UCSC Genome Browser hwp://genome.ucsc.edu/cgi- bin/hggateway ENSEMBL hwp:// NCBI Bioproject: hwp:// GEO: Integra-ve websites ENCODE and Roadmap Epigenomics: hwp:// roadmap.org/ HaploReg: hwp:// RegulomeDB: hwp://regulomedb.org/

25 ENCODE metadata resources NHGRI project page: hwp:// Protocols hwp://encodeproject.org/encode/protocols/cell/ Standards pages hwp://encodeproject.org/encode/datastandards.html hwp://encodeproject.org/encode/valida/on/ SoGware and tools hwp://encodeproject.org/encode/sodwaretools.html File formats pages hwp://encodeproject.org/encode/fileformats.html hwp://genome.ucsc.edu/faq/faqformat.html

Web Services and Development of Semantic Applications

Web Services and Development of Semantic Applications Web Services and Development of Semantic Applications Trish Whetzel Outreach Coordinator THE NATIONAL CENTER FOR BIOMEDICAL ONTOLOGY Na#onal Center for Biomedical Ontology Mission To create software for

More information

GMQL Functional Comparison with BEDTools and BEDOPS

GMQL Functional Comparison with BEDTools and BEDOPS GMQL Functional Comparison with BEDTools and BEDOPS Genomic Computing Group Dipartimento di Elettronica, Informazione e Bioingegneria Politecnico di Milano This document presents a functional comparison

More information

NCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013

NCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013 NCBI resources III: GEO and ftp site Yanbin Yin Spring 2013 1 Homework assignment 2 Search colon cancer at GEO and find a data Series and perform a GEO2R analysis Write a report (in word or ppt) to include

More information

CS 5150 So(ware Engineering System Architecture: Introduc<on

CS 5150 So(ware Engineering System Architecture: Introduc<on Cornell University Compu1ng and Informa1on Science CS 5150 So(ware Engineering System Architecture: Introduc

More information

BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS

BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title: Bioinformatics

More information

Theo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za

Theo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za Theo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za Reflec1ons on the role of corpora and big data in e- lexicography in rela1on to end user informa1on needs CILC 2015 7th Interna1onal

More information

«Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge

«Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge «Shanoir : une solu/on pour la ges/on de données distribuées en imagerie in- vivo» Jus/ne Guillaumont Isabelle Corouge Shanoir: a solu-on for neuro- imaging data management Jus/ne Guillaumont, Isabelle

More information

GeneProf and the new GeneProf Web Services

GeneProf and the new GeneProf Web Services GeneProf and the new GeneProf Web Services Florian Halbritter florian.halbritter@ed.ac.uk Stem Cell Bioinformatics Group (Simon R. Tomlinson) simon.tomlinson@ed.ac.uk December 10, 2012 Florian Halbritter

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.3 Selected Standards

More information

Data archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin

Data archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin Data archiving and reproducible research for ecology and evolu6on. March 23 rd 2010 Ian Dworkin Outline of the ques6ons for the workshop 1. Why should I share my data? 2. When should I share my data? 3.

More information

Module 3. Genome Browsing. Using Web Browsers to View Genome Annota4on. Kers4n Howe Wellcome Trust Sanger Ins4tute zfish- help@sanger.ac.

Module 3. Genome Browsing. Using Web Browsers to View Genome Annota4on. Kers4n Howe Wellcome Trust Sanger Ins4tute zfish- help@sanger.ac. Module 3 Genome Browsing Using Web Browsers to View Genome Annota4on Kers4n Howe Wellcome Trust Sanger Ins4tute zfish- help@sanger.ac.uk Introduc.on Genome browsing The Ensembl gene set Guided examples

More information

Data Mining. Supervised Methods. Ciro Donalek donalek@astro.caltech.edu. Ay/Bi 199ab: Methods of Computa@onal Sciences hcp://esci101.blogspot.

Data Mining. Supervised Methods. Ciro Donalek donalek@astro.caltech.edu. Ay/Bi 199ab: Methods of Computa@onal Sciences hcp://esci101.blogspot. Data Mining Supervised Methods Ciro Donalek donalek@astro.caltech.edu Supervised Methods Summary Ar@ficial Neural Networks Mul@layer Perceptron Support Vector Machines SoLwares Supervised Models: Supervised

More information

Globus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis

Globus Research Data Management: Introduction and Service Overview. Steve Tuecke Vas Vasiliadis Globus Research Data Management: Introduction and Service Overview Steve Tuecke Vas Vasiliadis Presentations and other useful information available at globus.org/events/xsede15/tutorial 2 Thank you to

More information

Technical Update 2008

Technical Update 2008 Technical Update 2008 Sandy Payette, Chief Executive Dan Davis, Chief Software Architect April 27, 2008 Mission Driven Use Cases Scholarly and Scien.fic Research and Communica.on Data Cura.on, Linking,

More information

Next generation DNA sequencing technologies. theory & prac-ce

Next generation DNA sequencing technologies. theory & prac-ce Next generation DNA sequencing technologies theory & prac-ce Outline Next- Genera-on sequencing (NGS) technologies overview NGS applica-ons NGS workflow: data collec-on and processing the exome sequencing

More information

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the

More information

SBML SBGN SBML Just my 2 cents. Alice C. Villéger COMBINE 2010

SBML SBGN SBML Just my 2 cents. Alice C. Villéger COMBINE 2010 SBML SBGN SBML Just my 2 cents Alice C. Villéger COMBINE 2010 Disclaimer Fuzzy talk work in progress last minute slides Someone else has been working on very similar stuff and should really have been talking

More information

Project Management Introduc1on

Project Management Introduc1on Project Management Introduc1on Session 1 Part I Introduc1on By Amal Le Collen, PMP Dr. Lauren1u Neamtu, PMP Session outline 1. PART I: Introduc1on 1. The Purpose of the PMBOK Guide 2. What is a project?

More information

Chapter 3. Database Architectures and the Web Transparencies

Chapter 3. Database Architectures and the Web Transparencies Week 2: Chapter 3 Chapter 3 Database Architectures and the Web Transparencies Database Environment - Objec

More information

Building the Systems Biology Knowledgebase

Building the Systems Biology Knowledgebase Building the Systems Biology Knowledgebase Tom Brettin Oak Ridge National Laboratory brettints@ornl.gov outreach@kbase.us kbase-users@lists.kbase.us kbase-devel@lists.kbase.us Integrate science and the

More information

Pa#ent Involvement in Clinical Research In Rela#onship with Biobanking BBMRI 15 December 2009

Pa#ent Involvement in Clinical Research In Rela#onship with Biobanking BBMRI 15 December 2009 Pa#ent Involvement in Clinical Research In Rela#onship with Biobanking BBMRI 15 December 2009 Cor Oosterwijk Project Coordinator Pa;entPartner Dutch Gene;c Alliance VSOP European Gene;c Alliances Network

More information

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16 Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems

More information

Bank of America Security by Design. Derrick Barksdale Jason Gillam

Bank of America Security by Design. Derrick Barksdale Jason Gillam Bank of America Security by Design Derrick Barksdale Jason Gillam Costs of Correcting Defects 2 Bank of America The Three P s Product Design and build security into our product People Cultivate a security

More information

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.2 Community Needs of

More information

Data Warehousing. Yeow Wei Choong Anne Laurent

Data Warehousing. Yeow Wei Choong Anne Laurent Data Warehousing Yeow Wei Choong Anne Laurent Databases Databases are developed on the IDEA that DATA is one of the cri>cal materials of the Informa>on Age Informa>on, which is created by data, becomes

More information

Strategies for Medical Device So2ware Development Presented By Anthony Giles of Blackwood Embedded Solu;ons And a Case Study by Francis Amoah of Creo

Strategies for Medical Device So2ware Development Presented By Anthony Giles of Blackwood Embedded Solu;ons And a Case Study by Francis Amoah of Creo Strategies for Medical Device So2ware Development Presented By Anthony Giles of Blackwood Embedded Solu;ons And a Case Study by Francis Amoah of Creo Medical Introduc;on Standards 60601-1 in par;cular

More information

Performance Management. Ch. 9 The Performance Measurement. Mechanism. Chiara Demar8ni UNIVERSITY OF PAVIA. mariachiara.demar8ni@unipv.

Performance Management. Ch. 9 The Performance Measurement. Mechanism. Chiara Demar8ni UNIVERSITY OF PAVIA. mariachiara.demar8ni@unipv. UNIVERSITY OF PAVIA Performance Management Ch. 9 The Performance Measurement Mechanism Chiara Demar8ni mariachiara.demar8ni@unipv.it Master in Interna+onal Business and Economics Defini8on Performance

More information

Extrac'ng People s Hobby and Interest Informa'on from Social Media Content

Extrac'ng People s Hobby and Interest Informa'on from Social Media Content Extrac'ng People s Hobby and Interest Informa'on from Social Media Content Thomas Forss, Shuhua Liu and Kaj- Mikael Björk Dept of Business Administra?on and Analy?cs Arcada University of Applied Sciences

More information

Application of Supply Chain Concepts to the Analysis Process

Application of Supply Chain Concepts to the Analysis Process Application of Supply Chain Concepts to the Analysis Process Rob Handfield, PhD Bank of America University Distinguished Professor of Supply Chain Management Executive Director, Supply Chain Resource Cooperative

More information

NCDS Leadership Summit " The Friday Center" Chapel Hill, North Carolina" April 23 & 24, 2013!

NCDS Leadership Summit  The Friday Center Chapel Hill, North Carolina April 23 & 24, 2013! NCDS Leadership Summit " The Friday Center" Chapel Hill, North Carolina" April 23 & 24, 2013! Data Collection Scale of Problem Challenges v Research versus clinical contexts v Science versus medicine v

More information

RNAseq / ChipSeq / Methylseq and personalized genomics

RNAseq / ChipSeq / Methylseq and personalized genomics RNAseq / ChipSeq / Methylseq and personalized genomics 7711 Lecture Subhajyo) De, PhD Division of Biomedical Informa)cs and Personalized Biomedicine, Department of Medicine University of Colorado School

More information

Research Data Management and the role of libraries

Research Data Management and the role of libraries Research Data Management and the role of libraries Sarah Jones Digital Cura9on Centre sarah.jones@glasgow.ac.uk Twi@er: @sjdcc ADBU conference: Quelle(s) stratégie(s) de recherche face à la nouvelle massifica>on

More information

Language Resources, Language Technology, Text Mining, the Seman8c Web: How interoperability of machines can help humans in the mul8lingual web

Language Resources, Language Technology, Text Mining, the Seman8c Web: How interoperability of machines can help humans in the mul8lingual web Language Resources, Language Technology, Text Mining, the Seman8c Web: How interoperability of machines can help humans in the mul8lingual web Felix Sasaki DFKI / University of Appl. Sciences Potsdam W3C

More information

From Big Data to Value

From Big Data to Value From Big Data to Value The Power of Master Data Management 2.0 Sergio Juarez SVP Elemica EMEA & LATAM Reveal Oct 2014 Agenda Master Data Management Why Now? What To Do? How To Do It? What s Next? Today

More information

University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology

University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology Programme Structure - the MSc outcome will require 180 credits total (full-time only) - 60

More information

Migrating to Hosted Telephony. Your ultimate guide to migrating from on premise to hosted telephony. www.ucandc.com

Migrating to Hosted Telephony. Your ultimate guide to migrating from on premise to hosted telephony. www.ucandc.com Migrating to Hosted Telephony Your ultimate guide to migrating from on premise to hosted telephony Intro What is covered in this guide? A professional and reliable business telephone system is a central

More information

IT Change Management Process Training

IT Change Management Process Training IT Change Management Process Training Before you begin: This course was prepared for all IT professionals with the goal of promo9ng awareness of the process. Those taking this course will have varied knowledge

More information

RESTful or RESTless Current State of Today's Top Web APIs

RESTful or RESTless Current State of Today's Top Web APIs RESTful or RESTless Current State of Today's Top Web APIs Frederik Buelthoff, Maria Maleshkova AIFB, Karlsruhe Ins-tute of Technology (KIT), Germany [1] Growing Number of Web APIs Challenges Scalability

More information

Comparing Methods for Identifying Transcription Factor Target Genes

Comparing Methods for Identifying Transcription Factor Target Genes Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF

More information

Vision of Interoperability Jamie Ferguson, Stan Huff, Cris Ross

Vision of Interoperability Jamie Ferguson, Stan Huff, Cris Ross Vision of Interoperability Jamie Ferguson, Stan Huff, Cris Ross Evolu&on of Interoperability As HIE evolves, the interoperability framework standards advance for reliable exchange and data integra=on across

More information

org.rn.eg.db December 16, 2015 org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank accession numbers.

org.rn.eg.db December 16, 2015 org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank accession numbers. org.rn.eg.db December 16, 2015 org.rn.egaccnum Map Entrez Gene identifiers to GenBank Accession Numbers org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank

More information

The Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons

The Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Solu4ons The Reservoir 10 th September 2014 A growing demand Business Teams want Open access to more informa4on More

More information

Building custom memory profilers. In# Gonzalez- Herrera Diverse Team Advisers: Johann Bourcier and Olivier Barais

Building custom memory profilers. In# Gonzalez- Herrera Diverse Team Advisers: Johann Bourcier and Olivier Barais Building custom memory profilers In# Gonzalez- Herrera Diverse Team Advisers: Johann Bourcier and Olivier Barais What are and why we need custom memory profilers? Developers think in terms of high- level

More information

ARTIST Methodology and Tooling. Jesus Gorroñogoitia - Atos SOC Crete, 1 st July 2015

ARTIST Methodology and Tooling. Jesus Gorroñogoitia - Atos SOC Crete, 1 st July 2015 ARTIST Methodology and Tooling Jesus Gorroñogoitia - Atos SOC Crete, 1 st July 2015 Motivation: From SaaP to SaaS So#ware as a Product based Company So#ware as a Service based Company : Cloud Computing

More information

Interna'onal Standards Ac'vi'es on Cloud Security EVA KUIPER, CISA CISSP EVA.KUIPER@HP.COM HP ENTERPRISE SECURITY SERVICES

Interna'onal Standards Ac'vi'es on Cloud Security EVA KUIPER, CISA CISSP EVA.KUIPER@HP.COM HP ENTERPRISE SECURITY SERVICES Interna'onal Standards Ac'vi'es on Cloud Security EVA KUIPER, CISA CISSP EVA.KUIPER@HP.COM HP ENTERPRISE SECURITY SERVICES Agenda Importance of Common Cloud Standards Outline current work undertaken Define

More information

Importance of Statistics in creating high dimensional data

Importance of Statistics in creating high dimensional data Importance of Statistics in creating high dimensional data Hemant K. Tiwari, PhD Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham History of Genomic Data

More information

Alison Yao, Ph.D. July 2014

Alison Yao, Ph.D. July 2014 * Alison Yao, Ph.D. Program Officer, Office of Genomics and Advanced Technologies Division of Microbiology and Infectious Diseases National Institute of Allergy and Infectious Diseases National Institutes

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

Tutorial. Reference Genome Tracks. Sample to Insight. November 27, 2015

Tutorial. Reference Genome Tracks. Sample to Insight. November 27, 2015 Reference Genome Tracks November 27, 2015 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com Reference

More information

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness Melanie Dulong de Rosnay Fellow, Science Commons and Berkman Center for Internet & Society at Harvard University This article

More information

!"#$%&'()*#"+,&-(.#,"*'/'.%-*

!#$%&'()*#+,&-(.#,*'/'.%-* !"#$%&'()*#"+,&-(.#,"*'/'.%-*!01234567* #0894:6;90* '!#'?* 15* =@3* 03A* B30346;90* 98* 10=3B46=3C* 59DA643* 894* %0=34E4153* &359F4G3* -606B3:30=* >%&-?* =@6=* E4921C35* =@3* 836=F435* 60C* 8F0G;90671;35*

More information

Open Access to Manuscripts, Open Science, and Big Data

Open Access to Manuscripts, Open Science, and Big Data Open Access to Manuscripts, Open Science, and Big Data Progress, and the Elsevier Perspective in 2013 Presented by: Dan Morgan Title: Senior Manager Access Relations, Global Academic Relations Company

More information

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane www.ebi.ac.uk Three data delivery cases for EMBL- EBI s Embassy Guy Cochrane www.ebi.ac.uk EMBL European Bioinformatics Institute Genes, genomes & variation European Nucleotide Archive 1000 Genomes Ensembl Ensembl Genomes

More information

Introduc)on to the IoT- A methodology

Introduc)on to the IoT- A methodology 10/11/14 1 Introduc)on to the IoTA methodology Olivier SAVRY CEA LETI 10/11/14 2 IoTA Objec)ves Provide a reference model of architecture (ARM) based on Interoperability Scalability Security and Privacy

More information

CSER & emerge Consor.a EHR Working Group Collabora.on on Display and Storage of Gene.c Informa.on in Electronic Health Records

CSER & emerge Consor.a EHR Working Group Collabora.on on Display and Storage of Gene.c Informa.on in Electronic Health Records electronic Medical Records and Genomics CSER & emerge Consor.a EHR Working Group Collabora.on on Display and Storage of Gene.c Informa.on in Electronic Health Records Brian Shirts, MD, PhD University of

More information

On-line supplement to manuscript Galaxy for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly

On-line supplement to manuscript Galaxy for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly On-line supplement to manuscript Galaxy for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly DANIEL BLANKENBERG, JAMES TAYLOR, IAN SCHENCK, JIANBIN HE, YI ZHANG, MATTHEW

More information

17 July 2014 WEB-SERVER MANUAL. Contact: Michael Hackenberg (hackenberg@ugr.es)

17 July 2014 WEB-SERVER MANUAL. Contact: Michael Hackenberg (hackenberg@ugr.es) WEB-SERVER MANUAL Contact: Michael Hackenberg (hackenberg@ugr.es) 1 1 Introduction srnabench is a free web-server tool and standalone application for processing small- RNA data obtained from next generation

More information

The EcoCyc Curation Process

The EcoCyc Curation Process The EcoCyc Curation Process Ingrid M. Keseler SRI International 1 HOW OFTEN IS THE GOLDEN GATE BRIDGE PAINTED? Many misconceptions exist about how often the Bridge is painted. Some say once every seven

More information

Internet Storage Sync Problem Statement

Internet Storage Sync Problem Statement Internet Storage Sync Problem Statement draft-cui-iss-problem Zeqi Lai Tsinghua University 1 Outline Background Problem Statement Service Usability Protocol Capabili?es Our Explora?on on Protocol Capabili?es

More information

Telephone Related Queries (TeRQ) IETF 85 (Atlanta)

Telephone Related Queries (TeRQ) IETF 85 (Atlanta) Telephone Related Queries (TeRQ) IETF 85 (Atlanta) Telephones and the Internet Our long- term goal: migrate telephone rou?ng and directory services to the Internet ENUM: Deviated significantly from its

More information

Graduate Systems Engineering Programs: Report on Outcomes and Objec:ves

Graduate Systems Engineering Programs: Report on Outcomes and Objec:ves Graduate Systems Engineering Programs: Report on Outcomes and Objec:ves Alice Squires, alice.squires@stevens.edu Tim Ferris, David Olwell, Nicole Hutchison, Rick Adcock, John BrackeL, Mary VanLeer, Tom

More information

Module 1. Sequence Formats and Retrieval. Charles Steward

Module 1. Sequence Formats and Retrieval. Charles Steward The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.

More information

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel

More information

IMBB 2013. Genomic DNA purifica8on

IMBB 2013. Genomic DNA purifica8on IMBB 2013 Genomic DNA purifica8on Why purify DNA? The purpose of DNA purifica8on from the cell/8ssue is to ensure it performs well in subsequent downstream applica8ons, e.g. Polymerase Chain Reac8on (PCR),

More information

Genome and DNA Sequence Databases. BME 110/BIOL 181 CompBio Tools Todd Lowe March 31, 2009

Genome and DNA Sequence Databases. BME 110/BIOL 181 CompBio Tools Todd Lowe March 31, 2009 Genome and DNA Sequence Databases BME 110/BIOL 181 CompBio Tools Todd Lowe March 31, 2009 Admin Reading: Chapters 1 & 2 Notes available in PDF format on-line (see class calendar page): http://www.soe.ucsc.edu/classes/bme110/spring09/bme110-calendar.html

More information

A Primer of Genome Science THIRD

A Primer of Genome Science THIRD A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:

More information

Offensive & Defensive & Forensic Techniques for Determining Web User Iden<ty

Offensive & Defensive & Forensic Techniques for Determining Web User Iden<ty Offensive & Defensive & Forensic Techniques for Determining Web User Iden

More information

Usability in bioinformatics mobile applications

Usability in bioinformatics mobile applications Usability in bioinformatics mobile applications what we are working on Noura Chelbah, Sergio Díaz, Óscar Torreño, and myself Juan Falgueras App name Performs Advantajes Dissatvantajes Link The problem

More information

Urban Big Data Centre

Urban Big Data Centre Urban Big Data Centre Piyushimita Thakuriah (Vonu) Director, UBDC Professor and Ch2M Chair of Transport UNIVERSITY OF GLASGOW November 12, 2015 July 10, 2015 UBDC Partners Funded by ESRC Big Data Network

More information

So#ware quality assurance - introduc4on. Dr Ana Magazinius

So#ware quality assurance - introduc4on. Dr Ana Magazinius So#ware quality assurance - introduc4on Dr Ana Magazinius 1 What is quality? 2 What is a good quality car? 2 and 2 2 minutes 3 characteris4cs 3 What is quality? 4 What is quality? How good or bad something

More information

How To Understand The Big Data Paradigm

How To Understand The Big Data Paradigm Big Data and Its Empiricist Founda4ons Teresa Scantamburlo The evolu4on of Data Science The mechaniza4on of induc4on The business of data The Big Data paradigm (data + computa4on) Cri4cal analysis Tenta4ve

More information

Seman&c Web: Benefits For Clinical Decision Support At The Bedside. Emory Fry, MD SemTechBiz 2013

Seman&c Web: Benefits For Clinical Decision Support At The Bedside. Emory Fry, MD SemTechBiz 2013 Seman&c Web: Benefits For Clinical Decision Support At The Bedside Emory Fry, MD SemTechBiz 2013 Clinical Decision Support (CDS) A system providing knowledge and person specific or popula8on informa8on

More information

DEFINING COMPONENTS OF NATIONAL REDD+ FINANCIAL PLANNING

DEFINING COMPONENTS OF NATIONAL REDD+ FINANCIAL PLANNING DEFINING COMPONENTS OF NATIONAL REDD+ FINANCIAL PLANNING WORKSHOP ON BUILDING MULTI- SOURCE REDD+ FINANCING STRATEGIES Antigua, Guatemala July 17 and 18, 2014 Objec'ves of REDD+ Financial Planning Financial

More information

DDC Sequencing and Redundancy

DDC Sequencing and Redundancy DDC Sequencing and Redundancy Presenter Sequencing Importance of sequencing Essen%al piece to designing and delivering a successful project Defines how disparate components interact to make up a system

More information

Cells, tissues and organs

Cells, tissues and organs Chapter 8: Cells, tissues and organs Cells: building blocks of life Living things are made of cells. Many of the chemical reactions that keep organisms alive (metabolic functions) take place in cells.

More information

cheminformatics nomenclature activity binding based data sets knowledge thesauri article

cheminformatics nomenclature activity binding based data sets knowledge thesauri article PubChem chemical biology domain caspase activity standards cheminformatics nomenclature activity semantic enzyme reporter viability fluorescence binding based data sets programming knowledge search screening

More information

SPRING BRANCH ISD Leadership Work Web Mee;ng

SPRING BRANCH ISD Leadership Work Web Mee;ng SPRING BRANCH ISD Leadership Work Web Mee;ng Ellio% Witney Exec. Dir. of Strategic Ini;a;ves & Innova;on Email: elliog.witney@springbranchisd.com LinkedIn: www.linkedin.com/in/ elliogwitney Twi%er: @ejwitney

More information

GenBank, Entrez, & FASTA

GenBank, Entrez, & FASTA GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,

More information

Report of the DTL focus meeting on Life Science Data Repositories

Report of the DTL focus meeting on Life Science Data Repositories Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity

More information

What is Distributed Annotation System?

What is Distributed Annotation System? Contents ISiLS Lecture 12 short introduction to data integration F.J. Verbeek Genome browsers Solutions for integration CORBA SOAP DAS Ontology mapping 2 nd lecture BioASP roadshow 1 2 Human Genome Browsers

More information

Ibis: Scaling Python Analy=cs on Hadoop and Impala

Ibis: Scaling Python Analy=cs on Hadoop and Impala Ibis: Scaling Python Analy=cs on Hadoop and Impala Wes McKinney, Budapest BI Forum 2015-10- 14 @wesmckinn 1 Me R&D at Cloudera Serial creator of structured data tools / user interfaces Mathema=cian MIT

More information

School of Nursing. Presented by Yvette Conley, PhD

School of Nursing. Presented by Yvette Conley, PhD Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression

More information

Retaining globally distributed high availability Art van Scheppingen Head of Database Engineering

Retaining globally distributed high availability Art van Scheppingen Head of Database Engineering Retaining globally distributed high availability Art van Scheppingen Head of Database Engineering Overview 1. Who is Spil Games? 2. Theory 3. Spil Storage Pla9orm 4. Ques=ons? 2 Who are we? Who is Spil

More information

Fifth Grade Cells: Structures and Processes Assessment

Fifth Grade Cells: Structures and Processes Assessment Fifth Grade Cells: Structures and Processes Assessment 1a. All living things are made up of. a. cells b. tissues c. organisms d. systems 1b. All living things are made up of. 1c. Explain what cells are

More information

Introduction. Overview of Bioconductor packages for short read analysis

Introduction. Overview of Bioconductor packages for short read analysis Overview of Bioconductor packages for short read analysis Introduction General introduction SRAdb Pseudo code (Shortread) Short overview of some packages Quality assessment Example sequencing data in Bioconductor

More information

The importance of supply chain

The importance of supply chain Guidelines for Improving Supply Chain Management at Bulgarian Enterprises Chief Assist. Prof. Miroslava Rakovska, Ph.D. Department of Business Logistics, UNWE tеl. (02) 9435248, 0888704178 e-mail: mirar@unwe.acad.bg

More information

Human Genome Sequencing Project: What Did We Learn? 02-223 How to Analyze Your Own Genome Fall 2013

Human Genome Sequencing Project: What Did We Learn? 02-223 How to Analyze Your Own Genome Fall 2013 Human Genome Sequencing Project: What Did We Learn? 02-223 How to Analyze Your Own Genome Fall 2013 Human Genome Sequencing Project Interna:onal Human Genome Sequencing Consor:um ( public project ) Ini$al

More information

PrimePCR Assay Validation Report

PrimePCR Assay Validation Report Gene Information Gene Name sorbin and SH3 domain containing 2 Gene Symbol Organism Gene Summary Gene Aliases RefSeq Accession No. UniGene ID Ensembl Gene ID SORBS2 Human Arg and c-abl represent the mammalian

More information

Discovery & Modeling of Genomic Regulatory Networks with Big Data

Discovery & Modeling of Genomic Regulatory Networks with Big Data Discovery & Modeling of Genomic Regulatory Networks with Big Data Hamid Bolouri Division of Human Biology Fred Hutchinson Cancer Research Center labs.fhcrc.org/bolouri I have no financial relationships

More information

Distributed Systems Interconnec=ng Them Fundamentals of Distributed Systems Alvaro A A Fernandes School of Computer Science University of Manchester

Distributed Systems Interconnec=ng Them Fundamentals of Distributed Systems Alvaro A A Fernandes School of Computer Science University of Manchester Distributed Systems Interconnec=ng Them Fundamentals of Distributed Systems lvaro Fernandes School of Computer Science University of Manchester Goals 1. To highlight the role of the interconnect in characterizing

More information

Clouds and Other Computa1onal Frameworks. Evere7 Toews, Cybera Inc. Todd King, UCLA

Clouds and Other Computa1onal Frameworks. Evere7 Toews, Cybera Inc. Todd King, UCLA Clouds and Other Computa1onal Frameworks Evere7 Toews, Cybera Inc. Todd King, UCLA Presenta1on Overview The cloud can be a great fit for your computa1onal and storage needs Projects overview The cloud

More information

Update on the Cloud Demonstration Project

Update on the Cloud Demonstration Project Update on the Cloud Demonstration Project Khalil Yazdi and Steven Wallace Spring Member Meeting April 19, 2011 Project Par4cipants BACKGROUND Eleven Universi1es: Caltech, Carnegie Mellon, George Mason,

More information

2015-16 ITS Strategic Plan Enabling an Unbounded University

2015-16 ITS Strategic Plan Enabling an Unbounded University 2015-16 ITS Strategic Plan Enabling an Unbounded University Update: July 31, 2015 IniAaAve: Agility Through Technology Vision Mission Enable Unbounded Learning Support student success through the innovaave

More information

Welcome! Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research. Andrea Heckert, PhD, MPH Program Officer, Science

Welcome! Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research. Andrea Heckert, PhD, MPH Program Officer, Science Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research Emily Evans, PhD, MPH Program Officer, Science Andrea Heckert, PhD, MPH Program Officer, Science June 22, 2015 Welcome! Emily

More information

An Open Dynamic Big Data Driven Applica3on System Toolkit

An Open Dynamic Big Data Driven Applica3on System Toolkit An Open Dynamic Big Data Driven Applica3on System Toolkit Craig C. Douglas University of Wyoming and KAUST This research is supported in part by the Na3onal Science Founda3on and King Abdullah University

More information

Bio-IT World 2013 Best Practices Awards

Bio-IT World 2013 Best Practices Awards Published Resources for the Life Sciences 250 First Avenue, Suite 300, Needham, MA 02494 phone: 781-972-5400 fax: 781-972-5425 Bio-IT World 2013 Best Practices Awards Celebrating Excellence in Innovation

More information

Research Data Networks: Privacy- Preserving Sharing of Protected Health Informa>on

Research Data Networks: Privacy- Preserving Sharing of Protected Health Informa>on Research Data Networks: Privacy- Preserving Sharing of Protected Health Informa>on Lucila Ohno-Machado, MD, PhD Division of Biomedical Informatics University of California San Diego PCORI Workshop 7/2/12

More information

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Computational Challenges in Storage, Analysis and Interpretation of Next-Generation Sequencing Data Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center Next Generation Sequencing

More information

Genome Viewing. Module 2. Using Genome Browsers to View Annotation of the Human Genome

Genome Viewing. Module 2. Using Genome Browsers to View Annotation of the Human Genome Module 2 Genome Viewing Using Genome Browsers to View Annotation of the Human Genome Bert Overduin, Ph.D. PANDA Coordination & Outreach EMBL - European Bioinformatics Institute Wellcome Trust Genome Campus

More information

Main Research Gaps in Cyber Security

Main Research Gaps in Cyber Security Comprehensive Approach to cyber roadmap coordina5on and development Main Research Gaps in Cyber Security María Pilar Torres Bruna everis Aerospace and Defence Index CAMINO WP2: Iden8fica8on and Analysis

More information