Alison Yao, Ph.D. July 2014



Similar documents
NIAID Genomics and Bioinformatics Programs

Big Data to Knowledge (BD2K)

Vivien Bonazzi ADDS Office (OD) George Komatsoulis (NCBI)

An Introduction to Genomics and SAS Scientific Discovery Solutions

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology

Summary of Responses to the Request for Information (RFI): Input on Development of a NIH Data Catalog (NOT-HG )

Electronic Laboratory Notebook in the Graduate Level Laboratory Informatics Program

Strategies in data integration to predict fish susceptibility to toxicants

NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

BIOINFORMATICS Supporting competencies for the pharma industry

International CEMarin Omics Workshop: Omics Techniques for the Study of Marine Organisms and Ecosystems

Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives

Data Science at the NIH Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health

BIO 3352: BIOINFORMATICS II HYBRID COURSE SYLLABUS

Dr Alexander Henzing

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

SYMPOSIUM. June 15, Photonics Center Colloquium Room (906) 8 Saint Mary's St., Boston, MA Boston University

University of Glasgow - Programme Structure Summary C1G MSc Bioinformatics, Polyomics and Systems Biology

FACULTY OF MEDICAL SCIENCE

Intro to Bioinformatics

European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

Workshop on Establishing a Central Resource of Data from Genome Sequencing Projects

Guidelines for Establishment of Contract Areas Computer Science Department

Preparing the scenario for the use of patient s genome sequences in clinic. Joaquín Dopazo

CCR Biology - Chapter 9 Practice Test - Summer 2012

General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List

Ingenuity Pathway Analysis (IPA )

Searching biomedical data sets. Hua Xu, PhD The University of Texas Health Science Center at Houston

CALIFORNIA STATE UNIVERSITY CHANNEL ISLANDS

Degree Level Expectations, Learning Outcomes, Indicators of Achievement and the Program Requirements that Support the Learning Outcomes

A Primer of Genome Science THIRD

NIH/NIGMS Trainee Forum: Computational Biology and Medical Informatics at Georgia Tech

How To Understand The Science Of Genomics

Connecting Basic Research and Healthcare Big Data

dixa a data infrastructure for chemical safety Jos Kleinjans Dept of Toxicogenomics Maastricht University

Globus Genomics Tutorial GlobusWorld 2014

EMBL Identity & Access Management

School of Public Health. Department of Epidemiology & Biostatistics

BIOLOGICAL SCIENCES REQUIREMENTS [63 75 UNITS]

Presenting data: how to convey information most effectively Centre of Research Excellence in Patient Safety 20 Feb 2015

School of Nursing. Presented by Yvette Conley, PhD

Voluntary Genomic Data Submissions at the U.S. FDA

Understanding Big Data Analytics for Research

The Office of Biological and Environmental

OpenCB a next generation big data analytics and visualisation platform for the Omics revolution

The Future of the Electronic Health Record. Gerry Higgins, Ph.D., Johns Hopkins

Personalized Medicine: Humanity s Ultimate Big Data Challenge. Rob Fassett, MD Chief Medical Informatics Officer Oracle Health Sciences

Three data delivery cases for EMBL- EBI s Embassy. Guy Cochrane

How Can Institutions Foster OMICS Research While Protecting Patients?

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS

FACULTY OF MEDICAL SCIENCE

MD-PhD: Is it Right for Me? Training & Career Paths

Donna J. Dean, Ph.D. October 27, 2009 Brown University

Center for Health Informatics & Bioinformatics. A New Catalyst For Cutting Edge research, Funding Opportunities, and Education at NYULMC

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

KNIME Enterprise server usage and global deployment at NIBR

Course Specification

MediSapiens Ltd. Bio-IT solutions for improving cancer patient care. Because data is not knowledge. 19th of March 2015

2015 Concept Paper for Master of Science Program in Biotechnology

Technology funding opportunities at the National Cancer Institute

Big Data and Data Analysis for Personalized Medicine

FBIO - Fundations of Bioinformatics

Transcription:

* Alison Yao, Ph.D. Program Officer, Office of Genomics and Advanced Technologies Division of Microbiology and Infectious Diseases National Institute of Allergy and Infectious Diseases National Institutes of Health July 2014

BIG DATA * NIH Big Data to Knowledge Initiative for Research Data BD2K

* Genomic Other Omic Imaging Phenotypic Exposure Clinical Courtesy of NHGRI

*BIG DATA Experimental metadata Interpreted data/ knowledge Derived data Analytical metadata Primary data Courtesy of Richard Scheuermann

*BIG DATA *Lots and lots of data in individual labs Lab 2 Lab 1 Lab 6 Lab 4 Lab 3 Lab 5 Courtesy of Michael F. Huerta

*NIH is Tackling the Big Data Problem Associate Director for Data Science (ADDS) Scientific Data Council (SDC) Big Data to Knowledge (BD2K) Courtesy of NHGRI

*Big Data to Knowledge (BD2K): Major trans-nih initiative addressing an NIH imperative and key roadblock Aims to be catalytic to biomedical research and synergistic across different scientific communities Overarching goal: BD2K aims to develop the new approaches, standards, methods, tools, software, and competencies that will enhance the use of biomedical Big Data by supporting research, implementation, and training in data science.

Data Computing centers and Software development Advance the science & technology of biomedical big data Data standards, catalog, and data sharing policies Facilitate the broad use of biomedical research data Training *NIH BD2K Initiative Enhance & develop the workforce in biomedical big data

*Impact of NIH BD2K *Increased data sharing will make data available *Promotion of standards will make data useable *Data will be brought into the research ecosystem *Discoverable, citable & linked to data, tools & literature *Data science & tools will enable scientific innovation BD2K will make the biomedical research enterprise more data centric Today Hypothesis driven Transforming Biomedical research Tomorrow Data centric

* *The DDICC will support *Data Discoverability *Data Access *Data Citation *Approaches *Community engagement and Outreach *Task Forces *Pilot Projects *Deliverables: *White paper and examples to help inform development of a fully functional DDI

* NIAID/DMID Genomics Program Sequencing Functional Genomics Proteomics Structural Genomics Systems Biology Genomic Sequencing Centers Functional Genomic Research Centers Clinical Proteomics Centers Structural Genomics Centers Systems Biology Centers Bioinformatics Resource Centers Bioinformatics Genomic Research Resources Genomic/Omics Data Sets, Databases, Bioinformatics Tools, Biomarkers, 3D Structures, Protein Clones, Predictive Models To address key questions in microbiology and infectious disease

* Bioinformatics Resource Centers (BRCs) Genome Sequencing Centers Systems Biology Centers Structure Genomics Centers Clinical Proteomics Centers

*Bioinformatics Resource Centers (BRCs) Goal: Provide integrated bioinformatics resources in support of basic and applied infectious diseases research Data and metadata management and integration solutions Computational analysis and visualization tools Work spaces and web interfaces Training and outreach activities Free bioinformatics services Rapid response to new and emerging pandemic threats

*Bioinformatics Resource Centers (BRCs)

* Software Engineering Data Management & Integration Web interfaces and workspaces Social Engineering Computational analysis tools Collaboration Bioinformatics Services Training Workshop

* Data Tools

* CEIRS ICEMR BRCs DBPs

* *Key Features: *~16,000 bacterial genomes and standardized annotations *Free bioinformatics services * Genome annotation service (RAST) * Comparative genome analysis *Integrated genomic and omics data, metadata and tools *Comparative analyses and interactive visualizations *Personal workspace *TB Portal

* Genomes Metadata Phylogenetic Trees Genes & Proteins

* Protein-protein interactions Structures Transcriptomics (Microarray, RNA-Seq) Pathways Proteomics, ChIP-Seq data coming January 2014

Reference genomes (H37Rv) * tb.patricbrc.org Gene/ Protein search Analysis Tools Omics Data

* Data Generation Infectious disease community CEIRS Insight Hypothesis Data Processing Bioinformatics centers, IRD CEIRS data coordinating center Knowledge Presentation Open access Visualization Analysis Query Analysis Training Services Collaboration

*Acknowledgment DMID/OGAT Maria Giovanni Valentina Di Francesco Julia Puzak Eun Mi Lee Punam Mathur Malu Polanski Vivien Dugan Christina Giblin The Influenza Research Database Team J. Craig Venter Institute Northrop Grumman Health Solutions Vecna Technologies Los Alamos National Laboratory University of California Davis