AmphoraNet: Taxonomic Composition Analysis of Metagenomic Shotgun Sequencing Data



Similar documents
Tribuna Académica. Overview of Metagenomics for Marine Biodiversity Research 1. Barton E. Slatko* Metagenomics defined

Accelerate genomic breakthroughs in microbiology. Gain deeper insights with powerful bioinformatic tools.

Difficult DNA Templates Sequencing. Primer Walking Service

Canadian Microbiome Initiative

History of DNA Sequencing & Current Applications

Metagenomics revisits the one pathogen/one disease postulates and translate the One Health concept into action

Next Generation Sequencing Technologies in Microbial Ecology. Frank Oliver Glöckner

CCR Biology - Chapter 9 Practice Test - Summer 2012

Nicolas Pons INRA Ins(tut Micalis Plateforme MetaQuant Jouy- en- Josas, France

Master's projects at ITMO University. Daniil Chivilikhin PhD ITMO University

Owen White Institute for Genome Sciences University of Maryland

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center

Influence of the skin mechanical and microbial properties on hair growth

Worksheet - COMPARATIVE MAPPING 1

University of Glasgow - Programme Structure Summary C1G MSc Bioinformatics, Polyomics and Systems Biology

Microbial community profiling for human microbiome projects: Tools, techniques, and challenges

14.3 Studying the Human Genome

BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS

Lecture 13: DNA Technology. DNA Sequencing. DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology

Biotechnology and Recombinant DNA (Chapter 9) Lecture Materials for Amy Warenda Czura, Ph.D. Suffolk County Community College

Biotechnology: DNA Technology & Genomics

Molecular typing of VTEC: from PFGE to NGS-based phylogeny

Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company

A data management framework for the Fungal Tree of Life

Tarmbakterier ved fedme og type 2 diabetes

DNA Fingerprinting. Unless they are identical twins, individuals have unique DNA

A Tutorial in Genetic Sequence Classification Tools and Techniques

Twincore - Zentrum für Experimentelle und Klinische Infektionsforschung Institut für Molekulare Bakteriologie

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology

IIID 14. Biotechnology in Fish Disease Diagnostics: Application of the Polymerase Chain Reaction (PCR)

MoBEDAC -- Integrated data and analysis for the indoor and built environment. Folker Meyer Argonne National Laboratory GSC 13 Shenzhen, China

A Primer of Genome Science THIRD

The impact of antibiotics on the gut microbiota

Deliverable First report on sample storage, DNA extraction and sample analysis processes

Nucleic Acid Techniques in Bacterial Systematics

E. coli plasmid and gene profiling using Next Generation Sequencing

IMCAS-BRC: toward better management and more efficient exploitation of microbial resources

MASTER OF SCIENCE IN BIOLOGY

TOOLS FOR T-RFLP DATA ANALYSIS USING EXCEL

Principles of Evolution - Origin of Species

Cloud Computing for Scientific Research

G E N OM I C S S E RV I C ES

Introduction. Contamination sources

Services. Updated 05/31/2016

An introduction to bioinformatic tools for population genomic and metagenetic data analysis, 2.5 higher education credits Third Cycle

The Human Genome Project. From genome to health From human genome to other genomes and to gene function Structural Genomics initiative

Algorithms in Computational Biology (236522) spring 2007 Lecture #1

National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA

Human Genome and Human Genome Project. Louxin Zhang

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

In developmental genomic regulatory interactions among genes, encoding transcription factors

Next Generation Sequencing

The Polyphenol Study. Effect of Dietary Polyphenols on Intestinal Microbiota, Intestinal Inflammation and Metabolic Syndrome

FACULTY OF MEDICAL SCIENCE

Bioinformatics and its applications

Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011

Statement of ethical principles for biotechnology in Victoria

DNA Barcoding: A New Tool for Identifying Biological Specimens and Managing Species Diversity

MK Gibson et al. Training and Benchmarking of Resfams profile HMMs

High-throughput sequencing and big data: implications for personalized medicine?

Classification of Microorganisms (Chapter 10) Lecture Materials for Amy Warenda Czura, Ph.D. Suffolk County Community College Eastern Campus

KEY CONCEPT Organisms can be classified based on physical similarities. binomial nomenclature

Introduction to Bioinformatics 3. DNA editing and contig assembly

WJEC AS Biology Biodiversity & Classification (2.1 All Organisms are related through their Evolutionary History)

SNP Essentials The same SNP story

2.3 Identify rrna sequences in DNA

DnaSP, DNA polymorphism analyses by the coalescent and other methods.

Protocols. Internal transcribed spacer region (ITS) region. Niklaus J. Grünwald, Frank N. Martin, and Meg M. Larsen (2013)

How To Understand The Science Of Genomics

Genetics Test Biology I

Recombinant DNA and Biotechnology

restriction enzymes 350 Home R. Ward: Spring 2001

CHAPTER 6: RECOMBINANT DNA TECHNOLOGY YEAR III PHARM.D DR. V. CHITRA

AP Biology 2015 Free-Response Questions

New generation sequencing: current limits and future perspectives. Giorgio Valle CRIBI - Università di Padova

Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives

MAKING AN EVOLUTIONARY TREE

REPORT OF THE FAST-TRACK ACTION COMMITTEE ON MAPPING THE MICROBIOME

Viruses. Viral components: Capsid. Chapter 10: Viruses. Viral components: Nucleic Acid. Viral components: Envelope

An Overview of DNA Sequencing

The Techniques of Molecular Biology: Forensic DNA Fingerprinting

Cystic Fibrosis Webquest Sarah Follenweider, The English High School 2009 Summer Research Internship Program

GenBank, Entrez, & FASTA

Transcription:

Csaba Kerepesi, Dániel Bánky, Vince Grolmusz: AmphoraNet: Taxonomic Composition Analysis of Metagenomic Shotgun Sequencing Data http://pitgroup.org/amphoranet/ PIT Bioinformatics Group, Department of Computer Science Eötvös Loránd University, Budapest, Hungary

Metagenomics Past 4-5 years: only the 10 percent of the bacterial species are culturable Solution: bioinformatics, metagenomics analysis that is the direct analysis of genetic material of the microorganisms in environmental or clinical samples Metagenomics samples: from seas, lakes, mines, animals, humans associations with diseases Cost of sequencing: in the year 2000 one billion base pairs cost about $400000, recently 1 billion base pairs cost about $50 amount of the public data has grown rapidly Metagenomics analyzing tools: AMPHORA2 and its webserver implementation AmphoraNet The analysis is difficult because it is not possible to sequence DNA molecules separately, only short fragments mixed randomly (shotgun sequencing)

Shotgun sequencing ([6]) (A) Sampling (B) Filtering by size (C) DNA extraction and lysis (D) Cloning and library (E) Sequencing (F) Sequence assembly

Healthy microbiome Human microbiome: 100 times more bacteria than self cells. They are important in our subsistence Human Microbiome Project (USA, 2008-2012): samples from 15, 18 habitats of 242 healthy individuals at two different time points [1] Observation: there are differences at the level of individuals, but we can observe a typical average composition pattern in each body habitat

Microbiome of individuals suffered from diseases Obesity: an increase in the phylum Firmicutes and the phylum Actinobacteria and a decrease in the phylum Bacteroidetes in the gut (2009, 145 sample from twins and parents, [2]) Type 2 diabetes: reduced proportion of the class Clostridiales in the phylum Firmicutes (2010, 36 gut samples from Danish adults, [3]), a decrease in the abundance of some universal butyrate-producing bacteria and an increase in various opportunistic pathogens (2012, 345 gut samples from Chinese individuals, [4]) Autism: lower abundances of the genera Prevotella, Coprococcus, and unclassified Veillonellaceae in the gut microbiome of autistic children (2013, 40 samples, [5]) Application of the results: forecast, prevention or cure of diseases. But a lot of new samples and testing would be needed to confirm the current results and explore many other associations.

Metagenomics analyzing tools Most of the above-mentioned studies used 16S rrna gene sequencing with the 454 platform. But there are tools for analyze shotgun sequencing data as well. Comparison based phylotyping (comparing DNA reads against known complete genomes represented species): MEGAN [7] Phylogeny based phylotyping (finding phylogenetic marker genes in the shotgun sequencing data and using evolutionary trees): AMPHORA2 [8] and its web server implementation AmphoraNet AMPHORA2 uses 31 bacterial and 104 archaeal protein-coding phylogenetic marker genes for metagenomic phylotyping Most of these marker genes are single copy genes, therefore AMPHORA2 is suitable for inferring the taxonomic composition of shotgun sequencing data

The Bacterial Tree of Life [9]

AmphoraNet webserver: amphoranet.pitgroup.org

AmphoraNet sample input files

AmphoraNet sample output file

Visualization of the output files We are working on the automated visualization of the output files generated AmphoraNet as Eisen et al's figure below [9]:

Thank you for your attention!

References [1] The Human Microbiome Project Consortium. Structure, function and diversity of the healthy human microbiome. Nature 486, 207 214 (2012). [2] Turnbaugh, P. J. et al. A core gut microbiome in obese and lean twins. Nature 457,480 484 (2009). [3] Larsen N, Vogensen FK, van den Berg FWJ, Nielsen DS, Andreasen AS, et al. (2010) Gut Microbiota in Human Adults with Type 2 Diabetes Differs from Non-Diabetic Adults. PLoS ONE 5(2): e9085. [4] Qin, J. et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature 490, 55 60 (2012). [5] Kang D-W, Park JG, Ilhan ZE, Wallstrom G, LaBaer J, et al. (2013) Reduced Incidence of Prevotella and Other Fermenters in Intestinal Microflora of Autistic Children. PLoS ONE 8(7): e68322. [6] Wooley JC, Godzik A, Friedberg I (2010) A Primer on Metagenomics. PLoS Comput Biol 6(2): e1000667 [7] D.H. Huson, A.F. Auch, Ji Qi and S.C. Schuster, MEGAN Analysis of Metagenomic Data, Genome Research. 17:377-386, 2007. [8] Wu M, Scott AJ.: Phylogenomic analysis of bacterial and archaeal sequences with AMPHORA2. Bioinformatics. 2012 Apr 1;28(7):1033-4. doi: 10.1093/bioinformatics/bts079. Epub 2012 Feb 12. [9] Wu M, Eisen JA.: A simple, fast, and accurate method of phylogenomic inference. Genome Biol. 2008 Oct 13;9(10):R151.