DnaSP, DNA polymorphism analyses by the coalescent and other methods.
|
|
|
- Thomas Matthews
- 10 years ago
- Views:
Transcription
1 DnaSP, DNA polymorphism analyses by the coalescent and other methods. Author affiliation: Julio Rozas 1, *, Juan C. Sánchez-DelBarrio 2,3, Xavier Messeguer 2 and Ricardo Rozas 1 1 Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Diagonal 645, Barcelona, Spain 2 Departament de Llenguatges i Sistemes Informàtics, Universitat Politècnica de Catalunya, Barcelona, Spain 3 Present Address: Departament de Tecnología, Universitat Pompeu Fabra, Barcelona, Spain Name and address for correspondence: Julio Rozas Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Diagonal 645, Barcelona, Spain. Tel.: Fax: [email protected] Running head: DNA polymorphism analysis * To whom correspondence should be addressed 1
2 ABSTRACT Summary: DnaSP is a software package for the analysis of DNA polymorphism data. Present version introduces several new modules and features which, among other options allows, 1) handling big data sets (~5 Mbp per sequence); 2) conducting a large number of coalescent-based tests by Monte Carlo computer simulations; 3) extensive analyses of the genetic differentiation and gene flow among populations; 4) analysing the evolutionary pattern of preferred and unpreferred codons; 5) generating graphical outputs for an easy visualization of results. Availability: The software package, including complete documentation and examples, is freely available to academic users from: Contact: [email protected] 2
3 INTRODUCTION Recent advances in DNA sequencing and polymorphism detection methodologies are generating huge datasets of DNA sequence variation and of single nucleotide polymorphisms (SNPs). Analysis of such DNA polymorphism data will definitively enhance our understanding of both the evolutionary significance of DNA polymorphisms and of the evolutionary history of populations and species (Nordborg and Innan 2002). Additionally, DNA polymorphism information has a wide range of applications, including pharmacogenomics, animal and plant breeding, conservation genetics, epidemiology genetics, medicine and forensics. Current massive datasets are stimulating the development of numerous methods to interpret DNA polymorphism data. These methods capture different features of the data (SNP frequency, association among variants, haplotype structure, synonymous and nonsynonymous changes, recombinational events, codon usage, etc.) (Rosenberg and Nordborg 2002; Bamshad and Wooding 2003). In this context, the coalescent theory (see Hudson, 1990; Rosenberg and Nordborg 2002) has become the primary framework to analyse the data. Indeed, coalescent-based methods are critical for detecting the signature of positive natural selection, in the identification of haplotype blocks across the genome, or for inferring the effect of intragenic recombination. Here, we describe version 4 of the DnaSP software package (Rozas and Rozas 1999). Present version largely extends the capabilities of the software allowing extensive DNA polymorphism analyses on a user-friendly interface. 3
4 SYSTEM AND METHODS DnaSP version 4 is written in Microsoft Visual Basic v. 6.0 and runs on ix86 compatible processors under Microsoft Windows. DnaSP can also run on Apple Macintosh, Linux and Unix-based platforms using Windows emulator software with one of the required Microsoft Windows versions. MAIN NEW FEATURES DnaSP provides a user-friendly Microsoft Windows graphic interface and can read (and export) five multiple-aligned nucleotide sequence file formats: FASTA, MEGA, NBRF/PIR, NEXUS and PHYLIP. DnaSP allows the analysis of polymorphism, divergence, genetic differentiation, gene flow, gene conversion, linkage disequilibrium, recombination, codon usage and also conducts a number of neutrality tests. The analyses can be performed in a subset of sites (including synonymous, nonsynonymous, non coding, i-fold degenerate sites) or in a subset of DNA sequences. Coding region analysis can be performed using a number of predefined genetic codes and codon usage tables. Coalescent-based methods DnaSP has extensively increased the capabilities of the coalescent-based analyses. Present DnaSP version allows conducting most of the developed neutrality tests (with and without outgroup) and linkage disequilibrium statistics, including among others- (1) Tajima s, Fu s and Fu and Li s tests (Tajima 1989; Fu and Li 1993; Fu 1997); (2) Depaulis and Veuille s haplotype-based tests (Depaulis and Veuille 1998); (3) B and Q tests (Wall 1999); (4) H test (Fay and Wu 2000); (5) Z ns, ZZ and Z A linkage disequilibrium based-statistics (Kelly 1997; Rozas et al., 2001). DnaSP also computes a number of statistical tests 4
5 for detecting population growth including the recently developed R 2 test (Ramos-Onsins and Rozas 2002). The Monte Carlo computer simulation module allows generating the empirical distribution for a very large number of test statistics. Simulations can be conducted for different recombination rates. Gene Flow and Genetic Differentiation The Gene Flow module has been completely rewritten. Present version allows performing a number of gene flow and genetic differentiation among populations analyses with different options for treating alignment gaps. To detect genetic differentiation among subpopulations DnaSP implements several statistics based both on the number of haplotypes and on the number of nucleotide changes (i.e., sequence-based statistics) (Hudson et al., 1992a; Hudson 2000). DnaSP also estimates several parameters of the standardized measure of the genetic diversity among populations (F ST, and the related statistics G ST, N ST ) (see Hudson et al., 1992b). From these F ST based estimators, the migration rates (in terms of Nm; where m is the migration rate) are obtained. The outcome values can be exported as a distance data file (PHYLIP and MEGA formats) for further phylogenetic analyses. DnaSP incorporates two methods to test for genetic differentiation: 1) the standard χ 2 homogeneity test, and 2) a Monte Carlo permutation (randomization) test (Hudson et al., 1992a). Analysis of Preferred and Unpreferred codons Present version implements a number of algorithms and methods to analyse the impact of natural selection and mutational processes on codon usage bias. In addition to the standard codon usage bias estimators (CBI, ENC, Scaled Chi- Square, etc.), DnaSP also implements an algorithm to identify preferred (P) and unpreferred (U) synonymous changes. This information is critical for determining the effect of natural selection (weak selection) on synonymous codons (see Akashi 1999). DnaSP allows estimating the numbers of preferred 5
6 and unpreferred changes within species (which requires the availability of one outgroup to polarize the mutations), and also those changes polymorphic within species and fixed between species (which requires the availability of two outgroups). DnaSP also provides several predefined codon usage tables. The user, additionally, can also define his own codon usage table; this user-defined information can be stored on a private block of the NEXUS file format. 6
7 ACKNOWLEDGEMENTS We thank M. Aguadé, A. Blanco-García, H. Quesada, C. Segarra and A. Vilella for critical comments on the manuscript. We also thank the numerous people who tested the program with their data, especially members of the Molecular Evolutionary Genetics group in the Departament de Genètica, Universitat de Barcelona. This work was supported by grant BMC from the Dirección General de Investigación Científica y Técnica, Spain, conferred on M. Aguadé, and by grant TXT from the Dirección General de Enseñanza Superior e Investigación Científica, Spain, conferred on J. Rozas. 7
8 REFERENCES Akashi,H., (1999) Detecting the "footprint" of natural selection in within and between species DNA sequence data. Gene, 238, Bamshad,M. and Wooding,S.P. (2003) Signatures of natural selection in the human genome. Nature Rev. Genetics, 4, Depaulis,F. and Veuille, M. (1998) Neutrality tests based on the distribution of haplotypes under an infinite-site model. Mol. Biol. Evol., 15, Fay,J.C. and Wu, C.-I. (2000) Hitchhiking under positive Darwinian selection. Genetics, 155, Fu,Y.-X. and Li,W.-H. (1993) Statistical tests of neutrality of mutations. Genetics, 133, Fu,Y.-X. (1997) Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics, 147, Hudson,R.R. (1990) Gene genealogies and the coalescent process. Oxf. Surv. Evol. Biol., 7, Hudson,R.R. (2000) A new statistic for detecting genetic differentiation. Genetics, 155, Hudson,R.R., Boos,D.D. and Kaplan,N.L. (1992a) A statistical test for detecting population subdivision. Mol. Biol. Evol., 9, Hudson,R.R., Slatkin,M. and Maddison,W.P. (1992b) Estimation of levels of gene flow from DNA sequence data. Genetics, 132, Kelly,J.K. (1997) A test of neutrality based on interlocus associations. Genetics, 146, Nordborg,M. and Innan,H. (2002) Molecular population genetics. Curr. Op. in Plant Biol., 5,
9 Ramos-Onsins,S.E. and Rozas,J. (2002) Statistical properties of new neutrality tests against population growth. Mol. Biol. Evol., 19, Rosenberg,N.A. and Nordborg,M. (2002) Genealogical trees, coalescent theory, and the analysis of genetic polymorphisms. Nature Rev. Genetics, 3, Rozas,J. and Rozas,R. (1999) DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics, 15, Rozas,J., Gullaud,M. Blandin,G. and Aguadé.M. (2001) DNA variation at the rp49 gene region of Drosophila simulans: Evolutionary inferences from an unusual haplotype structure. Genetics, 158, Tajima,F. (1989) Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics, 123, Wall,J.D. (1999) Recombination and the power of statistical tests of neutrality. Genet. Res., 74,
DNA Sequence Alignment Analysis
Analysis of DNA sequence data p. 1 Analysis of DNA sequence data using MEGA and DNAsp. Analysis of two genes from the X and Y chromosomes of plant species from the genus Silene The first two computer classes
PRINCIPLES OF POPULATION GENETICS
PRINCIPLES OF POPULATION GENETICS FOURTH EDITION Daniel L. Hartl Harvard University Andrew G. Clark Cornell University UniversitSts- und Landesbibliothek Darmstadt Bibliothek Biologie Sinauer Associates,
Input Data Files (FASTA format; MEGA format; NBRF/PIR format; NEXUS format; PHYLIP format; HapMap3
DnaSP Version 5 Help Contents Running DnaSP, press F1 to view the context-sensitive help. What DnaSP can do Introduction System requirements Input and Output Input Data Files (FASTA format; MEGA format;
A Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
REVIEWS. Computer programs for population genetics data analysis: a survival guide FOCUS ON STATISTICAL ANALYSIS
FOCUS ON STATISTICAL ANALYSIS REVIEWS Computer programs for population genetics data analysis: a survival guide Laurent Excoffier and Gerald Heckel Abstract The analysis of genetic diversity within species
PROC. CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE 2006 1. E-mail: [email protected]
BIOINFTool: Bioinformatics and sequence data analysis in molecular biology using Matlab Mai S. Mabrouk 1, Marwa Hamdy 2, Marwa Mamdouh 2, Marwa Aboelfotoh 2,Yasser M. Kadah 2 1 Biomedical Engineering Department,
SNPbrowser Software v3.5
Product Bulletin SNP Genotyping SNPbrowser Software v3.5 A Free Software Tool for the Knowledge-Driven Selection of SNP Genotyping Assays Easily visualize SNPs integrated with a physical map, linkage disequilibrium
Bioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
Molecular typing of VTEC: from PFGE to NGS-based phylogeny
Molecular typing of VTEC: from PFGE to NGS-based phylogeny Valeria Michelacci 10th Annual Workshop of the National Reference Laboratories for E. coli in the EU Rome, November 5 th 2015 Molecular typing
BAPS: Bayesian Analysis of Population Structure
BAPS: Bayesian Analysis of Population Structure Manual v. 6.0 NOTE: ANY INQUIRIES CONCERNING THE PROGRAM SHOULD BE SENT TO JUKKA CORANDER (first.last at helsinki.fi). http://www.helsinki.fi/bsg/software/baps/
Core Bioinformatics. Degree Type Year Semester. 4313473 Bioinformàtica/Bioinformatics OB 0 1
Core Bioinformatics 2014/2015 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformàtica/Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: [email protected]
Genome Explorer For Comparative Genome Analysis
Genome Explorer For Comparative Genome Analysis Jenn Conn 1, Jo L. Dicks 1 and Ian N. Roberts 2 Abstract Genome Explorer brings together the tools required to build and compare phylogenies from both sequence
Phylogenetic Trees Made Easy
Phylogenetic Trees Made Easy A How-To Manual Fourth Edition Barry G. Hall University of Rochester, Emeritus and Bellingham Research Institute Sinauer Associates, Inc. Publishers Sunderland, Massachusetts
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 [email protected] Genomics A genome is an organism s
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6
Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 In the last lab, you learned how to perform basic multiple sequence alignments. While useful in themselves for determining conserved residues
Typing in the NGS era: The way forward!
Typing in the NGS era: The way forward! Valeria Michelacci NGS course, June 2015 Typing from sequence data NGS-derived conventional Multi Locus Sequence Typing (University of Warwick, 7 housekeeping genes)
Focusing on results not data comprehensive data analysis for targeted next generation sequencing
Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes
A Web Based Software for Synonymous Codon Usage Indices
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 3 (2013), pp. 147-152 International Research Publications House http://www. irphouse.com /ijict.htm A Web
CRAC: An integrated approach to analyse RNA-seq reads Additional File 3 Results on simulated RNA-seq data.
: An integrated approach to analyse RNA-seq reads Additional File 3 Results on simulated RNA-seq data. Nicolas Philippe and Mikael Salson and Thérèse Commes and Eric Rivals February 13, 2013 1 Results
DNA Insertions and Deletions in the Human Genome. Philipp W. Messer
DNA Insertions and Deletions in the Human Genome Philipp W. Messer Genetic Variation CGACAATAGCGCTCTTACTACGTGTATCG : : CGACAATGGCGCT---ACTACGTGCATCG 1. Nucleotide mutations 2. Genomic rearrangements 3.
A Correlation of Miller & Levine Biology 2014
A Correlation of Miller & Levine Biology To Ohio s New Learning Standards for Science, 2011 Biology, High School Science Inquiry and Application Course Content A Correlation of, to Introduction This document
BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, [email protected]) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
1 2 GPT: a web-server to map phylogenetic trees on a virtual globe Pere Puigbò 1,* and Jacqueline M. Major 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 1 National Center for Biotechnology Information,
Heuristics for the Sorting by Length-Weighted Inversions Problem on Signed Permutations
Heuristics for the Sorting by Length-Weighted Inversions Problem on Signed Permutations AlCoB 2014 First International Conference on Algorithms for Computational Biology Thiago da Silva Arruda Institute
Genetic Variation and Human Evolution Lynn B. Jorde, Ph.D. Department of Human Genetics University of Utah School of Medicine.
Genetic Variation and Human Evolution Lynn B. Jorde, Ph.D. Department of Human Genetics University of Utah School of Medicine. The past two decades have witnessed an explosion of human genetic data. Innumerable
Bioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
Bio-Informatics Lectures. A Short Introduction
Bio-Informatics Lectures A Short Introduction The History of Bioinformatics Sanger Sequencing PCR in presence of fluorescent, chain-terminating dideoxynucleotides Massively Parallel Sequencing Massively
Introduction to Phylogenetic Analysis
Subjects of this lecture Introduction to Phylogenetic nalysis Irit Orr 1 Introducing some of the terminology of phylogenetics. 2 Introducing some of the most commonly used methods for phylogenetic analysis.
Maximum-Likelihood Estimation of Phylogeny from DNA Sequences When Substitution Rates Differ over Sites1
Maximum-Likelihood Estimation of Phylogeny from DNA Sequences When Substitution Rates Differ over Sites1 Ziheng Yang Department of Animal Science, Beijing Agricultural University Felsenstein s maximum-likelihood
SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications
Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each
Biological Sciences Initiative. Human Genome
Biological Sciences Initiative HHMI Human Genome Introduction In 2000, researchers from around the world published a draft sequence of the entire genome. 20 labs from 6 countries worked on the sequence.
Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the
Chapter 5 Analysis of Prostate Cancer Association Study Data 5.1 Risk factors for Prostate Cancer Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the disease has
Bayesian coalescent inference of population size history
Bayesian coalescent inference of population size history Alexei Drummond University of Auckland Workshop on Population and Speciation Genomics, 2016 1st February 2016 1 / 39 BEAST tutorials Population
Genetomic Promototypes
Genetomic Promototypes Mirkó Palla and Dana Pe er Department of Mechanical Engineering Clarkson University Potsdam, New York and Department of Genetics Harvard Medical School 77 Avenue Louis Pasteur Boston,
Keywords: evolution, genomics, software, data mining, sequence alignment, distance, phylogenetics, selection
Sudhir Kumar has been Director of the Center for Evolutionary Functional Genomics in The Biodesign Institute at Arizona State University since 2002. His research interests include development of software,
Milk protein genetic variation in Butana cattle
Milk protein genetic variation in Butana cattle Ammar Said Ahmed Züchtungsbiologie und molekulare Genetik, Humboldt Universität zu Berlin, Invalidenstraβe 42, 10115 Berlin, Deutschland 1 Outline Background
Lab 2/Phylogenetics/September 16, 2002 1 PHYLOGENETICS
Lab 2/Phylogenetics/September 16, 2002 1 Read: Tudge Chapter 2 PHYLOGENETICS Objective of the Lab: To understand how DNA and protein sequence information can be used to make comparisons and assess evolutionary
PHYML Online: A Web Server for Fast Maximum Likelihood-Based Phylogenetic Inference
PHYML Online: A Web Server for Fast Maximum Likelihood-Based Phylogenetic Inference Stephane Guindon, F. Le Thiec, Patrice Duroux, Olivier Gascuel To cite this version: Stephane Guindon, F. Le Thiec, Patrice
Core Bioinformatics. Degree Type Year Semester
Core Bioinformatics 2015/2016 Code: 42397 ECTS Credits: 12 Degree Type Year Semester 4313473 Bioinformatics OB 0 1 Contact Name: Sònia Casillas Viladerrams Email: [email protected] Teachers Use of
Principles of Evolution - Origin of Species
Theories of Organic Evolution X Multiple Centers of Creation (de Buffon) developed the concept of "centers of creation throughout the world organisms had arisen, which other species had evolved from X
School of Nursing. Presented by Yvette Conley, PhD
Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression
Network Protocol Analysis using Bioinformatics Algorithms
Network Protocol Analysis using Bioinformatics Algorithms Marshall A. Beddoe [email protected] ABSTRACT Network protocol analysis is currently performed by hand using only intuition and a protocol
Innovations in Molecular Epidemiology
Innovations in Molecular Epidemiology Molecular Epidemiology Measure current rates of active transmission Determine whether recurrent tuberculosis is attributable to exogenous reinfection Determine whether
An example of bioinformatics application on plant breeding projects in Rijk Zwaan
An example of bioinformatics application on plant breeding projects in Rijk Zwaan Xiangyu Rao 17-08-2012 Introduction of RZ Rijk Zwaan is active worldwide as a vegetable breeding company that focuses on
TOWARD BIG DATA ANALYSIS WORKSHOP
TOWARD BIG DATA ANALYSIS WORKSHOP 邁 向 巨 量 資 料 分 析 研 討 會 摘 要 集 2015.06.05-06 巨 量 資 料 之 矩 陣 視 覺 化 陳 君 厚 中 央 研 究 院 統 計 科 學 研 究 所 摘 要 視 覺 化 (Visualization) 與 探 索 式 資 料 分 析 (Exploratory Data Analysis, EDA)
A comparison of methods for estimating the transition:transversion ratio from DNA sequences
Molecular Phylogenetics and Evolution 32 (2004) 495 503 MOLECULAR PHYLOGENETICS AND EVOLUTION www.elsevier.com/locate/ympev A comparison of methods for estimating the transition:transversion ratio from
AP Biology Essential Knowledge Student Diagnostic
AP Biology Essential Knowledge Student Diagnostic Background The Essential Knowledge statements provided in the AP Biology Curriculum Framework are scientific claims describing phenomenon occurring in
(1-p) 2. p(1-p) From the table, frequency of DpyUnc = ¼ (p^2) = #DpyUnc = p^2 = 0.0004 ¼(1-p)^2 + ½(1-p)p + ¼(p^2) #Dpy + #DpyUnc
Advanced genetics Kornfeld problem set_key 1A (5 points) Brenner employed 2-factor and 3-factor crosses with the mutants isolated from his screen, and visually assayed for recombination events between
SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis
SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis Goal: This tutorial introduces several websites and tools useful for determining linkage disequilibrium
Site-Directed Nucleases and Cisgenesis Maria Fedorova, Ph.D.
Site-Directed Nucleases and Cisgenesis Maria Fedorova, Ph.D. Regulatory Strategy Lead Enabling Technologies DuPont-Pioneer, USA 1 New Plant Breeding Techniques 2007 New Techniques Working Group established
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE E15
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE DEFINITIONS FOR GENOMIC BIOMARKERS, PHARMACOGENOMICS,
Localised Sex, Contingency and Mutator Genes. Bacterial Genetics as a Metaphor for Computing Systems
Localised Sex, Contingency and Mutator Genes Bacterial Genetics as a Metaphor for Computing Systems Outline Living Systems as metaphors Evolutionary mechanisms Mutation Sex and Localized sex Contingent
The Human Genome Project. From genome to health From human genome to other genomes and to gene function Structural Genomics initiative
The Human Genome Project From genome to health From human genome to other genomes and to gene function Structural Genomics initiative June 2000 What is the Human Genome Project? U.S. govt. project coordinated
SNP Data Integration and Analysis for Drug- Response Biomarker Discovery
B. Comp Dissertation SNP Data Integration and Analysis for Drug- Response Biomarker Discovery By Chen Jieqi Pauline Department of Computer Science School of Computing National University of Singapore 2008/2009
Master's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University
Master's projects at ITMO University Daniil Chivilikhin PhD Student @ ITMO University General information Guidance from our lab's researchers Publishable results 2 Research areas Research at ITMO Evolutionary
COMPARING DNA SEQUENCES TO DETERMINE EVOLUTIONARY RELATIONSHIPS AMONG MOLLUSKS
COMPARING DNA SEQUENCES TO DETERMINE EVOLUTIONARY RELATIONSHIPS AMONG MOLLUSKS OVERVIEW In the online activity Biodiversity and Evolutionary Trees: An Activity on Biological Classification, you generated
Version 5.0 Release Notes
Version 5.0 Release Notes 2011 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074 (fax) www.genecodes.com
Protein Sequence Analysis - Overview -
Protein Sequence Analysis - Overview - UDEL Workshop Raja Mazumder Research Associate Professor, Department of Biochemistry and Molecular Biology Georgetown University Medical Center Topics Why do protein
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - [email protected]. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - [email protected] CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
ADVANCES IN BOTANICAL RESEARCH
o >VOLUME SIXTY NINE ADVANCES IN BOTANICAL RESEARCH Genomes of Herbaceous Land Plants Volume Editor ANDREW H. PATERSON Plant Genome Mapping Laboratory Department of Crop and Soil Sciences, Department of
Hidden Markov Models
8.47 Introduction to omputational Molecular Biology Lecture 7: November 4, 2004 Scribe: Han-Pang hiu Lecturer: Ross Lippert Editor: Russ ox Hidden Markov Models The G island phenomenon The nucleotide frequencies
Final Project Report
CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes
Human Genome and Human Genome Project. Louxin Zhang
Human Genome and Human Genome Project Louxin Zhang A Primer to Genomics Cells are the fundamental working units of every living systems. DNA is made of 4 nucleotide bases. The DNA sequence is the particular
Bayesian Phylogeny and Measures of Branch Support
Bayesian Phylogeny and Measures of Branch Support Bayesian Statistics Imagine we have a bag containing 100 dice of which we know that 90 are fair and 10 are biased. The
Integration of genomic data into electronic health records
Integration of genomic data into electronic health records Daniel Masys, MD Affiliate Professor Biomedical & Health Informatics University of Washington, Seattle Major portion of today s lecture is based
14.3 Studying the Human Genome
14.3 Studying the Human Genome Lesson Objectives Summarize the methods of DNA analysis. State the goals of the Human Genome Project and explain what we have learned so far. Lesson Summary Manipulating
Jeffrey O. French, PhD
Jeffrey O. French, PhD Office: PO Box 1892, 7801 N. Tigerville Rd., Tigerville, SC 29688 (864) 977-7132, [email protected] Education Doctor of Philosophy 2008 Biological Sciences, University of South
Y Chromosome Markers
Y Chromosome Markers Lineage Markers Autosomal chromosomes recombine with each meiosis Y and Mitochondrial DNA does not This means that the Y and mtdna remains constant from generation to generation Except
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
Supervised DNA barcodes species classification: analysis, comparisons and results. Tutorial. Citations
Supervised DNA barcodes species classification: analysis, comparisons and results Emanuel Weitschek, Giulia Fiscon, and Giovanni Felici Citations If you use this procedure please cite: Weitschek E, Fiscon
LifeScope Genomic Analysis Software 2.5
USER GUIDE LifeScope Genomic Analysis Software 2.5 Graphical User Interface DATA ANALYSIS METHODS AND INTERPRETATION Publication Part Number 4471877 Rev. A Revision Date November 2011 For Research Use
Delivering the power of the world s most successful genomics platform
Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE
A Tutorial in Genetic Sequence Classification Tools and Techniques
A Tutorial in Genetic Sequence Classification Tools and Techniques Jake Drew Data Mining CSE 8331 Southern Methodist University [email protected] www.jakemdrew.com Sequence Characters IUPAC nucleotide
Algorithms in Computational Biology (236522) spring 2007 Lecture #1
Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Tuesday 11:00-12:00/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office
How To Understand The Science Of Genomics
Curs Bioinformática. Grau Genética GENÓMICA INTRODUCTION TO GENOME SCIENCE Antonio Barbadilla Group Genomics, Bioinformatics & Evolution Institut Biotecnologia I Biomedicina Departament de Genètica i Microbiologia
Next Generation Sequencing: Technology, Mapping, and Analysis
Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University [email protected] http://tandem.bu.edu/ The Human Genome Project took
escience and Post-Genome Biomedical Research
escience and Post-Genome Biomedical Research Thomas L. Casavant, Adam P. DeLuca Departments of Biomedical Engineering, Electrical Engineering and Ophthalmology Coordinated Laboratory for Computational
Pairwise Sequence Alignment
Pairwise Sequence Alignment [email protected] SS 2013 Outline Pairwise sequence alignment global - Needleman Wunsch Gotoh algorithm local - Smith Waterman algorithm BLAST - heuristics What
Introduction to Bioinformatics 3. DNA editing and contig assembly
Introduction to Bioinformatics 3. DNA editing and contig assembly Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 [email protected]
Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company
Genetic engineering: humans Gene replacement therapy or gene therapy Many technical and ethical issues implications for gene pool for germ-line gene therapy what traits constitute disease rather than just
Worksheet - COMPARATIVE MAPPING 1
Worksheet - COMPARATIVE MAPPING 1 The arrangement of genes and other DNA markers is compared between species in Comparative genome mapping. As early as 1915, the geneticist J.B.S Haldane reported that
Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs)
Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Single nucleotide polymorphisms or SNPs (pronounced "snips") are DNA sequence variations that occur
Biology 1406 - Notes for exam 5 - Population genetics Ch 13, 14, 15
Biology 1406 - Notes for exam 5 - Population genetics Ch 13, 14, 15 Species - group of individuals that are capable of interbreeding and producing fertile offspring; genetically similar 13.7, 14.2 Population
2.3 Identify rrna sequences in DNA
2.3 Identify rrna sequences in DNA For identifying rrna sequences in DNA we will use rnammer, a program that implements an algorithm designed to find rrna sequences in DNA [5]. The program was made by
Replacing TaqMan SNP Genotyping Assays that Fail Applied Biosystems Manufacturing Quality Control. Begin
User Bulletin TaqMan SNP Genotyping Assays May 2008 SUBJECT: Replacing TaqMan SNP Genotyping Assays that Fail Applied Biosystems Manufacturing Quality Control In This Bulletin Overview This user bulletin
Journal of Statistical Software
JSS Journal of Statistical Software October 2006, Volume 16, Code Snippet 3. http://www.jstatsoft.org/ LDheatmap: An R Function for Graphical Display of Pairwise Linkage Disequilibria between Single Nucleotide
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS
BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS NEW YORK CITY COLLEGE OF TECHNOLOGY The City University Of New York School of Arts and Sciences Biological Sciences Department Course title:
