Comparative genomics of higher primates, including humans and Neandertals (Svante Pääbo)



Similar documents
Human Genome Organization: An Update. Genome Organization: An Update

A Primer of Genome Science THIRD

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

CCR Biology - Chapter 9 Practice Test - Summer 2012

Discovery and Quantification of RNA with RNASeq Roderic Guigó Serra Centre de Regulació Genòmica (CRG)


14.3 Studying the Human Genome

Dr Alexander Henzing

The Human Genome Project

Worksheet - COMPARATIVE MAPPING 1

Animal Models of Human Behavioral and Social Processes: What is a Good Animal Model? Dario Maestripieri

Biological Sciences Initiative. Human Genome

AP Biology Essential Knowledge Student Diagnostic

History of DNA Sequencing & Current Applications

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology

Mechanisms of Evolution

School of Nursing. Presented by Yvette Conley, PhD

Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure enzymes control cell chemistry ( metabolism )

DNA Fingerprinting. Unless they are identical twins, individuals have unique DNA

Human Genome and Human Genome Project. Louxin Zhang

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals

Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources

Genetic diagnostics the gateway to personalized medicine

Next Generation Sequencing: Technology, Mapping, and Analysis

Attacking the Biobank Bottleneck

Stem Cells and Inner Ear Cell Regeneration. Stefan Heller Stanford University School of Medicine

Clinical Research Infrastructure

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

Grade 12 Psychology (40S) Outcomes Unedited Draft 1

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

FACULTY OF MEDICAL SCIENCE

Algorithms in Computational Biology (236522) spring 2007 Lecture #1

SNP Essentials The same SNP story

Macroevolution: Change above the species level NABT 2006 Evolution Symposium

Genomes and SNPs in Malaria and Sickle Cell Anemia

The Story of Human Evolution Part 1: From ape-like ancestors to modern humans

GENOMICS: REINVIGORATING THE FIELD OF PSYCHIATRIC RESEARCH

Basic Concepts of DNA, Proteins, Genes and Genomes

escience and Post-Genome Biomedical Research

Biochemistry Major Talk Welcome!!!!!!!!!!!!!!

Targeted. sequencing solutions. Accurate, scalable, fast TARGETED

Principles of Evolution - Origin of Species

Evolution (18%) 11 Items Sample Test Prep Questions

Opposition against European Patent EP B1

Core Facility Genomics

Localised Sex, Contingency and Mutator Genes. Bacterial Genetics as a Metaphor for Computing Systems

FINDING RELATION BETWEEN AGING AND

Lab 2/Phylogenetics/September 16, PHYLOGENETICS

1 Mutation and Genetic Change

How To Understand The Science Of Genomics

MAKING AN EVOLUTIONARY TREE

The Making of the Fittest: Evolving Switches, Evolving Bodies

2019 Healthcare That Works for All

DNA and the Cell. Version 2.3. English version. ELLS European Learning Laboratory for the Life Sciences

Accelerated evolution of conserved noncoding sequences in the human genome

M The Nucleus M The Cytoskeleton M Cell Structure and Dynamics

The National Institute of Genomic Medicine (INMEGEN) was

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

ALLIANCE FOR LUPUS RESEARCH AND PFIZER S CENTERS FOR THERAPEUTIC INNOVATION CHALLENGE GRANT PROGRAM PROGRAM GUIDELINES

Re: Proposal for BAC library construction of Orangutan ( Pongo pygmaeus) Importance

From Africa to Aotearoa Part 1: Out of Africa

SEQUENCING THE CHIMPANZEE GENOME: INSIGHTS INTO HUMAN EVOLUTION AND DISEASE

ATIP Avenir Program Applicant s guide

Okami Study Guide: Chapter 3 1

Sequencing and microarrays for genome analysis: complementary rather than competing?

Micro RNAs: potentielle Biomarker für das. Blutspenderscreening

CONSENT FORM TEMPLATE. Derivation and Distribution of Induced Pluripotent Stem (ips) Cell Lines Created from Donor Specimens

Standards, Guidelines and Best Practices for RNA-Seq V1.0 (June 2011) The ENCODE Consortium

Understanding West Nile Virus Infection

Next Generation Sequencing

Introduction to Bioinformatics 3. DNA editing and contig assembly

Control of Gene Expression

BBSRC TECHNOLOGY STRATEGY: TECHNOLOGIES NEEDED BY RESEARCH KNOWLEDGE PROVIDERS

Genetic testing. The difference diagnostics can make. The British In Vitro Diagnostics Association

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!

Von Mäusen und Menschen E - 1

July 7th 2009 DNA sequencing

GenBank, Entrez, & FASTA

BioBoot Camp Genetics

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)

PGY 206 ELEMENTARY PHYSIOLOGY. (3) An introductory survey course in basic human physiology. Prereq: One semester of college biology.

Gene mutation and molecular medicine Chapter 15

Genetomic Promototypes

Amazing DNA facts. Hands-on DNA: A Question of Taste Amazing facts and quiz questions

Shouguo Gao Ph. D Department of Physics and Comprehensive Diabetes Center

European Genome-phenome Archive database of human data consented for use in biomedical research at the European Bioinformatics Institute

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized:

Bioinformatics: course introduction

Big Data Healthcare. Fei Wang Associate Professor Department of Computer Science and Engineering School of Engineering University of Connecticut

Single-Cell DNA Sequencing with the C 1. Single-Cell Auto Prep System. Reveal hidden populations and genetic diversity within complex samples

Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE E15

G E N OM I C S S E RV I C ES

Social cooperation, niche construction, and the core role of intergenerational bonding in human evolution. Agustín Fuentes University of Notre Dame

NATIONAL HEALTH COUNCIL

Integrating Bioinformatics, Medical Sciences and Drug Discovery

Data integration is a feature that clearly expands the role of the GTL

Genetic Variation and Human Evolution Lynn B. Jorde, Ph.D. Department of Human Genetics University of Utah School of Medicine.

1. Over the past century, several scientists around the world have made the following observations:

Control of Gene Expression

Transcription:

Comparative genomics of higher primates, including humans and Neandertals (Svante Pääbo) At a glance The comparison of the human genome to the genomes of our closest living relatives, the apes, as well as our closest extinct relative, the Neandertal, will allow almost all genomic features specific to fully modern humans to be identified. Further studies of these genomes as well as functional tests in model systems will allow those features that have been functionally important during human evolution to be identified. Ultimately, this will uncover the biological basis for what made human cognition, language and culture possible and may provide inroads to a better understanding of diseases affecting human-specific traits. Definition of research The closest living evolutionary relatives of humans are the two chimpanzee species ( common chimpanzees and bonobos) with whom we share a common ancestor about 5 to 7 million years ago. The other African ape, the gorilla, is almost as closely related to us, sharing a common ancestor 6 to 8 million years ago, while the common ancestor shared with the Asian orangutans lived some 12 to 16 million years ago. Since our shared common ancestors with the chimpanzees, many extinct groups related to human ancestors, so-called hominins, existed. Of these, the ones most closely related to us were the Neandertals, with whom we share a common ancestor in the order of 400,000 to 500,000 years ago. They existed in western Eurasia until they became extinct about 30,000 years ago. Genomic comparisons of humans with closely related apes and hominins are important not only in order to better understand humans origins and the biological background of what sets humans apart from other organisms, but also for understanding the basis for diseases or developmental problems that affect uniquely human traits, such as speech disorders, autism or schizophrenia. Status of field Technically, the field of genome sequencing is developing extremely rapidly. Very soon, it will be possible to sequence complete mammalian genomes to high accuracy at very little cost. This affects also other areas of genomics, for example high-throughput sequencing of RNA that now allows comparisons of levels of gene expression as well as patterns of spicing and transcription of parts of the genome other than the classical protein-coding genes. By comparison, mass spectroscopy-based high-throughput quantitative analysis of the proteome (i.e. all the proteins present in a tissue or a cell) and the metabolome (i.e. the small molecules produced by metabolic processes) is still in its infancy but in rapid development. In general, these technical advances are first applied to humans, particularly medical questions and only slowly to other primates. Currently, genome sequences of variable quality are available for humans, a chimpanzee, a gorilla, an orangutan and a rhesus macaque. The analysis of these genomes allows the identifications of millions of nucleotide changes that occurred on the human lineage since the divergence from the chimpanzee lineage as well as tens of thousands of insertions, deletions and duplications of pieces of DNA. The great challenge in the field is to identify those changes that form the biological basis for traits that are shared by all humans and distinguish them from all other species. - 1 -

Important areas in this endeavor are comparisons of not only the DNA sequences of the genomes but also of functional aspects of the genomes, such as gene activity in the form of RNA and protein concentrations in different cells and organs and during various stages of development. Recent technical advances have also made it possible to begin to sequence the genome of our closest extinct relative, the Neandertals, in order to identify those changes that occurred during the last few hundred of thousands of years when fully modern humans appeared. International activities Thanks mainly to efforts in the US, the human genome project and the sequences of a few ape genomes are currently being followed by determination of draft genome sequences of many primates. In the next few years, genome sequences of single representatives of almost all monkeys can be expected. MPG is contributing to this field by sequencing of the bonobo (pygmy chimpanzee) genome. Several large projects are providing resources and insights into the genomic variation in humans (HapMap, ENCODE, 1000 Genomes). In the next few years these activities are expected to expand. No comparable efforts exist for the apes or other primates. Efforts to sequence complete genomes of extinct animals are beginning to emerge thanks to adaptations of high-throughput DNA sequencing technologies. In this endeavor, the MPG has taken a lead with the Neandertal genome effort. Research opportunities and needs It will be important to more fully understand the genomic evolution of humans by both structural and functional comparisons of the human genome to the genomes of extant apes and extinct hominins. In order to achieve this, it will be necessary to obtain the same basic genomic resources that exist for humans in non-human primates, with a view to using those resources as the basis for subsequent evolutionary analyses, population genetics and disease gene identification. To be useful, such comparisons will have to include not only comparisons of single genomes per species but also genomic variation within the species. This is important both because many groups are so closely related that they share variants between species and because many methods to identify genes that have been of importance for the species rest on comparisons of variation within species as well as differences between species. The following areas will be particularly important in the next few years: Primate genomes of high and similar quality will need to be generated to allow differences between species to be defined accurately and comprehensively. Now, large and different numbers of errors in the different genomes make many analyses difficult. For example, currently the gorilla genome is of low quality and the chimpanzee X chromosome is sequenced to very low coverage. Over the next few years, it will be important to sequence all apes genomes to a quality similar to the human genome. Primate genomes need to annotated independently using cdnas generated directly from the species in question. Currently, such annotations for all non-human primate genomes are simply lifted over from the human genome. This will allow differences in gene structure, splicing and regulatory RNAs to be detected. Primate genome variation panels that complement the "1000 genomes project" in humans will need to be created, at least for the great apes. This will allow the identification of genes that have been positively selected in humans as well as in the other primates. Currently the - 2 -

largest studies performed non-human primates have sequenced only 100-200kb in approximately 20 individuals. To achieve the above goals it will be important to systematically collect blood samples and tissues from captive apes. Such material can be obtained when material is removed for medical reasons in primate sanctuaries. It will also be important to collect and establish genomic resources from wild apes. This is crucial since captive apes often represent only a limited sample of the total genetic variation in the species. The collection of such samples is all the more urgent as many of the wild populations can be expected to disappear over the next one or two decades. It is therefore necessary to systematically collect and preserve non-invasive samples from all great apes across their entire current habitats. This can be done in the form of collection of fecal and hair samples, and their preservation in ways that allow the retrieval of DNA in the future. From material collected from both captive and wild apes, immortalized cell lines and sequencing libraries that can be indefinitely replicated should be established whenever possible in order to provide genomic resources for the future. Methods should be developed to establish cell lines from novel sources of material, for example by establishing induced pluripotent stem (ips) cells. Neandertal genome sequencing will be continued with the aim of achieving a sequence of higher accuracy. It should be extended to other hominins whenever possible. In addition, the development and promotion of standards and protocols for working with ancient DNA, particularly ancient hominids, will be important to develop further. The analyses of the genomes of humans, Neandertals and apes will lead to the identification of features in the human genome that are candidates for being of importance for humanspecific traits. Animal models for testing such candidate features are per definition not available since the traits of interest are unique to humans and most experiments cannot be done in apes (who cannot give informed consent) nor in Neandertals (who are extinct). Efforts to establish in vitro systems, for example in the form of tissue cultures that closely mimic physiological tissues from ips cells, should therefore be pursued. Genetic manipulations of these in vitro systems and animal models in which genes or whole pathways are humanized will provide an avenue for testing aspects of traits specific to humans. Expected outcome and benefits The understanding of the biological basis for what made important aspects of human cognition, language and culture possible will be of substantial cultural interest. Since several common diseases affect aspects of human-specific traits, it is also expected that an understanding of the biological basis of these traits may provide an inroad into understanding their pathophysiology in addition to genetic association and linkage studies. This may apply to, for example, speech disorders, autism, and schizophrenia. References Venter, J.C., et al.: The sequence of the human genome. Science 291(5507): 1304-1351 (2001). Lander, E.S., et al.: Initial sequencing and analysis of the human genome. Nature 409(6822): 860-921 (2001). The Chimpanzee Genome Consortium: Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437(7055): 69-87 (2005). - 3 -

Gibbs, R.A., et al.: Evolutionary and biomedical insights from the rhesus macaque genome. Science 316(5822): 222-234 (2007). <span style="font-size: small;" />Khaitovich, P., Hellmann, I., Enard, W., Nowick, K., Leinweber, M., Franz, H., Weiss, G., Lachmann, M., and Pääbo, S.: Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees. Science 309: 1850-1854 (2005). Green, R.E., Malaspinas, A.-S., Krause, J., Briggs, A.W., Johnson, P.L.F., Uhler, C., Meyer, M., Good, J.M., Maricic, T., Stenzel, U., Prüfer, K., Siebauer, M., Burbano, H.A., Ronan, M., Rothberg, J.M., Egholm, M., Rudan, P., Brajković, D., Kućan, Ž., Gušić, I., Wikström, M., Laakkonen, L., Kelso, J., Slatkin, M., and Pääbo, S.: A complete Neandertal mitochondrial genome sequence determined by high-throughput sequencing. Cell 134: 416-426 (2008). Enard, W., Gehre, S., Hammerschmidt, K., Hölter, S.M., Blass, T., Somel, M., Brückner, M.K., Schreiweis, C., Winter, C., Sohr, R., Becker, L., Wiebe, V., Nickel, B., Giger, T., Müller, U., Groszer, M., Adler, T., Aguilar, A., Bolle, I., Calzada-Wack, J., Dalke, C., Ehrhardt, N., Favor, J., Fuchs, H., Gailus-Durner, V., Hans, W., Hölzlwimmer, G., Javaheri, A., Kalaydjiev, S., Kallnik, M., Kling, E., Kunder, S., Moßbrugger, I., Naton, B., Racz, I., Rathkolb, B., Rozman, J., Schrewe, A., Busch, D., Graw, J., Ivandic, B., Klingenspor, M., Klopstock, T., Ollert, M., Quintanilla-Martinez, L., Schulz, H., Wolf, E., Wurst, W., Zimmer, A., Fisher, S.E., Morgenstern, R., Arendt, T., Hrabé de Angelis, M., Fischer, J., Schwarz, J., and S. Pääbo: A humanized version of Foxp2 affects cortico-basal ganglia circuits in mice. Cell 137, 961-71 (2009). Figures Fig. 1 Fig. 1: Tree of humans and apes. (From: Kaessmann, H., and Pääbo, S.: The genetical history of humans and the great apes. Journal of Internal Medicine 251: 1-18 (2002)). - 4 -

Fig. 2 Fig. 2: A skeleton of a Neandertal and a human. (Photo: K. Mowbray, Reconstruction: G. Sawyer and B. Maley, Copyright: Ian Tattersall) Fig. 3 Fig. 3: Artistic view of murine model systems. - 5 -